This is a Jupyter Notebook in Python3 for extracting possible statements of permission from informed consent documents. The goal is to generate a training set that will generalize beyond the current dataset for extracting terms related to 'permitting' in the consent context. This work is loosely organized and changing rapidly.
Dependencies can be found in docs
The command-line training and prediction scripts depend on having BERT cloned, including a language model.
- J. Devlin, M.-W. Chang, K. Lee, and K. Toutanova, “BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding,” arXiv:1810.04805 [cs], Oct. 2018.