This directory includes:
-
ArticleSearch : Search and retrieval of articles from PMC and PubMedCentral
$+$ Preparation of text files for analysis -
Corpus : Set of files forming the corpus
-
Guide d'annotation : Annotation guide containing a definition of entities and relations as well as examples of corpus occurences (in French)
The various scripts are written in Python under Jupyter-Notebook and require the following packages :
- lxml=4.9.1 (xml.etree.ElementTree)
- entrezpy=2.1.3
- datetime
- pandas=1.5.0
- time
- sys
- os
- re