Commit 5f9fcf45 authored by jcorvi's avatar jcorvi

Update README.md

parent 6f51a0ee
Pipeline #17186 failed with stage
......@@ -6,11 +6,44 @@ Nextflow pipeline that made use of software container (dockers) in order to dete
![preclinical text-mining solution overview](overview.png)
![preclinical text-mining solution ner](terminology.png)
## To run the pipeline.
Open the run.sh file and configure the corresponding parameters when needed.
Then to run the pipeline simple execute **bash run.sh**
## Components and modules used in the pipeline
### (Text Mining - Generic Tools GitLab Group) https://gitlab.bsc.es/inb/text-mining/generic-tools
* **nlp-gate-generic-component**: Text mining GATE generic component for run in Batch/Pipeline mode using software containers (dockers). This tool execute the Default Gazetteer or Flexible Gazetteer Lookup given dictionaries passed as parameters and, in a second stage, execute JAPE rules given a main.jape file. This component is instantiated by other specific domain modules.
* **nlp-standard-preprocessing**: dockerization of stanford corenlp preprocessing tasks; sentence splitting, tokenization, part of speech (POS), other features: word types, lemma, kinds, formats, length and masks.
* **import-json-to-mongo**: this component inserts and push json files into a mongo database.
### (Text Mining - Bio Tools GitLab Group) https://gitlab.bsc.es/inb/text-mining/bio-tools
* **hepatotoxicity-annotation**: This library annotated text with hepatotoxicity terms related to liver findings, liver markers and CYPs genes that are relevant in hepatotoxicity events. It uses data that were obtained in a previous work: The LIMTOX system http://limtox.bioinfo.cnio.es/. Is an instance of nlp-gate-generic-component.
* **cdisc-etox-annotation**: This component annotated text using CDISC SEND and eTOX (OntoBrowser) terminologies. These terminologies are oriented to the preclinical study reports. Is an instance of nlp-gate-generic-component.
* **dnorm-gate-wrapper**: This component is a Gate wrapper of the Dnorm application (Diseases Tagger). Could be easily downloaded and run as a docker container.
* **linnaeus-gate-wrapper**: This component is a Gate wrapper of the Linnaeus application (Species Tagger). Could be easily downloaded and run as a docker container.
### (eTRANSAFE GitLab Group) https://gitlab.bsc.es/inb/etransafe
* **pretox-app**: Web Application written in angular in order to show and manually curate preclinical toxicological findings.
* **pretox-rest-api**: Api rest that retrieves relevant preclinical toxicological information.
* **ades-relation-extraction**: Relation extraction in order to obtain preclinical toxicological findings.
* **ades-ner-postprocessing**: Postprocessing jape rules execution for the detection of preclinical toxicological findings.
* **ades-export-to-json**: export findings annotated in XML GATE format to JSON format.
## Built With
* [Docker](https://www.docker.com/) - Docker Containers
......
Markdown is supported
0% or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment