README.md 1.98 KB
Newer Older
javi's avatar
javi committed
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
# stanford-classifier

Wrapper and dockerized solution for the execution of a Sentence Stanford Classifier.

[Stanford Classifier](https://nlp.stanford.edu/software/classifier.shtml).

## Description

This is a Wrapper that receives GATE Xml format files and classify the sentences with a given Stanford Classifier already train.

This Wrapper receive a Stanford Classifier Model and return GATE documents in which the annotated sentences contain a feature "RELEVANT" that contain the class. 
For example, TOXICITY_RELEVANT o NOT_TOXICITY_RELEVANT.

Prerequisites,  Token and Sentence annotations must be present inside an annotation set that will also be a parameter.

For training a classifier you can go to , another dockerized component that can be run without any instalation.

Javi Corvi's avatar
Javi Corvi committed
18
## Actual Version: 2.0, 2021-10-05
javi's avatar
javi committed
19
20
21
## [Changelog](https://gitlab.bsc.es/inb/text-mining/generic-tools/stanford-classifier/blob/master/CHANGELOG) 
## Docker

Javi Corvi's avatar
Javi Corvi committed
22
registry.gitlab.bsc.es/inb/text-mining/generic-tools/stanford-classifier
javi's avatar
javi committed
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54

## Build and Run the Docker 
	#To build the docker, just go into the stanford-training-classifier folder and execute
	docker build -t stanford-classifier .
	#To run the docker
	docker run --rm -u $UID stanford-classifier stanford-classifier -i inputFolder -o outPutFolder -a annotationSet -model pathToModel

Parameters:
<p>
-i input folder with documents to classify
-o output folder
-a annotation set in which are the Sentences annotated
-model Stanford Trainned Model to classify sentences
</p>

## Built With

* [Docker](https://www.docker.com/) - Docker Containers
* [Maven](https://maven.apache.org/) - Dependency Management

## Versioning

We use [SemVer](http://semver.org/) for versioning. For the versions available, see the [tags on this repository](https://gitlab.bsc.es/inb/text-mining/generic-tools/stanford-classifier/-/tags). 

## Authors

* **Javier Corvi** 

## License

This project is licensed under the GNU GENERAL PUBLIC LICENSE Version 3 - see the [LICENSE.md](LICENSE.md) file for details