Skip to content

yohanesgultom/nlp-experiments

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

40 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Open NLP

POS tagging and Named-entity recognizing

Distribution

Binary distribution can be downloaded here (JRE 1.7 or later required, Unix or Windows only)

Usage

Please find usage guide in the README

Building

Prerequisites:

  • JDK 1.7 or later
  • Maven 3.3.9 or later

Building program:

$ cd java/nlp
$ mvn clean package

NLTK

Prequisites

POS Tagging

POS tagging with predefined training and test data:

$ cd python
$ python tagger.py ../data/pos-tagging/Indonesian_Manually_Tagged_Corpus_ID.tsv ../data/pos-tagging/Wikipedia.txt

POS tagging by splitting training data to training and test data:

$ cd python
$ python tagger.py ../data/pos-tagging/Indonesian_Manually_Tagged_Corpus_ID.tsv 1000 sentences.tag