Authors: Alan Ritter, Sam Clark
contact: aritter@cs.washington.edu
export TWITTER_NLP=./
cat test.1k.txt | python python/ner/extractEntities2.py
note: this takes a minute or so to read in models from files
To include classification, simply add the --classify switch:
cat test.1k.txt | python python/ner/extractEntities2.py --classify
For higher quality, but slower results, optionally include features based on POS and chunk tags (chunk tags require POS)
cat test.1k.txt | python python/ner/extractEntities2.py --classify --pos
cat test.1k.txt | python python/ner/extractEntities2.py --classify --pos --chunk
Also has the ability to include event tags (requires POS):
cat test.1k.txt | python python/ner/extractEntities2.py --classify --pos --event
- Linux
- Libraries and executables can be compiled with build.sh
@inproceedings{Ritter11,
author = {Ritter, Alan and Clark, Sam and Mausam and Etzioni, Oren},
title = {Named Entity Recognition in Tweets: An Experimental Study},
booktitle = {EMNLP},
year = {2011}
}
@inproceedings{Ritter12,
author = {Ritter, Alan and Mausam and Etzioni, Oren and Clark, Sam},
title = {Open Domain Event Extraction from Twitter},
booktitle = {KDD},
year = {2012}
}
statuscalendar.cs.washington.edu
Junming Sui
Ming-Wei Chang
Tuan Anh Hoang Vu
sumant81