PNLP | Persian Natural Language Processing
This is an experimental project and I try to develop everything for Persian (Farsi) from scratch or use open source libraries.
Repository contains:
-
utility
- Clean documents (remove punctuation, non-persian characters)
- Stop word removal
-
word embedding
- Word2Vec
- Glove
-
Vector space models
- TF-IDF
-
Language models
- Bigram with smoothing
- Translation
Comming soon..