Arabic-NLP-Engine

A Natural Language Processing (NLP) engine for Arabic text analysis and tokenization. The engine extracts content from a folder of text files, tokenizes it, and analyzes the most frequent next words or sequences. Additionally, it checks the syntax of the extracted tokens against a provided Arabic dictionary.

Installation

Clone the repository to your local machine:

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
data		data
NER_farasa.py		NER_farasa.py
README.md		README.md
dictionnary.py		dictionnary.py
engine.py		engine.py
evaluate.py		evaluate.py
incorrect_words.txt		incorrect_words.txt
lemmatized_words.csv		lemmatized_words.csv
lemmatizer.py		lemmatizer.py
main.py		main.py
syntax_verification_results.csv		syntax_verification_results.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Arabic-NLP-Engine

Installation

About

Releases

Packages

Languages

Nourine-Nadir/Arabic-NLP-Engine

Folders and files

Latest commit

History

Repository files navigation

Arabic-NLP-Engine

Installation

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages