Skip to content

Predicting the language of a piece of text using an ANN and character N-grams

License

Notifications You must be signed in to change notification settings

conorosully/language-classification

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

19 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

language-classification

Predicting the language of a piece of text using an Artificial Neural Network (ANN) and character N-grams. This code was used in one of my MSc projects.

Project report: final_research_paper.pdf

Code demonstation: 0_demo

File Purpose
0_demo A simplified demostration of project's code
1_clean_data Clean text dataset and obtain train, valid & test split
2_create_model_features Create trigram character features used to train ML models
3_ann_tuning Fine tune the ANN hyper-parameters
4_ann_final Used to train the final ANN model with optimal hyper-parameters
5_ann_metrics Calculate and visualise ANN result metrics

About

Predicting the language of a piece of text using an ANN and character N-grams

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published