This model has been trained from the MP3 Training Dataset. CMU's pocketsphinx has been used to convert the speech to text. On the text various classifiers have been applied.
After the benchmarking, Passive Aggressive Classifier is considered to be best suited for training model.
Following python modules are required for running the sample.py
-
numpy
-
pocketsphinx
-
sphinxbase
-
sklearn
Once you've cloned this repo and all the above requirements are met.
-
Place all the mp3 files in the main folder containing the sample.py file.
-
Run the sample.py file (python sample.py) or run the shell script run.sh (./run.sh).
From mp3 the text converted data for training the model is in the file TrainingSet.txt
Two output file's are being generated:
-
results.txt > predicted sentiment separated by line
-
filename_pred.txt > a map with file name and it's predicted sentiment like one.mp3->unhappy