NB: This project is a part of the interview process for VLabs.
The report for the file can be found here
This project is an analysis of the Enron email dataset, which is part of an interview process. The Enron email dataset consists of approximately 500,000 emails from 150 users within the Enron corporation. The goal of this project is to gain insights into the communication patterns and behavior of the employees at Enron through analysis of their emails. For our use case we will be working with the "A subset of about 1700 labelled email messages".
Files in the repository
Model_Word2Vec.py : Implementation of the models with word2vec
Models_TF-IDF.py : Implementation of the models with TF-IDF
MLP.py : Implementation of the models with Multilaper perceptron
RNN.py : Implementation of the models with RNN
libraries.py : Libraries used
preprocessing.py : The entire preprocessing for this project in one file
Vlabs.ipynb : A notebook for everything in one place