Skip to content

sohamtalukdar/Enron-Email-Analysis

Repository files navigation

The Enron email dataset analysis

NB: This project is a part of the interview process for VLabs.

The report for the file can be found here

This project is an analysis of the Enron email dataset, which is part of an interview process. The Enron email dataset consists of approximately 500,000 emails from 150 users within the Enron corporation. The goal of this project is to gain insights into the communication patterns and behavior of the employees at Enron through analysis of their emails. For our use case we will be working with the "A subset of about 1700 labelled email messages".

Files in the repository

Model_Word2Vec.py : Implementation of the models with word2vec

Models_TF-IDF.py : Implementation of the models with TF-IDF

MLP.py : Implementation of the models with Multilaper perceptron

RNN.py : Implementation of the models with RNN

libraries.py : Libraries used

preprocessing.py : The entire preprocessing for this project in one file

Vlabs.ipynb : A notebook for everything in one place

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published