The Enron email dataset analysis

NB: This project is a part of the interview process for VLabs.

The report for the file can be found here

This project is an analysis of the Enron email dataset, which is part of an interview process. The Enron email dataset consists of approximately 500,000 emails from 150 users within the Enron corporation. The goal of this project is to gain insights into the communication patterns and behavior of the employees at Enron through analysis of their emails. For our use case we will be working with the "A subset of about 1700 labelled email messages".

Files in the repository

Model_Word2Vec.py : Implementation of the models with word2vec

Models_TF-IDF.py : Implementation of the models with TF-IDF

MLP.py : Implementation of the models with Multilaper perceptron

RNN.py : Implementation of the models with RNN

libraries.py : Libraries used

preprocessing.py : The entire preprocessing for this project in one file

Vlabs.ipynb : A notebook for everything in one place

Name		Name	Last commit message	Last commit date
Latest commit History 100 Commits
Data_Visualisation		Data_Visualisation
__pycache__		__pycache__
enron_with_categories		enron_with_categories
.DS_Store		.DS_Store
.gitattributes		.gitattributes
LICENSE		LICENSE
MLP.py		MLP.py
Model_Word2Vec.py		Model_Word2Vec.py
Models_TF-IDF.py		Models_TF-IDF.py
README.md		README.md
RNN.py		RNN.py
Report.pdf		Report.pdf
Vlabs.ipynb		Vlabs.ipynb
libraries.py		libraries.py
preprocessing.py		preprocessing.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

The Enron email dataset analysis

About

Releases

Packages

Languages

License

sohamtalukdar/Enron-Email-Analysis

Folders and files

Latest commit

History

Repository files navigation

The Enron email dataset analysis

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages