Skip to content

This Jupyter Notebook file was created with Ian Wilke-Tomasik to extract the words and word counts of the Bronte sisters' novels and two Charles Dickens novels. The word-count vectors for each novel were used to compute cosine similarities.

Notifications You must be signed in to change notification settings

tcho6319/Cosine-Similarity-of-Dickens-and-Brontes

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 

Repository files navigation

Cosine-Similarity-of-Dickens-and-Brontes

This Jupyter Notebook file was created with Ian Wilke-Tomasik as part of a Intro Data Science Project.

The objective was to compute the cosine similarities among the novels written by the Bronte sisters and Charles Dickens. Words were extracted from .csv files in order to vectorize word counts of the Bronte sisters' novels and two Charles Dickens novels. The word-count vectors for each novel were used to compute cosine similarities.

View the notebook here: https://nbviewer.jupyter.org/github/tcho6319/Cosine-Similarity-of-Dickens-and-Brontes/blob/master/INFO%202950%20Final%20Project-Final%20Code.ipynb

About

This Jupyter Notebook file was created with Ian Wilke-Tomasik to extract the words and word counts of the Bronte sisters' novels and two Charles Dickens novels. The word-count vectors for each novel were used to compute cosine similarities.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published