Skip to content

This project works with the GoodReads Book-Review Dataset to find all pairs of books that a given user reviews for all users within the dataset. These pairs and their frequencies are computed using pySpark and are used to create a tsv file with book titles using SparkSQL.

Notifications You must be signed in to change notification settings

aaguirre321/GoodReadsBookPairs

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 

About

This project works with the GoodReads Book-Review Dataset to find all pairs of books that a given user reviews for all users within the dataset. These pairs and their frequencies are computed using pySpark and are used to create a tsv file with book titles using SparkSQL.

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages