Skip to content

Working with Atomic Wikipedia Edits dataset. This dataset provided by the LUNAR Lab contains atomic Wikipedia edits containing both insertions (13.7 million examples) and deletions (9.3 million examples) of a contiguous chunk of text in an English-language sentence.

Notifications You must be signed in to change notification settings

ShyamSubramanian/Brown-Datathon-2020

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

25 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Brown-Datathon-2020

Working with Atomic Wikipedia Edits dataset. This dataset provided by the LUNAR Lab contains atomic Wikipedia edits containing both insertions (13.7 million examples) and deletions (9.3 million examples) of a contiguous chunk of text in an English-language sentence.

Data collected from Amazon Mechanical Turk - Amazon_Mturk_Category_data.csv
Intent Classification - edit_intent_classification.ipynb
Pretrained Model Weights - pre_trained_glove_model.h5
Presentation - Final_Presentation_Results.ipynb

About

Working with Atomic Wikipedia Edits dataset. This dataset provided by the LUNAR Lab contains atomic Wikipedia edits containing both insertions (13.7 million examples) and deletions (9.3 million examples) of a contiguous chunk of text in an English-language sentence.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 100.0%