Skip to content

collection of jupyter notebooks for planing & development

Notifications You must be signed in to change notification settings

unreally-ai/notebooks

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

44 Commits
 
 
 
 
 
 

Repository files navigation

Unreally Notebooks

This repo contains all jupyter notebooks used for planing and developing the Unreally pipeline. (Each folder represents a step in the pipeline)

The following contains brief discribtions of each notebook. If you commit one yourself, please add some infos to this file as well.

Step 1 - Vectors

This folder deals with the first step of the ML pipeline: Vectorization of text & the necessesary cleaning of data. Also note that it contains a .zip with the datasets used

Top5k Bow-TF

  • explores the SemEval2016 A Gold dataset
  • contains a custom function for cleaning text
  • aims to understand how one could create a dictionary with the top 5k words of a dataset

5k-MultiDF Vocab

  • extends the function of the previous notebook to compute one 5k vocab of multiple data frames
  • more precise functions
  • displays the top 5k words as a data frame

Step 2 - MLP

The second step of our pipeline: The Multi-Layer Perceptron.

the_unreally_notebook

The current state-of-the-art notebook. This model reached about 72.5% accuracy.

data_balancing

Notebook for balancing the mapped data to imporove training

NN_and_eval

(Outdated) First tests to evaluate the neural net

About

collection of jupyter notebooks for planing & development

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published