Modelling #1

jmc-bbk · 2021-01-01T16:08:17Z

The main purpose of this PR is to add modelling.ipynb.

This notebook demonstrates how to make a text classification model in Keras and Tensorflow.

I use the amazon_cells_labelled.txt data source and achieve a test accuracy of 81% 🚀 .

I also update extract.ipynb so it no longer loads returns one pd.DataFrame for all three raw data sources.

This notebook has been changed to return a single pd.DataFrame for one data source.

When training models on the combined data sources, accuracy was no better than chance. It's likely that sentiment is a domain specific problem and we can't train a single sentiment model (that's performant!) on multiple disparate data sources.

jmc-bbk added 2 commits January 1, 2021 14:45

Created model

19ec998

Updated notebooks, added comments

6ff27c2

jmc-bbk merged commit 608f2dc into master Jan 1, 2021

jmc-bbk deleted the modelling branch January 1, 2021 16:08

jmc-bbk mentioned this pull request Jan 4, 2021

Google Cloud Modelling #2

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Modelling #1

Modelling #1

jmc-bbk commented Jan 1, 2021

Modelling #1

Modelling #1

Conversation

jmc-bbk commented Jan 1, 2021