Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Modelling #1

Merged
merged 2 commits into from
Jan 1, 2021
Merged

Modelling #1

merged 2 commits into from
Jan 1, 2021

Conversation

jmc-bbk
Copy link
Owner

@jmc-bbk jmc-bbk commented Jan 1, 2021

The main purpose of this PR is to add modelling.ipynb.

This notebook demonstrates how to make a text classification model in Keras and Tensorflow.

I use the amazon_cells_labelled.txt data source and achieve a test accuracy of 81% 🚀 .


I also update extract.ipynb so it no longer loads returns one pd.DataFrame for all three raw data sources.

This notebook has been changed to return a single pd.DataFrame for one data source.

When training models on the combined data sources, accuracy was no better than chance. It's likely that sentiment is a domain specific problem and we can't train a single sentiment model (that's performant!) on multiple disparate data sources.

@jmc-bbk jmc-bbk merged commit 608f2dc into master Jan 1, 2021
@jmc-bbk jmc-bbk deleted the modelling branch January 1, 2021 16:08
@jmc-bbk jmc-bbk mentioned this pull request Jan 4, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant