Skip to content

Analysing Disaster related tweets dataset and build a classifier using deep learning and deploy it using Heroku

License

Notifications You must be signed in to change notification settings

raklugrin01/Disaster-Tweets-Analysis-and-Classification

Repository files navigation

Disaster Tweets Analysis and Classification

Dataset Language Library ML Library DL Library

Project Description

Twitter has become an important communication channel in times of emergency.
The ubiquitousness of smartphones enables people to announce an emergency they’re observing in real-time.
Because of this, more agencies are interested in programatically monitoring Twitter (i.e. disaster relief organizations and news agencies).

Project Contents

  1. Exploratory Data Analysis
  2. EDA after Data Cleaning
  3. Data Preprocessing using NLP
  4. Machine Learning models for classifying Tweets data
  5. Deep Learning approach for classifying Tweets data
  6. Model Deployment

Resources Used

1. Exploratory Data Analysis

  • Visualising Target Variable of the Dataset

    Target Variable
  • Visualising Length of Tweets

    Tweet Length
  • Visualising Average word lengths of Tweets

    Avg Word Lengths
  • Visualising most common stop words in the text data

    Stopwords
  • Visualising most common punctuations in the text data

    Punctuations

2. EDA after Data Cleaning

  • We use Python Regex library and nltk lemmatizing methods for Data Cleaning