Applying machine learning classification methods to identify suicidal posts in "SuicideWatch" subreddit.
Suicide is one of the leading causes of death globally, with an estimated 800,000 deaths annually. That is one death every 40 seconds. A key aspect in suicide prevention is addressing suicidal thoughts and ideas before they turn into actions. By the help of Machine learning, detecting suicidal ideation can assist public health agency in better resource allocation to target people at risk
Data used in this project can be found here. Below is the information required to reproduce the project:
Helper functions
saved asutils.py
- Dataset is saved in
_data
folder asSuicide_Detection.csv
Data cleaning
notebook saved asdata_cleaning.ipynb
(the data produced from this notebook is found in the_data
folder asclean_reddit.csv
)Data Processing
notebook found inprocessing.ipynb
Modelling
notebook found inmodelling.ipynb
Final report
saved asreport.pdf
After creating your virtual environment, please run the following in command line to replicate
git clone https://github.com/eugenebaraka/Predict-Suicidal-Ideation-on-Reddit.git
cd Predict-Suicidal-Ideation-on-Reddit
pip install -r requirements.txt
- Building a suicidal tweet classifier using NLP
- Detecting suicidality on Twitter
- Detecting Suicide Ideation in the Era of Social Media
- Tracking suicide risk factors through Twitter in the US
- "Right time, right place" health communication on Twitter
- Tweeting celebrity suicides
- A comparative analysis on suicidal ideation detection using NLP, ML, and DL
- Suicidal profiles detection in Twitter
- A machine learning approach predicts future risk to suicidal ideation from social media data
- Systematic review: machine learning and suicidal ideation
- An unsupervised learning approach for automatically to categorize potential suicide
- A lexicon-based approach to detecting suicide-related messages on Twitter
- Detection of Suicide Ideation in Social Media Forums Using Deep Learning
- Supervised learning for suicidal ideation detection in online user content
- Machine learning for suicidal ideation identification: A systematic literature review
- A brief introduction to weakly supervised learning