GitHub - Ritabrata04/Hybrid-Approach-To-Depression-Detection: This repository applies Deep Learning techniques for depression detection in text, using LSTM, GRU, BiLSTM, BERT models, and a baseline FFNN. It also includes data visualizations, autoencoder semantics, KMeans clustering, and detailed performance comparisons.

Deep Learning for Depression Detection

This repository is dedicated to an investigation of various Deep Learning techniques for depression detection in text data. The techniques explored include several variants of Recurrent Neural Networks (RNNs) such as Long Short-Term Memory (LSTM), Gated Recurrent Unit (GRU), Bidirectional LSTM (BiLSTM), Transformer model using BERT (Bidirectional Encoder Representations from Transformers), and a simple Feed-Forward Neural Network (FFNN) used as a baseline model.

Dataset

The dataset used is a collection of text data gathered from different threads of depression-related discussions. Each sample has been labeled as either "depression" or "non-depression" based on the content of the discussion. The text has been preprocessed to be in a suitable format for analysis by various machine learning algorithms.

Models

The models used in the study include:

GRU: This model utilizes a Gated Recurrent Unit architecture, a variant of RNNs, which is capable of preserving long-term dependencies in sequence data while being computationally more efficient than traditional LSTMs.
LSTM: The long-short-term memory model is another variant of RNNs. It effectively captures long-term dependencies in sequence data by maintaining a separate memory cell that updates and exposes its content only when deemed necessary.
BiLSTM: The Bidirectional LSTM model is an extension of traditional LSTM. It processes the data in both forward and backward directions to keep the context of both past and future data.
BERT: BERT is a Transformer model specifically designed for NLP tasks. It uses bidirectional training of transformers, which allows for a deep understanding of the context of a word based on all its surroundings (left and right of the word).
FFNN (Baseline): This is a simple Feed-Forward Neural Network used as a baseline model for comparing the performance of other complex models.

Autoencoder

An Autoencoder was implemented to understand the semantics of the text data, which was then used for unsupervised learning using KMeans clustering.

Results

The models' performance was evaluated based on precision, recall, F1-score, and accuracy. The results are as follows:

Model	Precision	Recall	F1-Score	Accuracy
GRU	0.51	0.51	0.51	0.51
LSTM	0.95	0.95	0.95	0.95
BERT	0.96	0.96	0.96	0.96
FFNN	0.99	0.99	0.99	0.99
BiLSTM	0.96	0.96	0.96	0.95

Graphical and tabular comparisons between these models are available in the 'RESULTS.pdf' file.

Repository Files

Baseline.py: Contains the code for the implementation of the FFNN model.
BERT.py: Contains the code for the implementation of the BERT model.
DATA VISUALISATION.py: Contains the code for various data visualization techniques implemented on the dataset.VISUALIZATION.py
DEPRESSION_DETECTION_USING_LSTM_AND_GRU.py: Contains the code for the implementation of both the LSTM and GRU models.
RESULTS.pdf: Contains graphical and tabular comparisons between the implemented models.
wordcloud_depression.png: A wordcloud visualization showing the most common words in depression-related sentence threads
BILSTM.py: Contains the code for the implementation of the BILSTM model.
AUTOENCODER.py: An LSTM autoencoder is used to understand the semantics of the text, and the encoded representations are visualized using t-SNE. KMeans clustering is also implemented for unsupervised learning.

Future Improvements

Future work may involve tuning hyperparameters more finely, exploring the use of attention mechanisms, and examining the potential of other state-of-the-art models. Also, we may look into combining the results from various models to achieve higher accuracy.

Running the Repository

To run the repository, first, clone it to your local machine. Ensure that Python 3.x and the necessary packages (such as TensorFlow, Keras, pandas, etc.) are installed. Then navigate to the repository's directory and execute the Python scripts. For example, to run the LSTM model, you would type python DEPRESSION_DETECTION_USING_LSTM_AND_GRU.py. Note: it's advisable to use a virtual environment to avoid package conflicts.

For visualizing the results, open RESULTS.pdf. If you're interested in how the data was visualized, refer to DATA VISUALISATION.py.

Please note that running deep learning models can be resource-intensive, and it is recommended to run these scripts on a machine with an adequate GPU.

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
AUTOENCODER.py		AUTOENCODER.py
BASELINE.py		BASELINE.py
BERT.py		BERT.py
DATA VISUALISATION.pdf		DATA VISUALISATION.pdf
DATA VISUALISATION.py		DATA VISUALISATION.py
DEPRESSION_DETECTION_USING_LSTM.py		DEPRESSION_DETECTION_USING_LSTM.py
LICENSE		LICENSE
README.md		README.md
RESULTS.pdf		RESULTS.pdf
accuracy.png		accuracy.png
baseline confusion matrix.png		baseline confusion matrix.png
dataset distribution.png		dataset distribution.png
depression_dataset_reddit_cleaned.csv.zip		depression_dataset_reddit_cleaned.csv.zip
f1 score.png		f1 score.png
gru confusion matrix.png		gru confusion matrix.png
lstm confusion matrix.png		lstm confusion matrix.png
precision.png		precision.png
recall.png		recall.png
wordcloud_depression.png		wordcloud_depression.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Dataset

Models

Autoencoder

Results

Repository Files

Future Improvements

Running the Repository

About

Releases

Packages

Languages

License

Ritabrata04/Hybrid-Approach-To-Depression-Detection

Folders and files

Latest commit

History

Repository files navigation

Dataset

Models

Autoencoder

Results

Repository Files

Future Improvements

Running the Repository

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages