Railroad Trespassing and Safety: A Systematic Analysis of Twitter Data

https://doi.org/10.1016/j.cstp.2024.101154

Overview

This is the completed version of codes for the paper "Railroad Trespassing and Safety: A Systematic Analysis of Twitter Data"

Data

This folder contains code data_collection.ipynb is used in data collection and saved in csv file format.

Data Processing

The code preprocessing.py is used to preprocess data and save it in csv file format.

User analysis

Simple Python codes for data analysis.

Topic modeling, Sentiment and Emotion prediction

We used some existing models for topic modeling, sentiment, and emotion prediction.

Topic modeling: "https://github.com/MaartenGr/BERTopic"

Sentiment prediction: "https://huggingface.co/cardiffnlp/twitter-roberta-base-sentiment"

Emotion prediction: "https://github.com/nikicc/twitter-emotion-recognition"

Results

All numerical results are created in csv format

Description of steps

The railroad-related twitter data analysis is divided into six parts:

Data cleaning process: Remove tweets that are not related to railroad, rail, or safety.
User analysis: Identify users and the type of users involved in posting rail safety-related tweets worldwide. Group the users in different subgroups.
Topic modeling: Extract topics of rail safety-related tweets.
Sentiment analysis: Identify the sentiments(pos,neg,neu) of the tweets.
Emotion Analysis: Identify the emotional (joy,..) of tweets.
Hashtags and Mention analysis: Extract hashtags and mentions from tweets and utilize that information for extracting organizational and geographical information.

Name		Name	Last commit message	Last commit date
Latest commit History 77 Commits
code		code
.DS_Store		.DS_Store
README.md		README.md
data_collection.ipynb		data_collection.ipynb
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Railroad Trespassing and Safety: A Systematic Analysis of Twitter Data

Overview

Data

Data Processing

User analysis

Topic modeling, Sentiment and Emotion prediction

Results

Description of steps

About

Releases

Packages

Languages

srbnghosh99/Railroad-Safety-Evidence-from-Twitter-Analysis

Folders and files

Latest commit

History

Repository files navigation

Railroad Trespassing and Safety: A Systematic Analysis of Twitter Data

Overview

Data

Data Processing

User analysis

Topic modeling, Sentiment and Emotion prediction

Results

Description of steps

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages