This repository provides the data and code to reproduce the results of Correcting misinformation on social media with a large language model.
-
data/notes_all.csv
: The Community Notes data that our evaluation is based on. -
data/responses.csv
: It contains the tweets and responses made by- Laypeople with high helpfulness;
- Laypeople with average helpfulness;
- MUSE that simulates correcting misinformation at the same time as the laypeople's high-helpfulness response;
- MUSE that simulates correcting misinformation at the same time as the laypeople's average-helpfulness response;
- MUSE that simulates correcting misinformation right after it appears on social media;
- MUSE\retrieval (multimodal inputs only, otherwise it is the same as GPT-4);
- MUSE\vision (multimodal inputs only, otherwise it is the same as GPT-4); and
- GPT-4.
'~' indicates the response is the same as (iii). '|||': the same as (iv). '*': the same as (vi). '$': the same as (vii). '///': the same as (viii).
-
data/Q[..].csv
: It contains the experts' evaluation results of the responses indata/responses.csv
. -
data/username_tweetids.csv
: The assignment of the tweets and responses to every expert in the annotation phase. -
data/tweetid_domain
: The identified domain of each tweet. -
data/tweetid_misleadtype
: The identified tactic(s) of each tweet. -
data/tweetid_politics
: The identified political learning of each tweet. -
data/tweetid_factchecked
: The identified tweets that have been fact-checked online. -
code/
: The code to reproduce the main results in our paper. The results were generated with Python 3.7 and dependencies inrequirements.txt
.
- We comply with X/Twitter Terms of Service by only releasing the IDs of tweets. Most code files are runnable without further obtaining the tweet data, except
fig_s23.ipynb
, where the creation times of tweets are necessary. - The names of the experts are anonymized.
@article{zhou2024muse,
title={Correcting misinformation on social media with a large language model},
author={Zhou, Xinyi and Sharma, Ashish and Zhang, Amy X and Althoff, Tim},
journal={arXiv preprint arXiv:2403.11169},
year={2024}
}