In the year 2020, misinformation seemed to spread just as quickly as the virus that had defined it. Established facts devolved into hotly debated topics and low-brow journalism had presided over the advice of experts and scientific research. Consequently, Americans are largely fragmented in their opinions toward the severity of COVID (in turn contributing to its spread) and thousands are disillusioned with modern journalism, uncertain as to what to believe on the internet.
This project aims to effectively address this misinformation crisis through the restoration of trust in journalism and the identification of false information on social media. Using a NN classifier trained on recent and comprehensive datasets, the goal is to flag articles, specifically those shared on platforms like Facebook and WhatsApp, that either intentionally or unintentionally include untrue statements regarding COVID or COVID-related topics. To ensure accessibility, it is my hope that the end product can be accessed via a web extension and can progressively improve through continual learning.
- Use the tools of NLP to create a Long short-term neural network that can classify news within in a dataset as either fake or true
- Extend dataset beyond just COVID related news to include fake news in general
- If possible: Deploy classifier as a Chrome web extension