Propaganda Dection: Submission to SemEval-2020 Shared Task 11

PsuedoProp at SemEval-2020 Task 11: Propaganda Span Detection Using BERT-CRF and Ensemble Sentence Level Classifier

We propose a sequential BERT-CRF based Span Identification model where the fine-grained detection is carried out only on the articles that are flagged as containing propaganda by an ensemble SLC model. We propose this setup bearing in mind the practicality of this approach in identifying propaganda spans in the exponentially increasing content base where the fine-tuned analysis of the entire data repository may not be the optimal choice due to its massive computational resource requirements. We present our analysis on different voting ensembles for the SLC model. Our system ranks 14th on the test set and 22nd on the development set and with an F1 score of 0.41 and 0.39 respectively.

Directory Structure

├── bert_lstm_ner.py                :BERT-CRF defintion
├── inference.ipynb                 :Permutations for ensembles
├── roberta_slc.ipynb               :Sequence Level Classification of the sentences as being propaganda or not. 
├── span_detection.ipynb            :Fine-Grained span prediction on the propaganda samples. 
└── terminal_predict.py             :e2e pipeline.

Cite: Aniruddha Chauhan and Harshita Diddee. 2020. PsuedoProp at SemEval-2020 Task 11: Propaganda Span Detection Using BERT-CRF and Ensemble Sentence Level Classifier. In Proceedings of the Fourteenth Workshop on Semantic Evaluation, pages 1779–1785, Barcelona (online).International Committee for Computational Linguistics.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
BERT_NER_LSTM.ipynb		BERT_NER_LSTM.ipynb
README.md		README.md
bert_lstm_ner.py		bert_lstm_ner.py
inference.ipynb		inference.ipynb
pseudoprop.png		pseudoprop.png
roberta_slc.ipynb		roberta_slc.ipynb
span_detection.ipynb		span_detection.ipynb
terminal_predict.py		terminal_predict.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Propaganda Dection: Submission to SemEval-2020 Shared Task 11

Directory Structure

About

Releases

Languages

harshitadd/SemEval-Task-11

Folders and files

Latest commit

History

Repository files navigation

Propaganda Dection: Submission to SemEval-2020 Shared Task 11

Directory Structure

About

Topics

Resources

Stars

Watchers

Forks

Releases

Languages