Tracking Progress in Natural Language Processing

It aims to cover both traditional and core NLP tasks such as dependency parsing and part-of-speech tagging as well as more recent ones such as reading comprehension and natural language inference. The main objective is to provide the reader with a quick overview of benchmark datasets and the state-of-the-art for their task of interest, which serves as a stepping stone for further research. To this end, if there is a place where results for a task are already published and regularly maintained, such as a public leaderboard, the reader will be pointed there.

Contributing

If you would like to add a new result, you can do so with a pull request. In order to minimize noise and to make maintenance somewhat manageable, results reported in published papers will be preferred (indicate the venue of publication in your PR); an exception may be made for influential preprints. The result should include the name of the method, the citation, the score, and a link to the paper and should be added so that the table is sorted.

To add a new dataset or task, follow the below steps. Any new datasets should have been used for evaluation in at least one published paper besides the one that introduced the dataset.

Fork the repository.
If your task is completely new, create a new file and link to it in the table of contents above. If not, add your task or dataset to the respective section of the corresponding file.
Briefly describe the dataset/task and include relevant references.
Describe the evaluation setting and evaluation metric.
Show how an annotated example of the dataset/task looks like.
Add a download link if available.
Copy the below table and fill in at least two results (including the state-of-the-art) for your dataset/task (chang Score to the metric of your dataset).
Submit your change as a pull request.

Model	Score	Paper / Source

Things to do

Add pointers on how to retrieve data.
Provide more details regarding the evaluation setup of each task.
Add an example to every task/dataset.
Add statistics to every dataset.
Provide a description and details for every task / dataset.
We could potentially use readthedocs to provide a clearer structure.
All current datasets in this list are for the English language (except for UD). In a separate section, we could add datasets for other languages.

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
LICENSE		LICENSE
README.md		README.md
ccg_supertagging.md		ccg_supertagging.md
chunking.md		chunking.md
constituency_parsing.md		constituency_parsing.md
coreference_resolution.md		coreference_resolution.md
dependency_parsing.md		dependency_parsing.md
dialog.md		dialog.md
domain_adaptation.md		domain_adaptation.md
language_modeling.md		language_modeling.md
machine_translation.md		machine_translation.md
multi-task_learning.md		multi-task_learning.md
multimodal.md		multimodal.md
named_entity_recognition.md		named_entity_recognition.md
natural_language_inference.md		natural_language_inference.md
part-of-speech_tagging.md		part-of-speech_tagging.md
question_answering.md		question_answering.md
semantic_parsing.md		semantic_parsing.md
semantic_role_labeling.md		semantic_role_labeling.md
semantic_textual_similarity.md		semantic_textual_similarity.md
sentiment_analysis.md		sentiment_analysis.md
summarization.md		summarization.md
text_classification.md		text_classification.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Tracking Progress in Natural Language Processing

Table of contents

Contributing

Things to do

About

Releases

Packages

License

FredRodrigues/NLP-progress

Folders and files

Latest commit

History

Repository files navigation

Tracking Progress in Natural Language Processing

Table of contents

Contributing

Things to do

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Packages