Skip to content

Tahrirgoh is a web platform for dataset collection for the Grammatical Error Correction (GEC) task.

License

Notifications You must be signed in to change notification settings

tahrirchi/tahrirgoh

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Tahrirgoh: Data Collection Platform for Grammatical Error Correction (GEC)

Tahrirgoh is a web platform developed by Tahrirchi for dataset collection for the Grammatical Error Correction (GEC) task.

Tahrirgoh is pronounced as T-AH-REER-GOKH, and is made up word in Uzbek, meaning a "place to proofread".

Interface

img.png

Running locally

For running the app locally, follow instructions in backend/README.md and frontend/README.md files

Running in production

Use the docker-compose file to run it in a production server. Be sure to tweak the frontend/nginx.conf file to your needs.

docker compose up -d

Workflow

After you have successfully launched the platform:

  1. Create worker users (API)
  2. Add sentences to the platform (API)
  3. Workers annotate sentences
  4. Repeat steps 2 and 3 until you have enough sentences annotated
  5. Download annotated sentences (API)

There is no admin panel implemented in the frontend, but there is a Swagger Docs initialized and available in the backend with user-friendly interface. Read more about it in API Documentation section on how to access it and use.

Cite

If you use this platform during your research, please cite us.

@software{Mamasaidov_Tahrirgoh_is_a_2023,
author = {Mamasaidov, Mukhammadsaid and Yusupov, Jasur},
month = oct,
title = {{Tahrirgoh is a web platform for dataset collection for the Grammatical Error Correction (GEC) task.}},
url = {https://github.com/tahrirchi/tahrirgoh},
version = {1.0.0},
year = {2023}
}