Bitextor generates translation memories from multilingual websites
-
Updated
Nov 11, 2024 - Python
Bitextor generates translation memories from multilingual websites
This repository contains the code and data of the paper titled "Not Low-Resource Anymore: Aligner Ensembling, Batch Filtering, and New Datasets for Bengali-English Machine Translation" published in Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020), November 16 - November 20, 2020.
Yet another search platform for linguistic corpora.
OPUS (opus.nlpl.eu) Python3 API
A simple and efficient tool for mining and aligning sentences with pre-trained models.
Cod hwyluso alinio testunau gyda hunalign a dogfennaeth ar sut i ddefnyddio LFAligner // Code for simplifying aligning texts with hunalign and documentation for LFAligner
Add a description, image, and links to the parallel-corpora topic page so that developers can more easily learn about it.
To associate your repository with the parallel-corpora topic, visit your repo's landing page and select "manage topics."