parallel-corpora

Here are 6 public repositories matching this topic...

bitextor / bitextor

Bitextor generates translation memories from multilingual websites

Updated Nov 11, 2024
Python

This repository contains the code and data of the paper titled "Not Low-Resource Anymore: Aligner Ensembling, Batch Filtering, and New Datasets for Bengali-English Machine Translation" published in Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020), November 16 - November 20, 2020.

machine-translation neural-machine-translation parallel-corpus parallel-corpora bangla-nlp low-resource-languages bangla-machine-translation bangla-dataset-machine-translation emnlp-2020 low-resource-nlp low-resource-machine-translation

Updated Oct 23, 2024
Python

timarkh / tsakorpus

Star

Yet another search platform for linguistic corpora.

flask elasticsearch corpus linguistics corpus-linguistics corpus-tools linguistic-corpora language-documentation parallel-corpora media-aligned-corpora

Updated Feb 15, 2025
Python

korenyoni / opus-api

Star

OPUS (opus.nlpl.eu) Python3 API

python api machine-learning corpus corporate opus corpora language-model parallel-corpus parallel-corpora

Updated Nov 23, 2024
Python

rggdmonk / hadal

Star

A simple and eﬀicient tool for mining and aligning sentences with pre-trained models.

nlp machine-translation alignment nlp-library parallel-corpus sentence-alignment parallel-corpora parallel-sentence-mining

Updated May 17, 2024
Python

techiaith / alinio

Star

Cod hwyluso alinio testunau gyda hunalign a dogfennaeth ar sut i ddefnyddio LFAligner // Code for simplifying aligning texts with hunalign and documentation for LFAligner

machine-translation alignment welsh cymraeg parallel-corpora

Updated Mar 6, 2016
Python

Improve this page

Add a description, image, and links to the parallel-corpora topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the parallel-corpora topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

parallel-corpora

Here are 6 public repositories matching this topic...

bitextor / bitextor

csebuetnlp / banglanmt

timarkh / tsakorpus

korenyoni / opus-api

rggdmonk / hadal

techiaith / alinio

Improve this page

Add this topic to your repo