Skip to content
View Liyyy2122's full-sized avatar

Block or report Liyyy2122

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

基于springboot以及ChatGPT接口的智能BI(Business Intelligence)项目 , 用户只需要输入分析诉求并导入XLS数据, 即可通过AI进行图表生成与数据分析 , 实现数据分析的降本增效。

Java 80 13 Updated Dec 8, 2023

带带弟弟 通用验证码识别OCR pypi版

Python 11,047 1,890 Updated Dec 30, 2024

验证码识别

Python 432 122 Updated Dec 12, 2024

A model that predicts the punctuation of English, Italian, French and German texts.

Python 78 14 Updated Feb 22, 2023

A python package for deep multilingual punctuation prediction.

Python 115 27 Updated Aug 21, 2024

a Fairseq fork for sequence tagging/labeling tasks

Python 31 4 Updated Jun 7, 2020

华东师范大学研究生毕业论文 Latex 模板

TeX 97 20 Updated Mar 22, 2022

The case study and multilingfual performance of ICASSP submission

Python 20 3 Updated Sep 24, 2022

总结梳理自然语言处理工程师(NLP)需要积累的各方面知识,包括面试题,各种基础知识,工程能力等等,提升核心竞争力

Python 7,090 1,195 Updated Aug 24, 2022
Python 14 2 Updated Dec 7, 2020

[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821

Python 3,502 520 Updated Oct 16, 2024

Python port of Moses tokenizer, truecaser and normalizer

Python 489 59 Updated May 26, 2024

Source Code for ACL2019 paper <Bridging the Gap between Training and Inference for Neural Machine Translation>

Python 41 10 Updated Nov 10, 2020

This is a code repository for the ACL 2022 paper "Learning to Generalize to More: Continuous Semantic Augmentation for Neural Machine Translation"

52 Updated May 10, 2022

Implementation of our paper "Self-training Sampling with Monolingual Data Uncertainty for Neural Machine Translation" to appear in ACL-2021.

Python 31 1 Updated Jul 16, 2021

Tool to fix bitexts and tag near-duplicates for removal

Python 29 3 Updated Feb 5, 2025
Perl 57 6 Updated Jul 16, 2022

A list of awesome Machine Translation frameworks, libraries, software and papers

187 24 Updated Jul 15, 2024

OpusFilter - Parallel corpus processing toolkit

Python 104 20 Updated Jan 29, 2025

Korean corpus repository

Python 711 79 Updated Oct 3, 2022

Sequence-to-sequence framework with a focus on Neural Machine Translation based on PyTorch

Python 1,212 324 Updated Oct 24, 2024

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 31,018 6,467 Updated Jan 9, 2025

Code for paper "Vocabulary Learning via Optimal Transport for Neural Machine Translation"

Python 442 46 Updated Feb 2, 2022

Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)

Python 9,821 1,392 Updated Jul 31, 2023

一键中文数据增强包 ; NLP数据增强、bert数据增强、EDA:pip install nlpcda

Python 1,800 169 Updated Apr 15, 2024

中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…

Python 71,075 14,689 Updated May 10, 2024

搜索所有中文NLP数据集,附常用英文NLP数据集

Python 4,227 618 Updated Nov 21, 2022

Multilingual word vectors in 78 languages

Jupyter Notebook 1,195 121 Updated Mar 10, 2023

Open-Source Machine Translation Quality Estimation in PyTorch

Python 230 47 Updated Jun 23, 2022

Source code for the Apple reproduction

Jupyter Notebook 31 12 Updated Apr 23, 2021
Next
Showing results