Skip to content
This repository has been archived by the owner on May 10, 2023. It is now read-only.

Removal Request: Turkish #650

Closed
HarikalarKutusu opened this issue Nov 25, 2022 · 1 comment
Closed

Removal Request: Turkish #650

HarikalarKutusu opened this issue Nov 25, 2022 · 1 comment
Assignees

Comments

@HarikalarKutusu
Copy link
Contributor

In the last few days, 5k+ sentences are added from a Internet service provider support chat history, but they were not corrected and does not confirm to quality standards. Fortunately they contacted me and we will be working together on these.

I checked the data, they are mostly fine wrt being public domain, but there are full proper names, sentences with abbreviations (ADSL, DSL, TT Turkish Telekom etc), no punctuation, no Turkish characters (such as i is used instead of ı, we do it on mobile) etc. So, the result is not usable.

If these can be deleted, we will add corrected ones, also adding other sentences to dilute them.

Thank you
Bülent

Source:
⁨firmamız veri tabanından bu proje bizim için önemli Türkçe olarak 1500 saate yakın hedefimiz var personel mesai yapacak bunun için cümleler ekliyoruz.⁩

@MichaelKohler MichaelKohler self-assigned this Dec 1, 2022
MichaelKohler pushed a commit that referenced this issue Dec 1, 2022
# [2.19.0](v2.18.2...v2.19.0) (2022-12-01)

### Bug Fixes

* remove Turkish sentences from a specific source (fixes [#650](#650)) ([#652](#652)) ([34fd2be](34fd2be))

### Features

* add validator for Lao (lo) ([#651](#651)) ([f56d8a0](f56d8a0))
@MichaelKohler
Copy link
Member

🎉 This issue has been resolved in version 2.19.0 🎉

The release is available on GitHub release

Your semantic-release bot 📦🚀

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants