Replies: 3 comments 5 replies
-
Hi Antoine, Thanks for your interest.
All I know currently is that my paper was accepted on the 6th of August, 2021, in ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP). I don't know when they will publish it. I hope that will be by the end of this year. But I have no guarantee.
Tashkeela processed. I used only a subset of it as I described in the article (10 first files for training, first file for both validation and testing). I wanted to use it all but due to the COVID-19 outbreak I have lost access to the server that was available to me before, so I had to continue my research with limited resources.
On the testing file, when counting all Arabic letters at all positions, DER = 3% and WER = 8.99%
The code contained in this repository is already trained, otherwise, the diacritization will be completely random, which is not the case if you have tried the web interface. The default weights file is
I have added all the required instructions in the I hope my comments are helpful. Best regards, Hamza. |
Beta Was this translation helpful? Give feedback.
-
Hi Hamza, Thanks for your swift reply.
Too bad... :( What kind of server would you need to run the training on the whole dataset?
Thanks. Have you already deployed it on your own Heroku account? What I would love is a link that I could send to friends (who don't know coding at all) so that they can try it/play with it. So I thought you may have already done that. |
Beta Was this translation helpful? Give feedback.
-
Hi @a455bcd9 , the link to the paper is now available in the README.md, in case you need it. |
Beta Was this translation helpful? Give feedback.
-
Hi Hamza,
Thanks a lot for your amazing work!
I wondered:
I asked these questions because I would love to use (or build) a Chrome extension that automatically adds diacritics to Arabic text online (for instance if the page or some paragraphs are tagged as lang=ar).
Thanks for any help you can provide.
Kind regards,
Antoine
Beta Was this translation helpful? Give feedback.
All reactions