-
Notifications
You must be signed in to change notification settings - Fork 238
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
a tricky mdx file #384
Comments
Looks like the file is problematic.
Can you test it with a dictionary that supports MDX, like BlueDict? |
I tested it on MDict app on Android and works fine! @xiaoqiangwang @csarron Any ideas? |
There are two such blocks in total that zlib decompress fails. They are not truncated or anything and seem like valid zlib compressed data. Without the original source text, it is hard to say wether MDict app ignores the error silently. Maybe it is worth to regenerate the mdx file and check again. Or modify readmdict.py to skip the failed blocks
|
Thank you @xiaoqiangwang @florentinovame |
It works! thanks a lot.
чт, 18 авг. 2022 г. в 23:59, Saeed Rasooli ***@***.***>:
… Thank you @xiaoqiangwang <https://github.com/xiaoqiangwang>
@florentinovame <https://github.com/florentinovame>
I pushed a commit that skips these few blocks and completes the conversion.
Please try again.
—
Reply to this email directly, view it on GitHub
<#384 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/A2Q6ZU4K4XCXSB45YHDR4L3VZ2PZJANCNFSM56RRQ3PA>
.
You are receiving this because you were mentioned.Message ID:
***@***.***>
|
for some unknown reason using dictzip makes the dictionary useless. I know that you are not to blame as the conversing process is flawless. my comment is a kind of warning to those run up against this problem (dictzip). |
MDX has internal compression (zlib / lzo), so running dictzip (which is basically gzip) on top of that won't change the size much. |
sorry for being ambiguous... I meant to compress dsl which is 2.4 gb with the help of dictzip. it turned out that dictzip has a size limitation to 1.8 gb. |
Interesting. What error do you get when file is too big? |
it silently creates a dz file, which makes goldendict crash and xarchiver is impossible to open the archive. if i try dictzip -d i get |
the manpage for dictzip says: |
Thanks. |
Can you find the exact number of bytes for the "about 1.8GB" limit? |
I am sorry, but I am not able to perform this calculation. As for the dictionary mentioned before it turned out to be not a standalone dictionary but a combination of several sources of reference. dictzip crashed long before the stated limit of 1.8GB. |
thank you for your prompt reply. I use goldendict which supports mdx based
files. I am under the illusion that ifo based dictionaries are smaller than
mdx equivalents. cambridge online dictionary has extracts from cambridge
corpus. this data is also available online but in the light of coming
closure of lexico.com one should take interest in having an offline copy.
чт, 18 авг. 2022 г. в 19:37, Saeed Rasooli ***@***.***>:
… I tested it on MDict app on Android and works fine!
@xiaoqiangwang <https://github.com/xiaoqiangwang> Any ideas?
—
Reply to this email directly, view it on GitHub
<#384 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/A2Q6ZU6EI3JLING2CZ3ZQP3VZZRGNANCNFSM56RRQ3PA>
.
You are receiving this because you authored the thread.Message ID:
***@***.***>
|
Thank you for your wonderful software.
I managed to come across an mdict file which fails to convert to an ifo file.
Here is the file in question.
whenever I run pyglossary it stucks at [INFO] extracting links...
The text was updated successfully, but these errors were encountered: