Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Can not download DI_Vietnamese-UVD dataset #641

Closed
KhuongDuy-Nguyen opened this issue Feb 7, 2023 · 6 comments
Closed

Can not download DI_Vietnamese-UVD dataset #641

KhuongDuy-Nguyen opened this issue Feb 7, 2023 · 6 comments
Labels

Comments

@KhuongDuy-Nguyen
Copy link

KhuongDuy-Nguyen commented Feb 7, 2023

When I download the DI Vietnamese-UVD dataset by underthesea download-data DI_Vietnamese-UVD, it gives me this error. How can I fix that?

image

@rain1024
Copy link
Contributor

rain1024 commented Feb 8, 2023

@KhuongDuy-Nguyen Thanks for reporting the issue.

The configuration of the dataset's filename was incorrect, I've fixed it and release new version 6.1.0. Please update underthesea and let me know if the code is now working.

@rain1024 rain1024 added the bug label Feb 8, 2023
@rain1024 rain1024 mentioned this issue Feb 8, 2023
4 tasks
@KhuongDuy-Nguyen
Copy link
Author

KhuongDuy-Nguyen commented Feb 8, 2023

@rain1024 I ran cmd using admin and restarted my laptop but it still error

image

Btw, another dataset still have the same error when i check all of them

image

image

image

image

@rain1024
Copy link
Contributor

rain1024 commented Feb 10, 2023

@KhuongDuy-Nguyen Thanks for your comment

The process cannot access the file because it is being used by another process...

The bug related to opening a Zipfile has been fixed.

I also fixed errors related with UTS2017_BANK, VNESES, CP_Vientamese-UNC, VNTC datasets.

Please update underthesea version 6.1.1 and let me know if the code is now working.

@KhuongDuy-Nguyen
Copy link
Author

KhuongDuy-Nguyen commented Feb 10, 2023

@rain1024 I can download them but I can't open DI_Vietnamese-UVD

image

@rain1024
Copy link
Contributor

rain1024 commented Feb 10, 2023

@KhuongDuy-Nguyen The "UVD.bin" file is a dump created by pickle. To access its contents, you can use the following code

import pickle

filepath = "UVD.bin"

with open(filepath, "rb") as f:
        data = pickle.load(f)

for item in data:
        print(item)
        print(data[item])

@KhuongDuy-Nguyen
Copy link
Author

@rain1024 Got it, thanks for your support

This issue was closed.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants