Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

details about dataset .txt file encoding type. #69

Open
udita23 opened this issue Oct 10, 2023 · 1 comment
Open

details about dataset .txt file encoding type. #69

udita23 opened this issue Oct 10, 2023 · 1 comment

Comments

@udita23
Copy link

udita23 commented Oct 10, 2023

Can you please mention in what format the .txt files are encoded in the current datasets directory. I was trying out with my own custom dataset which I imported from pandas dataframe using both utf-8 and asci encoding. but I received the following error while running the train.sh script
File "/home/Social-STGCNN/utils.py", line 122, in init
data = read_file(path, delim)
File "/home/Social-STGCNN/utils.py", line 79, in read_file
for line in f:
File "/home/miniconda3/envs/newenv/lib/python3.6/codecs.py", line 321, in decode
(result, consumed) = self._buffer_decode(data, self.errors, final)
UnicodeDecodeError: 'utf-8' codec can't decode byte 0x80 in position 3131: invalid start byte

@AdityaKharmaleGoat7
Copy link

Great work @abduallahmohamed !!

Could you please explain me about the attribute of the text dataset? Like the significance of the columns in the text file, so that we could also create our own similar dataset.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants