-
-
Notifications
You must be signed in to change notification settings - Fork 355
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
llm embed-multi --files
should handle encodings other than utf-8
#225
Comments
In the debugger:
I think it's this:
|
So... should the |
I'm tempted to go with the cheapest option: fall back to Another option would be to support a |
Code at fault: Lines 1083 to 1087 in 206e691
|
llm embed-multi --files
error: UnicodeDecodeError: 'utf-8' codec can't decode byte 0x96 in position 522llm embed-multi --files
should handle encodings other than utf-8
(I love the way you've documented things like this even when you were talking to yourself. It means that if someone has a similar problem. they can find a relevant issue. Am just posting here rather than opening a new issue but will if you want. ) I've been using folders of old (mostly plain text) files full of notes from meetings. They are all a bit ropey as they've been in different systems and encodings over time. I'm sending them to a model by
Where the offending line was I can weed them out manually but having |
Got this while running this command:
Traceback:
The text was updated successfully, but these errors were encountered: