UnicodeDecodeError: 'ascii' codec can't decode byte 0xcb in position 2542: ordinal not in range(128) #40

docfate111 · 2020-03-29T00:49:44Z

After creating the issue, checkboxes will appear where [] label exist in the
markdown. You can check/uncheck them to fill out the environment section.

Checklist

[ x ] I have included the [relevant portions of the] grammar used that caused the bug
[ x ] I have filled out the environment section

Environment

Platform

Windows
Mac
[ x ] Linux
Other (please specify)

Python Version

Describe the bug

A clear and concise description of what the bug is.
The file isn't opened with the encoding='utf-8' flag so I end with this error when I run main.py https://github.com/mgree/smoosh-fuzz/blob/master/src/posix/main.py in ubuntu docker python3

  File "main.py", line 4, in <module>
    fuzzer.load_grammar('words.py')
  File "/usr/local/lib/python3.6/dist-packages/gramfuzz/__init__.py", line 125, in load_grammar
    data = f.read()
  File "/usr/lib/python3.6/encodings/ascii.py", line 26, in decode
    return codecs.ascii_decode(input, self.errors)[0]
UnicodeDecodeError: 'ascii' codec can't decode byte 0xcb in position 2542: ordinal not in range(128)

To Reproduce

The issue is in the function load_grammar in class GramFuzzer:
--->here: with open(path, "r", WE NEED: encoding='utf-8') as f:
data = f.read()
code = compile(data, path, "exec")
Steps to reproduce

Expected Behavior

I expected the behavior to be the same as in Python3.7 on Mac- a grammar is generated and strings from that grammar.
A clear and concise description of what you expected to happen.

The text was updated successfully, but these errors were encountered:

d0c-s4vage · 2020-03-31T00:13:24Z

Oof, thanks for this! I'll take a look this week

docfate111 · 2020-04-07T18:53:10Z

May I have permission to git push please?
The solution is just 1 line of code that I wrote above.

docfate111 · 2020-04-11T23:12:11Z

the line is 124 in init.py
with open(path, "r")as f:
to
with open(path, "r", encoding='utf-8') as f:
otherwise this breaks on Linux

d0c-s4vage · 2020-04-23T17:10:52Z

@docfate111 Thanks for being patient on this! Things have been up in the air lately...

May I have permission to git push please?

PRs are definitely welcome! You can fork the repository, create a new branch in your forked repository, and then submit a new PR from your branch into this project.

docfate111 added bug needs-triage A fresh user-reported issue that needs triage (i.e. is not a concrete-issue) labels Mar 29, 2020

docfate111 assigned d0c-s4vage Mar 29, 2020

d0c-s4vage mentioned this issue Apr 23, 2020

Fixes unicode decoding error when loading grammars #41

Merged

d0c-s4vage closed this as completed in #41 Apr 23, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

UnicodeDecodeError: 'ascii' codec can't decode byte 0xcb in position 2542: ordinal not in range(128) #40

UnicodeDecodeError: 'ascii' codec can't decode byte 0xcb in position 2542: ordinal not in range(128) #40

docfate111 commented Mar 29, 2020 •

edited by d0c-s4vage

Loading

d0c-s4vage commented Mar 31, 2020

docfate111 commented Apr 7, 2020

docfate111 commented Apr 11, 2020

d0c-s4vage commented Apr 23, 2020

UnicodeDecodeError: 'ascii' codec can't decode byte 0xcb in position 2542: ordinal not in range(128) #40

UnicodeDecodeError: 'ascii' codec can't decode byte 0xcb in position 2542: ordinal not in range(128) #40

Comments

docfate111 commented Mar 29, 2020 • edited by d0c-s4vage Loading

Checklist

Environment

Describe the bug

To Reproduce

Expected Behavior

d0c-s4vage commented Mar 31, 2020

docfate111 commented Apr 7, 2020

docfate111 commented Apr 11, 2020

d0c-s4vage commented Apr 23, 2020

docfate111 commented Mar 29, 2020 •

edited by d0c-s4vage

Loading