Solve JSONDecodeError #799

HuZixia · 2024-01-26T15:38:58Z

Solve JSONDecodeError, about issue #749

To avoid JSONDecodeError:

Remove comments in output json str, after json value content, maybe start with #, maybe start with //, particularly, it is not inside the string value

Addtionly, if you do not want JSONDecodeError to occur, you can add 'Delete comments in json' after FORMAT_CONSTRAINT in action_node.py

Features

The json content returned by a LLM may contain comments, maybe start with #, maybe start with //, it's random. This can lead to subsequent json parsing errors that affect the overall code execution.
The Error has happened whether I use gpt-4-1106-preview or GLM-4.
These code changes are intended to fix this problem.

Feature Docs

Influence

These code changes are intended to fix JSONDecodeError.

Result

See the issue #749 in detail

The json comments maybe start with #

The json comments maybe start with //

So we need to fix the code repair_llm_raw_output.py.

Add the second way to fix bug about raised JSONDecodeError:
Add prompt "Delete comments in json" of FORMAT_CONSTRAINT in action_node.py, as follows:

raised JSONDecodeError as follows:

add "Delete comments in json", solve the problem:

Remove comments in output json str, after json value content, maybe start with #, maybe start with //, particularly, it is not inside the string value Addtionly, if you do not want JSONDecodeError to occur, you can add 'Delete comments in json' after FORMAT_CONSTRAINT in action_node.py

…Delete comments in json' after FORMAT_CONSTRAINT in action_node.py

metagpt/utils/repair_llm_raw_output.py

PR/# remove comments in output json str, after js

merge

better629 · 2024-01-27T08:52:12Z

metagpt/actions/action_node.py

@@ -23,7 +23,10 @@
 TAG = "CONTENT"

 LANGUAGE_CONSTRAINT = "Language: Please use the same language as Human INPUT."
-FORMAT_CONSTRAINT = f"Format: output wrapped inside [{TAG}][/{TAG}] like format example, nothing else."
+FORMAT_CONSTRAINT = (f"Format: output wrapped inside [{TAG}][/{TAG}] like format example, nothing else. "
+                     f"Delete comments in json")


due to ActionNode can generate json or markdown format example, we suggest not to add explicit json keyword in template str. Can the new added code in repair_llm_xx solved the problem, If so, maybe it's no need to add here.

Well, I agree with you.

better629 · 2024-01-27T08:53:44Z

metagpt/actions/action_node.py

+FORMAT_CONSTRAINT = (f"Format: output wrapped inside [{TAG}][/{TAG}] like format example, nothing else. "
+                     f"Delete comments in json")
+# Delete comments in json
+# If you don't want JSONDecodeError to occur, you can add Delete comments in json after FORMAT_CONSTRAINT


seems no need to add this explanation in extra lines.

better629 · 2024-01-27T08:58:11Z

metagpt/utils/repair_llm_raw_output.py

@@ -198,6 +214,12 @@ def repair_invalid_json(output: str, error: str) -> str:
            new_line = line.replace("}", "")
        elif line.endswith("},") and output.endswith("},"):
            new_line = line[:-1]
+        # remove comments in output json str, after json value content, maybe start with #, maybe start with //
+        elif rline[col_no] == "#" or rline[col_no] == "/":


since you have removed comments in the repair pipeline in repair_json_format , there is no need to do it again here.

better629 · 2024-01-27T09:07:24Z

metagpt/utils/repair_llm_raw_output.py

@@ -105,6 +105,23 @@ def judge_potential_json(routput: str, left_key: str) -> Union[str, None]:
    return output


+def remove_comments_from_line(line):


add unittest case for // like https://github.com/geekan/MetaGPT/blob/main/tests/metagpt/utils/test_repair_llm_raw_output.py#L131-L141

Modify code based on feedback of action_node.py and repair_llm_raw_output.py, add code in test_repair_llm_raw_output.py. Please have a look, thanks.

…tput.py, add code in test_repair_llm_raw_output.py

better629 · 2024-01-27T13:32:38Z

LGTM

geekan

LGTM

HuZixia added 2 commits January 26, 2024 22:59

Addtionly, if you do not want JSONDecodeError to occur, you can add '…

43b069f

…Delete comments in json' after FORMAT_CONSTRAINT in action_node.py

HuZixia had a problem deploying to unittest January 26, 2024 15:39 — with GitHub Actions Failure

geekan requested a review from better629 January 27, 2024 06:01

geekan added the bug-fix label Jan 27, 2024

geekan reviewed Jan 27, 2024

View reviewed changes

metagpt/utils/repair_llm_raw_output.py Outdated Show resolved Hide resolved

merge code with similar logic to avoid duplication

f16b247

HuZixia had a problem deploying to unittest January 27, 2024 07:32 — with GitHub Actions Failure

geekan reviewed Jan 27, 2024

View reviewed changes

PR/# remove comments in output json str, after js Outdated Show resolved Hide resolved

HuZixia added 3 commits January 27, 2024 17:00

delete PR dir

8b5f784

Merge branch 'huzixia' of github.com:HuZixia/MetaGPT into huzixia

b9a03c3

merge

delete PR dir

2361c7e

HuZixia had a problem deploying to unittest January 27, 2024 09:07 — with GitHub Actions Failure

better629 reviewed Jan 27, 2024

View reviewed changes

modify code based on feedback of action_node.py and repair_llm_raw_ou…

11f70ca

…tput.py, add code in test_repair_llm_raw_output.py

HuZixia had a problem deploying to unittest January 27, 2024 10:09 — with GitHub Actions Failure

update repair_llm_raw_output.py

c3b4c69

HuZixia had a problem deploying to unittest January 27, 2024 10:24 — with GitHub Actions Failure

geekan approved these changes Jan 28, 2024

View reviewed changes

geekan merged commit ee0801a into geekan:main Jan 28, 2024
1 check failed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Solve JSONDecodeError #799

Solve JSONDecodeError #799

HuZixia commented Jan 26, 2024

better629 Jan 27, 2024 •

edited

Loading

HuZixia Jan 27, 2024

better629 Jan 27, 2024

better629 Jan 27, 2024

better629 Jan 27, 2024

HuZixia Jan 27, 2024

HuZixia Jan 27, 2024

better629 commented Jan 27, 2024

geekan left a comment

		@@ -105,6 +105,23 @@ def judge_potential_json(routput: str, left_key: str) -> Union[str, None]:
		return output


		def remove_comments_from_line(line):

Solve JSONDecodeError #799

Solve JSONDecodeError #799

Conversation

HuZixia commented Jan 26, 2024

better629 Jan 27, 2024 • edited Loading

Choose a reason for hiding this comment

HuZixia Jan 27, 2024

Choose a reason for hiding this comment

better629 Jan 27, 2024

Choose a reason for hiding this comment

better629 Jan 27, 2024

Choose a reason for hiding this comment

better629 Jan 27, 2024

Choose a reason for hiding this comment

HuZixia Jan 27, 2024

Choose a reason for hiding this comment

HuZixia Jan 27, 2024

Choose a reason for hiding this comment

better629 commented Jan 27, 2024

geekan left a comment

Choose a reason for hiding this comment

better629 Jan 27, 2024 •

edited

Loading