fix(autoedit): fix shrink prediction logic #6404

valerybugakov · 2024-12-18T04:34:47Z

Follow up for autoedit: shrink prediction repro issue #6386

Test plan

CI
New unit tests
Manually tested on cody-chat-eval examples

vscode/src/autoedits/shrink-prediction.ts

hitesh-1997 · 2024-12-18T07:09:57Z

vscode/src/autoedits/shrink-prediction.ts

@@ -16,27 +14,25 @@ export function shrinkPredictionUntilSuffix(
    const suffix = codeToReplaceData.suffixInArea + codeToReplaceData.suffixAfterArea

    // Split the prediction and suffix into arrays of lines
-    const predictionLines = lines(prediction)
+    const predictionLines = lines(stripLastEmptyLineIfExists(prediction))


can we add a comment why do we want to strip the last empty line

hitesh-1997 · 2024-12-18T07:13:21Z

vscode/src/autoedits/shrink-prediction.ts

-            if (!suffixSlice[j].trim().startsWith(predictionSlice[j].trim())) {
-                matches = false
+            if (
+                (suffixSlice[j].length > 0 && predictionSlice[j].startsWith(suffixSlice[j])) ||


Why are we only checking if predictionSlice[j].startsWith(suffixSlice[j]). If we want to see that prediction lines matches with suffix, shouldn't we check === condition

See the test case with self.email = email in vscode/src/autoedits/shrink-prediction.test.ts. I want to catch cases where something is added to the end of the suffix line, too.

I don't think that should be the expectation since suffix self.email = and not self.email = email which is in the prediction. So, we can't match it suffix. There is risk of prediction getting trimmed just because suffix prefixed with prediction.

I added a test case here which will have issues because .startsWith logic just because the suffix had bunch of empty lines below and prediction added lines at the same indentation level, so prediction.startwith(suffix) becomes true because suffix is only the spaces with same indentation level, and all the prediction lines gets trimmed.

(please note that I had to turn off trim_trailing_whitespace = false in .editorconfig in our code to add trailing spaces in the code, so if we git pull and save, the trailing space would get trimmed and test case might not work, so please turn off this settting before git pull)

The partial match logic is giving us too many false positives. Even after fixing the test case you pushed (thanks for that!), I found this behavior popping up in several places during manual testing. So, I reverted the logic back to an exact match.

Could we handle this at the model layer? Is there a way to nudge the model to only make changes within the code-to-rewrite block? I guess this issue comes from the training dataset, where suffix changes sometimes get duplicated in the updated code section. Could this be the case?

You can check the test case with self.email = email to see an example. Even though this line is part of the suffix, the model still modifies it and includes it in the response. Ideally, it should only return new line insertions.

The goal of the shrink-until-suffix logic was to ensure we still show parts of suggestions that are relevant, even if they only partially match the suffix. Previously, we had logic using startsWith() that would hide the entire suggestion if its last lines were partially present in the document. Let’s chat about the best way to solve this over Zoom.

I traced the addition of this logic to the feature, which comes from this PR, which does not give much context for this specific change. I wonder how critical was the starsWith() bit back then when it was introduced and why didn't we opt-in for the exact match instead.

The reasoning behind the old logic was that, if model includes the suffix code, than they can only be new added lines from the code to rewrite, and I used startsWith there because we were not doing the line level comparison, but the whole suffix is the whole suffix in the file.
But the logic earlier also had its flaws and not very well though, but it mitigated some issues I encountered while manual testing.
I think it would be okay to just let the model output wrong predictions and we can figure out the severity from the logs.

vscode/src/autoedits/shrink-prediction.ts

hitesh-1997

I had confusion in some statements, added comments to clarify

hitesh-1997

Added test case which would have issue with the current logic, please check this commit (comment)

vscode/src/autoedits/shrink-prediction.test.ts

repro issue because of partial suffix match with predictions repro issue because of partial suffix match with predictions

fix(autoedit): fix shrink prediction logic

1ae4fa2

valerybugakov added the autoedit label Dec 18, 2024

valerybugakov requested a review from hitesh-1997 December 18, 2024 04:34

valerybugakov self-assigned this Dec 18, 2024

hitesh-1997 reviewed Dec 18, 2024

View reviewed changes

vscode/src/autoedits/shrink-prediction.ts Outdated Show resolved Hide resolved

hitesh-1997 reviewed Dec 18, 2024

View reviewed changes

vscode/src/autoedits/shrink-prediction.ts Outdated Show resolved Hide resolved

hitesh-1997 reviewed Dec 18, 2024

View reviewed changes

vscode/src/autoedits/shrink-prediction.ts Outdated Show resolved Hide resolved

hitesh-1997 requested changes Dec 18, 2024

View reviewed changes

valerybugakov added 3 commits December 18, 2024 15:23

feat(audoedit): fix the condition

8aa254f

feat(audoedit): add a comment

7ed4de3

feat(audoedit): use codeToRewrite to get the new line char

86fbe0c

valerybugakov requested a review from hitesh-1997 December 18, 2024 07:29

hitesh-1997 requested changes Dec 18, 2024

View reviewed changes

vscode/src/autoedits/shrink-prediction.test.ts Outdated Show resolved Hide resolved

repro issue because of partial suffix match with predictions

e159e22

repro issue because of partial suffix match with predictions repro issue because of partial suffix match with predictions

hitesh-1997 force-pushed the vb/shrink-prediction-fix branch from 7d781bd to e159e22 Compare December 18, 2024 16:42

valerybugakov added 2 commits December 19, 2024 10:59

fix(audoedit): use exact match

a7bdee3

Merge branch 'main' into vb/shrink-prediction-fix

1f5f544

valerybugakov requested a review from hitesh-1997 December 19, 2024 03:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(autoedit): fix shrink prediction logic #6404

fix(autoedit): fix shrink prediction logic #6404

valerybugakov commented Dec 18, 2024

hitesh-1997 Dec 18, 2024

valerybugakov Dec 18, 2024

hitesh-1997 Dec 18, 2024

valerybugakov Dec 18, 2024

hitesh-1997 Dec 18, 2024 •

edited

Loading

valerybugakov Dec 19, 2024 •

edited

Loading

valerybugakov Dec 19, 2024

valerybugakov Dec 19, 2024

hitesh-1997 Dec 19, 2024

hitesh-1997 left a comment

hitesh-1997 left a comment •

edited

Loading

fix(autoedit): fix shrink prediction logic #6404

Are you sure you want to change the base?

fix(autoedit): fix shrink prediction logic #6404

Conversation

valerybugakov commented Dec 18, 2024

Test plan

hitesh-1997 Dec 18, 2024

Choose a reason for hiding this comment

valerybugakov Dec 18, 2024

Choose a reason for hiding this comment

hitesh-1997 Dec 18, 2024

Choose a reason for hiding this comment

valerybugakov Dec 18, 2024

Choose a reason for hiding this comment

hitesh-1997 Dec 18, 2024 • edited Loading

Choose a reason for hiding this comment

valerybugakov Dec 19, 2024 • edited Loading

Choose a reason for hiding this comment

valerybugakov Dec 19, 2024

Choose a reason for hiding this comment

valerybugakov Dec 19, 2024

Choose a reason for hiding this comment

hitesh-1997 Dec 19, 2024

Choose a reason for hiding this comment

hitesh-1997 left a comment

Choose a reason for hiding this comment

hitesh-1997 left a comment • edited Loading

Choose a reason for hiding this comment

hitesh-1997 Dec 18, 2024 •

edited

Loading

valerybugakov Dec 19, 2024 •

edited

Loading

hitesh-1997 left a comment •

edited

Loading