Showcase off-by-one error in language-html #99

winstliu · 2017-09-20T09:58:37Z

This PR adds a failing test to showcase an off-by-one error that can occasionally occur. It was originally discovered in atom/atom#14982, though it can now be reproduced by using atom/language-html#170 as well. Note: I had to bring in a more recent version of language-css so that the proper rules could be hooked up. html-with-css.cson is a very minimal example grammar that will reproduce the issue, given <span style="s:"></span>.

Will attempt to revert the caching changes to see if that was the cause of this regression.

/cc @maxbrunsfeld

winstliu · 2017-09-20T10:32:40Z

Interestingly, this test still seems to fail in the 6.x versions (and 5.x throws errors when trying to start specs). I'll try copying over the repro case from atom/atom#14982 to see if that one's any different.

Ingramz · 2017-09-20T16:04:12Z

The cause for atom/atom#14982 seems to be 7c16504#diff-30b5df382fb36749d0d1d282b2df9a53

Rule::getNextTags doesn't like if there is a newline appended down this path. If you are saying that the html test fails with versions 7.0.2 and lower, these issues must be different.

winstliu · 2017-09-20T16:39:01Z

Ok, I'll open a new PR for that one.

maxbrunsfeld · 2017-09-20T18:29:35Z

⚡ Thanks so much for moving this forward @50Wliu! The theory about the appended newline seems very promising. You're sure it's failing in the same way w/ #94 reverted?

Ingramz · 2017-09-20T18:33:13Z

@maxbrunsfeld the following patch fixes both cases for me, rest of the specs pass. I am not sure what it breaks though.

diff --git a/src/grammar.coffee b/src/grammar.coffee
index 82fd12c..7880b8b 100644
--- a/src/grammar.coffee
+++ b/src/grammar.coffee
@@ -98,7 +98,7 @@ class Grammar
   # * `ruleStack` An {Array} of rules representing the tokenized state at the
   #   end of the line. These should be passed back into this method when
   #   tokenizing the next line in the file.
-  tokenizeLine: (inputLine, ruleStack, firstLine=false, compatibilityMode=true) ->
+  tokenizeLine: (inputLine, ruleStack, firstLine=false, compatibilityMode=true, withNewLine=true) ->
     tags = []
 
     truncatedLine = false
@@ -108,7 +108,8 @@ class Grammar
     else
       line = inputLine
 
-    string = new OnigString(line + '\n')
+    string = new OnigString(line)
+    stringWithNewLine = if withNewLine then new OnigString(line + '\n') else string
 
     if ruleStack?
       ruleStack = ruleStack.slice()
@@ -139,7 +140,7 @@ class Grammar
         truncatedLine = true
         break
 
-      if match = _.last(ruleStack).rule.getNextTags(ruleStack, string, position, firstLine)
+      if match = _.last(ruleStack).rule.getNextTags(ruleStack, string, stringWithNewLine, position, firstLine)
         {nextTags, tagsStart, tagsEnd} = match
 
         # Unmatched text before next tags
diff --git a/src/pattern.coffee b/src/pattern.coffee
index 2aff1d4..30a2c91 100644
--- a/src/pattern.coffee
+++ b/src/pattern.coffee
@@ -171,7 +171,7 @@ class Pattern
 
   tagsForCaptureRule: (rule, line, captureStart, captureEnd, stack) ->
     captureText = line.substring(captureStart, captureEnd)
-    {tags} = rule.grammar.tokenizeLine(captureText, [stack..., {rule}])
+    {tags} = rule.grammar.tokenizeLine(captureText, [stack..., {rule}], false, true, false)
 
     # only accept non empty tokens that don't exceed the capture end
     openScopes = []
diff --git a/src/rule.coffee b/src/rule.coffee
index 3301060..84d7528 100644
--- a/src/rule.coffee
+++ b/src/rule.coffee
@@ -96,8 +96,8 @@ class Rule
       @normalizeCaptureIndices(lineWithNewline, result.captureIndices)
       result
 
-  getNextTags: (ruleStack, line, position, firstLine) ->
-    result = @findNextMatch(ruleStack, line, position, firstLine)
+  getNextTags: (ruleStack, line, lineWithNewline, position, firstLine) ->
+    result = @findNextMatch(ruleStack, lineWithNewline, position, firstLine)
     return null unless result?
 
     {index, captureIndices, scanner} = result

maxbrunsfeld · 2017-09-20T18:41:56Z

@50Wliu Just so I understand your unit test, will the error reproduce if we put valid CSS in the style value, like top: 1px or something? Or does it only happen if we have some partial content like s:?

maxbrunsfeld · 2017-09-20T18:42:36Z

@Ingramz Awesome! Want to open a PR that applies that patch with this branch as the base branch?

winstliu · 2017-09-20T18:47:08Z

I'm fairly confident that #94 was not the change, because I checked out v6.3.0, nuked node_modules, and the tests still failed.

And the off-by-one error only reproduces when there's no property-value present. Once you add a non-space after the colon, the error disappears. So top: 1px is fine.

Showcase off-by-one error in language-html

327f6e0

winstliu force-pushed the wl-tags-off-by-one branch from dc917f7 to 327f6e0 Compare September 20, 2017 10:00

Fix test to actually pass when it should

4f864a2

Ingramz mentioned this pull request Sep 21, 2017

Fix insertion of newline characters to the end of lines #100

Merged

maxbrunsfeld closed this in #100 Sep 22, 2017

winstliu deleted the wl-tags-off-by-one branch September 22, 2017 17:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Showcase off-by-one error in language-html #99

Showcase off-by-one error in language-html #99

winstliu commented Sep 20, 2017

winstliu commented Sep 20, 2017 •

edited

Loading

Ingramz commented Sep 20, 2017 •

edited

Loading

winstliu commented Sep 20, 2017

maxbrunsfeld commented Sep 20, 2017

Ingramz commented Sep 20, 2017

maxbrunsfeld commented Sep 20, 2017

maxbrunsfeld commented Sep 20, 2017

winstliu commented Sep 20, 2017

Showcase off-by-one error in language-html #99

Showcase off-by-one error in language-html #99

Conversation

winstliu commented Sep 20, 2017

winstliu commented Sep 20, 2017 • edited Loading

Ingramz commented Sep 20, 2017 • edited Loading

winstliu commented Sep 20, 2017

maxbrunsfeld commented Sep 20, 2017

Ingramz commented Sep 20, 2017

maxbrunsfeld commented Sep 20, 2017

maxbrunsfeld commented Sep 20, 2017

winstliu commented Sep 20, 2017

winstliu commented Sep 20, 2017 •

edited

Loading

Ingramz commented Sep 20, 2017 •

edited

Loading