Something changed with printing, now syntax highlighting is confused #2628

harold · 2019-04-23T21:18:46Z

Actual behavior

I often return sequences of maps to have them printed in the repl. Since upgrading to 0.21.0 I am seeing a lot of confused syntax highlighting like this:

Steps to reproduce the problem

Evaluating this form in the repl is enough to do it:

(for [i (range 40)]
     {:a (java.util.UUID/randomUUID)
     :b "String String String String String String String"})

Environment & Version information

CIDER version information

;; CIDER 0.21.0 (New York), nREPL 0.6.0
;; Clojure 1.9.0, Java 1.8.0_171

Lein/Boot version

$ lein --version
Leiningen 2.9.1 on Java 1.8.0_171 Java HotSpot(TM) 64-Bit Server VM

Emacs version

GNU Emacs 26.2 (build 2, x86_64-pc-linux-gnu, GTK+ Version 3.22.30) of 2019-04-12

Operating system

Ubuntu 18.04

PS. I love Cider it's the best part of my day every day. <3

The text was updated successfully, but these errors were encountered:

bbatsov · 2019-04-24T02:59:49Z

I've noticed this as well recently. Probably it's some regression related to the REPL performance work done by @cichli in that release. We'll have to investigate this further.

bbatsov · 2019-04-24T03:00:17Z

PS. I love Cider it's the best part of my day every day. <3

Thanks for the kind words! 🙇

harold · 2019-04-24T03:03:39Z

We'll have to investigate this further.

Great. Let me know if there's anything I can do to help.

dpsutton · 2019-04-24T03:04:57Z

@harold i feel like i've seen this for a while so:

thanks for reporting. things don't happen without that
do you know which version you were last on that didn't have this bug?

harold · 2019-04-24T03:14:07Z

On my main machine I have this in package-list-packages:

0.16.0 I used for a long time, and never saw this particular brand of syntax highlighting confusion.

0.20.0 I used for a much shorter time, and don't recall seeing this, but it could be in there.

For a long time (years) I have had this in my config as well: (setq cider-print-fn 'puget), but after reading about the print improvements in 0.21.0 I commented it out. So if puget use could mask this, it may have been around for a long time.

Does that help/make sense?

dpsutton · 2019-04-24T03:17:35Z

more info is always helpful :)

i'm thinking this has been there for a while and I may have been the one to break it a few years ago. The breaks in font-locking happen when the response is split over several messages:

(<--
  id         "10"
  session    "714f85bb-5674-4149-8a7e-a08d8f4318fd"
  time-stamp "2019-04-23 22:13:51.769192492"
  value      "({:a0 #uuid "5e7120c0-38a9-4087-a4b5-156d92337968",
  :b0 "S..."
)
(<--
  id         "10"
  session    "714f85bb-5674-4149-8a7e-a08d8f4318fd"
  time-stamp "2019-04-23 22:13:51.811907403"
  value      "bb-2cbf37bdf8a5",
  :b9 "String String String String String ..."
)
(<--
  id         "10"
  session    "714f85bb-5674-4149-8a7e-a08d8f4318fd"
  time-stamp "2019-04-23 22:13:51.813519873"
  value      "
  :b18 "String String String String String String String"}
..."
)

so here it begins a new message in the middle of a response with "bb...". And the font locking breaks there:

I remember there's a function that has to do with context and ansi coloring and I'm looking for that now.

It would be nice if you could confirm if puget as pretty printer can suppress this behavior

harold · 2019-04-24T03:24:38Z

Ok, just tried w/ puget, syntax highlighting gets lost at the exact same spot as in my first screenshot (see the first gray 'g' about halfway through the print).

So puget no effect as far as I can tell.

Malabarba · 2019-04-29T12:06:11Z

I get the same bug in other types of buffer (like compilation-mode, or inf-ruby) whenever there's a lot of output being produced really fast. I think it's just a matter of emacs font-locking "skipping" portions of the buffer when it can't keep-up with the speed of the output.

harold · 2019-04-29T15:21:51Z

I get the same bug in other types of buffer

Be that as it may, some recent change has exacerbated the problem for the Cider repl - since upgrading I experience this on basically every print, while before I don't recall ever having seen it.

bbatsov · 2019-05-11T20:39:06Z

I'm almost certain this was broken by the streamed printing. It's like the font-locking got applied to the individual chunks that were streamed in the REPL independently, instead of to the REPL buffer as a whole. Someone will have to dig a bit deeper into that problem.

stale · 2019-08-09T20:40:27Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contribution and understanding!

harold · 2020-04-24T00:09:54Z

@bbatsov - Is it true that you're being paid to work on this project again?

https://www.clojuriststogether.org/news/q2-2020-funding-announcement/

Found out today (on the anniversary of this issue Apr 23!).

This must be a sign.

Let's fix this issue! What can I do to help?

bbatsov · 2020-04-24T05:03:22Z

Must be a sign indeed! 😆 I'll see what I can do about this, but I first have to wrap up some new nREPL middleware that I'm working on currently.

harold · 2020-04-24T12:41:20Z

Good, good, good. This is tremendous news. Long live CIDER.

bbatsov · 2020-04-25T09:32:00Z

I didn't mention this before, but a simple workaround is to just disable font-locking in the REPL completely:

(seq cider-repl-use-clojure-font-lock nil)

I'm almost certain this was broken by the streamed printing. It's like the font-locking got applied to the individual chunks that were streamed in the REPL independently, instead of to the REPL buffer as a whole. Someone will have to dig a bit deeper into that problem.

Seems I was right in the past. The potential solutions I see for this are:

add an option to disable streaming in the REPL, which would fix the font-locking, but degrade performance
try to find some way to force the font-locking of results to redone once they are fully sent (this would probably have some impact on performance as well)

bbatsov · 2020-04-25T09:37:22Z

For more context - the problem is here:

(defun cider-repl-emit-result (buffer string show-prefix &optional bol)
  "Emit into BUFFER the result STRING and mark it as an evaluation result.
If SHOW-PREFIX is non-nil insert `cider-repl-result-prefix' at the beginning
of the line.  If BOL is non-nil insert at the beginning of the line."
  (with-current-buffer buffer
    (save-excursion
      (cider-save-marker cider-repl-output-start
        (goto-char cider-repl-output-end)
        (when (and bol (not (bolp)))
          (insert-before-markers "\n"))
        (when show-prefix
          (insert-before-markers (propertize cider-repl-result-prefix 'font-lock-face 'font-lock-comment-face)))
        (if cider-repl-use-clojure-font-lock
            (progn
              (insert-before-markers (cider-font-lock-as-clojure string)))
          (cider-propertize-region
              '(font-lock-face cider-repl-result-face rear-nonsticky (font-lock-face))
            (insert-before-markers string)))))))

As you can see we font-lock the result in chunks (they weren't chunks before nREPL 0.6, though) and this is where the breakage is coming from. Anyways, I'll figure something out.

bbatsov · 2020-04-25T11:06:22Z

After playing with the code a bit more I'll probably just remove the font-locking completely. It's going to be a big mess to keep track of the result boundaries and fontify them, so I guess we're better off without this.

harold · 2020-04-25T12:35:27Z

remove the font-locking completely

Does this imply losing syntax highlighting on printed results in the repl?

harold · 2020-04-25T12:42:24Z

One more neophyte question, how is the chunk-size determined?

If I understand your diagnosis correctly, it seems like the likelihood of a font-locking error is inversely proportionate to this 'chunk size' (bigger chunks are more likely to be font-locked correctly because there are fewer boundaries per result).

If the chunk size were enormous, or perhaps configurable, one could tune it to encounter this bug less.

Thanks again for looking into this, anyone doing anything nontrivial with CIDER is hitting this every day.

WorldsEndless · 2020-04-25T12:45:33Z

I have just received with this issue for time immemorial. I haven't ever expected repl to give pretty feedback for all but small data. You're telling me there's supposed to be another way?

bbatsov · 2020-04-25T15:09:50Z

Does this imply losing syntax highlighting on printed results in the repl?

Yes. I simply don't see a simple way to preserve streaming the result in chunks and do the font-locking effectively. The current approach was meant for the case when the entire result comes in one piece.

If the chunk size were enormous, or perhaps configurable, one could tune it to encounter this bug less.

True, but that you defeat the purpose of streaming the result in chunks, right? :-) The chunk size is configurable, but so is using streaming or not. However, without streaming if you make a mistake and print something big this will lock-up both nREPL and CIDER, which is why we introduced this feature in the first place.

Thanks again for looking into this, anyone doing anything nontrivial with CIDER is hitting this every day.

Well, it's annoying for me as well, especially now that I'm actually doing some Clojure development again. :-)

I have just received with this issue for time immemorial. I haven't ever expected repl to give pretty feedback for all but small data. You're telling me there's supposed to be another way?

In the past the result was returned from the server in one piece and this was font-locked just before it was inserted in the REPL. Now the server splits the result in 1024 byte chunks (by default) and because they are font-locked individually this breaks the font-locking on anything bigger.

As noted above I don't see any reasonable way around this, as we can't really collect all results and print them in bulk - that'd undo the massive performance gains of streaming results in chunks and would make printing uninterruptible. I can write some logic that constantly goes back to the expression boundary and font-locks the code again, but that'd be super slow and quite complex.

That's why it seems to me this font-locking should go away completely. I can also expose the streaming settings to the end users, so they can disable it if they want to or increase the result chunk size.

harold · 2020-04-25T15:38:31Z

I got it now. Thanks for the clarification. I think the screenshot in the original report is actually typical.

Syntax highlighting on printed results is too good to lose.

What about just cranking the default chunk size? Like, maybe 8x? Was the current size chosen carefully? Does it optimize some other concern?

If font locking breaks on ginormous prints that’s definitely better than bringing Emacs and cider down (we’ve all done that).

What do you think about just increasing the chunk size? Can you tell me how to do it locally so I can test for myself?

Welcome back into the fold, can’t express how sweet it is to have you back.

dpsutton · 2020-04-25T16:06:25Z

is it conceivable to not font-lock on messages until we get a done message and then call (font-lock-fontify-region beginning-of-first-message point-at-done-message-time) or something equivalent?

bbatsov · 2020-04-25T16:11:52Z

It is, but it's not very easy and it will also result in you seeing bigger results without font-locking until they are streamed until the end. The main complexity comes from the fact you can't just take everything between the previous and the next prompt, as some of it might be output.

dpsutton · 2020-04-25T16:14:04Z

good point. Does the result viewer functionality give us some breathing room? Maybe the repl output isn't font-locked but you could call some type of cider-inspect-last-result and see a font-locked-version if needed?

bbatsov · 2020-04-25T16:17:27Z

What about just cranking the default chunk size? Like, maybe 8x? Was the current size chosen carefully? Does it optimize some other concern?

Frankly, I don't remember how we chose the default. :D Maybe increasing it 4/8x is fine. 1k buffer-sizes are relatively small indeed.

What do you think about just increasing the chunk size? Can you tell me how to do it locally so I can test for myself?

You can redefine the cider--nrepl-print-request-map function like this:

(defun cider--nrepl-print-request-map (&optional right-margin)
  "Map to merge into requests that require pretty-printing.
RIGHT-MARGIN specifies the maximum column-width of the printed result, and
is included in the request if non-nil."
  (let* ((width-option (cider--print-option "right-margin" cider-print-fn))
         (print-options (thread-last
                            (map-merge 'hash-table
                                       `((,width-option ,right-margin))
                                       cider-print-options)
                          (map-pairs)
                          (seq-mapcat #'identity)
                          (apply #'nrepl-dict))))
    (map-merge 'list
               `(("nrepl.middleware.print/stream?" "1")
                  ("nrepl.middleware.print/buffer-size" "8192"))
               (when cider-print-fn
                 `(("nrepl.middleware.print/print" ,(cider--print-fn))))
               (when cider-print-quota
                 `(("nrepl.middleware.print/quota" ,cider-print-quota)))
               (unless (nrepl-dict-empty-p print-options)
                 `(("nrepl.middleware.print/options" ,print-options))))))```

It's current definition doesn't specify an explicit buffer-size, but I can expose this via a defcustom.

bsless · 2020-04-25T21:11:49Z

Is it possible to perhaps play dynamically with the quota (looking at replying-PrintWriter) such that streamed chunks will end at distinct Clojure objects? It might require double buffering but if the next object can only be partially added to the stream, maybe it should be moved to the next chunk?
This could enable maintaining correct syntax highlighting with higher probability with less need to increase buffer size (it might have to be temporarily exceeded for hefty object though)
What do you think, could this work?

harold · 2020-04-26T01:33:48Z

Ok! We've learned a lot.

There are a couple of typos in what you shared, @bbatsov - but I got this working:

(defun cider--nrepl-print-request-map (&optional right-margin)
  "Map to merge into requests that require pretty-printing.
RIGHT-MARGIN specifies the maximum column-width of the printed result, and
is included in the request if non-nil."
  (let* ((width-option (cider--print-option "right-margin" cider-print-fn))
         (print-options (thread-last
                            (map-merge 'hash-table
                                       `((,width-option ,right-margin))
                                       cider-print-options)
                          (map-pairs)
                          (seq-mapcat #'identity)
                          (apply #'nrepl-dict))))
    (map-merge 'list
               `(("nrepl.middleware.print/stream?" "1")
                 ("nrepl.middleware.print/buffer-size" 8192))
               (when cider-print-fn
                 `(("nrepl.middleware.print/print" ,(cider--print-fn))))
               (when cider-print-quota
                 `(("nrepl.middleware.print/quota" ,cider-print-quota)))
               (unless (nrepl-dict-empty-p print-options)
                 `(("nrepl.middleware.print/options" ,print-options))))))

And with that, the original code in the report works nicely:

Now, 8192 is a number, but 16384, 32768, and 65536 are also numbers. The last one maybe being the best choice here as Gates once said, "640K ought to be enough for anyone." .. err maybe 1/10th that, or well, whatever.

Some of the more clever solutions proposed in this issue sound interesting, but I think for simplicity my vote would currently be for just cranking this print buffer size a bit. For sure, turning off font locking by default for repl prints is bad.

One other interesting tidbit I discovered in playing today is that there seems to be some kind of upper bound on the size of output that CIDER will attempt to font-lock at all.

This (with a suitably large buffer) works:

But this (regarless of the buffer size), just gives up:

This all seems super-reasonable to me; and I think we're dialing in toward something great here.

Thanks everyone.

dpsutton · 2020-04-26T01:57:49Z

I like the abandoned font locking much more than inconsistent font locking this seems like a good avenue as a first measure.

…

On Sat, Apr 25, 2020 at 8:34 PM Harold ***@***.***> wrote: Ok! We've learned a lot. There are a couple of typos in what you shared, @bbatsov <https://github.com/bbatsov> - but I got this working: (defun cider--nrepl-print-request-map (&optional right-margin) "Map to merge into requests that require pretty-printing.RIGHT-MARGIN specifies the maximum column-width of the printed result, andis included in the request if non-nil." (let* ((width-option (cider--print-option "right-margin" cider-print-fn)) (print-options (thread-last (map-merge 'hash-table `((,width-option ,right-margin)) cider-print-options) (map-pairs) (seq-mapcat #'identity) (apply #'nrepl-dict)))) (map-merge 'list `(("nrepl.middleware.print/stream?" "1") ("nrepl.middleware.print/buffer-size" 8192)) (when cider-print-fn `(("nrepl.middleware.print/print" ,(cider--print-fn)))) (when cider-print-quota `(("nrepl.middleware.print/quota" ,cider-print-quota))) (unless (nrepl-dict-empty-p print-options) `(("nrepl.middleware.print/options" ,print-options)))))) And with that, the original code in the report works nicely: [image: image] <https://user-images.githubusercontent.com/7443/80294879-c3b4be80-872a-11ea-8ecf-925ac342d08f.png> Now, 8192 is a number, but 16384, 32768, and 65536 are also numbers. The last one maybe being the best choice here as Gates once said, "640K ought to be enough for anyone." .. err maybe 1/10th that, or well, whatever. Some of the more clever solutions proposed in this issue sound interesting, but I think for simplicity my vote would currently be for just cranking this print buffer size a bit. For sure, turning off font locking by default for repl prints is bad. One other interesting tidbit I discovered in playing today is that there seems to be some kind of upper bound on the size of output that CIDER will attempt to font-lock at all. This (with a suitably large buffer) works: [image: image] <https://user-images.githubusercontent.com/7443/80294949-7d139400-872b-11ea-9e29-fb190c984179.png> But this (regarless of the buffer size), just gives up: [image: image] <https://user-images.githubusercontent.com/7443/80294960-974d7200-872b-11ea-9171-be71448b3f44.png> This all seems super-reasonable to me; and I think we're dialing in toward something great here. Thanks everyone. — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#2628 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ABQU6TPX3JV6DUQ4Q4OPCSTROOFQXANCNFSM4HH6PZKA> .

bbatsov · 2020-04-26T08:20:19Z

There are a couple of typos in what you shared

Sorry about this! I've updated my example. Code editing outside of Emacs is not my strong suit! :D

One other interesting tidbit I discovered in playing today is that there seems to be some kind of upper bound on the size of output that CIDER will attempt to font-lock at all.

There's no limit in CIDER's font-locking function, so I'm guessing there's some Emacs limit after which font-locking doesn't kick-in at all.

I'll think a bit more about the solution, but I'm leaning towards disabling this by default as I tend to value simplicity and consistency.

harold · 2020-04-26T13:00:39Z

I'm leaning towards disabling this by default as I tend to value simplicity and consistency

To be clear, the current behavior (inconsistent font locking) is much better than having no font locking (syntax highlighting) on printed repl results.

I never thought logging this issue would lead to losing syntax highlighting functionality.

Increasing nrepl.middleware.print/buffer-size is minimally invasive, fixes the problem as originally reported, and preserves imporant CIDER functionality (human readability of printed repl results).

pbwolf · 2020-04-26T13:19:04Z

Legibility is nice when the decorating can be quick and accurate. The larger the output, the less I care about decoration. (In other words, I should refine my query.) Best would be to decorate whole outputs if small (32k?), and for anything bigger, optimize ruthlessly for speed.

practicalli-johnny · 2020-04-28T07:19:15Z

I stopped using the REPL buffer directly a long time ago. The cider-inspect tools are far quicker and more effective for browsing data, especially for nested or large results.
I've recently been working with Clojure versions of large GeoJSON files and they worked flawlessly in cider-inspect.
The way the inspector works means I don't feel the need for highlighting to understand the result.

All the other evaluation is done in the source code buffers.

Also fixes #1971.

bbatsov · 2020-05-19T06:09:30Z

For those curious about the solution - I just opted to check if the result is a balanced expression and font-lock it only in this case. Almost always this would mean that we're doing with a single chunk result that we can safely font-lock.

This makes it possible to stream results in bigger chunks (and as a corollary - fontify as Clojure bigger results).

bbatsov · 2020-05-19T08:20:27Z

I've also introduced cider-print-buffer-size which is set to 4k by default. I'm wary of using bigger buffer sizes by default, but everyone can bump them if they want to. I'm wondering whether to allow pretty-printing without streaming, but I probably won't do it as I don't see many benefits of this.

bbatsov added the bug label Apr 24, 2019

bbatsov mentioned this issue Jun 26, 2019

Cider 0.21 miscolors some output at the REPL seemingly randomly, maybe fixed in 0.22 snapshot #2660

Closed

stale bot added the stale label Aug 9, 2019

bbatsov added the high priority Tickets of particular importance label Aug 9, 2019

stale bot removed the stale label Aug 9, 2019

bbatsov added the help wanted label Aug 9, 2019

harold mentioned this issue May 18, 2020

Eval'ing (keyword "\"|") gives "Search failed" error #1971

Closed

bbatsov closed this as completed in 4f080e4 May 18, 2020

bbatsov added a commit that referenced this issue May 18, 2020

[Fix #2628] Don't try to font-lock multi-chunk results in the REPL

e0c2a2e

Also fixes #1971.

bbatsov added a commit that referenced this issue May 19, 2020

[#2628] Add a defcustom controlling nREPL's print buffer size

81662e1

This makes it possible to stream results in bigger chunks (and as a corollary - fontify as Clojure bigger results).

Something changed with printing, now syntax highlighting is confused #2628

Something changed with printing, now syntax highlighting is confused #2628

Comments

harold commented Apr 23, 2019

Actual behavior

Steps to reproduce the problem

Environment & Version information

CIDER version information

Lein/Boot version

Emacs version

Operating system

bbatsov commented Apr 24, 2019

bbatsov commented Apr 24, 2019

harold commented Apr 24, 2019

dpsutton commented Apr 24, 2019

harold commented Apr 24, 2019

dpsutton commented Apr 24, 2019

harold commented Apr 24, 2019

Malabarba commented Apr 29, 2019

harold commented Apr 29, 2019

bbatsov commented May 11, 2019

stale bot commented Aug 9, 2019

harold commented Apr 24, 2020

bbatsov commented Apr 24, 2020

harold commented Apr 24, 2020

bbatsov commented Apr 25, 2020

bbatsov commented Apr 25, 2020

bbatsov commented Apr 25, 2020

harold commented Apr 25, 2020

harold commented Apr 25, 2020

WorldsEndless commented Apr 25, 2020

bbatsov commented Apr 25, 2020

harold commented Apr 25, 2020

dpsutton commented Apr 25, 2020

bbatsov commented Apr 25, 2020

dpsutton commented Apr 25, 2020

bbatsov commented Apr 25, 2020 • edited Loading

bsless commented Apr 25, 2020

harold commented Apr 26, 2020

dpsutton commented Apr 26, 2020 via email

bbatsov commented Apr 26, 2020

harold commented Apr 26, 2020

pbwolf commented Apr 26, 2020

practicalli-johnny commented Apr 28, 2020

bbatsov commented May 19, 2020

bbatsov commented May 19, 2020

bbatsov commented Apr 25, 2020 •

edited

Loading