Deviations from the spec in the kdl4j test suite #121

larsgw · 2021-09-01T10:52:55Z

When I implemented the kdl4j test suite in kdljs I came across a few deviations from the spec. Since some of them are sensible I wanted to discuss them here.

Multiline comments in node-space

The spec does currently not allow for the following, neither in the grammar or the spec text:

node /* comment */ "arg"

Single-line comments at EOF

The spec currently requires a single-line comment to be ended by a newline, which makes the following invalid (without a trailing newline):

// hi

Esclines between `/-` and props/values/children blocks

The spec currently only allows for pure whitespace between /- and the thing it is commenting out. Therefore, the following is invalid (I would definitely agree with the spec here, I just added it for completeness):

node /-    \
    "arg"

Escline outside node-space

Esclines can currently only be in node-space, so the following is invalid (again, I agree):

node1
  \// hey
   node2

Large numbers

The question also arose on how to deal with large numbers. Other than considering BigInt and BigDecimal, parsing large integers as Infinity seems relatively fine to me. However, a problem arises when Infinity should be formatted. Should Infinity, -Infinity and NaN be allowed for full float compatibility?

The text was updated successfully, but these errors were encountered:

zkat · 2021-09-01T16:05:32Z

Multiline comments in node-space
Single-line comments at EOF

These all smell like spec bugs to me. What do you think? Should we fix this in spec, or consider them quirks?

Esclines between /- and props/values/children blocks

I'm on the fence about this one. I feel like we should be treating esclines as if they were single-line spaces? Maybe we should fix the spec here, even though this is a weird corner case.

Escline outside node-space

That doesn't look like a linespace to me. Which makes me realize that /*foo*/ should be valid because multi-line comments are allowed as part of linespace???

Large numbers

I've been thinking about this one a bit. What do you think of saying "All numbers that, when interpreted as 64-bit IEEE 754 floating point numbers MUST be losslessly accepted by KDL parsers. Larger (or "smaller", if negative) numbers MAY be supported by individual KDL implementations, but are not guaranteed to be portable."?

hkolbeck · 2021-09-01T18:16:39Z

Apologies for the misaligned tests, but I'm glad they're exposing some worthwhile corner cases. It's definitely worth some close eyes on the test suite, as it's fallen out of sync with the spec due to my not having time to keep kdl4j up to date. A few folks expressed interest in helping out there and I proposed updating the test suite as the first step, but as far as I can tell nothing has been pushed up so far.

On the large number question, I think it's worth discussing the full gap between a float64 and a BigDecimal, specifically including a mention of underflow.

zkat · 2021-09-01T18:34:33Z

@hkolbeck no worries. The spec has actually changed in subtle ways that actually probably caused this, since you wrote these tests. This is to be expected, really.

As far as float64/BigDecimal goes, I'd love to hear more of your thoughts, taking into account what I suggested about large numbers in my post above?

hkolbeck · 2021-09-01T18:47:20Z

I'd phrase things something like "Implementations must accept, store, and roundtrip any value representable as a 64 bit IEEE754 floating point number exactly. Any number not representable exactly may be rounded to the nearest representable value or stored and roundtripped exactly at the author's discretion."

That's admittedly a bit clumsy still, and it might be worthwhile to follow with examples of values not representable exactly.

hkolbeck · 2021-09-01T18:54:35Z

One wrinkle: I'm not sure how universal language handling of float rounding is, and it may be the specifying nearest-ties-to-even may make implementations in some languages difficult.

zkat · 2021-09-01T19:11:30Z

I'd rather not require roundtripping, because a lot of KDL implementations won't necessarily have writers, but I think the direction of requiring that, if they had a writer, they could is a good one.

I'm also kinda unconcerned with "larger numbers" and I think it's 100% fine to say "here be dragons"? Is that bad?

hkolbeck · 2021-09-01T19:14:21Z

Good point about not all implementations having writers, though I'd lean toward making that a requirement for those that do. I've definitely been burned by wonky float handling in libraries before, so I'm pretty strongly inclined to be as explicit as can reasonably be done in a few sentences.

Ref: #121

zkat · 2021-09-01T19:53:57Z

I've started a PR so we can wordsmith: #122

zkat · 2021-09-02T22:23:32Z

Ok looking into this more:

node /* comment */ "arg"

This is legal, per the following spec grammar items:

node-space := ws* escline ws* | ws+
...
ws := bom | unicode-space | multi-line-comment
...
multi-line-comment := '/*' (commented-block | multi-line-comment) '*/'

So, that seems fine to me.

Ref: #121

zkat · 2021-09-02T22:25:31Z

I've fixed the single-line-comment thing in #126

Ref: #121

zkat · 2021-09-02T22:32:38Z

Alright, with #127 I think I've addressed everything here.

As per #122, we're not going to continue not specifying any representation for numbers, so they're not actually guaranteed to round-trip. That limits the usability of the test suite, but folks can still use the input part to make sure everything is parsing correctly, and just generate their own output relevant to their own parsers/writers.

So, I'm gonna close this :)

larsgw · 2021-09-02T22:42:15Z

This is legal, per the following spec grammar items:

Oh I definitely missed that, my bad. (I thought it was weird that it wasn't allowed, I guess I should have looked better)

zkat added the bug Something isn't working label Sep 1, 2021

zkat added a commit that referenced this issue Sep 1, 2021

add a note about representations

c11ac24

Ref: #121

zkat mentioned this issue Sep 1, 2021

add a note about representations #122

Closed

zkat mentioned this issue Sep 2, 2021

Roadmap for KDL 1.0 #112

Closed

12 tasks

zkat added a commit that referenced this issue Sep 2, 2021

allow eof termination for single line comments

bbe7bd9

Ref: #121

zkat mentioned this issue Sep 2, 2021

allow eof termination for single line comments #126

Merged

zkat added a commit that referenced this issue Sep 2, 2021

allow /- to cross linespaces

ab7c9f3

Ref: #121

zkat mentioned this issue Sep 2, 2021

allow /- to cross linespaces #127

Merged

zkat added a commit that referenced this issue Sep 2, 2021

allow /- to cross linespaces (#127)

cbb500a

Ref: #121

zkat closed this as completed Sep 2, 2021

larsgw mentioned this issue Oct 6, 2021

I *think* escline_comment_node.kdl test is wrong (or the grammar is) #223

Closed

alightgoesout mentioned this issue Feb 11, 2024

Line continuations should not be allowed between nodes kdl-org/kdl4j#10

Open

yerke mentioned this issue Feb 12, 2024

Add support for NaN, +Infinity, and -Infinity #374

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Deviations from the spec in the kdl4j test suite #121

Deviations from the spec in the kdl4j test suite #121

larsgw commented Sep 1, 2021

zkat commented Sep 1, 2021 •

edited by tabatkins

Loading

hkolbeck commented Sep 1, 2021 •

edited

Loading

zkat commented Sep 1, 2021

hkolbeck commented Sep 1, 2021

hkolbeck commented Sep 1, 2021

zkat commented Sep 1, 2021

hkolbeck commented Sep 1, 2021

zkat commented Sep 1, 2021

zkat commented Sep 2, 2021

zkat commented Sep 2, 2021

zkat commented Sep 2, 2021

larsgw commented Sep 2, 2021 •

edited

Loading

Deviations from the spec in the kdl4j test suite #121

Deviations from the spec in the kdl4j test suite #121

Comments

larsgw commented Sep 1, 2021

Multiline comments in node-space

Single-line comments at EOF

Esclines between /- and props/values/children blocks

Escline outside node-space

Large numbers

zkat commented Sep 1, 2021 • edited by tabatkins Loading

hkolbeck commented Sep 1, 2021 • edited Loading

zkat commented Sep 1, 2021

hkolbeck commented Sep 1, 2021

hkolbeck commented Sep 1, 2021

zkat commented Sep 1, 2021

hkolbeck commented Sep 1, 2021

zkat commented Sep 1, 2021

zkat commented Sep 2, 2021

zkat commented Sep 2, 2021

zkat commented Sep 2, 2021

larsgw commented Sep 2, 2021 • edited Loading

Esclines between `/-` and props/values/children blocks

zkat commented Sep 1, 2021 •

edited by tabatkins

Loading

hkolbeck commented Sep 1, 2021 •

edited

Loading

larsgw commented Sep 2, 2021 •

edited

Loading