Add tests for incorrect filter usage #56

Marcono1234 · 2024-02-25T16:04:18Z

This mainly covers using filter on primitives, which should have no result, see RFC 9535 section 2.3.5.2:

The filter selector works with arrays and objects exclusively. [...] Applied to a primitive value, it selects nothing

(This was inspired by hiltontj/serde_json_path#49; CC @hiltontj)

Please let me know if any of the tests are incorrect, or if I should adjust them or split this into separate pull requests. Any feedback is appreciated!

Using filter on primitives should have no result, see RFC 9535 section 2.3.5.2

hiltontj

Thank you @Marcono1234 for opening the issue and for CC'ing me. I have left a couple of comments for suggestion, but feel that approval should come from one of the other contributors that has worked on this test suite more extensively than I.

One general comment that I wanted to make was this: I had thought of adding a test case back when hiltontj/serde_json_path#49 was resolved, but felt the problem in that case was more due to a logical error in serde_json_path's code, as opposed to some violation of the standard.

So, that at least was the excuse I made for not taking the time to do what you have done here 🙂. Having these test cases will help prevent other future implementors from making the same logical errors I did, so I would give a 👍.

cts.json

Marcono1234 · 2024-02-25T17:49:29Z

Thanks for the feedback! I hope I addressed everything as you had it in mind. As part of that I removed the "@ refers to ..." from the names, hopefully it is still clear why the queries should have no result, respectively don't work as one might expect.

To the maintainers: Please feel free to squash the commits on merge, or let me know if I should squash them for a cleaner Git history.

gregsdennis

There's some duplication of existing tests in here. I'd like to be sure we have good reason to add them and that the test cases are minimal.

Also, I'd like @glyn's opinion.

FWIW, these all pass in my implementation.

gregsdennis · 2024-02-25T19:56:48Z

tests/filter.json

+    {
+      "name": "on primitive, selects nothing",
+      "selector": "$.a[?@ == 1]",
+      "document": {"a": 1},
+      "result": []
+    },


This could be improved since the .a isn't necessary.

Suggested change

{

"name": "on primitive, selects nothing",

"selector": "$.a[?@ == 1]",

"document": {"a": 1},

"result": []

},

{

"name": "on primitive, selects nothing",

"selector": "$[?@ == 1]",

"document": 1,

"result": []

},

Also, I don't think having a filter here improves the test. It could be any selector.

Suggested change

{

"name": "on primitive, selects nothing",

"selector": "$.a[?@ == 1]",

"document": {"a": 1},

"result": []

},

{

"name": "on primitive, selects nothing",

"selector": "$['a']",

"document": 1,

"result": []

},

would do just as well. If we use this, then we need to have a series of these, and

they should be put in basic.json.

we should test all of the JSON types. (We already have one testing for an array index on an object and a couple, single- and double-quote versions, testing for a string index on an array.)

This could be improved since the .a isn't necessary.

It seems that would be the first test then with primitive top-level value. I had created #57 to propose in general adding tests for that.

Also, I don't think having a filter here improves the test. It could be any selector.

But those are (at least in the specification) two separate cases which independently say that only arrays and objects are supported, so maybe it would be good to have separate tests?

(I thought we had those tests for primitives but we only have tests for e.g. "index selector against object".)

gregsdennis · 2024-02-25T19:59:04Z

tests/functions/length.json

+    {
+      "name": "filter on primitive array element, selects nothing",
+      "selector": "$[?length(@)==1]",
+      "document": [1],
+      "result": []
+    },


We're already effectively testing this with "number arg" on line 32

Same as with #56 (comment); the intention was to make sure no library misinterprets @ as representing the JSON array (instead of its elements).

gregsdennis · 2024-02-25T20:00:10Z

tests/functions/length.json

+      "result": []
+    },
+    {
+      "name": "filter on primitive object member value, selects nothing",


This is the effectively the same test as before: checking to see what length() does with a number input.

You are right; I mainly added this to make sure no implementation misinterprets @ as being the whole JSON object (instead of just the member value 1).

I think that can be tested without the length() function.

Could you provide an example for such a test please?

My intention was that if a library misinterprets @ as being the array, then length(@) == 1 passes and it erroneously selects the array as result. Arguably this is not so much about the length function.

Maybe using @==@ could work here though (and this could be in filter.json then)? The expected result is [1], but a library which erroneously considers @ as referring to the JSON array would have [[]] or [[], 1] or similar as result?

$[?@ == 1] against either an object or an array works just fine to test this. (And, yes, this should be in filter.json.)

But that would not detect @ being misinterpreted as referring also to the array (instead of or in addition to just its items). It would in all cases only have the 1 as result.

It would in all cases only have the 1 as result.

Yes, exactly, and in doing so, it correctly tests that the implementation is interpreting @ not as the container but as the value within it. (Which it seems is the result you're after.)

Once it's known that the implementation interprets @ as the value, and assuming it passes the rest of the test suite, it's safe to assume that it will interpret @.a correctly.

gregsdennis · 2024-02-25T20:02:30Z

tests/filter.json

+      "name": "on primitive object member value, selects nothing",
+      "selector": "$[?@.a == 1]",


This and the following three tests seem to be checking that a relative path (@<...>) can have the same path syntax as a global path ($<...>). If that's something we're testing, that's fine, but I'd argue that we'd need to almost replicate the entire suite for relative paths.

This one was based on hiltontj/serde_json_path#49 where the reporter assumed @ refers to the JSON object instead of the member value.

And the tests "name selector on array" and "index selector on object" are supposed to cover the same issue in serde_json_path where the underlying bug was that an unexpected type (which should select nothing) erroneously always passed the filter. I assume such a bug pattern could affect other libraries as well.

where the reporter assumed @ refers to the JSON object instead of the member value.

Two things here:

We shouldn't be interested in testing what an end user expects, even (especially) if that expectation is wrong. We should be testing implementation behavior per the spec. If the implementation is doing it right and the user expected something different, then the solution is to correct the user. We don't need a test for it.

If we are going to test this, then we should test for interpreting @ directly; we don't need the .a. "$[?@ == 1]" would be a much more targeted test.

We shouldn't be interested in testing what an end user expects, even (especially) if that expectation is wrong. We should be testing implementation behavior per the spec.

Fair point. But I think here if the document is {"a": 1}, then it is necessary to have .a. If you used $[?@ == 1], then [1] is the correct result.

But if you add .a (i.e. ?@.a == 1), then you should get no result. However, libraries might get this wrong if they silently ignore the .a because it cannot be applied to a non-object. And with hiltontj/serde_json_path#49 there is a precedent for that exact issue.

I view this differently. The test should be that @ represents the 1. This is the simple test that verifies the implementation has the correct behavior.

Separately, we test that .a doesn't apply to a non-object (which is your primitive tests).

That .a can't be applied to a non-object in the context of @ is merely a consequence of the correct behavior and doesn't specifically need a test.

singular-querys, i.e., relative paths with @, are defined separately in the standard ABNF. So, this leads implementations to have separate parsing, AST representations, evaluation mechanisms, etc. - as was the case for serde_json_path.

Therefore, I think there is value in having tests that ensure the behaviour of queries against primitives for both absolute and relative paths. Relative paths can only appear in filters, so having separate tests for those requires queries like @Marcono1234 has used here.

I suppose that the motivation for this test case is that one implementor (me) made a logic error in the handling of singular queries, so, having this test would prevent another from doing the same. It is not possible to come up with tests for every conceivable logic error, or misinterpretation of JSONPath, but I don't see the harm in having a test that guards against one that did happen (and the .a was relevant in that case).

singular-querys, i.e., relative paths with @, are defined separately in the standard ABNF.

While true, @-paths can also be existence tests, which can handle all selector constructs. I believe (on mobile now; can verify letter) that we have some tests that ensure only singular paths are used in comparisons, though. If not, we should add them in another PR.

I suppose that the motivation for this test case is that one implementor (me) made a logic error in the handling of singular queries, so, having this test would prevent another from doing the same.

Okay. That this was an implementation error wasn't understood. I thought this was a user that expected something incorrect.

However, I still maintain, that this test is overly complex and can be broken down into smaller constituent pieces that still achieve the goal of verifying proper operation.

Sorry for taking so long to get back...

While true, @-paths can also be existence tests

This escaped me, thanks for pointing that out. I was incorrect there. Relative path != singular query. The singular query defined in the ABNF is only relevant to comparisons. Relative paths in existence tests or as function arguments can be non-singular.

glyn

LGTM with some suggestions. I can't comment on whether tests are duplicates as I'm not keeping track of all the tests.

glyn · 2024-02-26T08:46:06Z

cts.json

@@ -4096,6 +4153,22 @@
      ],
      "result": []
    },
+    {
+      "name": "functions, length, filter on primitive array element, selects nothing",
+      "selector": "$[?length(@)==1]",


This test could be made broader, thus:

Suggested change

"selector": "$[?length(@)==1]",

"selector": "$[?length(@)>=0]",

Also, it would be good to add a test for the comparison of two Nothing values if we don't already, e.g.:

$[?length(@)==length(@)]

should select all the elements of an array argument.

https://github.com/jsonpath-standard/jsonpath-compliance-test-suite/blob/main/tests%2Ffilter.json

We have Nothing equality.

@glyn I think these changes are moot given the conversation I've raised. What these tests are trying to verify can be accomplished with several more targeted tests.

glyn · 2024-02-26T08:47:31Z

cts.json

+    },
+    {
+      "name": "functions, length, filter on primitive object member value, selects nothing",
+      "selector": "$[?length(@)==1]",


Suggested change

"selector": "$[?length(@)==1]",

"selector": "$[?length(@)>=0]",

Marcono1234 · 2024-02-26T18:04:03Z

Thanks for all your reviews! Sorry that I did not approach this in a very structured way, and also mixed multiple test cases in this pull request.

@gregsdennis, if you prefer feel free to add tests in a more organized and simplified way, then I will close this pull request. Otherwise, should I only address @glyn's comments, or try to organize and simplify the tests more as you suggested @gregsdennis?

gregsdennis · 2024-02-29T22:21:01Z

I think @glyn and I (at a minimum) should agree on the approach for this. I'm still of the mind that these tests are too complex and the desired behavior can be covered with a few more targeted test cases.

glyn · 2024-03-01T02:44:59Z

I think @glyn and I (at a minimum) should agree on the approach for this. I'm still of the mind that these tests are too complex and the desired behavior can be covered with a few more targeted test cases.

@Marcono1234 May I suggest that you close this PR (or convert it into a draft PR) and then submit some small PRs, each targetting one case? I hope that won't be much more work and it would mean that we can quickly merge the non-contentious PRs. (If you have a better suggestion, I'm all ears!)

Marcono1234 · 2024-03-23T12:11:00Z

I have split this now into #69 and #70, and omitted some of the potentially redundant tests. I hope that is ok and makes it easier to review the changes.

Add tests for incorrect filter usage

40c8e74

Using filter on primitives should have no result, see RFC 9535 section 2.3.5.2

hiltontj reviewed Feb 25, 2024

View reviewed changes

cts.json Outdated Show resolved Hide resolved

cts.json Outdated Show resolved Hide resolved

cts.json Outdated Show resolved Hide resolved

Address review feedback

a9fcedf

gregsdennis requested changes Feb 25, 2024

View reviewed changes

gregsdennis requested a review from glyn February 25, 2024 20:05

glyn approved these changes Feb 26, 2024

View reviewed changes

Marcono1234 marked this pull request as draft March 2, 2024 16:28

This was referenced Mar 23, 2024

Add test for existence test without segments #69

Merged

Add filter tests for segment on wrong type #70

Merged

Marcono1234 closed this Mar 23, 2024

Marcono1234 deleted the selectors-on-primitive branch March 23, 2024 12:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add tests for incorrect filter usage #56

Add tests for incorrect filter usage #56

Marcono1234 commented Feb 25, 2024 •

edited

Loading

hiltontj left a comment

Marcono1234 commented Feb 25, 2024

gregsdennis left a comment •

edited

Loading

gregsdennis Feb 25, 2024

Marcono1234 Feb 25, 2024

gregsdennis Feb 25, 2024

gregsdennis Feb 25, 2024

Marcono1234 Feb 25, 2024

gregsdennis Feb 25, 2024

Marcono1234 Feb 25, 2024

gregsdennis Feb 25, 2024

Marcono1234 Feb 25, 2024

gregsdennis Feb 25, 2024 •

edited

Loading

Marcono1234 Feb 25, 2024 •

edited

Loading

gregsdennis Feb 25, 2024 •

edited

Loading

gregsdennis Feb 25, 2024

Marcono1234 Feb 25, 2024

gregsdennis Feb 25, 2024 •

edited

Loading

Marcono1234 Feb 25, 2024

gregsdennis Feb 25, 2024

hiltontj Feb 26, 2024

gregsdennis Feb 26, 2024 •

edited

Loading

hiltontj Mar 2, 2024

glyn left a comment

glyn Feb 26, 2024

glyn Feb 26, 2024

gregsdennis Feb 26, 2024

gregsdennis Feb 26, 2024

glyn Feb 26, 2024

Marcono1234 commented Feb 26, 2024

gregsdennis commented Feb 29, 2024

glyn commented Mar 1, 2024

Marcono1234 commented Mar 23, 2024

		"name": "on primitive object member value, selects nothing",
		"selector": "$[?@.a == 1]",

	"selector": "$[?length(@)==1]",
	"selector": "$[?length(@)>=0]",

Add tests for incorrect filter usage #56

Add tests for incorrect filter usage #56

Conversation

Marcono1234 commented Feb 25, 2024 • edited Loading

hiltontj left a comment

Choose a reason for hiding this comment

Marcono1234 commented Feb 25, 2024

gregsdennis left a comment • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

gregsdennis Feb 25, 2024 • edited Loading

Choose a reason for hiding this comment

Marcono1234 Feb 25, 2024 • edited Loading

Choose a reason for hiding this comment

gregsdennis Feb 25, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

gregsdennis Feb 25, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

gregsdennis Feb 26, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

glyn left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Marcono1234 commented Feb 26, 2024

gregsdennis commented Feb 29, 2024

glyn commented Mar 1, 2024

Marcono1234 commented Mar 23, 2024

Marcono1234 commented Feb 25, 2024 •

edited

Loading

gregsdennis left a comment •

edited

Loading

gregsdennis Feb 25, 2024 •

edited

Loading

Marcono1234 Feb 25, 2024 •

edited

Loading

gregsdennis Feb 25, 2024 •

edited

Loading

gregsdennis Feb 25, 2024 •

edited

Loading

gregsdennis Feb 26, 2024 •

edited

Loading