Feature request: NUL-delimited output #1271

charles-dyfis-net · 2016-11-06T17:23:12Z

Right now, the standard-practice way to read an array from jq into a shell-script is to use raw output and parse on newlines.

However, JSON strings can contain literal newlines; this makes such parsing error-prone.

NUL-delimited output, allowing IFS= read -r -d '' string to read exactly one C string unambiguously, would resolve this.

The text was updated successfully, but these errors were encountered:

eric-brechemier · 2016-11-09T16:57:28Z

@charles-dyfis-net is it not simpler in this case to keep newline escaping, instead of using raw output? This allows to keep a single item per line, which is easier to loop over in a shell script:

input.json

[
  "LF\nLF",
  "TAB\tTAB",
  "FF\fFF"
]

Filter

.[]

Command Line

$ jq '.[]' input.json

Output

"LF\nLF"
"TAB\tTAB"
"FF\fFF"

Otherwise, you can actually add a character of your choice at the end of each line, directly from your jq filter:

Filter + NUL

.[]
| ( . + "\u0000")

Command Line + NUL

$ jq '.[] | ( . + "\u0000")' input.json

Output + NUL

"LF\nLF\u0000"
"TAB\tTAB\u0000"
"FF\fFF\u0000"

Command Line + NUL as Raw (View as Hex)

$ jq -r '.[] | ( . + "\u0000")' input.json | xxd

Output + NUL as Raw (Viewed as Hex)

0000000: 4c46 0a4c 4600 0a54 4142 0954 4142 000a  LF.LF..TAB.TAB..
0000010: 4646 0c46 4600 0a                        FF.FF..

charles-dyfis-net · 2016-11-09T18:13:33Z

Thank you -- I actually have a few StackOverflow answers I'm going to want to amend in light of the patterns suggested in this ticket.

That said, this still would be a desirable feature to have.

Newline escaping requires the consumer's code to perform unescaping -- while printf '%b' is POSIX-defined, it's hardly common idiom, and without extensions such as bash's printf -v, command substitutions used to invoke it are themselves side-effecting, strippping trailing newlines. Moreover, lack of such unescaping is only visible/obvious in the error case, whereas reading a NUL-delimited stream as a line-delimited stream or the inverse is an easily-detected corner case. Moreover, whereas common tools (xargs -0, sort -z, etc) can deal with NUL-delimited streams, very few correctly grok "newline-delimited-text, but with the specific correct set of escape sequences".

The patterns given here are helpful: though \x00\x0a is a bit harder to process on the consumer side than just \x00 (for purposes of xargs -0 &c), it's certainly better than where we were without them.

thedward · 2016-11-11T15:17:55Z

@charles-dyfis-net

If you use -j instead of -r then it won't output the newline (\u00a0) characters.

wtlangford · 2016-11-11T19:23:34Z

JSON (at least RFC 7159 JSON) does not permit unescaped ASCII control
characters (U+0000 ~ U+001F), which contains the newline/linefeed
character. jq neither accepts nor outputs JSON strings containing newlines.

I'm not sure how you've come across this as an issue. Can you show me a
use case for this?

On Fri, Nov 11, 2016 at 10:17 AM Thedward Blevins notifications@github.com
wrote:

@charles-dyfis-net https://github.com/charles-dyfis-net

If you use -j instead of -r then it won't output the newline (\u00a0)
characters.

—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub
#1271 (comment), or mute
the thread
https://github.com/notifications/unsubscribe-auth/ADQ4V_lnpBbkMDRAfZsyxTcTRwM0e776ks5q9IclgaJpZM4Kqnc8
.

charles-dyfis-net · 2016-11-12T03:58:23Z

@wtlangford, gladly.

Consider the following contrived example:

#!/usr/bin/env bash
input_json='[{"value": "I am\na multiline\nvalue\twith a tab"}, {"value": "I am a second value"}]'
while IFS= read -r item; do
  printf 'Shell script interpreted item as: %q\n' "$item"
  printf '...as a literal: <<<%s>>>\n' "$item"
done < <(jq -r '.[] | .value' <<<"$input_json")

...where the intended output is (something equivalent to -- not all ksh-derivative shells implement printf %q in exactly the same way):

Shell script interpreted item as: $'I am\na multiline\nvalue\twith a tab'
...as a literal: <<<I am
a multiline
value   with a tab>>>
Shell script interpreted item as: I\ am\ a\ second\ value
...as a literal: <<<I am a second value>>>

Instead, as given above, the actual output is:

Shell script interpreted item as: I\ am
...as a literal: <<<I am>>>
Shell script interpreted item as: a\ multiline
...as a literal: <<<a multiline>>>
Shell script interpreted item as: $'value\twith a tab'
...as a literal: <<<value       with a tab>>>
Shell script interpreted item as: I\ am\ a\ second\ value
...as a literal: <<<I am a second value>>>

Now, to fix this, we can use NUL delimiters. That would modify our expression to be something like the following:

#!/usr/bin/env bash
input_json='[{"value": "I am\na multiline\nvalue\twith a tab"}, {"value": "I am a second value"}]'
while IFS= read -r -d '' item; do
  printf 'Shell script interpreted item as: %q\n' "$item"
  printf '...as a literal: <<<%s>>>\n' "$item"
done < <(jq -j '.[] | .value | (. + "\u0000")' <<<"$input_json")

...and it does in fact work exactly as desired. The only problem is that it requires the user to use some idioms that aren't completely obvious unless they read this ticket. :)

wtlangford · 2016-11-12T04:23:26Z

Ah. I see, you're using the raw output mode. It does, as you've found,
output unescaped newline characters, as it outputs the value of the json
strings and not the strings themselves. :)

I see your use case now. I'm not strictly averse to adding a new flag, but
at the same time, we try not to add new flags to the binary. I'd
definitely like to see some form of this added to the wiki, though.

On Fri, Nov 11, 2016, 22:58 Charles Duffy notifications@github.com wrote:

@wtlangford https://github.com/wtlangford, gladly.

Consider the following contrived example:

input_json='[{"value": "I am\na multiline\nvalue\twith a tab"}, {"value": "I am a second value"}]'
while IFS= read -r item; do
printf 'Shell script interpreted item as: %q\n' "$item"
printf '...as a literal: <<<%s>>>\n' "$item"
done < <(jq -r '.[] | .value' <<<"$input_json")

...where the intended output is:

Shell script interpreted item as: $'I am\n a multiline\nvalue\twith a tab'
...as a literal: <<>>
Shell script interpreted item as: 'I am a second value'
...as a literal: <<>>

Instead, as given above, the actual output is:

Shell script interpreted item as: I\ am
...as a literal: <<>>
Shell script interpreted item as: a\ multiline
...as a literal: <<>>
Shell script interpreted item as: $'value\twith a tab'
...as a literal: <<>>
Shell script interpreted item as: I\ am\ a\ second\ value
...as a literal: <<>>

Now, to fix this, we can use NUL delimiters. That would modify our
expression to be something like the following:

input_json='[{"value": "I am\na multiline\nvalue\twith a tab"}, {"value": "I am a second value"}]'
while IFS= read -r -d '' item; do
printf 'Shell script interpreted item as: %q\n' "$item"
printf '...as a literal: <<<%s>>>\n' "$item"
done < <(jq -j '.[] | .value | (. + "\u0000")' <<<"$input_json")

...and it does in fact work exactly as desired. The only problem is that
it requires the user to use some idioms that aren't completely obvious
unless they read this jq ticket. :)

—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
#1271 (comment), or mute
the thread
https://github.com/notifications/unsubscribe-auth/ADQ4VwESUHerOjuEUQpxjswLC1LMwSqfks5q9TlkgaJpZM4Kqnc8
.

eric-brechemier · 2016-11-14T12:12:55Z

@charles-dyfis-net you could also keep the list of values encoded as JSON, then use jq again within the loop to decode each JSON value into a raw string:

#!/bin/sh
{
  jq '.[] | .value' << INPUT_JSON
[
  {"value": "I am\na multiline\nvalue\twith a tab"},
  {"value": "I am a second value"}
]
INPUT_JSON
} | {
  while read -r jsonString
  do
    printf 'JSON Value: <<<%s>>>\n' "$jsonString"
    printf 'Text Value: <<<%s>>>\n' "$( jq -r -n "$jsonString")"
  done
}

JSON Value: <<<"I am\na multiline\nvalue\twith a tab">>>
Text Value: <<<I am
a multiline
value   with a tab>>>
JSON Value: <<<"I am a second value">>>
Text Value: <<<I am a second value>>>

The conversion from JSON to text is done in jq -r -n "$jsonString": the JSON string is provided as a filter, using -n flag, which prints itself as a raw string, using -r flag.

charles-dyfis-net · 2016-11-14T19:06:57Z

@eric-brechemier, noted, though that's considerably less efficient than a single jq run.

I think I'm entirely happy with @wtlangford's suggestion of treating this as a doc enhancement rather than a software enhancement -- now it's just a question of whether and when I have the time to assign this to myself and generate a wiki edit incorporating the many suggestions given here. :)

eric-brechemier · 2016-11-16T08:42:48Z

@wtlangford without adding a new flag, you could repurpose the -j flag to accept an optional argument:

-j # join with empty character
--join-output='\u0000' # join with NUL

pkoppstein · 2016-11-17T05:43:39Z

It seems to me that the matter of enhancing jq to support "joining with NUL" is of rather low priority, and certainly much lower than several other issues (notably the release of jq 1.6).

In any case, I suspect that most users who actually have the need to join with NUL can simply use the idiom:

   jq -c ..... | tr '\n' '\0'

That is, I suspect that most such users are working in an environment that has tr.

If using tr is not an option, then chances are that using the -c option in some other way, perhaps in conjunction with jq's support for @TSV and/or "\u0000", will suffice to solve the problem at hand.

Rather than expending the very limited resources available on supporting NUL-as-delimiter, I believe it would be far better to enhance support for the application/json-seq MIME type. Specifically, it should be easy to use jq to accept a JSON stream as input but produce json-seq as output (and vice versa), but currently the --seq option does not provide the flexibility to make this convenient.

(Note: To convert a stream of JSON texts to json-seq, one could use the form: jq -n --seq --slurpfile in <(STREAM) '$in[]' )

charles-dyfis-net · 2016-11-17T12:11:55Z

@pkoppstein, tr does not address the use case given in the sample code above, wherein there is a need to distinguish between literal newlines and delimiter newlines, and conflating the two (as by converting all newlines to delimiters) will cause the very ambiguity this feature (by selecting a delimiter not allowed in JSON strings even in escaped form) is intended to address.

pkoppstein · 2016-11-17T15:34:26Z

@charles-dyfis-net - My point is that one can use jq -c (without the -r option) to insert the NULs, and then later on in the processing convert to "raw output" if that is really needed.

charles-dyfis-net · 2016-11-17T16:43:57Z

@pkoppstein, ...so what you have then is essentially the same proposal offered by @eric-brechemier of using multiple passes, with the same performance overhead -- which is to say, the need to invoke a separate instance of jq for each item of output to be processed to convert into ultimate raw form.

pkoppstein · 2016-11-17T22:05:54Z

@charles-dyfis-net - My comments were mainly directed to the question of whether joining with NUL is really needed, not to the example which you yourself described as contrived.

For non-contrived problems, I suspect your concerns about efficiency are probably misplaced. Consider, for example, pipelines of the form:

while read -r line ; do MUNGE << "$line" | jq WHATEVER ; done < <(jq -c HEAVYLIFTING)

In realistic scenarios, the additional cost associated with the inner invocations of jq will almost certainly be relatively small, perhaps even to the point of insignificance if reasonable care is taken with the details.

The real issue here is probably #147

eric-brechemier · 2016-11-22T09:53:52Z

Rather than expending the very limited resources available on supporting NUL-as-delimiter, I believe it would be far better to enhance support for the application/json-seq MIME type. Specifically, it should be easy to use jq to accept a JSON stream as input but produce json-seq as output (and vice versa), but currently the --seq option does not provide the flexibility to make this convenient.

@pkoppstein are you referring to this?

pkoppstein · 2016-11-23T01:53:50Z

@eric-brechemier - That does seem to be related.

nicowilliams · 2017-01-23T22:01:35Z

So, yeah, a -0 would actually be nice.

pvdb · 2018-02-12T15:44:10Z

So, yeah, a -0 would actually be nice.

Yes please... pretty, pretty please!

pabs3 · 2019-10-14T04:28:18Z

I was thinking of working on this (it looks pretty simple), which option do people want?

-0 / --nul-output
-j/--join-output '\u0000'

Personally I think I would prefer the first one.

pabs3 · 2019-10-14T05:06:23Z

I ended up implementing the first option, but I'll be happy to change the PR to the other option if people prefer that.

eric-brechemier · 2019-10-24T12:14:19Z

Thanks! I suggested the second option to address the reluctance to introduce a new flag.
But using -0 directly would make the usage simpler.

Closes: #1271

andrii-pukhalevych · 2021-09-01T10:27:56Z

When we can expect release with this --nul-output support?

pcworld · 2021-09-16T11:53:04Z

Note that JSON strings can also contain null bytes ("\u0000"), which could break the --nul-output feature, which in some cases might be a security issue. I'm not sure if there's a way to split values properly that would be supported by POSIX shells.

pabs3 · 2021-09-18T00:14:21Z

The same security issue applies to the raw-output option, which could contain LF `\n` characters. There are several options for dealing with the security issue: Document the issue in the manual page for the `-r` and `-0` options. Error out when one of the output data items contains NUL or LF. Add the length of the string as a prefix. There are language features like `read -Nr` in bash that enable reading fixed length strings. That isn't really useful though as shells use C strings which use NUL as a terminator so they cannot represent data containing NUL as a variable. It could be possible to load that into programs in other languages tho. Add the individual output values as individual files in a directory. Add options for other output formats that can escape NUL or LF chars.

…

-- bye, pabs https://bonedaddy.net/pabs3/

pabs3 · 2021-09-18T00:17:26Z

To start with I've filed a pull request for documenting the issue: #2350 If anyone wants to work on the other options that would be great.

…

-- bye, pabs https://bonedaddy.net/pabs3/

@pcworld

Reported-by: @pcworld Reported-in: jqlang#1271 (comment)

@pcworld

Reported-by: @pcworld Reported-in: jqlang#1271 (comment)

vdukhovni · 2021-10-17T05:17:04Z

There is no, and cannot be a solution or work-around to failing to properly encode data to the syntax of the consuming application. The only reason that -0 works with xargs et. al. is that filenames found in the filesystem are NUL-terminated and can't contain ASCII NUL characters.

The -r option works correctly for output of properly encoded strings in some non-JSON format. These could be just raw lines if the output is plain text, but if it is expected to have some structure (consist of elements that are not necessarily "lines") then the jq program needs to generate that structure. This is fundamental in all applications that serialise and deserialise data. So no new features to try to paper over the problem are warranted or desirable.

As to documentation, the news won't reach the audience that most needs it, they'll just cargo-cult some naïve code and suffer the consequences.

If some guidance to the perplexed is to be delivered, it should be quite clear, that this is fundamentally a correctness issue that is germane to all programming languages and essentially all data formats. Yes, there can be security consequences to getting this wrong, but even absent a security issue, the result is liable to be wrong in various corner or even common cases.

In terms of working with shell commands, the jq interpreter has an @sh serialiser that robustly quotes strings as potential literal arguments for shell commands:

$ jq -nr '["echo","foo\nbar\nbaz", "$HOME"]| @sh'
'echo' 'foo
bar
baz' '$HOME'

and thus assuming the arguments are validated as part of building the shell command, one can be sure that the command is executed as intended, without deserialisation errors:

$ jq -nr '["echo","foo\nbar\nbaz", "$HOME"]| @sh' | sh -
foo
bar
baz $HOME

If the output is an SQL query, then the serialisation needs to be escaped correctly for the intended SQL dialect (perhaps not a job for JQ, and so one might pass JSON into some other tool that has an SQL API and can quote SQL data).

So while I am not ultimately opposed to some mention of the issues in the docs, I don't think the currently pending PR is the right way to handle this.

pabs3 · 2021-10-17T06:53:55Z

I wonder if the -0 option should just get removed. It is just as unsafe as the -r option and people looking for a safe alternative to the -r option are going to switch to -0 without knowing about NUL being allowable in JSON strings. Probably also the -r option should be deprecated or removed too, in favour of external programs checking and transforming the JSON output of jq into the needed formats. Do you want to keep -0 and -r? What changes to the docs do you suggest?

…

-- bye, pabs https://bonedaddy.net/pabs3/

vdukhovni · 2021-10-17T07:06:09Z

I wonder if the -0 option should just get removed.

That would be my recommendation. IIRC it has not been released yet, and if so, it should not be released.

Probably also the -r option should be deprecated or removed too, in favour of external programs checking and transforming the JSON output of jq into the needed formats.

No, sorry, that would be completely unacceptable. It makes @csv and @sh for example, completely useless, and also various contexts where unstructured text output is fine, or the user's JQ program constructed a robust serialisation.

Just because some users are sloppy CANNOT mean that jq is then made unusable for everyone else. The cargo cultists can shoot themselves in the foot in any language, and jq is by far one of the safer choices.

They can also print raw strings in Python, Perl, ... and I don't see any warnings in those languages about the dangers of text output.

pabs3 · 2021-10-17T07:16:15Z

OK, please revert the commit adding the -0 option and document in the FAQ that -0 will not be added.

…

-- bye, pabs https://bonedaddy.net/pabs3/

`--nul-output` / `-0` was removed RE: jqlang#1271 and jqlang#2350

vaab · 2023-02-08T10:47:10Z

@vdukhovni The @sh still require the equivalent to an evaluation in bash which is costly (and could be risky, and need to be treated with care). Second, neither YAML or JSON, nor a lot of other data contains the NUL char (or is expected to contain it). Most shell code will gain substantial time in most cases if jq would directly output raw strings (the -r case), and can separate values with NUL char with a -0 : the shell code wouldn't need to exist, you could pipe yq directly to other processes. If the output of a jq query happens to contains itself a 'rogue' NUL char, I would expect -0 to bail out with an explicit error, so that the calling code can warn the user that it's expectation were broken, as any normal syntax error.

You are suggesting that any code needs some formatting where actually here it is not the case: shell variables can hold any binary data that is not containing NUL char, and pipes handle any binary data. As long as a program can ensure that it's properly using NUL and ensuring what it separates doesn't have any NUL char inside, you'll be magnitude faster and safer than going through converters and formatters and re-interpretation of data.

As I see it, jq is for the command line, not the shell, but for processes. The command line is system programming : it is all about binary data, and NUL separated values. It is crude, but efficient. You are directly talking to xargs, find, grep, git etc...

For these reason, and for what it is worth, I'm not in favor or removing -0 option. But clearly in favor of bailing out with an error when using this option and one of the separated value contains a NUL char.

vdukhovni · 2023-07-09T18:32:15Z

What is the compelling use-case for extracting a stream of multi-line strings from a JSON document to feed into a program that supports NUL-separated inputs?

For xargs, cpio, ... the compelling use-case is that they can consume the output of find ... -print0. Where does jq enter into this picture.

I don't want to give users a false sense of security. Any "raw" output form (be it -r or the proposed -0) carries risks of various injection-style attacks, and the user should not assume safety.

The suggestion to fail if an item for raw output already contains a NUL does provide some safety, at the cost of throwing errors that should have been handled in some manner before attempting to serialise the data in question as a NUL-separated (terminated) stream.

If that's to be done, then one might argue that the same should be available (another option?) with newline-separated output, but even protecting against separator injection is not generally sufficient, sometimes injection of unexpected spaces or unexpected ../ path components, ... are also problematic.

So if such a feature is to be provided, it should be more general:

$ jq --raw-terminator <codepoint> ...

Would support -0 as well as the current -r but with guaranteed absence of newlines in each output item.
In such a case, it should also be configurable whether to skip the problem item or terminate.

All that said, I am not convinced there are compelling practical and then sufficiently safe use-cases for this sort of feature.

pabs3 · 2023-07-10T02:13:21Z

The idea is that you have some JSON data and want to safely pass parts of it to other programs via either stdin or command-line arguments. So you process the data with jq, output the data with a safe separator (usually NUL) and use xargs to convert stdin to command-line arguments. For extra safety you pass an option processing terminator before the arguments.

curl https://example.com/foo.json | jq -0 '[].foo' | xargs -0 foo -- | ...
curl https://example.com/foo.json | jq -0 '[].foo' | sort -z | ...

Agreed that injection attacks are always possible. The existing documentation for -r and -j should mention this problem. Protecting against them can't be the sole responsibility of jq though, since you never know what people are passing the output of it to and how badly they are handling it. Even if jq only passes JSON along instead of raw data, subsequent commands could mishandle that too. The documentation probably should have a section on jq and safety listing all the possible attacks.

The handling of the failure when encountering output separators in the output data could be done by jq withholding all output until all of the input is processed. Or you could leave it to subsequent commands to handle the error exit code (likely via shell set -o pipefail) and partial output.

Without having the -0 feature, people are going to continue to do the jq -j '.foo + "\u0000" workaround when they want some semblance of safety, but still be subject to injection attacks. Of course, it is more likely they will just use -r, not think about newlines in the input and still be subject to the same attacks. It is unlikely they would bother to do the alternative of writing a script/program to process the JSON output of jq, if they were going to do that they would never have used jq in the first place.

You can see here the original context where I personally wanted to use -0, getting some date/ersion data from an API, safely comparing versions using dpkg, saving the data to files and doing git bisect on the results. Probably more of that could be done within jq that what I wrote, but it wouldn't be possible to do the dpkg version comparison without having the data leave json mode and go into raw mode then into dpkg commands. Other folks probably have other use-cases.

nicowilliams · 2023-07-10T02:51:34Z

The idea is that you have some JSON data and want to safely pass parts of it to other programs via either stdin or command-line arguments. So you process the data with jq, output the data with a safe separator (usually NUL) and use xargs to convert stdin to command-line arguments. For extra safety you pass an option processing terminator before the arguments.
curl https://example.com/foo.json | jq -0 '[].foo' | xargs -0 foo -- | ...
curl https://example.com/foo.json | jq -0 '[].foo' | sort -z | ...

Ah, but if you just use jq -c then you don't need -0 because jq -c will not output newlines within the JSON text, only after each JSON text, therefore it is safe to run curl https://example.com/foo.json | jq -c '[].foo' | xargs foo -- | ....

vdukhovni · 2023-07-10T02:54:47Z

Thanks for the examples. FWIW, instead of attempting to carefully serialise whatever happened to come in, I'd have restricted the values to a known safe subset

$ curl -s https://snapshot.debian.org/mr/binary/perl/ |
  jq -r '.result[].binary_version | select(test("^[-.+:~0-9a-zA-Z]+$"))'

This is then safe to newline separate, and easier to work with. And I'd probably also take care with positional arguments that might look like short or long options, thus make sure to include a -- at the appropriate point in constructed command-lines:

sh -c '
    dpkg --compare-versions -- "$1" ge 5.24.1-3 &&
    dpkg --compare-versions -- "$1" le 5.28.1-6 &&
    printf "%s\0" "$1"'

Finally, it is still not clear to me whether the correct thing to do with unexpected values is to abort, or to just skip that value.
Safe serialisation of untrusted data sadly requires attention to detail. There's no silver bullet.
So even if there's sufficient user-community support for -0, it would have to come with sufficient disclaimers to not lead to a false sense of security. I'd still recommend being explicit about validation, and document some examples (perhaps in a project wiki linked from the manpage, if too intrusive in the main reference document).

pabs3 · 2023-07-10T03:36:24Z

The jq -c option isn't useful here because the commands being passed data don't support JSON and -c outputs JSON.

I've updated the wiki page to include your dpkg -- suggestion, thanks. I'm not sure how I feel about the select suggestion.

There could be an option for choosing to skip or abort on separator bytes in the output items, that could be made mandatory for -0/-r/-j to ensure that people think about injection possibilities and corresponding error handling.

The manual page is reasonably long as-is, so it feels OK to add a new section about safety in general, then the -0/-r/-j documentation could refer to the subsection of that about separator injection. The wiki page idea sounds good for extra examples too.

nicowilliams · 2023-07-10T03:52:10Z

The jq -c option isn't useful here because the commands being passed data don't support JSON and -c outputs JSON.

Ah, then do this:

a) use jq -j,
b) in your jq program check whether inputs have embedded delimiters and reject or map those,
c) output whatever outputs and a delimiter.

This is much more general than --nul-output. It does put the onus on you to make sure that your jq program does the right thing, and I think that's quite fair.

pabs3 · 2023-07-10T04:10:17Z

That is a lot more complicated for folks who know shell much better than jq.
A command-line option would for -0 would make it much easier for them.
A subset of them will just YOLO it and use -r and skip b and c anyway.
The proposed mandatory skip/abort option would make the -r folks safer.

nicowilliams · 2023-07-10T04:41:26Z

That is a lot more complicated for folks who know shell much better than jq. A command-line option would for -0 would make it much easier for them. A subset of them will just YOLO it and use -r and skip b and c anyway. The proposed mandatory skip/abort option would make the -r folks safer.

I'm thinking we'll make -0 mean NUL-delimited input, and we might keep (but deprecate?) --nul-output.

pabs3 · 2023-07-10T04:53:49Z

Hmm, I thought jq always required JSON input, not randomly formatted input. Changing the meaning of an option is a major backwards compatibility issue too, so please don't do that.

PS: my request to update the documentation to mention injection issues was already rejected in #2350, I can resubmit that if it is wanted.

nicowilliams · 2023-07-10T05:04:34Z

Hmm, I thought jq always required JSON input, not randomly formatted input.

There's -R which means "raw input".

Changing the meaning of an option is a major backwards compatibility issue too, so please don't do that.

-0 hasn't shipped in any version of jq.

PS: my request to update the documentation to mention injection issues was already rejected in #2350, I can resubmit that if it is wanted.

There's no need to re-submit it. I'll review #2350.

nicowilliams · 2023-07-10T05:05:08Z

Feature request: NUL-delimited output #1271

Feature request: NUL-delimited output #1271

Comments

charles-dyfis-net commented Nov 6, 2016

eric-brechemier commented Nov 9, 2016

input.json

Filter

Command Line

Output

Filter + NUL

Command Line + NUL

Output + NUL

Command Line + NUL as Raw (View as Hex)

Output + NUL as Raw (Viewed as Hex)

charles-dyfis-net commented Nov 9, 2016

thedward commented Nov 11, 2016

wtlangford commented Nov 11, 2016

charles-dyfis-net commented Nov 12, 2016 • edited Loading

wtlangford commented Nov 12, 2016

eric-brechemier commented Nov 14, 2016

charles-dyfis-net commented Nov 14, 2016

eric-brechemier commented Nov 16, 2016

pkoppstein commented Nov 17, 2016 • edited Loading

charles-dyfis-net commented Nov 17, 2016

pkoppstein commented Nov 17, 2016

charles-dyfis-net commented Nov 17, 2016 • edited Loading

pkoppstein commented Nov 17, 2016

eric-brechemier commented Nov 22, 2016

pkoppstein commented Nov 23, 2016

nicowilliams commented Jan 23, 2017

pvdb commented Feb 12, 2018

pabs3 commented Oct 14, 2019

pabs3 commented Oct 14, 2019

eric-brechemier commented Oct 24, 2019

andrii-pukhalevych commented Sep 1, 2021

pcworld commented Sep 16, 2021

pabs3 commented Sep 18, 2021 via email

pabs3 commented Sep 18, 2021 via email

vdukhovni commented Oct 17, 2021 • edited Loading

pabs3 commented Oct 17, 2021 via email

vdukhovni commented Oct 17, 2021

pabs3 commented Oct 17, 2021 via email

vaab commented Feb 8, 2023 • edited Loading

vdukhovni commented Jul 9, 2023

pabs3 commented Jul 10, 2023

nicowilliams commented Jul 10, 2023

vdukhovni commented Jul 10, 2023 • edited Loading

pabs3 commented Jul 10, 2023

nicowilliams commented Jul 10, 2023

pabs3 commented Jul 10, 2023

nicowilliams commented Jul 10, 2023

pabs3 commented Jul 10, 2023

nicowilliams commented Jul 10, 2023 • edited Loading

nicowilliams commented Jul 10, 2023

svdb0 commented Jul 10, 2023

vdukhovni commented Jul 10, 2023

svdb0 commented Jul 12, 2023

charles-dyfis-net commented Nov 12, 2016 •

edited

Loading

pkoppstein commented Nov 17, 2016 •

edited

Loading

charles-dyfis-net commented Nov 17, 2016 •

edited

Loading

vdukhovni commented Oct 17, 2021 •

edited

Loading

vaab commented Feb 8, 2023 •

edited

Loading

vdukhovni commented Jul 10, 2023 •

edited

Loading

nicowilliams commented Jul 10, 2023 •

edited

Loading