jq 允许你直接在命令行下对 JSON 进行操作,包括分片、过滤、转换等等。

让我们通过几个例子来说明 jq 的功能:

一、输出格式化,漂亮的打印效果如果我们用文本编辑器打开 JSON,有时候可能看起来会一团糟,但是通过 jq 的 .(点)过滤器就可以立马让 JSON 的格式规整起来。

1.用文本编辑器打开后的样子


2.用 jq 显示的结果


复制代码 代码如下:


% jq . soundtag.json



二、快速查询JSON数据

利用 jq 能够以 key 作为关键字来对 JSON 作出快速查询, 例如:


复制代码 代码如下:


% jq .cn soundtag.json


这将仅仅显示 cn 键对应的值。

jq 的键查询也支持链式调用,如:


复制代码 代码如下:


% jq .cn[0].pNum soundtag.json


三、jq的管道操作

熟悉命令行的朋友可能都知道 | (管道)是一个非常强大的 武器。幸运的是,jq 也提供了对管道的支持。


复制代码 代码如下:


% jq '.cn[] | { pNum }' soundTag.json


在这里,我们使用管道过滤并构造出 pNum 对象。

四、总结

如果你需要在命令行下处理 JSON,我强烈推荐 jq。jq 不仅 能够满足一般性的常见需求,更包含运算、内置函数、条件比较、 变量声明、自定函数等强大功能。对此感兴趣的朋友,不妨通过 jq 的官方手册 进行学习。

 

五、jq搜索对应项的路径:

​https://stackoverflow.com/questions/34165201/how-to-get-the-index-path-of-found-values-using-jq​



$ cat 2.json |jq '[paths as $path | select(getpath($path) == "美容护肤/美体/精油") | $path]'
[
[
"mods",
"nav",
"data",
"common",
0,
"text"
]
]


测试一下:



cat 2.json |jq .mods.nav.data.common[0]|grep "美容护肤/美体/精油"


 

 

 

 

jq处理JSON数据



2、高亮并按属性排序显示

用 jq '.'即可:

curl 'https://api.github.com/repos/stedolan/jq/commits?per_page=5' | jq '.'


结果:

[
{
"parents": [
{
"html_url": "https://github.com/stedolan/jq/commit/54b9c9bdb225af5d886466d72f47eafc51acb4f7",
"url": "https://api.github.com/repos/stedolan/jq/commits/54b9c9bdb225af5d886466d72f47eafc51acb4f7",
"sha": "54b9c9bdb225af5d886466d72f47eafc51acb4f7"
},
{
"html_url": "https://github.com/stedolan/jq/commit/8b1b503609c161fea4b003a7179b3fbb2dd4345a",
"url": "https://api.github.com/repos/stedolan/jq/commits/8b1b503609c161fea4b003a7179b3fbb2dd4345a",
"sha": "8b1b503609c161fea4b003a7179b3fbb2dd4345a"
}
],
"committer": {
"type": "User",
"received_events_url": "https://api.github.com/users/stedolan/received_events",
"events_url": "https://api.github.com/users/stedolan/events{/privacy}",
"repos_url": "https://api.github.com/users/stedolan/repos",
"organizations_url": "https://api.github.com/users/stedolan/orgs",
...


这里的评论内容比较多,我们现在想拿第一个评论。

3、获取数组某一项

curl 'https://api.github.com/repos/stedolan/jq/commits?per_page=5' | jq '.[0]'


结果:


jq处理JSON数据, jq Manual (development version)_html

{
"parents": [
{
"html_url": "https://github.com/stedolan/jq/commit/54b9c9bdb225af5d886466d72f47eafc51acb4f7",
"url": "https://api.github.com/repos/stedolan/jq/commits/54b9c9bdb225af5d886466d72f47eafc51acb4f7",
"sha": "54b9c9bdb225af5d886466d72f47eafc51acb4f7"
},
{
"html_url": "https://github.com/stedolan/jq/commit/8b1b503609c161fea4b003a7179b3fbb2dd4345a",
"url": "https://api.github.com/repos/stedolan/jq/commits/8b1b503609c161fea4b003a7179b3fbb2dd4345a",
"sha": "8b1b503609c161fea4b003a7179b3fbb2dd4345a"
}
],
"committer": {
"type": "User",
"received_events_url": "https://api.github.com/users/stedolan/received_events",
"events_url": "https://api.github.com/users/stedolan/events{/privacy}",
"repos_url": "https://api.github.com/users/stedolan/repos",
"organizations_url": "https://api.github.com/users/stedolan/orgs",
"subscriptions_url": "https://api.github.com/users/stedolan/subscriptions",
"starred_url": "https://api.github.com/users/stedolan/starred{/owner}{/repo}",
"gists_url": "https://api.github.com/users/stedolan/gists{/gist_id}",
"login": "stedolan",
"id": 79765,
"avatar_url": "https://1.gravatar.com/avatar/31de909d8e55dd07ed782d92ece59842?d=https%3A%2F%2Fidenticons.github.com%2Ffc5b6765b1c9cfaecea48ae71df4d279.png",
"gravatar_id": "31de909d8e55dd07ed782d92ece59842",
"url": "https://api.github.com/users/stedolan",
"html_url": "https://github.com/stedolan",
"followers_url": "https://api.github.com/users/stedolan/followers",
"following_url": "https://api.github.com/users/stedolan/following{/other_user}"
},
"author": {
"type": "User",
"received_events_url": "https://api.github.com/users/stedolan/received_events",
"events_url": "https://api.github.com/users/stedolan/events{/privacy}",
"repos_url": "https://api.github.com/users/stedolan/repos",
"organizations_url": "https://api.github.com/users/stedolan/orgs",
"subscriptions_url": "https://api.github.com/users/stedolan/subscriptions",
"starred_url": "https://api.github.com/users/stedolan/starred{/owner}{/repo}",
"gists_url": "https://api.github.com/users/stedolan/gists{/gist_id}",
"login": "stedolan",
"id": 79765,
"avatar_url": "https://1.gravatar.com/avatar/31de909d8e55dd07ed782d92ece59842?d=https%3A%2F%2Fidenticons.github.com%2Ffc5b6765b1c9cfaecea48ae71df4d279.png",
"gravatar_id": "31de909d8e55dd07ed782d92ece59842",
"url": "https://api.github.com/users/stedolan",
"html_url": "https://github.com/stedolan",
"followers_url": "https://api.github.com/users/stedolan/followers",
"following_url": "https://api.github.com/users/stedolan/following{/other_user}"
},
"comments_url": "https://api.github.com/repos/stedolan/jq/commits/d25341478381063d1c76e81b3a52e0592a7c997f/comments",
"html_url": "https://github.com/stedolan/jq/commit/d25341478381063d1c76e81b3a52e0592a7c997f",
"url": "https://api.github.com/repos/stedolan/jq/commits/d25341478381063d1c76e81b3a52e0592a7c997f",
"commit": {
"comment_count": 0,
"url": "https://api.github.com/repos/stedolan/jq/git/commits/d25341478381063d1c76e81b3a52e0592a7c997f",
"tree": {
"url": "https://api.github.com/repos/stedolan/jq/git/trees/6ab697a8dfb5a96e124666bf6d6213822599fb40",
"sha": "6ab697a8dfb5a96e124666bf6d6213822599fb40"
},
"message": "Merge pull request #162 from stedolan/utf8-fixes\n\nUtf8 fixes. Closes #161",
"committer": {
"date": "2013-06-22T16:30:59Z",
"email": "mu@netsoc.tcd.ie",
"name": "Stephen Dolan"
},
"author": {
"date": "2013-06-22T16:30:59Z",
"email": "mu@netsoc.tcd.ie",
"name": "Stephen Dolan"
}
},
"sha": "d25341478381063d1c76e81b3a52e0592a7c997f"
}

View Code

可以看到,我们已经拿到了一条完整的评论内容。但我们真正关心的只是评论内容和用户名,下面来获取这两项内容。

4、自定义格式输出某一项

jq '.[0] | {message: .commit.message, name: .commit.committer.name}'


结果:

{
"name": "Stephen Dolan",
"message": "Merge pull request #162 from stedolan/utf8-fixes\n\nUtf8 fixes. Closes #161"
}


我们可以看到,已经拿到了想要的内容,并且已经按我们自己定义的格式显示了。

这里 | 后面的内容是以前面的内容为输入的, .commit  中的 . 就是指 .[0] 中的内容。

5、自定义格式输出多项

jq '.[] | {message: .commit.message, name: .commit.committer.name}'


结果:

{
"name": "Stephen Dolan",
"message": "Merge pull request #162 from stedolan/utf8-fixes\n\nUtf8 fixes. Closes #161"
}
{
"name": "Stephen Dolan",
"message": "Reject all overlong UTF8 sequences."
}
{
"name": "Stephen Dolan",
"message": "Fix various UTF8 parsing bugs.\n\nIn particular, parse bad UTF8 by replacing the broken bits with U+FFFD\nand resychronise correctly after broken sequences."
}
{
"name": "Stephen Dolan",
"message": "Fix example in manual for `floor`. See #155."
}
{
"name": "Nicolas Williams",
"message": "Document floor"
}


这里 .[] 获取的是数组中的所有项。

我们看到,结果是一个个独立的JSON对象,如何把结果组合成一个数组呢?

6、以数组形式自定义输出多项

jq '[.[] | {message: .commit.message, name: .commit.committer.name}]'


结果:

[
{
"name": "Stephen Dolan",
"message": "Merge pull request #162 from stedolan/utf8-fixes\n\nUtf8 fixes. Closes #161"
},
{
"name": "Stephen Dolan",
"message": "Reject all overlong UTF8 sequences."
},
{
"name": "Stephen Dolan",
"message": "Fix various UTF8 parsing bugs.\n\nIn particular, parse bad UTF8 by replacing the broken bits with U+FFFD\nand resychronise correctly after broken sequences."
},
{
"name": "Stephen Dolan",
"message": "Fix example in manual for `floor`. See #155."
},
{
"name": "Nicolas Williams",
"message": "Document floor"
}
]


我们可以看到,只要在上一步的命令中内容的两端加个中括号即可。

最后,我们如果想获取每个评论的引用评论的url(在parents节点中,有一个或多个)呢?

7、获取其他内容

jq '[.[] | {message: .commit.message, name: .commit.committer.name, parents: [.parents[].html_url]}]'


结果:

[
{
"parents": [
"https://github.com/stedolan/jq/commit/54b9c9bdb225af5d886466d72f47eafc51acb4f7",
"https://github.com/stedolan/jq/commit/8b1b503609c161fea4b003a7179b3fbb2dd4345a"
],
"name": "Stephen Dolan",
"message": "Merge pull request #162 from stedolan/utf8-fixes\n\nUtf8 fixes. Closes #161"
},
{
"parents": [
"https://github.com/stedolan/jq/commit/ff48bd6ec538b01d1057be8e93b94eef6914e9ef"
],
"name": "Stephen Dolan",
"message": "Reject all overlong UTF8 sequences."
},
{
"parents": [
"https://github.com/stedolan/jq/commit/54b9c9bdb225af5d886466d72f47eafc51acb4f7"
],
"name": "Stephen Dolan",
"message": "Fix various UTF8 parsing bugs.\n\nIn particular, parse bad UTF8 by replacing the broken bits with U+FFFD\nand resychronise correctly after broken sequences."
},
{
"parents": [
"https://github.com/stedolan/jq/commit/3dcdc582ea993afea3f5503a78a77675967ecdfa"
],
"name": "Stephen Dolan",
"message": "Fix example in manual for `floor`. See #155."
},
{
"parents": [
"https://github.com/stedolan/jq/commit/7c4171d414f647ab08bcd20c76a4d8ed68d9c602"
],
"name": "Nicolas Williams",
"message": "Document floor"
}
]


这里用 .parents[].html_url 获取当前项的 parents 节点中的所有项的 html_url 属性的内容,然后两边加个中括号组装成数组输出。

怎么样,经过这个例子可以看出,用jq处理JSON数据还是很方便强大的吧。

8、注意点

jq能处理的需要是严格的JSON格式数据,JSON对象和JSON字符串是不行的,如下面的两种格式数据jq是不能处理的:

json对象:

{
a: 1,
b: {
c: "abc"
}
}


json字符串:

'{"a":1,"b":{"c":"abc"}}'


正确的JSON格式:

{
"a": 1,
"b": {
"c": "abc"
}
}


关于什么是JSON格式数据,请参考: ​​http://www.json.org/​

 

 

 



在Shell命令行处理JSON数据的方法



这篇文章主要介绍了在Shell命令行处理JSON数据的方法,使用jq工具实现,需要的朋友可以参考下



因为最近要处理一些 JSON 数据格式,所以在经过一番搜索后 最终找到了 jq 这个很棒的工具。jq 允许你直接在命令行下对 JSON 进行操作,包括分片、过滤、转换等等。

让我们通过几个例子来说明 jq 的功能:

一、输出格式化,漂亮的打印效果如果我们用文本编辑器打开 JSON,有时候可能看起来会一团糟,但是通过 jq 的 .(点)过滤器就可以立马让 JSON 的格式规整起来。

1.用文本编辑器打开后的样子


2.用 jq 显示的结果


复制代码代码如下:


% jq . soundtag.json


jq处理JSON数据, jq Manual (development version)_sed_02

二、快速查询JSON数据

利用 jq 能够以 key 作为关键字来对 JSON 作出快速查询, 例如:

复制代码代码如下:


% jq .cn soundtag.json


这将仅仅显示 cn 键对应的值。

jq 的键查询也支持链式调用,如:

复制代码代码如下:


% jq .cn[0].pNum soundtag.json


三、jq的管道操作

熟悉命令行的朋友可能都知道 | (管道)是一个非常强大的 武器。幸运的是,jq 也提供了对管道的支持。

复制代码代码如下:


% jq '.cn[] | { pNum }' soundTag.json


在这里,我们使用管道过滤并构造出 pNum 对象。

四、总结

如果你需要在命令行下处理 JSON,我强烈推荐 jq。jq 不仅 能够满足一般性的常见需求,更包含运算、内置函数、条件比较、 变量声明、自定函数等强大功能。对此感兴趣的朋友,不妨通过 jq 的官方手册 进行学习。

 

 

 


jq : Linux下json的命令行工具



   JSON是前端编程经常用到的格式,对于PHP或者Python,解析JSON都不是什么大事,尤其是PHP的json_encode和json_decode,干的相当的漂亮。Linux下也有处理处理JSON的神器:jq。

   对于JSON格式而言,jq就像sed/awk/grep这些神器一样的方便,而也,jq没有乱七八糟的依赖,只需要一个binary文件jq,就足矣。下面我们看下jq的使用。

   1 格式化JSON 

  1. manu@manu:~/code/php/json$ cat json_raw.txt 
  2. {"name":"Google","location":{"street":"1600 Amphitheatre Parkway","city":"Mountain View","state":"California","country":"US"},"employees":[{"name":"Michael","division":"Engineering"},{"name":"Laura","division":"HR"},{"name":"Elise","division":"Marketing"}]}

   上面的JSON是PHP json_encode之后,echo出来的字符串,很明显,可读性太差。前一阵子写文档,需要将前后段JSON写入文档,我当时是用是网上的JSON格式化工具做的。事实上,jq就可以检查JSON的合法性,并把JSON格式化成更友好更可读的格式:

  1.     cat json_raw.txt | jq .

    jq处理JSON数据, jq Manual (development version)_sed_03


   看到上图,将一团乱麻的JSON格式化成个更可读的形式。其实背后另外检查了JSON的合法性。如果JSON不合法,jq .会报错。我故意写个错误的JSON:  

  1. manu@manu:~/code/php/json$ cat json_err.txt 
  2. {"name":"Google","location":{"street":"1600 Amphitheatre Parkway","city":"Mountain View","state":"California","country":"US"},"employees":[{"name":"Michael","division":"Engineering"}{"name":"Laura","division":"HR"},{"name":"Elise","division":"Marketing"}]}

   上面JSON中加粗和斜体部分,遗漏了一个逗号,所以这个JSON是错误的,jq轻松的可以轻松的检查出来:

  1.     manu@manu:~/code/php/json$ cat json_err.txt |jq .
  2.     parse error: Expected separator between values at line 1, column 183

    2 JSON parse

   如上图json,jq如何解析JSON,根据key获取value?   

  1.     {
  2.         “key_1”:"value_1",
  3.         “key_2”:"value_2",
  4.     }

    如何根据key获取value? 

  1.     jq '.key'

    jq处理JSON数据, jq Manual (development version)_ide_04

   解析不存在的元素,会返回null

  1.     echo '{"foo": 42, "bar": "less interesting data"}' | jq .nofoo
  2.     null

   3 JSON nested parse(嵌套解析)  

  1.     cat json_raw.txt | jq '.location.state'
  2.     "California"

   4 JSON parse array   

  1.     cat json_raw.txt | jq '.employees[1].name'
  2.     "Laura"

   5 内建函数
  
jq还有一些内建函数如 key,has

  key是用来获取JSON中的key元素的: 

  1. cat json_raw.txt | jq 'keys'
  2. [
  3.   "employees",
  4.   "location",
  5.   "name"
  6. ]

   has是用来是判断是否存在某个key: 

  1. cat json_raw.txt | jq 'has("name")'
  2. true

  3. cat json_raw.txt | jq 'has("noexisted")'
  4. false

   与JSON相关的我就暂时介绍到这里,希望进一步了解jq并使用的可以去​​http://stedolan.github.io/jq/manual/​​,希望了解源码实现的,可以去​​https://github.com/stedolan/jq​​,意料之中的事情是 作者用来flex和bison来parse json。我们的示例JSON来自参考文献第一篇。用google搜索,kernalpanic中有篇文章介绍了jshon和json.sh提供了另外的思路。不过我挺喜欢及


参考文献:

1 ​How to parse JSON string via command line on Linux
​2 ​ j​​q - command-line JSON processor

 

 

 

 

 

 

 



Contents





jq Manual (development version)

For released versions, see jq 1.5​jq 1.4​ or ​jq 1.3​.

A jq program is a “filter”: it takes an input, and produces an output. There are a lot of builtin filters for extracting a particular field of an object, or converting a number to a string, or various other standard tasks.

Filters can be combined in various ways - you can pipe the output of one filter into another filter, or collect the output of a filter into an array.

Some filters produce multiple results, for instance there’s one that produces all the elements of its input array. Piping that filter into a second runs the second filter for each element of the array. Generally, things that would be done with loops and iteration in other languages are just done by gluing filters together in jq.

It’s important to remember that every filter has an input and an output. Even literals like “hello” or 42 are filters - they take an input but always produce the same literal as output. Operations that combine two filters, like addition, generally feed the same input to both and combine the results. So, you can implement an averaging filter as ​​add / length​​ - feeding the input array both to the ​​add​​ filter and the ​​length​​ filter and then performing the division.

But that’s getting ahead of ourselves. :) Let’s start with something simpler:

Invoking jq

jq filters run on a stream of JSON data. The input to jq is parsed as a sequence of whitespace-separated JSON values which are passed through the provided filter one at a time. The output(s) of the filter are written to standard out, again as a sequence of whitespace-separated JSON data.

Note: it is important to mind the shell’s quoting rules. As a general rule it’s best to always quote (with single-quote characters) the jq program, as too many characters with special meaning to jq are also shell meta-characters. For example, ​​jq "foo"​​ will fail on most Unix shells because that will be the same as ​​jq foo​​, which will generally fail because ​​foo is not defined​​. When using the Windows command shell (cmd.exe) it’s best to use double quotes around your jq program when given on the command-line (instead of the ​​-f program-file​​ option), but then double-quotes in the jq program need backslash escaping.

You can affect how jq reads and writes its input and output using some command-line options:

  • --version:
    Output the jq version and exit with zero.
  • --seq:
    Use the application/json-seq MIME type scheme for separating JSON texts in jq’s input and output. This means that an ASCII RS (record separator) character is printed before each value on output and an ASCII LF (line feed) is printed after every output. Input JSON texts that fail to parse are ignored (but warned about), discarding all subsequent input until the next RS. This mode also parses the output of jq without the --seq option.
  • --stream:
    Parse the input in streaming fashion, outputing arrays of path and leaf values (scalars and empty arrays or empty objects). For example, "a" becomes [[],"a"], and [[],"a",["b"]]becomes [[0],[]][[1],"a"], and [[1,0],"b"].
    This is useful for processing very large inputs. Use this in conjunction with filtering and the reduce and foreach syntax to reduce large inputs incrementally.
  • --slurp/-s:
    Instead of running the filter for each JSON object in the input, read the entire input stream into a large array and run the filter just once.
  • --raw-input/-R:
    Don’t parse the input as JSON. Instead, each line of text is passed to the filter as a string. If combined with --slurp, then the entire input is passed to the filter as a single long string.
  • --null-input/-n:
    Don’t read any input at all! Instead, the filter is run once using null as the input. This is useful when using jq as a simple calculator or to construct JSON data from scratch.
  • --compact-output / -c:
    By default, jq pretty-prints JSON output. Using this option will result in more compact output by instead putting each JSON object on a single line.
  • --tab:
    Use a tab for each indentation level instead of two spaces.
  • --indent n:
    Use the given number of spaces (no more than 8) for indentation.
  • --color-output / -C and --monochrome-output / -M:
    By default, jq outputs colored JSON if writing to a terminal. You can force it to produce color even if writing to a pipe or a file using -C, and disable color with -M.
  • --ascii-output / -a:
    jq usually outputs non-ASCII Unicode codepoints as UTF-8, even if the input specified them as escape sequences (like “\u03bc”). Using this option, you can force jq to produce pure ASCII output with every non-ASCII character replaced with the equivalent escape sequence.
  • --unbuffered
    Flush the output after each JSON object is printed (useful if you’re piping a slow data source into jq and piping jq’s output elsewhere).
  • --sort-keys / -S:
    Output the fields of each object with the keys in sorted order.
  • --raw-output / -r:
    With this option, if the filter’s result is a string then it will be written directly to standard output rather than being formatted as a JSON string with quotes. This can be useful for making jq filters talk to non-JSON-based systems.
  • --join-output / -j:
    Like -r but jq won’t print a newline after each output.
  • -f filename / --from-file filename:
    Read filter from the file rather than from a command line, like awk’s -f option. You can also use ‘#’ to make comments.
  • -Ldirectory / -L directory:
    Prepend directory to the search list for modules. If this option is used then no builtin search list is used. See the section on modules below.
  • -e / --exit-status:
    Sets the exit status of jq to 0 if the last output values was neither false nor null, 1 if the last output value was either false or null, or 4 if no valid result was ever produced. Normally jq exits with 2 if there was any usage problem or system error, 3 if there was a jq program compile error, or 0 if the jq program ran.
    Another way to set the exit status is with the halt_error builtin function.
  • --arg name value:
    This option passes a value to the jq program as a predefined variable. If you run jq with --arg foo bar, then $foo is available in the program and has the value "bar". Note that valuewill be treated as a string, so --arg foo 123 will bind $foo to "123".
    Named arguments are also available to the jq program as $ARGS.named.
  • --argjson name JSON-text:
    This option passes a JSON-encoded value to the jq program as a predefined variable. If you run jq with --argjson foo 123, then $foo is available in the program and has the value 123.
  • --slurpfile variable-name filename:
    This option reads all the JSON texts in the named file and binds an array of the parsed JSON values to the given global variable. If you run jq with --argfile foo bar, then $foo is available in the program and has an array whose elements correspond to the texts in the file named bar.
  • --argfile variable-name filename:
    Do not use. Use --slurpfile instead.
    (This option is like --slurpfile, but when the file has just one text, then that is used, else an array of texts is used as in --slurpfile.)
  • --args:
    Remaining arguments are positional string arguments. These are available to the jq program as $ARGS.positional[].
  • --jsonargs:
    Remaining arguments are positional JSON text arguments. These are available to the jq program as $ARGS.positional[].
  • --run-tests [filename]:
    Runs the tests in the given file or standard input. This must be the last option given and does not honor all preceding options. The input consists of comment lines, empty lines, and program lines followed by one input line, as many lines of output as are expected (one per output), and a terminating empty line. Compilation failure tests start with a line containing only “%%FAIL”, then a line containing the program to compile, then a line containing an error message to compare to the actual.
    Be warned that this option can change backwards-incompatibly.

Basic filters

Identity: ​​.​

The absolute simplest filter is ​​.​​ . This is a filter that takes its input and produces it unchanged as output. That is, this is the identity operator.

Since jq by default pretty-prints all output, this trivial program can be a useful way of formatting JSON output from, say, ​​curl​​.


​ Example​


Object Identifier-Index: ​​.foo​​, ​​.foo.bar​

The simplest useful filter is ​​.foo​​. When given a JSON object (aka dictionary or hash) as input, it produces the value at the key “foo”, or null if there’s none present.

A filter of the form ​​.foo.bar​​ is equivalent to ​​.foo|.bar​​.

This syntax only works for simple, identifier-like keys, that is, keys that are all made of alphanumeric characters and underscore, and which do not start with a digit.

If the key contains special characters, you need to surround it with double quotes like this: ​​."foo$"​​, or else ​​.["foo$"]​​.

For example ​​.["foo::bar"]​​ and ​​.["foo.bar"]​​ work while ​​.foo::bar​​ does not, and ​​.foo.bar​​means ​​.["foo"].["bar"]​​.


​ Examples​


Optional Object Identifier-Index: ​​.foo?​

Just like ​​.foo​​, but does not output even an error when ​​.​​ is not an array or an object.


​ Examples​


Generic Object Index: ​​.[<string>]​

You can also look up fields of an object using syntax like ​​.["foo"]​​ (.foo above is a shorthand version of this, but only for identifier-like strings).

Array Index: ​​.[2]​

When the index value is an integer, ​​.[<value>]​​ can index arrays. Arrays are zero-based, so ​​.[2]​​returns the third element.

Negative indices are allowed, with -1 referring to the last element, -2 referring to the next to last element, and so on.


​ Examples​


Array/String Slice: ​​.[10:15]​

The ​​.[10:15]​​ syntax can be used to return a subarray of an array or substring of a string. The array returned by ​​.[10:15]​​ will be of length 5, containing the elements from index 10 (inclusive) to index 15 (exclusive). Either index may be negative (in which case it counts backwards from the end of the array), or omitted (in which case it refers to the start or end of the array).


​ Examples​


Array/Object Value Iterator: ​​.[]​

If you use the ​​.[index]​​ syntax, but omit the index entirely, it will return all of the elements of an array. Running ​​.[]​​ with the input ​​[1,2,3]​​ will produce the numbers as three separate results, rather than as a single array.

You can also use this on an object, and it will return all the values of the object.


​ Examples​


​.[]?​

Like ​​.[]​​, but no errors will be output if . is not an array or object.

Comma: ​​,​

If two filters are separated by a comma, then the same input will be fed into both and the two filters’ output value streams will be concatenated in order: first, all of the outputs produced by the left expression, and then all of the outputs produced by the right. For instance, filter ​​.foo, .bar​​, produces both the “foo” fields and “bar” fields as separate outputs.


​ Examples​


Pipe: ​​|​

The | operator combines two filters by feeding the output(s) of the one on the left into the input of the one on the right. It’s pretty much the same as the Unix shell’s pipe, if you’re used to that.

If the one on the left produces multiple results, the one on the right will be run for each of those results. So, the expression ​​.[] | .foo​​ retrieves the “foo” field of each element of the input array.

Note that ​​.a.b.c​​ is the same as ​​.a | .b | .c​​.

Note too that ​​.​​ is the input value at the particular stage in a “pipeline”, specifically: where the ​​.​​expression appears. Thus ​​.a | . | .b​​ is the same as ​​.a.b​​, as the ​​.​​ in the middle refers to whatever value ​​.a​​ produced.


​ Example​


Parenthesis

Parenthesis work as a grouping operator just as in any typical programming language.


​ Example​


Types and Values

jq supports the same set of datatypes as JSON - numbers, strings, booleans, arrays, objects (which in JSON-speak are hashes with only string keys), and “null”.

Booleans, null, strings and numbers are written the same way as in javascript. Just like everything else in jq, these simple values take an input and produce an output - ​​42​​ is a valid jq expression that takes an input, ignores it, and returns 42 instead.

Array construction: ​​[]​

As in JSON, ​​[]​​ is used to construct arrays, as in ​​[1,2,3]​​. The elements of the arrays can be any jq expression, including a pipeline. All of the results produced by all of the expressions are collected into one big array. You can use it to construct an array out of a known quantity of values (as in ​​[.foo, .bar, .baz]​​) or to “collect” all the results of a filter into an array (as in ​​[.items[].name]​​)

Once you understand the “,” operator, you can look at jq’s array syntax in a different light: the expression ​​[1,2,3]​​ is not using a built-in syntax for comma-separated arrays, but is instead applying the ​​[]​​ operator (collect results) to the expression 1,2,3 (which produces three different results).

If you have a filter ​​X​​ that produces four results, then the expression ​​[X]​​ will produce a single result, an array of four elements.


​ Examples​


Object Construction: ​​{}​

Like JSON, ​​{}​​ is for constructing objects (aka dictionaries or hashes), as in: ​​{"a": 42, "b": 17}​​.

If the keys are “identifier-like”, then the quotes can be left off, as in ​​{a:42, b:17}​​. Keys generated by expressions need to be parenthesized, e.g., ​​{("a"+"b"):59}​​.

The value can be any expression (although you may need to wrap it in parentheses if it’s a complicated one), which gets applied to the {} expression’s input (remember, all filters have an input and an output).

{foo: .bar}


will produce the JSON object ​​{"foo": 42}​​ if given the JSON object ​​{"bar":42, "baz":43}​​ as its input. You can use this to select particular fields of an object: if the input is an object with “user”, “title”, “id”, and “content” fields and you just want “user” and “title”, you can write

{user: .user, title: .title}


Because that is so common, there’s a shortcut syntax for it: ​​{user, title}​​.

If one of the expressions produces multiple results, multiple dictionaries will be produced. If the input’s

{"user":"stedolan","titles":["JQ Primer", "More JQ"]}


then the expression

{user, title: .titles[]}


will produce two outputs:

{"user":"stedolan", "title": "JQ Primer"}
{"user":"stedolan", "title": "More JQ"}


Putting parentheses around the key means it will be evaluated as an expression. With the same input as above,

{(.user): .titles}


produces

{"stedolan": ["JQ Primer", "More JQ"]}


​ Examples​


Recursive Descent: ​​..​

Recursively descends ​​.​​, producing every value. This is the same as the zero-argument ​​recurse​​builtin (see below). This is intended to resemble the XPath ​​//​​ operator. Note that ​​..a​​ does not work; use ​​..|.a​​ instead. In the example below we use ​​..|.a?​​ to find all the values of object keys “a” in any object found “below” ​​.​​.

This is particularly useful in conjunction with ​​path(EXP)​​ (also see below) and the ​​?​​ operator.


​ Example​


Builtin operators and functions

Some jq operator (for instance, ​​+​​) do different things depending on the type of their arguments (arrays, numbers, etc.). However, jq never does implicit type conversions. If you try to add a string to an object you’ll get an error message and no result.

Addition: ​​+​

The operator ​​+​​ takes two filters, applies them both to the same input, and adds the results together. What “adding” means depends on the types involved:

  • Numbers are added by normal arithmetic.
  • Arrays are added by being concatenated into a larger array.
  • Strings are added by being joined into a larger string.
  • Objects are added by merging, that is, inserting all the key-value pairs from both objects into a single combined object. If both objects contain a value for the same key, the object on the right of the + wins. (For recursive merge use the * operator.)

​null​​ can be added to any value, and returns the other value unchanged.


​ Examples​


Subtraction: ​​-​

As well as normal arithmetic subtraction on numbers, the ​​-​​ operator can be used on arrays to remove all occurrences of the second array’s elements from the first array.


​ Examples​


Multiplication, division, modulo: ​​*​​, ​​/​​, and ​​%​

These infix operators behave as expected when given two numbers. Division by zero raises an error. ​​x % y​​ computes x modulo y.

Multiplying a string by a number produces the concatenation of that string that many times. ​​"x" * 0​​produces null.

Dividing a string by another splits the first using the second as separators.

Multiplying two objects will merge them recursively: this works like addition but if both objects contain a value for the same key, and the values are objects, the two are merged with the same strategy.


​ Examples​


​length​

The builtin function ​​length​​ gets the length of various different types of value:

  • The length of a string is the number of Unicode codepoints it contains (which will be the same as its JSON-encoded length in bytes if it’s pure ASCII).
  • The length of an array is the number of elements.
  • The length of an object is the number of key-value pairs.
  • The length of null is zero.


​ Example​


​utf8bytelength​

The builtin function ​​utf8bytelength​​ outputs the number of bytes used to encode a string in UTF-8.


​ Example​


​keys​​, ​​keys_unsorted​

The builtin function ​​keys​​, when given an object, returns its keys in an array.

The keys are sorted “alphabetically”, by unicode codepoint order. This is not an order that makes particular sense in any particular language, but you can count on it being the same for any two objects with the same set of keys, regardless of locale settings.

When ​​keys​​ is given an array, it returns the valid indices for that array: the integers from 0 to length-1.

The ​​keys_unsorted​​ function is just like ​​keys​​, but if the input is an object then the keys will not be sorted, instead the keys will roughly be in insertion order.


​ Examples​


​has(key)​

The builtin function ​​has​​ returns whether the input object has the given key, or the input array has an element at the given index.

​has($key)​​ has the same effect as checking whether ​​$key​​ is a member of the array returned by ​​keys​​, although ​​has​​ will be faster.


​ Examples​


​in​

The builtin function ​​in​​ returns whether or not the input key is in the given object, or the input index corresponds to an element in the given array. It is, essentially, an inversed version of ​​has​​.


​ Examples​


​map(x)​​, ​​map_values(x)​

For any filter ​​x​​, ​​map(x)​​ will run that filter for each element of the input array, and return the outputs in a new array. ​​map(.+1)​​ will increment each element of an array of numbers.

Similarly, ​​map_values(x)​​ will run that filter for each element, but it will return an object when an object is passed.

​map(x)​​ is equivalent to ​​[.[] | x]​​. In fact, this is how it’s defined. Similarly, ​​map_values(x)​​ is defined as ​​.[] |= x​​.


​ Examples​


​path(path_expression)​

Outputs array representations of the given path expression in ​​.​​. The outputs are arrays of strings (object keys) and/or numbers (array indices).

Path expressions are jq expressions like ​​.a​​, but also ​​.[]​​. There are two types of path expressions: ones that can match exactly, and ones that cannot. For example, ​​.a.b.c​​ is an exact match path expression, while ​​.a[].b​​ is not.

​path(exact_path_expression)​​ will produce the array representation of the path expression even if it does not exist in ​​.​​, if ​​.​​ is ​​null​​ or an array or an object.

​path(pattern)​​ will produce array representations of the paths matching ​​pattern​​ if the paths exist in ​​.​​.

Note that the path expressions are not different from normal expressions. The expression ​​path(..|select(type=="boolean"))​​ outputs all the paths to boolean values in ​​.​​, and only those paths.


​ Examples​


​del(path_expression)​

The builtin function ​​del​​ removes a key and its corresponding value from an object.


​ Examples​


​getpath(PATHS)​

The builtin function ​​getpath​​ outputs the values in ​​.​​ found at each path in ​​PATHS​​.


​ Examples​


​setpath(PATHS; VALUE)​

The builtin function ​​setpath​​ sets the ​​PATHS​​ in ​​.​​ to ​​VALUE​​.


​ Examples​


​delpaths(PATHS)​

The builtin function ​​delpaths​​ sets the ​​PATHS​​ in ​​.​​. ​​PATHS​​ must be an array of paths, where each path is an array of strings and numbers.


​ Example​


​to_entries​​, ​​from_entries​​, ​​with_entries​

These functions convert between an object and an array of key-value pairs. If ​​to_entries​​ is passed an object, then for each ​​k: v​​ entry in the input, the output array includes ​​{"key": k, "value": v}​​.

​from_entries​​ does the opposite conversion, and ​​with_entries(foo)​​ is a shorthand for ​​to_entries | map(foo) | from_entries​​, useful for doing some operation to all keys and values of an object. ​​from_entries​​ accepts key, Key, name, Name, value and Value as keys.


​ Examples​


​select(boolean_expression)​

The function ​​select(foo)​​ produces its input unchanged if ​​foo​​ returns true for that input, and produces no output otherwise.

It’s useful for filtering lists: ​​[1,2,3] | map(select(. >= 2))​​ will give you ​​[2,3]​​.


​ Examples​


​arrays​​, ​​objects​​, ​​iterables​​, ​​booleans​​, ​​numbers​​, ​​normals​​, ​​finites​​, ​​strings​​, ​​nulls​​, ​​values​​, ​​scalars​

These built-ins select only inputs that are arrays, objects, iterables (arrays or objects), booleans, numbers, normal numbers, finite numbers, strings, null, non-null values, and non-iterables, respectively.


​ Example​


​empty​

​empty​​ returns no results. None at all. Not even ​​null​​.

It’s useful on occasion. You’ll know if you need it :)


​ Examples​


​error(message)​

Produces an error, just like ​​.a​​ applied to values other than null and objects would, but with the given message as the error’s value. Errors can be caught with try/catch; see below.

​halt​

Stops the jq program with no further outputs. jq will exit with exit status ​​0​​.

​halt_error​​, ​​halt_error(exit_code)​

Stops the jq program with no further outputs. The input will be printed on ​​stderr​​ as raw output (i.e., strings will not have double quotes) with no decoration, not even a newline.

The given ​​exit_code​​ (defaulting to ​​5​​) will be jq’s exit status.

For example, ​​"Error: somthing went wrong\n"|halt_error(1)​​.

​$__loc__​

Produces an object with a “file” key and a “line” key, with the filename and line number where ​​$__loc__​​ occurs, as values.


​ Example​


​paths​​, ​​paths(node_filter)​​, ​​leaf_paths​

​paths​​ outputs the paths to all the elements in its input (except it does not output the empty list, representing . itself).

​paths(f)​​ outputs the paths to any values for which ​​f​​ is true. That is, ​​paths(numbers)​​ outputs the paths to all numeric values.

​leaf_paths​​ is an alias of ​​paths(scalars)​​; ​​leaf_paths​​ is deprecated and will be removed in the next major release.


​ Examples​


​add​

The filter ​​add​​ takes as input an array, and produces as output the elements of the array added together. This might mean summed, concatenated or merged depending on the types of the elements of the input array - the rules are the same as those for the ​​+​​ operator (described above).

If the input is an empty array, ​​add​​ returns ​​null​​.


​ Examples​


​any​​, ​​any(condition)​​, ​​any(generator; condition)​

The filter ​​any​​ takes as input an array of boolean values, and produces ​​true​​ as output if any of the elements of the array are ​​true​​.

If the input is an empty array, ​​any​​ returns ​​false​​.

The ​​any(condition)​​ form applies the given condition to the elements of the input array.

The ​​any(generator; condition)​​ form applies the given condition to all the outputs of the given generator.


​ Examples​


​all​​, ​​all(condition)​​, ​​all(generator; condition)​

The filter ​​all​​ takes as input an array of boolean values, and produces ​​true​​ as output if all of the elements of the array are ​​true​​.

The ​​all(condition)​​ form applies the given condition to the elements of the input array.

The ​​all(generator; condition)​​ form applies the given condition to all the outputs of the given generator.

If the input is an empty array, ​​all​​ returns ​​true​​.


​ Examples​


​flatten​​, ​​flatten(depth)​

The filter ​​flatten​​ takes as input an array of nested arrays, and produces a flat array in which all arrays inside the original array have been recursively replaced by their values. You can pass an argument to it to specify how many levels of nesting to flatten.

​flatten(2)​​ is like ​​flatten​​, but going only up to two levels deep.


​ Examples​


​range(upto)​​, ​​range(from;upto)​​ ​​range(from;upto;by)​

The ​​range​​ function produces a range of numbers. ​​range(4;10)​​ produces 6 numbers, from 4 (inclusive) to 10 (exclusive). The numbers are produced as separate outputs. Use ​​[range(4;10)]​​ to get a range as an array.

The one argument form generates numbers from 0 to the given number, with an increment of 1.

The two argument form generates numbers from ​​from​​ to ​​upto​​ with an increment of 1.

The three argument form generates numbers ​​from​​ to ​​upto​​ with an increment of ​​by​​.


​ Examples​


​floor​

The ​​floor​​ function returns the floor of its numeric input.


​ Example​


​sqrt​

The ​​sqrt​​ function returns the square root of its numeric input.


​ Example​


​tonumber​

The ​​tonumber​​ function parses its input as a number. It will convert correctly-formatted strings to their numeric equivalent, leave numbers alone, and give an error on all other input.


​ Example​


​tostring​

The ​​tostring​​ function prints its input as a string. Strings are left unchanged, and all other values are JSON-encoded.


​ Example​


​type​

The ​​type​​ function returns the type of its argument as a string, which is one of null, boolean, number, string, array or object.


​ Example​


​infinite​​, ​​nan​​, ​​isinfinite​​, ​​isnan​​, ​​isfinite​​, ​​isnormal​

Some arithmetic operations can yield infinities and “not a number” (NaN) values. The ​​isinfinite​​builtin returns ​​true​​ if its input is infinite. The ​​isnan​​ builtin returns ​​true​​ if its input is a NaN. The ​​infinite​​ builtin returns a positive infinite value. The ​​nan​​ builtin returns a NaN. The ​​isnormal​​builtin returns true if its input is a normal number.

Note that division by zero raises an error.

Currently most arithmetic operations operating on infinities, NaNs, and sub-normals do not raise errors.


​ Examples​


​sort, sort_by(path_expression)​

The ​​sort​​ functions sorts its input, which must be an array. Values are sorted in the following order:

  • ​null​
  • ​false​
  • ​true​
  • numbers
  • strings, in alphabetical order (by unicode codepoint value)
  • arrays, in lexical order
  • objects

The ordering for objects is a little complex: first they’re compared by comparing their sets of keys (as arrays in sorted order), and if their keys are equal then the values are compared key by key.

​sort​​ may be used to sort by a particular field of an object, or by applying any jq filter.

​sort_by(foo)​​ compares two elements by comparing the result of ​​foo​​ on each element.


​ Examples​


​group_by(path_expression)​

​group_by(.foo)​​ takes as input an array, groups the elements having the same ​​.foo​​ field into separate arrays, and produces all of these arrays as elements of a larger array, sorted by the value of the ​​.foo​​ field.

Any jq expression, not just a field access, may be used in place of ​​.foo​​. The sorting order is the same as described in the ​​sort​​ function above.


​ Example​


​min​​, ​​max​​, ​​min_by(path_exp)​​, ​​max_by(path_exp)​

Find the minimum or maximum element of the input array.

The ​​min_by(path_exp)​​ and ​​max_by(path_exp)​​ functions allow you to specify a particular field or property to examine, e.g. ​​min_by(.foo)​​ finds the object with the smallest ​​foo​​ field.


​ Examples​


​unique​​, ​​unique_by(path_exp)​

The ​​unique​​ function takes as input an array and produces an array of the same elements, in sorted order, with duplicates removed.

The ​​unique_by(path_exp)​​ function will keep only one element for each value obtained by applying the argument. Think of it as making an array by taking one element out of every group produced by ​​group​​.


​ Examples​


​reverse​

This function reverses an array.


​ Example​


​contains(element)​

The filter ​​contains(b)​​ will produce true if b is completely contained within the input. A string B is contained in a string A if B is a substring of A. An array B is contained in an array A if all elements in B are contained in any element in A. An object B is contained in object A if all of the values in B are contained in the value in A with the same key. All other types are assumed to be contained in each other if they are equal.


​ Examples​


​indices(s)​

Outputs an array containing the indices in ​​.​​ where ​​s​​ occurs. The input may be an array, in which case if ​​s​​ is an array then the indices output will be those where all elements in ​​.​​ match those of ​​s​​.


​ Examples​


​index(s)​​, ​​rindex(s)​

Outputs the index of the first (​​index​​) or last (​​rindex​​) occurrence of ​​s​​ in the input.


​ Examples​


​inside​

The filter ​​inside(b)​​ will produce true if the input is completely contained within b. It is, essentially, an inversed version of ​​contains​​.


​ Examples​


​startswith(str)​

Outputs ​​true​​ if . starts with the given string argument.


​ Example​


​endswith(str)​

Outputs ​​true​​ if . ends with the given string argument.


​ Example​


​combinations​​, ​​combinations(n)​

Outputs all combinations of the elements of the arrays in the input array. If given an argument ​​n​​, it outputs all combinations of ​​n​​ repetitions of the input array.


​ Examples​


​ltrimstr(str)​

Outputs its input with the given prefix string removed, if it starts with it.


​ Example​


​rtrimstr(str)​

Outputs its input with the given suffix string removed, if it ends with it.


​ Example​


​explode​

Converts an input string into an array of the string’s codepoint numbers.


​ Example​


​implode​

The inverse of explode.


​ Example​


​split(str)​

Splits an input string on the separator argument.


​ Example​


​join(str)​

Joins the array of elements given as input, using the argument as separator. It is the inverse of ​​split​​: that is, running ​​split("foo") | join("foo")​​ over any input string returns said input string.

Numbers and booleans in the input are converted to strings. Null values are treated as empty strings. Arrays and objects in the input are not supported.


​ Examples​


​ascii_downcase​​, ​​ascii_upcase​

Emit a copy of the input string with its alphabetic characters (a-z and A-Z) converted to the specified case.

​while(cond; update)​

The ​​while(cond; update)​​ function allows you to repeatedly apply an update to ​​.​​ until ​​cond​​ is false.

Note that ​​while(cond; update)​​ is internally defined as a recursive jq function. Recursive calls within ​​while​​ will not consume additional memory if ​​update​​ produces at most one output for each input. See advanced topics below.


​ Example​


​until(cond; next)​

The ​​until(cond; next)​​ function allows you to repeatedly apply the expression ​​next​​, initially to ​​.​​then to its own output, until ​​cond​​ is true. For example, this can be used to implement a factorial function (see below).

Note that ​​until(cond; next)​​ is internally defined as a recursive jq function. Recursive calls within ​​until()​​ will not consume additional memory if ​​next​​ produces at most one output for each input. See advanced topics below.


​ Example​


​recurse(f)​​, ​​recurse​​, ​​recurse(f; condition)​​, ​​recurse_down​

The ​​recurse(f)​​ function allows you to search through a recursive structure, and extract interesting data from all levels. Suppose your input represents a filesystem:

{"name": "/", "children": [
{"name": "/bin", "children": [
{"name": "/bin/ls", "children": []},
{"name": "/bin/sh", "children": []}]},
{"name": "/home", "children": [
{"name": "/home/stephen", "children": [
{"name": "/home/stephen/jq", "children": []}]}]}]}


Now suppose you want to extract all of the filenames present. You need to retrieve ​​.name​​, ​​.children[].name​​, ​​.children[].children[].name​​, and so on. You can do this with:

recurse(.children[]) | .name


When called without an argument, ​​recurse​​ is equivalent to ​​recurse(.[]?)​​.

​recurse(f)​​ is identical to ​​recurse(f; . != null)​​ and can be used without concerns about recursion depth.

​recurse(f; condition)​​ is a generator which begins by emitting . and then emits in turn .|f, .|f|f, .|f|f|f, … so long as the computed value satisfies the condition. For example, to generate all the integers, at least in principle, one could write ​​recurse(.+1; true)​​.

For legacy reasons, ​​recurse_down​​ exists as an alias to calling ​​recurse​​ without arguments. This alias is considered deprecated and will be removed in the next major release.

The recursive calls in ​​recurse​​ will not consume additional memory whenever ​​f​​ produces at most a single output for each input.


​ Examples​


​walk(f)​

The ​​walk(f)​​ function applies f recursively to every component of the input entity. When an array is encountered, f is first applied to its elements and then to the array itself; when an object is encountered, f is first applied to all the values and then to the object. In practice, f will usually test the type of its input, as illustrated in the following examples. The first example highlights the usefulness of processing the elements of an array of arrays before processing the array itself. The second example shows how all the keys of all the objects within the input can be considered for alteration.


​ Examples​


​$ENV​​, ​​env​

​$ENV​​ is an object representing the environment variables as set when the jq program started.

​env​​ outputs an object representing jq’s current environment.

At the moment there is no builtin for setting environment variables.


​ Examples​


​transpose​

Transpose a possibly jagged matrix (an array of arrays). Rows are padded with nulls so the result is always rectangular.


​ Example​


​bsearch(x)​

bsearch(x) conducts a binary search for x in the input array. If the input is sorted and contains x, then bsearch(x) will return its index in the array; otherwise, if the array is sorted, it will return (-1 - ix) where ix is an insertion point such that the array would still be sorted after the insertion of x at ix. If the array is not sorted, bsearch(x) will return an integer that is probably of no interest.


​ Examples​


String interpolation - ​​\(foo)​

Inside a string, you can put an expression inside parens after a backslash. Whatever the expression returns will be interpolated into the string.


​ Example​


Convert to/from JSON

The ​​tojson​​ and ​​fromjson​​ builtins dump values as JSON texts or parse JSON texts into values, respectively. The tojson builtin differs from tostring in that tostring returns strings unmodified, while tojson encodes strings as JSON strings.


​ Examples​


Format strings and escaping

The ​​@foo​​ syntax is used to format and escape strings, which is useful for building URLs, documents in a language like HTML or XML, and so forth. ​​@foo​​ can be used as a filter on its own, the possible escapings are:

  • @text:
    Calls tostring, see that function for details.
  • @json:
    Serializes the input as JSON.
  • @html:
    Applies HTML/XML escaping, by mapping the characters <>&'" to their entity equivalents &lt;&gt;&amp;&apos;&quot;.
  • @uri:
    Applies percent-encoding, by mapping all reserved URI characters to a %XX sequence.
  • @csv:
    The input must be an array, and it is rendered as CSV with double quotes for strings, and quotes escaped by repetition.
  • @tsv:
    The input must be an array, and it is rendered as TSV (tab-separated values). Each input array will be printed as a single line. Fields are separated by a single tab (ascii 0x09). Input characters line-feed (ascii 0x0a), carriage-return (ascii 0x0d), tab (ascii 0x09) and backslash (ascii 0x5c) will be output as escape sequences \n\r\t\\ respectively.
  • @sh:
    The input is escaped suitable for use in a command-line for a POSIX shell. If the input is an array, the output will be a series of space-separated strings.
  • @base64:
    The input is converted to base64 as specified by RFC 4648.
  • @base64d:
    The inverse of @base64, input is decoded as specified by RFC 4648. Note: If the decoded string is not UTF-8, the results are undefined.

This syntax can be combined with string interpolation in a useful way. You can follow a ​​@foo​​ token with a string literal. The contents of the string literal will not be escaped. However, all interpolations made inside that string literal will be escaped. For instance,

@uri "https://www.google.com/search?q=\(.search)"


will produce the following output for the input ​​{"search":"what is jq?"}​​:

"https://www.google.com/search?q=what%20is%20jq%3F"


Note that the slashes, question mark, etc. in the URL are not escaped, as they were part of the string literal.


​ Examples​


Dates

jq provides some basic date handling functionality, with some high-level and low-level builtins. In all cases these builtins deal exclusively with time in UTC.

The ​​fromdateiso8601​​ builtin parses datetimes in the ISO 8601 format to a number of seconds since the Unix epoch (1970-01-01T00:00:00Z). The ​​todateiso8601​​ builtin does the inverse.

The ​​fromdate​​ builtin parses datetime strings. Currently ​​fromdate​​ only supports ISO 8601 datetime strings, but in the future it will attempt to parse datetime strings in more formats.

The ​​todate​​ builtin is an alias for ​​todateiso8601​​.

The ​​now​​ builtin outputs the current time, in seconds since the Unix epoch.

Low-level jq interfaces to the C-library time functions are also provided: ​​strptime​​, ​​strftime​​, ​​strflocaltime​​, ​​mktime​​, ​​gmtime​​, and ​​localtime​​. Refer to your host operating system’s documentation for the format strings used by ​​strptime​​ and ​​strftime​​. Note: these are not necessarily stable interfaces in jq, particularly as to their localization functionality.

The ​​gmtime​​ builtin consumes a number of seconds since the Unix epoch and outputs a “broken down time” representation of Greenwhich Meridian time as an array of numbers representing (in this order): the year, the month (zero-based), the day of the month, the hour of the day, the minute of the hour, the second of the minute, the day of the week, and the day of the year – all one-based unless otherwise stated.

The ​​localtime​​ builtin works like the ​​gmtime​​ builtin, but using the local timezone setting.

The ​​mktime​​ builtin consumes “broken down time” representations of time output by ​​gmtime​​ and ​​strptime​​.

The ​​strptime(fmt)​​ builtin parses input strings matching the ​​fmt​​ argument. The output is in the “broken down time” representation consumed by ​​gmtime​​ and output by ​​mktime​​.

The ​​strftime(fmt)​​ builtin formats a time (GMT) with the given format. The ​​strflocaltime​​ does the same, but using the local timezone setting.

The format strings for ​​strptime​​ and ​​strftime​​ are described in typical C library documentation. The format string for ISO 8601 datetime is ​​"%Y-%m-%dT%H:%M:%SZ"​​.

jq may not support some or all of this date functionality on some systems.


​ Examples​


SQL-Style Operators

jq provides a few SQL-style operators.

  • INDEX(stream; index_expression):
    This builtin produces an object whose keys are computed by the given index expression applied to each value from the given stream.
  • JOIN($idx; stream; idx_expr; join_expr):
    This builtin joins the values from the given stream to the given index. The index’s keys are computed by applying the given index expression to each value from the given stream. An array of the value in the stream and the corresponding value from the index is fed to the given join expression to produce each result.
  • JOIN($idx; stream; idx_expr):
    Same as JOIN($idx; stream; idx_expr; .).
  • JOIN($idx; idx_expr):
    This builtin joins the input . to the given index, applying the given index expression to . to compute the index key. The join operation is as described above.
  • IN(s):
    This builtin outputs true if . appears in the given stream, otherwise it outputs false.
  • IN(source; s):
    This builtin outputs true if any value in the source stream appears in the second stream, otherwise it outputs false.

​builtins​

Returns a list of all builtin functions in the format ​​name/arity​​. Since functions with the same name but different arities are considered separate functions, ​​all/0​​, ​​all/1​​, and ​​all/2​​ would all be present in the list.

Conditionals and Comparisons

​==​​, ​​!=​

The expression ‘a == b’ will produce ‘true’ if the result of a and b are equal (that is, if they represent equivalent JSON documents) and ‘false’ otherwise. In particular, strings are never considered equal to numbers. If you’re coming from Javascript, jq’s == is like Javascript’s === - considering values equal only when they have the same type as well as the same value.

!= is “not equal”, and ‘a != b’ returns the opposite value of ‘a == b’


​ Example​


if-then-else

​if A then B else C end​​ will act the same as ​​B​​ if ​​A​​ produces a value other than false or null, but act the same as ​​C​​ otherwise.

Checking for false or null is a simpler notion of “truthiness” than is found in Javascript or Python, but it means that you’ll sometimes have to be more explicit about the condition you want: you can’t test whether, e.g. a string is empty using ​​if .name then A else B end​​, you’ll need something more like ​​if (.name | length) > 0 then A else B end​​ instead.

If the condition ​​A​​ produces multiple results, then ​​B​​ is evaluated once for each result that is not false or null, and ​​C​​ is evaluated once for each false or null.

More cases can be added to an if using ​​elif A then B​​ syntax.


​ Example​


​>, >=, <=, <​

The comparison operators ​​>​​, ​​>=​​, ​​<=​​, ​​<​​ return whether their left argument is greater than, greater than or equal to, less than or equal to or less than their right argument (respectively).

The ordering is the same as that described for ​​sort​​, above.


​ Example​


and/or/not

jq supports the normal Boolean operators and/or/not. They have the same standard of truth as if expressions - false and null are considered “false values”, and anything else is a “true value”.

If an operand of one of these operators produces multiple results, the operator itself will produce a result for each input.

​not​​ is in fact a builtin function rather than an operator, so it is called as a filter to which things can be piped rather than with special syntax, as in ​​.foo and .bar | not​​.

These three only produce the values “true” and “false”, and so are only useful for genuine Boolean operations, rather than the common Perl/Python/Ruby idiom of “value_that_may_be_null or default”. If you want to use this form of “or”, picking between two values rather than evaluating a condition, see the “//” operator below.


​ Examples​


Alternative operator: ​​//​

A filter of the form ​​a // b​​ produces the same results as ​​a​​, if ​​a​​ produces results other than ​​false​​and ​​null​​. Otherwise, ​​a // b​​ produces the same results as ​​b​​.

This is useful for providing defaults: ​​.foo // 1​​ will evaluate to ​​1​​ if there’s no ​​.foo​​ element in the input. It’s similar to how ​​or​​ is sometimes used in Python (jq’s ​​or​​ operator is reserved for strictly Boolean operations).


​ Examples​


try-catch

Errors can be caught by using ​​try EXP catch EXP​​. The first expression is executed, and if it fails then the second is executed with the error message. The output of the handler, if any, is output as if it had been the output of the expression to try.

The ​​try EXP​​ form uses ​​empty​​ as the exception handler.


​ Examples​


Breaking out of control structures

A convenient use of try/catch is to break out of control structures like ​​reduce​​, ​​foreach​​, ​​while​​, and so on.

For example:

# Repeat an expression until it raises "break" as an
# error, then stop repeating without re-raising the error.
# But if the error caught is not "break" then re-raise it.
try repeat(exp) catch .=="break" then empty else error;


jq has a syntax for named lexical labels to “break” or “go (back) to”:

label $out | ... break $out ...


The ​​break $label_name​​ expression will cause the program to to act as though the nearest (to the left) ​​label $label_name​​ produced ​​empty​​.

The relationship between the ​​break​​ and corresponding ​​label​​ is lexical: the label has to be “visible” from the break.

To break out of a ​​reduce​​, for example:

label $out | reduce .[] as $item (null; if .==false then break $out else ... end)


The following jq program produces a syntax error:

break $out


because no label ​​$out​​ is visible.

Error Suppresion / Optional Operator: ​​?​

The ​​?​​ operator, used as ​​EXP?​​, is shorthand for ​​try EXP​​.


​ Example​


Regular expressions (PCRE)

jq uses the Oniguruma regular expression library, as do php, ruby, TextMate, Sublime Text, etc, so the description here will focus on jq specifics.

The jq regex filters are defined so that they can be used using one of these patterns:

STRING | FILTER( REGEX )
STRING | FILTER( REGEX; FLAGS )
STRING | FILTER( [REGEX] )
STRING | FILTER( [REGEX, FLAGS] )


where:

  • STRING, REGEX and FLAGS are jq strings and subject to jq string interpolation;
  • REGEX, after string interpolation, should be a valid PCRE regex;
  • FILTER is one of ​​test​​, ​​match​​, or ​​capture​​, as described below.

FLAGS is a string consisting of one of more of the supported flags:

  • ​g​​ - Global search (find all matches, not just the first)
  • ​i​​ - Case insensitive search
  • ​m​​ - Multi line mode (‘.’ will match newlines)
  • ​n​​ - Ignore empty matches
  • ​p​​ - Both s and m modes are enabled
  • ​s​​ - Single line mode (‘^’ -> ‘\A’, ‘$’ -> ‘\Z’)
  • ​l​​ - Find longest possible matches
  • ​x​​ - Extended regex format (ignore whitespace and comments)

To match whitespace in an x pattern use an escape such as \s, e.g.

  • test( “a\sb”, “x” ).

Note that certain flags may also be specified within REGEX, e.g.

  • jq -n ‘(“test”, “TEst”, “teST”, “TEST”) | test( “(?i)te(?-i)st” )’

evaluates to: true, true, false, false.

​test(val)​​, ​​test(regex; flags)​

Like ​​match​​, but does not return match objects, only ​​true​​ or ​​false​​ for whether or not the regex matches the input.


​ Examples​


​match(val)​​, ​​match(regex; flags)​

match outputs an object for each match it finds. Matches have the following fields:

  • ​offset​​ - offset in UTF-8 codepoints from the beginning of the input
  • ​length​​ - length in UTF-8 codepoints of the match
  • ​string​​ - the string that it matched
  • ​captures​​ - an array of objects representing capturing groups.

Capturing group objects have the following fields:

  • ​offset​​ - offset in UTF-8 codepoints from the beginning of the input
  • ​length​​ - length in UTF-8 codepoints of this capturing group
  • ​string​​ - the string that was captured
  • ​name​​ - the name of the capturing group (or ​​null​​ if it was unnamed)

Capturing groups that did not match anything return an offset of -1


​ Examples​


​capture(val)​​, ​​capture(regex; flags)​

Collects the named captures in a JSON object, with the name of each capture as the key, and the matched string as the corresponding value.


​ Example​


​scan(regex)​​, ​​scan(regex; flags)​

Emit a stream of the non-overlapping substrings of the input that match the regex in accordance with the flags, if any have been specified. If there is no match, the stream is empty. To capture all the matches for each input string, use the idiom ​​[ expr ]​​, e.g. ​​[ scan(regex) ]​​.

​split(regex; flags)​

For backwards compatibility, ​​split​​ splits on a string, not a regex.

​splits(regex)​​, ​​splits(regex; flags)​

These provide the same results as their ​​split​​ counterparts, but as a stream instead of an array.

​sub(regex; tostring)​​ ​​sub(regex; string; flags)​

Emit the string obtained by replacing the first match of regex in the input string with ​​tostring​​, after interpolation. ​​tostring​​ should be a jq string, and may contain references to named captures. The named captures are, in effect, presented as a JSON object (as constructed by ​​capture​​) to ​​tostring​​, so a reference to a captured variable named “x” would take the form: “(.x)”.

​gsub(regex; string)​​, ​​gsub(regex; string; flags)​

​gsub​​ is like ​​sub​​ but all the non-overlapping occurrences of the regex are replaced by the string, after interpolation.

Advanced features

Variables are an absolute necessity in most programming languages, but they’re relegated to an “advanced feature” in jq.

In most languages, variables are the only means of passing around data. If you calculate a value, and you want to use it more than once, you’ll need to store it in a variable. To pass a value to another part of the program, you’ll need that part of the program to define a variable (as a function parameter, object member, or whatever) in which to place the data.

It is also possible to define functions in jq, although this is is a feature whose biggest use is defining jq’s standard library (many jq functions such as ​​map​​ and ​​find​​ are in fact written in jq).

jq has reduction operators, which are very powerful but a bit tricky. Again, these are mostly used internally, to define some useful bits of jq’s standard library.

It may not be obvious at first, but jq is all about generators (yes, as often found in other languages). Some utilities are provided to help deal with generators.

Some minimal I/O support (besides reading JSON from standard input, and writing JSON to standard output) is available.

Finally, there is a module/library system.

Variable / Symbolic Binding Operator: ​​... as $identifier | ...​

In jq, all filters have an input and an output, so manual plumbing is not necessary to pass a value from one part of a program to the next. Many expressions, for instance ​​a + b​​, pass their input to two distinct subexpressions (here ​​a​​ and ​​b​​ are both passed the same input), so variables aren’t usually necessary in order to use a value twice.

For instance, calculating the average value of an array of numbers requires a few variables in most languages - at least one to hold the array, perhaps one for each element or for a loop counter. In jq, it’s simply ​​add / length​​ - the ​​add​​ expression is given the array and produces its sum, and the ​​length​​ expression is given the array and produces its length.

So, there’s generally a cleaner way to solve most problems in jq than defining variables. Still, sometimes they do make things easier, so jq lets you define variables using ​​expression as $variable​​. All variable names start with ​​$​​. Here’s a slightly uglier version of the array-averaging example:

length as $array_length | add / $array_length


We’ll need a more complicated problem to find a situation where using variables actually makes our lives easier.

Suppose we have an array of blog posts, with “author” and “title” fields, and another object which is used to map author usernames to real names. Our input looks like:

{"posts": [{"title": "Frist psot", "author": "anon"},
{"title": "A well-written article", "author": "person1"}],
"realnames": {"anon": "Anonymous Coward",
"person1": "Person McPherson"}}


We want to produce the posts with the author field containing a real name, as in:

{"title": "Frist psot", "author": "Anonymous Coward"}
{"title": "A well-written article", "author": "Person McPherson"}


We use a variable, $names, to store the realnames object, so that we can refer to it later when looking up author usernames:

.realnames as $names | .posts[] | {title, author: $names[.author]}


The expression ​​exp as $x | ...​​ means: for each value of expression ​​exp​​, run the rest of the pipeline with the entire original input, and with ​​$x​​ set to that value. Thus ​​as​​ functions as something of a foreach loop.

Just as ​​{foo}​​ is a handy way of writing ​​{foo: .foo}​​, so ​​{$foo}​​ is a handy way of writing ​​{foo:$foo}​​.

Multiple variables may be declared using a single ​​as​​ expression by providing a pattern that matches the structure of the input (this is known as “destructuring”):

. as {realnames: $names, posts: [$first, $second]} | ...


The variable declarations in array patterns (e.g., ​​. as [$first, $second]​​) bind to the elements of the array in from the element at index zero on up, in order. When there is no value at the index for an array pattern element, ​​null​​ is bound to that variable.

Variables are scoped over the rest of the expression that defines them, so

.realnames as $names | (.posts[] | {title, author: $names[.author]})


will work, but

(.realnames as $names | .posts[]) | {title, author: $names[.author]}


won’t.

For programming language theorists, it’s more accurate to say that jq variables are lexically-scoped bindings. In particular there’s no way to change the value of a binding; one can only setup a new binding with the same name, but which will not be visible where the old one was.


​ Examples​


Defining Functions

You can give a filter a name using “def” syntax:

def increment: . + 1;


From then on, ​​increment​​ is usable as a filter just like a builtin function (in fact, this is how many of the builtins are defined). A function may take arguments:

def map(f): [.[] | f];


Arguments are passed as filters (functions with no arguments), not as values. The same argument may be referenced multiple times with different inputs (here ​​f​​ is run for each element of the input array). Arguments to a function work more like callbacks than like value arguments. This is important to understand. Consider:

def foo(f): f|f;
5|foo(.*2)


The result will be 20 because ​​f​​ is ​​.*2​​, and during the first invocation of ​​f​​ ​​.​​ will be 5, and the second time it will be 10 (5 * 2), so the result will be 20. Function arguments are filters, and filters expect an input when invoked.

If you want the value-argument behaviour for defining simple functions, you can just use a variable:

def addvalue(f): f as $f | map(. + $f);


Or use the short-hand:

def addvalue($f): ...;


With either definition, ​​addvalue(.foo)​​ will add the current input’s ​​.foo​​ field to each element of the array. Do note that calling ​​addvalue(.[])​​ will cause the ​​map(. + $f)​​ part to be evaluated once per value in the value of ​​.​​ at the call site.

Multiple definitions using the same function name are allowed. Each re-definition replaces the previous one for the same number of function arguments, but only for references from functions (or main program) subsequent to the re-definition. See also the section below on scoping.


​ Examples​


Scoping

There are two types of symbols in jq: value bindings (a.k.a., “variables”), and functions. Both are scoped lexically, with expressions being able to refer only to symbols that have been defined “to the left” of them. The only exception to this rule is that functions can refer to themselves so as to be able to create recursive functions.

For example, in the following expression there is a binding which is visible “to the right” of it, ​​... | .*3 as $times_three | [. + $times_three] | ...​​, but not “to the left”. Consider this expression now, ​​... | (.*3 as $times_three | [.+ $times_three]) | ...​​: here the binding ​​$times_three​​ is not visible past the closing parenthesis.

Reduce

The ​​reduce​​ syntax in jq allows you to combine all of the results of an expression by accumulating them into a single answer. As an example, we’ll pass ​​[3,2,1]​​ to this expression:

reduce .[] as $item (0; . + $item)


For each result that ​​.[]​​ produces, ​​. + $item​​ is run to accumulate a running total, starting from 0. In this example, ​​.[]​​ produces the results 3, 2, and 1, so the effect is similar to running something like this:

0 | (3 as $item | . + $item) |
(2 as $item | . + $item) |
(1 as $item | . + $item)


If the reduction update expression outputs ​​empty​​ (that is, no values), then the reduction state is left as-is. For example, ​​reduce range(4) as $n ({}; if .==2 then empty else .[$n|tostring] |= $n)​​ will produce ​​{"0":0,"1":1,"3":3}​​.


​ Examples​


​limit(n; exp)​

The ​​limit​​ function extracts up to ​​n​​ outputs from ​​exp​​.


​ Example​


​first(expr)​​, ​​last(expr)​​, ​​nth(n; expr)​

The ​​first(expr)​​ and ​​last(expr)​​ functions extract the first and last values from ​​expr​​, respectively.

The ​​nth(n; expr)​​ function extracts the nth value output by ​​expr​​. This can be defined as ​​def nth(n; expr): last(limit(n + 1; expr));​​. Note that ​​nth(n; expr)​​ doesn’t support negative values of ​​n​​.


​ Example​


​first​​, ​​last​​, ​​nth(n)​

The ​​first​​ and ​​last​​ functions extract the first and last values from any array at ​​.​​.

The ​​nth(n)​​ function extracts the nth value of any array at ​​.​​.


​ Example​


​foreach​

The ​​foreach​​ syntax is similar to ​​reduce​​, but intended to allow the construction of ​​limit​​ and reducers that produce intermediate results (see example).

The form is ​​foreach EXP as $var (INIT; UPDATE; EXTRACT)​​. Like ​​reduce​​, ​​INIT​​ is evaluated once to produce a state value, then each output of ​​EXP​​ is bound to ​​$var​​, ​​UPDATE​​ is evaluated for each output of ​​EXP​​ with the current state and with ​​$var​​ visible. Each value output by ​​UPDATE​​replaces the previous state. Finally, ​​EXTRACT​​ is evaluated for each new state to extract an output of ​​foreach​​.

This is mostly useful only for constructing ​​reduce​​- and ​​limit​​-like functions. But it is much more general, as it allows for partial reductions (see the example below).


​ Example​


Recursion

As described above, ​​recurse​​ uses recursion, and any jq function can be recursive. The ​​while​​builtin is also implemented in terms of recursion.

Tail calls are optimized whenever the expression to the left of the recursive call outputs its last value. In practice this means that the expression to the left of the recursive call should not produce more than one output for each input.

For example:

def recurse(f): def r: ., (f | select(. != null) | r); r;

def while(cond; update):
def _while:
if cond then ., (update | _while) else empty end;
_while;

def repeat(exp):
def _repeat:
exp, _repeat;
_repeat;


Generators and iterators

Some jq operators and functions are actually generators in that they can produce zero, one, or more values for each input, just as one might expect in other programming languages that have generators. For example, ​​.[]​​ generates all the values in its input (which must be an array or an object), ​​range(0; 10)​​ generates the integers between 0 and 10, and so on.

Even the comma operator is a generator, generating first the values generated by the expression to the left of the comma, then for each of those, the values generate by the expression on the right of the comma.

The ​​empty​​ builtin is the generator that produces zero outputs. The ​​empty​​ builtin backtracks to the preceding generator expression.

All jq functions can be generators just by using builtin generators. It is also possible to define new generators using only recursion and the comma operator. If the recursive call(s) is(are) “in tail position” then the generator will be efficient. In the example below the recursive call by ​​_range​​ to itself is in tail position. The example shows off three advanced topics: tail recursion, generator construction, and sub-functions.


​ Examples​


Math

jq currently only has IEEE754 double-precision (64-bit) floating point number support.

Besides simple arithmetic operators such as ​​+​​, jq also has most standard math functions from the C math library. C math functions that take a single input argument (e.g., ​​sin()​​) are available as zero-argument jq functions. C math functions that take two input arguments (e.g., ​​pow()​​) are available as two-argument jq functions that ignore ​​.​​. C math functions that take three input arguments are available as three-argument jq functions that ignore ​​.​​.

Availability of standard math functions depends on the availability of the corresponding math functions in your operating system and C math library. Unavailable math functions will be defined but will raise an error.

One-input C math functions: ​​acos​​ ​​acosh​​ ​​asin​​ ​​asinh​​ ​​atan​​ ​​atanh​​ ​​cbrt​​ ​​ceil​​ ​​cos​​ ​​cosh​​ ​​erf​​​​erfc​​ ​​exp​​ ​​exp10​​ ​​exp2​​ ​​expm1​​ ​​fabs​​ ​​floor​​ ​​gamma​​ ​​j0​​ ​​j1​​ ​​lgamma​​ ​​log​​ ​​log10​​ ​​log1p​​ ​​log2​​​​logb​​ ​​nearbyint​​ ​​pow10​​ ​​rint​​ ​​round​​ ​​significand​​ ​​sin​​ ​​sinh​​ ​​sqrt​​ ​​tan​​ ​​tanh​​ ​​tgamma​​ ​​trunc​​​​y0​​ ​​y1​​.

Two-input C math functions: ​​atan2​​ ​​copysign​​ ​​drem​​ ​​fdim​​ ​​fmax​​ ​​fmin​​ ​​fmod​​ ​​frexp​​ ​​hypot​​ ​​jn​​​​ldexp​​ ​​modf​​ ​​nextafter​​ ​​nexttoward​​ ​​pow​​ ​​remainder​​ ​​scalb​​ ​​scalbln​​ ​​yn​​.

Three-input C math functions: ​​fma​​.

See your system’s manual for more information on each of these.

I/O

At this time jq has minimal support for I/O, mostly in the form of control over when inputs are read. Two builtins functions are provided for this, ​​input​​ and ​​inputs​​, that read from the same sources (e.g., ​​stdin​​, files named on the command-line) as jq itself. These two builtins, and jq’s own reading actions, can be interleaved with each other.

Two builtins provide minimal output capabilities, ​​debug​​, and ​​stderr​​. (Recall that a jq program’s output values are always output as JSON texts on ​​stdout​​.) The ​​debug​​ builtin can have application-specific behavior, such as for executables that use the libjq C API but aren’t the jq executable itself. The ​​stderr​​ builtin outputs its input in raw mode to stder with no additional decoration, not even a newline.

Most jq builtins are referentially transparent, and yield constant and repeatable value streams when applied to constant inputs. This is not true of I/O builtins.

​input​

Outputs one new input.

​inputs​

Outputs all remaining inputs, one by one.

This is primarily useful for reductions over a program’s inputs.

​debug​

Causes a debug message based on the input value to be produced. The jq executable wraps the input value with ​​["DEBUG:", <input-value>]​​ and prints that and a newline on stderr, compactly. This may change in the future.

​stderr​

Prints its input in raw and compact mode to stderr with no additional decoration, not even a newline.

​input_filename​

Returns the name of the file whose input is currently being filtered. Note that this will not work well unless jq is running in a UTF-8 locale.

​input_line_number​

Returns the line number of the input currently being filtered.

Streaming

With the ​​--stream​​ option jq can parse input texts in a streaming fashion, allowing jq programs to start processing large JSON texts immediately rather than after the parse completes. If you have a single JSON text that is 1GB in size, streaming it will allow you to process it much more quickly.

However, streaming isn’t easy to deal with as the jq program will have ​​[<path>, <leaf-value>]​​ (and a few other forms) as inputs.

Several builtins are provided to make handling streams easier.

The examples below use the streamed form of ​​[0,[1]]​​, which is ​​[[0],0],[[1,0],1],[[1,0]],[[1]]​​.

Streaming forms include ​​[<path>, <leaf-value>]​​ (to indicate any scalar value, empty array, or empty object), and ​​[<path>]​​ (to indicate the end of an array or object). Future versions of jq run with ​​--stream​​ and ​​-seq​​ may output additional forms such as ​​["error message"]​​ when an input text fails to parse.

​truncate_stream(stream_expression)​

Consumes a number as input and truncates the corresponding number of path elements from the left of the outputs of the given streaming expression.


​ Example​


​fromstream(stream_expression)​

Outputs values corresponding to the stream expression’s outputs.


​ Example​


​tostream​

The ​​tostream​​ builtin outputs the streamed form of its input.


​ Example​


Assignment

Assignment works a little differently in jq than in most programming languages. jq doesn’t distinguish between references to and copies of something - two objects or arrays are either equal or not equal, without any further notion of being “the same object” or “not the same object”.

If an object has two fields which are arrays, ​​.foo​​ and ​​.bar​​, and you append something to ​​.foo​​, then ​​.bar​​ will not get bigger, even if you’ve previously set ​​.bar = .foo​​. If you’re used to programming in languages like Python, Java, Ruby, Javascript, etc. then you can think of it as though jq does a full deep copy of every object before it does the assignment (for performance it doesn’t actually do that, but that’s the general idea).

This means that it’s impossible to build circular values in jq (such as an array whose first element is itself). This is quite intentional, and ensures that anything a jq program can produce can be represented in JSON.

All the assignment operators in jq have path expressions on the left-hand side (LHS). The right-hand side (RHS) procides values to set to the paths named by the LHS path expressions.

Values in jq are always immutable. Internally, assignment works by using a reduction to compute new, replacement values for ​​.​​ that have had all the desired assignments applied to ​​.​​, then outputting the modified value. This might be made clear by this example: ​​{a:{b:{c:1}}} | (.a.b|=3), .​​. This will output ​​{"a":{"b":3}}​​ and ​​{"a":{"b":{"c":1}}}​​ because the last sub-expression, ​​.​​, sees the original value, not the modified value.

Most users will want to use modification assignment operators, such as ​​|=​​ or ​​+=​​, rather than ​​=​​.

Note that the LHS of assignment operators refers to a value in ​​.​​. Thus ​​$var.foo = 1​​ won’t work as expected (​​$var.foo​​ is not a valid or useful path expression in ​​.​​); use ​​$var | .foo = 1​​ instead.

Note too that ​​.a,.b=0​​ does not set ​​.a​​ and ​​.b​​, but ​​(.a,.b)=0​​ sets both.

Update-assignment: ​​|=​

This is the “update” operator ‘|=’. It takes a filter on the right-hand side and works out the new value for the property of ​​.​​ being assigned to by running the old value through this expression. For instance, (.foo, .bar) |= .+1 will build an object with the “foo” field set to the input’s “foo” plus 1, and the “bar” field set to the input’s “bar” plus 1.

The left-hand side can be any general path expression; see ​​path()​​.

Note that the left-hand side of ‘|=’ refers to a value in ​​.​​. Thus ​​$var.foo |= . + 1​​ won’t work as expected (​​$var.foo​​ is not a valid or useful path expression in ​​.​​); use ​​$var | .foo |= . + 1​​instead.

If the right-hand side outputs no values (i.e., ​​empty​​), then the left-hand side path will be deleted, as with ​​del(path)​​.

If the right-hand side outputs multiple values, only the first one will be used (COMPATIBILITY NOTE: in jq 1.5 and earlier releases, it used to be that only the last one was used).


​ Example​


Arithmetic update-assignment: ​​+=​​, ​​-=​​, ​​*=​​, ​​/=​​, ​​%=​​, ​​//=​

jq has a few operators of the form ​​a op= b​​, which are all equivalent to ​​a |= . op b​​. So, ​​+= 1​​ can be used to increment values, being the same as ​​|= . + 1​​.


​ Example​


Plain assignment: ​​=​

This is the plain assignment operator. Unlike the others, the input to the right-hand-side (RHS) is the same as the input to the left-hand-side (LHS) rather than the value at the LHS path, and all values output by the RHS will be used (as shown below).

If the RHS of ‘=’ produces multiple values, then for each such value jq will set the paths on the left-hand side to the value and then it will output the modified ​​.​​. For example, ​​(.a,.b)=range(2)​​outputs ​​{"a":0,"b":0}​​, then ​​{"a":1,"b":1}​​. The “update” assignment forms (see above) do not do this.

This example should show the difference between ‘=’ and ‘|=’:

Provide input ‘{“a”: {“b”: 10}, “b”: 20}’ to the programs:

.a = .b

.a |= .b

The former will set the “a” field of the input to the “b” field of the input, and produce the output {“a”: 20, “b”: 20}. The latter will set the “a” field of the input to the “a” field’s “b” field, producing {“a”: 10, “b”: 20}.

Another example of the difference between ‘=’ and ‘|=’:

null|(.a,.b)=range(3)

outputs ‘{“a”:0,“b”:0}’, ‘{“a”:1,“b”:1}’, and ‘{“a”:2,“b”:2}’, while

null|(.a,.b)|=range(3)

outputs just ‘{“a”:0,“b”:0}’.

Complex assignments

Lots more things are allowed on the left-hand side of a jq assignment than in most languages. We’ve already seen simple field accesses on the left hand side, and it’s no surprise that array accesses work just as well:

.posts[0].title = "JQ Manual"


What may come as a surprise is that the expression on the left may produce multiple results, referring to different points in the input document:

.posts[].comments |= . + ["this is great"]


That example appends the string “this is great” to the “comments” array of each post in the input (where the input is an object with a field “posts” which is an array of posts).

When jq encounters an assignment like ‘a = b’, it records the “path” taken to select a part of the input document while executing a. This path is then used to find which part of the input to change while executing the assignment. Any filter may be used on the left-hand side of an equals - whichever paths it selects from the input will be where the assignment is performed.

This is a very powerful operation. Suppose we wanted to add a comment to blog posts, using the same “blog” input above. This time, we only want to comment on the posts written by “stedolan”. We can find those posts using the “select” function described earlier:

.posts[] | select(.author == "stedolan")


The paths provided by this operation point to each of the posts that “stedolan” wrote, and we can comment on each of them in the same way that we did before:

(.posts[] | select(.author == "stedolan") | .comments) |=
. + ["terrible."]


Modules

jq has a library/module system. Modules are files whose names end in ​​.jq​​.

Modules imported by a program are searched for in a default search path (see below). The ​​import​​and ​​include​​ directives allow the importer to alter this path.

Paths in the a search path are subject to various substitutions.

For paths starting with “~/”, the user’s home directory is substituted for “~”.

For paths starting with “$ORIGIN/”, the path of the jq executable is substituted for “$ORIGIN”.

For paths starting with “./” or paths that are “.”, the path of the including file is substituted for “.”. For top-level programs given on the command-line, the current directory is used.

Import directives can optionally specify a search path to which the default is appended.

The default search path is the search path given to the ​​-L​​ command-line option, else ​​["~/.jq", "$ORIGIN/../lib/jq", "$ORIGIN/../lib"]​​.

Null and empty string path elements terminate search path processing.

A dependency with relative path “foo/bar” would be searched for in “foo/bar.jq” and “foo/bar/bar.jq” in the given search path. This is intended to allow modules to be placed in a directory along with, for example, version control files, README files, and so on, but also to allow for single-file modules.

Consecutive components with the same name are not allowed to avoid ambiguities (e.g., “foo/foo”).

For example, with ​​-L$HOME/.jq​​ a module ​​foo​​ can be found in ​​$HOME/.jq/foo.jq​​ and ​​$HOME/.jq/foo/foo.jq​​.

If “$HOME/.jq” is a file, it is sourced into the main program.

​import RelativePathString as NAME [<metadata>];​

Imports a module found at the given path relative to a directory in a search path. A “.jq” suffix will be added to the relative path string. The module’s symbols are prefixed with “NAME::”.

The optional metadata must be a constant jq expression. It should be an object with keys like “homepage” and so on. At this time jq only uses the “search” key/value of the metadata. The metadata is also made available to users via the ​​modulemeta​​ builtin.

The “search” key in the metadata, if present, should have a string or array value (array of strings); this is the search path to be prefixed to the top-level search path.

​include RelativePathString [<metadata>];​

Imports a module found at the given path relative to a directory in a search path as if it were included in place. A “.jq” suffix will be added to the relative path string. The module’s symbols are imported into the caller’s namespace as if the module’s content had been included directly.

The optional metadata must be a constant jq expression. It should be an object with keys like “homepage” and so on. At this time jq only uses the “search” key/value of the metadata. The metadata is also made available to users via the ​​modulemeta​​ builtin.

​import RelativePathString as $NAME [<metadata>];​

Imports a JSON file found at the given path relative to a directory in a search path. A “.json” suffix will be added to the relative path string. The file’s data will be available as ​​$NAME::NAME​​.

The optional metadata must be a constant jq expression. It should be an object with keys like “homepage” and so on. At this time jq only uses the “search” key/value of the metadata. The metadata is also made available to users via the ​​modulemeta​​ builtin.

The “search” key in the metadata, if present, should have a string or array value (array of strings); this is the search path to be prefixed to the top-level search path.

​module <metadata>;​

This directive is entirely optional. It’s not required for proper operation. It serves only the purpose of providing metadata that can be read with the ​​modulemeta​​ builtin.

The metadata must be a constant jq expression. It should be an object with keys like “homepage”. At this time jq doesn’t use this metadata, but it is made available to users via the ​​modulemeta​​ builtin.

​modulemeta​

Takes a module name as input and outputs the module’s metadata as an object, with the module’s imports (including metadata) as an array value for the “deps” key.

Programs can use this to query a module’s metadata, which they could then use to, for example, search for, download, and install missing dependencies.


 


 

 

 

 

 

 

 

 

​Using jq to parse and display multiple fields in a json serially​​ 



{
"users": [
{
"first": "Stevie",
"last": "Wonder"
},
{
"first": "Michael",
"last": "Jackson"
}
]
}
jq '.users[] | "\(.first)####\(.last)"'



{
"baseinfo": {
"code": "123456",
},
"finance": {
"profit": "6543210",
},


 cat *.json |jq '"\(.baseinfo.code)####\(.finance.profit)"'