All versions are tagged by the major Postgres version, plus an individual semver for this library itself.
- Add support for compiling on Windows
- In order to build on Windows when using MSVC, use the new "Makefile.msvc" with nmake, or directly compile all .c files in the src folder into a library
- If compiling directly, add src/postgres/include/port/win32 to the include path, and when using MSVC also add src/postgres/include/port/win32_msvc
- Add support for compiling on 32-bit systems
- The relevant code is enabled at compile time by checking the pointer size (SIZEOF_POINTER == 4)
- Move internal headers and included .c files to src/include folder
- This avoids having any .c files in the top-level src/ folder that can't be directly compiled, and thus lets us simplify the logic for defining which source units are to be compiled.
- Remove strnlen fallback implementation
- Avoid use of mmap, asprintf and strndup to improve portability
- Improve compatibility with non-POSIX systems and C89 compilers
- Update to Postgres 16.1
- Drop support for arbitrary trailing junk on integer literals
- Support for parsing junk after parameters, e.g.
$1OR
is retained
- Support for parsing junk after parameters, e.g.
- Deparser:
- Fix deparsing of
SYSTEM_USER
- Add support for deparsing
STORAGE
mode - Add support for deparsing
REVOKE ... CASCADE
- Rework a_expr/b_expr/c_expr deparsing to match gram.y structure
- Add support for deparsing
COMPRESSION
option for columns - Add support for deparsing
NULLS NOT DISTINCT
in unique constraints - Add support for deparsing new SQL/JSON functionality
- Fix deparsing of
- Scanner: Add token
ASCII_36
("$") to support queries like "SELECT $identifier" #211, #219- Whilst these queries are not valid SQL and would fail parsing, this token can show up when using
pg_query_scan
orpg_query_split_with_scanner
directly
- Whilst these queries are not valid SQL and would fail parsing, this token can show up when using
- Normalize: Fix incorrect type cast #223
- Deparser:
- Support changing parse mode and config settings affecting the parser #216
- Alternate parse modes are useful for parsing PL/pgSQL expressions, as well as type names
- Additionally, you can now change config settings that affect parsing, like
standard_conforming_strings
- To pass options, use the new methods ending in
_opts
, e.g.pg_query_parse_opts
- Fix builds when compiling with
glibc >= 2.38
#203 - Deparser: Add support for COALESCE and other expressions in LIMIT clause #199
- Deparser:
- Deparser: Handle INTERVAL correctly when used in SET statements #184
- Deparser: Ensure index names are quoted as identifiers #182
- Remove limits.h from pg_query_deparse.c #181
- Update copyright notice years and authors #175
- Allow trailing junk in numeric literals #177
- Allows parsing queries like
SELECT * FROM a WHERE b=$1ORc=$2
- Allows parsing queries like
- NetBSD support #172
- Add
Boolean
fingerprintingBoolean
nodes are now output during fingerprinting
- Fix parsing issue on 32-bit big endian machines
- Now we use
size_t
to indicate the protobuf message size
- Now we use
- Update to Postgres 15.1
- Add support for
MERGE
statements - Add support for
ALTER TABLE ALL IN TABLESPACE ...
statements - Add support for publication objects
- E.g.
CREATE PUBLICATION foo FOR TABLES IN SCHEMA CURRENT_SCHEMA
- E.g.
- Deparser now attempts to deparse
COPY
statements first using the old Postgres 8.4-style syntax (e.g.COPY foo FROM STDIN FREEZE CSV
).
Special thanks to @wolfgangwalther and @tlisanti for most of the work done on this release.
- Update to Postgres 14.6
- Drop support for
?
parameter syntax - Update
fingerprint.json
to include newly added tests, regeneate tests
- Update to Postgres 13.8 patch release #156
- Backport Xcode 14.1 build fix from upcoming 13.9 release #156
- Fingerprinting version 3.1 #155
- Fixes issue with "SELECT DISTINCT" having the same fingerprint as "SELECT" (fingerprints for "SELECT DISTINCT" will change with this revision)
- Group additional DDL statements together that otherwise generate a lot of unique fingerprints (ListenStmt, UnlistenStmt, NotifyStmt, CreateFunctionStmt, FunctionParameter and DoStmt)
- Normalize additional DDL statements #155
- Normalizes arguments to CreateFunctionStmt, DoStmt, CreateSubscriptionStmt, AlterSubscriptionStmt, CreateUserMapping and AlterUserMapping.
- Note that this is different from pg_stat_statements itself, which does not normalize utility statements at all today.
- Add support for analyzing PL/pgSQL code inside DO blocks #142
- Fix memory leak in pg_query_fingerprint error handling #141
- PL/pgSQL parser
- Add support for parsing more operators that include a
?
character (special cased to support old pg_stat_statements query texts) - Deparser improvements
- Normalize: add funcname error object #121
- Normalize: Match GROUP BY against target list and re-use param refs #124
- PL/pgSQL: Setup namespace items for parameters, support RECORD types #123
- This significantly improves parsing for PL/pgSQL functions, to the extent that most functions should now parse successfully
- Normalize: Don't modify constants in TypeName typmods/arrayBounds fields (#118)
- This matches how pg_stat_statement behaves, and avoids causing parsing errors on the normalized statement
- Don't fail builds on systems that have strchrnul support (FreeBSD)
- Normalize: Don't touch "ORDER BY 1" expressions, keep original text #115
- This avoids obscuring the semantic meaning of integers in the ORDER BY clause, which is to reference a particular column in the target list.
- Update to Postgres 13.3 patch release #114
- Add optional Makefile target to build as shared library #100
- Normalize: Don't touch "GROUP BY 1" type statements, keep original text #113
- This avoids obscuring the semantic meaning of integers in the GROUP BY clause, which is to reference a particular column in the target list.
- Fingerprint: Cache list item hashes to fingerprint complex queries faster #112
- This was exhibiting quite bad runtime behaviour before, causing both an explosion in memory, as well as very high CPU runtime for complex queries.
- The new approach ensures we don't calculate the hashes for a particular list more than once, which ensures that we roughly have quadratic runtime instead of exponential runtime.
- Deparser: Emit the RangeVar catalogname if present #105
- Fix crash in pg_scan function when encountering backslash escapes #109
- Integrate oss-fuzz fuzzer #106
- Deparser: Fix crash in CopyStmt with HEADER or FREEZE inside WITH parens
- The parse tree does not contain an explicit argument in those cases, but does when specified in the legacy mode without the wrapping WITH.
- With this change we only output the "1" argument when the original tree also had this, to ensure parse tree comparisons match. Note the intent here is technically the same, which is to enable these options.
- Normalize: Fix handling of two subsequent DefElem elements #96
- We were incorrectly adding too many DefElem locations to the recorded constant values, causing a crash when more than a single DefElem is present in a utility statement.
- srcdata/nodetypes.json: Avoid bogus values accidentally parsed from inside comments
- Fix ARM builds: Avoid dependency on cpuid.h header
- Simplify deparser of TableLikeClause #91 Lele Gaifax
- Fix asprintf warnings by ensuring _GNU_SOURCE is set early enough
- Update to PostgreSQL 13 parser (13.2 release)
- Changes to JSON output format
- WARNING: These JSON format changes are incompatible with prior releases.
- New top-level result object that contains the Postgres version number the parser is based on
- Node type names are only output when the field is a generic field (Node*),
but not when the field always has the same type. This matches how the
Postgres source looks like, and ensures the JSON and (new) Protobuf format
match in their structure. You can utilize the
srcdata/struct_defs.json
file as needed to get the necessary context on field types. - Whitespace between control characters in JSON is no longer added
- "<" and ">" characters are escaped to avoid browser HTML injections
- Enum values are output with the value's name, instead of the integer value
- Introduce new Protobuf parse tree output format
- Up until now, this library relied on JSON to pass the parse result back to the caller, which has a number of downsides, most importantly that we don't have a readily available parser for JSON thats not tied to a running Postgres server. That in turn makes it hard to provide cross-language features such as deparsing directly in this library (which would require reading back a parse tree that gets passed in).
- Protobuf isn't perfect, but its straightforward enough to generate the schema definitions for the parse tree nodes, and output the tree using a bundled C protobuf library, which has a small enough SLOC count (~3k) to not be noticeable in the big picture.
- Add support for returning Postgres scanner result
- This allows utilizing pg_query for use cases that need the raw token information, instead of a parse tree. Due to additional modifications to the Postgres source, this also contains information about comments in the query string, and their location.
- Add deparsing functionality that turns parse tree back into a SQL query
- This is based on the deparser that was written over multiple years for the pg_query Ruby library, and is now accessible for all bindings through this new API and implementation.
- Fingerprinting: Introduce v3 version and 64-bit XXH3 hash
- See full details in the wiki page here: https://github.com/pganalyze/libpg_query/wiki/Fingerprinting#version-30-based-on-postgresql-13
- Add new pg_query_split_with_scanner/pg_query_split_with_parser functions to
split up multi-statement strings
- Naively one could assume that splitting a string by ";" is sufficient, but it becomes tricky once one takes into consideration that this character can also show up in identifier, constants or comments.
- We provide both a parser-based split function and a scanner-based split function. Most importantly when splitting statements in a file that may contain syntax errors that cause a parser error, but are accepted by the scanner. Otherwise the parser-based split function is recommended due to better accuracy.
- Add experimental Protobuf C++ outfuncs, converge JSON output to match Protobuf mapped output
- Extract source with USE_ASSERT_CHECKING enabled
- This ensures we have the necessary functions to compile an assert-enabled build if necessary. Note that this doesn't mean that asserts are enabled by default (they are not, you need to explicitly use DEBUG=1).
- Ensure codebase has a clean Valgrind run
- PL/pgSQL: Output NEW/OLD variable numbers, record dno fields Ethan Resnick
- Makefile: Allow passing in customized CFLAGS/PG_CONFIGURE_FLAGS/TEST_* Ethan Resnick
- Update to latest Postgres 10 patch release (10.16)
- Free Postgres top-level memory context on thread exit / with function
- Previously there was no way to free the top-level Postgres memory context, causing threaded programs that churn through a lot of threads to leak memory with each newly initialized thread-local top-level memory context.
- Instead, this uses a newly introduced cleanup method to free the memory when a pthread exits (note this causes a pthread dependency to be added to this library). In addition, primarily for memory testing purposes, add a new method "pg_query_exit" that performs the same cleanup on demand.
- Resolve correctness issues and possible memory leak in PL/pgSQL parser
- Add arch-ppc.h for PPC architectures #80 @pkubaj
- Update to latest Postgres 10 patch release (10.15)
- PL/pgSQL parsing: Handle asprintf failures (and prevent compiler warning)
- Update to latest Postgres 10 patch release (10.14)
- Add support for ARM builds by explicitly copying ARM header file
- Ignore return value of asprintf without compiler warnings @herwinw
- Free stderr_buffer when parsing plpgsql @herwinw
- Avoid compiler warning due to unused result in pg_query_parse_plpgsql
- Fix fingerprint tests
- First release based on PostgreSQL 10.0
- Parse tree output may have changed in backwards-incompatible ways!
- Fingerprint base version bumped to "02" to reflect the change in parse trees
- Allow "$1 FROM $2" to be parsed
- This is new with Postgres 10 output of pg_stat_statements, so we should treat this the same as "? FROM ?" in earlier versions.
- Update to Postgres 9.5.9
- Support gcc versions earlier than 4.6.0
- Export version information in pg_query.h directly
- Fingerprinting Version 1.3
- Attributes to be ignored:
- RangeVar.relname (if node also has RangeVar.relpersistence = "t")
- Special cases: List nodes where parent field name is valuesLists
- Follow same logic described for fromClause/targetList/cols/rexpr
- Attributes to be ignored:
- Fingerprinting Version 1.2
- Ignore portal_name in DeclareCursorStmt, FetchStmt and ClosePortalStmt
- Change normalization methods to output $1/$2 .. $N instead of ? characters
- BREAKING CHANGE in pg_query_normalize(..) output
- This matches the change in the upcoming Postgres 10, and makes it easier to migrate applications to the new normalization format
- Update to Postgres 9.5.7
- Disable 128-bit integer support (they are not used), to support 32-bit archs @herwinw
- Cleanup efforts @herwinw
- Improve concurrency tests
- Make sure we have a valid proc_source
- Normalized whitespace in pg_query_parse_plpgsql
- Move inclusion of stdio.h in plpgsql parser
- Cut off fingerprints at 100 nodes deep to avoid excessive runtimes/memory
- Fix warning on Linux due to missing asprintf include
- Automatically call pg_query_init as needed to ease threaded usage
- Clean up includes to avoid dependency on stdbool.h and xlocale.h
- Change PL/pgSQL input to be the full CREATE FUNCTION statement
- This is necessary for parsing, since we need the argument and return types
- Fingerprinting Version 1.1
- Only ignore ResTarget.name when parent field name is targetList and we have a SelectStmt as a parent node (fixes UpdateStmt fingerprinting)
- Normalize the password in ALTER ROLE ... PASSWORD '123' statements
- Make library thread-safe through thread-local storage #13
- Extract source code using LLVM instead of manually compiling the right objects
- This speeds up build times considerably since we don't download the Postgres source anymore, instead shipping a partial copy created as part of a release.
- Experimental support for parsing PL/pgSQL source code (output format subject to change)
- Fix stack overflow when parsing CREATE FOREIGN TABLE (#9)
- Update to PostgreSQL 9.5.3
- Add pg_query_fingerprint() method that uniquely identifies SQL queries, whilst ignoring formatting and individual constant values
- Update to PostgreSQL 9.5.2
- First release based on PostgreSQL 9.5.1
- Make JSON_OUTPUT_V2 the default and remove outfuncs patch
- NOTE: This is a backwards incompatible change in the output parsetree format!
- First tagged release based on PostgreSQL 9.4.5