fix: add missing handler for deserializing json value #1587

tomwojcik · 2023-06-19T10:59:01Z

google-cla · 2023-06-19T10:59:05Z

Thanks for your pull request! It looks like this may be your first contribution to a Google open source project. Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

View this failed invocation of the CLA check for more information.

For the most up to date status, view the checks section at the bottom of the pull request.

tomwojcik · 2023-07-02T11:21:46Z

@tswast bump, please review

tswast

Thanks!

One concern I have is the other direction. If someone supplies a parsed JSON object to an API like insert_rows, do we correctly serialize that?

tswast · 2023-12-13T20:27:09Z

Looks like everything is working well except for the type annotations:

nox > Running session mypy
nox > Creating virtual environment (virtualenv) using python3.8 in .nox/mypy
nox > python -m pip install -e '.[all]'
nox > python -m pip install mypy==1.6.1
nox > python -m pip install types-protobuf types-python-dateutil types-requests types-setuptools
nox > mypy -p google --show-traceback
google/cloud/bigquery/query.py:472: error: Cannot call function of unknown type  [operator]
google/cloud/bigquery/query.py:629: error: Cannot call function of unknown type  [operator]
google/cloud/bigquery/query.py:778: error: Cannot call function of unknown type  [operator]
Found 3 errors in 1 file (checked 54 source files)
nox > Command mypy -p google --show-traceback failed with exit code 1
nox > Session mypy failed.

MartijnvanElferen · 2023-12-14T10:48:45Z

This change is causing the following error for insert_rows:

TypeError: _json_from_json() missing 1 required positional argument: 'field'

Is there a recommended fix for this?

tswast · 2023-12-15T21:41:56Z

Thanks @MartijnvanElferen for the report. I've filed #1756 to investigate further.

tswast · 2023-12-15T21:53:20Z

As a workaround insert_rows_json should avoid this type conversion code, but you're responsible for putting things in a format the BigQuery REST API understands.

tswast · 2023-12-15T22:20:16Z

@MartijnvanElferen Fix pending: #1757

Note: this PR will change the behavior of JSON data in insert_rows to call json.dumps, which is consistent with the new behavior of calling json.loads when reading row values. Previously, JSON support was happening via a generic fallback that didn't do any conversion of values passed in.

MartijnvanElferen · 2023-12-18T16:17:34Z

Thanks, @tswast. I'll refactor a couple of things to avoid the type conversion. Looking forward to the fix at the same time!

kcooney · 2024-05-31T19:39:47Z

@tswast This change should probably be listed as a potentially braking change in the release notes, since before this change reading the entire column would return a value of type str:

CREATE TABLE mydataset.table(
  id INT64,
  proto_as_json JSON
);

from google.cloud import bigquery
from google.protobuf import json_format
from com.example.my_proto_pb2 import MyProto

client = bigquery.Client()
proto = MyProto(first_name="Jane", last_name="Doe") 
proto_as_json = json_format.MessageToJson(
    proto, use_integers_for_enums=True, indent=None)
rows_to_insert = [
    {"id": 1, "proto_as_json": f'JSON """{proto_as_json}"""'},
]
errors = client.insert_rows("mydataset.table", rows_to_insert)
assert not errors

row = client.query("SELECT proto_as_json FROM mydataset.table").result().__next__()
json_str = row["proto_as_json"]
# Prior to this change, json_str was of type str, so we could do this:
read_proto = MyProto()
json_format.Parse(json_str, read_proto)
assert proto == read_proto

Luckily, in our project we created a common helper method for the json_format.Parse() call, so we can update it to use json_format.ParseDict()

tswast · 2024-05-31T20:47:09Z

Yeah, it's a gray area with this since we never said we support the JSON type until this change.

We still don't support JSON in some code paths like to_dataframe()

kcooney · 2024-05-31T22:28:02Z

@tswast We saw the breakage when we tried to update a number of python libraries to newer versions. The release notes were very unhelpful in determining which library and version could have caused the problem (because the PR and bug listed in the release notes suggested that it only affected selecting subfields). It wasn't until we found a work around that we were able to track the issue to v3.15.0 and this change. Even if there were no prior promises of support of the JSON type, a short blurb in the release notes would be very much appreciated.

fix: add missing handler for deserializing json value

92d61d3

tomwojcik requested review from a team as code owners June 19, 2023 10:59

tomwojcik requested a review from tswast June 19, 2023 10:59

product-auto-label bot added size: s Pull request size is small. api: bigquery Issues related to the googleapis/python-bigquery API. labels Jun 19, 2023

Merge branch 'main' into feature/googleapis#1500-jsonfield-deserialize

f3cdbcb

parthea added kokoro:run Add this label to force Kokoro to re-run the tests. kokoro:force-run Add this label to force Kokoro to re-run the tests. owlbot:run Add this label to trigger the Owlbot post processor. labels Jul 6, 2023

gcf-owl-bot bot removed the owlbot:run Add this label to trigger the Owlbot post processor. label Jul 6, 2023

yoshi-kokoro removed kokoro:run Add this label to force Kokoro to re-run the tests. kokoro:force-run Add this label to force Kokoro to re-run the tests. labels Jul 6, 2023

tomwojcik and others added 2 commits July 7, 2023 10:10

Merge branch 'main' into feature/googleapis#1500-jsonfield-deserialize

1063e47

Merge branch 'main' into feature/googleapis#1500-jsonfield-deserialize

757ac80

tswast added the kokoro:run Add this label to force Kokoro to re-run the tests. label Oct 3, 2023

yoshi-kokoro removed the kokoro:run Add this label to force Kokoro to re-run the tests. label Oct 3, 2023

tswast reviewed Oct 3, 2023

View reviewed changes

tswast requested a review from chalmerlowe October 3, 2023 21:12

Merge branch 'main' into feature/googleapis#1500-jsonfield-deserialize

3a4ea03

Linchin added the kokoro:force-run Add this label to force Kokoro to re-run the tests. label Nov 1, 2023

yoshi-kokoro removed the kokoro:force-run Add this label to force Kokoro to re-run the tests. label Nov 1, 2023

Merge branch 'main' into feature/googleapis#1500-jsonfield-deserialize

782a083

tswast added the kokoro:run Add this label to force Kokoro to re-run the tests. label Dec 13, 2023

yoshi-kokoro removed the kokoro:run Add this label to force Kokoro to re-run the tests. label Dec 13, 2023

tswast approved these changes Dec 13, 2023

View reviewed changes

tswast self-assigned this Dec 13, 2023

fix mypy

41c22c2

tswast added kokoro:run Add this label to force Kokoro to re-run the tests. owlbot:run Add this label to trigger the Owlbot post processor. labels Dec 13, 2023

gcf-owl-bot bot removed the owlbot:run Add this label to trigger the Owlbot post processor. label Dec 13, 2023

yoshi-kokoro removed the kokoro:run Add this label to force Kokoro to re-run the tests. label Dec 13, 2023

tswast merged commit 09017a9 into googleapis:main Dec 13, 2023
18 of 20 checks passed

release-please bot mentioned this pull request Dec 13, 2023

chore(main): release 3.14.1 #1750

Merged

tswast mentioned this pull request Dec 13, 2023

fix: Deserializing JSON subfields within structs fails #1742

Merged

4 tasks

tomwojcik deleted the feature/#1500-jsonfield-deserialize branch December 13, 2023 22:48

tswast mentioned this pull request Dec 15, 2023

Support JSON data type in insert_rows and as query parameters #1756

Closed

This was referenced Dec 19, 2023

December 18, 2023 kitta65/bq-extension-vscode#262

Closed

December 18, 2023 kitta65/prettier-plugin-bq#269

Closed

December 18, 2023 kitta65/bq2cst#279

Closed

bricker mentioned this pull request Jan 10, 2024

Unable to insert or update JSON #1759

Closed

dbeatty10 mentioned this pull request Feb 16, 2024

[Bug] Error when loading NULL result from JSON column dbt-labs/dbt-bigquery#1101

Closed

2 tasks

suzmue mentioned this pull request Aug 5, 2024

Add a warning in the type mapping if no conversion function is found #1988

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: add missing handler for deserializing json value #1587

fix: add missing handler for deserializing json value #1587

tomwojcik commented Jun 19, 2023

google-cla bot commented Jun 19, 2023

tomwojcik commented Jul 2, 2023 •

edited

Loading

tswast left a comment

tswast commented Dec 13, 2023

MartijnvanElferen commented Dec 14, 2023

tswast commented Dec 15, 2023

tswast commented Dec 15, 2023

tswast commented Dec 15, 2023

MartijnvanElferen commented Dec 18, 2023

kcooney commented May 31, 2024 •

edited

Loading

tswast commented May 31, 2024

kcooney commented May 31, 2024 •

edited

Loading

fix: add missing handler for deserializing json value #1587

fix: add missing handler for deserializing json value #1587

Conversation

tomwojcik commented Jun 19, 2023

google-cla bot commented Jun 19, 2023

tomwojcik commented Jul 2, 2023 • edited Loading

tswast left a comment

Choose a reason for hiding this comment

tswast commented Dec 13, 2023

MartijnvanElferen commented Dec 14, 2023

tswast commented Dec 15, 2023

tswast commented Dec 15, 2023

tswast commented Dec 15, 2023

MartijnvanElferen commented Dec 18, 2023

kcooney commented May 31, 2024 • edited Loading

tswast commented May 31, 2024

kcooney commented May 31, 2024 • edited Loading

tomwojcik commented Jul 2, 2023 •

edited

Loading

kcooney commented May 31, 2024 •

edited

Loading

kcooney commented May 31, 2024 •

edited

Loading