fix: use pandas function to check for NaN #750

LinuxChristian · 2021-07-11T12:12:55Z

Starting with pandas 1.0, an experimental pandas.NA value (singleton) is available to represent scalar missing values as
opposed to numpy.nan. Comparing the variable with itself results in a pandas.NA value that doesn't support type-casting
to boolean. Using the build-in pandas.isna function handles all pandas supported NaN values (None, pd.NA, np.nan, NaT).

Thank you for opening a Pull Request! Before submitting your PR, there are a few things you can do to make sure it goes smoothly:

Make sure to open an issue as a bug/issue before writing your code! That way we can discuss the change, evaluate designs, and agree on the general idea
Ensure the tests and linter pass
Code coverage does not decrease (if any source code was changed)
Appropriate docs were updated (if necessary)

Fixes #729

Starting with pandas 1.0, an experimental pandas.NA value (singleton) is available to represent scalar missing values as opposed to numpy.nan. Comparing the variable with itself results in a pandas.NA value that doesn't support type-casting to boolean. Using the build-in pandas.isna function handles all pandas supported NaN values.

plamut

Thanks for the fix and the test, it generally looks good! Just made two improvement suggestions.

tests/unit/test__pandas_helpers.py

plamut · 2021-07-12T10:49:42Z

FWIW, the prerelease-deps failure is flakiness, while Python 3.6 unit tests failure is legitimate - we use the oldest supported depenency versions there, and the new test failed with an attribute error:

 AttributeError: module 'pandas' has no attribute 'NA'

LinuxChristian · 2021-07-12T12:18:50Z

I added one commit per comment so it's easier to review. If you would like me to squash a final version let me know.

plamut · 2021-07-12T12:22:57Z

tests/unit/test__pandas_helpers.py

-@pytest.mark.skipif(pandas is None, reason="Requires `pandas`")
+@pytest.mark.skipIf(
+    pandas is None or PANDAS_INSTALLED_VERSION < PANDAS_MINIUM_VERSION,
+    reason="Requires `pandas version >= 1.0.0` which introduces pandas.NA",


Neat to also include the reason why a minimum version is needed. 👍

plamut

LGTM, thanks for the quick update!

Nothing to worry about squashing the commits, we do that when the PR is merged.

plamut · 2021-07-12T15:39:39Z

It appears that the test was not skipped?

    def test_dataframe_to_json_generator(module_under_test):
        utcnow = datetime.datetime.utcnow()
        df_data = collections.OrderedDict(
            [
>               ("a_series", [pandas.NA, 2, 3, 4]),
                ("b_series", [0.1, float("NaN"), 0.3, 0.4]),
                ("c_series", ["a", "b", pandas.NA, "d"]),
                ("d_series", [utcnow, utcnow, utcnow, pandas.NaT]),
                ("e_series", [True, False, True, None]),
            ]
        )
E       AttributeError: 'NoneType' object has no attribute 'NA'

Also reproducible locally.

Edit: Ah, typo in the decorator name, I actually remember seeing that before. Fixed it myself, should be fine now.

tests/unit/test__pandas_helpers.py

LinuxChristian requested a review from a team July 11, 2021 12:12

LinuxChristian requested a review from a team as a code owner July 11, 2021 12:12

LinuxChristian requested review from loferris and removed request for a team July 11, 2021 12:12

google-cla bot added the cla: yes This human has signed the Contributor License Agreement. label Jul 11, 2021

product-auto-label bot added the api: bigquery Issues related to the googleapis/python-bigquery API. label Jul 11, 2021

plamut added the kokoro:run Add this label to force Kokoro to re-run the tests. label Jul 12, 2021

yoshi-kokoro removed the kokoro:run Add this label to force Kokoro to re-run the tests. label Jul 12, 2021

plamut suggested changes Jul 12, 2021

View reviewed changes

tests/unit/test__pandas_helpers.py Show resolved Hide resolved

tests/unit/test__pandas_helpers.py Outdated Show resolved Hide resolved

LinuxChristian added 2 commits July 12, 2021 14:12

tests: Skip tests if pandas below required version

f86d103

tests: compare expected and actual directly as lists

659e9f7

plamut reviewed Jul 12, 2021

View reviewed changes

plamut approved these changes Jul 12, 2021

View reviewed changes

plamut mentioned this pull request Jul 12, 2021

Unify the way how unit tests are conditionally skipped #752

Closed

plamut added kokoro:run Add this label to force Kokoro to re-run the tests. automerge Merge the pull request once unit tests and other checks pass. labels Jul 12, 2021

yoshi-kokoro removed the kokoro:run Add this label to force Kokoro to re-run the tests. label Jul 12, 2021

plamut reviewed Jul 12, 2021

View reviewed changes

tests/unit/test__pandas_helpers.py Outdated Show resolved Hide resolved

Fix pytest.mark.skipif spelling

dc0529b

plamut added the kokoro:force-run Add this label to force Kokoro to re-run the tests. label Jul 12, 2021

yoshi-kokoro removed the kokoro:force-run Add this label to force Kokoro to re-run the tests. label Jul 12, 2021

plamut merged commit 67bc5fb into googleapis:master Jul 12, 2021

gcf-merge-on-green bot removed the automerge Merge the pull request once unit tests and other checks pass. label Jul 12, 2021

plamut mentioned this pull request Aug 27, 2021

Can't use Pandas to upload a REPEATED field (e.g. list of strings) #913

Closed

release-please bot mentioned this pull request Jan 4, 2022

chore(main): release python-bigquery 1.27.1 #1097

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: use pandas function to check for NaN #750

fix: use pandas function to check for NaN #750

LinuxChristian commented Jul 11, 2021

plamut left a comment

plamut commented Jul 12, 2021

LinuxChristian commented Jul 12, 2021

plamut Jul 12, 2021

plamut left a comment

plamut commented Jul 12, 2021 •

edited

Loading

fix: use pandas function to check for NaN #750

fix: use pandas function to check for NaN #750

Conversation

LinuxChristian commented Jul 11, 2021

plamut left a comment

Choose a reason for hiding this comment

plamut commented Jul 12, 2021

LinuxChristian commented Jul 12, 2021

plamut Jul 12, 2021

Choose a reason for hiding this comment

plamut left a comment

Choose a reason for hiding this comment

plamut commented Jul 12, 2021 • edited Loading

plamut commented Jul 12, 2021 •

edited

Loading