Unit Testing in Beam Blog Post #31701

svetakvsundhar · 2024-06-27T18:11:31Z

This blog posts details opinionated examples and practices for unit testing in Beam. Examples use the Python SDK.

Thank you for your contribution! Follow this checklist to help us incorporate your contribution quickly and easily:

Mention the appropriate issue in your description (for example: addresses #123), if applicable. This will automatically add a link to the pull request in the issue. If you would like the issue to automatically close on merging the pull request, comment fixes #<ISSUE NUMBER> instead.
Update CHANGES.md with noteworthy changes.
If this contribution is large, please file an Apache Individual Contributor License Agreement.

See the Contributor Guide for more tips on how to make review process smoother.

To check the build health, please visit https://github.com/apache/beam/blob/master/.test-infra/BUILD_STATUS.md

GitHub Actions Tests Status (on master branch)

See CI.md for more information about GitHub Actions CI or the workflows README to see a list of phrases to trigger workflows.

svetakvsundhar · 2024-06-27T18:19:47Z

R: @liferoad
R: @rszper
R: @damccorm
cc: https://github.com/surjits254

github-actions · 2024-06-27T18:20:57Z

Stopping reviewer notifications for this pull request: review requested by someone other than the bot, ceding control

website/www/site/content/en/blog/unit-testing-blog.md

liferoad · 2024-06-27T19:49:14Z

website/www/site/content/en/blog/unit-testing-blog.md

+
+
+
+The following cover other testing best practices:


shall we mention a lot of testing utils under https://github.com/apache/beam/tree/master/sdks/python/apache_beam/testing?

Are these useful for customers writing pipeline? IIUC these utils are designed more for tests written in the Beam repo

Users should generally use assert_that, equal_to, and (for streaming) test stream.

website/www/site/content/en/blog/unit-testing-blog.md

damccorm · 2024-07-03T10:10:35Z

website/www/site/content/en/blog/unit-testing-blog.md

@@ -0,0 +1,149 @@
+---
+title:  "So You Want to Write Tests on Your Beam Pipeline?"
+date:   2024-07-08 00:00:01 -0800


Note - this isn't rendering in https://apache-beam-website-pull-requests.storage.googleapis.com/31701/blog/index.html

I suspect it is because the date is in the future, but I'm not sure. If so, that's a neat way to gate blog posts for a given time I guess

Hm, I also don't see it rendered at https://apache-beam-website-pull-requests.storage.googleapis.com/31701/blog/unit-testing-blog/index.html

Maybe something is malformatted? If you stage it locally with an older date, does it render?

If needed, lets temporarily set this to an older date so we can review the staged version

damccorm · 2024-07-03T10:19:51Z

website/www/site/content/en/blog/unit-testing-blog.md

+            numbers=[1,2,3]
+
+
+        with TestPipeline() as p:
+            output = p | beam.Create([1,2,3])
+                       | beam.Map(compute_square)
+        assert_that(output, equal_to([1,4,9]))


Looks like this is poorly formatted (indentation needed, too much white space). I'd also recommend defining examples=[1,2,3] and expected=[1,4,9] and using those in the pipeline rather than explicit lists

damccorm · 2024-07-03T10:20:39Z

website/www/site/content/en/blog/unit-testing-blog.md

+		with TestPipeline() as p:
+			output = p | beam.Create(strings)
+                               | beam.Map(str.strip)
+        assert_that(output,['Strawberry','Carrot','Eggplant'])


Same as above - formatting seems off, + lets make an expected=['Strawberry','Carrot','Eggplant'] instead of an explicit list here

Also, we're missing an equal_to. Lets make sure we quickly run these test snippets locally before submitting

I commented on this above, but let's ensure they're continuously run.

damccorm · 2024-07-03T10:22:31Z

website/www/site/content/en/blog/unit-testing-blog.md

+      class TestBeam(unittest.TestCase):
+          def test_custom_function(self):
+              with TestPipeline() as p:
+                input = p | beam.ParDo(custom_function(("1","2","3"))
+              assert_that(input, equal_to(["1","2","3"]))


I find this example is confusing - is the custom_function just the identity function? We have examples of assert_that above, can we use that instead?

I don't think this works, but I'm also trying to understand what it's even trying to do. You can't apply a ParDo to a raw pipeline object. ParDo accepts a DoFn, is custom_function(some_tuple) returning a DoFn that yields the tuple?

Again, if we actually ran this code we'd uncover these issues.

damccorm · 2024-07-03T10:26:33Z

website/www/site/content/en/blog/unit-testing-blog.md

+
+
+
+For more pointed guidance on testing on Beam/Dataflow, see the [Google Cloud documentation](https://cloud.google.com/dataflow/docs/guides/develop-and-test-pipelines).


Could we also point to some example tests here as well? The beam repo is probably the best place for example PTransform tests. For example, we have quite extensive RunInference tests -

beam/sdks/python/apache_beam/ml/inference/base_test.py

Line 262 in 736cf50

class RunInferenceBaseTest(unittest.TestCase):

- and we could add similarly well tested transforms in Java and Go

svetakvsundhar · 2024-07-03T17:28:43Z

Thanks for the feedback. To address the code running as intended, I plan to create a colab notebook with the runnable snippets, and link it to the blog post (as well as check it in to the Beam repo). Will create after the holiday.

damccorm · 2024-07-03T22:05:15Z

SGTM - at that point we may be able to ignore the date related comments anyways and just validate that it does indeed render the blog

svetakvsundhar · 2024-07-09T21:31:12Z

I've added a colab notebook and tested these examples locally. PTAL, once we have a consensus on the notebook examples, I will update the blog post and address the other comments.

examples/notebooks/blogposts/unittests_in_beam.ipynb

damccorm · 2024-07-10T10:52:31Z

examples/notebooks/blogposts/unittests_in_beam.ipynb

+        "    result = (\n",
+        "        p2\n",
+        "        | ReadFromText(\"/content/sample_data/anscombe.json\")\n",
+        "        | beam.Map(str.strip)\n",


I wonder if a better way to do this would be to split beam.Map(str.strip) out into a separate function which can be called from the test. As it is, the test isn't actually invoking any of the code we've written.

A more interesting example might be:

def manipulate_strings(incoming_pcoll): return incoming_pcoll | beam.Map(str.strip) | beam.Map(str.upper)

The functions themselves don't need tested, but the beam transforms do. That would let you test the actual code you've written below with:

with TestPipeline() as p: inputs = p | beam.Create(strings) output = manipulate_strings(inputs) assert_that(output, equal_to(expected))

Basically, I don't like that you could totally change the user code (which you're supposed to be testing) without a test failing.

Seeing output = manipulate_strings(inputs) makes me wonder how we could make composite PTransforms even easier/more natural.

Basically, I don't like that you could totally change the user code (which you're supposed to be testing) without a test failing.

I see your point and think it's valid (I can make the change). Out of curiosity though, what if a users entire transform was an inbuilt function (like str.strip)? Would the guidance be that they wouldn't need to test the Beam transform?

Co-authored-by: Danny McCormick <dannymccormick@google.com>

robertwb · 2024-07-10T18:43:34Z

examples/notebooks/blogposts/unittests_in_beam.ipynb

+        "# The following packages are imported for unit testing.\n",
+        "import unittest\n",
+        "import apache_beam as beam\n",
+        "from apache_beam.testing.test_pipeline import TestPipeline\n",


Is three any advantage to users of using TestPipeline?

AFAICT, no distinct advantages outside of the reasons mentioned here, as well as it being the de-facto choice for tests from previous documentation.

robertwb

Thanks for writing this up, guidance on good unit testing practices for Beam is much needed.

robertwb · 2024-07-10T18:47:57Z

examples/notebooks/blogposts/unittests_in_beam.ipynb

+        "  HttpError = None\n",
+        "\n",
+        "\n",
+        "@unittest.skipIf(HttpError is None, 'GCP dependencies are not installed')\n",


It would seems one of the main points of unit testing is to not have heavyweight dependencies. I wouldn't say skipping like this is best practices unless absolutely necessary.

robertwb · 2024-07-10T18:49:34Z

examples/notebooks/blogposts/unittests_in_beam.ipynb

+        "class TestBeam(unittest.TestCase):\n",
+        "\n",
+        "# This test corresponds to pipeline p1, and is written to confirm the compute_square function works as intended.\n",
+        "  def test_compute_square(self):\n",


If compute_square is an ordinary Python function, I would recommend writing "ordinary" unit tests for it rather than testing it as part of a pipeline.

robertwb · 2024-07-10T18:52:09Z

examples/notebooks/blogposts/unittests_in_beam.ipynb

+      "cell_type": "code",
+      "source": [
+        "# We import the mock package for mocking functionality.\n",
+        "import mock\n",


Mocking often interacts poorly with serialization; I would avoid this when possible. (Also, are these examples automatically tested?

robertwb · 2024-07-10T18:53:06Z

examples/notebooks/blogposts/unittests_in_beam.ipynb

+        "  with self.assertRaisesRegex(ValueError,\n",
+        "                              \"Length of record does not match expected length'\"):\n",
+        "      p = beam.Pipeline()\n",
+        "      result = p | beam.ParDo(CustomClass.custom_function())\n",


CustomClass.custom_function() returns a DoFn? I'm a bit confused at what you're trying to test here.

robertwb · 2024-07-10T18:54:17Z

website/www/site/content/en/blog/unit-testing-blog.md

+
+    with beam.Pipeline(argv=self.args) as p:
+        result = p | ReadFromText("gs://my-storage-bucket/csv_location.csv")
+                   | beam.ParDo(lambda x: custom_function(x))


You cand ParDo(lambda). I would look into how snippets are used in the programming guide to ensure the code is (and remains) correct.

robertwb · 2024-07-10T19:06:48Z

website/www/site/content/en/blog/unit-testing-blog.md

+      class TestBeam(unittest.TestCase):
+          def test_custom_function(self):
+              with TestPipeline() as p:
+                input = p | beam.ParDo(custom_function(("1","2","3"))
+              assert_that(input, equal_to(["1","2","3"]))


I don't think this works, but I'm also trying to understand what it's even trying to do. You can't apply a ParDo to a raw pipeline object. ParDo accepts a DoFn, is custom_function(some_tuple) returning a DoFn that yields the tuple?

Again, if we actually ran this code we'd uncover these issues.

robertwb · 2024-07-10T19:07:41Z

website/www/site/content/en/blog/unit-testing-blog.md

+      class TestBeam(unittest.TestCase):
+          def test_custom_function(self):
+              with TestPipeline() as p:
+                input = p | beam.ParDo(custom_function(("1","2","3"))


Nit: If we want ints, lets use ints, if we want strings, let's use strings (like "a", "b", "c" or fruit names or whatever, not numeric strings).

robertwb · 2024-07-10T19:15:42Z

website/www/site/content/en/blog/unit-testing-blog.md

+
+    #The following packages are used to run the example pipelines
+    import apache_beam as beam
+    import apache_beam.io.textio.ReadFromText


Java-style imports?

robertwb · 2024-07-10T19:19:36Z

website/www/site/content/en/blog/unit-testing-blog.md

+    import apache_beam.io.textio.WriteToText
+
+    with beam.Pipeline(argv=self.args) as p:
+        result = p | ReadFromText("gs://my-storage-bucket/csv_location.csv")


So, the problem is one can't really test this pipeline without reproducing it in the test. I think there are a couple of ways of re-structuring the code to make it more testable (and realistic).

(1) For an end-to-end test, put your pipeline in a function that takes the input and output paths as parameters. Your production code will call this with, e.g. gcs paths, but your test could create temporary directories and files and validate that. This tests your whole pipeline, including IOs, is structured correct.

(2) Factor out the "processing" part of your pipeline into its own PTransform, or at least function. E.g. your pipeline would be

with beam.Pipeline(argv=self.args) as p: _ = (p | beam.io.ReadFromText("gs://my-storage-bucket/csv_location.csv") | ProcessData(...) | beam.io.WriteToText()

and then your unit test would look like

with beam.Pipeline(argv=self.args) as p: _ = (p | beam.Create(["some", "sample", "elements"]) | ProcessData(...) | AssertEqualTo(["expected", "outputs"]))

or (equivalently, but less parallel)

with beam.Pipeline(argv=self.args) as p: output_pcoll = (p | beam.Create(["some", "sample", "elements"]) | ProcessData(...)) assert_that(output_pcoll, equal_to(...))

If writing a custom ProcessData PTransform is too much work, one could at least have

output_pcoll = process_data(input_pcoll)

robertwb · 2024-07-10T19:20:40Z

website/www/site/content/en/blog/unit-testing-blog.md

+      @mock.patch.object(CustomFunction, 'get_record')
+      def test_error_message_wrong_length(self, get_record):
+        record = ["field1","field2",...]
+        get_record.return_value = record


I am not following this at all. Where is get_record being called? Why is it being mocked?

svetakvsundhar · 2024-07-10T19:27:49Z

Hi @robertwb thanks for taking a look! The blog and the colab notebook are actually currently out of sync so that we could test runnable examples. Once aligned on those, I was going to update the blog. Anyhow, I will take a look at the comments and see where they apply.

github-actions · 2024-09-09T12:45:08Z

This pull request has been marked as stale due to 60 days of inactivity. It will be closed in 1 week if no further activity occurs. If you think that’s incorrect or this pull request requires a review, please simply write any comment. If closed, you can revive the PR at any time and @mention a reviewer or discuss it on the dev@beam.apache.org list. Thank you for your contributions.

svetakvsundhar added 2 commits June 26, 2024 11:29

unit testing blog

64e5b75

unit testing blogpost

fd5549b

svetakvsundhar marked this pull request as draft June 27, 2024 18:11

github-actions bot added the website label Jun 27, 2024

unit testing blogpost

a79b051

svetakvsundhar marked this pull request as ready for review June 27, 2024 18:18

whitespace

4684ecf

liferoad reviewed Jun 27, 2024

View reviewed changes

website/www/site/content/en/blog/unit-testing-blog.md Outdated Show resolved Hide resolved

liferoad reviewed Jun 27, 2024

View reviewed changes

website/www/site/content/en/blog/unit-testing-blog.md Show resolved Hide resolved

liferoad reviewed Jun 27, 2024

View reviewed changes

website/www/site/content/en/blog/unit-testing-blog.md Show resolved Hide resolved

liferoad reviewed Jun 27, 2024

View reviewed changes

website/www/site/content/en/blog/unit-testing-blog.md Outdated Show resolved Hide resolved

fixes

736cf50

damccorm reviewed Jul 3, 2024

View reviewed changes

Colab notebook for blog post

2ec337a

github-actions bot added the examples label Jul 9, 2024

formatting fixes

01b54a0

damccorm reviewed Jul 10, 2024

View reviewed changes

Update examples/notebooks/blogposts/unittests_in_beam.ipynb

a6059da

Co-authored-by: Danny McCormick <dannymccormick@google.com>

robertwb reviewed Jul 10, 2024

View reviewed changes

robertwb requested changes Jul 10, 2024

View reviewed changes

github-actions bot added the stale label Sep 9, 2024

svetakvsundhar closed this Sep 9, 2024

svetakvsundhar mentioned this pull request Sep 9, 2024

Unit Testing in Beam Blog Post #32412

Merged

3 tasks




		For more pointed guidance on testing on Beam/Dataflow, see the [Google Cloud documentation](https://cloud.google.com/dataflow/docs/guides/develop-and-test-pipelines).

Unit Testing in Beam Blog Post #31701

Unit Testing in Beam Blog Post #31701

Conversation

svetakvsundhar commented Jun 27, 2024

GitHub Actions Tests Status (on master branch)

svetakvsundhar commented Jun 27, 2024

github-actions bot commented Jun 27, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

svetakvsundhar commented Jul 3, 2024

damccorm commented Jul 3, 2024

svetakvsundhar commented Jul 9, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

robertwb left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

svetakvsundhar commented Jul 10, 2024

github-actions bot commented Sep 9, 2024