schema tests defined by macros #339

drewbanin · 2017-03-17T22:04:38Z

use "globally" scoped macros instead of hardcoded sql in python files
supports user-defined schema tests (via macros) custom schema tests (via macros) #200

To create a custom schema test, create macros prefixed with test_. The test_ macros must take model as the first argument. If the test takes a single argument (as with not null & unique tests), then the second argument should be called arg. Otherwise, the remaining arguments can be named as you like. Test configurations stated in schema.yml will be mapped to the macro arguments as long as the names match. For example:

-- macros/tests.sql
{% macro test_something(model, arg) %}

select *
from {{ ref(model) }}

where id = '{{ arg }}'

{% endmacro %}


{% macro test_something_complicated(model, first_argument, another_argument) %}

select *
from {{ ref(model) }}

where id = '{{ first_argument }}'
   or id = '{{ another_argument }}'

{% endmacro %}

some_model:
  constraints:
    my_project.test_something:
      - abc
      - def

    my_project.test_something_complicated:
      - {first_argument: abc, another_argument: def}

cmcarthur · 2017-03-20T15:45:54Z

dbt/parser.py

+        elif type(arg_val) in (list, tuple):
+            parts = arg_val
+        else:
+            parts = [arg_val]


can this be anything but a dict? looks like below we make sure it's a dict before calling this. parts = kwargs.values() should cover all the cases, right?

This func takes arbitrary schema test configs and spits out a human readable, unique name.

my_model: constraints: test_something: - { some_value: True, other_thing: ['abc', 'def'] }

So this func would operate on { some_value: True, other_thing: ['abc', 'def'] } and spit out:

test_something_my_model_abc_def__True

I actually think that in practice this is kind of annoying/confusing. This "nice" name becomes the compiled filename and shows up in the dbt test output. Ideally, we'd keep the test config args around and show something like:

ERROR running test "test_something" for model "my_model" with args: some_value: True other_thing: ['abc', 'def']

but that's not really how things work currently. Something to consider for the future though

oh gosh, and to answer your question, yes: The arg_val var will be the value of each item in the supplied dict (args). So here it is a bool, then a dict. could also be a string/list/int etc

cmcarthur · 2017-03-20T15:46:03Z

dbt/parser.py

-        child_field = test_config.get('from')
-        parent_field = test_config.get('field')
-        parent_model = test_config.get('to')
+        flat_args.extend([str(part) for part in parts])


use dbt.compat.basestring

cmcarthur · 2017-03-20T15:47:16Z

dbt/utils.py

-                )
-                logger.info(str(e))
+def dependency_projects(project, include_global=True):
+    if include_global:


is there a time when we would not want to include globals?

nope, i'll remove this

drewbanin · 2017-03-26T00:07:29Z

dbt/parser.py

-            parent_ref=("{{ref('"+parent_model+"')}}"))
+def as_kwarg(key, value):
+    test_value = to_string(value)
+    is_function = re.match(r'^\s*(ref|var)\(.+\)$', test_value) is not None


@cmcarthur how do you feel about this?

i feel ok about it. not in love with regex parsing but what can you do.

is the intention to support ref and var in custom schema tests? it doesn't look like those would be passed in here unless i'm missing something

cmcarthur · 2017-03-26T00:24:08Z

dbt/parser.py

-            parent_ref=("{{ref('"+parent_model+"')}}"))
+def as_kwarg(key, value):
+    test_value = to_string(value)
+    is_function = re.match(r'^\s*(ref|var)\(.+\)$', test_value) is not None


i feel ok about it. not in love with regex parsing but what can you do.

is the intention to support ref and var in custom schema tests? it doesn't look like those would be passed in here unless i'm missing something

cmcarthur · 2017-03-26T00:28:00Z

test/integration/008_schema_tests_test/models/schema.yml

@@ -29,7 +29,7 @@ table_summary:
            - { field: favorite_color, values: ['blue', 'green'] }

        relationships:
-            - { from: favorite_color, to: table_copy, field: favorite_color }
+            - { from: favorite_color, to: ref('table_copy'), field: favorite_color }


ahh i understand now. is this a breaking change in how schema tests are defined?

drewbanin · 2017-03-26T00:42:22Z

Yeah, sorry for the lack of context. It is a breaking change, but we don't really have much of a choice! I think it's more canonical like this anyway -- kind of weird that the ref was implicit before. I ran it by @jthandy and he agreed

…

On Mar 25, 2017, at 8:28 PM, Connor McArthur ***@***.***> wrote: @cmcarthur commented on this pull request. In dbt/parser.py: > - raw_sql = QUERY_VALIDATE_REFERENTIAL_INTEGRITY.format( - child_field=child_field, - child_ref="{{ref('"+model_name+"')}}", - parent_field=parent_field, - parent_ref=("{{ref('"+parent_model+"')}}")) +def as_kwarg(key, value): + test_value = to_string(value) + is_function = re.match(r'^\s*(ref|var)$.+$$', test_value) is not None i feel ok about it. not in love with regex parsing but what can you do. is the intention to support ref and var in custom schema tests? it doesn't look like those would be passed in here unless i'm missing something In test/integration/008_schema_tests_test/models/schema.yml: > @@ -29,7 +29,7 @@ table_summary: - { field: favorite_color, values: ['blue', 'green'] } relationships: - - { from: favorite_color, to: table_copy, field: favorite_color } + - { from: favorite_color, to: ref('table_copy'), field: favorite_color } ahh i understand now. is this a breaking change in how schema tests are defined? — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub, or mute the thread.

drewbanin requested a review from cmcarthur March 18, 2017 15:53

drewbanin added the 0.8.0 Release label Mar 18, 2017

drewbanin added this to the Dependency Graph Improvements milestone Mar 18, 2017

cmcarthur requested changes Mar 20, 2017

View reviewed changes

drewbanin added 6 commits March 25, 2017 17:10

make things work

a4c3e1c

remove debug code

9c826b7

remove hardcoded SQL for schema tests

de49d22

fix tests

171587a

fix unit tests

c0e5d36

make include path a python module

858d423

drewbanin force-pushed the feature/schema-tests-defined-by-macros branch from cc2e91a to 858d423 Compare March 25, 2017 21:11

drewbanin added 5 commits March 25, 2017 17:13

remove loader file

ad70fe3

work with refs passed in as args

69d6890

use to_string, basestring(var) doesn't work

a6e953b

fix tests

531467e

fix integration tests

4ee7ed2

drewbanin commented Mar 26, 2017

View reviewed changes

cmcarthur reviewed Mar 26, 2017

View reviewed changes

drewbanin added 2 commits March 25, 2017 21:29

less stringent whitespace checking

71bc023

redefine materializations as macros (#356)

e8d70a6

drewbanin closed this Apr 5, 2017

drewbanin deleted the feature/schema-tests-defined-by-macros branch June 19, 2017 19:24

drewbanin mentioned this pull request Jul 17, 2017

schema tests in dependent projects #491

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

schema tests defined by macros #339

schema tests defined by macros #339

drewbanin commented Mar 17, 2017 •

edited

Loading

cmcarthur Mar 20, 2017

drewbanin Mar 25, 2017

drewbanin Mar 25, 2017 •

edited

Loading

cmcarthur Mar 20, 2017

drewbanin Mar 25, 2017

cmcarthur Mar 20, 2017

drewbanin Mar 25, 2017

drewbanin Mar 26, 2017

cmcarthur Mar 26, 2017

cmcarthur Mar 26, 2017

cmcarthur Mar 26, 2017

drewbanin commented Mar 26, 2017 via email

schema tests defined by macros #339

schema tests defined by macros #339

Conversation

drewbanin commented Mar 17, 2017 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

drewbanin Mar 25, 2017 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

drewbanin commented Mar 26, 2017 via email

drewbanin commented Mar 17, 2017 •

edited

Loading

drewbanin Mar 25, 2017 •

edited

Loading