[CT-3204] [implementation] Automate creation of metricflow_time_spine if the project defines semantic objects #8825

graciegoheen · 2023-10-11T14:12:23Z

Housekeeping

I am a maintainer of dbt-core

Short description

Currently, if the user defines semantic objects in their project, but not a model named metricflow_time_spine, we raise an error.

We should simply create the model automatically, if it is not found in the project, using the recommended definition.

Users should still have the ability to create themselves with a custom implementation if they so choose.

Acceptance criteria

if the user defines semantic objects in their project, but not a model named metricflow_time_spine, we should create the model automatically using the recommended definition
users should still have the ability to create themselves with a custom implementation if they so choose.

Impact to Other Teams

semantic layer

Will backports be required?

no

Context

So there are two main concerns I believe:

Squaring partial parsing with generating a metricflow_time_spine model when one isn't specified
Auto-generating the metricflow_time_spine correctly given the any adapter
For Issue (1) there are four possible states

a. metricflow_time_spine was specified by the user and that's still the case
b. metricflow_time_spine was specified by the user and now isn't (and thus should be generated)
c. metricflow_time_spine wasn't specified by the user (and thus generated) and that's still the case
d. metricflow_time_spine wasn't specified by the user (and thus generated) but now it is specified by the user

I think the solution is at the end of parsing if there are semantic layer nodes and no metricflow_time_spine model we add one and mark it as auto generated. At the start of parsing if there is a saved manifest, we drop the metricflow_time_spine node if is marked as having been auto generated. This workflow makes the following happen in the corresponding cases.

a. the metricflow_time_spine is handled by the user specification
b. the user specifed metricflow_time_spine gets dropped during partial parsing, and then re-added via the auto-generation
c. the auto generated metricflow_time_spine gets dropped, and then re-added at the end
d. the auto generated metricflow_time_spine gets dropped, and then the user specified metricflow_time_spine gets added

For issue (2) I don't think we need a cross-database macro for date types, though it would be nice. Instead we could just use the same jinja template we use for the date_spine macro tests, were we do different calls to the macro based on the target data warehouse.

graciegoheen · 2023-10-11T14:13:32Z

@graciegoheen this idea makes sense to me to support a built-in metricflow_time_spine 🚀

cast_text_to_date (similar to cast_bool_to_text)

What do you think about calling it to_date()?

Instead of naming it cast_text_to_date, I'd suggest we call it to_date instead. Even though to_date isn't within the SQL standard, databricks, postgres, redshift, and snowflake all have a to_date function that does what we want. Although bigquery is an outlier, nothing we can't solve with a little dispatch magic ✨

Prototype of to_date()

Assuming to_date() is a cross-database macro that takes an ISO 8601 (YYYY-MM-DD) date string as input, here's a completely untested prototype for dbt-postgres:
{% macro to_date(date_str) %}
  {{ return(adapter.dispatch('to_date', 'dbt') (date_str)) }}
{% endmacro %}

{% macro default__to_date(date_str) -%}
    to_date({{ dbt.string_literal(date_str) }})
{%- endmacro %}
Pulling it all together for metricflow_time_spine

The cross-database Jinja template might look like this:
select cast(date_day as date) as date_day
from ({{ dbt.date_spine("day", dbt.to_date("2023-09-01"), dbt.to_date("2023-09-10")) }})
Appendix

Validation and error checking

If we want, we could always add some format validation to the default implementation of to_date() by using the datetime module:
    {%- set dt = modules.datetime.datetime.strptime(date_str, "%Y-%m-%d") -%}
    ...
type_date macro

We may (or may not) want to also create a cross-database type_date macro (which doesn't exist today). I haven't seen any database that doesn't call this data type DATE, so that makes it either easy-peasy or extraneous depending how you look at it.

Originally posted by @dbeatty10 in #8319 (comment)

adamcunnington-mlg · 2024-06-06T08:57:18Z

@graciegoheen I guess this is low priority but is there an ETA for when this would happen?

ChenyuLInx · 2024-12-09T19:21:21Z

@Jstein77 is this still needed?

dbeatty10 · 2024-12-09T19:28:55Z

This feature is completely independent from #7442, but linking it here because they are spiritually similar -- they would both create database objects (and/or dbt nodes) without the user explicitly defining and configuring them.

Jstein77 · 2024-12-16T20:17:35Z

@ChenyuLInx @MichelleArk @dbeatty10 I think we can close this issue. We updated the behavior so you can point at any date spine model in your project. It doesn't have to be called metricflow_time_spine. Users typically already have a date spine model in their dbt project, and if they don't I think using the dbt-utils.date-spine() macro should be enough to create one.

graciegoheen added user docs [docs.getdbt.com] Needs better documentation Impact: SL labels Oct 11, 2023

github-actions bot changed the title ~~[implementation] Automate creation of metricflow_time_spine if the project defines semantic objects~~ [CT-3204] [implementation] Automate creation of metricflow_time_spine if the project defines semantic objects Oct 11, 2023

dbeatty10 mentioned this issue May 2, 2024

[CT-3208] [Feature] Cross-database date macro dbt-labs/dbt-adapters#192

Closed

3 tasks

This was referenced May 1, 2024

Cross-database date macro dbt-labs/dbt-adapters#191

Merged

[Epic] Cross-database date macro #10075

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CT-3204] [implementation] Automate creation of metricflow_time_spine if the project defines semantic objects #8825

[CT-3204] [implementation] Automate creation of metricflow_time_spine if the project defines semantic objects #8825

graciegoheen commented Oct 11, 2023

graciegoheen commented Oct 11, 2023

What do you think about calling it `to_date()`?

Prototype of `to_date()`

Pulling it all together for `metricflow_time_spine`

Appendix

Validation and error checking

`type_date` macro

adamcunnington-mlg commented Jun 6, 2024 •

edited

Loading

ChenyuLInx commented Dec 9, 2024

dbeatty10 commented Dec 9, 2024

Jstein77 commented Dec 16, 2024

[CT-3204] [implementation] Automate creation of metricflow_time_spine if the project defines semantic objects #8825

[CT-3204] [implementation] Automate creation of metricflow_time_spine if the project defines semantic objects #8825

Comments

graciegoheen commented Oct 11, 2023

Housekeeping

Short description

Acceptance criteria

Impact to Other Teams

Will backports be required?

Context

graciegoheen commented Oct 11, 2023

What do you think about calling it to_date()?

Prototype of to_date()

Pulling it all together for metricflow_time_spine

Appendix

Validation and error checking

type_date macro

adamcunnington-mlg commented Jun 6, 2024 • edited Loading

ChenyuLInx commented Dec 9, 2024

dbeatty10 commented Dec 9, 2024

Jstein77 commented Dec 16, 2024

What do you think about calling it `to_date()`?

Prototype of `to_date()`

Pulling it all together for `metricflow_time_spine`

`type_date` macro

adamcunnington-mlg commented Jun 6, 2024 •

edited

Loading