Replies: 3 comments
-
After some discussion with @shangyian this morning we concluded that (in the short term) we should focus on the materialization use case wrt parameters and partitions. The use case of live querying data is separate and we can address that later. We concluded that:
Few more thoughts on this:
|
Beta Was this translation helpful? Give feedback.
-
@agorajek do you think this statement can be restricted further, with parameters only being defined on a single partition column rather than potentially multiple columns? A few other things we talked about yesterday (just jotting this down so that we don't forget):
|
Beta Was this translation helpful? Give feedback.
-
@shangyian yes, I think so. Moreover I think we may not need to define any new parameter attributes (#fingers-crossed) and simply use the partitions as the guide-rails of the materialization. In the current world of partitioned tables, they are really equivalent. And in the future world of row-level table modifications, we can simply treat partitions as parameters.
^ +1
^ +1
I think that eventually we'll need to support two modes:
|
Beta Was this translation helpful? Give feedback.
-
My goal here is to make us think about these ^ ideas and how useful or necessary they are in the early days of DJ. I was wondering if I should add my thoughts to #407 because they are related, but I wanted to highlight the filter/parameter side of things here.
Use cases:
Let's consider a transform node:
The above model has all we need to build materialization jobs and to require the query builder to provide (or ask the user for) values for the required partition filters.
I could think of addition
parameters
section at the node level that would not need to be tied to the partition columns (in any way) but this seems like a nice-to-have option that can be added later on (if at all at this level).Some related questions:
Beta Was this translation helpful? Give feedback.
All reactions