-
Notifications
You must be signed in to change notification settings - Fork 63
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[DO - enrichment] Column types problem #1243
Comments
It has been already done by @arredond by setting the |
Right now we're only supporting at enrichment by polygons (not points) aggregation by numeric datasets. Aggregation of non-numeric is a cool feature for the future. The aim of this issue is to skip the variables with For example, for strings we can take the mode with an average weight, but not a priority right now, we'll include it in the roadmap |
After a second thought, you're totally right @oleurud! |
After talking, the objectives are:
The objective for this iteration is to do at least 1 and 2 (reduced scope and avoiding errrors), and 3 (doing it better) if posssible |
As we are not going to tackle the points 4 and 5 now, I have created a new issue for them #1259 |
🚀 |
Problems
Until now, I thought our datasets in the catalog only have numeric columns, but I have realized we have different types. And for the enrichment, this is something to take into account in the polygons enrichment.
Right now, if you use a variable with a different type of
NUMERIC
we are going to fail because for each variable, we are doing the following:where we are basically getting a numeric value (getting the porportional value of the area intersected).
But, what happens if the column is a STRING? a date? a boolean? We can not apply the same logic in these cases.
Even more: there are 2 numeric types of variables: extensive (it grows with area, like population) and intensive (it doesn’t grow with size, like population density). Doing the previous query, we are working fine with extensive ones, but we are falling with the intensive ones.
Problems recap:
data types different to a numeric value
I would say: right now we are only supporting numeric ones. And then, one by one, start adding support for the rest of the types. In enrichment, we can know the data type because it is the catalog.
intensive and numeric values
I am not sure, but at least, we need to add a new property in the catalog to set this type.
cc @alasarr @arredond @cmongut
The text was updated successfully, but these errors were encountered: