-
Notifications
You must be signed in to change notification settings - Fork 8.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Semantic data for fields #17087
Comments
A possible use-case is described in #22486 |
Another use-case for index-pattern semantic data is identifying a default geospatial field for an index-pattern. It would be helpful to apply a geospatial filter to dashboard and have the filter applied to the default geospatial field for each embeddable's index-pattern(s). Without a default geospatial field, users have to ensure all indices use the same field name for the geospatial field so the query can be applied across disparate indices. |
cc @mattkime @stacey-gammon @lukeelmers @ppisljar Basically that topic comes up every other week as a solution to some of the problems we're having that we can't automatically calculate (like whether a numeric field is actually a continuous number or identifiers (like user ids)). Since Matt you're anyway rethinking index patterns at the moment I just wanted to raise awareness for that issue again. |
Pinging @elastic/kibana-app-arch |
How would this relate to ECS? Would that simply be one source of this data? |
Imho we could build a "shortcut" for ECS data, so that if we detect an ECS index (however we would do that exactly), we can set all the semantic information in the index pattern, according to our knowledge about ECS. But ECS is way more generic than what this suggestion here would allow, so a user could still finetune their index pattern potentially to enrich it, even if it's ECS, with more semantic information to get more benefit out of their data. |
We now have an agreement to have standard metadata for numeric fields in the stack, which is a good place to start. It's not yet part of ECS but it will soon be part of many solutions. |
This feels like there is overlap with the new Aggregated View in Discover, powered by the ML system. |
@mfinkle could you please elaborate a bit on where you see the overlap? I think this issue was mostly about having the ability to define semantic data which we can't automatically determine to fields to improve the UI in several places. I'm personally not seeing a huge overlap with the data visualizer from ML? |
I was just assuming that the underlying code in the Data Visualizer had to run some basic descriptive statistics on the fields, some of which might also be able to used or extended to find ranges, step sizes, and maybe potential enum-able fields. |
There are some automatic pulled information about e.g. the min/max/median value. What I described with "semantic data" here is actually exactly the things that are not able to be determined automatically. I'll give a couple of examples:
I believe when we can automatically determine information about a fields, we should do so (and already partially do). Besides that there are still a log ot use-cases left, where we can't determine the semantic of the field automatically, but would still benefit from knowing it and could improve our UX for those fields. |
Thank you for contributing to this issue, however, we are closing this issue due to inactivity as part of a backlog grooming effort. If you believe this feature/bug should still be considered, please reopen with a comment. |
I think adding more semantic information about fields in your index, could provide us quite some benefits. Currently our only semantic information we are having about a field, is it's type. But a user might know way more about the data inside that field, than just it's type. Some information that a user knowing the data could provide:
error
,warning
,info
) providing that semantic information improves over doing terms aggregation, since not all possible values might yet be existing in the data.ISO2 country code
or this field actually containsIATA airport codes
Some examples of where these semantic information could be used and come in handy:
ISO2
code, highlight or limit to these fields in Region maps.I see two possible ways of implementing that:
Store the information in the
_meta
field of the mappingThe advantage here would be, that this information is directly sitting "besides" the data. That way the data, thus the person responsible for defining the data mapping anyway would also have full control of semantic information. It also would be available to all tools not only Kibana.
The disadvantage is, that this would lack a good UI to configure it, make it harder to configure it later on and also a lot of people are working with dynamic field mappings, which might make this more complex.
Store the information in the index pattern
That way we could directly provide an UI when configuring an index pattern to select the semantic information. Would make it way easier to configure and change that configuration later on.
The hugest disadvantage here would be, that this information would purely be available for that single Kibana instance and not shared between different tools.
/cc @trevan @chrisronline
The text was updated successfully, but these errors were encountered: