forked from elastic/ecs
-
Notifications
You must be signed in to change notification settings - Fork 0
/
data_stream.yml
62 lines (52 loc) · 3.34 KB
/
data_stream.yml
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
---
- name: data_stream
title: Data Stream
short: The data_stream fields take part in defining the new data stream naming scheme.
beta: >
These fields are in beta and are subject to change.
description: >
The data_stream fields take part in defining the new data stream naming scheme.
In the new data stream naming scheme the value of the data stream fields combine to the name of the actual data
stream in the following manner: `{data_stream.type}-{data_stream.dataset}-{data_stream.namespace}`. This means the fields
can only contain characters that are valid as part of names of data streams. More details about this can be found in
this https://www.elastic.co/blog/an-introduction-to-the-elastic-data-stream-naming-scheme[blog post].
An Elasticsearch data stream consists of one or more backing indices, and a data stream name forms part of the backing indices names.
Due to this convention, data streams must also follow index naming restrictions. For example, data stream names cannot include `\`, `/`, `*`, `?`, `"`, `<`, `>`, `|`, ` ` (space character), `,`, or `#`.
Please see the Elasticsearch reference for additional https://www.elastic.co/guide/en/elasticsearch/reference/current/indices-create-index.html#indices-create-api-path-params[restrictions].
fields:
- name: type
level: extended
type: constant_keyword
example: logs
# Any future values for `data_stream.type` should also adhere to the following restrictions (these are derived from the Elasticsearch index restrictions):
# * Must not contain `-`
# * Must not start with `+` or `_`
description: >
An overarching type for the data stream.
Currently allowed values are "logs" and "metrics". We expect to also add "traces" and "synthetics" in the near future.
short: An overarching type for the data stream.
- name: dataset
level: extended
type: constant_keyword
example: nginx.access
description: >
The field can contain anything that makes sense to signify the source of the data.
Examples include `nginx.access`, `prometheus`, `endpoint` etc. For data streams that otherwise fit, but that
do not have dataset set we use the value "generic" for the dataset value. `event.dataset` should have the
same value as `data_stream.dataset`.
Beyond the Elasticsearch data stream naming criteria noted above, the `dataset` value has additional restrictions:
* Must not contain `-`
* No longer than 100 characters
short: The field can contain anything that makes sense to signify the source of the data.
- name: namespace
level: extended
type: constant_keyword
example: production
description: >
A user defined namespace. Namespaces are useful to allow grouping of data.
Many users already organize their indices this way, and the data stream naming scheme now provides this
best practice as a default. Many users will populate this field with `default`. If no value is used, it falls back to `default`.
Beyond the Elasticsearch index naming criteria noted above, `namespace` value has the additional restrictions:
* Must not contain `-`
* No longer than 100 characters
short: A user defined namespace. Namespaces are useful to allow grouping of data.