diff --git a/home/docs/help/yarn.md b/home/docs/help/yarn.md new file mode 100644 index 00000000000..176a3209fee --- /dev/null +++ b/home/docs/help/yarn.md @@ -0,0 +1,83 @@ +--- +id: yarn +title: Monitoring Apache Yarn Monitoring +sidebar_label: Apache Yarn +keywords: [Big Data Monitoring System, Apache Yarn Monitoring, ResourceManager Monitoring] +--- + +> Hertzbeat monitors Apache Yarn node monitoring metrics. + +**Protocol Used: HTTP** + +## Pre-monitoring Actions + +Retrieve the HTTP monitoring port of Apache Yarn. Value: `yarn.resourcemanager.webapp.address` + +## Configuration Parameters + +| Parameter Name | Parameter Description | +| ---------------- |----------------------------------------------------| +| Target Host | IP address, IPV6, or domain name of the monitored endpoint. Without protocol header. | +| Port | Monitoring port number of Apache Yarn, default is 8088. | +| Query Timeout | Timeout for querying Apache Yarn, in milliseconds, default is 6000 milliseconds. | +| Metrics Interval | Time interval for monitoring data collection, in seconds, minimum interval is 30 seconds. | + +### Collected Metrics + +#### Metric Set: ClusterMetrics + +| Metric Name | Unit | Metric Description | +| ----------------------- | ---- | -----------------------------------------| +| NumActiveNMs | | Number of currently active NodeManagers | +| NumDecommissionedNMs | | Number of currently decommissioned NodeManagers | +| NumDecommissioningNMs | | Number of nodes currently decommissioning | +| NumLostNMs | | Number of lost nodes in the cluster | +| NumUnhealthyNMs | | Number of unhealthy nodes in the cluster | + +#### Metric Set: JvmMetrics + +| Metric Name | Unit | Metric Description | +| ----------------------- | ---- | -------------------------------------------- | +| MemNonHeapCommittedM | MB | Current committed size of non-heap memory in JVM | +| MemNonHeapMaxM | MB | Maximum available non-heap memory in JVM | +| MemNonHeapUsedM | MB | Current used size of non-heap memory in JVM | +| MemHeapCommittedM | MB | Current committed size of heap memory in JVM | +| MemHeapMaxM | MB | Maximum available heap memory in JVM | +| MemHeapUsedM | MB | Current used size of heap memory in JVM | +| GcTimeMillis | | JVM GC time | +| GcCount | | Number of JVM GC occurrences | + +#### Metric Set: QueueMetrics + +| Metric Name | Unit | Metric Description | +| --------------------------- | ---- | -------------------------------------------- | +| queue | | Queue name | +| AllocatedVCores | | Allocated virtual cores (allocated) | +| ReservedVCores | | Reserved cores | +| AvailableVCores | | Available cores (unallocated) | +| PendingVCores | | Blocked scheduling cores | +| AllocatedMB | MB | Allocated (used) memory size | +| AvailableMB | MB | Available memory (unallocated) | +| PendingMB | MB | Blocked scheduling memory | +| ReservedMB | MB | Reserved memory | +| AllocatedContainers | | Number of allocated (used) containers | +| PendingContainers | | Number of blocked scheduling containers | +| ReservedContainers | | Number of reserved containers | +| AggregateContainersAllocated| | Total aggregated containers allocated | +| AggregateContainersReleased| | Total aggregated containers released | +| AppsCompleted | | Number of completed applications | +| AppsKilled | | Number of killed applications | +| AppsFailed | | Number of failed applications | +| AppsPending | | Number of pending applications | +| AppsRunning | | Number of currently running applications | +| AppsSubmitted | | Number of submitted applications | +| running_0 | | Number of jobs running for less than 60 minutes | +| running_60 | | Number of jobs running between 60 and 300 minutes | +| running_300 | | Number of jobs running between 300 and 1440 minutes | +| running_1440 | | Number of jobs running for more than 1440 minutes | + +#### Metric Set: runtime + +| Metric Name | Unit | Metric Description | +| ----------------------- | ---- | --------------------------| +| StartTime | | Startup timestamp | \ No newline at end of file diff --git a/home/i18n/zh-cn/docusaurus-plugin-content-docs/current/help/yarn.md b/home/i18n/zh-cn/docusaurus-plugin-content-docs/current/help/yarn.md new file mode 100644 index 00000000000..2c88fe1e5a9 --- /dev/null +++ b/home/i18n/zh-cn/docusaurus-plugin-content-docs/current/help/yarn.md @@ -0,0 +1,83 @@ +--- +id: yarn +title: 监控:Apache Yarn监控 +sidebar_label: Apache Yarn +keywords: [大数据监控系统, Apache Yarn监控, 资源管理器监控] +--- + +> Hertzbeat 对 Apache Yarn 节点监控指标进行监控。 + +**使用协议:HTTP** + +## 监控前操作 + +获取 Apache Yarn 的 HTTP 监控端口。 取值:`yarn.resourcemanager.webapp.address` + +## 配置参数 + +| 参数名称 | 参数帮助描述 | +| ---------------- |---------------------------------------| +| 目标Host | 被监控的对端IPV4,IPV6或域名。不带协议头。 | +| 端口 | Apache Yarn 的监控端口号,默认为8088。 | +| 查询超时时间 | 查询 Apache Yarn 的超时时间,单位毫秒,默认6000毫秒。 | +| 指标采集间隔 | 监控数据采集的时间间隔,单位秒,最小间隔为30秒。 | + +### 采集指标 + +#### 指标集合:ClusterMetrics + +| 指标名称 | 指标单位 | 指标帮助描述 | +| -------------------- | -------- | ---------------------------------- | +| NumActiveNMs | | 当前存活的 NodeManager 个数 | +| NumDecommissionedNMs | | 当前 Decommissioned 的 NodeManager 个数 | +| NumDecommissioningNMs| | 集群正在下线的节点数 | +| NumLostNMs | | 集群丢失的节点数 | +| NumUnhealthyNMs | | 集群不健康的节点数 | + +#### 指标集合:JvmMetrics + +| 指标名称 | 指标单位 | 指标帮助描述 | +| -------------------- | -------- | ------------------------------------ | +| MemNonHeapCommittedM | MB | JVM当前非堆内存大小已提交大小 | +| MemNonHeapMaxM | MB | JVM非堆最大可用内存 | +| MemNonHeapUsedM | MB | JVM当前已使用的非堆内存大小 | +| MemHeapCommittedM | MB | JVM当前已使用堆内存大小 | +| MemHeapMaxM | MB | JVM堆内存最大可用内存 | +| MemHeapUsedM | MB | JVM当前已使用堆内存大小 | +| GcTimeMillis | | JVM GC时间 | +| GcCount | | JVM GC次数 | + +#### 指标集合:QueueMetrics + +| 指标名称 | 指标单位 | 指标帮助描述 | +| ------------------------ | -------- | ------------------------------------ | +| queue | | 队列名称 | +| AllocatedVCores | | 分配的虚拟核数(已分配) | +| ReservedVCores | | 预留核数 | +| AvailableVCores | | 可用核数(尚未分配) | +| PendingVCores | | 阻塞调度核数 | +| AllocatedMB | MB | 已分配(已用)的内存大小 | +| AvailableMB | MB | 可用内存(尚未分配) | +| PendingMB | MB | 阻塞调度内存 | +| ReservedMB | MB | 预留内存 | +| AllocatedContainers | | 已分配(已用)的container数 | +| PendingContainers | | 阻塞调度container个数 | +| ReservedContainers | | 预留container数 | +| AggregateContainersAllocated | | 累积的container分配总数 | +| AggregateContainersReleased | | 累积的container释放总数 | +| AppsCompleted | | 完成的任务数 | +| AppsKilled | | 被杀掉的任务数 | +| AppsFailed | | 失败的任务数 | +| AppsPending | | 阻塞的任务数 | +| AppsRunning | | 提正在运行的任务数 | +| AppsSubmitted | | 提交过的任务数 | +| running_0 | | 运行时间小于60分钟的作业个数 | +| running_60 | | 运行时间介于60~300分钟的作业个数 | +| running_300 | | 运行时间介于300~1440分钟的作业个数 | +| running_1440 | | 运行时间大于1440分钟的作业个数 | + +#### 指标集合:runtime + +| 指标名称 | 指标单位 | 指标帮助描述 | +| -------------------- | -------- | ---------------------------- | +| StartTime | | 启动时间戳 | diff --git a/home/sidebars.json b/home/sidebars.json index f84b92cffe3..0a0ecb1f5a4 100755 --- a/home/sidebars.json +++ b/home/sidebars.json @@ -229,6 +229,7 @@ "help/doris_be", "help/doris_fe", "help/hadoop", + "help/yarn", "help/hbase_master", "help/hbase_regionserver", "help/hdfs_namenode", diff --git a/manager/src/main/resources/define/app-yarn.yml b/manager/src/main/resources/define/app-yarn.yml new file mode 100644 index 00000000000..24886b22364 --- /dev/null +++ b/manager/src/main/resources/define/app-yarn.yml @@ -0,0 +1,469 @@ +# Licensed to the Apache Software Foundation (ASF) under one or more +# contributor license agreements. See the NOTICE file distributed with +# this work for additional information regarding copyright ownership. +# The ASF licenses this file to You under the Apache License, Version 2.0 +# (the "License"); you may not use this file except in compliance with +# the License. You may obtain a copy of the License at +# +# http://www.apache.org/licenses/LICENSE-2.0 +# +# Unless required by applicable law or agreed to in writing, software +# distributed under the License is distributed on an "AS IS" BASIS, +# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. +# See the License for the specific language governing permissions and +# limitations under the License. + +# The monitoring type category:service-application service monitoring db-database monitoring custom-custom monitoring os-operating system monitoring +category: bigdata +# The monitoring type eg: linux windows tomcat mysql aws... +app: yarn +# The monitoring i18n name +name: + zh-CN: Apache Yarn + en-US: Apache Yarn +# The description and help of this monitoring type +help: + zh-CN: Hertzbeat 对 Yarn 节点监控指标进行监控。
您可以点击 “新建 Apache Yarn” 并进行配置,或者选择“更多操作”,导入已有配置。 + en-US: Hertzbeat monitors the Yarn metrics.
You can click "New Apache Yarn" to configure, or select "More Actions" to import an existing configuration. + zh-TW: Hertzbeat 對 Yarn 節點監控指標進行監控。
您可以點擊 “新建 Apache Yarn” 並進行配置,或者選擇“更多操作”,導入已有配置。 + +helpLink: + zh-CN: https://hertzbeat.apache.org/zh-cn/docs/help/yarn/ + en-US: https://hertzbeat.apache.org/docs/help/yarn/ +# Input params define for monitoring(render web ui by the definition) +params: + # field-param field key + - field: host + # name-param field display i18n name + name: + zh-CN: 目标Host + en-US: Target Host + # type-param field type(most mapping the html input type) + type: host + # required-true or false + required: true + # field-param field key + - field: port + # name-param field display i18n name + name: + zh-CN: 端口 + en-US: Port + # type-param field type(most mapping the html input type) + type: number + # when type is number, range is required + range: '[0,65535]' + # required-true or false + required: true + # default value + defaultValue: 8088 + # field-param field key + - field: timeout + # name-param field display i18n name + name: + zh-CN: 查询超时时间 + en-US: Query Timeout + # type-param field type(most mapping the html input type) + type: number + # required-true or false + required: false + # hide param-true or false + hide: true + # default value + defaultValue: 6000 +# collect metrics config list +metrics: + # metrics - Server + - name: ClusterMetrics + # metrics scheduling priority(0->127)->(high->low), metrics with the same priority will be scheduled in parallel + # priority 0's metrics is availability metrics, it will be scheduled first, only availability metrics collect success will the scheduling continue + priority: 0 + # collect metrics content + fields: + # field-metric name, type-metric type(0-number,1-string), unit-metric unit('%','ms','MB'), label-whether it is a metrics label field + - field: NumActiveNMs + type: 0 + i18n: + zh-CN: 当前存活的 NodeManager 个数 + en-US: NumActiveNMs + # field-metric name, type-metric type(0-number,1-string), unit-metric unit('%','ms','MB'), label-whether it is a metrics label field + - field: NumDecommissionedNMs + type: 0 + i18n: + zh-CN: 当前 Decommissioned 的 NodeManager 个数 + en-US: NumDecommissionedNMs + # field-metric name, type-metric type(0-number,1-string), unit-metric unit('%','ms','MB'), label-whether it is a metrics label field + - field: NumDecommissioningNMs + type: 0 + i18n: + zh-CN: 集群正在下线的节点数 + en-US: NumDecommissioningNMs + # field-metric name, type-metric type(0-number,1-string), unit-metric unit('%','ms','MB'), label-whether it is a metrics label field + - field: NumLostNMs + type: 0 + i18n: + zh-CN: 集群丢失的节点数 + en-US: NumLostNMs + # field-metric name, type-metric type(0-number,1-string), unit-metric unit('%','ms','MB'), label-whether it is a metrics label field + - field: NumUnhealthyNMs + type: 0 + i18n: + zh-CN: 集群不健康的节点数 + en-US: NumUnhealthyNMs + # (optional)metrics field alias name, it is used as an alias field to map and convert the collected data and metrics field + aliasFields: + - $.NumActiveNMs + - $.NumDecommissionedNMs + - $.NumDecommissioningNMs + - $.NumLostNMs + - $.NumUnhealthyNMs + - $.RpcProcessingTimeAvgTime + - $.CallQueueLength + calculates: + - NumActiveNMs=$.NumActiveNMs + - NumDecommissionedNMs=$.NumDecommissionedNMs + - NumDecommissioningNMs=$.NumDecommissioningNMs + - NumLostNMs=$.NumLostNMs + - NumUnhealthyNMs=$.NumUnhealthyNMs + - RpcProcessingTimeAvgTime=$.RpcProcessingTimeAvgTime + - CallQueueLength=$.CallQueueLength + protocol: http + http: + host: ^_^host^_^ + port: ^_^port^_^ + url: /jmx + method: GET + ssl: ^_^ssl^_^ + parseType: jsonPath + parseScript: '$.beans[?(@.name == "Hadoop:service=ResourceManager,name=ClusterMetrics")]' + - name: JvmMetrics + # metrics scheduling priority(0->127)->(high->low), metrics with the same priority will be scheduled in parallel + # priority 0's metrics is availability metrics, it will be scheduled first, only availability metrics collect success will the scheduling continue + priority: 0 + # collect metrics content + # collect metrics content + fields: + # field-metric name, type-metric type(0-number,1-string), unit-metric unit('%','ms','MB'), label-whether it is a metrics label field + - field: MemNonHeapCommittedM + type: 0 + unit: MB + i18n: + zh-CN: JVM当前非堆内存大小已提交大小 + en-US: MemNonHeapCommittedM + # field-metric name, type-metric type(0-number,1-string), unit-metric unit('%','ms','MB'), label-whether it is a metrics label field + - field: MemNonHeapMaxM + type: 0 + unit: MB + i18n: + zh-CN: JVM非堆最大可用内存 + en-US: MemNonHeapMaxM + # field-metric name, type-metric type(0-number,1-string), unit-metric unit('%','ms','MB'), label-whether it is a metrics label field + - field: MemNonHeapUsedM + type: 0 + unit: MB + i18n: + zh-CN: JVM当前已使用的非堆内存大小 + en-US: MemNonHeapUsedM + # field-metric name, type-metric type(0-number,1-string), unit-metric unit('%','ms','MB'), label-whether it is a metrics label field + - field: MemHeapCommittedM + type: 0 + unit: MB + i18n: + zh-CN: JVM当前已使用堆内存大小 + en-US: MemHeapCommittedM + # field-metric name, type-metric type(0-number,1-string), unit-metric unit('%','ms','MB'), label-whether it is a metrics label field + - field: MemHeapMaxM + type: 0 + unit: MB + i18n: + zh-CN: JVM堆内存最大可用内存 + en-US: MemHeapMaxM + # field-metric name, type-metric type(0-number,1-string), unit-metric unit('%','ms','MB'), label-whether it is a metrics label field + - field: MemHeapUsedM + type: 0 + unit: MB + i18n: + zh-CN: JVM当前已使用堆内存大小 + en-US: MemHeapUsedM + # field-metric name, type-metric type(0-number,1-string), unit-metric unit('%','ms','MB'), label-whether it is a metrics label field + - field: GcTimeMillis + type: 0 + i18n: + zh-CN: JVM GC时间 + en-US: GcTimeMillis + # field-metric name, type-metric type(0-number,1-string), unit-metric unit('%','ms','MB'), label-whether it is a metrics label field + - field: GcCount + type: 0 + i18n: + zh-CN: JVM GC次数 + en-US: GcCount + # (optional)metrics field alias name, it is used as an alias field to map and convert the collected data and metrics field + aliasFields: + - $.MemNonHeapCommittedM + - $.MemNonHeapMaxM + - $.MemNonHeapUsedM + - $.MemHeapCommittedM + - $.MemHeapMaxM + - $.MemHeapUsedM + - $.GcTimeMillis + - $.GcCount + calculates: + - MemNonHeapCommittedM=$.MemNonHeapCommittedM + - MemNonHeapMaxM=$.MemNonHeapMaxM + - MemNonHeapUsedM=$.MemNonHeapUsedM + - MemHeapCommittedM=$.MemHeapCommittedM + - MemHeapMaxM=$.MemHeapMaxM + - MemHeapUsedM=$.MemHeapUsedM + - GcTimeMillis=$.GcTimeMillis + - GcCount=$.GcCount + protocol: http + http: + host: ^_^host^_^ + port: ^_^port^_^ + url: /jmx + method: GET + ssl: ^_^ssl^_^ + parseType: jsonPath + parseScript: '$.beans[?(@.name == "Hadoop:service=ResourceManager,name=JvmMetrics")]' + - name: QueueMetrics + # metrics scheduling priority(0->127)->(high->low), metrics with the same priority will be scheduled in parallel + # priority 0's metrics is availability metrics, it will be scheduled first, only availability metrics collect success will the scheduling continue + priority: 0 + # collect metrics content + # collect metrics content + fields: + # field-metric name, type-metric type(0-number,1-string), unit-metric unit('%','ms','MB'), label-whether it is a metrics label field + - field: queue + type: 1 + label: true + i18n: + zh-CN: 队列名称 + en-US: queue + # field-metric name, type-metric type(0-number,1-string), unit-metric unit('%','ms','MB'), label-whether it is a metrics label field + - field: AllocatedVCores + type: 0 + i18n: + zh-CN: 分配的虚拟核数(已分配) + en-US: AllocatedVCores + # field-metric name, type-metric type(0-number,1-string), unit-metric unit('%','ms','MB'), label-whether it is a metrics label field + - field: ReservedVCores + type: 0 + i18n: + zh-CN: 预留核数 + en-US: ReservedVCores + # field-metric name, type-metric type(0-number,1-string), unit-metric unit('%','ms','MB'), label-whether it is a metrics label field + - field: AvailableVCores + type: 0 + i18n: + zh-CN: 可用核数(尚未分配) + en-US: AvailableVCores + # field-metric name, type-metric type(0-number,1-string), unit-metric unit('%','ms','MB'), label-whether it is a metrics label field + - field: PendingVCores + type: 0 + i18n: + zh-CN: 阻塞调度核数 + en-US: PendingVCores + # field-metric name, type-metric type(0-number,1-string), unit-metric unit('%','ms','MB'), label-whether it is a metrics label field + - field: AllocatedMB + type: 0 + unit: MB + i18n: + zh-CN: 已分配(已用)的内存大小 + en-US: AllocatedMB + # field-metric name, type-metric type(0-number,1-string), unit-metric unit('%','ms','MB'), label-whether it is a metrics label field + - field: AvailableMB + type: 0 + unit: MB + i18n: + zh-CN: 可用内存(尚未分配) + en-US: AvailableMB + # field-metric name, type-metric type(0-number,1-string), unit-metric unit('%','ms','MB'), label-whether it is a metrics label field + - field: PendingMB + type: 0 + unit: MB + i18n: + zh-CN: 阻塞调度内存 + en-US: PendingMB + # field-metric name, type-metric type(0-number,1-string), unit-metric unit('%','ms','MB'), label-whether it is a metrics label field + - field: ReservedMB + type: 0 + unit: MB + i18n: + zh-CN: 预留内存 + en-US: ReservedMB + # field-metric name, type-metric type(0-number,1-string), unit-metric unit('%','ms','MB'), label-whether it is a metrics label field + - field: AllocatedContainers + type: 0 + i18n: + zh-CN: 已分配(已用)的container数 + en-US: AllocatedContainers + # field-metric name, type-metric type(0-number,1-string), unit-metric unit('%','ms','MB'), label-whether it is a metrics label field + - field: PendingContainers + type: 0 + i18n: + zh-CN: 阻塞调度container个数 + en-US: PendingContainers + # field-metric name, type-metric type(0-number,1-string), unit-metric unit('%','ms','MB'), label-whether it is a metrics label field + - field: ReservedContainers + type: 0 + i18n: + zh-CN: 预留container数 + en-US: ReservedContainers + # field-metric name, type-metric type(0-number,1-string), unit-metric unit('%','ms','MB'), label-whether it is a metrics label field + - field: AggregateContainersAllocated + type: 0 + i18n: + zh-CN: 累积的container分配总数 + en-US: AggregateContainersAllocated + # field-metric name, type-metric type(0-number,1-string), unit-metric unit('%','ms','MB'), label-whether it is a metrics label field + - field: AggregateContainersReleased + type: 0 + i18n: + zh-CN: 累积的container释放总数 + en-US: AggregateContainersReleased + # field-metric name, type-metric type(0-number,1-string), unit-metric unit('%','ms','MB'), label-whether it is a metrics label field + - field: AppsCompleted + type: 0 + i18n: + zh-CN: 完成的任务数 + en-US: AppsCompleted + # field-metric name, type-metric type(0-number,1-string), unit-metric unit('%','ms','MB'), label-whether it is a metrics label field + - field: AppsKilled + type: 0 + i18n: + zh-CN: 被杀掉的任务数 + en-US: AppsKilled + # field-metric name, type-metric type(0-number,1-string), unit-metric unit('%','ms','MB'), label-whether it is a metrics label field + - field: AppsFailed + type: 0 + i18n: + zh-CN: 失败的任务数 + en-US: AppsFailed + # field-metric name, type-metric type(0-number,1-string), unit-metric unit('%','ms','MB'), label-whether it is a metrics label field + - field: AppsPending + type: 0 + i18n: + zh-CN: 阻塞的任务数 + en-US: AppsPending + # field-metric name, type-metric type(0-number,1-string), unit-metric unit('%','ms','MB'), label-whether it is a metrics label field + - field: AppsRunning + type: 0 + i18n: + zh-CN: 提正在运行的任务数 + en-US: AppsRunning + # field-metric name, type-metric type(0-number,1-string), unit-metric unit('%','ms','MB'), label-whether it is a metrics label field + - field: AppsSubmitted + type: 0 + i18n: + zh-CN: 提交过的任务数 + en-US: AppsSubmitted + # field-metric name, type-metric type(0-number,1-string), unit-metric unit('%','ms','MB'), label-whether it is a metrics label field + - field: running_0 + type: 0 + i18n: + zh-CN: 运行时间小于60分钟的作业个数 + en-US: running_0 + # field-metric name, type-metric type(0-number,1-string), unit-metric unit('%','ms','MB'), label-whether it is a metrics label field + - field: running_60 + type: 0 + i18n: + zh-CN: 运行时间介于60~300分钟的作业个数 + en-US: running_60 + # field-metric name, type-metric type(0-number,1-string), unit-metric unit('%','ms','MB'), label-whether it is a metrics label field + - field: running_300 + type: 0 + i18n: + zh-CN: 运行时间介于300~1440分钟的作业个数 + en-US: running_300 + # field-metric name, type-metric type(0-number,1-string), unit-metric unit('%','ms','MB'), label-whether it is a metrics label field + - field: running_1440 + type: 0 + i18n: + zh-CN: 运行时间大于1440分钟的作业个数 + en-US: running_1440 + # (optional)metrics field alias name, it is used as an alias field to map and convert the collected data and metrics field + aliasFields: + - $.['tag.Queue'] + - $.AllocatedVCores + - $.ReservedVCores + - $.AvailableVCores + - $.PendingVCores + - $.AllocatedMB + - $.AvailableMB + - $.PendingMB + - $.ReservedMB + - $.AllocatedContainers + - $.PendingContainers + - $.ReservedContainers + - $.AggregateContainersAllocated + - $.AggregateContainersReleased + - $.AppsCompleted + - $.AppsKilled + - $.AppsFailed + - $.AppsPending + - $.AppsRunning + - $.AppsSubmitted + - $.running_0 + - $.running_60 + - $.running_300 + - $.running_1440 + calculates: + - queue=$.['tag.Queue'] + - AllocatedVCores=$.AllocatedVCores + - ReservedVCores=$.ReservedVCores + - AvailableVCores=$.AvailableVCores + - PendingVCores=$.PendingVCores + - AllocatedMB=$.AllocatedMB + - AvailableMB=$.AvailableMB + - PendingMB=$.PendingMB + - ReservedMB=$.ReservedMB + - AllocatedContainers=$.AllocatedContainers + - PendingContainers=$.PendingContainers + - ReservedContainers=$.ReservedContainers + - AggregateContainersAllocated=$.AggregateContainersAllocated + - AggregateContainersReleased=$.AggregateContainersReleased + - AppsCompleted=$.AppsCompleted + - AppsKilled=$.AppsKilled + - AppsFailed=$.AppsFailed + - AppsPending=$.AppsPending + - AppsRunning=$.AppsRunning + - AppsSubmitted=$.AppsSubmitted + - running_0=$.running_0 + - running_60=$.running_60 + - running_300=$.running_300 + - running_1440=$.running_1440 + protocol: http + http: + host: ^_^host^_^ + port: ^_^port^_^ + url: /jmx + method: GET + ssl: ^_^ssl^_^ + parseType: jsonPath + parseScript: '$.beans[?(@.name =~ /Hadoop:service=ResourceManager,name=QueueMetrics.*/)]' + + - name: runtime + # metrics scheduling priority(0->127)->(high->low), metrics with the same priority will be scheduled in parallel + # priority 0's metrics is availability metrics, it will be scheduled first, only availability metrics collect success will the scheduling continue + priority: 0 + # collect metrics content + fields: + # field-metric name, type-metric type(0-number,1-string), unit-metric unit('%','ms','MB'), label-whether it is a metrics label field + - field: StartTime + type: 0 + i18n: + zh-CN: 启动时间戳 + en-US: StartTime + # (optional)metrics field alias name, it is used as an alias field to map and convert the collected data and metrics field + aliasFields: + - $.StartTime + calculates: + - StartTime=$.StartTime + protocol: http + http: + host: ^_^host^_^ + port: ^_^port^_^ + url: /jmx + method: GET + ssl: ^_^ssl^_^ + parseType: jsonPath + parseScript: '$.beans[?(@.name == "java.lang:type=Runtime")]'