Releases: DataDog/datadog-agent
7.57.0
Agent
7.57.0
Known bugs
- ECS Fargate deployments may cause increases in RAM and CPU usage. For more information see #27523.
Prelude
Release on: 2024-09-09
- Please refer to the 7.57.0 tag on integrations-core for the list of changes on the Core Checks
Upgrade Notes
- Update cURL to 8.9.1.
- Update OpenSSL from 3.0.14 to 3.3.1 (on Linux and macOS).
New Features
- The agent diagnose command now includes a
--json
option to output the results in JSON format. - Add integration value for device metadata.
- APM: In order to allow for automatic instrumentation to work in Kubernetes clusters that enforce a
Restricted
Pod Security Standard, which require all containers to explicitly set asecurityContext
, an option to configure a securityContext to be used for allinitContainers
created by the auto instrumentation has been added. | This can be done through theDD_ADMISSION_CONTROLLER_AUTO_INSTRUMENTATION_INIT_SECURITY_CONTEXT
environment value, oradmission_controller.auto_instrumentation.init_security_context
configuration -in both cases ajson
string should be supplied. - Adds a kube_runtime_class tag to metrics associated with Kubernetes pods and their containers.
- Expose the Agent's get host tags function to python checks using the new datadog_agent.get_host_tags method.
- Implement static allowlist of Kubernetes events to send by default. This feature is only enabled when
filtering_enabled
is set totrue
in thekubernetes_apiserver
integration configuration. - Adds a new launcher to handle incoming logs from integtrations.
- Add optional reverse DNS enrichment of private IP addresses to NDM NetFlow.
- On Windows, the default value for the service inference feature is now enabled.
Enhancement Notes
- Turn on Orchestrator Explorer by default in the core agent
- Added new source_host tag to TCP/UDP logs to help users understand where their logs came from.
- Added support to handling UDP/TCP Logs when running the containerized agent.
- APM: Allow custom HTTP client to be provided when instantiating the trace-agent configuration. This feature is primarily intended for the OpenTelemetry exporter.
- APM: Add default UDS listeners for traces (trace-agent) and dogstatsd (core-agent) on /var/run/datadog/apm.socket and /var/run/datadog/dsd.socket, respectively. These are used in the Single Step APM Instrumentation, improving the onboarding experience and minimizing the agent configuration.
- For the [Inferred Service Dependencies beta](https://docs.datadoghq.com/tracing/guide/inferred-service-opt-in/?tab=java), add two new peer.hostname precursor attributes, out.host and dns.hostname. This will improve coverage of inferred services because some tracer integrations only place the peer hostname in one of those attributes.
- APM stats for internal service overrides are now aggregated by the _dd.base_service tag only, enhancing visibility into specific base services.
- Include spans with span.kind=consumer for aggregation of stats on peer tags.
- IP address quantization on all peer tags is done the backend during ingestion. This change updates the Agent to apply the same IP address quantization. This reduces unnecessary aggregation that is currently done on raw IP addresses. And therefore, improves the aggregation performance of stats on peer tags.
- APM: Add new setting to disable the HTTP receiver in the trace-agent. This setting should almost never be disabled and is only a convenience parameter for OpenTelemetry extensions. Disabling the receiver is semantically equivalent to setting the receiver_port to 0 and receiver_socket to "".
- Agents are now built with Go
1.22.6
. - [NDM] Adds the option to collect BGP neighbors metrics from Cisco SD-WAN.
- [NDM] Add option to collect cloud application metrics from Cisco SD-WAN.
- [Cisco SD-WAN] Allow enabling/disabling metrics collection.
- Report the hostname of Kubernetes events based on the associated pod that the event relates to.
- Introduces a parser to extract tags from integration logs and attach them to outgoing logs.
- Implement External Data environment variable injection in the Admission Controller. Format for this new environment variable is it-INIT_CONTAINER,cn-CONTAINER_NAME,pu-POD_UID. This new variable is needed for the New Origin Detection spec. It is used for Origin Detection in case Local Data are unavailable, for example with Kata Containers and CGroups v2.
- Upgraded JMXFetch to 0.49.3 which adds support for jsr77 j2ee statistics and custom ConnectionFactory. See 0.49.3 for more details.
- Windows Agent Installer gives a better error message when a gMSA account is provided for
ddagentuser
that Windows does not recognize. - Uninstalling the Windows Agent MSI Installer removes specific subdirectories of the install path to help prevent data loss when
PROJECTLOCATION
is misconfigured to an existing directory. - Adds a default upper limit of 10000 to the number of network traffic paths that are captured at a single time. The user can increase or decrease this limit as needed.
- Language detection can run on the core Agent without needing a gRPC server.
- Add Hostname and ExtraTags to CollectorECSTask.
- Collect SystemInfo for Pods and ECS Tasks.
- Implement API that allows Python checks to send logs for eventual submission.
- Users can use
DD_ORCHESTRATOR_EXPLORER_CUSTOM_SENSITIVE_ANNOTATIONS_LABELS
to remove sensitive annotations and labels. For example:DD_ORCHESTRATOR_EXPLORER_CUSTOM_SENSITIVE_ANNOTATIONS_LABELS="sensitive-key-1 sensitive-key-2"
. Keys should be separated by spaces. The agent removes any annotations and labels matching these keys. - Add the ability to tag interface metrics with user-defined tags.
Security Notes
- Fix CVE-2024-41110.
Bug Fixes
- Results of agent config did not reflect the actual runtime config for the other services. This will have other Datadog Agent services (e.g. trace-agent) running as a systemd service read the same environment variables from a text file /etc/datadog-agent/environment as the core Agent process.
- [DBM] Bump go-sqllexer to 0.0.13 to fix a bug where the table name is incorrectly collected on PostgreSQL SELECT ONLY statement.
- [Cisco SD-WAN] Do not collect unspecified IP addresses.
- Fix container.net.* metrics accuracy on Linux. Currently container.net.* metrics are always emitted with high cardinality tags while the values may not represent actual container-level values but POD-level values (multiple containers in a pod) or host-level values (containers running in host network). With this bug fix, the container.net.* metrics aren't emitted for containers running in host network and a single timeseries is emitted by pods when running multiple containers. Finally, in non-Kubernetes environments, if multiple containers share the same network namespace, container.net.* metrics won't be emitted.
- Fix duplicate logging in Process Agent component's Enabled() method.
- Fixed bug in kubelet check when running in core agent that was causing kubernetes.kubelet.container.log_filesystem.used_bytes to be reported by the check for excluded/non-existing containers. The metric was being reported in this case without tags. This bug does not exist in the python integration version of the kubelet check.
- Fixes a bug on Windows in the driver installation custom actions that could prevent rollback from working properly if an installation failed or was canceled.
- Update pro-bing library to include fix for a Windows specific issue with large ICMP packets
- [oracle] Fix wrong durations for cloud databases.
- Stop chunking outputs in manual checks for container, process, and process_discovery checks to allow JSON unmarshaler to parse output.
- Remove the original pod annotation on consul
- Fix pod status for pods using native sidecars.
- Fix a regression where the Agent would fail to start on systems with SysVinit.
- APM: Fixes issue where the number of HTTP decoders was incorrectly set if setting GOMAXPROCS to milli-cpu values.
Other Notes
- Add metrics origins for vLLM integration.
- Add deprecation warnings when running process checks on the Process Agent in Linux. This change prepares for the deprecation of processes and container collection in the Process Agent, occurring in a future release.
- Add metric origin for the AWS Neuron integration
Datadog Cluster Agent
7.57.0
Prelude
Released on: 2024-09-09 Pinned to datadog-agent v7.57.0: CHANGELOG.
New Features
- The Cluster Agent now supports activating Continuous Profiling using Admission Controller.
LimitRange
andStorageClass
resources are now collected by the orchestrator check.
Enhancement Notes
- The auto-ins...
7.56.2
Agent
7.56.2
Prelude
Release on: 2024-09-02
- Please refer to the 7.56.2 tag on integrations-core for the list of changes on the Core Checks
Bug Fixes
- Fix issue causing GUI to fail when opening with Internet Explorer on Windows.
Datadog Cluster Agent
7.56.2
Prelude
Released on: 2024-09-02 Pinned to datadog-agent v7.56.2: CHANGELOG.
7.56.1
Agent
7.56.1
Prelude
Release on: 2024-08-29
- Please refer to the 7.56.1 tag on integrations-core for the list of changes on the Core Checks
Bug Fixes
- Fixed a nil pointer dereference issue in the Tailer.DidRotate function that was causing the Agent to panic.
Datadog Cluster Agent
7.56.1
Prelude
Released on: 2024-08-29 Pinned to datadog-agent v7.56.1: CHANGELOG.
7.56.0
Agent
Prelude
Release on: 2024-08-16
- Please refer to the 7.56.0 tag on integrations-core for the list of changes on the Core Checks
Upgrade Notes
- Change default bind address in OTLP ingest from 0.0.0.0 to localhost. Please refer to the following blog post: https://opentelemetry.io/blog/2024/hardening-the-collector-one for additional information around this change.
- Update cURL to 8.7.1.
New Features
- The core Agent now supports multiple configuration files in addition to the main
datadog.yaml
file. The -E flag can be used to specify additional configuration files to be loaded sequentially after the maindatadog.yaml
. - When
DD_SERVERLESS_STREAM_LOGS
is enabled, DD_EXTENSION prints collected logs likeagent stream-logs
. - Add full support of CIS Amazon Linux 2 Benchmark in CSPM.
- Add full support of CIS Amazon Linux 2023 Benchmark in CSPM.
- NPM - Adds the capability to track and report failed TCP connections to the Datadog backend. This feature is disabled by default.
Enhancement Notes
-
Add the use_apiserver_cache option to the kubernetes_state_metrics check to reduce the pressure on the underlying storage engine etcd. Requires Kubernetes 1.19+.
-
APM: Add obfuscation support for OpenSearch statements within span metadata. This feature works in the same way as the existing Elasticsearch one, and is enabled by default. It is configured by binding
apm_config.obfuscation.opensearch.*
parameters to new obfuscation environment variables. In particular, bind:apm_config.obfuscation.opensearch.enabled
toDD_APM_OBFUSCATION_OPENSEARCH_ENABLED
: It accepts a boolean value with default value true.apm_config.obfuscation.opensearch.keep_values
toDD_APM_OBFUSCATION_OPENSEARCH_KEEP_VALUES
It accepts a list of strings of the form["id1", "id2"]
.apm_config.obfuscation.opensearch.obfuscate_sql_values
toDD_APM_OBFUSCATION_OPENSEARCH_OBFUSCATE_SQL_VALUES
It accepts a list of strings of the form["key1", "key2"]
. -
Agents are now built with Go
1.22.4
. -
Agents are now built with Go
1.22.5
. -
Bump dependency msodbcsql18 to version 18.3.3.1.
-
Adds config
kubernetes_events_source_detection.enabled
which is false by default. When set to true, this sets the source of kubernetes events to specific integrations based on the name of the controller that emitted it. All kubernetes events will have the tagorchestrator:kubernetes
. For controller names that do not match any of the known integrations, the source will still be set tokubernetes
by default. -
Introduces a
bundle_unspecifed_events
config to thedocker
integration. Whenbundle_unspecifed_events
andunbundle_events
are true, Docker events are unbundled according tocollected_event_types
and the remaining events are bundled after excluding thefiltered_event_types
andcollected_event_types
. -
The Agent will now ignore empty configuration files in
conf.d
. Users can use this functionality to avoid creating broken integrations when deploying agents with provisioning systems that do not allow skipping files entirely. -
Introduces an
bundle_unspecified_events
config to thekubernetes_apiserver
integration. Whenbundle_unspecified_events
andunbundle_events
are true, Kubernetes events are unbundled according tocollected_event_types
and the remaining events are bundled. -
Improve utility functions that start or stop a service to better manage the SERVICE_START_PENDING and SERVICE_STOP_PENDING states by waiting for the service transition to complete. This will improve handling of concurrent execution of agent.exe start and agent.exe restart commands for the Windows Agent.
-
Make the oom_kill check capture the OOM score and the OOM score adjustment of the process being killed.
-
Oracle integration will now auto-adjust the size of the SQL substring requested by the activity sampler to better support users with multi-byte character sets, for example, Korean, Arabic, etc. This should alleviate crashes caused by long queries using these characters.
-
The OTLP ingestion endpoint now supports the same settings and protocol as the OpenTelemetry Collector OTLP receiver v0.103.0.
-
APM: Probabilistic Sampler now only looks at the lower 64 bits of a trace ID by default to improve compatibility in distributed systems where some apps may truncate the trace ID. To maintain the previous behavior use the feature flag probabilistic_sampler_full_trace_id.
-
Adds the source of the payload for Processes-owned messages.
-
Add tags to CollectorManifest
-
Add image and imageID fields to pods ContainerStatuses.
-
The orchestrator check can now scrub sensitive data from probes in pods specifications.
-
The Agent now populates the git.repository_url and git.commit.sha tags from the values of the DD_GIT_REPOSITORY_URL and DD_GIT_COMMIT_SHA container environment variables.
-
Implement the kubernetes_persistent_volume_claims_as_tags configuration that allows users to disable PersistentVolumeClaim for Kubernetes resources.
-
Upgrade the NTP check client library 'beevik/ntp' from v0.3.0 to v1.3.4
-
Use cloud-provided hostname as default when running the Agent in AKS.
-
APM: Enabled zstd compression by default on trace payloads.
Deprecation Notes
- APM: DD_APM_MAX_TPS config setting is deprecated in favor of the more accurate DD_APM_TARGET_TPS. Accordingly, when configured through YAML, max_traces_per_second is deprecated in favor of target_traces_per_second. The setting behavior remains the same, only the name is changed to more accurately reflect the existing logic.
Security Notes
- Updating OpenSSL to 3.0.14 to address CVE-2024-4741 (on Linux and macOS).
Bug Fixes
- Upgrades the pro-bing library to fix a Windows-only bug with too-long ICMP packets being received
- Fix ExtraTags mapping for CollectorManifest.
- Fix a bug in the Agent where it could potentially fetch logs of short-lived Kubernetes jobs twice if the CRI is Docker.
- Re-enable printing of checks metadata in the
datadog-agent status
collector section. - Fix OTLP status output not being displayed in the GUI.
- Fix issue where init config for ping took priority over instance config.
- Fix
diagnose
command for logs endpoints and related warnings about unknown config keys. - Fixes oracle.tablespace.offline metric not emitting 1 when tablespace is offline.
- APM: Show probabilistic sampling configuration in Agent status when enabled.
- Add a field to differentiate between empty and undefined podSelector or namespaceSelector for network policies.
- Fixed a bug where the file tailing position is always set to the beginning, this fix allows users to explicitly set a starting position.
- All datadog public endpoints have the maximum requirements to close idle connections after 60s being idle. If a given client keeps it for longer, the server will close it, and the client will likely see the issue during the next write, leading to a connection reset error. The idle timeout should be therefore set under a minute. This PR is reducing the timeout from 90 to 30s.
- Windows: Added driver rollback properties to ensure that all services and drivers are uninstalled or rolled back after an installation or upgrade failure.
Other Notes
- Add metric origins for community Python integrations.
Datadog Cluster Agent
Prelude
Released on: 2024-08-16 Pinned to datadog-agent v7.56.0: CHANGELOG.
Upgrade Notes
- Disables default injection of the .NET profiler dependency for Kubernetes auto_instrumentation.
Enhancement Notes
- Mark the NetworkPolicy collector as stable in the Cluster Agent
- Enabled language detection automatically in the injected agent sidecar on EKS Fargate when APM SSI is enabled. This is only available for users using the admission controller to automatically inject the agent sidecar.
- The orchestrator check can now scrub sensitive data from probes in pods specifications.
Bug Fixes
- Fixes issue where the external metrics server would sometimes return metrics which had not been updated for longer than the configured external_metrics_provider.max_age as valid. In connection with this fix, a new config (external_metrics_provider.query_validity_period) has been added to account for the delay between when metrics are resolved and when they are queried by the various autoscaling controllers. It is set to 30 seconds by default.
7.55.3
Agent
7.55.3
Prelude
Release on: 2024-08-01
- Please refer to the 7.55.3 tag on integrations-core for the list of changes on the Core Checks
Enhancement Notes
- Agents are now built with Go
1.21.12
.
Security Notes
- Fix CVE-2024-41110.
Datadog Cluster Agent
7.55.3
Prelude
Released on: 2024-08-01 Pinned to datadog-agent v7.55.3: CHANGELOG.
7.55.2
Agent
7.55.2
Prelude
Release on: 2024-07-25
- Please refer to the 7.55.2 tag on integrations-core for the list of changes on the Core Checks
Security Notes
- Fix
CVE-2024-6257
.
Bug Fixes
- Fixes an issue introduced in 7.55.0 with container metrics. In some rare cases, container metrics (cpu, memory, limits, etc.) could be incorrect and not reflect actual resources usage.
Datadog Cluster Agent
7.55.2
Prelude
Released on: 2024-07-25 Pinned to datadog-agent v7.55.2: CHANGELOG.
7.55.1
Agent
Prelude
Release on: 2024-07-12
- Please refer to the 7.55.1 tag on integrations-core for the list of changes on the Core Checks
Bug Fixes
- Fix a regression where the agent would fail to start on systems with SysVinit
Datadog Cluster Agent
Prelude
Released on: 2024-07-12 Pinned to datadog-agent v7.55.1: CHANGELOG.
7.55.0
Agent
Known bug
The datadog-agent
service fails to start on systems using the SysVInit init system, with a datadog-agent: unrecognized service
error. See #27523 for details.
Prelude
Release on: 2024-07-11
- Please refer to the 7.55.0 tag on integrations-core for the list of changes on the Core Checks
New Features
- Add new parameter
djm_config.enabled
to enable Data Jobs Monitoring - You can now set the buffering configuration of Lambda Logs API using
DD_LOGS_CONFIG_LAMBDA_LOGS_BUFFERING_TIMEOUT_MS
,DD_LOGS_CONFIG_LAMBDA_LOGS_BUFFERING_MAX_BYTES
, andDD_LOGS_CONFIG_LAMBDA_LOGS_BUFFERING_MAX_ITEMS
. - Adds the new service_discovery check, which is currently only supported on Linux.
- APM: OTLP: Added an API in pkg/trace/stats that converts eligible OTLP spans to the input to APM stats concentrator.
- Experimental support of SBOM generation on Windows.
- Expose the Agent's MongoDB command string obfuscation to Python checks using the new datadog_agent.obfuscate_mongodb_string method.
- [network path] Add feature to traceroute network connections.
- Implement OpenTelemetry SDK resource attributes as unified service rags.
- CSM captures and generates events based on IMDS traffic
- Add support for kubernetes_namespace_annotations_as_tags. This new option is equivalent to the existing kubernetes_namespace_labels_as_tags, but it considers namespace annotations instead of namespace labels. With this new option, users can enrich tagging based namespace annotations.
Enhancement Notes
- Add a section in the Agent status to show device profile parsing errors.
- Add Go execution traces to Agent Flare.
- Introducing the port conflict suite to the Agent diagnose command. Adding this for MacOS only first. Will incrementally add support for Linux and Windows. The Agent might need particular permissions to read information about processes using ports.
- Agents are now built with Go
1.21.11
. - [NDM] Add option to collect hardware status from Cisco SD-WAN.
- Fix concurrency handling by Agent restart-service CLI command when the Agent starts and simultaneously restarts. The Agent may start and restart concurrently during the Agent's installation or upgrade.
- OOM kill events in the OOM kill check are now reported at ERROR level instead of INFO level.
- Improve concurrency handling for Agent start/simultaneous restart. Agent starting and restarting concurrent execution may happen during the Agent's installation or upgrade.
- Add support for parsing pool and peer directives in locally defined configuration files in the NTP check.
- Improve error handling in the Lambda Logs API handler not to ignore error details
- Improve NTP check log messages.
- Also add kube_service tag to pods that run with hostNetwork: true.
- Added a new
-l
flag to thedatadog-agent status
command. This flag allows users to print all available sections. - Improve NTP check error messages to show the specific hosts that are failing.
- Store SNMP device tags on disk to avoid losing them when the agent restarts.
- The Datadog agent container image is now using Ubuntu 24.04 noble as the base image.
- Improve GUI security authorization flow.
Deprecation Notes
- Deprecated
datadog-agent status component [section]
in favor ofdatadog-agent status [section]
.
Security Notes
- Current GPG keys that are used to sign new releases of the Agent package are about to expire. Following our 2024 GPG key rotation plan, we rotated RPM and APT GPG keys.
Bug Fixes
- Fixes the validation of permissions for the secret backend helper.
- APM Stats type for is_trace_root flag is now renamed to trilean.
- [DBM] Bump go-sqllexer to 0.0.12 to fix a bug where table name is incorrectly collected on statement
CREATE TABLE IF NOT EXISTS
. - Correct log level of trace writer log message
- Fix an issue introduced in Agent 7.51 for Windows that would cause a panic during Agent shutdown if the etwtracer component failed to initialize.
- Fixed an issue where the
datadog-agent status
anddatadog-agent flare
commands could hang while trying to reach other agent processes. - Fix an issue where the Agent incorrectly reports JMX integrations as having issues.
- Fix leader election error when running checks manually from the Cluster Agent.
- [oracle] Fix missing query metrics.
- Fix metric reporting for process checks on the core Agent.
- Fix the status date so that it is computed for every request.
- Fix a bug where containerd container metrics and container tags were not being collected for containers with arbitrary container IDs.
Other Notes
- Added metrics origins for python integrations.
- Added metrics origins for the most recent python integrations.
Datadog Cluster Agent
Released on: 2024-07-11 Pinned to datadog-agent v7.55.0: CHANGELOG.
New Features
- Add support for kubernetes_namespace_annotations_as_tags. This new option is equivalent to the existing kubernetes_namespace_labels_as_tags, but it considers namespace annotations instead of namespace labels. With this new option, users can enrich tagging based on namespace annotations.
- Support namespace labels as tags on kubernetes events.
Enhancement Notes
- Add
reason:backofflimitexceeded,deadlineexceeded
to thekubernetes_state.job.failed
metric to help users understand why a job failed. - Reduced the memory used to store the tags.
- The Datadog cluster-agent container image is now using Ubuntu 24.04 noble as the base image.
Bug Fixes
- Fixes an issue with large clusters where the Cluster Agent fails to collect all tags when cluster_agent.collect_kubernetes_tags is enabled.
7.54.1
7.54.0
Agent
Prelude
Release on: 2024-05-29
- Please refer to the 7.54.0 tag on integrations-core for the list of changes on the Core Checks
Upgrade Notes
- Upgraded JMXFetch to 0.49.1 which adds support for ZGC Major and Minor Cycles and ZGC Major and Minor Pauses beans.
- Add a configuration option
admission_controller.inject_config.inject_container_name
to retrieve the container tags for dogstatsd metrics and apm traces in setups where origin detection is not supported (windows, nested virtualization with cgroupv2...). This option will inject the container name along with the pod uid inDD_ENTITY_ID
if it was not set manually. Note that this option is disabled by default and is incompatible with prior versions of the agent . Thus, we recommend enabling it only if you are using the latest version of the agent. In case a rollback to an incompatible version is needed, it will be necessary to delete the pods in whichDD_ENTITY_ID
was injected by the latest agent to preserve pod tags in dogstatsd metrics and traces.
New Features
- APM: Add a new Probabilistic Sampler sampling mechanism, which is compatible with OpenTelemetry's Probabilistic Sampling Processor.
- CWS: Add support for
write
events on Windows. - CWS: Add support for
delete
event on Windows. - CWS: Add
chdir
,mount
andumount
inptracer
mode. - CWS: Add APM spans in
ptracer
mode. - CWS: Add support for file hashing in
ptracer
mode. - CWS: Allow writing rules for rename events on Windows.
- Set kubelet core check to be enabled by default.
- Add
dogstatsd_pipeline_autoadjust_strategy
setting withmax_throughput
andper_origin
strategies.max_throughput
is the already existing strategy (whendogstatsd_pipeline_autoadjust
istrue
).per_origin
will let you create an arbitrary number of pipelines (based ondogstatsd_pipeline_count
), and will try to isolate containers based on their advertised origin. This will improve compression. - [NDM] Add Cisco SD-WAN integration.
- In the Logs Agent, add support for the Sensitive Data Scanner library to process logs.
- USM now captures TLS traffic from NodeJS applications.
Enhancement Notes
- This change allows DBM Autodiscovery users to optionally set the region for where their aurora clusters are running. This can be used in lieu of relying on IMDS to discover the region through instance metadata. This is a nicer experience for users running in Docker, who would be required to complete extra steps in their instance metadata configuration to allow the Docker container access to the instance metadata.
- Activate the autodiscovery automatically if at least one configuration is given in network_devices.autodiscovery. Remove the listeners: -snmp requirement.
- Updates the Agent status output to show if a check is in a cancelling state. Once canceled, the check is removed from the status output.
- Report EKS Fargate configuration to the Agent metadata payload.
- Adding
dirname
tag for full filepath log configurations. This tag was only added previously if using a wildcard filepath log configuration. - Agents are now built with Go
1.21.9
. - Collect clusters' extended resources capacity and allocatable.
- Collect all node resource capacities and allocatable metrics for cluster management.
- Collect pod limits and requests.
- CWS: Always snapshot memory mapped files in activity dumps. This allows easier mapping of workload to loaded shared libraries.
- CWS: Normalize Windows filenames before going through glob matching.
- CWS: Support non seccomp mode in
ptracer
mode. - Add
agent status [name]
subcommand to fetch only a given section status (for example,datadog-agent status forwarder
` to get forwarder status). - dogstatsd: Implement new config option origin_detection_unified. This new option will allow users to configure the origin detection behavior for DogStatsD. When enabled, the DogStatsD server will use the default Origin Detection logic.
- snmp_listener now lives under network_devices with the name network_devices.autodiscovery. While the old configuration will continue to work, if both configurations are present, the new one will take priority.
- [oracle] Add
oracle.user_sessions
metric. - The OTLP ingestion endpoint now supports the same settings and protocol as the OpenTelemetry Collector OTLP receiver v0.97.0.
- Supports Podman newer versions (4.8+) using SQLite instead of BoltDB for the containers database backend. Setting
podman_db_path
to the path with thedb.sql
file (e.g./var/lib/containers/storage/db.sql
) will make the Datadog Agent use the SQLite format. Note: Ifpodman_db_path
is not set (default), the Datadog Agent attempts to use the default filelibpod/bolt_state.db
anddb.sql
from/var/lib/containers/storage
. - Allow certain Process Agent checks to be run from the core agent using the processchecks subcommand.
- check output from the Process Agent component are added to the flare when used in the core agent.
- expvars from the Process Agent component are added to the flare when used in the core Agent.
- Status of the Process Agent component will be shown when used in the core Agent.
- NDM: SNMP devices are now tagged by device_ip and device_id.
Deprecation Notes
- The
datadog-agent status component [name]
syntax will be replaced bydatadog-agent status [name]
in Datadog Agent 7.55 - Removed log that was noisy when Process Agent was running on ECS Fargate.
- The flare_stripped_keys configuration is now deprecated. Use scrubber.additional_keys instead.
Bug Fixes
- APM: Fix potential connection issues by ensuring connection semaphore release during errors.
- APM: Removed unsupported configuration parameter
apm_config.log_throttling
from code and documentation. - Disables the creation of build-id files in RPM packages. These are provided for debugging tools, but can lead to conflicts when multiple packages have the same build-id files. This conflict prevents the second package from being installed.
- Fix a race condition that could prevent JMX checks from running.
- OTLP ingest for traces now supports stable (v1.23.0+) semantic conventions for HTTP Spans. Old (v1.20.0 and older) semantic conventions are still supported. When both are reported, the new semantic conventions take precedence and old semantic conventions are ignored.
- Don't log when failing to fetch config of other Agents
- The User-Agent header is now set to datadog-agent/<version> for logs forwarding requests to /api/v2/logs. Previously it was set to Go-http-client/1.1.
- APM: Fixes issue where match-all replace tags rules could inadvertently affect required datadog tags. It is still possible to redact specific Datadog tags by targeting them explicitly.
- Fixes containers file-based log collection when using the
k8s-file
podman log driver and thelogs_config.use_podman_logs
parameter. - The Process Agent no longer crashes when pidMode is misconfigured on ECS Fargate. A warning is logged instead.
- Fix panic when running process checks in the core Agent with telemetry enabled.
- The Agent health check will now continue running even if the API key validation endpoint returns a 403 response code. Because this can occur due to transient issues, retrying will allow the Agent health indicator to recover in some cases.
- Fix type conversion error while generating the trace-agent status.
- APM: fix a bug where the Trace Agent would ignore proxy.no_proxy configuration values
- Create missing default configuration files during the upgrade, change, and repair actions of the Windows Installer.
Other Notes
- APM: While adding another sampler, we have modified the Rare sampler so that it no longer has a separate TTL for priority traces, meaning it no longer distinguishes between traces with priority > 0 and priority <= 0. It is necessary to detangle the various samplers so they can work independently from each other, in this case, so the Rare sampler can work with the Probabilistic Sampler. This should not have a noticeable impact on users.
- CWS: Allow
fim_enabled
to explicitly be set to false on Windows to enable process monitoring only.
Datadog Cluster Agent
Prelude
Released on: 2024-05-29 Pinned to datadog-agent v7.54.0: CHANGELOG.
New Features
- Add LimitRange and StorageClass collection in the orchestrator check.
Enhancement Notes
- Added retry mechanism to language detection patcher in order to retry failed patching operations.
Bug Fixes
- Fix collection of numeric rolling update options in Kubernetes deployments and daemonsets.
- Fixed initialization of language expiration time for detected languages.