You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
During a conversation in the pollinators slack we noticed that log lines emitted by refinery for some messages showed that the event's api_host option was empty, even though the event was being sent to honeycomb:
This caused timeout errors in the upstream honeycomb API to be treated as peer errors:
This is a bit misleading as it causes the operator to believe there's a problem with their cluster, when the issue is really coming from the honeycomb API.
Given metrics are already prefixed with upstream_ or peer_, I'm wondering whether we really need the _api and _peer suffixes, as it's a bit confusing. e.g. How should upstream_response_errors_peer differ from peer_response_errors_peer?
Would it be possible to remove this suffix so that operators can determine whether a metric is related to the honeycomb API, or the peer, by its prefix?
The text was updated successfully, but these errors were encountered:
It seems refinery checks the API host option in outgoing events to see if the message is destined for a peer, or the honeycomb API:
refinery/transmit/transmit.go
Lines 164 to 171 in 003f55f
During a conversation in the pollinators slack we noticed that log lines emitted by refinery for some messages showed that the event's
api_host
option was empty, even though the event was being sent to honeycomb:This caused timeout errors in the upstream honeycomb API to be treated as peer errors:
This is a bit misleading as it causes the operator to believe there's a problem with their cluster, when the issue is really coming from the honeycomb API.
Given metrics are already prefixed with
upstream_
orpeer_
, I'm wondering whether we really need the_api
and_peer
suffixes, as it's a bit confusing. e.g. How shouldupstream_response_errors_peer
differ frompeer_response_errors_peer
?Would it be possible to remove this suffix so that operators can determine whether a metric is related to the honeycomb API, or the peer, by its prefix?
The text was updated successfully, but these errors were encountered: