Vector unable to slow down the source when buffer is full #18578
-
In our configuration, we have deployed vector in an AKS cluster as Daemonset. The agent collect logs using The set up was working perfectly until we got a big surge in amount of logs being generated. We found out during high load vector was shipping all events to Azure Eventhub which was throttling the requests.This was causing the vector pods to get into a state where it was getting OOM Killed. We then read about buffering model and thought that it could be used to slow down the source when Azure Eventhub is unable to catchup the the pace of logs. Initially we started with The configuration is as follows
The pod was continuously OOM Killed as well during this time. Later we switch to disk based buffering to see if that helps. However it was still hammering the azure eventhub with many requests. The only way we were able to handle the situation at the moment is increasing the azure eventhub capacity. So we would like to know, if we are doing any mistake in terms of buffer configuration due to which the source is not slowing down when sink is unable to catch up the with pace. Any other suggestion is also appreciated. |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments
-
I believe this is actually a bug. |
Beta Was this translation helpful? Give feedback.
-
I created a fix here, #18634. |
Beta Was this translation helpful? Give feedback.
I created a fix here, #18634.