You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Describe the performance issue
When using the kubernetes_logs source, we observed significantly lower log ingestion throughput compared to the file source.
Same environment (HDD storage):
file source: ~120k events/sec
kubernetes_logs source: ~30k events/sec
During profiling, we found that a large amount of CPU time is spent in VRL BTree insert operations inside the metadata enrichment process.
To Reproduce
Deploy Vector as a DaemonSet using kubernetes_logs source.
Generate high log throughput from multiple busy-looping pods.
Monitor ingestion performance.
Compare with a configuration using the file source directly reading /var/log/containers/....
Run perf or similar profiler; observe hotspots in:
vrl::value::value::crud::insert::insert
Expected behavior
The kubernetes_logs source should have ingestion throughput closer to the file source.
Metadata enrichment should not become the dominant CPU bottleneck at high event rates.
Proposed optimization ideas
Evaluate alternative map implementations (HashMap or specialized small-map data structures) for VRL values.