sql_exporter is losing metrics if compute is very busy #9960
Labels
a/observability
Area: related to observability
a/performance
Area: relates to performance of the system
c/compute
Component: compute, excluding postgres itself
t/bug
Issue Type: Bug
Steps to reproduce
run ingest benchmark
doc
Expected result
We see metrics collected by sql_exporter for the complete run
Actual result
we are losing metrics - most likely because sql_exporter is exceeding its scrape_timout
we observe this especially when there is large amount of backpressure from PS to compute
Environment
staging
Logs, links
https://neonprod.grafana.net/d/de3mupf4g68e8e/perf-test3a-ingest-benchmark?orgId=1&from[…]ge_tenant_endpoint_id=ep-misty-river-w2vdg495&viewPanel=19
first reported here
another observation of this - probably related
https://neondb.slack.com/archives/C04DGM6SMTM/p1731526874214679
The text was updated successfully, but these errors were encountered: