It would be nice to allow a dynamic SendDelay #1184

kentquirk · 2024-06-04T13:13:41Z

Is your feature request related to a problem? Please describe.

For async systems, sometimes different parts of the system need different amounts of time to expect the rest of the trace to arrive. A global SendDelay setting causes all traces to have to wait for the worst case.

Customer:

Async tasks where the trace context is carried across a message queue often complete long after the global SendDelay. It would be nice to catch these traces and wait longer for them to complete (in many cases, for us, the root span is an async function that returns almost immediately after dispatching work)

Describe the solution you'd like

If the root span has a numeric field called refinery.trace_send_delay, then instead of using the configured SendDelay, refinery will wait for the number of seconds specified in that field before deciding on that trace.

Describe alternatives you've considered

Rules for setting SendDelay that are similar to decision rules, but apply immediately when the root span arrives. This is complex to specify and evaluate.
Allow a debounce parameter as an alternative to SendDelay (call it DecisionTimeout or something) -- if configured, any new span that arrives for a trace resets that trace's clock, so that as long as late spans aren't TOO late, the trace will delay decisions; only when they slow down enough does the decision get made.

Additional context

Slack thread in pollinators

The text was updated successfully, but these errors were encountered:

bixu · 2024-06-04T13:18:24Z

Ideally, we'd be able to tune delays around the sampling decision per-rule, since the context we want the rule to evaluate in is usually enough for us to know if we want to wait longer than normal (or ignore the root span closing early).

But also, in our environment, asking users to add refinery.trace_send_delay to their trace doesn't feel like excessive lift. The users most affected tend to understand tracing pretty well and the need for usable traces.

kentquirk · 2024-09-10T18:01:23Z

Came across a situation today where allowing SendDelay to be reset whenever a new span arrives would be helpful. (Long trace, variable number of async actions which arrive frequently but over many seconds; it would help to have a debounce model where the trace sends once spans stop arriving.)

VinozzZ · 2024-11-08T16:01:16Z

A user in the community just requested this feature because they have high value for long-running jobs but would like to lower it for certain traffic

kentquirk added the type: enhancement New feature or request label Jun 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

It would be nice to allow a dynamic SendDelay #1184

It would be nice to allow a dynamic SendDelay #1184

kentquirk commented Jun 4, 2024 •

edited

Loading

bixu commented Jun 4, 2024 •

edited

Loading

kentquirk commented Sep 10, 2024

VinozzZ commented Nov 8, 2024

It would be nice to allow a dynamic SendDelay #1184

It would be nice to allow a dynamic SendDelay #1184

Comments

kentquirk commented Jun 4, 2024 • edited Loading

bixu commented Jun 4, 2024 • edited Loading

kentquirk commented Sep 10, 2024

VinozzZ commented Nov 8, 2024

kentquirk commented Jun 4, 2024 •

edited

Loading

bixu commented Jun 4, 2024 •

edited

Loading