experimental/stats: Expose Telemetry Label Callback #8877

seth-epps · 2026-02-02T22:22:45Z

Expose a new experimental API for registering a telemetry label callback function.

Some clients may not be instrumented with opentelemetry which restricts valuable information from being propagated to stats handlers. This gives clients the ability to collect otel labels by registering a label callback on the context and collecting the information themselves in their stats handlers.

RELEASE NOTES:

experimental/stats: Expose Telemetry Label Callback

codecov · 2026-02-03T14:46:30Z

Codecov Report

❌ Patch coverage is 88.88889% with 3 lines in your changes missing coverage. Please review.
✅ Project coverage is 83.25%. Comparing base (49e224f) to head (fe8c628).
⚠️ Report is 8 commits behind head on master.

Files with missing lines	Patch %	Lines
experimental/stats/telemetry.go	83.33%	1 Missing and 1 partial ⚠️
internal/xds/balancer/clusterimpl/picker.go	87.50%	0 Missing and 1 partial ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #8877      +/-   ##
==========================================
+ Coverage   83.15%   83.25%   +0.10%     
==========================================
  Files         414      415       +1     
  Lines       32751    32825      +74     
==========================================
+ Hits        27235    27330      +95     
+ Misses       4096     4076      -20     
+ Partials     1420     1419       -1

Files with missing lines	Coverage Δ
internal/stats/labels.go	`92.30% <100.00%> (-7.70%)`	⬇️
internal/xds/balancer/clusterimpl/picker.go	`76.82% <87.50%> (-18.30%)`	⬇️
experimental/stats/telemetry.go	`83.33% <83.33%> (ø)`

... and 25 files with indirect coverage changes

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

mbissa · 2026-02-05T04:31:32Z

internal/xds/balancer/clusterimpl/picker.go

 	}

+	estats.ExecuteTelemetryLabelCallback(info.Ctx, "grpc.lb.locality", xdsinternal.LocalityString(lID))
+	estats.ExecuteTelemetryLabelCallback(info.Ctx, "grpc.lb.backend_service", d.clusterName)


I know this solves your use case, and this looks good. But this forces the LB policy to know about the callbacks and can get fragmented over a period of time when labels are updated from multiple places.

Can we instead centralize this? When the central function to update telemetry state is called, it should be responsible for triggering any registered callbacks. This decouples the LB logic from the telemetry hook logic and ensures consistency across the library.

I agree that centralizing this would be better, but I was struggling with how we would support it without a much more complex change / something that doesn't interfere with the existing internal API since these are only currently set if the open telemetry labels are initialized (which requires otel instrumentation)

I wonder if we should encapsulate the telemetry label collection into it's own component and then inject it here. That way we establish the pattern for label collection in a single place and all future cases would just inject the same struct? I think a less invasive approach could be to just use a single function...

I took a slightly different approach in my change set that basically moves the update to the existing internal/stats package and accepts the full map to merge. This felt like the least intrusive way to make the change and it side-steps the Key/Value pairs as variadic arguments to the execute function.

fe8c628

Let me know what you think!

mbissa · 2026-02-05T04:34:02Z

experimental/stats/telemetry_test.go

+				tracker[key] = value
+			},
+			additionalLabels: map[string]string{"grpc.lb.backend_service": "grpc.lb.backend_service_other_val"},
+			wantLabels:       map[string]string{"grpc.lb.backend_service": "grpc.lb.backend_service_other_val", "grpc.lb.locality_val": "grpc.lb.locality_val"},


nit: should the key be "grpc.lb.locality" ?

fixed fe8c628

mbissa · 2026-02-05T04:36:14Z

experimental/stats/telemetry_test.go

+// TestTelemetryLabels tests registering a callback function with the context and
+// the effects of executing the callback on a local label state tracker. Each test
+// case constructs a new context with the provided callback registered.
+func (s) TestTelemetryLabels(t *testing.T) {


There are some end to end tests which test for back end service label values in different scenarios, could you modify them to validate that your scenario works end to end.

addressed in fe8c628

mbissa · 2026-02-05T04:49:05Z

experimental/stats/telemetry.go

+// key and value. If no callback is registered it does nothing.
+//
+// If the registered callback panics it will be swallowed and logged
+func ExecuteTelemetryLabelCallback(ctx context.Context, key, value string) {


nit: would it make more sense to have
func ExecuteTelemetryLabelCallback(ctx context.Context, kvs ...string)
so that we can batch multiple key/value callbacks in one loop and have the defer only once?

I think that's a good suggestion

I wonder if there's value in encoding the data into a struct to make it a touch more extensible later on. Something along these lines

type Label struct { Key string Value string } func ExecuteTelemetryLabelCallback(ctx context.Context, labels ...Label) { // ...unpack and callback }

As mentioned in the other comment I took a slightly different approach when looking at how we merge the labels into the stats label key by passing an updates map
fe8c628

Let me know if this feels more ergonomic

mbissa · 2026-02-05T04:51:44Z

experimental/stats/telemetry.go

+type telemetryLabelCallbackKey struct{}
+
+// WithTelemetryLabelCallback registers a callback function that is executed whenever
+// telemetry labels will be updated. This does _not_ require opentelemetry instrumentation


We should may be add a warning in the doc string that this callback is intended to execute in the rpc hotpath and users need to be careful of the performance impact when eventually ExecuteTelemetryLabelCallback is invoked.

addressed in fe8c628

mbissa

LGTM, sending for second review.

[i-8682][WIP] expose telemetry label callback

6dda7c7

seth-epps force-pushed the i-8682/expose-recorder branch from 09e6612 to 6dda7c7 Compare February 2, 2026 22:27

seth-epps mentioned this pull request Feb 2, 2026

Expose the labels package to use with custom stats handler #8682

Open

arjan-bal requested a review from mbissa February 3, 2026 07:44

arjan-bal added the Type: Feature New features or improvements in behavior label Feb 3, 2026

arjan-bal assigned mbissa Feb 3, 2026

arjan-bal added this to the 1.80 Release milestone Feb 3, 2026

vet

12bf10e

mbissa reviewed Feb 5, 2026

View reviewed changes

mbissa assigned seth-epps and unassigned mbissa Feb 5, 2026

feedback

fe8c628

mbissa approved these changes Feb 6, 2026

View reviewed changes

mbissa requested review from arjan-bal and dfawley February 6, 2026 08:37

mbissa assigned arjan-bal and dfawley and unassigned seth-epps Feb 6, 2026

arjan-bal assigned mbissa and unassigned dfawley and arjan-bal Feb 10, 2026

experimental/stats: Expose Telemetry Label Callback #8877

Are you sure you want to change the base?

experimental/stats: Expose Telemetry Label Callback #8877

Conversation

seth-epps commented Feb 2, 2026

Uh oh!

codecov bot commented Feb 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Choose a reason for hiding this comment

Uh oh!

seth-epps Feb 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mbissa Feb 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mbissa Feb 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mbissa left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

codecov bot commented Feb 3, 2026 •

edited

Loading

seth-epps Feb 5, 2026 •

edited

Loading

mbissa Feb 5, 2026 •

edited

Loading

mbissa Feb 5, 2026 •

edited

Loading