Skip to content

Conversation

@axiomofjoy
Copy link
Contributor

@axiomofjoy axiomofjoy commented Feb 5, 2026

Note

Medium Risk
Touches core playground LLM execution/tracing paths and span persistence, so regressions could affect observability data and experiment runs across providers; changes are mostly refactor/instrumentation but broad in surface area.

Overview
Routes playground LLM tracing through the shared Tracer/OpenTelemetry span pipeline instead of the bespoke streaming_llm_span implementation, including persisting spans via Tracer.save_db_traces() for both chat completions and dataset experiment runs.

Unifies message handling by replacing tuple-based chat messages with a typed PlaygroundMessage dict (plus helpers), and centralizes OpenInference attribute emission (input/output, tools, invocation params with sensitive-key filtering) inside PlaygroundStreamingClient.chat_completion_create(); call sites (chat_mutations, subscriptions, evaluators) now pass a tracer into clients.

Updates dependency pins (notably anthropic and arize) and tightens type-check requirements by pinning OpenTelemetry package versions; adjusts and adds tests/cassettes to validate the new span name (ChatCompletion) and recorded attributes (including request URL, input payload, tools, and invocation parameters).

Written by Cursor Bugbot for commit 8432b71. This will update automatically on new commits. Configure here.

@github-project-automation github-project-automation bot moved this to 📘 Todo in phoenix Feb 5, 2026
@axiomofjoy axiomofjoy force-pushed the xander/playground-traces-v2 branch 7 times, most recently from 7848e72 to 533c979 Compare February 10, 2026 18:21
@axiomofjoy axiomofjoy marked this pull request as ready for review February 10, 2026 18:41
@axiomofjoy axiomofjoy requested a review from a team as a code owner February 10, 2026 18:41
@dosubot dosubot bot added the size:XXL This PR changes 1000+ lines, ignoring generated files. label Feb 10, 2026
@axiomofjoy axiomofjoy requested a review from a team as a code owner February 10, 2026 20:09
@axiomofjoy axiomofjoy force-pushed the xander/playground-traces-v2 branch from 533c979 to 0f8e96f Compare February 10, 2026 20:13
Copy link
Contributor

@anticorrelator anticorrelator left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

looks good! I think the main thing is that some evaluator codepaths were removed

Copy link
Contributor

@anticorrelator anticorrelator left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm after discussion

@github-project-automation github-project-automation bot moved this from 📘 Todo to 👍 Approved in phoenix Feb 10, 2026
@claude
Copy link

claude bot commented Feb 10, 2026

Code review

No issues found. Checked for bugs and CLAUDE.md compliance.

Copy link

@cursor cursor bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Cursor Bugbot has reviewed your changes and found 1 potential issue.

Bugbot Autofix is OFF. To automatically fix reported issues with Cloud Agents, enable Autofix in the Cursor dashboard.

dataset_example_id=None,
repetition_number=repetition_number,
)
await results.put((tracer, repetition_number))
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Evaluator logic removed from single chat completion subscription

High Severity

The entire evaluator flow has been removed from _stream_single_chat_completion and the chat_completion subscription method no longer calls get_evaluators. The ChatCompletionInput schema still exposes an evaluators field, so clients can pass evaluators, but they're silently ignored. The chat_completion_over_dataset path still processes evaluators correctly, suggesting this removal from the single completion path was unintentional.

Additional Locations (1)

Fix in Cursor Fix in Web

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This was on purpose

@axiomofjoy axiomofjoy merged commit 15cecd4 into version-13 Feb 11, 2026
53 checks passed
@axiomofjoy axiomofjoy deleted the xander/playground-traces-v2 branch February 11, 2026 02:05
@github-project-automation github-project-automation bot moved this from 👍 Approved to ✅ Done in phoenix Feb 11, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

size:XXL This PR changes 1000+ lines, ignoring generated files.

Projects

Status: ✅ Done

Development

Successfully merging this pull request may close these issues.

2 participants