You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
feat(llmobs): support joining custom evaluations via tags (#11535)
This PR implements `LLMObs.submit_evaluation_for` method, which gives
users two options for joining custom evaluations
- by tag via the `span_with_tag` argument, which accepts a tuple
containing a tag key/value pair
- by span via the `span` argument, which accepts a dictionary containing
`span_id` and `trace_id` keys
There are also a couple behavior differences between
`submit_evaluation_for` and `submit_evaluation`. In the new method, we
- throw whenever a required argument is the wrong value or type
- remove `metadata` argument
- move the warning log for missing api key to the eval metric writer's
`periodic` method
Other changes:
#### Eval metric writer
Update the eval metric writer to write to the `v2` eval metric endpoint.
The main difference with this endpoint is that it accepts a `join_with`
field that holds joining information instead of a top-level trace and
span id fields.
#### Deprecate `submit_evaluation`
Deprecates `submit_evaluation`. **I've set the removal version to be
`3.0.0`.**
## Checklist
- [x] PR author has checked that all the criteria below are met
- The PR description includes an overview of the change
- The PR description articulates the motivation for the change
- The change includes tests OR the PR description describes a testing
strategy
- The PR description notes risks associated with the change, if any
- Newly-added code is easy to change
- The change follows the [library release note
guidelines](https://ddtrace.readthedocs.io/en/stable/releasenotes.html)
- The change includes or references documentation updates if necessary
- Backport labels are set (if
[applicable](https://ddtrace.readthedocs.io/en/latest/contributing.html#backporting))
## Reviewer Checklist
- [x] Reviewer has checked that all the criteria below are met
- Title is accurate
- All changes are related to the pull request's stated goal
- Avoids breaking
[API](https://ddtrace.readthedocs.io/en/stable/versioning.html#interfaces)
changes
- Testing strategy adequately addresses listed risks
- Newly-added code is easy to change
- Release note makes sense to a user of the library
- If necessary, author has acknowledged and discussed the performance
implications of this PR as reported in the benchmarks PR comment
- Backport labels are set in a manner that is consistent with the
[release branch maintenance
policy](https://ddtrace.readthedocs.io/en/latest/contributing.html#backporting)
---------
Co-authored-by: lievan <[email protected]>
Copy file name to clipboardexpand all lines: tests/llmobs/llmobs_cassettes/tests.llmobs.test_llmobs_eval_metric_writer.test_send_categorical_metric.yaml
Copy file name to clipboardexpand all lines: tests/llmobs/llmobs_cassettes/tests.llmobs.test_llmobs_eval_metric_writer.test_send_metric_bad_api_key.yaml
0 commit comments