[Evaluator] Add component #1375

Guikingone · 2026-01-12T09:58:10Z

Q	A
Bug fix?	no
New feature?	yes
Docs?	yes
Issues	--
License	MIT

This PR (as a POC for now) introduce the Evaluator component, this component is mainly used to evaluate and add a score to output from platforms / agents, I moved it to a new directory as a mirror of the Validator component, the goal is to allows to add new scorers without impacting the platforms (even if they can define their own scorers).

The PR is built around the following concepts:

An Evaluator that receive Scorers and compute the final score
ScorerInterface implementations that defines score using score method (could be improved, I'm not locked on the name)
An AbstractScorer used to define a reason (mostly used for "LLM as judge" scorers) for a score
An implementation on AiBundle via the Profiler and a subscriber (no configuration for now).

OskarStark · 2026-01-12T10:05:42Z

splitsh.json

        "ai-redis-message-store": "src/chat/src/Bridge/Redis",
        "ai-surreal-db-message-store": "src/chat/src/Bridge/SurrealDb",
+        "ai-evaluator": {
+            "prefixes": [{ "from": "src/evaluator", "to": "", "excludes": ["src/Bridge"] }]


Suggested change

"prefixes": [{ "from": "src/evaluator", "to": "", "excludes": ["src/Bridge"] }]

"prefixes": [{ "from": "src/evaluator", "to": "" }]

not sure, as we don't have a bridge folder

Oops, wrong copy/paste 😅

OskarStark · 2026-01-12T10:06:45Z

Does this need to be a dedicated component? If not, where would it fit best? Platform or Agent?

Guikingone · 2026-01-12T10:11:28Z

Does this need to be a dedicated component?

Open to debate and I don't have a strong opinion on this one, the first version was a sub-directory of Platform but as PR's tend to stay "simple" and once complexity is added, we tend to move it as "bridge", I preferred to move it as a new component to keep clean the Platform one.

If it needs to be moved to an existing one, I would say Platform, we don't depends on Agent.

aszenz · 2026-01-12T13:55:47Z

Does this need to be a dedicated component?

Open to debate and I don't have a strong opinion on this one, the first version was a sub-directory of Platform but as PR's tend to stay "simple" and once complexity is added, we tend to move it as "bridge", I preferred to move it as a new component to keep clean the Platform one.

If it needs to be moved to an existing one, I would say Platform, we don't depends on Agent.

Can it be integrated into PHPUnit, i think it kind of provides testing utilities for the platform component so it should ideally be in platform component itself

Guikingone · 2026-01-12T16:26:10Z

Can it be integrated into PHPUnit

Like Mailer / Messenger assertions? Of course 🙂

chr-hertel · 2026-01-12T22:09:34Z

First of all: yes we need this! i wonder if it's too early tho ...

To me this is rather a standalone component, yes, but the integration aspect with PHPUnit is interesting, but not sure we want to do this - i might want to use evaluations not only in my test suite 🤔

For me it's too early since i feel like putting something on top of agent/chat without them being stable enough - and i would rather focus on that first - that shouldn't keep you from exploring the field here ofc. it's just another thing in parallel and i'm a bit worried about too many moving parts tbh.

what's your blueprint for tackling this? do you use some kind of reference?
i recall discussing this briefly with @tgalopin in Amsterdam - was it dspy or pydantic? or sth else?

feat(core): Evaluator component

96c8a1e

Guikingone force-pushed the src/evaluator branch from 28290b4 to 96c8a1e Compare January 12, 2026 09:58

ref

270ce6a

OskarStark reviewed Jan 12, 2026

View reviewed changes

OskarStark changed the title ~~[Core] Introduce the Evaluator component~~ [Evaluator] Add component Jan 12, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Evaluator] Add component #1375

[Evaluator] Add component #1375

Uh oh!

Guikingone commented Jan 12, 2026

Uh oh!

OskarStark Jan 12, 2026

Uh oh!

Guikingone Jan 12, 2026

Uh oh!

OskarStark commented Jan 12, 2026

Uh oh!

Guikingone commented Jan 12, 2026

Uh oh!

aszenz commented Jan 12, 2026

Uh oh!

Guikingone commented Jan 12, 2026

Uh oh!

chr-hertel commented Jan 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

	"prefixes": [{ "from": "src/evaluator", "to": "", "excludes": ["src/Bridge"] }]
	"prefixes": [{ "from": "src/evaluator", "to": "" }]

Uh oh!

[Evaluator] Add component #1375

Are you sure you want to change the base?

[Evaluator] Add component #1375

Uh oh!

Conversation

Guikingone commented Jan 12, 2026

Uh oh!

OskarStark Jan 12, 2026

Choose a reason for hiding this comment

Uh oh!

Guikingone Jan 12, 2026

Choose a reason for hiding this comment

Uh oh!

OskarStark commented Jan 12, 2026

Uh oh!

Guikingone commented Jan 12, 2026

Uh oh!

aszenz commented Jan 12, 2026

Uh oh!

Guikingone commented Jan 12, 2026

Uh oh!

chr-hertel commented Jan 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants