Skip to content

[Feature]: Run Evals for Harness agent #6684

Description

@alliscode

Description

Run extensive competitive evals for our agent harness.

Code Sample

Language/SDK

Both

Metadata

Metadata

Assignees

Labels

No labels
No labels
No fields configured for Feature.

Projects

Status
No status

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions