-
Notifications
You must be signed in to change notification settings - Fork 0
Open
Description
I ran
agentv eval evals/cargowise/cw-refvessel.eval.yaml --test-id rule1-antipattern-firstordefault
The score was 1, and no error or warning was reported.
If I view the results.jsonl, I see this excerpt:
test_id: rule1-antipattern-firstordefault
eval_set: cw-refvessel.eval
score: 1
assertions:
- text: "Grader parse failure after 3 attempts: Failed to parse evaluator response after 3 attempts: Resource not found"
passed: false
- text: 'Skill tool invoked via tool name "Using skill: cw-refvessel"'
passed: true
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels