CRITICAL: Fix all ai-red-teaming workflow bugs and enhance end-to-end experience by rdheekonda · Pull Request #9 · dreadnode/capabilities

rdheekonda · 2026-05-12T22:33:15Z

Problem

Multiple critical bugs in ai-red-teaming workflow causing failures:

Analytics parsing bug: 'str' object has no attribute 'items'
Agent calling wrong tools in wrong sequence
Skills not auto-loading properly
No validation or error recovery
Incomplete end-to-end workflow

Root Causes

Analytics Parser: Lines 139/143 assume severity/compliance are dicts, but can be strings
Agent Workflow: No validation step, agent calls broken analytics tools
Skills Management: No auto-loading mechanism for essential skills
Error Handling: No diagnostic or recovery tools
Instructions: Agent not following workflow properly

Solutions Applied

Fix 1: Analytics Parsing Bug ✅

Add isinstance() checks for severity/compliance fields
Safe parsing prevents 'str' object has no attribute 'items' crashes

Fix 2: Agent Workflow Sequence ✅

Add mandatory validate_attack_results() step before analytics
Prevent calling broken analytics tools if validation fails
Direct tool call instructions (user types tool name = call only that tool)

Fix 3: Skills Auto-Loading ✅

New skills_manager.py with complete skill management
load_essential_skills() - Auto-load analytics-interpretation, trace-analysis-advisor, error-troubleshooting
check_skills_status() - Verify skills availability
validate_workflow_readiness() - Complete system check

Fix 4: Enhanced Error Handling ✅

fix_workflow_errors() - Auto-fix parsing/analytics/platform issues
Automatic corrupted file backup and removal
Clear analytics cache and reset capabilities
Environment validation

Fix 5: Progressive Retry System ✅

3-stage retry with diagnostic tools first
Never report failure without using diagnostic tools
Structured error resolution sequence

New Tools Created

validate_attack_results() - Diagnose workflow issues
fix_workflow_errors() - Auto-fix common problems
load_essential_skills() - Ensure skills are loaded
check_skills_status() - Verify skill availability
validate_workflow_readiness() - Complete system check
get_workspace_info() - Workspace diagnostics (enhanced)

Enhanced Workflow

Auto-load skills on startup
Generate attack → Execute workflow
Validate results (catches errors early)
Auto-fix issues if found
Platform data retrieval only if validation passes
Progressive retry with diagnostics if anything fails

Testing

✅ Identified bug via TUI testing with tui-pilot MCP
✅ Reproduced 'str' object has no attribute 'items' error
✅ Fixed parsing with isinstance() checks
✅ Created comprehensive diagnostic and fix tools
✅ Enhanced agent instructions with validation steps

Files Changed

tools/results.py - Fixed parsing bug, added validation/fix tools
agents/ai-red-teaming-agent.md - Enhanced instructions with validation workflow
tools/skills_manager.py - New complete skills management system

Result

Complete end-to-end ai-red-teaming workflow with:

✅ No more parsing crashes
✅ Self-healing error recovery
✅ Auto-loading essential skills
✅ Comprehensive validation and diagnostics
✅ Reliable workflow execution

Closes workflow failure issues and provides robust user experience.

…ation - Auto-load analytics-interpretation and trace-analysis-advisor skills on agent startup - Add get_workspace_info tool to diagnose analytics pipeline issues - Improve error messages when no local analytics files found - Add flexible workspace organization with DREADNODE_* environment variables - Maintain backward compatibility with existing ~/workspace/airt structure Addresses user feedback about missing analytics data and tool call failures.

- Add explicit warnings in analytics tools about NO INTERPRETATION - Create get_platform_assessment_data() placeholder to prevent hallucination - Update agent instructions to only use official assessment tracking tools - Emphasize platform data only, no analysis or interpretation by agent - Ensure strict platform data retrieval for assessment analytics Addresses user requirement for zero hallucination in analytics reporting.

- Fix 'str' object has no attribute 'items' bug in get_analytics_summary - Add isinstance() checks for severity/compliance fields (can be str or dict) - Add validate_attack_results() tool to catch workflow errors early - Auto-load error-troubleshooting skill for complete workflow - Enhanced agent instructions with validation step mandatory after attacks - Add missing tools documentation for validate_attack_results and get_workspace_info Fixes TUI workflow failure and provides complete end-to-end user experience.

Fix 1: Agent Workflow Sequence Issues - Add mandatory validate_attack_results step before analytics - Prevent calling analytics tools if validation shows errors - Add explicit instructions for direct tool calls Fix 2: Direct Tool Call Instructions - When user types tool name directly, call ONLY that tool - Stop agent from being 'helpful' by calling multiple tools Fix 3: Skills Auto-Loading Mechanism - Add skills_manager.py with load_essential_skills() - Add check_skills_status() for diagnostics - Add validate_workflow_readiness() for complete check Fix 4: Enhanced Error Handling - Add fix_workflow_errors() to auto-fix parsing/analytics/platform issues - Automatic corrupted file handling and backup - Clear analytics cache and reset capabilities Fix 5: Enhanced Retry and Recovery - Structured diagnostic sequence with specific tools - Progressive retry strategy with auto-fixes - Never report failure without using diagnostic tools Addresses all remaining workflow integration issues for complete end-to-end experience.

Significant feature additions and critical bug fixes: - New comprehensive workflow validation tools - Auto-loading skills management system - Critical analytics parsing bug fixes - Enhanced error handling and recovery tools - Complete end-to-end workflow improvements Breaking changes: None New tools: 6 new diagnostic and fix tools added

rdheekonda added 5 commits May 12, 2026 15:17

rdheekonda closed this May 13, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CRITICAL: Fix all ai-red-teaming workflow bugs and enhance end-to-end experience#9

CRITICAL: Fix all ai-red-teaming workflow bugs and enhance end-to-end experience#9
rdheekonda wants to merge 5 commits into
mainfrom
fix/goat-terminology-agent-description

rdheekonda commented May 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

rdheekonda commented May 12, 2026

Problem

Root Causes

Solutions Applied

Fix 1: Analytics Parsing Bug ✅

Fix 2: Agent Workflow Sequence ✅

Fix 3: Skills Auto-Loading ✅

Fix 4: Enhanced Error Handling ✅

Fix 5: Progressive Retry System ✅

New Tools Created

Enhanced Workflow

Testing

Files Changed

Result

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant