Benchmark self-evolving Agent upon realistic large-scale file workspaces
benchmark dataset autonomous-agents ai-agents large-language-models llm file-dependencies workspace-learning
-
Updated
May 19, 2026 - Python