ArXiv ID: 2605.09998
Summary: This paper introduces the concept of a 'reset-free' self-improving harness. Instead of discrete training and inference phases, Continual Harness allows embodied agents to adapt online. The agent autonomously alternates between environment interaction and meta-optimization (refining its prompt, sub-agents, and skills) based on its own trajectory data.