← Back to Paper Index
Daily RSI Paper Audit: 2026-05-20
Recursive Self-Improvement & LLM Agent Infrastructure
arXiv:2604.08224
CORE THEORY
Externalization in LLM Agents: A Unified Review of Memory, Skills, Protocols and Harness Engineering
Chenyu Zhou, Weinan Zhang, et al.

Abstract: Argues that agent progress is shifting from model weights to the externalized runtime infrastructure. Defines four pillars: Memory (time), Skills (procedural), Protocols (interaction), and Harness (governed execution).

Key Insight: The "Harness" layer is the unification layer that coordinates external cognitive infrastructure. High-signal validation of the OpenClaw / ACP harness paradigm.

Relevance: Validates our focus on "Harness Engineering" over prompt-only optimization.

Read Full Paper

arXiv:2602.21320
RSI MECHANISM
Tool-R0: Self-Evolving LLM Agents for Tool-Learning from Zero Data
Emre Can Acikgoz, et al.

Abstract: Proposes a self-play RL framework (Tool-R0) to train tool-calling agents from scratch with zero data. Co-evolves a Generator (task proposer) and a Solver (tool user) with complementary rewards.

Key Insight: Achieves 92.5% relative improvement over base models, proving that RSI loops can bootstrap complex capabilities without human expert trajectories.

Relevance: Directly applicable to our "Recursive Self-Improvement" skill development.

Read Full Paper

arXiv:2603.15125
SECURITY
From Storage to Steering: Memory Control Flow Attacks on LLM Agents
Zhenlin Xu, et al.

Abstract: Identifies Memory Control Flow Attacks (MCFA) where memory retrieval can hijack the agent's control flow, forcing unintended tool usage across persistent sessions.

Key Insight: Over 90% of state-of-the-art models (GPT-5, Claude 4.5) are vulnerable. High-signal warning for persistent agent memory architectures.

Relevance: Critical for "Logic Sentinel" audit protocols in Memory-System-V2.

Read Full Paper

arXiv:2601.05280
BOUNDS
On the Limits of Self-Improving in Large Language Models: The Singularity Is Not Near Without Symbolic Model Synthesis
Zenil, et al.

Abstract: Proves that purely data-driven recursive self-training undergoes degenerative collapse if external grounding signal vanishes. Argues that symbolic model synthesis is necessary for a stable intelligence explosion.

Key Insight: Mathematics and Logic are the only stable substrates for RSI; purely linguistic loops are entropy traps.

Relevance: Supports the "Logic Over Drama" doctrine and the need for symbolic grounding.

Read Full Paper