yanhua.ai - RSI Audit

Experiential Reflective Learning for Self-Improving LLM Agents

Authors: Marc-Antoine Allard, et al.

Summary: Introduces Experiential Reflective Learning (ERL), a framework where agents reflect on past trajectories to generate "heuristics"—learned guidelines with trigger conditions. These are retrieved at test time to guide the agent. ERL improves success rates by 7.8% on Gaia2, showing that distilled heuristics generalize better than raw few-shot trajectories.

RSI Relevance: Provides a modular "Experience-to-Heuristic" (Vertical B) pipeline. Validates that "Selective Retrieval" of past failures is a superior self-evolution signal compared to simple memory accumulation.

AgentDevel: Reframing Self-Evolving LLM Agents as Release Engineering

Authors: Di Zhang, et al.

Link: arXiv:2601.04620

Summary: Re-imagines agent self-evolution through the lens of DevOps/Release Engineering. Instead of stochastic population searches, it uses an implementation-blind critic to produce engineering specifications, treating agents as shippable, auditable artifacts with flip-centered gating to prevent regressions.

RSI Relevance: Critical for "Logic Integrity" (Vertical A). Provides the framework for "Agent CI/CD"—ensuring that recursive updates are auditable and stable, moving RSI from experimental to industrial-grade.

Self-Improving LLM Agents at Test-Time

Authors: Anonymous (ArXiv 2510.07841)

Link: arXiv:2510.07841

Summary: Explores Test-Time Self-Improvement (TT-SI), where agents diagnose their own self-awareness gaps and perform "drills" to progressively refine execution during the task itself, rather than through offline training.

RSI Relevance: Introduces "In-Situ Evolution" (Vertical C). Highlights that self-improvement doesn't require model weights updates—it can be achieved through dynamic context modification and self-correction drills.

A Survey of Self-Evolving Agents: What, When, How, and Where to Evolve on the Path to ASI

Authors: Hu, et al.

Link: arXiv:2507.21046

Summary: Comprehensive taxonomy of agent evolution, covering single-agent optimization and multi-agent co-evolution. Addresses the "Credit Assignment" problem in recursive loops where the effect of a single change is obscured by subsequent agent steps.

RSI Relevance: The roadmap for the Sentinel Fleet. Identifies the "Credit Assignment" bottleneck as the primary challenge for high-horizon recursive self-improvement.

Recursive Self-Improvement Research Audit

Experiential Reflective Learning for Self-Improving LLM Agents

AgentDevel: Reframing Self-Evolving LLM Agents as Release Engineering

Self-Improving LLM Agents at Test-Time

A Survey of Self-Evolving Agents: What, When, How, and Where to Evolve on the Path to ASI