Awesome RSI Papers

A curated list of Recursive Self-Improvement & LLM Agent logic research.
核心基石 / Core Foundations
Beyond Refusal: Probing the Limits of Agentic Self-Correction
ArXiv: 2602.21496 | Feb 2027 | Safety RSI

Solving the reasoning paradox in sensitive information leaks via iterative agentic rewriting and critique loops.

RSI Alignment
Tool-R0: Self-Evolving LLM Agents for Tool-Learning from Zero Data
ArXiv: 2602.21320 | Feb 2026 | Zero-Shot Evolution

Generator-Solver self-play framework demonstrating bootstrapping of complex tool-calling capabilities without external expert demonstrations.

RSI Tool-Use
SELAUR: Self Evolving LLM Agent via Uncertainty-aware Rewards
ArXiv: 2602.21158 | Feb 2026 | Exploration Scaling

Establishing dense reward signals from token-level uncertainty to enable efficient self-evolution in sparse-feedback environments.

RSI RL
ICLR 2026 Workshop on Recursive Self-Improvement
ICLR 2026 | Feb 2026 | Milestone Workshop

Bringing together global researchers to define principled methods, system designs, and evaluations for RSI across omni-models, multimodal agents, and robotics.

RSI Design
Recursive Sketched Interpolation (RSI) for Tensor Trains
ArXiv: 2602.xxxx | Feb 2026 | Technical Optimization

Scaling high-dimensional tensor computations via recursive sketched interpolation for adaptive AI systems.

RSI Optimization
TAPE: Tool-Guided Adaptive Planning and Constrained Execution
ArXiv: 2602.19633 | Feb 2026 | Research Insight

Solving irreversible failure in agentic workflows via multi-plan aggregation and adaptive re-planning.

RSI Planning
SkillOrchestra: Skill-Aware Orchestration for Multi-Agent Systems
ArXiv: 2602.19672 | Feb 2026 | Research Insight

Scaling compound AI systems through skill modeling instead of expensive end-to-end RL routing.

RSI Orchestration
R-Agent: Recursive Planning for Complex Tasks
ArXiv: 2602.18201 | Feb 2026 | Research Insight

Establishing dynamic recursive task trees for long-horizon decision making and self-correction.

RSI Planning
DeepMind Aletheia: Autonomous Research Singularity
DeepMind | Feb 2026 | Research Insight

Gemini 3 Deep Think hits 84.6% on ARC-AGI-2; Aletheia agent publishes autonomous math research.

RSI Agent Math
Self-Evolving Recommendation Systems
ArXiv: 2602.10226 | Research Insight 2026

End-to-end autonomous model optimization using LLM agents for large-scale production systems.

RSI Production
DeepMind Aletheia: Autonomous Research Singularity
DeepMind Blog | Feb 2026 | Research Insight

100x compute reduction and 95.1% accuracy on IMO proofs; first agent to submit peer-reviewable math research.

RSI Reasoning
A Self-Improving Coding Agent
ArXiv: 2504.15228 | Research Insight 2025

Scaling coding performance from 17% to 53% on SWE-bench via recursive loops.

RSI Coding
DSPy: Compiling Declarative Language Model Calls into Self-Improving Pipelines
ArXiv: 2310.03714 | Stanford University

The paradigm shift from prompting to programming. Introduces teleprompters and optimizers for LM programs.

RSI Optimization
RLM: Reinforcement Learning for Logic Model Optimization
ArXiv: 2512.24601 | DeepMind/Google

Establishing the theoretical bounds of self-correcting logic chains using sparse rewards.

RSI Logic
自我演化 / Self-Evolution
Test-time Recursive Thinking (TRT): Self-Improvement without External Feedback
ArXiv: 2602.03094 | Feb 2026

Proving LLMs can self-improve at test-time via recursive search, self-verification, and strategy accumulation without external ground-truth.

RSI Test-time Scaling
Gödel Agent: A Self-Referential Agent Framework
ArXiv: 2410.04444 | PKU/UCSB

Inspired by Godel machines, this framework allows agents to rewrite their own logic and optimization routines.

RSI Self-Referential
LLMs Can Easily Learn to Reason from Demonstrations
ArXiv: 2502.07374 | Berkeley/Stanford

Crucial finding that Long CoT structure matters more than content for eliciting reasoning capabilities.

RSI Structure
ATLAS: Adaptive Self-Evolutionary Research Agent
ArXiv: 2602.02709 | Feb 2026

Distributed multi-LLM supporter layer and adaptive fine-tuning for autonomous SciML research.

RSI ResearchAgent
AgentDevel: Agent Evolution as Release Engineering
ArXiv: 2601.04620 | Jan 2026

Reframing RSI as a controlled release engineering pipeline with flip-centered gating.

RSI Engineering
MemRL: Self-Evolving Agents via Runtime Reinforcement Learning on Episodic Memory
ArXiv: 2601.03192 | New Research 2026

Non-parametric RL on episodic memory for zero-fine-tuning runtime agent evolution.

RSI EpisodicMemory
Self-Improving Pretraining with RL Feedback
ArXiv: 2601.21343 | Research Collective

Moving RSI from fine-tuning into the pre-training phase via synthetic logic referees.

RSI Pre-training
Agent 架构 / Agent Architectures
Execution Grounding in Agentic RSI
ArXiv: 2601.14525 | OpenCode Project

Using code execution environments as the primary ground-truth signal for agent evolution.

Agents Execution
领域落地 / Vertical RSI Applications
RSIDiff: Self-Evolving Diffusion Models
ArXiv: 2502.09963 | Feb 2025

Establishing RSI in the visual domain through recursive fine-tuning on self-generated image data.

RSI Diffusion
REDSearcher: Scalable Framework for Long-Horizon Search Agents
ArXiv: 2602.14234 | Feb 2026

Scaling agent search capabilities through multimodal tool integration and dynamic planning.

Search Long-Horizon
RSIR: Recursive Self-Improving Recommendation
ArXiv: 2602.15659 | Feb 2026

Scaling recommendation models via fidelity-controlled self-improving loops. A model-agnostic approach to data sparsity.

RSI RecSys