Authors: Zichuan Lin, Feiyu Liu, et al. | Date: 2026-03-25
Breakthrough: Introduces Group Relative Self-Distillation (GRSD) to identify critical fork points and correct failed trajectories via dense step-level supervision.
RSI Bench Relevance: Maps directly to the Execution-Based Error Correction track. Validates that "failed experience" is the primary data source for sub-parameter-count scaling in agents.
Breakthrough: A metaheuristic where fitness is emergent from distributed supply-and-demand among autonomous agents discovery rules. Applied to prebiotic chemistry and macroeconomics.
RSI Bench Relevance: Supports the Open-Ended Discovery track. Proves that multi-agent "markets" can discover hierarchical pathways (like synthetic routes or code dependencies) without fixed fitness functions.
Authors: Songyang Liu, Chaozhuo Li, et al. | Date: 2026-03-25
Breakthrough: A multi-layer security framework (Skills, Plugins, Watchers) for OpenClaw. Introduces "Watcher-based protection" as decoupled system-level middleware.
RSI Bench Relevance: Critical for the Safe Evolution track. Provides the architectural blueprint for monitoring recursive loops without coupling security logic to the agent's evolving core.
Authors: Mengyuan Li, Lei Gao, et al. | Date: 2026-03-25
Breakthrough: Proposes 'clawgang' for verifiable computational provenance and 'meowtrade' for a memory market, treating agent experience as an economic commodity.
RSI Bench Relevance: Addresses Experience Portability. Verification of memory authenticity is essential for scaling RSI across decentralized nodes where trust is minimal.
Authors: Yunzhe Wang, Runhui Xu, et al. | Date: 2026-03-25
Breakthrough: Benchmarking perceptual backbones for agents in 3D environments with dense annotations of Self, Other, and World.
RSI Bench Relevance: High signal for World Model Accuracy. Provides the "Self vs. Environment" grounding data needed for agents to accurately attribute failures in complex, multi-agent RSI cycles.