— Ch. 1 · The Seed Improver Concept —
Recursive self-improvement.
~3 min read · Ch. 1 of 5
Eliezer Yudkowsky coined the term "Seed AI" to describe a foundational framework for recursive self-improvement. This architecture equips an artificial general intelligence system with initial capabilities required to rewrite its own code. Human engineers develop this initial code-base to program software that can plan, read, write, compile, test, and execute arbitrary instructions. The system maintains original goals while performing validations to ensure abilities do not degrade over iterations. A goal-following autonomous agent continuously learns and adapts itself to become more efficient in achieving objectives. Recursive self-prompting loops enable the large language model to iterate on tasks through execution cycles. Basic programming capabilities allow the system to modify its own algorithms and codebase. Validation protocols form a suite of tests ensuring the agent does not regress or derail itself during development.
Evolutionary Coding Experiments
In 2023, the Voyager agent learned diverse tasks in Minecraft by iteratively prompting a large language model for code. It refined this code based on feedback from the game environment and stored working programs in an expanding skills library. Researchers proposed the STOP framework in 2024 as a Self-Taught OPtimizer where a scaffolding program recursively improves itself using a fixed large language model. Meta AI conducted research on large language models capable of self-improvement through their work on Self-Rewarding Language Models. These studies examine how agents receive super-human feedback within training processes. Google DeepMind unveiled AlphaEvolve in May 2025 as an evolutionary coding agent that uses a large language model to design and optimize algorithms. Starting with initial algorithms and performance metrics, AlphaEvolve repeatedly mutates or combines existing algorithms to generate new candidates. The system selects the most promising candidates for further iterations while making algorithmic discoveries. A key limitation remains the need for automated evaluation functions to drive the process forward.