Back to AI Research

AI Research

DreamProver: Evolving Transferable Lemma Libraries... | AI Research

Key Takeaways

  • DreamProver: Evolving Transferable Lemma Libraries via a Wake-Sleep Theorem-Proving Agent DreamProver is an agentic framework designed to improve automated t...
  • We introduce DreamProver, an agentic framework that leverages a "wake-sleep" program induction paradigm to discover reusable lemmas for formal theorem proving.
  • Existing approaches either rely on fixed lemma libraries, which limit adaptability, or synthesize highly specific intermediate lemmas tailored to individual theorems, thereby lacking generality.
  • DreamProver addresses this gap through an iterative two-stage process.
  • In the wake stage, DreamProver attempts to prove theorems from a training set using the current lemma library while proposing new candidate lemmas.
Paper AbstractExpand

We introduce DreamProver, an agentic framework that leverages a "wake-sleep" program induction paradigm to discover reusable lemmas for formal theorem proving. Existing approaches either rely on fixed lemma libraries, which limit adaptability, or synthesize highly specific intermediate lemmas tailored to individual theorems, thereby lacking generality. DreamProver addresses this gap through an iterative two-stage process. In the wake stage, DreamProver attempts to prove theorems from a training set using the current lemma library while proposing new candidate lemmas. In the "sleep" stage, it abstracts, refines, and consolidates these candidates to compress and optimize the library. Through this alternating cycle, DreamProver progressively evolves a compact set of high-level, transferable lemmas that can be effectively used to prove unseen theorems in related domains. Experimental results demonstrate that DreamProver substantially improves proof success rates across a diverse set of mathematical benchmarks, while also producing more concise proofs and reducing computational cost.

DreamProver: Evolving Transferable Lemma Libraries via a Wake-Sleep Theorem-Proving Agent
DreamProver is an agentic framework designed to improve automated theorem proving by enabling AI to learn and reuse mathematical knowledge over time. While many existing systems treat every mathematical problem as a unique challenge, DreamProver mimics human mathematical practice by systematically building a library of reusable lemmas—intermediate proofs or "shortcuts"—that can be applied to solve future, unseen problems. By evolving this library through an iterative cycle, the system becomes more efficient and capable at solving complex mathematical tasks.

The Wake-Sleep Learning Cycle

The core of DreamProver is a two-stage "wake-sleep" process inspired by cognitive learning models. During the "wake" stage, the agent attempts to solve a set of training theorems. If it encounters a difficult problem, it uses recursive decomposition to break the theorem into smaller, manageable sub-problems. It then collects these successful intermediate steps as potential learning material.
In the "sleep" stage, the system processes these collected experiences. It clusters similar intermediate theorems based on their semantic meaning and abstracts them into generalized, reusable lemmas. To keep the library effective, the system prunes redundant or rarely used lemmas and verifies the new ones to ensure they are mathematically sound. This cycle allows the library to grow into a compact, high-quality repository of transferable knowledge.

Improving Efficiency and Performance

By relying on this evolving library rather than generating problem-specific solutions from scratch every time, DreamProver significantly outperforms existing state-of-the-art theorem provers. Experimental results across diverse fields—including number theory, inequalities, and combinatorics—show that DreamProver improves proof success rates by an average of 61% compared to previous agentic systems.
Beyond just solving more problems, the system also improves the quality of the proofs themselves. Because the agent can utilize its library of lemmas, it produces proofs that are more concise, reducing proof length by an average of 50%. This approach also lowers the computational cost, as the system requires 48% fewer tokens to generate its solutions, making it a more efficient tool for complex mathematical reasoning.

Key Advantages of the Approach

DreamProver addresses a major limitation in current AI theorem proving: the lack of persistent, transferable knowledge. By moving away from instance-specific proof generation, the framework creates a system that learns from its own history. The "forgetting" mechanism, which removes rarely used lemmas, ensures that the library remains compact and focused, preventing the system from becoming overwhelmed by irrelevant information. This combination of abstraction, verification, and strategic pruning allows DreamProver to maintain a high level of performance across both well-represented and underrepresented mathematical domains.

Comments (0)

No comments yet

Be the first to share your thoughts!