2602.05762

Total: 1

#1 RocqSmith: Can Automatic Optimization Forge Better Proof Agents? [PDF] [Copy] [Kimi2] [REL]

Authors: Andrei Kozyrev, Nikita Khramov, Denis Lochmelis, Valerio Morelli, Gleb Solovev, Anton Podkopaev

This work studies the applicability of automatic AI agent optimization methods to real-world agents in formal verification settings, focusing on automated theorem proving in Rocq as a representative and challenging domain. We evaluate how different automatic agent optimizers perform when applied to the task of optimizing a Rocq proof-generation agent, and assess whether parts of the fine-grained tuning of agentic systems, such as prompt design, contextual knowledge, and control strategies, can be automated. Our results show that while several optimizers yield measurable improvements, simple few-shot bootstrapping is the most consistently effective; however, none of the studied methods matches the performance of a carefully engineered state-of-the-art proof agent.

Subjects: Artificial Intelligence , Machine Learning , Logic in Computer Science , Software Engineering

Publish: 2026-02-05 15:28:26 UTC