MaxProof: Scaling Mathematical Proof with Generative-Verifier RL and Population-Level Test-Time Scaling
Problem The paper addresses the limitations in existing mathematical proof systems, particularly in competition-level settings, where the ability to generate, verify, and refine proofs is critical. Prior works have not...