COOKIES: By using this website you agree that we can place Google Analytics Cookies on your device for performance monitoring. |
University of Cambridge > Talks.cam > Artificial Intelligence Research Group Talks (Computer Laboratory) > Draft, Sketch, and Prove: Guiding Formal Theorem Provers with Informal Proofs
Draft, Sketch, and Prove: Guiding Formal Theorem Provers with Informal ProofsAdd to your list(s) Download to your calendar using vCal
If you have a question about this talk, please contact Mateja Jamnik. Join us in Lecture Theatre 2 or on Zoom The formalization of existing mathematical proofs is a notoriously difficult process. Despite decades of research on automation and proof assistants, writing formal proofs remains arduous and only accessible to a few experts. While previous studies to automate formalization focused on powerful search algorithms, no attempts were made to take advantage of available informal proofs. In this work, we introduce Draft, Sketch, and Prove (DSP), a method that maps informal proofs to formal proof sketches, and uses the sketches to guide an automated prover by directing its search to easier sub-problems. We investigate two relevant setups where informal proofs are either written by humans or generated by a language model. Our experiments and ablation studies show that large language models are able to produce well-structured formal sketches that follow the same reasoning steps as the informal proofs. Guiding an automated prover with these sketches enhances its performance from 20.9% to 39.3% on a collection of mathematical competition problems. This talk is part of the Artificial Intelligence Research Group Talks (Computer Laboratory) series. This talk is included in these lists:
Note that ex-directory lists are not shown. |
Other listsWorkshop in Microeconomics Galaxies Discussion Group Engineering - Mechanics, Materials and Design (Div C) - talks and eventsOther talksA Respectable Living and Women’s Work, England, 1270-1860 Networking Reception Roundtable on Jessica Patterson’s (Cambridge) Book, 'Religion, Enlightenment and Empire' Insights into cortical organisation and neurodevelopment from a genome wide association study (GWAS) of 2,347 structural cortical phenotypes CANCELLED - TO BE RE-SCHEDULED - Border town: Tharros and the Roman Frontier in Sardinia Broad community development of the Unified Forecast System, using the WAVEWATCH III™ wind model as an example |