First Neural Conjecturing Datasets and Experiments

Urban, Josef; Jakubův, Jan

doi:10.1007/978-3-030-53518-6_24

Josef Urban¹⁰ &
Jan Jakubův¹⁰

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 12236))

Included in the following conference series:

International Conference on Intelligent Computer Mathematics

3305 Accesses
4 Citations
19 Altmetric

Abstract

We describe several datasets and first experiments with creating conjectures by neural methods. The datasets are based on the Mizar Mathematical Library processed in several forms and the problems extracted from it by the MPTP system and proved by the E prover using the ENIGMA guidance. The conjecturing experiments use the Transformer architecture and in particular its GPT-2 implementation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

References

Brown, C.E., Gauthier, T.: Self-learned formula synthesis in set theory. CoRR, abs/1912.01525 (2019)
Google Scholar
Chvalovský, K., Jakubův, J., Suda, M., Urban, J.: ENIGMA-NG: efficient neural and gradient-boosted inference guidance for E. In: Fontaine, P. (ed.) CADE 2019. LNCS (LNAI), vol. 11716, pp. 197–215. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-29436-6_12
Chapter Google Scholar
Colton, S.: Automated Theory Formation in Pure Mathematics. Distinguished Dissertations. Springer, London (2012). https://doi.org/10.1007/978-1-4471-0147-5
Book MATH Google Scholar
Fajtlowicz, S.: On conjectures of Graffiti. Ann. Discrete Math. 72(1–3), 113–118 (1988)
MathSciNet MATH Google Scholar
Gauthier, T.: Deep reinforcement learning in HOL4. CoRR, abs/1910.11797 (2019)
Google Scholar
Gauthier, T., Kaliszyk, C., Urban, J.: Initial experiments with statistical conjecturing over large formal corpora. In: CICM 2016 WiP Proceedings, pp. 219–228 (2016)
Google Scholar
Johansson, M., Rosén, D., Smallbone, N., Claessen, K.: Hipster: integrating theory exploration in a proof assistant. In: Watt, S.M., Davenport, J.H., Sexton, A.P., Sojka, P., Urban, J. (eds.) CICM 2014. LNCS (LNAI), vol. 8543, pp. 108–122. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-08434-3_9
Chapter Google Scholar
Kaliszyk, C., Urban, J., Vyskočil, J.: Automating formalization by statistical and semantic parsing of mathematics. In: Ayala-Rincón, M., Muñoz, C.A. (eds.) ITP 2017. LNCS, vol. 10499, pp. 12–27. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-66107-0_2
Chapter Google Scholar
Kaliszyk, C., Urban, J., Vyskočil, J.: Learning to parse on aligned corpora (Rough Diamond). In: Urban, C., Zhang, X. (eds.) ITP 2015. LNCS, vol. 9236, pp. 227–233. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-22102-1_15
Chapter Google Scholar
Lenat, D.B.: AM: an artificial intelligence approach to discovery in mathematics as heuristic search. Ph.D thesis, Stanford (1976)
Google Scholar
Piotrowski, B., Urban, J.: Stateful Premise Selection by Recurrent Neural Networks (2020)
Google Scholar
Radford, A., et al.: Language models are unsupervised multitask learners. OpenAI Blog 1(8), 9 (2019)
Google Scholar
Schulz, S.: System description: E 1.8. In: McMillan, K., Middeldorp, A., Voronkov, A. (eds.) LPAR 2013. LNCS, vol. 8312, pp. 735–743. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-45221-5_49
Chapter Google Scholar
Urban, J.: XML-izing Mizar: making semantic processing and presentation of MML easy. In: Kohlhase, M. (ed.) MKM 2005. LNCS (LNAI), vol. 3863, pp. 346–360. Springer, Heidelberg (2006). https://doi.org/10.1007/11618027_23
Chapter Google Scholar
Urban, J.: MPTP 0.2: design, implementation, and initial experiments. J. Autom. Reasoning 37(1–2), 21–43 (2006)
MATH Google Scholar
Wang, Q., Brown, C.E., Kaliszyk, C., Urban, J.: Exploration of neural machine translation in autoformalization of mathematics in Mizar. In: CPP, pp. 85–98 (2020)
Google Scholar
Wang, Q., Kaliszyk, C., Urban, J.: First experiments with neural translation of informal to formal mathematics. In: Rabe, F., Farmer, W.M., Passmore, G.O., Youssef, A. (eds.) CICM 2018. LNCS (LNAI), vol. 11006, pp. 255–270. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-96812-4_22
Chapter MATH Google Scholar

Download references

Funding

Funded by the AI4REASON ERC Consolidator grant nr. 649043 and by the Czech project AI&Reasoning CZ.02.1.01/0.0/0.0/15_003/0000466 and the European Regional Development Fund. We thank K. Chvalovský and T. Gauthier for discussions.

Author information

Authors and Affiliations

Czech Institute of Informatics, Robotics and Cybernetics, Prague, Czech Republic
Josef Urban & Jan Jakubův

Authors

Josef Urban
View author publications
You can also search for this author in PubMed Google Scholar
Jan Jakubův
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Josef Urban .

Editor information

Editors and Affiliations

Department of Mathematics and Computer Science, Freie Universität Berlin, Berlin, Germany
Christoph Benzmüller
National Institute of Standards and Technology, Gaithersburg, MD, USA
Bruce Miller

A Additional Data From the Experiments

1.1 A.1 XXREAL 1:48 and its GPT-2 predictions

Following are the Mizar premises in the order proposed by GPT-2. The fifth and sixth were not needed for the ATP proof.

1.2 A.2 GROUPP_1:10 and its generalization conjectured by GPT-2

The generalization that avoids finiteness:

We don’t have an ATP proof of the generalization yet. We thank algebraists Michael Kinyon and David Stanovský for confirming that this generalization is provable. Based on this example Stanovský commented that related Mizar theorems can be similarly generalized.

1.3 A.3 SINCOS10:17 and a false conjecture by GPT-2

GPT-2 generated the following conjecture, which is false. Along with another GPT-2 conjecture about the differentiability of sec on the interval, this results in an ATP proof of SINCOS10:17.

1.4 A.4 FUNCTOR1:9 and a GPT-2 conjecture reducing it to FUNCTOR1:7

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Urban, J., Jakubův, J. (2020). First Neural Conjecturing Datasets and Experiments. In: Benzmüller, C., Miller, B. (eds) Intelligent Computer Mathematics. CICM 2020. Lecture Notes in Computer Science(), vol 12236. Springer, Cham. https://doi.org/10.1007/978-3-030-53518-6_24

Download citation

DOI: https://doi.org/10.1007/978-3-030-53518-6_24
Published: 17 July 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-53517-9
Online ISBN: 978-3-030-53518-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

First Neural Conjecturing Datasets and Experiments

Abstract

Access this chapter

Notes

References

Funding

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

A Additional Data From the Experiments

A Additional Data From the Experiments

1.1 A.1 XXREAL 1:48 and its GPT-2 predictions

1.2 A.2 GROUPP_1:10 and its generalization conjectured by GPT-2

1.3 A.3 SINCOS10:17 and a false conjecture by GPT-2

1.4 A.4 FUNCTOR1:9 and a GPT-2 conjecture reducing it to FUNCTOR1:7

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation