lemmata.substack.com/p/alphaproof-and-the-imo/comment/112809047

Preview meta tags from the lemmata.substack.com website.

Linked Hostnames

Thumbnail

Search Engine Appearance

Google

https://lemmata.substack.com/p/alphaproof-and-the-imo/comment/112809047

Kevin on Lemmata

I think we will we need both LLMs and FPMs working together. The LLMs have a lot of "holes" that the FPMs can fill. But there are a lot of tasks that the FPM can't really even start on. I think we need a step like "looking for interesting intermediate results". In practice, you don't solve these problems by guessing the answer and then cranking away. You start asking things like, what if a = b, what can we prove. Or, oh it looks like g has to divide both a - 1 and b - 1, that seems useful. And then while you look for interesting intermediate results, you notice things that pop up. For example, on the snail problem, there's an obvious way to look for intermediate results. Think of some strategies and see how well they do. One strategy is, just try column 1, then just try column 2, etc. (spoiler alert) After a little while you think and see, well if I just try column n, and find the monster, I can try "dodging around" just that monster. Well, columns n-1 and n+1 could have the monster one step away diagonally. So that's the worst case, and actually, the worse case is finding the monster right in the middle of column n. Hmm, if only you could find the monster *not* in the middle of the column. And then this line of reasoning suggests the actual optimal strategy. I think the FPMs can just never really do this evaluation of, is this an interesting intermediate result or not. That is a very "LLM" task. Plus, in actual non-contest mathematics, looking for interesting intermediate results is useful all the time. You aren't going to be able to tell it "please prove P != NP now" but you could tell it, okay here are all of these problems and the best known algorithms for them, what other interesting stuff is there. New problems, new algorithms, etc. Anyway this is a very, very interesting post and thank you for making it.

Bing

Kevin on Lemmata

https://lemmata.substack.com/p/alphaproof-and-the-imo/comment/112809047

DuckDuckGo

https://lemmata.substack.com/p/alphaproof-and-the-imo/comment/112809047

Kevin on Lemmata

General Meta Tags
19
- title
  Comments - AlphaProof and the IMO - by Greg Burnham
- title
- title
- title
- title
Open Graph Meta Tags
9
- og:url
  https://lemmata.substack.com/p/alphaproof-and-the-imo/comment/112809047
- og:type
  article
- og:title
  Kevin on Lemmata
- og:description
  I think we will we need both LLMs and FPMs working together. The LLMs have a lot of "holes" that the FPMs can fill. But there are a lot of tasks that the FPM can't really even start on. I think we need a step like "looking for interesting intermediate results". In practice, you don't solve these problems by guessing the answer and then cranking away. You start asking things like, what if a = b, what can we prove. Or, oh it looks like g has to divide both a - 1 and b - 1, that seems useful. And then while you look for interesting intermediate results, you notice things that pop up. For example, on the snail problem, there's an obvious way to look for intermediate results. Think of some strategies and see how well they do. One strategy is, just try column 1, then just try column 2, etc. (spoiler alert) After a little while you think and see, well if I just try column n, and find the monster, I can try "dodging around" just that monster. Well, columns n-1 and n+1 could have the monster one step away diagonally. So that's the worst case, and actually, the worse case is finding the monster right in the middle of column n. Hmm, if only you could find the monster *not* in the middle of the column. And then this line of reasoning suggests the actual optimal strategy. I think the FPMs can just never really do this evaluation of, is this an interesting intermediate result or not. That is a very "LLM" task. Plus, in actual non-contest mathematics, looking for interesting intermediate results is useful all the time. You aren't going to be able to tell it "please prove P != NP now" but you could tell it, okay here are all of these problems and the best known algorithms for them, what other interesting stuff is there. New problems, new algorithms, etc. Anyway this is a very, very interesting post and thank you for making it.
- og:image
  https://substackcdn.com/image/fetch/w_680,h_680,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack.com%2Fnote%2Fc-112809047%2Fpreview.jpeg%3Fsize%3Dsm
Twitter Meta Tags
8
- twitter:label1
  Likes
- twitter:data1
  1
- twitter:label2
  Replies
- twitter:data2
  1
- twitter:title
  Kevin on Lemmata
Link Tags
52
- alternate
  /feed
- apple-touch-icon
  https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7a8de5ad-261a-47b6-a836-293bcde84acb%2Fapple-touch-icon-57x57.png
- apple-touch-icon
  https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7a8de5ad-261a-47b6-a836-293bcde84acb%2Fapple-touch-icon-60x60.png
- apple-touch-icon
  https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7a8de5ad-261a-47b6-a836-293bcde84acb%2Fapple-touch-icon-72x72.png
- apple-touch-icon
  https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7a8de5ad-261a-47b6-a836-293bcde84acb%2Fapple-touch-icon-76x76.png

lemmata.substack.com/p/alphaproof-and-the-imo/comment/112809047

Linked Hostnames

Thumbnail

Search Engine Appearance

Google

Kevin on Lemmata

Bing

Kevin on Lemmata

DuckDuckGo

Kevin on Lemmata

General Meta Tags

Open Graph Meta Tags

Twitter Meta Tags

Link Tags

Links