They haven't made the chart very clear, but it seems it has configurable passes ...

ainch · 2026-03-17T01:05:09 1773709509

pass@k means that you run the model k times and give it a pass if any of the answers is correct. I guess Lean is one of the few use cases where pass@k actually makes sense, since you can automatically validate correctness.

andai · 2026-03-16T22:01:02 1773698462

Oh my bad. I'm not sure how that works in practice. Do you just keep running it until the tests pass? I guess with formal verification you can run it as many times as you need, right?