| | Batched reward model inference and Best-of-N sampling (raw.sh) |
| 34 points by rawsh on Nov 19, 2024 | past |
|
| | Teaching LLMs to solve chess puzzles with DSPy and Finetuning (raw.sh) |
| 1 point by rawsh on Sept 12, 2024 | past |
|
| | Teaching chat models to solve chess puzzles (raw.sh) |
| 4 points by rawsh on Aug 24, 2024 | past |
|