Paper

Chat

image.png

Core Technique: Compute-Optimal Test-Time Scaling

The key insight is that different problems benefit from different test-time computation strategies, and by adaptively choosing the right strategy based on problem difficulty, we can achieve much better results with less compute.

The Two Main Mechanisms:

  1. Improving the Proposal Distribution (Revisions)
  2. Optimizing with Verifiers (Search)

The Adaptive Strategy:

The paper's key contribution is showing that:

Concrete Example: Math Problem Solving