O1: A Technical Primer – LessWrong

Anon84 | 10 points

I really dislike the phrase "test time." Why not just say, "More time to think" (aka more inference time)? To make it more accessible, why not just use the straightforward concepts:

1. Bigger networks

2. More data

3. Longer training time

4. More time to think

The bitter lesson is the complicated patterns are self-revealing.

patrickhogan1 | 11 hours ago