[qa] smaller, weaker, yet better: training llm reasoners via compute-optimal sampling
Published 1 month ago • 61 plays • Length 7:47Download video MP4
Download video MP3
Similar videos
-
18:20
smaller, weaker, yet better: training llm reasoners via compute-optimal sampling
-
53:02
scaling llm test-time compute optimally can be more effective than scaling model parameters (paper)
-
6:15
creative beam search: llm-as-a-judge for improving response generation - arxiv:2405.0009
-
2:18:02
solving chollet's arc-agi with gpt4o
-
46:22
it's not about scale, it's about abstraction
-
8:57
rag vs. fine tuning
-
2:01
paper walkthrough: react (https://arxiv.org/abs/2210.03629)
-
1:15
using scaling laws for smaller, but still accurate models
-
18:15
on scalable oversight with weak llms judging strong llms
-
38:00
large language monkeys: scaling inference compute with repeated sampling
-
0:59
llms vs generative ai: what’s the difference?