arXiv:2604.21018v1 Announce Type: new Abstract: While scaling test-time compute can substantially improve model performance, existing approaches either rely on static compute allocation or sample from fixed generation distributions. In this work, we introduce a test-time compute allocation framework that jointly adapts where computation is spent and how generation is performed. Our method begins w
Adaptive Test-Time Compute Allocation with Evolving In-Context Demonstrations
Bowen Zuo, Dongruo Zhou, Yinglun Zhu·arXiv cs.AI··1 min read
a
Continue reading on arXiv cs.AI
This article was sourced from arXiv cs.AI's RSS feed. Visit the original for the complete story.