Quick answer
AI Summary: Applies test-time compute scaling to web-based agents, using search and reward guidance to outperform static, larger-scale models.
AI Summary: Applies test-time compute scaling to web-based agents, using search and reward guidance to outperform static, larger-scale models.
Current WebAgents struggle with long-horizon tasks and complex navigation. We propose an agentic scaling framework that increases compute at test-time through iterative trajectory pruning and reward-guided search. By treating the agent as a search process over the browser's state space, we show that performance scales predictably with the number of explored paths. Our method significantly outperforms larger, more expensive base models by using a smaller model with an optimized search-and-verify loop.
Share your opinion to help other learners triage faster.
Write a reviewInvite someone by email to share an invited review for Agentic Test-Time Scaling for WebAgents.