How to improve web agent planning?
From Grounding to Planning: Benchmarking Bottlenecks in Web Agents
September 4, 2024
https://arxiv.org/pdf/2409.01927This paper analyzes the performance bottlenecks of AI agents designed for web navigation. It dissects these agents into two components: Planning (deciding the sequence of actions) and Grounding (identifying and interacting with web elements).
Contrary to prior assumptions, the research found that the primary bottleneck is not grounding but planning. Even with perfect grounding, current LLM-based agents struggle to plan complex action sequences. This highlights the need for incorporating external knowledge and contextual information to improve the planning capabilities of LLM-based multi-agent systems.