Can APIs outperform web browsing for AI agents?
BEYOND BROWSING: API-BASED WEB AGENTS
October 23, 2024
https://arxiv.org/pdf/2410.16464This research paper explores if AI agents can perform web tasks more efficiently using APIs (interfaces designed for machines) rather than traditional web browsers (interfaces designed for humans).
The key points relevant to LLM-based multi-agent systems:
- APIs outperform browsing: Agents using APIs demonstrably outperformed those using only web browsers, especially on sites with well-documented APIs (like GitLab).
- Hybrid approach for flexibility: A hybrid agent capable of both browsing and API interaction proved most effective, allowing flexibility for websites with limited API support.
- API quality matters: Well-documented and comprehensive APIs were crucial for agent performance. Poorly documented or limited APIs hampered the agent's effectiveness.
- LLM limitations: The size of API documentation posed a challenge for LLMs; two-stage documentation retrieval helped manage large sets of APIs.