Exploring the Limits of Language Agents
Project
Exploring the Limits of Language Agents
Principal Investigator
Oracle Fellowship Recipient
Carlos Jimenez, John Yang, Ofir Press, Shunyu Yao
Oracle Principal Investigator
Ari Kobren, Principal Research Scientist
Jason Peck, Research Director
Summary
This project will develop new benchmarks for real-world language agents, deriving tasks from software engineering and writing research papers. These benchmarks will provide realistic evaluations of the capabilities of language agents, while allowing for the development of advanced techniques and methods. We will then explore (1) the implementation of new cognitive architectures for agents (based on the theoretical CoALA framework, previously developed), (2) the development of new memory mechanisms for agents to handle long-horizon tasks, and (3) novel learning mechanisms for self-improving agents.