AgentOhana: Design Unified Data and Training Pipeline for Effective Agent Learning

Jianguo Zhang,Tian Lan, Rithesh Murthy,Zhiwei Liu,Weiran Yao,Juntao Tan, Thai Hoang,Liangwei Yang,Yihao Feng, Zuxin Liu, Tulika Awalgaonkar, Juan Carlos Niebles,Silvio Savarese,Shelby Heinecke,Huan Wang,Caiming Xiong

CoRR（2024）

引用 0|浏览8

暂无评分

摘要

Autonomous agents powered by large language models (LLMs) have garnered significant research attention. However, fully harnessing the potential of LLMs for agent-based tasks presents inherent challenges due to the heterogeneous nature of diverse data sources featuring multi-turn trajectories. In this paper, we introduce AgentOhana as a comprehensive solution to address these challenges. AgentOhana aggregates agent trajectories from distinct environments, spanning a wide array of scenarios. It meticulously standardizes and unifies these trajectories into a consistent format, streamlining the creation of a generic data loader optimized for agent training. Leveraging the data unification, our training pipeline maintains equilibrium across different data sources and preserves independent randomness across devices during dataset partitioning and model training. Additionally, we present xLAM-v0.1, a large action model tailored for AI agents, which demonstrates exceptional performance across various benchmarks.

查看译文

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要