Chrome Extension
WeChat Mini Program
Use on ChatGLM

A Generalist Dynamics Model for Control

CoRR(2023)

Cited 4|Views135
Abstract
We investigate the use of transformer sequence models as dynamics models (TDMs) for control. In a number of experiments in the DeepMind control suite, we find that first, TDMs perform well in a single-environment learning setting when compared to baseline models. Second, TDMs exhibit strong generalization capabilities to unseen environments, both in a few-shot setting, where a generalist model is fine-tuned with small amounts of data from the target environment, and in a zero-shot setting, where a generalist model is applied to an unseen environment without any further training. We further demonstrate that generalizing system dynamics can work much better than generalizing optimal behavior directly as a policy. This makes TDMs a promising ingredient for a foundation model of control.
More
Translated text
Key words
generalist dynamics model,control
PDF
Bibtex
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Data Disclaimer
The page data are from open Internet sources, cooperative publishers and automatic analysis results through AI technology. We do not make any commitments and guarantees for the validity, accuracy, correctness, reliability, completeness and timeliness of the page data. If you have any questions, please contact us by email: report@aminer.cn
Chat Paper

要点】:本文提出了一种基于变压器序列模型的通用动力学模型(TDMs),在DeepMind控制套件中的多个实验表明,TDMs在单环境学习设置中表现良好,且具有很强的泛化能力,为控制基础模型的构建提供了有前景的方法。

方法】:研究使用变压器序列模型作为控制中的动力学模型,通过学习环境的序列数据来预测未来的状态。

实验】:在DeepMind控制套件中进行了实验,实验结果证明TDMs在单环境学习、少量样本微调和零样本泛化情况下均表现出优异性能。