|
Doklady Rossijskoj Akademii Nauk. Mathematika, Informatika, Processy Upravlenia, 2024, Volume 520, Number 2, Pages 124–130 DOI: https://doi.org/10.31857/S2686954324700449
(Mi danma594)
|
|
|
|
This article is cited in 1 scientific paper (total in 1 paper)
SPECIAL ISSUE: ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING TECHNOLOGIES
Review of multimodal environments for reinforcement learning
Z. A. Volovikovaab, M. A. Kuznetsovaa, A. A. Skrynnikbc, A. I. Panovabc a Moscow Institute of Physics and Technology (National Research University), Dolgoprudny, Moscow Region
b Artificial Intelligence Research Institute, Moscow, Russia
c Federal Research Center "Computer Science and Control" of Russian Academy of Sciences, Moscow, Russia
DOI:
https://doi.org/10.31857/S2686954324700449
Abstract:
This article presents a review and comparative analysis of multimodal virtual environments for reinforcement learning. Seven different environments are considered, including the HomeGrid, BabyAI, RTFM, Messenger, Touchdown, Alfred, and IGLU, and research is focused on their peculiarities and requirements to agents. The main attention is paid to such parameters as complexity of text instructions and the dynamic properties of the environment. The conducted analysis identifies the strengths and weaknesses of each environment, which allows determining the optimal conditions for effective agent training, and also emphasizes the need to create more balanced environments combining high requirements to both understanding of language and interaction with the surrounding.
Keywords:
multimodal learning, language grounding, reinforcement learning.
Received: 01.10.2024 Accepted: 07.10.2024
Citation:
Z. A. Volovikova, M. A. Kuznetsova, A. A. Skrynnik, A. I. Panov, “Review of multimodal environments for reinforcement learning”, Dokl. RAN. Math. Inf. Proc. Upr., 520:2 (2024), 124–130; Dokl. Math., 110:suppl. 1 (2024), S110–S116
Linking options:
https://www.mathnet.ru/eng/danma594 https://www.mathnet.ru/eng/danma/v520/i2/p124
|
| Statistics & downloads: |
| Abstract page: | 102 | | References: | 2 |
|