|
Proceedings of Machine Learning Research (PMLR), 2024, том 247, страницы 4511–4547
(Mi pmlr3)
|
|
|
|
Improved high-probability bounds for the temporal difference learning algorithm via exponential stability
Sergey Samsonova, Daniil Tiapkinbc, Alexey Naumovad, Eric Moulinesb a HSE University, Moscow, Russia
b Centre de Mathématiques Appliquées – CNRS – École polytechnique – Institut Polytechnique de Paris, France
c Université Paris-Saclay, CNRS, Laboratoire de mathématiques d'Orsay, France
d Steklov Mathematical Institute of Russian Academy of Sciences, Moscow, Russia
Образцы ссылок на эту страницу:
https://www.mathnet.ru/rus/pmlr3
|
Статистика просмотров: |
Страница аннотации: | 24 |
|