|
|
Proceedings of Machine Learning Research (PMLR), 2024, Volume 247, Pages 4511–4547
(Mi pmlr3)
|
|
|
|
Improved high-probability bounds for the temporal difference learning algorithm via exponential stability
Sergey Samsonova, Daniil Tiapkinbc, Alexey Naumovad, Eric Moulinesb a HSE University, Moscow, Russia
b Centre de Mathématiques Appliquées – CNRS – École polytechnique – Institut Polytechnique de Paris, France
c Université Paris-Saclay, CNRS, Laboratoire de mathématiques d'Orsay, France
d Steklov Mathematical Institute of Russian Academy of Sciences, Moscow, Russia
Linking options:
https://www.mathnet.ru/eng/pmlr3
|
|