Sergey V. Garbar, Alexander V. Kolnogorov, Alexey N. Lazutchenko, “UCB strategies and optimization of batch processing in a one-armed bandit problem”, Mat. Teor. Igr Pril., 15:4 (2023), 3

Matematicheskaya Teoriya Igr i Ee Prilozheniya

RUS ENG

JOURNALS PEOPLE ORGANISATIONS CONFERENCES SEMINARS VIDEO LIBRARY PACKAGE AMSBIB

JavaScript is disabled in your browser. Please switch it on to enable full functionality of the website

	General information
	Latest issue
	Archive
	Impact factor

	Search papers
	Search references

	RSS
	Latest issue
	Current issues
	Archive issues
	What is RSS

Mat. Teor. Igr Pril.:
Year:
Volume:
Issue:
Page:
	Find

Personal entry:
Login:
Password:
	Save password
	Enter
	Forgotten password?
	Register

Matematicheskaya Teoriya Igr i Ee Prilozheniya, 2023, Volume 15, Issue 4, Pages 3–27 (Mi mgta328)

UCB strategies and optimization of batch processing in a one-armed bandit problem

Sergey V. Garbar, Alexander V. Kolnogorov, Alexey N. Lazutchenko

Yaroslav-the-Wise Novgorod State University

Full-text PDF (2136 kB)

References:

PDF

HTML

Abstract: We consider a Gaussian one-armed bandit problem, which arises when optimizing batch data processing if there are two alternative processing methods with a priori known efficiency of the first method. During processing, it is necessary to determine a more effective method and ensure its preferential use. This optimal control problem is interpreted as a game with nature. We investigate cases of known and a priori unknown variance of income corresponding to the second method. The control goal is considered in a minimax setting, and UCB strategies are used to ensure it. In all the studied cases, invariant descriptions of control on a horizon equal to one are obtained, which depend only on the number of batches into which the data is divided, but not on their full number. These descriptions allow us to determine approximately optimal parameters of strategies using Monte Carlo simulation. Numerical results show the high efficiency of the proposed UCB strategies.

Keywords: Gaussian one-armed bandit, minimax approach, UCB rule, invariant description, Monte-Carlo simulations.

Funding agency	Grant number
Russian Science Foundation	23-21-00447

Received: 07.05.2023
Revised: 24.10.2023
Accepted: 01.12.2023

Document Type: Article

UDC: 519.832, 519.245

BBC: 22.18

Language: Russian

Citation: Sergey V. Garbar, Alexander V. Kolnogorov, Alexey N. Lazutchenko, “UCB strategies and optimization of batch processing in a one-armed bandit problem”, Mat. Teor. Igr Pril., 15:4 (2023), 3–27

Citation in format AMSBIB

\Bibitem{GarKolLaz23}

\by Sergey~V.~Garbar, Alexander~V.~Kolnogorov, Alexey~N.~Lazutchenko

\paper UCB strategies and optimization of batch processing in a one-armed bandit problem

\jour Mat. Teor. Igr Pril.

\yr 2023

\vol 15

\issue 4

\pages 3--27

\mathnet{http://mi.mathnet.ru/mgta328}

Linking options:

https://www.mathnet.ru/eng/mgta328

https://www.mathnet.ru/eng/mgta/v15/i4/p3

Citing articles in Google Scholar: Russian citations, English citations
Related articles in Google Scholar: Russian articles, English articles

Математическая теория игр и её приложения

Registration to the website

Logotypes