Matematicheskaya Teoriya Igr i Ee Prilozheniya
RUS  ENG    JOURNALS   PEOPLE   ORGANISATIONS   CONFERENCES   SEMINARS   VIDEO LIBRARY   PACKAGE AMSBIB  
General information
Latest issue
Archive
Impact factor

Search papers
Search references

RSS
Latest issue
Current issues
Archive issues
What is RSS



Mat. Teor. Igr Pril.:
Year:
Volume:
Issue:
Page:
Find






Personal entry:
Login:
Password:
Save password
Enter
Forgotten password?
Register


Matematicheskaya Teoriya Igr i Ee Prilozheniya, 2023, Volume 15, Issue 4, Pages 3–27 (Mi mgta328)  

UCB strategies and optimization of batch processing in a one-armed bandit problem

Sergey V. Garbar, Alexander V. Kolnogorov, Alexey N. Lazutchenko

Yaroslav-the-Wise Novgorod State University
References:
Abstract: We consider a Gaussian one-armed bandit problem, which arises when optimizing batch data processing if there are two alternative processing methods with a priori known efficiency of the first method. During processing, it is necessary to determine a more effective method and ensure its preferential use. This optimal control problem is interpreted as a game with nature. We investigate cases of known and a priori unknown variance of income corresponding to the second method. The control goal is considered in a minimax setting, and UCB strategies are used to ensure it. In all the studied cases, invariant descriptions of control on a horizon equal to one are obtained, which depend only on the number of batches into which the data is divided, but not on their full number. These descriptions allow us to determine approximately optimal parameters of strategies using Monte Carlo simulation. Numerical results show the high efficiency of the proposed UCB strategies.
Keywords: Gaussian one-armed bandit, minimax approach, UCB rule, invariant description, Monte-Carlo simulations.
Funding agency Grant number
Russian Science Foundation 23-21-00447
Received: 07.05.2023
Revised: 24.10.2023
Accepted: 01.12.2023
Document Type: Article
UDC: 519.832, 519.245
BBC: 22.18
Language: Russian
Citation: Sergey V. Garbar, Alexander V. Kolnogorov, Alexey N. Lazutchenko, “UCB strategies and optimization of batch processing in a one-armed bandit problem”, Mat. Teor. Igr Pril., 15:4 (2023), 3–27
Citation in format AMSBIB
\Bibitem{GarKolLaz23}
\by Sergey~V.~Garbar, Alexander~V.~Kolnogorov, Alexey~N.~Lazutchenko
\paper UCB strategies and optimization of batch processing in a one-armed bandit problem
\jour Mat. Teor. Igr Pril.
\yr 2023
\vol 15
\issue 4
\pages 3--27
\mathnet{http://mi.mathnet.ru/mgta328}
Linking options:
  • https://www.mathnet.ru/eng/mgta328
  • https://www.mathnet.ru/eng/mgta/v15/i4/p3
  • Citing articles in Google Scholar: Russian citations, English citations
    Related articles in Google Scholar: Russian articles, English articles
    Математическая теория игр и её приложения
     
      Contact us:
     Terms of Use  Registration to the website  Logotypes © Steklov Mathematical Institute RAS, 2025