|
This article is cited in 16 scientific papers (total in 16 papers)
On the efficiency of a randomized mirror descent algorithm in online optimization problems
A. V. Gasnikovabc, Yu. E. Nesterovbca, V. G. Spokoinyacb a Moscow Institute of Physics and Technology, Institutskii per. 9, Dolgoprudnyi, Moscow oblast, 141700, Russia
b National Research University Higher School of Economics, Myasnitskaya ul. 20, Moscow, 101000, Russia
c Institute for Information Transmission Problems, Russian Academy of Sciences, Bolshoi Karetnyi per. 19, Moscow, 127051, Russia
Abstract:
A randomized online version of the mirror descent method is proposed. It differs from the existing versions by the randomization method. Randomization is performed at the stage of the projection of a subgradient of the function being optimized onto the unit simplex rather than at the stage of the computation of a subgradient, which is common practice. As a result, a componentwise subgradient descent with a randomly chosen component is obtained, which admits an online interpretation. This observation, for example, has made it possible to uniformly interpret results on weighting expert decisions and propose the most efficient method for searching for an equilibrium in a zero-sum two-person matrix game with sparse matrix.
Key words:
mirror descent method, dual averaging method, online optimization, exponential weighting, multi-armed bandits, weighting of experts, stochastic optimization, randomization.
Received: 03.09.2014 Revised: 29.10.2014
Citation:
A. V. Gasnikov, Yu. E. Nesterov, V. G. Spokoiny, “On the efficiency of a randomized mirror descent algorithm in online optimization problems”, Zh. Vychisl. Mat. Mat. Fiz., 55:4 (2015), 582–598; Comput. Math. Math. Phys., 55:4 (2015), 580–596
Linking options:
https://www.mathnet.ru/eng/zvmmf10186 https://www.mathnet.ru/eng/zvmmf/v55/i4/p582
|
|