Russian Journal of Nonlinear Dynamics
RUS  ENG    ЖУРНАЛЫ   ПЕРСОНАЛИИ   ОРГАНИЗАЦИИ   КОНФЕРЕНЦИИ   СЕМИНАРЫ   ВИДЕОТЕКА   ПАКЕТ AMSBIB  
Общая информация
Последний выпуск
Архив
Импакт-фактор

Поиск публикаций
Поиск ссылок

RSS
Последний выпуск
Текущие выпуски
Архивные выпуски
Что такое RSS



Rus. J. Nonlin. Dyn.:
Год:
Том:
Выпуск:
Страница:
Найти






Персональный вход:
Логин:
Пароль:
Запомнить пароль
Войти
Забыли пароль?
Регистрация


Russian Journal of Nonlinear Dynamics, 2024, том 20, номер 5, страницы 759–788
DOI: https://doi.org/10.20537/nd241209
(Mi nd922)
 

NONLINEAR SYSTEMS IN ROBOTICS

Accelerated Zero-Order SGD under High-Order Smoothness and Overparameterized Regime

G. K. Bychkovab, D. M. Dvinskikhcad, A. V. Antsiferovaea, A. V. Gasnikovfgc, A. V. Lobanovcha

a ISP RAS Research Center for Trusted Artificial Intelligence, ul. Alexandra Solzhenitsyna 25, Moscow, 109004 Russia
b Lomonosov Moscow State University, ul. Leninskiye Gory 1, Moscow, 119991 Russia
c Moscow Institute of Physics and Technology, Institutskiy per. 9, Dolgoprudny, 141701 Russia
d National Research University Higher School of Economics, Pokrovsky bul. 11, Moscow, 109028 Russia
e Institute for Artificial Intelligence, Lomonosov Moscow State University, Lomonosovsky pr. 27/1, Moscow, 119192 Russia
f Innopolis University, ul. Universitetskaya 1, Innopolis, 420500, Russia
g Institute for Information Transmission Problems of RAS, per. Bolshoy Karetny 19, Moscow, 127051 Russia
h Skolkovo Institute of Science and Technology, Bolshoy Boulevard 30, bld. 1, Moscow, 121205 Russia
Список литературы:
Аннотация: We present a novel gradient-free algorithm to solve a convex stochastic optimization problem, such as those encountered in medicine, physics, and machine learning (e.g., the adversarial multi-armed bandit problem), where the objective function can only be computed through numerical simulation, either as the result of a real experiment or as feedback given by the function evaluations from an adversary. Thus, we suppose that only black-box access to the function values of the objective is available, possibly corrupted by adversarial noise: deterministic or stochastic. The noisy setup can arise naturally from modeling randomness within a simulation or by computer discretization, or when exact values of the function are forbidden due to privacy issues, or when solving nonconvex problems as convex ones with an inexact function oracle. By exploiting higher-order smoothness, fulfilled, e.g., in logistic regression, we improve the performance of zero-order methods developed under the assumption of classical smoothness (or having a Lipschitz gradient). The proposed algorithm enjoys optimal oracle complexity and is designed under an overparameterization setup, i.e., when the number of model parameters is much larger than the size of the training dataset. Overparametrized models fit to the training data perfectly while also having good generalization and outperforming underparameterized models on unseen data. We provide convergence guarantees for the proposed algorithm under both types of noise. Moreover, we estimate the maximum permissible adversarial noise level that maintains the desired accuracy in the Euclidean setup, and then we extend our results to a non-Euclidean setup. Our theoretical results are verified using the logistic regression problem.
Ключевые слова: zero-order optimization, gradient-free algorithms, high-order smoothness, kernel approximation, overparametrization
Финансовая поддержка Номер гранта
Российский научный фонд 21-71-30005
The research was supported by Russian Science Foundation (project No. 21-71-30005), https://rscf.ru/en/project/21-71-30005/.
Поступила в редакцию: 03.11.2024
Принята в печать: 17.12.2024
Тип публикации: Статья
MSC: 49M30
Язык публикации: английский
Образец цитирования: G. K. Bychkov, D. M. Dvinskikh, A. V. Antsiferova, A. V. Gasnikov, A. V. Lobanov, “Accelerated Zero-Order SGD under High-Order Smoothness and Overparameterized Regime”, Rus. J. Nonlin. Dyn., 20:5 (2024), 759–788
Цитирование в формате AMSBIB
\RBibitem{BycDviAnt24}
\by G. K. Bychkov, D. M. Dvinskikh, A. V. Antsiferova, A. V. Gasnikov, A. V. Lobanov
\paper Accelerated Zero-Order SGD under High-Order Smoothness and Overparameterized Regime
\jour Rus. J. Nonlin. Dyn.
\yr 2024
\vol 20
\issue 5
\pages 759--788
\mathnet{http://mi.mathnet.ru/nd922}
\crossref{https://doi.org/10.20537/nd241209}
Образцы ссылок на эту страницу:
  • https://www.mathnet.ru/rus/nd922
  • https://www.mathnet.ru/rus/nd/v20/i5/p759
  • Citing articles in Google Scholar: Russian citations, English citations
    Related articles in Google Scholar: Russian articles, English articles
    Russian Journal of Nonlinear Dynamics
     
      Обратная связь:
     Пользовательское соглашение  Регистрация посетителей портала  Логотипы © Математический институт им. В. А. Стеклова РАН, 2025