Stochastic online optimization. Single-point and multi-point non-linear multi-armed bandits. Convex and strongly-convex case


Cite item

Full Text

Open Access Open Access
Restricted Access Access granted
Restricted Access Subscription Access

Abstract

In this paper the gradient-free modification of the mirror descent method for convex stochastic online optimization problems is proposed. The crucial assumption in the problem setting is that function realizations are observed with minor noises. The aim of this paper is to derive the convergence rate of the proposed methods and to determine a noise level which does not significantly affect the convergence rate.

About the authors

A. V. Gasnikov

Moscow Institute of Physics and Technology (State University); Institute for Information Transmission Problems (Kharkevich Institute)

Author for correspondence.
Email: gasnikov@yandex.ru
Russian Federation, Moscow; Moscow

E. A. Krymova

Institute for Information Transmission Problems (Kharkevich Institute)

Email: gasnikov@yandex.ru
Russian Federation, Moscow

A. A. Lagunovskaya

Keldysh Institute of Applied Mathematics; Moscow Institute of Physics and Technology (State University)

Email: gasnikov@yandex.ru
Russian Federation, Moscow; Moscow

I. N. Usmanova

Moscow Institute of Physics and Technology (State University); Institute for Information Transmission Problems (Kharkevich Institute)

Email: gasnikov@yandex.ru
Russian Federation, Moscow; Moscow

F. A. Fedorenko

Moscow Institute of Physics and Technology (State University)

Email: gasnikov@yandex.ru
Russian Federation, Moscow

Supplementary files

Supplementary Files
Action
1. JATS XML

Copyright (c) 2017 Pleiades Publishing, Ltd.