Median Clipping for Zeroth-order Non-Smooth Convex Optimization and Multi Arm Bandit Problem with Heavy-tailed Symmetric Noise

Nikita Kornilov, Yuriy Dorn,Aleksandr Lobanov, Nikolay Kutuzov,Innokentiy Shibaev, Eduard Gorbunov, Alexander Gasnikov, Alexander Nazin

arxiv(2024)

引用 0|浏览2
暂无评分
摘要
In this paper, we consider non-smooth convex optimization with a zeroth-order oracle corrupted by symmetric stochastic noise. Unlike the existing high-probability results requiring the noise to have bounded κ-th moment with κ∈ (1,2], our results allow even heavier noise with any κ > 0, e.g., the noise distribution can have unbounded 1-st moment. Moreover, our results match the best-known ones for the case of the bounded variance. To achieve this, we use the mini-batched median estimate of the sampled gradient differences, apply gradient clipping to the result, and plug in the final estimate into the accelerated method. We apply this technique to the stochastic multi-armed bandit problem with heavy-tailed distribution of rewards and achieve O(√(Td)) regret by incorporating noise symmetry.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要