No Dimensional Sampling Coresets for Classification
CoRR(2024)
摘要
We refine and generalize what is known about coresets for classification
problems via the sensitivity sampling framework. Such coresets seek the
smallest possible subsets of input data, so one can optimize a loss function on
the coreset and ensure approximation guarantees with respect to the original
data. Our analysis provides the first no dimensional coresets, so the size does
not depend on the dimension. Moreover, our results are general, apply for
distributional input and can use iid samples, so provide sample complexity
bounds, and work for a variety of loss functions. A key tool we develop is a
Radamacher complexity version of the main sensitivity sampling approach, which
can be of independent interest.
更多查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要