THE IMPORTANCE OF CONVEXITY IN LEARNING WITH SQUARED LOSS

Citation
Ws. Lee et al., THE IMPORTANCE OF CONVEXITY IN LEARNING WITH SQUARED LOSS, IEEE transactions on information theory, 44(5), 1998, pp. 1974-1980
Citations number
21
Categorie Soggetti
Computer Science Information Systems","Engineering, Eletrical & Electronic","Computer Science Information Systems
ISSN journal
00189448
Volume
44
Issue
5
Year of publication
1998
Pages
1974 - 1980
Database
ISI
SICI code
0018-9448(1998)44:5<1974:TIOCIL>2.0.ZU;2-Y
Abstract
We show that if the closure of a function class F under the metric ind uced by some probability distribution is not convex, then the sample c omplexity for agnostically learning F with squared loss (using only hy potheses in F) is Omega(ln(1/delta)/epsilon(2)) where 1 - delta is the probability of success and epsilon is the required accuracy. In compa rison, if the class F is convex and has finite pseudodimension, then t he sample complexity is O(1/epsilon(ln 1/epsilon + ln 1/delta)). If a nonconvex class F has finite pseudodimension, then the sample complexi ty for agnostically learning the closure of the convex hull of F, is O (1/epsilon(1/epsilon ln 1/epsilon + ln 1/delta)). Hence, for agnostic learning, learning the convex hull provides better approximation capa bilities with little sample complexity penalty.