Multiclass learnability and the ERM principle

  • Amit Daniely
  • , Sivan Sabato
  • , Shai Ben-David
  • , Shai Shalev-Shwartz

    Research output: Contribution to journalArticlepeer-review

    70 Scopus citations

    Abstract

    We study the sample complexity of multiclass prediction in several learning settings. For the PAC setting our analysis reveals a surprising phenomenon: In sharp contrast to binary classification, we show that there exist multiclass hypothesis classes for which some Empirical Risk Minimizers (ERM learners) have lower sample complexity than others. Furthermore, there are classes that are learnable by some ERM learners, while other ERM learners will fail to learn them. We propose a principle for designing good ERM learners, and use this principle to prove tight bounds on the sample complexity of learning symmetric multiclass hypothesis classes-classes that are invariant under permutations of label names. We further provide a characterization of mistake and regret bounds for multiclass learning in the online setting and the bandit setting, using new generalizations of Littlestone's dimension.

    Original languageEnglish
    Pages (from-to)2377-2404
    Number of pages28
    JournalJournal of Machine Learning Research
    Volume16
    StatePublished - 1 Dec 2015

    Keywords

    • ERM
    • Multiclass
    • Sample complexity

    ASJC Scopus subject areas

    • Software
    • Control and Systems Engineering
    • Statistics and Probability
    • Artificial Intelligence

    Fingerprint

    Dive into the research topics of 'Multiclass learnability and the ERM principle'. Together they form a unique fingerprint.

    Cite this