Skip to main navigation Skip to search Skip to main content

List and Certificate Complexities in Replicable Learning

  • Peter Dixon
  • , A. Pavan
  • , Jason Vander Woude
  • , N. V. Vinodchandran

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    7 Scopus citations

    Abstract

    We investigate replicable learning algorithms. Informally a learning algorithm is replicable if the algorithm outputs the same canonical hypothesis over multiple runs with high probability, even when different runs observe a different set of samples from the unknown data distribution. In general, such a strong notion of replicability is not achievable. Thus we consider two feasible notions of replicability called list replicability and certificate replicability. Intuitively, these notions capture the degree of (non) replicability. The goal is to design learning algorithms with optimal list and certificate complexities while minimizing the sample complexity. Our contributions are the following. - We first study the learning task of estimating the biases of d coins, up to an additive error of ε, by observing samples. For this task, we design a (d + 1)-list replicable algorithm. To complement this result, we establish that the list complexity is optimal, i.e there are no learning algorithms with a list size smaller than d + 1 for this task. We also design learning algorithms with certificate complexity Õ(log d). The sample complexity of both these algorithms is Õ(dε22 ) where ε is the approximation error parameter (for a constant error probability). - In the PAC model, we show that any hypothesis class that is learnable with d-nonadaptive statistical queries can be learned via a (d + 1)-list replicable algorithm and also via a Õ(log d)-certificate replicable algorithm. The sample complexity of both these algorithms is Õ(νd22 ) where ν is the approximation error of the statistical query. We also show that for the concept class dTHRESHOLD, the list complexity is exactly d + 1 with respect to the uniform distribution. To establish our upper bound results we use rounding schemes induced by geometric partitions with certain properties. We use Sperner/KKM Lemma to establish the lower bound results.

    Original languageEnglish
    Title of host publicationAdvances in Neural Information Processing Systems 36 - 37th Conference on Neural Information Processing Systems, NeurIPS 2023
    EditorsA. Oh, T. Neumann, A. Globerson, K. Saenko, M. Hardt, S. Levine
    PublisherNeural information processing systems foundation
    ISBN (Electronic)9781713899921
    StatePublished - 1 Jan 2023
    Event37th Conference on Neural Information Processing Systems, NeurIPS 2023 - New Orleans, United States
    Duration: 10 Dec 202316 Dec 2023

    Publication series

    NameAdvances in Neural Information Processing Systems
    Volume36
    ISSN (Print)1049-5258

    Conference

    Conference37th Conference on Neural Information Processing Systems, NeurIPS 2023
    Country/TerritoryUnited States
    CityNew Orleans
    Period10/12/2316/12/23

    ASJC Scopus subject areas

    • Signal Processing
    • Information Systems
    • Computer Networks and Communications

    Fingerprint

    Dive into the research topics of 'List and Certificate Complexities in Replicable Learning'. Together they form a unique fingerprint.

    Cite this