Sequential Probability Assignment with Contexts: Minimax Regret, Contextual Shtarkov Sums, and Contextual Normalized Maximum Likelihood

  • Ziyi Liu
  • , Idan Attias
  • , Daniel M. Roy

    Research output: Contribution to journalConference articlepeer-review

    Abstract

    We study the fundamental problem of sequential probability assignment, also known as online learning with logarithmic loss, with respect to an arbitrary, possibly nonparametric hypothesis class. Our goal is to obtain a complexity measure for the hypothesis class that characterizes the minimax regret and to determine a general, minimax optimal algorithm. Notably, the sequential ℓ entropy, extensively studied in the literature (Rakhlin and Sridharan, 2015, Bilodeau et al., 2020, Wu et al., 2023), was shown to not characterize minimax regret in general. Inspired by the seminal work of Shtarkov (1987) and Rakhlin, Sridharan, and Tewari (2010), we introduce a novel complexity measure, the contextual Shtarkov sum, corresponding to the Shtarkov sum after projection onto a multiary context tree, and show that the worst case log contextual Shtarkov sum equals the minimax regret. Using the contextual Shtarkov sum, we derive the minimax optimal strategy, dubbed contextual Normalized Maximum Likelihood (cNML). Our results hold for sequential experts, beyond binary labels, which are settings rarely considered in prior work. To illustrate the utility of this characterization, we provide a short proof of a new regret upper bound in terms of sequential ℓ entropy, unifying and sharpening state-of-the-art bounds by Bilodeau et al. (2020) and Wu et al. (2023).

    Original languageEnglish
    JournalAdvances in Neural Information Processing Systems
    Volume37
    StatePublished - 1 Jan 2024
    Event38th Conference on Neural Information Processing Systems, NeurIPS 2024 - Vancouver, Canada
    Duration: 9 Dec 202415 Dec 2024

    ASJC Scopus subject areas

    • Signal Processing
    • Information Systems
    • Computer Networks and Communications

    Fingerprint

    Dive into the research topics of 'Sequential Probability Assignment with Contexts: Minimax Regret, Contextual Shtarkov Sums, and Contextual Normalized Maximum Likelihood'. Together they form a unique fingerprint.

    Cite this