Clustering on sliding windows in polylogarithmic space

  • Vladimir Braverman
  • , Harry Lang
  • , Keith Levin
  • , Morteza Monemizadeh

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

8 Scopus citations

Abstract

In PODS 2003, Babcock, Datar, Motwani and O'Callaghan [4] gave the first streaming solution for the k-median problem on sliding windows using O( k/τ4W log2W) space, with a O(2O(1/τ)) approximation factor, where W is the window size and ∈ 2 (0, 1/2 ) is a user-specified parameter. They left as an open question whether it is possible to improve this to polylogarithmic space. Despite much progress on clustering and sliding windows, this question has remained open for more than a decade. In this paper, we partially answer the main open question posed by Babcock, Datar, Motwani and O'Callaghan. We present an algorithm yielding an exponential improvement in space compared to the previous result given in Babcock, et al. In particular, we give the first polylogarithmic space (-,-)-approximation for metric k-median clustering in the sliding window model, where- and- are constants, under the assumption, also made by Babcock et al., that the optimal k-median cost on any given window is bounded by a polynomial in the window size. We justify this assumption by showing that when the cost is exponential in the window size, no sublinear space approximation is possible. Our main technical contribution is a simple but elegant extension of smooth functions as introduced by Braverman and Ostrovsky [9], which allows us to apply well-known techniques for solving problems in the sliding window model to functions that are not smooth, such as the k-median cost.

Original languageEnglish
Title of host publication35th IARCS Annual Conference on Foundations of Software Technology and Theoretical Computer Science, FSTTCS 2015
EditorsPrahladh Harsha, G. Ramalingam
PublisherSchloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing
Pages350-364
Number of pages15
ISBN (Electronic)9783939897972
DOIs
StatePublished - 1 Dec 2015
Externally publishedYes
Event35th IARCS Annual Conference on Foundations of Software Technology and Theoretical Computer Science, FSTTCS 2015 - Bangalore, India
Duration: 16 Dec 201518 Dec 2015

Publication series

NameLeibniz International Proceedings in Informatics, LIPIcs
Volume45
ISSN (Print)1868-8969

Conference

Conference35th IARCS Annual Conference on Foundations of Software Technology and Theoretical Computer Science, FSTTCS 2015
Country/TerritoryIndia
CityBangalore
Period16/12/1518/12/15

Keywords

  • Clustering
  • Sliding windows
  • Streaming

ASJC Scopus subject areas

  • Software

Fingerprint

Dive into the research topics of 'Clustering on sliding windows in polylogarithmic space'. Together they form a unique fingerprint.

Cite this