Generalizing the layering method of indyk and Woodruff: Recursive sketches for frequency-based vectors on streams

Vladimir Braverman, Rafail Ostrovsky

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

32 Scopus citations

Abstract

In their ground-breaking paper, Indyk and Woodruff (STOC 05) showed how to compute the k-th frequency moment Fk (for k > 2) in space O(poly-log(n,m)·n1-2/k), giving the first optimal result up to poly-logarithmic factors in n and m (here m is the length of the stream and n is the size of the domain.) The method of Indyk and Woodruff reduces the problem of Fk to the problem of computing heavy hitters in the streaming manner. Their reduction only requires polylogarithmic overhead in term of the space complexity and is based on the fundamental idea of "layering". Since 2005 the method of Indyk and Woodruff has been used in numerous applications and has become a standard tool for streaming computations. We propose a new recursive sketch that generalizes and improves the reduction of Indyk and Woodruff. Our method works for any non-negative frequency-based function in several models, including the insertion-only model, the turnstile model and the sliding window model. For frequency-based functions with sublinear polynomial space complexity our reduction only requires log(c)(n) overhead, where log(c)(n) is the iterative log function. Thus, we improve the reduction of Indyk and Woodruff by polylogarithmic factor. We illustrate the generality of our method by several applications: frequency moments, frequency based functions, spatial data streams and measuring independence of data sets.

Original languageEnglish
Title of host publicationApproximation, Randomization, and Combinatorial Optimization
Subtitle of host publicationAlgorithms and Techniques - 16th International Workshop, APPROX 2013 and 17th International Workshop, RANDOM 2013, Proceedings
Pages58-70
Number of pages13
DOIs
StatePublished - 15 Oct 2013
Externally publishedYes
Event16th International Workshop on Approximation Algorithms for Combinatorial Optimization Problems, APPROX 2013 and the 17th International Workshop on Randomization and Computation, RANDOM 2013 - Berkeley, CA, United States
Duration: 21 Aug 201323 Aug 2013

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume8096 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference16th International Workshop on Approximation Algorithms for Combinatorial Optimization Problems, APPROX 2013 and the 17th International Workshop on Randomization and Computation, RANDOM 2013
Country/TerritoryUnited States
CityBerkeley, CA
Period21/08/1323/08/13

Keywords

  • Data streams
  • frequencies
  • recursion
  • sketches

ASJC Scopus subject areas

  • Theoretical Computer Science
  • General Computer Science

Fingerprint

Dive into the research topics of 'Generalizing the layering method of indyk and Woodruff: Recursive sketches for frequency-based vectors on streams'. Together they form a unique fingerprint.

Cite this