AMS without 4-wise independence on product domains

Vladimir Braverman, Kai Min Chung, Zhenming Liu, Michael Mitzenmacher, Rafail Ostrovsky

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

15 Scopus citations

Abstract

In their seminal work, Alon, Matias, and Szegedy introduced several sketching techniques, including showing that 4-wise independence is sufficient to obtain good approximations of the second frequency moment. In this work, we show that their sketching technique can be extended to product domains [n] k by using the product of 4-wise independent functions on [n]. Our work extends that of Indyk and McGregor, who showed the result for k = 2. Their primary motivation was the problem of identifying correlations in data streams. In their model, a stream of pairs (i, j) ∈ [n]2 arrive, giving a joint distribution (X, Y), and they find approximation algorithms for how close the joint distribution is to the product of the marginal distributions under various metrics, which naturally corresponds to how close X and Y are to being independent. By using our technique, we obtain a new result for the problem of approximating the ℓ2 distance between the joint distribution and the product of the marginal distributions for k-ary vectors, instead of just pairs, in a single pass. Our analysis gives a randomized algorithm that is a (1 ± ∈) approximation (with probability 1 - δ) that requires space logarithmic in n and m and proportional to 3k.

Original languageEnglish
Title of host publicationSTACS 2010 - 27th International Symposium on Theoretical Aspects of Computer Science
Pages119-130
Number of pages12
DOIs
StatePublished - 1 Dec 2010
Externally publishedYes
Event27th International Symposium on Theoretical Aspects of Computer Science, STACS 2010 - Nancy, France
Duration: 4 Mar 20106 Mar 2010

Publication series

NameLeibniz International Proceedings in Informatics, LIPIcs
Volume5
ISSN (Print)1868-8969

Conference

Conference27th International Symposium on Theoretical Aspects of Computer Science, STACS 2010
Country/TerritoryFrance
CityNancy
Period4/03/106/03/10

Keywords

  • Data streams
  • Independence
  • Randomized algorithms
  • Sketches
  • Streaming algorithms

ASJC Scopus subject areas

  • Software

Fingerprint

Dive into the research topics of 'AMS without 4-wise independence on product domains'. Together they form a unique fingerprint.

Cite this