The design of a similarity based deduplication system

Lior Aronovich, Ron Asher, Eitan Bachmat, Haim Bitner, Michael Hirsch, Shmuel T. Klein

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

51 Scopus citations


We describe some of the design choices that were made during the development of a fast, scalable, inline, deduplication device. The system's design goals and how they were achieved are presented. This is the firs deduplication device that uses similarity matching. The paper provides the following original research contributions: We show how similarity signatures can serve in a deduplication scheme; a novel type of similarity signatures is presented and its advantages in the context of deduplication requirements are explained. It is also shown how to combine similarity matching schemes with byte by byte comparison or hash based identity schemes.

Original languageEnglish
Title of host publicationProceedings of the Israeli Experimental Systems Conference, SYSTOR 2009
Number of pages1
StatePublished - 16 Nov 2009
EventSYSTOR 2009: The Israeli Experimental Systems Conference - Haifa, Israel
Duration: 4 May 20096 May 2009

Publication series

NameACM International Conference Proceeding Series


ConferenceSYSTOR 2009: The Israeli Experimental Systems Conference

ASJC Scopus subject areas

  • Software
  • Human-Computer Interaction
  • Computer Vision and Pattern Recognition
  • Computer Networks and Communications


Dive into the research topics of 'The design of a similarity based deduplication system'. Together they form a unique fingerprint.

Cite this