Systems and methods for searching and storage of data

Michael Hirsch (Inventor), Haim Bitner (Inventor), Lior Aronovich (Inventor), Ron Aaher (Inventor), Eitan Bachmat (Inventor), Shmuel T Klein (Inventor)

Research output: Patent

Abstract

A method comprising identifying input data in repository data wherein the repository data comprises repository data chunks and the input data comprise input data chunks and wherein each repository data chunk has a corresponding set of repository data chunk distinguishing characteristics, each distinguishing characteristic being stored with an RDC characteristic location, the method including the steps of, for each input data chunk: determining a set of input data chunk distinguishing characteristics, each distinguishing characteristic having an IDC characteristic location; then comparing the determined set of IDCs to one or more sets of RDCs; identifying a repository data chunk that is similar to the input data chunk as a function of the comparing of the determined set of IDCs to the one or more sets of RDCs, wherein a repository data chunk is identified as similar when a predetermined number of the distinguishing characteristics in the set of IDCs is found to match in a set of RDCs; outputting the IDC and RDC locations of at least one pair of matching IDC and RDC; and computing at least one common section of the input data chunk and the identified similar repository data chunk using the at least one pair of matching IDC and RDC as an anchor to define corresponding intervals in the input data chunk and the identified similar repository data chunk.

Original languageEnglish
Patent numberEP1962209
IPCG06F 17/ 30 A I
Priority date15/09/05
StatePublished - 27 Aug 2008

Fingerprint

Dive into the research topics of 'Systems and methods for searching and storage of data'. Together they form a unique fingerprint.

Cite this