Massively parallel computations of the LZ-complexity of strings

Alexander Belousov, Joel Ratsaby

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Scopus citations

Abstract

We introduce a new parallel algorithm LZMP for computing the Lempel-Ziv complexity of a string of characters from a finite alphabet. The LZ-complexity is a mathematical quantity that is related to the amount of non-redundant information that a string contains. It has been recently shown to be useful in pattern recognition in measuring the distance between various kinds of input patterns [1],[2]. We implement the algorithm on a software parallel-computing platform (CUDA) which works with a Graphical Processing Unit (GPU) on a TESLA K20c board. The GPU has 2, 496 computing cores and 5G bytes of global memory. This platform enables high increase in computing performance. We exploit multiple places in the sequential version of the code and use CUDA to implement these code sections to run in parallel on the GPU cores. For example, the string XYZMXZXYZKR has a complexity of 7. The sequential version of the algorithm takes 27 time steps to compute this while the parallel version takes only 11 time steps. The advantage of the parallel implementation over the sequential one becomes more relevant as the string's length increases. Our results show that for single strings of length up to 5k bytes there is no significant improvement but for strings of length greater than 5k the speedup factor grows at a linear rate with respect to the string length. When we compute the LZ-complexity of multiple strings all of length 48k bytes in a parallel our results indicate that the speedup ratio is approximately 100. This is advantageous for pattern recognition where the object to be recognized can be represented as a collection of substrings and LZ-complexity based distance ca be computed to each of them in parallel.

Original languageEnglish
Title of host publication2014 IEEE 28th Convention of Electrical and Electronics Engineers in Israel, IEEEI 2014
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781479959877
DOIs
StatePublished - 1 Jan 2014
Externally publishedYes
Event2014 28th IEEE Convention of Electrical and Electronics Engineers in Israel, IEEEI 2014 - Eilat, Israel
Duration: 3 Dec 20145 Dec 2014

Publication series

Name2014 IEEE 28th Convention of Electrical and Electronics Engineers in Israel, IEEEI 2014

Conference

Conference2014 28th IEEE Convention of Electrical and Electronics Engineers in Israel, IEEEI 2014
Country/TerritoryIsrael
CityEilat
Period3/12/145/12/14

Fingerprint

Dive into the research topics of 'Massively parallel computations of the LZ-complexity of strings'. Together they form a unique fingerprint.

Cite this