PROX: Approximated summarization of data provenance

Eleanor Ainy, Pierre Bourhis, Susan B. Davidson, Daniel Deutch, Tova Milo

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations


Many modern applications involve collecting large amounts of data from multiple sources, and then aggregating and manipulating it in intricate ways. The complexity of such applications, combined with the size of the collected data, makes it difficult to understand the application logic and how information was derived. Data provenance has been proven helpful in this respect in different contexts; however, maintaining and presenting the full and exact provenance may be infeasible, due to its size and complex structure. For that reason, we introduce the notion of approximated summarized provenance, where we seek a compact representation of the provenance at the possible cost of information loss. Based on this notion, we have developed PROX, a system for the management, presentation and use of data provenance for complex applications. We propose to demonstrate PROX in the context of a movies rating crowd-sourcing system, letting participants view provenance summarization and use it to gain insights on the application and its underlying data.

Original languageEnglish
Title of host publicationAdvances in Database Technology - EDBT 2016
Subtitle of host publication19th International Conference on Extending Database Technology, Proceedings
EditorsIoana Manolescu, Evaggelia Pitoura, Amelie Marian, Sofian Maabout, Letizia Tanca, Georgia Koutrika, Kostas Stefanidis
Number of pages4
ISBN (Electronic)9783893180707
StatePublished - 1 Jan 2016
Externally publishedYes
Event19th International Conference on Extending Database Technology, EDBT 2016 - Bordeaux, France
Duration: 15 Mar 201618 Mar 2016

Publication series

NameAdvances in Database Technology - EDBT
ISSN (Electronic)2367-2005


Conference19th International Conference on Extending Database Technology, EDBT 2016

ASJC Scopus subject areas

  • Information Systems
  • Software
  • Computer Science Applications


Dive into the research topics of 'PROX: Approximated summarization of data provenance'. Together they form a unique fingerprint.

Cite this