Inequality in utility of data and its implications for data management

Adir Even, G. Shankaranarayanan, Paul D. Berger

Research output: Contribution to conferencePaperpeer-review

5 Scopus citations

Abstract

The volumes and complexity of organizational data-resources are rapidly increasing and hence demand larger investments in data-management solutions. We posit that justifying these investments needs a better understanding of the utility of data - the business value gained by using it. As a first step in this direction, we explore utility inequality - the extent to which records in a large dataset differ in their business-value contribution. The distribution of utility and the magnitude of utility inequality in a data resource have important implications for data management. These can impact design and administration of the data resource, inform data acquisition and retention policies, and assist in prioritizing quality improvement efforts. We propose analytical tools for modeling and quantifying inequality, and demonstrate their application for assessing inequality in a large alumni data repository. We also examine the implications of utility inequality for data-management decisions associated with this repository.

Original languageEnglish
Pages164-169
Number of pages6
StatePublished - 1 Jan 2007
Externally publishedYes
Event17th Workshop on Information Technologies and Systems, WITS 2007 - Montreal, QC, Canada
Duration: 8 Dec 20079 Dec 2007

Conference

Conference17th Workshop on Information Technologies and Systems, WITS 2007
Country/TerritoryCanada
CityMontreal, QC
Period8/12/079/12/07

ASJC Scopus subject areas

  • Information Systems

Fingerprint

Dive into the research topics of 'Inequality in utility of data and its implications for data management'. Together they form a unique fingerprint.

Cite this