A continuous Markov-Chain model of data quality transition: Application in insurance-claim handling

Yuval Zak, Adir Even

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Data quality (DQ) might degrade over time, due to changes in realworld entities or behaviors that are not reflected correctly in datasets that describe them. This study presents a continuous-time Markov-Chain model that reflects DQ as a dynamic process. The model may help assessing and predicting accuracy degradation over time. Taking into account cost-benefit tradeoffs, it can also be used to recommend an economically-optimal point in time at which data values should be evaluated and possibly reacquired. The model addresses data-acquisition scenarios that reflect real-world processes with a finite number of states, each described by certain data-attribute values. It takes into account state-transition probabilities, the distribution of time spent in each state, the damage associated with incorrect data that fails to reflect the real-world state, and the cost of data reacquisition. Given current state and the time passed since the last transition, the model estimates the expected damage of a data record and recommends whether or not to correct it, by comparing the potential benefits of correction (elimination of potential damage), versus reacquisition cost. Following common design science research guidelines, the applicability and the potential contribution of the model is demonstrated with a real-world dataset that reflects a process of handling insurance claims. Insurants' status must be kept up-to-date, to avoid potential monetary damages; however, contacting an insurant for status update is costly and time consuming. Currently the contact decision is guided by some heuristics that are based on employees' experience. The evaluation shows that applying the model has major cost-saving potential, compared to the current state.

Original languageEnglish
Title of host publicationNew Horizons in Design Science
Subtitle of host publicationBroadening the Research Agenda - 10th International Conference, DESRIST 2015, Proceedings
EditorsBrian Donnellan, Debra VanderMeer, Jim Kenneally, Robert Winter, Marcus Rothenberger, Markus Helfert
PublisherSpringer Verlag
Pages199-214
Number of pages16
ISBN (Electronic)9783319187136
DOIs
StatePublished - 1 Jan 2015
Event10th International Conference on Design Science Research in Information Systems and Technology, DESRIST 2015 - Dublin, Ireland
Duration: 20 May 201522 May 2015

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume9073
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference10th International Conference on Design Science Research in Information Systems and Technology, DESRIST 2015
Country/TerritoryIreland
CityDublin
Period20/05/1522/05/15

Keywords

  • Accuracy
  • Continuous-Time Markov Chain
  • Data Quality
  • Design Science Research

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Computer Science (all)

Fingerprint

Dive into the research topics of 'A continuous Markov-Chain model of data quality transition: Application in insurance-claim handling'. Together they form a unique fingerprint.

Cite this