Towards collaborative data analysis with diverse crowds – a design science approach

  • Michael Feldman
  • , Cristian Anastasiu
  • , Abraham Bernstein

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

The last years have witnessed an increasing shortage of data experts capable of analyzing the omnipresent data and producing meaningful insights. Furthermore, some data scientists mention data preprocessing to take up to 80% of the whole project time. This paper proposes a method for collaborative data analysis that involves a crowd without data analysis expertise. Orchestrated by an expert, the team of novices conducts data analysis through iterative refinement of results up to its successful completion. To evaluate the proposed method, we implemented a tool that supports collaborative data analysis for teams with mixed level of expertise. Our evaluation demonstrates that with proper guidance data analysis tasks, especially preprocessing, can be distributed and successfully accomplished by non-experts. Using the design science approach, iterative development also revealed some important features for the collaboration tool, such as support for dynamic development, code deliberation, and project journal. As such we pave the way for building tools that can leverage the crowd to address the shortage of data analysts.

Original languageEnglish
Title of host publicationDesigning for a Digital and Globalized World - 13th International Conference, DESRIST 2018, Proceedings
EditorsSamir Chatterjee, Kaushik Dutta, Rangaraja P. Sundarraj
PublisherSpringer Verlag
Pages218-235
Number of pages18
ISBN (Print)9783319917993
DOIs
StatePublished - 1 Jan 2018
Externally publishedYes
Event13th International Conference on Design Science Research in Information Systems and Technology, DESRIST 2018 - Chennai, India
Duration: 3 Jun 20186 Jun 2018

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume10844 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference13th International Conference on Design Science Research in Information Systems and Technology, DESRIST 2018
Country/TerritoryIndia
CityChennai
Period3/06/186/06/18

Keywords

  • Collaborative data analysis
  • Crowdsourcing
  • Design science

ASJC Scopus subject areas

  • Theoretical Computer Science
  • General Computer Science

Fingerprint

Dive into the research topics of 'Towards collaborative data analysis with diverse crowds – a design science approach'. Together they form a unique fingerprint.

Cite this