Extracting user profiles from large scale data

  • Michal Shmueli-Scheuer
  • , Haggai Roitman
  • , David Carmel
  • , Yosi Mass
  • , David Konopnicki

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Scopus citations

Abstract

In this work we present the details of a large scale user profiling framework that we developed here in IBM on top of Apache Hadoop. We address the problem of extracting and maintaining a very large number of user profiles from large scale data. We first describe an efficient user profiling framework with high user profiling quality guarantees. We then describe a scalable implementation of the proposed framework in Apache Hadoop and discuss its challenges.

Original languageEnglish
Title of host publicationProceedings of the 2010 Workshop on Massive Data Analytics on the Cloud, MDAC 2010, in Association with the 19th Annual World Wide Web Conference, WWW2010
DOIs
StatePublished - 16 Jul 2010
Externally publishedYes
Event2010 Workshop on Massive Data Analytics on the Cloud, MDAC 2010, in Association with the 19th Annual World Wide Web Conference, WWW2010 - Raleigh, NC, United States
Duration: 26 Apr 201026 Apr 2010

Publication series

NameACM International Conference Proceeding Series

Conference

Conference2010 Workshop on Massive Data Analytics on the Cloud, MDAC 2010, in Association with the 19th Annual World Wide Web Conference, WWW2010
Country/TerritoryUnited States
CityRaleigh, NC
Period26/04/1026/04/10

Keywords

  • Hadoop
  • large scale
  • user profile

ASJC Scopus subject areas

  • Software
  • Human-Computer Interaction
  • Computer Vision and Pattern Recognition
  • Computer Networks and Communications

Fingerprint

Dive into the research topics of 'Extracting user profiles from large scale data'. Together they form a unique fingerprint.

Cite this