Folksonomy-based term extraction for word cloud generation

David Carmel, Erel Uziel, Ido Guy, Yosi Mass, Haggai Roitman

Research output: Contribution to journalArticlepeer-review

20 Scopus citations


In this work we study the task of term extraction for word cloud generation in sparsely tagged domains, in which manual tags are scarce. We present a folksonomy-based term extraction method, called tag-boost, which boosts terms that are frequently used by the public to tag content. Our experiments with tag-boost based term extraction over different domains demonstrate tremendous improvement in word cloud quality, as reflected by the agreement between manual tags of the testing items and the cloud's terms extracted from the items' content. Moreover, our results demonstrate the high robustness of this approach, as compared to alternative cloud generation methods that exhibit a high sensitivity to data sparseness. Additionally, we show that tag-boost can be effectively applied even in nontagged domains, by using an external rich folksonomy borrowed from a well-tagged domain.

Original languageEnglish
Article number60
JournalACM Transactions on Intelligent Systems and Technology
Issue number4
StatePublished - 1 Sep 2012
Externally publishedYes


  • Keyword extraction
  • Tag-boost
  • Tag-cloud generation

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Artificial Intelligence


Dive into the research topics of 'Folksonomy-based term extraction for word cloud generation'. Together they form a unique fingerprint.

Cite this