TY - GEN
T1 - Folksonomy-based term extraction for word cloud generation
AU - Carmel, David
AU - Uziel, Erel
AU - Guy, Ido
AU - Mass, Yosi
AU - Roitman, Haggai
PY - 2011/12/13
Y1 - 2011/12/13
N2 - In this work we study the task of term extraction for word cloud generation. We present a folksonomy-based term extraction method, called tag-boost, which boosts terms that are frequently used by the public to tag content. Our experiments with tag-boost-based term extraction over different domains demonstrate tremendous improvement in word cloud quality, as reflected by the agreement between extracted terms and manually assigned tags of the testing items. Additionally, we show that tag-boost can be effectively applied even in non-tagged domains, by using an external rich folksonomy borrowed from a well-tagged domain.
AB - In this work we study the task of term extraction for word cloud generation. We present a folksonomy-based term extraction method, called tag-boost, which boosts terms that are frequently used by the public to tag content. Our experiments with tag-boost-based term extraction over different domains demonstrate tremendous improvement in word cloud quality, as reflected by the agreement between extracted terms and manually assigned tags of the testing items. Additionally, we show that tag-boost can be effectively applied even in non-tagged domains, by using an external rich folksonomy borrowed from a well-tagged domain.
KW - tag-boost
KW - term extraction
KW - word-cloud generation
UR - http://www.scopus.com/inward/record.url?scp=83055161464&partnerID=8YFLogxK
U2 - 10.1145/2063576.2063986
DO - 10.1145/2063576.2063986
M3 - Conference contribution
AN - SCOPUS:83055161464
SN - 9781450307178
T3 - International Conference on Information and Knowledge Management, Proceedings
SP - 2437
EP - 2440
BT - CIKM'11 - Proceedings of the 2011 ACM International Conference on Information and Knowledge Management
T2 - 20th ACM Conference on Information and Knowledge Management, CIKM'11
Y2 - 24 October 2011 through 28 October 2011
ER -