TY - GEN
T1 - Authorship attribution of micro-messages
AU - Schwartz, Roy
AU - Tsur, Oren
AU - Rappoport, Ari
AU - Koppel, Moshe
N1 - Funding Information:
The authors wish to acknowledge the work done by Mariam Amer and Arij Nabil, Graduate Research Assistants at the American University in Cairo in the development work of this paper.
Publisher Copyright:
© 2013 Association for Computational Linguistics.
PY - 2013/1/1
Y1 - 2013/1/1
N2 - Work on authorship attribution has traditionally focused on long texts. In this work, we tackle the question of whether the author of a very short text can be successfully identified. We use Twitter as an experimental testbed. We introduce the concept of an author's unique "signature", and show that such signatures are typical of many authors when writing very short texts. We also present a new authorship attribution feature ("flexible patterns") and demonstrate a significant improvement over our baselines. Our results show that the author of a single tweet can be identified with good accuracy in an array of flavors of the authorship attribution task.
AB - Work on authorship attribution has traditionally focused on long texts. In this work, we tackle the question of whether the author of a very short text can be successfully identified. We use Twitter as an experimental testbed. We introduce the concept of an author's unique "signature", and show that such signatures are typical of many authors when writing very short texts. We also present a new authorship attribution feature ("flexible patterns") and demonstrate a significant improvement over our baselines. Our results show that the author of a single tweet can be identified with good accuracy in an array of flavors of the authorship attribution task.
UR - http://www.scopus.com/inward/record.url?scp=84906925986&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:84906925986
T3 - EMNLP 2013 - 2013 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference
SP - 1880
EP - 1891
BT - EMNLP 2013 - 2013 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference
PB - Association for Computational Linguistics (ACL)
T2 - 2013 Conference on Empirical Methods in Natural Language Processing, EMNLP 2013
Y2 - 18 October 2013 through 21 October 2013
ER -