Authorship attribution of micro-messages

Roy Schwartz, Oren Tsur, Ari Rappoport, Moshe Koppel

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

88 Scopus citations

Abstract

Work on authorship attribution has traditionally focused on long texts. In this work, we tackle the question of whether the author of a very short text can be successfully identified. We use Twitter as an experimental testbed. We introduce the concept of an author's unique "signature", and show that such signatures are typical of many authors when writing very short texts. We also present a new authorship attribution feature ("flexible patterns") and demonstrate a significant improvement over our baselines. Our results show that the author of a single tweet can be identified with good accuracy in an array of flavors of the authorship attribution task.

Original languageEnglish
Title of host publicationEMNLP 2013 - 2013 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference
PublisherAssociation for Computational Linguistics (ACL)
Pages1880-1891
Number of pages12
ISBN (Electronic)9781937284978
StatePublished - 1 Jan 2013
Externally publishedYes
Event2013 Conference on Empirical Methods in Natural Language Processing, EMNLP 2013 - Seattle, United States
Duration: 18 Oct 201321 Oct 2013

Publication series

NameEMNLP 2013 - 2013 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference

Conference

Conference2013 Conference on Empirical Methods in Natural Language Processing, EMNLP 2013
Country/TerritoryUnited States
CitySeattle
Period18/10/1321/10/13

Fingerprint

Dive into the research topics of 'Authorship attribution of micro-messages'. Together they form a unique fingerprint.

Cite this