TY - GEN
T1 - Recommending citations
T2 - 21st ACM International Conference on Information and Knowledge Management, CIKM 2012
AU - Huang, Wenyi
AU - Kataria, Saurabh
AU - Caragea, Cornelia
AU - Mitra, Prasenjit
AU - Giles, C. Lee
AU - Rokach, Lior
PY - 2012/12/19
Y1 - 2012/12/19
N2 - When we write or prepare to write a research paper, we always have appropriate references in mind. However, there are most likely references we have missed and should have been read and cited. As such a good citation recommendation system would not only improve our paper but, overall, the efficiency and quality of literature search. Usually, a citation's context contains explicit words explaining the citation. Using this, we propose a method that "translates" research papers into references. By considering the citations and their contexts from existing papers as parallel data written in two different "languages", we adopt the translation model to create a relationship between these two "vocabularies". Experiments on both CiteSeer and CiteULike dataset show that our approach outperforms other baseline methods and increase the precision, recall and f-measure by at least 5% to 10%, respectively. In addition, our approach runs much faster in the both training and recommending stage, which proves the effectiveness and the scalability of our work.
AB - When we write or prepare to write a research paper, we always have appropriate references in mind. However, there are most likely references we have missed and should have been read and cited. As such a good citation recommendation system would not only improve our paper but, overall, the efficiency and quality of literature search. Usually, a citation's context contains explicit words explaining the citation. Using this, we propose a method that "translates" research papers into references. By considering the citations and their contexts from existing papers as parallel data written in two different "languages", we adopt the translation model to create a relationship between these two "vocabularies". Experiments on both CiteSeer and CiteULike dataset show that our approach outperforms other baseline methods and increase the precision, recall and f-measure by at least 5% to 10%, respectively. In addition, our approach runs much faster in the both training and recommending stage, which proves the effectiveness and the scalability of our work.
KW - citation recommendation
KW - machine translation
UR - http://www.scopus.com/inward/record.url?scp=84871049973&partnerID=8YFLogxK
U2 - 10.1145/2396761.2398542
DO - 10.1145/2396761.2398542
M3 - Conference contribution
AN - SCOPUS:84871049973
SN - 9781450311564
T3 - ACM International Conference Proceeding Series
SP - 1910
EP - 1914
BT - CIKM 2012 - Proceedings of the 21st ACM International Conference on Information and Knowledge Management
Y2 - 29 October 2012 through 2 November 2012
ER -