Multi-document summarization using tensor decomposition

Marina Litvak, Natalia Vanetik

Research output: Contribution to journalArticlepeer-review

4 Scopus citations

Abstract

The problem of extractive text summarization for a collection of documents is defined as selecting a small subset of sentences so the contents and meaning of the original document set are preserved in the best possible way. In this paper we present a new model for the problem of extractive summarization, where we strive to obtain a summary that preserves the information coverage as much as possible, when compared to the original document set. We construct a new tensor-based representation that describes the given document set in terms of its topics. We then rank topics via Tensor Decomposition, and compile a summary from the sentences of the highest ranked topics.

Original languageEnglish
Pages (from-to)581-589
Number of pages9
JournalComputacion y Sistemas
Volume18
Issue number3
DOIs
StatePublished - 1 Jul 2014
Externally publishedYes

Keywords

  • Multilingual multifocument summarization
  • Tensor decomposition

ASJC Scopus subject areas

  • General Computer Science

Fingerprint

Dive into the research topics of 'Multi-document summarization using tensor decomposition'. Together they form a unique fingerprint.

Cite this