Abstract
The problem of extractive text summarization for a collection of documents is defined as selecting a small subset of sentences so the contents and meaning of the original document set are preserved in the best possible way. In this paper we present a new model for the problem of extractive summarization, where we strive to obtain a summary that preserves the information coverage as much as possible, when compared to the original document set. We construct a new tensor-based representation that describes the given document set in terms of its topics. We then rank topics via Tensor Decomposition, and compile a summary from the sentences of the highest ranked topics.
Original language | English |
---|---|
Pages (from-to) | 581-589 |
Number of pages | 9 |
Journal | Computacion y Sistemas |
Volume | 18 |
Issue number | 3 |
DOIs | |
State | Published - 1 Jul 2014 |
Externally published | Yes |
Keywords
- Multilingual multifocument summarization
- Tensor decomposition
ASJC Scopus subject areas
- General Computer Science