Polytope model for extractive summarization

Marina Litvak, Natalia Vanetik

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Scopus citations

Abstract

The problem of text summarization for a collection of documents is defined as the problem of selecting a small subset of sentences so that the contents and meaning of the original document set are preserved in the best possible way. In this paper we present a linear model for the problem of text summarization, where we strive to obtain a summary that preserves the information coverage as much as possible in comparison to the original document set. We construct a system of linear inequalities that describes the given document set and its possible summaries and translate the problem of finding the best summary to the problem of finding the point on a convex polytope closest to the given hyperplane. This re-formulated problem can be solved efficiently with the help of quadratic programming.

Original languageEnglish
Title of host publicationKDIR 2012 - Proceedings of the International Conference on Knowledge Discovery and Information Retrieval
Pages281-286
Number of pages6
StatePublished - 1 Dec 2012
Externally publishedYes
Event4th International Conference on Knowledge Discovery and Information Retrieval, KDIR 2012 - Barcelona, Spain
Duration: 4 Oct 20127 Oct 2012

Publication series

NameKDIR 2012 - Proceedings of the International Conference on Knowledge Discovery and Information Retrieval

Conference

Conference4th International Conference on Knowledge Discovery and Information Retrieval, KDIR 2012
Country/TerritorySpain
CityBarcelona
Period4/10/127/10/12

Keywords

  • Polytope model
  • Quadratic programming
  • Text summarization

ASJC Scopus subject areas

  • Information Systems

Fingerprint

Dive into the research topics of 'Polytope model for extractive summarization'. Together they form a unique fingerprint.

Cite this