Quality-Diversity Summarization with Unsupervised Autoencoders

Lei Li, Zuying Huang, Natalia Vanetik, Marina Litvak

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This paper introduces a novel perspective on unlabeled data driven technology for extractive summarization. Because unsupervised autoencoders, combined with neural network language models, help to capture deep semantic features for sentence quality, we propose to integrate autoencoders with sampling method based on Determinantal point processes (DPPs) [1] to extract diverse sentences with high qualities, and generate brief summaries. The unique fusion of unsupervised autoencoders and DPPs sampling has never been adopted before. We illustrate the advantages of this attempt against statistics based approaches through experiments in multilingual environment for single-document and multi-document summarization tasks. Our algorithms evaluated with ROUGE F-measure [2] obtain better scores in several varieties of languages on MMS-2015 dataset and MSS-2015 dataset.

Original languageEnglish
Title of host publicationArtificial Neural Networks and Machine Learning – ICANN 2019
Subtitle of host publicationText and Time Series - 28th International Conference on Artificial Neural Networks, 2019, Proceedings
EditorsIgor V. Tetko, Pavel Karpov, Fabian Theis, Vera Kurková
PublisherSpringer Verlag
Pages293-299
Number of pages7
ISBN (Print)9783030304898
DOIs
StatePublished - 1 Jan 2019
Externally publishedYes
Event28th International Conference on Artificial Neural Networks, ICANN 2019 - Munich, Germany
Duration: 17 Sep 201919 Sep 2019

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume11730 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference28th International Conference on Artificial Neural Networks, ICANN 2019
Country/TerritoryGermany
CityMunich
Period17/09/1919/09/19

Keywords

  • Autoencoders
  • DPPs
  • Summarization

ASJC Scopus subject areas

  • Theoretical Computer Science
  • General Computer Science

Fingerprint

Dive into the research topics of 'Quality-Diversity Summarization with Unsupervised Autoencoders'. Together they form a unique fingerprint.

Cite this