Generalized Viterbi-based models for time-series segmentation applied to speaker diarization

  • Itshak Lapidot
  • , Jean Francois Bonastre

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

10 Scopus citations

Abstract

Time-series clustering is a process which takes into account the input samples chronological sequence. So, in time-series clustering, the samples are not processed independently as a result for a given sample depends on the clustering result of the whole sequence. One of the popular clustering algorithms to handle such dependency is the well-known Hidden-Markov-Model (HMM) trained by the Viterbi statistics. In this work we propose a generalization of the broadly used HMM, denoted Hidden-Distortion-Models (HDMs). Our proposal is based on distortion-based models and transition count, for which probabilistic calculations are no longer mandatory. We will introduce our approach by its mathematical bases. It will be shown that Viterbi based HMM can be seen as a special case of HDM. This proximity allows to us to apply similar approaches for state-model training when the new paradigm is used to learn the sequence dependencies. Speaker diarization application will be presented to show the advantages of the HDM as a clustering algorithm.

Original languageEnglish
Title of host publicationOdyssey 2012 - Speaker and Language Recognition Workshop
EditorsHaizhou Li, Bin Ma, Kong Aik Lee
PublisherChinese and Oriental Languages Information Processing Society (COLIPS), Speaker and Language Characterization SIG
Pages138-145
Number of pages8
ISBN (Electronic)9789810730932
StatePublished - 1 Jan 2012
Externally publishedYes
EventSpeaker and Language Recognition Workshop, Odyssey 2012 - Singapore, Singapore
Duration: 25 Jun 201228 Jun 2012

Publication series

NameOdyssey 2012 - Speaker and Language Recognition Workshop

Conference

ConferenceSpeaker and Language Recognition Workshop, Odyssey 2012
Country/TerritorySingapore
CitySingapore
Period25/06/1228/06/12

ASJC Scopus subject areas

  • Signal Processing
  • Software
  • Human-Computer Interaction

Fingerprint

Dive into the research topics of 'Generalized Viterbi-based models for time-series segmentation applied to speaker diarization'. Together they form a unique fingerprint.

Cite this