Speakers clustering with stochastic VQ and clustering quality estimator

  • Yishai Cohen
  • , Itshak Lapidot

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Scopus citations

Abstract

Short segments speaker clustering has significant importance both for diarization and applications such as short push-To-Tatk (PTT) segments clustering. In this paper we present a new way to cluster speech segments by applying a stochastic vector quantization (VQ) with a cosine metric together with a speaker clustering quality estimator based on logistic regression. The VQ is performed on codebooks of different sizes, and the choice of the best clustering result is estimated using logistic regression. The algorithm is tested on a large range of speakers, between 2 to 60. The results are compared to those of the mean-shift clustering method, which was already tested for this task several times. The results are a bit below those of the cosine similarity measure-based mean-shift clustering. The advantage is in the run-Time which is approximately 10 times faster.

Original languageEnglish
Title of host publication2018 IEEE International Conference on the Science of Electrical Engineering in Israel, ICSEE 2018
PublisherInstitute of Electrical and Electronics Engineers
ISBN (Electronic)9781538663783
DOIs
StatePublished - 2 Jul 2018
Externally publishedYes
Event2018 IEEE International Conference on the Science of Electrical Engineering in Israel, ICSEE 2018 - Eilat, Israel
Duration: 12 Dec 201814 Dec 2018

Publication series

Name2018 IEEE International Conference on the Science of Electrical Engineering in Israel, ICSEE 2018

Conference

Conference2018 IEEE International Conference on the Science of Electrical Engineering in Israel, ICSEE 2018
Country/TerritoryIsrael
CityEilat
Period12/12/1814/12/18

Keywords

  • clustering quality estimation
  • cosine metric
  • logistic regression
  • mean-shift
  • speaker clustering
  • vector quantization (VQ)

ASJC Scopus subject areas

  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'Speakers clustering with stochastic VQ and clustering quality estimator'. Together they form a unique fingerprint.

Cite this