Perceptual time varying linear prediction model for speech applications

Oron Gamliel, Ilan D. Shallom

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

5 Scopus citations

Abstract

A new perceptual time varying model for non-stationary analysis of speech signals is presented. Some researches have already shown that the Time Varying Linear Prediction Coding (TVLPC) model that was applied to speech signals increases the recognition performance of Automatic Speech Recognition (ASR) systems. This improvement has been achieved due to the incorporation of the speech dynamics information in the model. Another work, Perceptual Linear Prediction (PLP) analysis of speech, has shown that a modified estimation of the Auto Correlation Function (ACF) of stationary speech frame yields major improvement to the recognition rate. The presented model, Perceptual Time Varying Linear Prediction (PTVLP) analysis of speech, adopts the perceptual concepts, of how to estimate the ACF, into the TVLPC model. This research shows that the proposed PTVLP model is more accurate, robust to noise and achieves better recognition rates than PLP and TVLPC over wide SNR range.

Original languageEnglish
Title of host publication2009 IEEE International Conference on Acoustics, Speech, and Signal Processing - Proceedings, ICASSP 2009
Pages4601-4604
Number of pages4
DOIs
StatePublished - 23 Sep 2009
Event2009 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2009 - Taipei, Taiwan, Province of China
Duration: 19 Apr 200924 Apr 2009

Publication series

NameICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
ISSN (Print)1520-6149

Conference

Conference2009 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2009
Country/TerritoryTaiwan, Province of China
CityTaipei
Period19/04/0924/04/09

Keywords

  • Auto regressive
  • HMM
  • PLP
  • PSD
  • TVLPC

ASJC Scopus subject areas

  • Software
  • Signal Processing
  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'Perceptual time varying linear prediction model for speech applications'. Together they form a unique fingerprint.

Cite this