TY - JOUR
T1 - Generalized likelihood ratio test for voiced-unvoiced decision in noisy speech using the harmonic model
AU - Fisher, Etan
AU - Tabrikian, Joseph
AU - Dubnov, Shlomo
N1 - Funding Information:
Manuscript received March 4, 2003; revised March 15, 2005. This work was supported in part by the Bi-national Science Foundation (BSF). The associate editor coordinating the review of this manuscript and approving it for publication was Dr. Ramesh A. Gopinath.
PY - 2006/12/1
Y1 - 2006/12/1
N2 - In this paper, a novel method for voiced-unvoiced decision within a pitch tracking algorithm is presented. Voiced-unvoiced decision is required for many applications, including modeling for analysis/synthesis, detection of model changes for segmentation purposes and signal characterization for indexing and recognition applications. The proposed method is based on the generalized likelihood ratio test (GLRT) and assumes colored Gaussian noise with unknown covariance. Under voiced hypothesis, a harmonic plus noise model is assumed. The derived method is combined with a maximum a-posteriori probability (MAP) scheme to obtain a pitch and voicing tracking algorithm. The performance of the proposed method is tested using several speech databases for different levels of additive noise and phone speech conditions. Results show that the GLRT is robust to speaker and environmental conditions and performs better than existing algorithms.
AB - In this paper, a novel method for voiced-unvoiced decision within a pitch tracking algorithm is presented. Voiced-unvoiced decision is required for many applications, including modeling for analysis/synthesis, detection of model changes for segmentation purposes and signal characterization for indexing and recognition applications. The proposed method is based on the generalized likelihood ratio test (GLRT) and assumes colored Gaussian noise with unknown covariance. Under voiced hypothesis, a harmonic plus noise model is assumed. The derived method is combined with a maximum a-posteriori probability (MAP) scheme to obtain a pitch and voicing tracking algorithm. The performance of the proposed method is tested using several speech databases for different levels of additive noise and phone speech conditions. Results show that the GLRT is robust to speaker and environmental conditions and performs better than existing algorithms.
KW - Generalized likelihood ratio test (GLRT)
KW - Harmonic model
KW - Likelihood ratio test (LRT)
KW - Maximum a-posteriori probability
KW - Noisy speech
KW - Pitch tracking
KW - Voice activity detection (VAD)
KW - Voiced-unvoiced decision
UR - https://www.scopus.com/pages/publications/33847129521
U2 - 10.1109/TSA.2005.857806
DO - 10.1109/TSA.2005.857806
M3 - Article
AN - SCOPUS:33847129521
SN - 1558-7916
VL - 14
SP - 502
EP - 510
JO - IEEE Transactions on Audio, Speech and Language Processing
JF - IEEE Transactions on Audio, Speech and Language Processing
IS - 2
ER -