TY - GEN
T1 - The Lombard Effect's influence on automatic speaker verification systems and methods for its compensation
AU - Goldenberg, Roman
AU - Cohen, Arnon
AU - Shallom, Ilan
PY - 2006/1/1
Y1 - 2006/1/1
N2 - Speaker identification/verification applications have progressed significantly during the last few years. Performance levels of between 70% - 99% success in speaker recognition systems are normal, depending on the type of application and quality of the signal. Several techniques for robust speaker recognition have been developed. Until now, however, the problem posed by variations in speech characteristics due to acoustical noise has not been thoroughly investigated in the context of speaker recognition. The change a noisy acoustic environment can produce in speech signal parameters is known as the "Lombard Effect." In this paper the Lombard Effect's influence on speaker verification system performance is investigated and several compensation methods are proposed. The verification system is based on a 24 Gaussian Mixture Model (GMM) and speech feature orders of 12 to 60. It was found that, based on the mean Equal Error Rate (EER), verification performance deteriorated by 10.1% (from 3.8% to 13.9%) relative to speech veriflcation in a normal environment due to the Lombard Effect. Two types of Lombard Effect compensation methods are proposed. The first is based on robust speech features that are resistant to the Lombard Effect. The second is based on studying how the Lombard Effect changes speech feature and then transforming the Lombard affected speech back to normal speech. The proposed methods significantly reduce speaker veriflcation system error rates. An improvement in the EER of up to 5.4% (from 13.5% to 8.5%) was achieved,
AB - Speaker identification/verification applications have progressed significantly during the last few years. Performance levels of between 70% - 99% success in speaker recognition systems are normal, depending on the type of application and quality of the signal. Several techniques for robust speaker recognition have been developed. Until now, however, the problem posed by variations in speech characteristics due to acoustical noise has not been thoroughly investigated in the context of speaker recognition. The change a noisy acoustic environment can produce in speech signal parameters is known as the "Lombard Effect." In this paper the Lombard Effect's influence on speaker verification system performance is investigated and several compensation methods are proposed. The verification system is based on a 24 Gaussian Mixture Model (GMM) and speech feature orders of 12 to 60. It was found that, based on the mean Equal Error Rate (EER), verification performance deteriorated by 10.1% (from 3.8% to 13.9%) relative to speech veriflcation in a normal environment due to the Lombard Effect. Two types of Lombard Effect compensation methods are proposed. The first is based on robust speech features that are resistant to the Lombard Effect. The second is based on studying how the Lombard Effect changes speech feature and then transforming the Lombard affected speech back to normal speech. The proposed methods significantly reduce speaker veriflcation system error rates. An improvement in the EER of up to 5.4% (from 13.5% to 8.5%) was achieved,
UR - http://www.scopus.com/inward/record.url?scp=47849083819&partnerID=8YFLogxK
U2 - 10.1109/ITRE.2006.381571
DO - 10.1109/ITRE.2006.381571
M3 - Conference contribution
AN - SCOPUS:47849083819
SN - 1424408598
SN - 9781424408597
T3 - ITRE 2006 - 4th International Conference on Information Technology: Research and Education, Proceedings
SP - 233
EP - 237
BT - ITRE 2006 - 4th International Conference on Information Technology
PB - Institute of Electrical and Electronics Engineers
T2 - ITRE 2006 - 4th International Conference on Information Technology: Research and Education
Y2 - 17 October 2006 through 18 October 2006
ER -