TY - GEN
T1 - Feature selection for room volume identification from room impulse response
AU - Shabtai, Noam R.
AU - Zigel, Yaniv
AU - Rafaely, Boaz
PY - 2009/12/1
Y1 - 2009/12/1
N2 - The room impulse response (RIR) can be used to calculate many room acoustical parameters, such as the reverberation time (RT). However, estimating the room volume, another important room parameter, from the RIR is typically a more difficult task requiring extraction of other features from the RIR. Most of the existing fully-blind methods for estimating the room volume from the RIR do not combine features from different feature sets. This can be one reason to the fact that these methods are sensitive to differences in source-to-receiver distance and wall reflection coefficients. We propose a new approach in which hypothetical-volume room models are trained with room volume features from different feature sets. Estimation is performed by identifying the hypothesis with maximum-likelihood (ML) using background model normalization. The different feature sets are compared using equal error rate (EER) of hypothesis verification. A combination of features from the different feature sets is selected so that minimum EER is achieved. Using the selected features, we achieve average detection rate of 98.8% with a standard deviation (STD) of 1.5% for eight rooms with different volumes, source-to-receiver distances, and wall reflection coefficients.
AB - The room impulse response (RIR) can be used to calculate many room acoustical parameters, such as the reverberation time (RT). However, estimating the room volume, another important room parameter, from the RIR is typically a more difficult task requiring extraction of other features from the RIR. Most of the existing fully-blind methods for estimating the room volume from the RIR do not combine features from different feature sets. This can be one reason to the fact that these methods are sensitive to differences in source-to-receiver distance and wall reflection coefficients. We propose a new approach in which hypothetical-volume room models are trained with room volume features from different feature sets. Estimation is performed by identifying the hypothesis with maximum-likelihood (ML) using background model normalization. The different feature sets are compared using equal error rate (EER) of hypothesis verification. A combination of features from the different feature sets is selected so that minimum EER is achieved. Using the selected features, we achieve average detection rate of 98.8% with a standard deviation (STD) of 1.5% for eight rooms with different volumes, source-to-receiver distances, and wall reflection coefficients.
UR - http://www.scopus.com/inward/record.url?scp=77950179467&partnerID=8YFLogxK
U2 - 10.1109/ASPAA.2009.5346458
DO - 10.1109/ASPAA.2009.5346458
M3 - Conference contribution
AN - SCOPUS:77950179467
SN - 9781424436798
T3 - IEEE Workshop on Applications of Signal Processing to Audio and Acoustics
SP - 249
EP - 252
BT - 2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2009
T2 - 2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2009
Y2 - 18 October 2009 through 21 October 2009
ER -