TY - GEN
T1 - Realtime multiple-pitch and multiple-instrument recognition for music signals using sparse non-negative constraints
AU - Cont, Arshia
AU - Dubnov, Shlomo
AU - Wessel, David
PY - 2007/1/1
Y1 - 2007/1/1
N2 - In this paper we introduce a simple and fast method for realtime recognition of multiple pitches produced by multiple musical instruments. Our proposed method is based on two important facts: (1) that timbral information of any instrument is pitch-dependant and (2) that the modulation spectrum of the same pitch seems to result into a persistent representation of the characteristics of the instrumental family. Using these basic facts, we construct a learning algorithm to obtain pitch templates of all possible notes on various instruments and then devise an online algorithm to decompose a realtime audio buffer using the learned templates. The learning and decomposition proposed here are inspired by non-negative matrix factorization methods but differ by introduction of an explicit sparsity control. Our test results show promising recognition rates for a realtime system on real music recordings. We discuss further improvements that can be made over the proposed system.
AB - In this paper we introduce a simple and fast method for realtime recognition of multiple pitches produced by multiple musical instruments. Our proposed method is based on two important facts: (1) that timbral information of any instrument is pitch-dependant and (2) that the modulation spectrum of the same pitch seems to result into a persistent representation of the characteristics of the instrumental family. Using these basic facts, we construct a learning algorithm to obtain pitch templates of all possible notes on various instruments and then devise an online algorithm to decompose a realtime audio buffer using the learned templates. The learning and decomposition proposed here are inspired by non-negative matrix factorization methods but differ by introduction of an explicit sparsity control. Our test results show promising recognition rates for a realtime system on real music recordings. We discuss further improvements that can be made over the proposed system.
UR - https://www.scopus.com/pages/publications/80052992790
M3 - Conference contribution
AN - SCOPUS:80052992790
SN - 9788890147913
T3 - Proceedings of the International Conference on Digital Audio Effects, DAFx
SP - 85
EP - 92
BT - Proceedings of the 10th International Conference on Digital Audio Effects, DAFx 2007
T2 - 10th International Conference on Digital Audio Effects, DAFx 2007
Y2 - 10 September 2007 through 15 September 2007
ER -