Abstract
An algorithm for unsupervised speaker classification using Kohonen SOM is presented. The system employs 6×10 SOM networks for each speaker and for non-speech segments. The algorithm was evaluated using high quality as well as telephone quality conversations between two speakers. Correct classification of more than 90% was demonstrated. High quality conversation between three speakers yielded 80% correct classification. The high quality speech required the use of 12th order cepstral coefficients vector. In telephone quality speech, additional 12 features of the difference of the cepstrum were required.
Original language | English |
---|---|
Pages | 578-587 |
Number of pages | 10 |
State | Published - 1 Dec 1997 |
Event | Proceedings of the 1997 7th IEEE Workshop on Neural Networks for Signal Processing, NNSP'97 - Amelia Island, FL, USA Duration: 24 Sep 1997 → 26 Sep 1997 |
Conference
Conference | Proceedings of the 1997 7th IEEE Workshop on Neural Networks for Signal Processing, NNSP'97 |
---|---|
City | Amelia Island, FL, USA |
Period | 24/09/97 → 26/09/97 |
ASJC Scopus subject areas
- Signal Processing
- Software
- Electrical and Electronic Engineering