Abstract
An algorithm for unsupervised speaker classification using Kohonen SOM is presented. The system employs 6×10 SOM networks for each speaker and for non-speech segments. The algorithm was evaluated using high quality as well as telephone quality conversations between two speakers. Correct classification of more than 90% was demonstrated. High quality conversation between three speakers yielded 80% correct classification. The high quality speech required the use of 12th order cepstral coefficients vector. In telephone quality speech, additional 12 features of the difference of the cepstrum were required.
| Original language | English |
|---|---|
| Pages | 578-587 |
| Number of pages | 10 |
| State | Published - 1 Dec 1997 |
| Event | Proceedings of the 1997 7th IEEE Workshop on Neural Networks for Signal Processing, NNSP'97 - Amelia Island, FL, USA Duration: 24 Sep 1997 → 26 Sep 1997 |
Conference
| Conference | Proceedings of the 1997 7th IEEE Workshop on Neural Networks for Signal Processing, NNSP'97 |
|---|---|
| City | Amelia Island, FL, USA |
| Period | 24/09/97 → 26/09/97 |
ASJC Scopus subject areas
- Signal Processing
- Software
- Electrical and Electronic Engineering