Abstract
A new approach for initial assignment of data in a speaker clustering application is presented. This approach employs Segmental K-Means clustering algorithm prior to competitive based learning. The clustering system relies on Self-Organizing Maps (SOM) for speaker modeling and as a likelihood estimator. Performance is evaluated on 108 two speaker conversations taken from LDC CALLHOME American English Speech corpus using NIST criterion and shows an improvement of 20%-30% in Cluster Error Rate (CER) relative to the randomly initialized clustering system. The number of iterations was reduced significantly, which contributes to both speed and efficiency of the clustering system.
| Original language | English |
|---|---|
| Article number | 4747495 |
| Pages (from-to) | 305-308 |
| Number of pages | 4 |
| Journal | Proceedings Elmar - International Symposium Electronics in Marine |
| Volume | 1 |
| State | Published - 1 Dec 2008 |
| Event | ELMAR-2008 - 50th International Symposium ELMAR 2008 - Zadar, Croatia Duration: 10 Sep 2008 → 12 Sep 2008 |
Keywords
- Clustering
- Initial Conditions
- K-means
- SOM
- Speech
ASJC Scopus subject areas
- Electrical and Electronic Engineering