A new approach for initial assignment of data in a speaker clustering application is presented. This approach employs Segmental K-Means clustering algorithm prior to competitive based learning. The clustering system relies on Self-Organizing Maps (SOM) for speaker modeling and as a likelihood estimator. Performance is evaluated on 108 two speaker conversations taken from LDC CALLHOME American English Speech corpus using NIST criterion and shows an improvement of 20%-30% in Cluster Error Rate (CER) relative to the randomly initialized clustering system. The number of iterations was reduced significantly, which contributes to both speed and efficiency of the clustering system.
|Number of pages||4|
|Journal||Proceedings Elmar - International Symposium Electronics in Marine|
|State||Published - 1 Dec 2008|
|Event||ELMAR-2008 - 50th International Symposium ELMAR 2008 - Zadar, Croatia|
Duration: 10 Sep 2008 → 12 Sep 2008
- Initial Conditions