Identification of label dependencies for multi-label classification

Lena Tenenboim-Chekina, Lior Rokach, Bracha Shapira

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The main feature distinguishing multi-label classification from a regular classification task is that a number of labels have to be predicted simultaneously. Thus, it is obviously important to exploit potential dependencies between labels. However, surprisingly only a few of the existing algorithms address this problem directly by identifying dependent labels explicitly from the dataset. In this research we propose a new approach for identification and modeling existing dependencies between labels. We define and develop an algorithm that, first, identifies dependencies among the labels, then divides the whole set of labels into several mutually exclusive subsets, and finally performs multilabel classification incorporating the discovered dependencies. In this paper we utilize the ChiSquare test for independence to identify interdependent labels. We then apply a combination of the Binary Relevance and Label Power-set approaches for multi-label classification incorporating the discovered relations. An empirical evaluation on wide range of datasets shows that the proposed method achieves high and stable accuracy results and is competitive with recent state-of-the-art algorithms for multi-label classification
Original languageEnglish
Title of host publicationWorking Notes of the Second International Workshop on Learning from Multi-Label Data
Place of PublicationHaifa, Israel
Pages53-60
StatePublished - 2010

Fingerprint

Dive into the research topics of 'Identification of label dependencies for multi-label classification'. Together they form a unique fingerprint.

Cite this