Abstract
Convolutional neural networks (CNNs) have been widely used in image classification tasks and have achieved remarkable results compared with traditional methods. Their main advantage is the ability to extract hidden features automatically using local connectivity and spatial locality. However, CNN cannot be applied to tabular data, mainly due to the unsuitability of the tabular data structure to the CNN input. In this paper, we propose a new generic method for the representation of multidimensional tabular data as color-encoded images that can be used both for data visualization and classification with CNN. Our approach, named FC-Viz (Feature Clustering-Visualization), is based on user-oriented data visualization ideas, such as pixel-oriented techniques, feature clustering, and feature interactions. The proposed approach includes a transformation of each instance of the tabular data into a 2D pixel-based representation, where pixels representing features with strong correlation and interaction are adjacent to each other. We applied FC-Viz to ten multidimensional tabular datasets with dozens to thousands of features and compared its classification and visualization performance with a state-of-the-art tabular data transformation method. The evaluation experiments show that our approach is as accurate as the state-of-the-art, but with much smaller images resulting in much more compact and faster CNN models.
Original language | English |
---|---|
Pages (from-to) | 1023-1043 |
Number of pages | 21 |
Journal | Neural Computing and Applications |
Volume | 36 |
Issue number | 2 |
DOIs | |
State | Published - 1 Jan 2024 |
Keywords
- Convolutional neural networks
- Data transformation
- Data visualization
- Feature clustering
- Feature interaction
- Tabular data representation
ASJC Scopus subject areas
- Software
- Artificial Intelligence