Towards efficient image-based representation of tabular data

Amit Damri, Mark Last, Niv Cohen

Research output: Contribution to journalArticlepeer-review

2 Scopus citations

Abstract

Convolutional neural networks (CNNs) have been widely used in image classification tasks and have achieved remarkable results compared with traditional methods. Their main advantage is the ability to extract hidden features automatically using local connectivity and spatial locality. However, CNN cannot be applied to tabular data, mainly due to the unsuitability of the tabular data structure to the CNN input. In this paper, we propose a new generic method for the representation of multidimensional tabular data as color-encoded images that can be used both for data visualization and classification with CNN. Our approach, named FC-Viz (Feature Clustering-Visualization), is based on user-oriented data visualization ideas, such as pixel-oriented techniques, feature clustering, and feature interactions. The proposed approach includes a transformation of each instance of the tabular data into a 2D pixel-based representation, where pixels representing features with strong correlation and interaction are adjacent to each other. We applied FC-Viz to ten multidimensional tabular datasets with dozens to thousands of features and compared its classification and visualization performance with a state-of-the-art tabular data transformation method. The evaluation experiments show that our approach is as accurate as the state-of-the-art, but with much smaller images resulting in much more compact and faster CNN models.

Original languageEnglish
Pages (from-to)1023-1043
Number of pages21
JournalNeural Computing and Applications
Volume36
Issue number2
DOIs
StatePublished - 1 Jan 2024

Keywords

  • Convolutional neural networks
  • Data transformation
  • Data visualization
  • Feature clustering
  • Feature interaction
  • Tabular data representation

ASJC Scopus subject areas

  • Software
  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'Towards efficient image-based representation of tabular data'. Together they form a unique fingerprint.

Cite this