This repository contains the resources and code for the study: "Improving the Use of Deep Neural Networks with Tabular Data by Exploiting Synthetic Images." It benchmarks eight tabular-to-image transformation techniques and evaluates their effectiveness in combination with hybrid architectures (CNN+MLP and ViT+MLP) across diverse datasets and machine learning tasks.
- Comprehensive Benchmark: Evaluation of eight transformation techniques for tabular data:
- TINTO
- REFINED
- IGTD
- FeatureWrap
- SuperTML
- BarGraph
- DistanceMatrix
- Combination
- Hybrid Architectures: Analysis of CNN+MLP and ViT+MLP combinations.
- Diverse Datasets: Includes regression, binary classification, and multiclass classification tasks.
- Metrics: Performance evaluation using RMSE, Accuracy, Precision, Recall, and F1-score.
TODO:
The experiments span a variety of datasets, including:
- Regression: Boston Housing, California Housing, MIMO
- Binary Classification: Dengue/Chikungunya, HELOC
- Multiclass Classification: Covertype, GAS
Put dataset links...
Clone the repository and install the dependencies:
git clone https://github.com/manwestc/TINTOlib-benchmark