PyTorch implementation of TabNet for tabular data
Top 17.3% on sourcepulse
This repository provides a PyTorch implementation of the TabNet architecture, designed for interpretable and attentive tabular learning. It addresses classification and regression tasks, including multi-task scenarios, and is suitable for researchers and practitioners seeking high-performance, transparent models for tabular data.
How It Works
TabNet employs a sequential attention mechanism to learn feature importance at each decision step, allowing for efficient processing of high-dimensional tabular data. It utilizes gated linear units (GLUs) and feature re-usage through a gamma parameter to control sparsity and model capacity. The implementation also supports embedding-aware attention and grouped features for improved handling of categorical and correlated inputs.
Quick Start & Requirements
pip install pytorch-tabnet
or conda install -c conda-forge pytorch-tabnet
make start-gpu
(or make start
for CPU) followed by poetry install
and make notebook
.Highlighted Details
Maintenance & Community
The project is actively maintained by dreamquark-ai. Community interaction is encouraged via Slack.
Licensing & Compatibility
The repository is licensed under the Apache 2.0 license, permitting commercial use and integration with closed-source projects.
Limitations & Caveats
The README notes that some implementation choices may differ from the original TabNet paper. Reconstruction during pre-training can be challenging with Batch Normalization and large batch sizes.
9 months ago
Inactive