pytorch-frame  by pyg-team

PyTorch extension for heterogeneous tabular deep learning

Created 2 years ago
705 stars

Top 48.4% on SourcePulse

GitHubView on GitHub
Project Summary

PyTorch Frame is a modular deep learning library for PyTorch, designed to simplify the creation and training of neural network models on heterogeneous tabular data. It caters to researchers and practitioners looking to leverage deep learning for tabular datasets, offering a flexible framework that integrates various column types and state-of-the-art architectures.

How It Works

The library employs a modular architecture consisting of FeatureEncoder, TableConv, and Decoder components. FeatureEncoder transforms raw tabular data into embeddings, TableConv models interactions between features, and Decoder produces the final output. This design allows for easy experimentation with different model architectures and facilitates integration with other PyTorch libraries, such as PyG for graph neural networks.

Quick Start & Requirements

Highlighted Details

  • Supports diverse column types: numerical, categorical, text, image, and embeddings.
  • Implements state-of-the-art deep tabular models (e.g., FTTransformer, TabNet) and integrates GBDTs (XGBoost, CatBoost, LightGBM).
  • Provides benchmark datasets and performance comparisons against GBDTs.
  • Facilitates integration with external embedding models (OpenAI, Cohere, Hugging Face) for text data.

Maintenance & Community

Licensing & Compatibility

  • License: MIT License.
  • Compatibility: Compatible with commercial and closed-source applications.

Limitations & Caveats

While deep tabular models show competitive performance, the benchmarks indicate they can be significantly slower to train than GBDTs. Some models may also have higher memory requirements, with "OOM" (Out Of Memory) noted for Trompt and FTTransformerBucket on certain datasets.

Health Check
Last Commit

3 days ago

Responsiveness

1 day

Pull Requests (30d)
9
Issues (30d)
1
Star History
10 stars in the last 30 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Jeff Hammerbacher Jeff Hammerbacher(Cofounder of Cloudera), and
1 more.

tapas by google-research

0.1%
1k
Table QA models for end-to-end neural table-text understanding
Created 5 years ago
Updated 1 year ago
Feedback? Help us improve.