pytorch-frame  by pyg-team

PyTorch extension for heterogeneous tabular deep learning

created 2 years ago
688 stars

Top 50.4% on sourcepulse

GitHubView on GitHub
1 Expert Loves This Project
Project Summary

PyTorch Frame is a modular deep learning library for PyTorch, designed to simplify the creation and training of neural network models on heterogeneous tabular data. It caters to researchers and practitioners looking to leverage deep learning for tabular datasets, offering a flexible framework that integrates various column types and state-of-the-art architectures.

How It Works

The library employs a modular architecture consisting of FeatureEncoder, TableConv, and Decoder components. FeatureEncoder transforms raw tabular data into embeddings, TableConv models interactions between features, and Decoder produces the final output. This design allows for easy experimentation with different model architectures and facilitates integration with other PyTorch libraries, such as PyG for graph neural networks.

Quick Start & Requirements

Highlighted Details

  • Supports diverse column types: numerical, categorical, text, image, and embeddings.
  • Implements state-of-the-art deep tabular models (e.g., FTTransformer, TabNet) and integrates GBDTs (XGBoost, CatBoost, LightGBM).
  • Provides benchmark datasets and performance comparisons against GBDTs.
  • Facilitates integration with external embedding models (OpenAI, Cohere, Hugging Face) for text data.

Maintenance & Community

Licensing & Compatibility

  • License: MIT License.
  • Compatibility: Compatible with commercial and closed-source applications.

Limitations & Caveats

While deep tabular models show competitive performance, the benchmarks indicate they can be significantly slower to train than GBDTs. Some models may also have higher memory requirements, with "OOM" (Out Of Memory) noted for Trompt and FTTransformerBucket on certain datasets.

Health Check
Last commit

5 days ago

Responsiveness

Inactive

Pull Requests (30d)
5
Issues (30d)
1
Star History
37 stars in the last 90 days

Explore Similar Projects

Starred by Aravind Srinivas Aravind Srinivas(Cofounder of Perplexity), Ross Taylor Ross Taylor(Cofounder of General Reasoning; Creator of Papers with Code), and
3 more.

pixel-cnn by openai

0.1%
2k
TensorFlow implementation for PixelCNN++ research paper
created 9 years ago
updated 5 years ago
Feedback? Help us improve.