Discover and explore top open-source AI tools and projects—updated daily.
BR-IDLPaddleViT: Vision models for PaddlePaddle
Top 31.9% on SourcePulse
PaddleViT is a comprehensive library for state-of-the-art Vision Transformer (ViT) and MLP models, designed for the PaddlePaddle deep learning framework. It provides implementations, pre-trained weights, and training/validation scripts for various computer vision tasks including image classification, object detection, semantic segmentation, and GANs, targeting researchers and practitioners looking to leverage cutting-edge CV techniques.
How It Works
PaddleViT offers a modular design, with each model architecture implemented in a standalone Python module. This allows for easy modification and experimentation. The library integrates popular layers, utilities, optimizers, schedulers, and data augmentations, enabling users to reproduce state-of-the-art results and fine-tune models on custom datasets. It supports distributed data-parallel training (DDP) and mixed-precision training (AMP) for enhanced performance.
Quick Start & Requirements
pip. A conda environment is recommended.Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
3 years ago
Inactive
cmhungsteve
google-research
timzhang642
huggingface