Discover and explore top open-source AI tools and projects—updated daily.
unum-cloudMultimodal AI library for content understanding and generation
Top 32.8% on SourcePulse
UForm is a compact, multimodal AI library designed for efficient content understanding and generation across text, images, and video. It targets developers and researchers needing to deploy AI capabilities on diverse platforms, from servers to smartphones, offering significant speedups and reduced resource footprints compared to larger models.
How It Works
UForm leverages custom-trained, compact transformer models. Its embedding models utilize Matryoshka-style embeddings, allowing for flexible dimensionality (64-768) and efficient retrieval. Generative models are built on efficient architectures like Qwen and LLaMA, enabling tasks such as chat, image captioning, and Visual Question Answering (VQA). The library emphasizes portability via ONNX and native support for quantization (f32 to i8, b1) and embedding slicing, integrating tightly with SimSIMD for numerical operations and USearch for vector indexing.
Quick Start & Requirements
pip install uformtransformers and torch.Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The project is marked with a ⚠️ emoji for the uform-gen model, suggesting potential instability or deprecation. Future support for video is indicated with 🔜 emojis. The license is not explicitly stated, which may pose a barrier for commercial adoption.
4 days ago
1 day
kohjingyu
kohjingyu
open-mmlab
OpenGVLab
huggingface