visionary  by Visionary-Laboratory

Web-native platform for world model rendering and generation

Created 1 month ago
386 stars

Top 74.2% on SourcePulse

GitHubView on GitHub
Project Summary

Visionary is an open, web-native platform designed for real-time rendering of diverse Gaussian Splatting variants and 3D meshes directly within a browser. It addresses the challenge of deploying and comparing advanced neural rendering techniques by providing a unified, efficient, and accessible environment. The platform targets researchers, engineers, and power users working with world models and 3D graphics, offering a lightweight, "click-to-run" experience that significantly lowers the barrier to reproduction and deployment.

How It Works

Visionary employs a hybrid rendering architecture leveraging WebGPU for high-performance parallel sorting and rendering of millions of Gaussian particles. Core to its design is per-frame ONNX inference, enabling dynamic neural processing and the integration of plug-and-play algorithms through a standardized Gaussian Generator contract. This approach supports not only standard 3DGS but also MLP-based 3DGS, 4DGS, neural avatars, and custom generative models. The platform automatically handles depth compositing between Gaussian point clouds and standard meshes, resolving occlusion issues for complex scene compositions.

Quick Start & Requirements

Highlighted Details

  • Native WebGPU rendering for efficient processing of millions of Gaussian particles.
  • Hybrid rendering architecture supporting seamless integration of Gaussian point clouds and standard meshes.
  • Universal asset loader supporting multiple formats including Static Gaussians (PLY, SPLAT, etc.), standard meshes (GLB, GLTF, etc.), and ONNX for 4DGS/Avatars/custom algorithms.

Maintenance & Community

The project is a collaborative effort involving Shanghai AI Laboratory, Sichuan University, The University of Tokyo, Shanghai Jiao Tong University, and Northwestern Polytechnical University. Specific community channels like Discord or Slack are not detailed in the README.

Licensing & Compatibility

The project is licensed under the Apache-2.0 license. This license is permissive and generally compatible with commercial use and closed-source linking.

Limitations & Caveats

Ubuntu is currently not supported due to a WebGPU bug affecting fp16 ONNX pipeline compatibility. macOS users may experience limited GPU performance on non-high-end chips (e.g., M4 Max or better), potentially leading to slow rendering or stuttering.

Health Check
Last Commit

2 weeks ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
4
Star History
105 stars in the last 30 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Luis Capelo Luis Capelo(Cofounder of Lightning AI), and
6 more.

threestudio by threestudio-project

0.1%
7k
Framework for 3D content generation from text/images using 2D diffusion
Created 2 years ago
Updated 1 year ago
Feedback? Help us improve.