Discover and explore top open-source AI tools and projects—updated daily.
CAD-MLLMUnifying multimodal inputs for CAD generation with MLLMs
Top 98.2% on SourcePulse
<2-3 sentences summarising what the project addresses and solves, the target audience, and the benefit.> CAD-MLLM addresses the challenge of unifying multimodality-conditioned Computer-Aided Design (CAD) generation by leveraging Multimodal Large Language Models (MLLMs). It targets researchers and engineers in the CAD and AI fields, providing a novel framework to generate complex CAD models from diverse inputs like text and images, aiming to streamline design processes.
How It Works
The project integrates MLLMs to enable conditional CAD generation, allowing for more intuitive and flexible design workflows. It builds upon the DeepCAD framework for robust data preprocessing, including conversion to STEP formats, point cloud sampling, and image rendering. This approach aims to unify various conditioning modalities for a more comprehensive CAD generation system.
Quick Start & Requirements
pythonocc-core=7.8.1. Setup involves initializing submodules, creating a Conda environment, installing dependencies from ./3rd_party/DeepCAD/requirements.txt, and installing pythonocc-core.CAD-MLLM-metrics. A project page is referenced for demonstrations.Highlighted Details
Maintenance & Community
The project is led by researchers from ShanghaiTech University, Transcengram, DeepSeek AI, and the University of Hong Kong. Acknowledgements are made to the DeepCAD project. Key components like inference and training code are still pending release according to the project's to-do list. No community channels (e.g., Discord, Slack) are explicitly listed.
Licensing & Compatibility
The provided README does not specify a software license. This absence creates ambiguity regarding usage rights, commercial application, and derivative works.
Limitations & Caveats
The inference and training code are not yet publicly available, limiting immediate practical application for model deployment or further development. The project appears to be in an active development phase, with core functionalities still to be released.
6 months ago
1 day
openai
black-forest-labs