OmniDrive  by NVlabs

Drive LLM-Agent framework for end-to-end autonomous driving

created 1 year ago
455 stars

Top 67.4% on sourcepulse

GitHubView on GitHub
Project Summary

OmniDrive is a comprehensive framework for end-to-end autonomous driving, leveraging a novel 3D multimodal Large Language Model (LLM) agent. It targets researchers and developers in autonomous driving, offering advanced capabilities for perception, reasoning, and planning, with a focus on interactive conversation and counterfactual analysis of driving scenarios.

How It Works

The core innovation lies in the OmniDrive-Agent, a 3D multimodal LLM that utilizes sparse queries to efficiently lift and compress visual representations into a 3D space. This approach enables sophisticated reasoning and planning by processing rich 3D scene information, facilitating tasks like scene description, traffic regulation adherence, 3D object grounding, and counterfactual reasoning about driving decisions.

Quick Start & Requirements

  • Installation: Follow the Environment Setup guide.
  • Prerequisites: TensorRT support is available. Specific hardware requirements are not detailed but are implied by the focus on 3D perception and LLM inference.
  • Resources: A technical report and dataset are available.
  • Links: Technical Report [arXiv], Dataset [Data], Model [HF].

Highlighted Details

  • End-to-end autonomous driving framework with LLM-agent.
  • Novel 3D multimodal LLM design using sparse queries for 3D representation.
  • OmniDrive-nuScenes benchmark for VQA tasks in reasoning and planning.
  • Supports joint end-to-end planning, interactive conversation, and counterfactual reasoning.

Maintenance & Community

The project is associated with NVlabs and has been accepted to CVPR 2025. TensorRT support was recently added with assistance from the NVIDIA TSE Team.

Licensing & Compatibility

The license is not explicitly stated in the provided README snippet.

Limitations & Caveats

The README does not detail specific limitations, unsupported platforms, or known bugs. The project appears to be in active development, with recent additions like TensorRT support.

Health Check
Last commit

1 month ago

Responsiveness

1 day

Pull Requests (30d)
0
Issues (30d)
5
Star History
98 stars in the last 90 days

Explore Similar Projects

Feedback? Help us improve.