Discover and explore top open-source AI tools and projects—updated daily.
jmwang0117Human-like end-to-end driving system
Top 99.3% on SourcePulse
HE-Drive is an end-to-end autonomous driving system designed to generate human-like, temporally consistent, and comfortable driving trajectories. It targets researchers and developers in the autonomous driving field, offering a novel approach that significantly reduces collision rates and improves computational speed while prioritizing passenger comfort.
How It Works
HE-Drive employs a multi-stage approach: sparse perception extracts key 3D spatial representations, a DDPM-based motion planner generates diverse, multi-modal trajectories, and a Vision Language Model (VLM)-guided scorer selects the most comfortable option. This integration of VLMs for assessing driving style and comfort is a novel aspect, aiming to mimic human driving nuances.
Quick Start & Requirements
hedrive), installing specific PyTorch (1.13.0+cu116) and torchvision versions, and running pip3 install -r requirement.txt. CUDA operations (deformable_aggregation) must be compiled.llama3.2-vision-11b model.Highlighted Details
Maintenance & Community
The project README does not provide details on community channels (e.g., Discord, Slack), active maintainers, or a roadmap. It encourages users to star the repository.
Licensing & Compatibility
The README does not specify a software license. This omission makes it impossible to determine compatibility for commercial use or closed-source integration without further clarification.
Limitations & Caveats
The primary adoption blocker is the significant VRAM requirement (8GB) for the Llama 3.2-Vision 11B model, which is integral to the system's VLM-guided scoring. Installation involves compiling custom CUDA operations, which can be complex and platform-dependent. The system relies on specific external datasets (NuScenes) and checkpoints (SparseDrive).
6 months ago
Inactive
OpenDriveLab