Research paper exploring GPT-4V as autonomous driving agent
Top 90.1% on sourcepulse
This repository explores the application of GPT-4V as an autonomous driving agent, providing a dataset of test images and detailed prompt/response logs for researchers and developers in the autonomous driving and AI fields. It aims to showcase GPT-4V's capabilities in understanding driving scenarios, reasoning, and decision-making.
How It Works
The project leverages GPT-4V's visual-language understanding to interpret driving scenes. It organizes test cases into categories like "Scenario Understanding," "Reasoning," and "Serving as a Driving Agent." Each case includes a .png
image of a driving scenario and a .json
file containing the specific prompts given to GPT-4V and its corresponding textual responses, demonstrating its decision-making process.
Quick Start & Requirements
Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
This repository primarily showcases results and data; it does not provide a runnable autonomous driving agent or code for direct integration. Users must have API access to GPT-4V and implement their own inference pipelines.
1 year ago
Inactive