OpenFly-Platform  by SHAILAB-IPEC

Advancing aerial Vision-Language Navigation research

Created 1 year ago
268 stars

Top 95.9% on SourcePulse

GitHubView on GitHub
Project Summary

Summary

OpenFly-Platform addresses the need for robust research tools in outdoor aerial Vision-Language Navigation (VLN). It provides an automated data collection toolchain, a large-scale benchmark dataset of 100,000 flight trajectories, and the OpenFly-Agent VLN model. This open-source platform aims to accelerate research by offering comprehensive resources for the aerial VLN community.

How It Works

The platform integrates multiple simulation environments (Unreal Engine, AirSim, 3DGS, GTAV) with a sophisticated toolchain for generating diverse datasets. It automates the collection of high-quality visual data and trajectory information, enabling the creation of large-scale benchmarks. The core innovation lies in its end-to-end approach, from simulation and data processing to training and evaluation of keyframe-aware VLN models like OpenFly-Agent, facilitating reproducible research.

Quick Start & Requirements

  • Installation: Requires cloning the repository, setting up a Python 3.10+ conda environment, installing dependencies (flash-attn, dlimp, packaging, ninja, xvfb, libgoogle-glog-dev, ROS2 Humble, etc.), and building with colcon.
  • Prerequisites: Linux (Ubuntu 22.04 recommended), CUDA, Python 3.10+, ROS2 Humble. GTAV simulation requires Windows.
  • Simulation Setup: Involves downloading pre-configured environments (UE, AirSim, 3DGS) or setting up custom ones, with specific instructions for each simulator.
  • Resources: GPU-intensive rendering, especially for UE.
  • Links: Repository: https://github.com/SHAILAB-IPEC/OpenFly-Platform.git.

Highlighted Details

  • Features a benchmark dataset comprising 100,000 diverse flight trajectories.
  • Utilizes cutting-edge rendering techniques for high-quality visual data generation.
  • Includes the OpenFly-Agent, a keyframe-aware VLN model, and provides pretrained weights.
  • Supports multiple simulation backends: Unreal Engine, AirSim, 3DGS, and GTAV.

Maintenance & Community

No specific details regarding maintainers, community channels (e.g., Discord, Slack), or project roadmap are provided in the README.

Licensing & Compatibility

The README includes a "License" section but does not specify the license type. This omission requires clarification for commercial use or integration into closed-source projects.

Limitations & Caveats

  • GTAV simulation and instruction generation functionalities are marked as "Coming soon".
  • Setup requires specific OS and ROS2 versions, and GTAV is Windows-only with strict graphics configuration.
  • Instruction generation necessitates an OpenAI API key.
  • UE rendering is GPU-intensive and may require configuration adjustments for higher quality.
Health Check
Last Commit

1 month ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
1
Star History
11 stars in the last 30 days

Explore Similar Projects

Feedback? Help us improve.