Discover and explore top open-source AI tools and projects—updated daily.
Neuro-symbolic system for compositional visual reasoning using natural language
Top 46.5% on SourcePulse
This repository provides the official code for VisProg, a neuro-symbolic system designed for compositional visual reasoning based on natural language instructions. It targets researchers and developers working on complex visual question answering and image manipulation tasks, offering an interpretable and extensible framework.
How It Works
VisProg leverages GPT-3's in-context learning to generate Python programs that execute off-the-shelf computer vision models and image processing routines. This approach allows for compositional reasoning without requiring task-specific training, generating both solutions and interpretable execution rationales. The system is modular, enabling easy extension with new functionalities and tasks.
Quick Start & Requirements
conda env create -f environment.yaml
and activate with conda activate visprog
.notebooks/ok_det.ipynb
, notebooks/image_editing.ipynb
, notebooks/nlvr.ipynb
, notebooks/gqa.ipynb
).Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
1 year ago
Inactive