awesome-openai-vision-api-experiments by roboflow

Vision API experiments and examples

Created 2 years ago

1,682 stars

Top 25.0% on SourcePulse

1 Expert Loves This Project

jwyang

Research Scientist at Meta Superintelligence Lab

Project Summary

This repository serves as a curated hub for experiments leveraging the OpenAI Vision API, targeting developers and researchers interested in visual AI applications. It showcases diverse use cases, from basic image classification to advanced zero-shot learning, aiming to foster collaboration and exploration of the API's capabilities.

How It Works

The project demonstrates various applications of the OpenAI Vision API, including image classification, zero-shot learning, and integration with other models like GroundingDINO and Segment Anything (SAM) for tasks like object detection and segmentation. This combination aims to overcome the Vision API's native limitations, offering more robust visual understanding capabilities.

Quick Start & Requirements

Requires an OpenAI API key.
Experiments may involve additional foundational models (e.g., GroundingDINO, SAM) requiring separate setup.
Refer to individual experiment directories for specific dependencies and setup instructions.

Highlighted Details

Showcases zero-shot object detection by combining GPT-4V with GroundingDINO.
Includes experiments comparing GPT-4V with CLIP for classification tasks.
Features a "screenshot-to-code" experiment.
Provides links to relevant research papers and blog posts detailing methodologies and findings.

Maintenance & Community

Contributions are welcomed via issues and pull requests, with a contribution guide available.
Key contributors include @SkalskiP, @capjamesg, and members of the Roboflow team.

Licensing & Compatibility

The repository itself appears to be under a permissive license, but the use of the OpenAI Vision API is subject to OpenAI's terms of service and API usage policies.

Limitations & Caveats

The OpenAI Vision API has a daily request limit per API key.
Native capabilities are limited for object detection and image segmentation, requiring integration with other models.

Health Check

Last Commit

1 year ago

Responsiveness

Inactive

Pull Requests (30d)

0

Issues (30d)

0

Star History

3 stars in the last 30 days

Explore Similar Projects

mvits_for_class_agnostic_od by mmaaz60

Research paper for class-agnostic object detection

Created 4 years ago

Updated 2 years ago

Starred by

Dan Guido

Dan Guido(Cofounder of Trail of Bits).

machina by PsyChip

CCTV viewer for realtime object tagging

Created 1 year ago

Updated 2 months ago

X-Temporal by Sense-X

Video understanding codebase using PyTorch

Created 6 years ago

Updated 4 years ago

webcamGPT by roboflow

CLI tool for chatting with a webcam video stream

Created 2 years ago

Updated 1 year ago

WebcamGPT-Vision by bdekraker

Webcam app for GPT-4 Vision API processing

Created 2 years ago

Updated 2 years ago

scan-for-webcams by JettChenT

CLI tool for scanning webcams on the internet

Created 5 years ago

Updated 2 years ago

efficientdet-pytorch by bubbliiiing

PyTorch code for EfficientDet object detection

Created 5 years ago

Updated 2 years ago

Starred by

Eric Zhang

Eric Zhang(Founding Engineer at Modal),

Tim J. Baek

Tim J. Baek(Founder of Open WebUI), and

2 more.

ml-mobileclip by apple

Image-text models research paper, CVPR 2024

Created 1 year ago

Updated 3 months ago

yolov4-tiny-pytorch by bubbliiiing

PyTorch code for YOLOv4-tiny object detection

Created 5 years ago

Updated 2 years ago

Starred by

Chenlin Meng

Chenlin Meng(Cofounder of Pika),

Ben Firshman

Ben Firshman(Cofounder of Replicate), and

1 more.

Best_AI_paper_2020 by louisfb01

AI paper list with video explanations and code from 2020

Created 5 years ago

Updated 4 years ago

Starred by

Chip Huyen

Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems") and

Kevin Hou

Kevin Hou(Head of Product Engineering at Windsurf).

ImageAI by OlafenwaMoses

Python library for computer vision tasks

Created 7 years ago

Updated 1 year ago

Starred by

Alexandr Wang

Alexandr Wang(Chief AI Officer at Meta; Cofounder of Scale AI),

Boris Cherny

Boris Cherny(Creator of Claude Code; MTS at Anthropic), and

8 more.

awesome-deep-vision by kjw0612

Curated list of deep learning resources for computer vision

Created 10 years ago

Updated 2 years ago

Feedback? Help us improve.