visionscript by capjamesg

High-level language for computer vision tasks

created 2 years ago

344 stars

Top 81.6% on sourcepulse

Project Summary

VisionScript is a high-level, interpreted programming language designed for simplifying common computer vision tasks like object detection, classification, and segmentation. It targets users new to computer vision concepts or those needing to perform one-off analyses quickly, offering a concise syntax inspired by Python and Wolfram Language.

How It Works

VisionScript leverages lexical inference, allowing operations to implicitly use the output of the previous command without explicit variable assignments. This reduces boilerplate code for sequential vision tasks. The language is built in Python and acts as an abstraction layer over various underlying computer vision models and libraries, including CLIP, YOLOv8, FastSAM, GroundedSAM, BLIP, and ViT.

Quick Start & Requirements

Install: pip install visionscript
Run REPL: visionscript
Run file: visionscript ./your_file.vic
Run notebook: visionscript --notebook
Requires Python.

Highlighted Details

Abstract syntax for common CV tasks (detection, classification, segmentation).
Lexical inference for implicit data flow between commands.
Supports multiple underlying models (CLIP, YOLOv8, FastSAM, etc.).
Offers both a REPL and a web-based notebook interface.

Maintenance & Community

The project is maintained by capjamesg. Links to documentation and a demo are provided in the README.

Licensing & Compatibility

Licensed under the MIT license, permitting commercial use and integration with closed-source projects.

Limitations & Caveats

VisionScript is explicitly stated as not intended to be a full-fledged programming language for all vision tasks, focusing instead on common, one-off operations. The notebook interface is described as ephemeral, requiring code to be saved to files.

visionscript by capjamesg

Explore Similar Projects

lens by ContextualAI

overeasy by overeasy-sh

METER by zdou0830

GPT-4-Unlimited-Tools by d3n7

open-codex by codingmoh

Olympus by yuanze-lin

dotai by udecode

comfyui_segment_anything by storyicon

VL-BERT by jackroos

viper by cvlab-columbia

Ollama-OCR by imanoop7

open_flamingo by mlfoundations