visionscript  by capjamesg

High-level language for computer vision tasks

created 2 years ago
344 stars

Top 81.6% on sourcepulse

GitHubView on GitHub
Project Summary

VisionScript is a high-level, interpreted programming language designed for simplifying common computer vision tasks like object detection, classification, and segmentation. It targets users new to computer vision concepts or those needing to perform one-off analyses quickly, offering a concise syntax inspired by Python and Wolfram Language.

How It Works

VisionScript leverages lexical inference, allowing operations to implicitly use the output of the previous command without explicit variable assignments. This reduces boilerplate code for sequential vision tasks. The language is built in Python and acts as an abstraction layer over various underlying computer vision models and libraries, including CLIP, YOLOv8, FastSAM, GroundedSAM, BLIP, and ViT.

Quick Start & Requirements

  • Install: pip install visionscript
  • Run REPL: visionscript
  • Run file: visionscript ./your_file.vic
  • Run notebook: visionscript --notebook
  • Requires Python.

Highlighted Details

  • Abstract syntax for common CV tasks (detection, classification, segmentation).
  • Lexical inference for implicit data flow between commands.
  • Supports multiple underlying models (CLIP, YOLOv8, FastSAM, etc.).
  • Offers both a REPL and a web-based notebook interface.

Maintenance & Community

The project is maintained by capjamesg. Links to documentation and a demo are provided in the README.

Licensing & Compatibility

Licensed under the MIT license, permitting commercial use and integration with closed-source projects.

Limitations & Caveats

VisionScript is explicitly stated as not intended to be a full-fledged programming language for all vision tasks, focusing instead on common, one-off operations. The notebook interface is described as ephemeral, requiring code to be saved to files.

Health Check
Last commit

1 year ago

Responsiveness

1 day

Pull Requests (30d)
0
Issues (30d)
0
Star History
2 stars in the last 90 days

Explore Similar Projects

Starred by Stas Bekman Stas Bekman(Author of Machine Learning Engineering Open Book; Research Engineer at Snowflake), Douwe Kiela Douwe Kiela(Cofounder of Contextual AI), and
1 more.

lens by ContextualAI

0%
352
Vision-language research paper using LLMs
created 2 years ago
updated 1 week ago
Starred by Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems), Omar Sanseviero Omar Sanseviero(DevRel at Google DeepMind), and
4 more.

open_flamingo by mlfoundations

0.1%
4k
Open-source framework for training large multimodal models
created 2 years ago
updated 11 months ago
Feedback? Help us improve.