scan-for-webcams by JettChenT

CLI tool for scanning webcams on the internet

Created 5 years ago

268 stars

Top 95.9% on SourcePulse

Project Summary

This project provides a tool for scanning the internet for publicly accessible webcams, targeting security researchers and hobbyists interested in network reconnaissance. It simplifies the discovery and viewing of live streams from various webcam types, offering both command-line and experimental GUI interfaces.

How It Works

The tool leverages Shodan for initial discovery of internet-connected devices, specifically targeting webcam protocols like MJPG, webcamXP, yawCam, hipcam, and RTSP. It then enumerates and attempts to display streams, with an option to use a Places365 model for on-device location classification of footage, enhancing the descriptive capabilities beyond simple tags. Experimental support for Vision-Language Models (VLMs) like LLaVA is also included for natural language descriptions.

Quick Start & Requirements

Install via pip install -r requirements.txt after cloning the repository.
Requires API keys for Shodan, Clarifai (or local Places365 model), and geo.ipify.org.
Setup involves running python sfw setup to input API keys.
Experimental VLM support requires llama-cpp-python and huggingface_hub.
Official Docs: https://github.com/JettChenT/scan-for-webcams

Highlighted Details

Supports multiple webcam stream types (MJPG, webcamXP, yawCam, hipcam, RTSP).
Optional GUI for visual display of scanned streams.
Experimental on-device footage classification using Places365.
Experimental Vision-Language Model integration for natural language descriptions.

Maintenance & Community

The project maintains a Discord channel for community interaction.
The README notes that the PyPI package is deprecated in favor of direct repository installation for better developer experience.

Licensing & Compatibility

The repository does not explicitly state a license in the provided README.

Limitations & Caveats

The Vision-Language Model integration is experimental and requires disabling parallel processing.
RTSP stream enumeration is noted as potentially dangerous.
The project relies on external API services (Shodan, Clarifai, geo.ipify.org) which may have usage limits or costs.

Health Check

Last Commit

2 years ago

Responsiveness

Inactive

Pull Requests (30d)

0

Issues (30d)

0

Star History

3 stars in the last 30 days

Explore Similar Projects

ComfyUI_toyxyz_test_nodes by toyxyz

ComfyUI custom nodes for real-time webcam/screen input

Created 2 years ago

Updated 2 months ago

ai-baby-monitor by zeenolife

Local AI baby monitor

Created 10 months ago

Updated 7 months ago

webcamGPT by roboflow

CLI tool for chatting with a webcam video stream

Created 2 years ago

Updated 1 year ago

WebcamGPT-Vision by bdekraker

Webcam app for GPT-4 Vision API processing

Created 2 years ago

Updated 2 years ago

Starred by

Ying Sheng

Ying Sheng(Coauthor of SGLang).

MiniGPT4-video by Vision-CAIR

Video-language model for short and long video understanding

Created 1 year ago

Updated 1 year ago

Starred by

Jason Huggins

Jason Huggins(Creator of Selenium) and

Jonathan Ragan-Kelley

Jonathan Ragan-Kelley(Professor at MIT).

my-yt by christian-fei

Minimal YouTube frontend for local, mindful use

Created 11 months ago

Updated 6 days ago

describe-anything by NVlabs

Image/video captioning model for detailed localized descriptions

Created 9 months ago

Updated 6 months ago

Starred by

Chip Huyen

Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems") and

Ettore Di Giacinto

Ettore Di Giacinto(Author of LocalAI).

ha-llmvision by valentinfrlch

Home Assistant integration for multimodal LLM vision

Created 1 year ago

Updated 1 month ago

awesome-video by krzemienski

Curated list of video tools, frameworks, libraries, and learning resources

Created 7 years ago

Updated 9 months ago

bilive by timerring

Live recording/uploading tool for Bilibili, with MLLM integration

Created 1 year ago

Updated 4 months ago

VideoPipe by sherlockchou86

Cross-platform C++ framework for video analysis and structuring

Created 3 years ago

Updated 2 months ago

Starred by

Tobi Lutke

Tobi Lutke(Cofounder of Shopify),

Omar Sanseviero

Omar Sanseviero(DevRel at Google DeepMind), and

7 more.

sam2 by facebookresearch

Foundation model for promptable visual segmentation in images and videos

Created 1 year ago

Updated 1 year ago

Feedback? Help us improve.