Owl  by OwlAIProject

Wearable AI captures life experiences, running locally

created 1 year ago
603 stars

Top 55.0% on sourcepulse

GitHubView on GitHub
Project Summary

Owl is a personal, always-on wearable AI system designed for continuous life logging and proactive assistance. It targets individuals interested in memory augmentation, productivity enhancement, and exploring novel human-computer interactions through multimodal data capture and local AI inference. The project aims to provide a transparent and open platform for developing and deploying such systems.

How It Works

Owl operates through a distributed architecture comprising wearable capture devices, an AI server, and presentation clients. Wearable devices (e.g., ESP32, Sony Spresense, Apple Watch) capture audio and location data, with plans for image and video. This data is streamed or chunked to a central AI server. The server processes the data using flexible inference options, supporting local models via Ollama or commercial APIs like GPT-4 and Whisper. It employs VAD-based endpointing for conversation segmentation and utilizes background processing queues for transcription, summarization, and data storage.

Quick Start & Requirements

  • Server Setup: Instructions provided for macOS, Linux, Windows, and Docker.
  • Capture Devices: Supports custom ESP platforms, Sony Spresense, and Apple Watch. Reference hardware "Bee" is available for community testing.
  • Clients: Native iOS and web interfaces are available; Android support is planned.
  • Dependencies: Python, FastAPI, Ollama (for local models), or API keys for commercial services. Specific hardware may require custom firmware.
  • Resources: Server hosting and potential GPU for local inference.
  • Documentation: Setup Guide, Technical Guide

Highlighted Details

  • Multimodal Capture: Supports audio and location, with image and video planned.
  • Flexible Inference: Integrates with Ollama for local LLMs/VLMs and commercial APIs (GPT-4, Deepgram).
  • Wearable Support: Broad device compatibility, including custom hardware like the "Bee" device with 50-hour battery life.
  • Data Flow Transparency: Detailed "Tour de Source" explains data path from capture to processed output.

Maintenance & Community

  • Key Contributors: Ethan Sutin, Bart Trzynadlowski.
  • Community: Primarily via Discord.
  • Roadmap: Not explicitly detailed, but feature development is ongoing.

Licensing & Compatibility

  • License: Not explicitly stated in the README. Compatibility for commercial use or closed-source linking is undetermined.

Limitations & Caveats

  • The project is experimental, with ongoing development for vision/video capture and Android support.
  • Conversation detection relies on VAD, which is noted as a "naive and unreliable heuristic."
  • Security recommendations highlight potential data exposure if not hosted securely (e.g., over HTTP).
Health Check
Last commit

1 year ago

Responsiveness

Inactive

Pull Requests (30d)
1
Issues (30d)
0
Star History
22 stars in the last 90 days

Explore Similar Projects

Starred by Anton Bukov Anton Bukov(Cofounder of 1inch Network), Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems), and
9 more.

exo by exo-explore

0.3%
29k
AI cluster for running models on diverse devices
created 1 year ago
updated 4 months ago
Starred by Addy Osmani Addy Osmani(Engineering Leader on Google Chrome), Victor Taelin Victor Taelin(Author of Bend, Kind, HVM), and
1 more.

chatbox by chatboxai

0.3%
36k
Desktop client app for AI models/LLMs
created 2 years ago
updated 6 days ago
Feedback? Help us improve.