org-ai by rksm

Emacs minor mode for generative AI in org-mode

Created 3 years ago

809 stars

Top 43.6% on SourcePulse

Project Summary

This Emacs package integrates generative AI models into the Org mode workflow, enabling users to leverage LLMs for text generation and diffusion models for image creation directly within their documents. It targets Emacs users who want to enhance their productivity with AI-powered content creation, summarization, and code refactoring.

How It Works

The package utilizes special #+begin_ai...#+end_ai blocks within Org mode to define AI tasks. Users can specify models (OpenAI, Azure, Anthropic, Perplexity, Stable Diffusion, local LLMs via oobabooga) and parameters like system prompts, temperature, and image dimensions. It supports text generation, image creation/variation, and speech input/output via Whisper.

Quick Start & Requirements

Installation: MELPA, Straight.el, or manual checkout.
Prerequisites: Emacs, OpenAI API key (or other service credentials), optional: Whisper.el, ffmpeg, Stable Diffusion WebUI, oobabooga/text-generation-webui for local models.
Setup: Basic OpenAI integration is quick. Setting up local models or speech requires additional installations and configuration.
Docs: https://github.com/rksm/org-ai

Highlighted Details

Seamless integration with Org mode for text and image generation.
Support for multiple AI providers including OpenAI, Azure, Anthropic, Perplexity, Stable Diffusion, and local LLMs.
Speech input/output capabilities using Whisper.
Global commands for operating on regions, files, and projects outside of Org mode buffers.
Noweb support for dynamic content generation and code evaluation within prompts.

Maintenance & Community

Actively maintained by rksm.
Community support via GitHub issues. Sponsorships are encouraged.

Licensing & Compatibility

MIT License.
Compatible with commercial and closed-source projects.

Limitations & Caveats

Image variation currently requires curl to be installed.
Perplexity.ai API integration does not currently provide references/links.
macOS speech setup involves specific system permissions and microphone configuration.

Health Check

Last Commit

1 month ago

Responsiveness

1 week

Pull Requests (30d)

0

Issues (30d)

0

Star History

6 stars in the last 30 days

Explore Similar Projects

ata by transformrs

CLI tool for multimodal AI in the terminal

Created 3 years ago

Updated 11 months ago

Mini-DALLE3 by Zeqiang-Lai

Text-to-image research paper using LLMs for interactive prompting

Created 2 years ago

Updated 2 years ago

ChatFred by chrislemke

Alfred workflow for AI interactions

Created 3 years ago

Updated 1 year ago

obsidian-ai-assistant by qgrail

Obsidian plugin for AI model interaction within notes

Created 3 years ago

Updated 9 months ago

awesome-prompts by songtianlun

Prompt library for multimodal AI generation

Created 5 months ago

Updated 2 months ago

payload-ai by ashbuilds

AI plugin for Payload CMS, enhancing content creation

Created 1 year ago

Updated 1 day ago

ComfyUI_VLM_nodes by gokayfem

ComfyUI nodes for multimodal generation and prompt engineering

Created 2 years ago

Updated 1 month ago

generative-ai-go by google

Go SDK for generative AI models

Created 2 years ago

Updated 6 months ago

gpt2bot by polakowo

Telegram chatbot using transformers for multi-turn dialogue

Created 6 years ago

Updated 2 years ago

AI-YinMei by worm128

AI-powered virtual streamer/Vtuber project

Created 2 years ago

Updated 3 weeks ago

Starred by

Simon Horup Eskildsen

Simon Horup Eskildsen(Cofounder of Turbopuffer).

gp.nvim by Robitx

Neovim plugin for AI-assisted text/code operations

Created 2 years ago

Updated 6 months ago

Starred by

Chip Huyen

Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"),

Elvis Saravia

Elvis Saravia(Founder of DAIR.AI), and

1 more.

InternGPT by OpenGVLab

Interactive demo platform for showcasing AI models

Created 2 years ago

Updated 1 year ago

Feedback? Help us improve.