LLM_AppDev-HandsOn by sroecker

Workshop for local LLM application development

Created 2 years ago

400 stars

Top 72.3% on SourcePulse

Project Summary

This repository provides a hands-on workshop and example code for developing applications with local Large Language Models (LLMs). It targets developers and researchers interested in building Retrieval Augmented Generation (RAG) chatbots that can query custom documents, with a focus on open-source tools and local deployment. The primary benefit is enabling users to create private, document-aware AI assistants without relying on external cloud services.

How It Works

The application utilizes Streamlit for the user interface, LlamaIndex for document indexing and retrieval, and Ollama for serving local LLMs. This stack allows for RAG by indexing documents into a vector store and then retrieving relevant chunks to augment LLM prompts. The approach emphasizes using open-source components and local LLMs, making it accessible for users without powerful GPUs or those prioritizing data privacy.

Quick Start & Requirements

Local Setup: Recommended for Mac M1 (16GB+ RAM). Install Ollama from ollama.ai.

Installation:

python -m venv venv
source venv/bin/activate
pip install -r requirements.txt
streamlit run app.py

Prerequisites: Ollama service running, Zephyr model pulled (ollama pull zephyr).
Configuration: Set OLLAMA_HOST environment variable if needed.
Resources: Local LLM inference can be resource-intensive.
Docs: Streamlit App, Ollama API

Highlighted Details

Demonstrates RAG with custom documents using local LLMs.
Supports deployment via Podman and OpenShift (Kubernetes).
Includes options for GPU acceleration with NVIDIA Container Toolkit and AMD KFD/DRI.
Offers guidance on disabling Ollama service for debugging on Linux.

Maintenance & Community

The repository is maintained by sroecker.
References include "AI on Openshift" and "Open Sourcerers."

Licensing & Compatibility

The repository itself does not explicitly state a license in the README.
The software stack uses open-source tools (Streamlit, LlamaIndex, Ollama), which have their own licenses. Compatibility for commercial use depends on the licenses of these underlying components.

Limitations & Caveats

Local LLM performance is highly dependent on hardware, especially for GPU-less setups.
Generating embeddings directly within the Streamlit app requires increasing shared memory for PyTorch, and LlamaIndex does not yet support generating embeddings via the Ollama service directly.
GPU support requires specific setup with NVIDIA Container Toolkit or AMD drivers.

Health Check

Last Commit

1 year ago

Responsiveness

Inactive

Pull Requests (30d)

0

Issues (30d)

0

Star History

1 stars in the last 30 days

Explore Similar Projects

megabots by momegas

LLM app framework for rapid bot creation and deployment

Created 2 years ago

Updated 2 years ago

Starred by

Ishaan Jaffer

Ishaan Jaffer(Cofounder of LiteLLM) and

Elvis Saravia

Elvis Saravia(Founder of DAIR.AI).

awesome-llm-webapps by icefort-ai

LLM webapp collection for chatbots, RAG, and assistants

Created 2 years ago

Updated 6 months ago

langchain-php by kambo-1st

PHP library for building LLM-powered applications

Created 2 years ago

Updated 2 years ago

TinyLLM by jasonacox

Local LLM and chatbot setup for consumer hardware

Created 2 years ago

Updated 1 month ago

Starred by

Xiaofan Luan

Xiaofan Luan(VP Engineering at Zilliz).

akcio by zilliztech

RAG demo using LLMs and vector DBs for knowledge-enhanced chatbots

Created 2 years ago

Updated 2 years ago

december by ntegrals

Local AI-powered full-stack development environment

Created 7 months ago

Updated 6 months ago

rag-gpt by gpt-open

RAG-based chatbot for custom knowledge bases

Created 1 year ago

Updated 1 year ago

wenda by wenda-LLM

LLM platform for efficient, environment-specific content generation

Created 2 years ago

Updated 11 months ago

local-ai-packaged by coleam00

Self-hosted AI package for local LLMs and low-code development

Created 10 months ago

Updated 1 week ago

Starred by

Marc Klingen

Marc Klingen(Cofounder of Langfuse).

langchain4j by langchain4j

Java library for LLM application development

Created 2 years ago

Updated 2 days ago

Starred by

Peter Norvig

Peter Norvig(Author of "Artificial Intelligence: A Modern Approach"; Research Director at Google),

Luis Capelo

Luis Capelo(Cofounder of Lightning AI), and

36 more.

open-interpreter by openinterpreter

Natural language interface for computers

Created 2 years ago

Updated 1 month ago

Langchain-Chatchat by chatchat-space

RAG and agent app for local knowledge-based LLMs

Created 2 years ago

Updated 2 months ago

Feedback? Help us improve.