lollms by ParisNeo

Text generation server for LLM applications

created 2 years ago

295 stars

Top 90.7% on sourcepulse

Project Summary

LoLLMs Server provides a Flask-based API and WebSocket interface for generating text using various large language models. It targets developers seeking to integrate LLM capabilities into applications, offering features like multiple personalities, real-time generation, and local data handling.

How It Works

LoLLMs Server acts as a unified interface to different LLM bindings, abstracting away model-specific complexities. It supports multiple "personalities" to influence generation style and tone. The architecture emphasizes local data storage and processing, even when serving requests remotely, with communication handled via RESTful API and WebSockets for real-time interaction.

Quick Start & Requirements

Install via pip: pip install --upgrade lollms or pip install --upgrade git+https://github.com/ParisNeo/lollms.git
GPU support requires CUDA installation. A conda environment setup is recommended: conda create --name lollms python=3.10, conda activate lollms, conda install -c anaconda cudatoolkit.
Configuration is managed via lollms-settings.
Models can be installed from Hugging Face URLs or local paths.
Official documentation: https://github.com/ParisNeo/lollms

Highlighted Details

Supports multiple LLM bindings and personalities.
Real-time text generation via WebSockets.
RESTful API for managing personalities and generation.
Local data storage and processing.
File upload capability for context.

Maintenance & Community

Contributions are welcome; see CONTRIBUTING.md.
Community links are not explicitly provided in the README.

Licensing & Compatibility

Licensed under the Apache 2.0 License.
Permissive license suitable for commercial use and integration into closed-source applications.

Limitations & Caveats

The README does not detail specific model compatibility beyond mentioning GGML and general Hugging Face support. Performance benchmarks or specific hardware requirements for optimal operation are not provided.

lollms by ParisNeo

Explore Similar Projects

smartcat by efugier

llamaduo by deep-diver

promptwright by StacklokLabs

CleanS2S by opendilab

parrot.nvim by frankroeder

LLMChat by c0sogi

chat.petals.dev by petals-infra

web-llm-chat by mlc-ai

lunary by lunary-ai

api-for-open-llm by xusenlinzy

gptel by karthink

serge by serge-chat