pygpt4all  by nomic-ai

Python bindings for local LLM inference (deprecated)

Created 2 years ago
1,019 stars

Top 36.7% on SourcePulse

GitHubView on GitHub
Project Summary

This repository provides Python bindings for llama.cpp and gpt4all, enabling local execution of large language models. It targets developers and researchers seeking to integrate LLM capabilities into Python applications without relying on cloud APIs. The primary benefit is local, private, and cost-effective LLM inference.

How It Works

The project offers Python bindings that interface directly with the llama.cpp C++ library. This allows users to load and run various quantized LLM architectures (like LLaMA, GPT-J, and GPT-Neo) efficiently on local hardware, leveraging CPU or GPU acceleration through llama.cpp's optimized inference engine.

Quick Start & Requirements

Highlighted Details

  • Supports a wide range of quantized LLM architectures.
  • Leverages llama.cpp for efficient CPU and GPU inference.
  • Enables local, private LLM execution.

Maintenance & Community

This repository is archived. Future development and support are handled by the main nomic-ai/gpt4all repository.

Licensing & Compatibility

The project is released under the MIT License, permitting commercial use and integration into closed-source applications.

Limitations & Caveats

This repository is archived and no longer actively maintained. The pygpt4all PyPI package may diverge from backend model updates. Users are directed to use the gpt4all package for current bindings.

Health Check
Last Commit

2 years ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
0 stars in the last 30 days

Explore Similar Projects

Starred by Yineng Zhang Yineng Zhang(Inference Lead at SGLang; Research Scientist at Together AI), Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and
9 more.

LightLLM by ModelTC

0.5%
4k
Python framework for LLM inference and serving
Created 2 years ago
Updated 16 hours ago
Starred by Lianmin Zheng Lianmin Zheng(Coauthor of SGLang, vLLM), Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and
1 more.

MiniCPM by OpenBMB

0.4%
8k
Ultra-efficient LLMs for end devices, achieving 5x+ speedup
Created 1 year ago
Updated 1 week ago
Feedback? Help us improve.