Pawel Garbacki

Cofounder of Fireworks AI

Starred Projects (322)

DeepSeek-OCR by deepseek-ai

1.1%

21k

Context-aware OCR model for visual-text compression

Starred by

Created 1 month ago

Updated 1 month ago

NeuralFlow by valine

381

Python script for visualizing Mistral 7B intermediate layer outputs

Created 1 year ago

Updated 10 months ago

MagiAttention by SandAI-org

1.6%

570

Distributed attention mechanism research paper for ultra-long context, heterogeneous data training

Starred by

Created 7 months ago

Updated 2 days ago

tinker-cookbook by thinking-machines-lab

3.8%

Advanced LLM fine-tuning SDK and example cookbook

Starred by

Created 4 months ago

Updated 5 days ago

torchtitan by pytorch

0.5%

PyTorch platform for generative AI model training research

Starred by

+11

Created 1 year ago

Updated 1 day ago

checkpoint-engine by MoonshotAI

0.8%

849

Middleware for efficient LLM weight updates during inference

Starred by

Created 2 months ago

Updated 6 days ago

SWE-bench by SWE-bench

0.8%

Benchmark for evaluating LLMs on real-world GitHub issues

Starred by

+11

Created 2 years ago

Updated 2 weeks ago

slime by THUDM

2.3%

LLM post-training framework for RL scaling

Starred by

Created 5 months ago

Updated 1 day ago

Step-Audio2 by stepfun-ai

1.0%

End-to-end audio understanding and speech conversation model

Created 4 months ago

Updated 2 months ago

inspect_ai by UKGovernmentBEIS

1.6%

Framework for large language model evaluations

Starred by

Created 2 years ago

Updated 1 day ago

openbench by groq

0.9%

667

Provider-agnostic LLM evaluation infrastructure

Starred by

Created 4 months ago

Updated 2 days ago

LaCT by a1600012888

1.3%

324

Test-Time Training framework for adaptable models

Starred by

Created 6 months ago

Updated 1 week ago

SkyRL by NovaSky-AI

4.0%

RL training pipeline for multi-turn tool use LLMs, optimized for real-world tasks

ART by OpenPipe

0.7%

RL library for training LLM agents via GRPO

Starred by

Created 8 months ago

Updated 3 days ago

torch-profiling-tutorial by Quentin-Anthony

0.4%

532

PyTorch model profiling tutorial

Created 4 months ago

Updated 3 months ago

OpenHands by OpenHands

0.2%

65k

AI platform for software development agents

Starred by

+36

Created 1 year ago

Updated 23 hours ago

ERNIE by PaddlePaddle

0.1%

PaddlePaddle implementations for ERNIE family pre-training models

Starred by

Created 6 years ago

Updated 2 days ago

codex by openai

0.7%

51k

Coding agent CLI tool for terminal-based chat-driven development

Starred by

+33

Created 7 months ago

Updated 1 day ago

claude-code by anthropics

1.8%

44k

Agentic coding assistant for your terminal

Starred by

+15

Created 9 months ago

Updated 3 days ago

opencode by opencode-ai

0.2%

10k

CLI tool for terminal-based AI coding assistance

Starred by

+11

Created 8 months ago

Updated 2 months ago

gemini-cli by google-gemini

1.3%

85k

AI agent for terminal workflows

MiniMax-M1 by MiniMax-AI

0.2%

Open-weight reasoning model with hybrid attention

Starred by

Created 5 months ago

Updated 4 months ago

RAGEN by mll-lab-nu

0.7%

Train LLM agents with reinforcement learning in interactive environments

Starred by

Created 10 months ago

Updated 2 days ago

VoRA by Hon-Wong

1.4%

353

MLLM with visual capabilities

Created 8 months ago

Updated 5 months ago

awesome-instruction-datasets by jianzhnie

710

Curated list of instruction datasets for training ChatLLMs

Created 2 years ago

Updated 1 year ago

xLAM by SalesforceAIResearch

0.2%

584

xLAM is a family of large action models for AI agent systems

Starred by

Created 1 year ago

Updated 3 months ago

deer-flow by bytedance

1.0%

18k

Deep research framework combining language models with specialized tools

Starred by

Created 6 months ago

Updated 1 day ago

arena-hard-auto by lmarena

0.3%

963

Automatic LLM benchmark for instruction-tuned models, correlating with human preference

Starred by

Created 2 years ago

Updated 5 months ago

atropos by NousResearch

0.3%

756

RL environment framework for LLM trajectory collection/evaluation

Starred by

Created 7 months ago

Updated 4 days ago

12-factor-agents by humanlayer

0.7%

16k

Principles for reliable LLM application development

Starred by

Created 8 months ago

Updated 2 months ago

DAPO by BytedTsinghua-SIA

0.8%

Open-source RL system for large-scale LLM training

Starred by

Created 8 months ago

Updated 6 months ago

system-prompts-and-models-of-ai-tools by x1xhlol

1.5%

98k

AI tool system prompts and models

Starred by

Created 9 months ago

Updated 1 day ago

rllm by rllm-org

0.6%

Framework for post-training language agents via reinforcement learning

Starred by

Created 10 months ago

Updated 4 days ago

ring-flash-attention by zhuzilin

0.2%

923

FlashAttention extension for ring attention

Starred by

Created 1 year ago

Updated 2 months ago

understand-r1-zero by sail-sg

0.4%

Research paper analyzing R1-Zero-like training for LLMs

Starred by

Created 8 months ago

Updated 3 months ago

dynamo by ai-dynamo

0.8%

Inference framework for distributed generative AI model serving

Starred by

Created 9 months ago

Updated 21 hours ago

openai-agents-python by openai

0.7%

18k

Python SDK for multi-agent workflows

OpenManus by FoundationAgents

0.2%

51k

Open-source framework for building general AI agents

Starred by

Created 8 months ago

Updated 1 week ago

verl by volcengine

3.1%

17k

RL training library for LLMs

Search-R1 by PeterGriffinJin

0.8%

RL framework for training LLMs to use search engines

Starred by

Created 9 months ago

Updated 2 weeks ago

s1 by simplescaling

0.1%

Test-time scaling recipe for strong reasoning performance

Starred by

Created 10 months ago

Updated 5 months ago

open-r1 by huggingface

0.1%

26k

SDK for reproducing DeepSeek-R1

Starred by

+17

Created 10 months ago

Updated 6 days ago

openllmetry by traceloop

0.3%

Open-source observability SDK for LLM applications

Kimi-k1.5 by MoonshotAI

0.0%

Research paper on scaling reinforcement learning with LLMs

Starred by

Created 10 months ago

Updated 8 months ago

UI-TARS by bytedance

0.5%

Multimodal agent for GUI interaction in virtual worlds (research paper)

Starred by

Created 10 months ago

Updated 2 weeks ago

DeepSeek-R1 by deepseek-ai

0.1%

92k

Reasoning models research paper

Starred by

+16

Created 10 months ago

Updated 5 months ago

unsloth by unslothai

0.5%

49k

Finetuning tool for LLMs, targeting speed and memory efficiency

ml-cross-entropy by apple

0.7%

555

PyTorch module for memory-efficient cross-entropy in LLMs

Starred by

Created 1 year ago

Updated 2 months ago

Liger-Kernel by linkedin

0.4%

Triton kernels for efficient LLM training

Starred by

Created 1 year ago

Updated 2 days ago

MiniMax-01 by MiniMax-AI

0.1%

Large language & vision-language models based on linear attention

Starred by

Created 10 months ago

Updated 4 months ago

tabby by TabbyML

0.1%

33k

Self-hosted AI coding assistant for on-prem code completion

continue by continuedev

0.4%

30k

IDE extension for custom AI code assistants

SkyThought by NovaSky-AI

0.0%

Training recipes for Sky-T1 family of models

Starred by

Created 10 months ago

Updated 4 months ago

dspy by stanfordnlp

0.5%

30k

Framework for programming language models, not prompting

storm by stanford-oval

0.1%

28k

LLM system for automated knowledge curation and article generation

Starred by

Created 1 year ago

Updated 2 months ago

open-computer-use by e2b-dev

0.6%

AI agent for computer control via LLMs

Starred by

Created 1 year ago

Updated 5 months ago

PaLM-rlhf-pytorch by lucidrains

0.1%

RLHF implementation on PaLM

Starred by

Created 3 years ago

Updated 1 month ago

picotron by huggingface

0.6%

Minimalist distributed training framework for educational use

Starred by

Created 1 year ago

Updated 3 months ago

PRIME by PRIME-RL

0.4%

Scalable RL solution for advanced reasoning of language models

Starred by

Created 11 months ago

Updated 8 months ago

sglang by sgl-project

0.9%

20k

Fast serving framework for LLMs and vision language models

Starred by

+34

Created 1 year ago

Updated 21 hours ago

DeepSeek-V3 by deepseek-ai

0.1%

100k

MoE language model research paper with 671B total parameters

Starred by

+13

Created 11 months ago

Updated 3 months ago

gitingest by coderamp-labs

0.4%

13k

CLI tool for LLM-friendly code ingestion from Git repos

Starred by

Created 1 year ago

Updated 6 days ago

tau-bench by sierra-research

2.4%

971

Benchmark for tool-agent-user interaction research

Starred by

Created 1 year ago

Updated 3 months ago

Genesis by Genesis-Embodied-AI

0.2%

28k

Physics platform for robotics & embodied AI learning

search-and-learn by huggingface

Recipes to scale inference-time compute of open models

Starred by

Created 11 months ago

Updated 6 months ago

open-instruct by allenai

1.3%

Training codebase for instruction-following language models

Starred by

Created 2 years ago

Updated 1 day ago

desktop by e2b-dev

0.8%

SDK for virtual desktop sandboxes for LLM-powered computer use

Starred by

Created 1 year ago

Updated 3 weeks ago

VLMEvalKit by open-compass

1.1%

Evaluation toolkit for large multi-modality models (LMMs)

Created 2 years ago

Updated 3 days ago

Qwen3-VL by QwenLM

1.3%

17k

Multimodal LLM for vision-language tasks, document parsing, and agent functionality

Starred by

Created 1 year ago

Updated 2 days ago

DocBank by doc-analysis

0.2%

629

Layout analysis dataset for document understanding tasks

Created 5 years ago

Updated 1 year ago

Qwen-VL by QwenLM

0.1%

Vision-language model for multimodal understanding, localization, and text reading

Starred by

Created 2 years ago

Updated 1 year ago

OSWorld by xlang-ai

1.0%

Multimodal agent benchmark for open-ended tasks in realistic computer environments

Starred by

Created 2 years ago

Updated 1 week ago

SoM by microsoft

0.5%

Visual prompting method for GPT-4V and LMMs

Starred by

Created 2 years ago

Updated 1 year ago

dynasaur by adobe-research

349

LLM agent framework using dynamic action creation via Python code generation

Starred by

Created 1 year ago

Updated 11 months ago

browser-use by browser-use

0.4%

73k

SDK for AI agent browser control

ShowUI by showlab

0.4%

Vision-language-action model for GUI agent & computer use (CVPR 2025 paper)

Starred by

Created 1 year ago

Updated 6 months ago

WilmerAI by SomeOddCodeGuy

0.1%

789

AI inference router for specialized workflows

Starred by

Created 1 year ago

Updated 1 month ago

aisuite by andrewyng

0.2%

13k

Unified interface for multiple generative AI providers

Starred by

Created 1 year ago

Updated 2 weeks ago

servers by modelcontextprotocol

0.6%

74k

Reference implementations for the Model Context Protocol (MCP) servers

python-sdk by modelcontextprotocol

0.6%

20k

Python SDK for Model Context Protocol (MCP) servers/clients

Starred by

Created 1 year ago

Updated 2 days ago

every-chatgpt-gui by billmei

0.3%

Curated list of ChatGPT, Claude, and other LLM front-end GUI clients

Created 2 years ago

Updated 3 weeks ago

MinerU by opendatalab

0.7%

50k

PDF extraction tool for converting PDFs to Markdown and JSON

Starred by

Created 1 year ago

Updated 3 days ago

Qwen3-Coder by QwenLM

0.5%

14k

Code LLM for code completion, generation, and assistant use cases

Starred by

Created 1 year ago

Updated 4 months ago

LLMxMapReduce by thunlp

0.2%

839

Framework for LLM long-sequence processing via MapReduce-inspired divide-and-conquer

Created 1 year ago

Updated 3 weeks ago

llm-app by pathwaycom

0.4%

48k

LLM app templates for RAG, AI pipelines, and enterprise search

Starred by

Created 2 years ago

Updated 1 month ago

docling by docling-project

1.4%

45k

Prepare documents for generative AI

Qwen3 by QwenLM

0.3%

26k

Large language model series by Qwen team, Alibaba Cloud

meditron by epfLLM

0.1%

Open-source medical LLMs adapted from Llama-2

Starred by

Created 2 years ago

Updated 1 year ago

OmniParser by microsoft

0.1%

24k

Screen parsing tool for vision-based GUI agents

Starred by

Created 1 year ago

Updated 2 months ago

fast-apply by kortix-ai

372

Pipeline for data generation and fine-tuning Qwen2.5 Coder models

Starred by

Created 1 year ago

Updated 2 months ago

Emu3 by baaivision

0.2%

Multimodal model for vision-language understanding and generation

Starred by

Created 1 year ago

Updated 1 week ago

together-cookbook by togethercomputer

0.1%

Cookbook for open-source models via Together AI

Created 1 year ago

Updated 3 days ago

zerox by getomni-ai

0.1%

12k

OCR SDK for AI ingestion of documents with complex layouts

Starred by

Created 1 year ago

Updated 6 months ago

Janus by deepseek-ai

0.1%

18k

Unified multimodal model research paper for understanding and generation

Starred by

Created 1 year ago

Updated 10 months ago

TransformerEngine by NVIDIA

0.6%

Library for Transformer model acceleration on NVIDIA GPUs

Starred by

Created 3 years ago

Updated 4 days ago

O1-Journey by GAIR-NLP

0.1%

Research paper on replicating O1 via "journey learning"

Starred by

Created 1 year ago

Updated 10 months ago

chunkr by lumina-ai-inc

0.1%

Document intelligence API for RAG/LLM workflows

Starred by

Created 1 year ago

Updated 2 months ago

zep by getzep

0.6%

Memory foundation for AI stacks, enabling continuous learning

Starred by

Created 2 years ago

Updated 1 week ago

ColBERT by stanford-futuredata

0.1%

Neural search for fast, accurate retrieval over large text collections

Starred by

Created 5 years ago

Updated 1 month ago

optillm by algorithmicsuperintelligence

0.9%

Optimizing inference proxy for LLMs

Starred by

Created 1 year ago

Updated 23 hours ago

LiveCodeBench by LiveCodeBench

0.7%

723

Benchmark for holistic LLM code evaluation

Starred by

Created 1 year ago

Updated 4 months ago

zml by zml

0.6%

AI inference stack for production

Starred by

Created 1 year ago

Updated 2 days ago

Awesome-LLM-Strawberry by hijkzzz

0.1%

Collection of LLM papers, blogs, and projects focused on reasoning techniques

Starred by

Created 1 year ago

Updated 1 month ago

GOT-OCR2.0 by Ucas-HaoranWei

0.2%

OCR research paper for unified end-to-end model

Created 1 year ago

Updated 9 months ago

colpali by illuin-tech

0.8%

Vision-language model code for document retrieval research

Starred by

Created 1 year ago

Updated 2 days ago

SuperPrompt by NeoVertex1

0.1%

Prompt engineering research for AI agent understanding

Starred by

Created 1 year ago

Updated 2 months ago

rStar by zhentingqi

0.2%

966

Research paper for improving small LLM reasoning via mutual reasoning

Starred by

Created 1 year ago

Updated 10 months ago

llama-stack-apps by llamastack

0.0%

Agentic app examples built on Llama Stack

Starred by

Created 1 year ago

Updated 3 months ago

ms-swift by modelscope

1.3%

11k

SDK for fine-tuning and deploying LLMs/MLLMs

Starred by

Created 2 years ago

Updated 23 hours ago

InternLM-XComposer by InternLM

0.1%

Multimodal model for long-context video/audio interactions, image understanding, and composition

Starred by

Created 2 years ago

Updated 6 months ago

DisTrO by NousResearch

966

Distributed optimizers research paper

Starred by

Created 1 year ago

Updated 1 month ago

llama_cloud_services by run-llama

0.1%

SDK for LlamaCloud GenAI services

Starred by

Created 1 year ago

Updated 3 days ago

DistillKit by arcee-ai

0.5%

785

Open-source toolkit for LLM distillation research

Starred by

Created 1 year ago

Updated 4 months ago

flux by black-forest-labs

0.3%

25k

Inference code for FLUX image generation & editing models

llama-stack by llamastack

0.2%

Composable building blocks for Llama apps

Starred by

Created 1 year ago

Updated 2 days ago

MindSearch by InternLM

0.1%

LLM multi-agent framework for web search (Perplexity AI, SearchGPT)

Starred by

Created 1 year ago

Updated 4 months ago

unstructured by Unstructured-IO

0.4%

13k

ETL solution for structuring unstructured data for language models

VIINA by zhukovyuri

0.3%

325

Event data system for the 2022 Russian Invasion of Ukraine

Created 3 years ago

Updated 1 day ago

MInference by microsoft

0.4%

Framework for long-context LLM inference speedup via sparse attention

Starred by

Created 1 year ago

Updated 2 months ago

ultravox by fixie-ai

0.1%

Multimodal LLM for real-time voice interactions

Starred by

Created 1 year ago

Updated 2 months ago

octo by octo-models

0.1%

Robot policy for generalist manipulation, trained on 800k trajectories

Starred by

Created 1 year ago

Updated 1 year ago

DeepSeek-Coder-V2 by deepseek-ai

0.3%

Open-source code language model comparable to GPT4-Turbo

Starred by

Created 1 year ago

Updated 2 weeks ago

MathBlackBox by trotsky1997

0.1%

Research paper for mathematical reasoning via LLMs

Starred by

Created 1 year ago

Updated 11 months ago

EAGLE by SafeAILab

0.9%

Speculative decoding research paper for faster LLM inference

Starred by

Created 2 years ago

Updated 1 week ago

tianshou by thu-ml

0.2%

PyTorch RL library for algorithm development and application

Starred by

Created 7 years ago

Updated 1 week ago

gemma-2B-10M by mustafaaljadery

0.1%

941

Gemma 2B with 10M context length using Infini-attention

Starred by

Created 1 year ago

Updated 1 year ago

RULER by NVIDIA

0.5%

Evaluation suite for long-context language models research paper

Starred by

Created 1 year ago

Updated 2 weeks ago

VILA by NVlabs

0.3%

Open-source VLMs for efficient video/multi-image understanding

Starred by

Created 1 year ago

Updated 3 days ago

ThunderKittens by HazyResearch

0.5%

CUDA kernel framework for fast deep learning primitives

selfcodealign by bigcode-project

321

Research paper for self-alignment in code generation

Starred by

Created 1 year ago

Updated 9 months ago

VAR by FoundationVision

0.3%

Image generation research paper using visual autoregressive modeling

Starred by

Created 1 year ago

Updated 2 weeks ago

FlagEmbedding by FlagOpen

0.4%

11k

Toolkit for retrieval and RAG applications

Starred by

Created 2 years ago

Updated 1 month ago

FILM by microsoft

0.4%

261

LLM for enhanced context utilization

Created 1 year ago

Updated 1 year ago

PLLaVA by magic-research

0.1%

673

Research paper for parameter-free LLaVA extension to videos

Created 1 year ago

Updated 1 year ago

cohere-toolkit by cohere-ai

0.2%

RAG toolkit for LLM application development and deployment

Starred by

Created 1 year ago

Updated 1 week ago

EasyContext by jzhang38

750

Recipes for language model context length extrapolation to 1M tokens

Starred by

Created 1 year ago

Updated 1 year ago

Spec-Bench by hemingkx

0.3%

338

Benchmark for speculative decoding methods (ACL 2024 paper)

Created 1 year ago

Updated 7 months ago

openai-node by openai

0.2%

10k

TypeScript/JavaScript SDK for the OpenAI API

Starred by

Created 4 years ago

Updated 1 week ago

llama3 by meta-llama

0.1%

29k

*Deprecated* minimal example for loading and running Llama 3 models

Starred by

+13

Created 1 year ago

Updated 10 months ago

distilabel by argilla-io

0.6%

Framework for synthetic data and AI feedback pipelines

Open-Sora-Plan by PKU-YuanGroup

0.1%

12k

Open-source project aiming to reproduce Sora-like T2V model

Starred by

Created 1 year ago

Updated 1 month ago

LLaVA-UHD by thunlp

1.0%

397

Efficient native-resolution encoding for multimodal LLMs

Created 1 year ago

Updated 3 days ago

higgsfield by higgsfield-ai

0.0%

ML framework for large model training and GPU orchestration

SWE-agent by SWE-agent

0.3%

18k

Agent for automated software engineering (NeurIPS 2024)

VoiceCraft by jasonppy

0.0%

Zero-shot speech editing and TTS research paper

Starred by

Created 1 year ago

Updated 8 months ago

Open-Sora by hpcaitech

0.2%

28k

Video generation initiative for efficient, high-quality video production

Starred by

Created 1 year ago

Updated 7 months ago

bark by suno-ai

0.1%

39k

Generative audio model for realistic speech and sound effects

pal by reasoning-machines

0.2%

517

Program-aided language model for reasoning tasks

Starred by

Created 3 years ago

Updated 2 years ago

torchtune by meta-pytorch

0.1%

PyTorch library for LLM post-training and experimentation

self-rag by AkariAsai

0.1%

Self-RAG implementation for learning retrieval, generation, and critique via self-reflection

Starred by

Created 2 years ago

Updated 1 year ago

OpenPipe by OpenPipe

0.1%

Fine-tuning platform for cheaper models

Starred by

Created 2 years ago

Updated 1 year ago

LLM-Blender by yuchenlin

0.1%

971

LLM ensembling framework using pairwise ranking and generative fusion

Starred by

Created 2 years ago

Updated 1 year ago

grok-1 by xai-org

0.1%

51k

JAX example code for loading and running Grok-1 open-weights model

aici by microsoft

0.1%

AICI constrains LLM output using (Wasm) programs

Starred by

Created 2 years ago

Updated 10 months ago

Yi by 01-ai

0.0%

Open-source bilingual LLMs trained from scratch

Starred by

Created 2 years ago

Updated 1 year ago

anthropic-tools by anthropics

0.3%

329

SDK for tool/function calling with Anthropic models (research preview)

Starred by

Created 2 years ago

Updated 1 year ago

self-rewarding-lm-pytorch by lucidrains

0.1%

Training framework for self-rewarding language models

Starred by

Created 1 year ago

Updated 1 year ago

OpenCodeInterpreter by OpenCodeInterpreter

Open-source code generation system for bridging LLMs and code interpreters

Starred by

Created 1 year ago

Updated 1 year ago

LWM by LargeWorldModel

0.1%

Multimodal autoregressive model for long-context video/text

Starred by

Created 1 year ago

Updated 1 year ago

ai by vercel

1.0%

20k

AI SDK for building AI-powered applications and agents

Starred by

+15

Created 2 years ago

Updated 23 hours ago

SPIN by uclaml

0.2%

Self-Play Fine-Tuning (SPIN) research paper implementation

Starred by

Created 1 year ago

Updated 1 year ago

trigger.dev by triggerdotdev

0.3%

13k

Open-source platform for background jobs and AI workflows

Starred by

Created 3 years ago

Updated 2 days ago

AgentBoard by hkust-nlp

0.3%

366

Analytical evaluation board for multi-turn LLM agents

Starred by

Created 1 year ago

Updated 1 year ago

sparrow by katanaml

0.2%

Data processing & instruction calling tool using ML, LLM, and Vision LLM

Starred by

Created 3 years ago

Updated 6 days ago

search_with_lepton by leptonai

0.0%

Conversational search engine demo

Starred by

Created 1 year ago

Updated 2 weeks ago

autogen by microsoft

0.3%

52k

Agentic framework for multi-agent AI applications

m2 by HazyResearch

561

Sub-quadratic architecture research paper

Starred by

Created 2 years ago

Updated 11 months ago

mergekit by arcee-ai

0.3%

CLI tool for merging pretrained language models, combining strengths without retraining

ToolAlpaca by tangqiaoyu

0.2%

886

Tool-learning framework for language models, research paper

Starred by

Created 2 years ago

Updated 1 year ago

ToolBench by OpenBMB

0.2%

Open platform for LLM tool learning (ICLR'24 spotlight)

Starred by

Created 2 years ago

Updated 6 months ago

NexusRaven by nexusflowai

318

Evaluation framework for function-calling LLM, NexusRaven-13B

Starred by

Created 2 years ago

Updated 2 years ago

gpt4free by xtekky

0.1%

66k

API package for multi-provider LLM requests (GPT-4.1, Gemini 2.5, Deepseek R1)

Starred by

Created 2 years ago

Updated 1 day ago

promptbench by microsoft

0.1%

LLM evaluation framework

Starred by

Created 2 years ago

Updated 1 month ago

spinningup by openai

0.1%

11k

Educational resource for learning deep reinforcement learning

MiniGPT-4 by Vision-CAIR

0.0%

26k

Vision-language model for multi-task learning

LLMCompiler by SqueezeAILab

0.4%

LLM compiler for parallel function calling

Starred by

Created 2 years ago

Updated 1 year ago

NexusRaven-V2 by nexusflowai

415

Open-source LLM for function calling, outperforming GPT-4 in some cases

Starred by

Created 2 years ago

Updated 1 year ago

mamba by state-spaces

0.4%

17k

Mamba SSM architecture for sequence modeling

gpt-fast by meta-pytorch

0.1%

PyTorch text generation for efficient transformer inference

generative-models by Stability-AI

0.1%

27k

Generative models SDK for video, image, and 3D synthesis research

Starred by

Created 2 years ago

Updated 3 weeks ago

LLM-Shearing by princeton-nlp

632

Code for LLM pre-training acceleration via structured pruning (ICLR 2024)

Starred by

Created 2 years ago

Updated 1 year ago

TensorRT-LLM by NVIDIA

0.4%

12k

LLM inference optimization SDK for NVIDIA GPUs

gpt-crawler by BuilderIO

0.1%

22k

CLI tool for site crawling to generate custom GPT knowledge files

Starred by

Created 2 years ago

Updated 4 months ago

draw-a-ui by SawyerHood

0.1%

14k

Web app generates HTML from UI wireframes

Starred by

Created 2 years ago

Updated 4 months ago

open-interpreter by openinterpreter

0.1%

61k

Natural language interface for computers

LLaVA-Plus-Codebase by LLaVA-VL

0.1%

762

Multimodal agent for vision tasks using external tools

Starred by

Created 2 years ago

Updated 1 year ago

opengpts by langchain-ai

0.1%

Open-source platform for building custom GPT assistants

Starred by

Created 2 years ago

Updated 5 months ago

openchat by imoneoi

0.1%

Open-source LLM fine-tuned with C-RLFT, inspired by offline reinforcement learning

Starred by

Created 2 years ago

Updated 1 year ago

CLIP by openai

0.3%

32k

Image-text matching model for zero-shot prediction

LLaVA by haotian-liu

0.3%

24k

Multimodal assistant with GPT-4 level capabilities

llmperf by ray-project

0.6%

LLM validation/benchmark library for LLM APIs

Starred by

Created 2 years ago

Updated 11 months ago

LightLLM by ModelTC

0.5%

Python framework for LLM inference and serving

Starred by

Created 2 years ago

Updated 2 days ago

examples by graphcore

332

ML examples for Graphcore IPUs, training and inference

Starred by

Created 7 years ago

Updated 1 year ago

streaming-llm by mit-han-lab

0.1%

Framework for efficient LLM streaming

Starred by

Created 2 years ago

Updated 1 year ago

mistral-inference by mistralai

0.1%

11k

Inference library for Mistral models

Starred by

Created 2 years ago

Updated 1 week ago

can-ai-code by the-crypt-keeper

598

AI coding model evaluation framework

Starred by

Created 2 years ago

Updated 5 months ago

LongLoRA by dvlab-research

0.0%

LongLoRA: Efficient fine-tuning for long-context LLMs

Starred by

Created 2 years ago

Updated 1 year ago

Qwen by QwenLM

0.3%

20k

Chat & pretrained LLM by Alibaba Cloud

ollama by ollama

0.3%

157k

CLI tool for running LLMs locally

alpaca_farm by tatsu-lab

837

RLHF simulation framework for accessible instruction-following/alignment research

Starred by

Created 2 years ago

Updated 1 year ago

Medusa by FasterDecoding

0.1%

Framework for accelerating LLM generation using multiple decoding heads

Starred by

Created 2 years ago

Updated 1 year ago

adept-inference by persimmon-ai-labs

412

Inference code for the Persimmon-8B LLM

Starred by

Created 2 years ago

Updated 2 years ago

butterfish by bakks

0.4%

464

CLI tool for adding AI to your shell

Starred by

Created 2 years ago

Updated 7 months ago

shell_gpt by TheR1D

0.1%

12k

CLI tool for shell command generation and task automation using LLMs

Starred by

Created 2 years ago

Updated 1 month ago

LocalAI by mudler

1.0%

39k

Open-source OpenAI alternative for local AI inference

legalbench by HazyResearch

0.4%

514

Legal reasoning benchmark for evaluating LLMs

Created 3 years ago

Updated 1 year ago

yarn by jquesnelle

0.3%

Context window extension method for LLMs (research paper, models)

Starred by

Created 2 years ago

Updated 1 year ago

shell-ai by ricklamers

0.1%

CLI tool for natural language to shell command translation

Created 2 years ago

Updated 2 months ago

llama-cookbook by meta-llama

0.1%

18k

Guide for building with Llama models

codellama by meta-llama

0.0%

16k

Inference code for CodeLlama models

prm800k by openai

Dataset of LLM solutions to math problems with step-level correctness labels

Starred by

Created 2 years ago

Updated 2 years ago

FLAML by microsoft

0.1%

AutoML library for efficient machine learning and AI operations

Starred by

Created 5 years ago

Updated 1 month ago

engshell by emcf

English-language shell for OS, powered by LLMs

Starred by

Created 2 years ago

Updated 1 year ago

WizardLM by nlpxucan

0.1%

LLMs built using Evol-Instruct for complex instruction following

sqlcoder by defog-ai

0.1%

LLM for natural language to SQL conversion

Starred by

Created 2 years ago

Updated 1 year ago

QuIP by Cornell-RelaxML

0.3%

390

Code for LLM quantization research

Created 2 years ago

Updated 1 year ago

lmdeploy by InternLM

0.5%

Toolkit for LLM compression, deployment, and serving

Starred by

Created 2 years ago

Updated 1 day ago

flexflow-train by flexflow

0.1%

Accelerating distributed deep learning training

Starred by

Created 7 years ago

Updated 1 week ago

Platypus by arielnlee

630

Code for fine-tuning LLMs using LoRA

Starred by

Created 2 years ago

Updated 1 year ago

MetaGPT by FoundationAgents

0.2%

60k

Multi-agent framework for collaborative AI software development

Starred by

Created 2 years ago

Updated 1 month ago

outlines by dottxt-ai

0.6%

13k

SDK for structured LLM text generation

Chinese-Llama-2-7b by LinkSoul-AI

0.0%

Chinese Llama 2 model for chat, fully open-source and commercially available

Created 2 years ago

Updated 2 years ago

sentence-transformers by huggingface

0.2%

18k

Framework for text embeddings, retrieval, and reranking

private-gpt by zylon-ai

0.1%

57k

Private AI API for local document interaction using LLMs

swiss_army_llama by Dicklesworthstone

FastAPI service for semantic text search using precomputed embeddings

Starred by

Created 2 years ago

Updated 9 months ago

transformers-tutorials by abhimishra91

0.1%

856

Tutorials for fine-tuning transformer models on NLP tasks

Starred by

Created 5 years ago

Updated 1 year ago

bert by google-research

0.1%

40k

TensorFlow code and pre-trained models for BERT

ALCE by princeton-nlp

0.2%

501

Benchmark for evaluating LLMs' citation abilities

Starred by

Created 2 years ago

Updated 1 year ago

lorahub by sail-sg

0.1%

659

Framework for efficient cross-task generalization via dynamic LoRA composition

Starred by

Created 2 years ago

Updated 1 year ago

text-to-text-transfer-transformer by google-research

0.1%

Unified text-to-text transformer for NLP research

ai-chatbot by vercel

0.4%

19k

Next.js chatbot template for building AI-powered chat applications

Starred by

Created 2 years ago

Updated 1 day ago

OpenChatKit by togethercomputer

0.0%

Open-source toolkit for building specialized/general-purpose chat models

clownfish by newhouseb

329

Constrained decoding for LLMs against JSON schema

Starred by

Created 2 years ago

Updated 2 years ago

stanford_alpaca by tatsu-lab

0.1%

30k

Instruction-following LLaMA model training and data generation

deep-learning-pytorch-huggingface by philschmid

0.2%

Tutorials for deep learning with PyTorch and Hugging Face libraries

Starred by

Created 3 years ago

Updated 9 months ago

xTuring by stochasticai

SDK for fine-tuning and customizing open-source LLMs

Starred by

Created 2 years ago

Updated 4 days ago

LMOps by microsoft

0.4%

AI research initiative for building AI products with foundation models

Starred by

Created 3 years ago

Updated 1 week ago

DialogStudio by salesforce

517

Unified dataset for conversational AI research

Created 2 years ago

Updated 10 months ago

ggml by ggml-org

0.2%

14k

Tensor library for machine learning

H2O by FMInference

488

KV cache eviction research paper for efficient LLM inference

Starred by

Created 2 years ago

Updated 1 year ago

h2ogpt by h2oai

0.0%

12k

Private chat with local GPT with document, images, video, etc

Starred by

Created 2 years ago

Updated 1 month ago

qlora by artidoro

0.1%

11k

Finetuning tool for quantized LLMs

LLaMA-Factory by hiyouga

0.6%

63k

Unified fine-tuning tool for 100+ LLMs & VLMs (ACL 2024)

long_llama by CStanKonrad

LLM for long context handling, fine-tuned with Focused Transformer

Starred by

Created 2 years ago

Updated 2 years ago

self-instruct by yizhongw

0.1%

Self-Instruct: Research paper for aligning language models with self-generated instructions

Starred by

Created 2 years ago

Updated 2 years ago

llama-cpp-python by abetlen

0.3%

10k

Python bindings for llama.cpp, enabling local LLM inference

helm by stanford-crfm

0.3%

Open-source Python framework for holistic evaluation of foundation models

text-generation-inference by huggingface

0.2%

11k

Rust/Python/gRPC server for fast LLM text generation

alpaca_lora_4bit by johnsmith0031

535

Fine-tuning and inference tool for quantized LLaMA models

Starred by

Created 2 years ago

Updated 2 years ago

minimal-llama by zphang

457

Code for running and fine-tuning LLaMA models

Starred by

Created 2 years ago

Updated 2 years ago

LongChat by DachengLi1

532

Long-context LLM chatbot training and evaluation framework

Starred by

Created 2 years ago

Updated 1 year ago

exllama by turboderp

0.0%

Llama implementation for memory-efficient quantized weights

Starred by

Created 2 years ago

Updated 2 years ago

LMFlow by OptimalScale

0.0%

Toolkit for finetuning and inference of large foundation models

Starred by

Created 2 years ago

Updated 2 days ago

vllm by vllm-project

0.8%

64k

LLM serving engine for high-throughput, memory-efficient inference

openai-python by openai

0.2%

29k

Python SDK for the OpenAI API

Starred by

+16

Created 5 years ago

Updated 1 day ago

starcoder by bigcode-project

0.0%

Code LM for code generation and instruction fine-tuning

axolotl by axolotl-ai-cloud

0.3%

11k

CLI tool for streamlined post-training of AI models

hnswlib by nmslib

0.1%

Header-only C++ library for fast approximate nearest neighbors

Starred by

+13

Created 8 years ago

Updated 2 months ago

open_clip by mlfoundations

0.3%

13k

OpenCLIP: open-source CLIP implementation for vision-language representation learning

accelerate by huggingface

0.2%

PyTorch training helper for distributed execution

SpQR by Vahe1994

550

Weight compression research paper for near-lossless LLM quantization

Starred by

Created 2 years ago

Updated 11 months ago

AutoGPTQ by AutoGPTQ

0.1%

LLM quantization package using GPTQ algorithm

llm-awq by mit-han-lab

0.3%

Weight quantization research paper for LLM compression/acceleration

Starred by

Created 2 years ago

Updated 4 months ago

Alpaca-CoT by PhoebusSi

0.1%

IFT platform for instruction collection, parameter-efficient methods, and LLMs

Starred by

Created 2 years ago

Updated 1 year ago

baize-chatbot by project-baize

Chat model trained via LoRA, using ChatGPT-generated dialogs

Starred by

Created 2 years ago

Updated 1 year ago

falcontune by rmihaylov

465

CLI tool for finetuning Falcon LLMs

Starred by

Created 2 years ago

Updated 2 years ago

gorilla by ShishirPatil

0.1%

13k

LLM tool-use framework for API invocation and function calling

MeZO by princeton-nlp

0.1%

Research paper implementation for memory-efficient LM fine-tuning

Starred by

Created 2 years ago

Updated 1 year ago

lm-evaluation-harness by EleutherAI

0.6%

11k

Framework for few-shot language model evaluation

llama.cpp by ggml-org

0.4%

91k

C/C++ library for local LLM inference

GPTCache by zilliztech

0.2%

Semantic cache for LLM queries, integrated with LangChain and LlamaIndex

Starred by

Created 2 years ago

Updated 4 months ago

tree-of-thoughts by kyegomez

0.0%

Plug-and-play implementation of Tree of Thoughts for LLM reasoning

Starred by

Created 2 years ago

Updated 4 months ago

gpt-neox by EleutherAI

0.1%

Framework for training large-scale autoregressive language models

pythia by EleutherAI

0.2%

LLM suite for interpretability, learning dynamics, ethics, and transparency research

Starred by

Created 3 years ago

Updated 2 weeks ago

llm-foundry by mosaicml

0.1%

LLM training code for Databricks foundation models

sd-webui-controlnet by Mikubill

0.1%

18k

WebUI extension for ControlNet, an image-generation plugin

Starred by

Created 2 years ago

Updated 1 year ago

ControlNet by lllyasviel

0.1%

33k

Neural network structure for adding conditional control to diffusion models

RWKV-LM by BlinkDL

0.1%

14k

RNN for LLM, transformer-level performance, parallelizable training

basaran by hyperonym

Open-source API server for text completion

Starred by

Created 2 years ago

Updated 1 year ago

pandas-ai by sinaptik-ai

0.2%

23k

Python SDK for conversational data analysis using LLMs and RAG

Starred by

Created 2 years ago

Updated 1 month ago

guidance by guidance-ai

0.1%

21k

Guidance is a programming paradigm for steering LLMs

raft by rapidsai

0.1%

955

CUDA-accelerated primitives for ML/data mining algorithms

Starred by

Created 6 years ago

Updated 5 days ago

lit-llama by Lightning-AI

0.1%

LLaMA implementation for pretraining, finetuning, and inference

Starred by

Created 2 years ago

Updated 5 months ago

civitai by civitai

0.2%

Platform for sharing AI models

Starred by

Created 3 years ago

Updated 2 days ago

LyCORIS by KohakuBlueleaf

0.1%

Parameter-efficient fine-tuning algorithms for Stable Diffusion

Created 2 years ago

Updated 2 weeks ago

stable-diffusion-webui by AUTOMATIC1111

0.1%

159k

Web UI for Stable Diffusion

flash-attention by Dao-AILab

0.6%

21k

Fast, memory-efficient attention implementation

LoRA by microsoft

0.3%

13k

PyTorch library for low-rank adaptation (LoRA) of LLMs

Starred by

+12

Created 4 years ago

Updated 11 months ago

EditAnything by sail-sg

0.0%

Image editing research paper using segmentation and diffusion

Starred by

Created 2 years ago

Updated 9 months ago

ImageBind by facebookresearch

0.1%

PyTorch implementation for multimodal embeddings research paper

Starred by

Created 2 years ago

Updated 1 week ago

langchain by langchain-ai

0.4%

121k

Framework for building LLM-powered applications

open-llms by eugeneyan

0.1%

13k

Curated list of commercially-usable open LLMs

Starred by

Created 2 years ago

Updated 9 months ago

IF by deep-floyd

0.0%

Text-to-image model for photorealistic synthesis and language understanding

Starred by

Created 2 years ago

Updated 1 year ago

EasyLM by young-geng

0.0%

LLM training/finetuning framework in JAX/Flax

Starred by

Created 3 years ago

Updated 1 year ago

open_llama by openlm-research

0.0%

Open-source reproduction of LLaMA models

LLaMA-Adapter by OpenGVLab

0.1%

Efficient fine-tuning for instruction-following LLaMA models

Starred by

Created 2 years ago

Updated 1 year ago

bitsandbytes by bitsandbytes-foundation

0.3%

PyTorch library for k-bit quantization, enabling accessible LLMs

composer by mosaicml

0.1%

DL framework for training at scale, optimized for large-scale clusters

fairseq by facebookresearch

0.1%

32k

Sequence modeling toolkit for translation, language modeling, and text generation research

alpaca-lora by tloen

0.0%

19k

LoRA fine-tuning for LLaMA

web-llm by mlc-ai

0.2%

17k

In-browser LLM inference engine using WebGPU for hardware acceleration

transformers by huggingface

0.2%

153k

ML library for pretrained model inference and training

Awesome-LLM by Hannibal046

0.3%

26k

Curated list of Large Language Model resources

Starred by

Created 2 years ago

Updated 4 months ago

agent-ci by pegasi-ai

0.3%

354

AI testing framework for LLM output validation

Created 2 years ago

Updated 3 weeks ago

trl by huggingface

0.6%

16k

Library for transformer RL

peft by huggingface

0.3%

20k

Parameter-efficient fine-tuning (PEFT) library

annotated_deep_learning_paper_implementations by labmlai

0.2%

65k

PyTorch implementations/tutorials of deep learning papers with side-by-side notes

Starred by

Created 5 years ago

Updated 2 weeks ago

Megatron-LM by NVIDIA

0.5%

14k

Framework for training transformer models at scale

minGPT by karpathy

0.3%

23k

Minimal PyTorch re-implementation for GPT training and inference

nanoGPT by karpathy

0.7%

50k

Minimalist repo for training/finetuning GPT models

llama by meta-llama

0.0%

59k

Inference code for Llama 2 models (deprecated)

Starred by

+38

Created 2 years ago

Updated 10 months ago

RedPajama-Data by togethercomputer

0.1%

Dataset pipeline for training large language models

Starred by

Created 2 years ago

Updated 11 months ago

FasterTransformer by NVIDIA

0.1%

Optimized transformer library for inference

dolly by databrickslabs

11k

Instruction-following LLM trained on the Databricks Machine Learning Platform

StableLM by Stability-AI

0.0%

16k

Language models by Stability AI

FastChat by lm-sys

0.1%

39k

Open platform for training, serving, and evaluating LLM-based chatbots

web-stable-diffusion by mlc-ai

Browser-based Stable Diffusion demo with no server support

Starred by

Created 2 years ago

Updated 1 year ago

DeepSpeed by deepspeedai

0.2%

41k

Deep learning optimization library for distributed training and inference

llama_index by run-llama

0.4%

46k

Data framework for building LLM-powered agents

ColossalAI by hpcaitech

0.1%

41k

AI system for large-scale parallel training

DeepLearningExamples by NVIDIA

0.1%

15k

Deep learning examples for training and deployment

Starred by

Created 7 years ago

Updated 1 year ago

tensorflow by tensorflow

0.1%

193k

Open-source ML framework

Feedback? Help us improve.