beta
Home
Browse all repos
Follow on
X
/
Popular searches
MCP
model serving
fine tuning
conversational speech model
observability
evaluation framework
Home
Browse all repos
Follow on
X
Home
>
Users
>
okhat
Omar Khattab
@okhat
Author of DSPy, ColBERT; Professor at MIT
GitHub
View on GitHub
Starred Projects (23)
Starred by
Jeff Hammerbacher
(Cofounder of Cloudera)
,
Jesse Clark
(Cofounder of Marqo),
and
1 more.
pylate
by
lightonai
0.8%
527
PyLate: library for late interaction model training and retrieval
created 1 year ago
updated 1 week ago
Rankify
by
DataScienceUIBK
0.6%
495
Python toolkit for retrieval, re-ranking, and RAG research
created 6 months ago
updated 1 week ago
Starred by
Chip Huyen
(Author of "AI Engineering", "Designing Machine Learning Systems")
,
Elvis Saravia
(Founder of DAIR.AI),
and
13 more.
markitdown
by
microsoft
1.4%
72k
Python tool for converting files to Markdown for LLM text analysis
created 9 months ago
updated 4 days ago
Starred by
Chip Huyen
(Author of "AI Engineering", "Designing Machine Learning Systems")
.
langwatch
by
langwatch
0.9%
2k
LLM ops platform for traces, analytics, evaluations, datasets, and prompt optimization
created 1 year ago
updated 21 hours ago
Starred by
Wes McKinney
(Author of Pandas)
,
Jeff Hammerbacher
(Cofounder of Cloudera),
and
1 more.
lotus
by
lotus-data
0.4%
1k
Query engine for LLM-powered data processing using semantic operators
created 1 year ago
updated 3 days ago
Starred by
Bob van Luijt
(Cofounder of Weaviate)
.
recipes
by
weaviate
1.0%
825
End-to-end notebooks for Weaviate features and integrations
created 2 years ago
updated 22 hours ago
awesome-dspy
by
ganarajpr
0.8%
398
Curated list of DSPy resources
created 1 year ago
updated 5 months ago
Starred by
Jeremy Howard
(Cofounder of fast.ai)
and
Jeff Hammerbacher
(Cofounder of Cloudera)
.
xmc.dspy
by
KarelDO
0%
434
In-context learning for extreme multi-label classification
created 1 year ago
updated 1 year ago
Starred by
Andrew Kane
(Author of pgvector)
,
James Luan
(VP Engineering at Zilliz),
and
10 more.
RAGatouille
by
AnswerDotAI
0.4%
4k
SDK for late-interaction retrieval (ColBERT) in RAG pipelines
created 1 year ago
updated 3 months ago
Starred by
Chaoyu Yang
(Founder of Bento)
and
Jeff Hammerbacher
(Cofounder of Cloudera)
.
WikiChat
by
stanford-oval
0.4%
1k
Improved RAG for factual LLM responses using Wikipedia grounding
created 1 year ago
updated 3 months ago
Starred by
Tobi Lutke
(Cofounder of Shopify)
,
Omar Sanseviero
(DevRel at Google DeepMind),
and
19 more.
mlc-llm
by
mlc-ai
0.2%
21k
Universal LLM deployment engine with ML compilation
created 2 years ago
updated 1 day ago
Starred by
Tobi Lutke
(Cofounder of Shopify)
,
Matei Zaharia
(Cofounder of Databricks),
and
37 more.
dspy
by
stanfordnlp
0.8%
27k
Framework for programming language models, not prompting
created 2 years ago
updated 1 day ago
Starred by
Yaowei Zheng
(Author of LLaMA-Factory)
,
Chip Huyen
(Author of "AI Engineering", "Designing Machine Learning Systems"),
and
8 more.
helm
by
stanford-crfm
0.5%
2k
Open-source Python framework for holistic evaluation of foundation models
created 3 years ago
updated 21 hours ago
Starred by
Jeff Hammerbacher
(Cofounder of Cloudera)
,
Jesse Clark
(Cofounder of Marqo),
and
4 more.
primeqa
by
primeqa
0%
736
Open-source repo for multilingual question answering research
created 3 years ago
updated 7 months ago
esci-data
by
amazon-science
1.3%
309
Benchmark dataset for product search R&D
created 3 years ago
updated 10 months ago
Starred by
Elie Bursztein
(Cybersecurity Lead at Google DeepMind)
,
Stella Rose Biderman
(Executive Director at EleutherAI),
and
11 more.
gpt-neo
by
EleutherAI
0.0%
8k
GPT-2/3-style model implementation using mesh-tensorflow
created 5 years ago
updated 3 years ago
Starred by
Daniel Gross
(Cofounder of Safe Superintelligence)
,
Matei Zaharia
(Cofounder of Databricks),
and
7 more.
ColBERT
by
stanford-futuredata
0.3%
4k
Neural search for fast, accurate retrieval over large text collections
created 5 years ago
updated 5 days ago
Starred by
Amanpreet Singh
(Cofounder of Contextual AI)
,
Piotr Dąbkowski
(Cofounder of ElevenLabs),
and
5 more.
transformer-deploy
by
ELS-RD
0%
2k
CLI tool for optimized Hugging Face Transformer deployment
created 3 years ago
updated 9 months ago
Starred by
Patrick Kidger
(Core Contributor to JAX ecosystem)
,
Travis Fischer
(Founder of Agentic),
and
12 more.
aim
by
aimhubio
0.2%
6k
Experiment tracker for AI model training runs
created 6 years ago
updated 19 hours ago
Starred by
Jeff Hammerbacher
(Cofounder of Cloudera)
,
Elvis Saravia
(Founder of DAIR.AI),
and
5 more.
CodeT5
by
salesforce
0.2%
3k
Code LLMs for code understanding and generation research
created 4 years ago
updated 1 year ago
Starred by
Shizhe Diao
(Research Scientist at NVIDIA; Author of LMFlow)
,
Jared Palmer
(Ex-VP AI at Vercel; Founder of Turborepo; Author of Formik, TSDX),
and
1 more.
human-eval
by
openai
0.5%
3k
Evaluation harness for LLMs trained on code
created 4 years ago
updated 7 months ago
Starred by
Shizhe Diao
(Research Scientist at NVIDIA; Author of LMFlow)
.
CodeXGLUE
by
microsoft
0.2%
2k
Benchmark for code intelligence tasks
created 5 years ago
updated 1 year ago
Starred by
Jesse Clark
(Cofounder of Marqo)
,
Chaoyu Yang
(Founder of Bento),
and
9 more.
vespa
by
vespa-engine
0.3%
6k
Platform for AI + data, online serving at scale
created 9 years ago
updated 1 day ago
Feedback? Help us improve.