web-crawl-q-and-a-example  by openai

Q&A bot using OpenAI API

created 1 year ago
314 stars

Top 87.1% on sourcepulse

GitHubView on GitHub
Project Summary

This repository provides a practical example for developers looking to build a question-answering system over their own website content using OpenAI's API and embedding models. It's designed for users familiar with Python and the OpenAI ecosystem who want to leverage their data for conversational AI applications.

How It Works

The project demonstrates a common pattern for RAG (Retrieval-Augmented Generation): website content is crawled, chunked, embedded using OpenAI's text-embedding-ada-002 model, and stored in a vector database (likely ChromaDB, though not explicitly stated in the README). When a user asks a question, it's also embedded, and the most similar document chunks are retrieved from the vector database to provide context for the OpenAI API (e.g., gpt-3.5-turbo) to generate an answer.

Quick Start & Requirements

Highlighted Details

  • Demonstrates end-to-end RAG pipeline for custom data.
  • Utilizes OpenAI's embedding and completion models.
  • Focuses on practical implementation for website Q&A.

Maintenance & Community

This is an example repository from OpenAI, likely maintained as part of their documentation and examples. No specific community channels or roadmap are indicated.

Licensing & Compatibility

The repository itself is not explicitly licensed in the provided README snippet. OpenAI's general policy for example code is often permissive, but users should verify the license within the repository for commercial use or closed-source linking.

Limitations & Caveats

The example is a basic demonstration and may require significant adaptation for production use, including robust error handling, scalable data storage, and more sophisticated chunking/retrieval strategies. The specific vector database used is not detailed in the README.

Health Check
Last commit

1 year ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
6 stars in the last 90 days

Explore Similar Projects

Starred by Jared Palmer Jared Palmer(Ex-VP of AI at Vercel; Founder of Turborepo; Author of Formik, TSDX).

chatgpt-pgvector by gannonh

0%
938
Domain-specific chat completions app
created 2 years ago
updated 2 years ago
Starred by Peter Norvig Peter Norvig(Author of Artificial Intelligence: A Modern Approach; Research Director at Google).

python-openai-demos by pamelafox

0%
374
Python scripts for OpenAI API demos
created 1 year ago
updated 1 week ago
Feedback? Help us improve.