sample-apps  by vespa-engine

Vespa sample apps for big data serving engine demos

created 8 years ago
376 stars

Top 76.7% on sourcepulse

GitHubView on GitHub
1 Expert Loves This Project
Project Summary

This repository provides a comprehensive collection of sample applications for Vespa.ai, a big data serving engine. It targets developers and researchers looking to leverage Vespa for various use cases, including vector search, hybrid search, RAG, and traditional text search, offering practical examples for both self-hosted and cloud deployments.

How It Works

The sample applications demonstrate Vespa's capabilities by showcasing its schema definition language, data feeding mechanisms, and query language. They highlight Vespa's support for advanced features like multi-vector indexing, hybrid search (combining keyword and vector search), and integration with various embedding models (e.g., Cohere, BGE-M3, OpenAI Matryoshka). The examples often utilize Python (Pyvespa) for interaction and data processing, emphasizing efficient data handling and scalable search.

Quick Start & Requirements

  • Deployment: Samples can be deployed to Vespa Cloud or self-hosted. Specific deployment instructions are detailed within each sample's README.md.
  • Dependencies: May include Python, specific libraries (e.g., pyvespa), and potentially GPU/CUDA for advanced AI models. Refer to individual sample READMEs for precise requirements.
  • Resources: Setup time and resource needs vary significantly based on the complexity and data size of the sample.
  • Documentation: Vespa.ai Documentation

Highlighted Details

  • Extensive examples for vector search, including multi-vector indexing and various embedding models.
  • Demonstrations of Retrieval Augmented Generation (RAG) with both external and local LLM integration.
  • Tutorials on traditional text search, hybrid search, and advanced ranking techniques (e.g., ColBERT, cross-encoders).
  • Showcases for e-commerce search, recommendations, and real-time features like search-as-you-type.
  • Examples for using Vespa as an ML inference server with support for model versioning and custom code.

Maintenance & Community

  • Maintained by the Vespa.ai team.
  • Community resources and support channels are available via the main Vespa.ai website.

Licensing & Compatibility

  • Samples are typically provided under a permissive license, but users should verify the license of each individual sample application.
  • Compatible with Vespa.ai, which is designed for integration into various data pipelines and applications.

Limitations & Caveats

Some sample applications may not yet fully adhere to the described standard structure, requiring users to consult individual READMEs for specific instructions and potential variations.

Health Check
Last commit

2 days ago

Responsiveness

1 week

Pull Requests (30d)
17
Issues (30d)
1
Star History
8 stars in the last 90 days

Explore Similar Projects

Starred by Jason Liu Jason Liu(Author of Instructor) and Ross Taylor Ross Taylor(Cofounder of General Reasoning; Creator of Papers with Code).

Search-R1 by PeterGriffinJin

1.3%
3k
RL framework for training LLMs to use search engines
created 5 months ago
updated 3 weeks ago
Starred by Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems) and Elie Bursztein Elie Bursztein(Cybersecurity Lead at Google DeepMind).

LightRAG by HKUDS

1.0%
19k
RAG framework for fast, simple retrieval-augmented generation
created 10 months ago
updated 1 day ago
Feedback? Help us improve.