modal_finetune_sql by run-llama

Walkthrough for fine-tuning LLaMa 2 7B on Text-to-SQL datasets

Created 2 years ago

321 stars

Top 84.7% on SourcePulse

View on GitHub

2 Experts Love This Project

Akshat Bubna

Cofounder of Modal

Jerry Liu

Cofounder of LlamaIndex

Project Summary

This repository provides a walkthrough for fine-tuning LLaMa 2 7B on a Text-to-SQL dataset and performing inference against databases using LlamaIndex. It is targeted at developers and researchers looking to build custom Text-to-SQL applications.

How It Works

The project leverages LlamaIndex for database interaction and Hugging Face's datasets and peft libraries for efficient fine-tuning of LLaMa 2. The fine-tuning process is designed to be modular and runnable via Modal, a cloud-native development framework, simplifying distributed training and deployment.

Quick Start & Requirements

Install via git clone https://github.com/run-llama/modal_finetune_sql.git and cd modal_finetune_sql.
Run the tutorial.ipynb notebook or individual steps:
- modal run src.load_data_sql
- modal run --detach src.finetune_sql
- modal run src.inference_sql_llamaindex::main --query "Which city has the highest population?" --sqlite-file-path "nbs/cities.db"
Requires Modal CLI, Hugging Face libraries, and potentially GPU resources for fine-tuning.

Highlighted Details

Demonstrates fine-tuning LLaMa 2 7B for Text-to-SQL tasks.
Integrates LlamaIndex for querying structured data.
Utilizes Modal for simplified distributed execution.
Adapts code from the Modal doppel-bot repository.

Maintenance & Community

The project is part of the run-llama organization, suggesting active development in the Llama ecosystem. Further community engagement details are not explicitly provided in the README.

Licensing & Compatibility

The repository's licensing is not specified in the README. Compatibility for commercial use or closed-source linking would require clarification.

Limitations & Caveats

The README indicates that the code is adapted from another repository and provides a walkthrough, suggesting it may be experimental or a proof-of-concept. Specific performance benchmarks or production-readiness claims are not present.

Health Check

Last Commit

2 years ago

Responsiveness

Inactive

Pull Requests (30d)

Issues (30d)

Star History

1 stars in the last 30 days