llama-dl  by shawwn

CLI tool for high-speed LLaMA model download

created 2 years ago
4,161 stars

Top 12.0% on sourcepulse

GitHubView on GitHub
Project Summary

This repository provides a high-speed download script for Facebook's LLaMA 65B parameter model, targeting researchers and individual hackers interested in experimenting with large language models. It offers a significantly faster alternative to torrent downloads, enabling quicker access to the model weights.

How It Works

The project utilizes curl to download model weights directly from a mirrored source, bypassing the slower peer-to-peer torrent method. This approach leverages a direct HTTP download, achieving speeds up to 40MB/s, which is approximately 3.6x faster than the original torrent. The script is designed for simplicity, allowing users to download all model variants (7B, 13B, 30B, 65B) with a single command.

Quick Start & Requirements

  • Install/Run: Execute sh llama.sh after downloading the script.
  • Prerequisites: Linux or macOS with bash and wget (macOS requires brew install bash wget).
  • Disk Space: Requires 219GB for all model weights.
  • Docs: GitHub Repository

Highlighted Details

  • Downloads all LLaMA model weights (7B, 13B, 30B, 65B).
  • Achieves download speeds of approximately 40MB/s.
  • Script is version-locked to a specific commit for safety.
  • Includes notes on running LLaMA-65B on a single A100 80GB with 8-bit quantization.

Maintenance & Community

  • The original download link was shut down by Facebook; the script was updated to point to a mirrored source.
  • Author: Shawn Presser (@theshawwn).

Licensing & Compatibility

  • LLaMA model weights are released under a "non-commercial bespoke license."
  • Use for commercial purposes may be restricted and could lead to legal action from Facebook.

Limitations & Caveats

  • The script downloads weights from an unofficial mirror after the original link was disabled.
  • Users are advised to exercise caution when running scripts from the internet.
  • The LLaMA model itself has a non-commercial license, limiting its use in commercial products.
Health Check
Last commit

2 years ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
4 stars in the last 90 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems), Jaret Burkett Jaret Burkett(Founder of Ostris), and
3 more.

dalai by cocktailpeanut

0.0%
13k
Local LLM inference via CLI tool and Node.js API
created 2 years ago
updated 1 year ago
Starred by Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems), Travis Fischer Travis Fischer(Founder of Agentic), and
6 more.

codellama by meta-llama

0.1%
16k
Inference code for CodeLlama models
created 1 year ago
updated 11 months ago
Feedback? Help us improve.