llama-dl  by shawwn

CLI tool for high-speed LLaMA model download

Created 2 years ago
4,158 stars

Top 11.8% on SourcePulse

GitHubView on GitHub
Project Summary

This repository provides a high-speed download script for Facebook's LLaMA 65B parameter model, targeting researchers and individual hackers interested in experimenting with large language models. It offers a significantly faster alternative to torrent downloads, enabling quicker access to the model weights.

How It Works

The project utilizes curl to download model weights directly from a mirrored source, bypassing the slower peer-to-peer torrent method. This approach leverages a direct HTTP download, achieving speeds up to 40MB/s, which is approximately 3.6x faster than the original torrent. The script is designed for simplicity, allowing users to download all model variants (7B, 13B, 30B, 65B) with a single command.

Quick Start & Requirements

  • Install/Run: Execute sh llama.sh after downloading the script.
  • Prerequisites: Linux or macOS with bash and wget (macOS requires brew install bash wget).
  • Disk Space: Requires 219GB for all model weights.
  • Docs: GitHub Repository

Highlighted Details

  • Downloads all LLaMA model weights (7B, 13B, 30B, 65B).
  • Achieves download speeds of approximately 40MB/s.
  • Script is version-locked to a specific commit for safety.
  • Includes notes on running LLaMA-65B on a single A100 80GB with 8-bit quantization.

Maintenance & Community

  • The original download link was shut down by Facebook; the script was updated to point to a mirrored source.
  • Author: Shawn Presser (@theshawwn).

Licensing & Compatibility

  • LLaMA model weights are released under a "non-commercial bespoke license."
  • Use for commercial purposes may be restricted and could lead to legal action from Facebook.

Limitations & Caveats

  • The script downloads weights from an unofficial mirror after the original link was disabled.
  • Users are advised to exercise caution when running scripts from the internet.
  • The LLaMA model itself has a non-commercial license, limiting its use in commercial products.
Health Check
Last Commit

2 years ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
2 stars in the last 30 days

Explore Similar Projects

Starred by Junyang Lin Junyang Lin(Core Maintainer at Alibaba Qwen), Vincent Weisser Vincent Weisser(Cofounder of Prime Intellect), and
25 more.

alpaca-lora by tloen

0.0%
19k
LoRA fine-tuning for LLaMA
Created 2 years ago
Updated 1 year ago
Starred by Andrej Karpathy Andrej Karpathy(Founder of Eureka Labs; Formerly at Tesla, OpenAI; Author of CS 231n), John Yang John Yang(Coauthor of SWE-bench, SWE-agent), and
28 more.

stanford_alpaca by tatsu-lab

0.1%
30k
Instruction-following LLaMA model training and data generation
Created 2 years ago
Updated 1 year ago
Feedback? Help us improve.