JSON descriptors for Large Language Model (LLM) files
Top 45.0% on sourcepulse
This repository provides a standardized JSON format for describing Large Language Model (LLM) files, enabling consistent metadata across different models. It targets developers and researchers working with LLMs, offering a structured way to manage and discover model details like size, architecture, and file formats.
How It Works
The project defines a JSON schema for individual model descriptors, capturing essential information such as model size (e.g., 7B, 13B), architecture (e.g., Llama, MPT), and file formats including quantization details (e.g., ggml, q4_K_M). A GitHub Action automatically merges these individual JSON files into a central catalog.json
, with another action ensuring schema validation for all contributions.
Quick Start & Requirements
models/
directory, and validate it using the provided validate.py
script or createCatalog.py
.Highlighted Details
Maintenance & Community
CONTRIBUTING.md
.Licensing & Compatibility
Limitations & Caveats
The README does not specify the license, which may impact commercial use or integration into closed-source projects.
1 year ago
1 day