awesome-japanese-llm by llm-jp

Japanese LLM list: models, benchmarks, datasets

Created 2 years ago

1,329 stars

Top 30.0% on SourcePulse

View on GitHub

1 Expert Loves This Project

Chip Huyen

Author of "AI Engineering", "Designing Machine Learning Systems"

Project Summary

This repository serves as a comprehensive, community-curated catalog of Japanese Large Language Models (LLMs) and related evaluation benchmarks. It aims to provide researchers, developers, and enthusiasts with an organized overview of the rapidly evolving Japanese LLM landscape, facilitating discovery and comparison of models, datasets, and evaluation methodologies.

How It Works

The project compiles information from various sources, including academic papers, public releases, and community contributions, to create detailed tables and descriptions of Japanese LLMs. It categorizes models by architecture, parameter count, training data, developer, and license, offering a structured approach to understanding the ecosystem. It also lists and describes various Japanese LLM evaluation benchmarks and datasets.

Quick Start & Requirements

This repository is a curated list and does not require installation or execution. Users can browse the README for information on specific models and benchmarks.

Highlighted Details

Extensive catalog of Japanese LLMs, including full-scratch trained models and models fine-tuned on Japanese data.
Detailed breakdown of models by architecture (Llama, GPT-NeoX, Mistral, etc.), parameter size, training corpus, and licensing.
Comprehensive listing of Japanese LLM evaluation benchmarks and datasets, covering various tasks like language understanding, generation, reasoning, and domain-specific performance.
Includes information on multimodal models (Vision-Language Models) and specialized models for tasks like text-to-image and speech recognition.

Maintenance & Community

The project is actively maintained by the llm-jp community, with contributions from numerous researchers and organizations in Japan. Information is updated regularly, and users are encouraged to contribute via GitHub Issues.

Licensing & Compatibility

Licenses vary significantly across the listed models, ranging from permissive MIT and Apache 2.0 to more restrictive non-commercial licenses (e.g., CC BY-NC-SA 4.0) and custom terms of use. Users must carefully check the specific license for each model before use, especially for commercial applications.

Limitations & Caveats

The README explicitly states that the content is not guaranteed to be complete or accurate and may change without notice. Some information may be based on speculation or individual interpretation. Users should verify details independently.

Health Check

Last Commit

1 day ago

Responsiveness

1 day

Pull Requests (30d)

Issues (30d)

Star History

9 stars in the last 30 days