Hy-MT2  by Tencent-Hunyuan

Fast, efficient multilingual translation models for real-world scenarios

Created 1 week ago

New!

352 stars

Top 79.0% on SourcePulse

GitHubView on GitHub
Project Summary

<2-3 sentences summarising what the project addresses and solves, the target audience, and the benefit.> Hy-MT2 is a family of "fast-thinking" multilingual translation models designed for complex, real-world scenarios. It targets developers and researchers needing efficient, high-performance translation across 33 languages, offering models from 1.8B to 30B parameters with significant on-device optimization and instruction-following capabilities, aiming to outperform existing open-source and commercial solutions.

How It Works

This family includes 1.8B, 7B, and 30B-A3B (MoE) models. A key feature is AngelSlim, a 1.25-bit extreme quantization that reduces the 1.8B model to 440MB and boosts inference speed by 1.5x for on-device deployment. The models are engineered for complex real-world tasks and robust instruction following, with performance claims benchmarked against leading alternatives.

Quick Start & Requirements

Highlighted Details

  • 7B and 30B-A3B models outperform DeepSeek-V4-Pro and Kimi K2.6 in fast-thinking mode.
  • 1.8B model surpasses mainstream commercial APIs (Microsoft, Doubao) overall.
  • AngelSlim 1.25-bit quantization enables 440MB footprint and 1.5x speedup for the 1.8B model.
  • Introduces IFMTBench for evaluating translation instruction-following.
  • Partnered with WMT26 for "Video Subtitle Translation Task" and "General Machine Translation Task."

Maintenance & Community

  • Contact: hunyuan_opensource@tencent.com.
  • Platforms: Models available on Hugging Face and ModelScope.
  • Activity: Recent releases in May 2026 (Hy-MT2), Dec 2025 (HY-MT1.5), Sep 2025 (Hunyuan-MT).

Licensing & Compatibility

  • License type: Not explicitly stated in the README, requiring clarification for adoption.
  • Compatibility notes: No specific notes on commercial use or closed-source linking.

Limitations & Caveats

  • Models lack a default system prompt, necessitating careful prompt engineering.
  • The absence of a clear license is a significant adoption blocker.
  • Detailed performance validation requires consulting the referenced report.
Health Check
Last Commit

1 day ago

Responsiveness

Inactive

Pull Requests (30d)
1
Issues (30d)
2
Star History
353 stars in the last 9 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Simon Willison Simon Willison(Coauthor of Django), and
10 more.

Yi by 01-ai

0%
8k
Open-source bilingual LLMs trained from scratch
Created 2 years ago
Updated 1 year ago
Feedback? Help us improve.