mistral-common by mistralai

Inference library for Mistral models preprocessing

Created 1 year ago

844 stars

Top 42.3% on SourcePulse

8 Experts Love This Project

tobi

Cofounder of Shopify

lysandrejik

Chief Open-Source Officer at Hugging Face

theophilegervet

Théophile Gervet

Cofounder of Genesis AI

patrickvonplaten

Patrick von Platen

Author of Hugging Face Diffusers; Research Engineer at Mistral

and 4 more!

Project Summary

This library provides the official inference tools for Mistral AI models, focusing on advanced tokenization for structured conversations and tool parsing. It's designed for developers and researchers working with Mistral's diverse model ecosystem, offering efficient pre-processing and validation capabilities.

How It Works

The library implements custom tokenizers (v1, v2, v3) that go beyond standard text-to-token conversion. They are specifically designed to parse and handle structured data, including tool calls and conversational formats, which is crucial for instruction-following and function-calling models. This approach allows for more robust and accurate interaction with Mistral's models.

Quick Start & Requirements

Install via pip: pip install mistral-common
Alternatively, install from source using Poetry: poetry install
Requires Python.

Highlighted Details

Supports tokenization for various Mistral models including Mistral 7B, Mixtral 8x7B, Mixtral 8x22B, Codestral, and Mathstral.
Includes validation and normalization code used in the Mistral API.
Provides examples for tokenizing chat completion requests with tool definitions.

Maintenance & Community

Official library from Mistral AI.

Licensing & Compatibility

License not specified in the README.

Limitations & Caveats

The specific license for this repository is not detailed in the README, which may impact commercial use or integration into closed-source projects.

Health Check

Last Commit

6 days ago

Responsiveness

1 day

Pull Requests (30d)

7

Issues (30d)

4

Star History

19 stars in the last 30 days

Explore Similar Projects

ClipboardConqueror by aseichter2007

AI copilot for copy-paste workflows, accessible in any text field

Created 2 years ago

Updated 1 year ago

OrionStar-Yi-34B-Chat by OrionStarAI

Chat model for conversational tasks in both Chinese and English

Created 2 years ago

Updated 1 year ago

ChatGPT.el by joshcho

Emacs package for interacting with ChatGPT

Created 3 years ago

Updated 2 years ago

whatsapp-ai-clone by kinggongzilla

AI chatbot for personalized conversations via WhatsApp data

Created 2 years ago

Updated 1 year ago

cmp-ai by tzachar

AI source for nvim-cmp, enabling remote code completion

Created 2 years ago

Updated 8 months ago

Starred by

Yaowei Zheng

Yaowei Zheng(Author of LLaMA-Factory) and

Gabriel Almeida

Gabriel Almeida(Cofounder of Langflow).

langfun by google

Library for object-oriented LLM prompting

Created 2 years ago

Updated 2 days ago

gpt-cli by kharvd

CLI tool for interacting with chat LLMs (ChatGPT, Claude, Gemini, etc.)

Created 2 years ago

Updated 8 months ago

NLP-Projects-NHV by Vasanthengineer4949

NLP course provides code examples and video walkthroughs

Created 2 years ago

Updated 1 year ago

textbase by cofactoryai

Framework for building AI chatbots

Created 2 years ago

Updated 2 years ago

Starred by

Vincent Weisser

Vincent Weisser(Cofounder of Prime Intellect),

Chaoyu Yang

Chaoyu Yang(Founder of Bento), and

11 more.

mistral-inference by mistralai

Inference library for Mistral models

Created 2 years ago

Updated 1 month ago

Starred by

Sam Partee

Sam Partee(Cofounder of Arcade),

Michael Han

Michael Han(Cofounder of Unsloth), and

15 more.

Qwen by QwenLM

Chat & pretrained LLM by Alibaba Cloud

Created 2 years ago

Updated 1 month ago

Starred by

Tobi Lutke

Tobi Lutke(Cofounder of Shopify),

Alexey Milovidov

Alexey Milovidov(Cofounder of Clickhouse), and

16 more.

LibreChat by danny-avila

Enhanced ChatGPT clone for self-hosting

Created 2 years ago

Updated 18 hours ago

Feedback? Help us improve.