Mistral NeMo

By Mistral May 28, 2026

Today, we are excited to release Mistral NeMo, a 12B model built in collaboration with NVIDIA. Mistral NeMo offers a large context window of up to 128k tokens. Its reasoning, world knowledge, and coding accuracy are state-of-the-art in its size category. As it relies on standard architecture, Mistral NeMo is easy to use and a drop-in replacement in any system using Mistral 7B.

Mistral NeMo Mistral NeMo is a new 12B AI model developed with NVIDIA, featuring a 128k token context window that surpasses current benchmarks in reasoning, world knowledge, and coding accuracy for its size. It utilizes a new, more efficient tokenizer called Tekken and is designed for global, multilingual applications, showing particular strength in various languages. Released under the Apache 2.0 license, it’s available as a drop-in replacement for Mistral 7B systems and packaged for NVIDIA NIM inference.

Mistral NeMo is a 12B parameter AI model created in collaboration with NVIDIA.
It features a large context window of up to 128k tokens.
The model demonstrates state-of-the-art performance in reasoning, world knowledge, and coding accuracy for its size.
It uses a new, more efficient tokenizer named Tekken, which is based on Tiktoken and trained on over 100 languages.
Mistral NeMo is designed for global, multilingual applications and excels in English, French, German, Spanish, Italian, Portuguese, Chinese, Japanese, Korean, Arabic, and Hindi.
Pre-trained base and instruction-tuned checkpoints are released under the Apache 2.0 license.
The model supports FP8 inference without performance loss due to quantisation-aware training.
Mistral NeMo is available on HuggingFace and as an NVIDIA NIM inference microservice. Continue reading https://foxvector.com/articles/319261cf-6879-4254-9b90-da4a8d9c9c37

Reference: https://foxvector.com/articles/319261cf-6879-4254-9b90-da4a8d9c9c37

Write a comment

No comments yet.