Dongdongniu API

中文 EN

Login

Home
Simple Tutorial
Developer Docs
VS Code: Cline
Models
Benchmark
News

News
News
Mistral Small 4 啟用 Prefix Caching 加速

News

Mistral Small 4 啟用 Prefix Caching 加速

YUI | 2026-06-05 23:19

Mistral Small 4 (119B) 已啟用 Prefix Caching，重複前綴的請求將自動命中 KV Cache 快取，顯著降低延遲與計算開銷。模型自 2026 年 3 月起即在平台提供服務，此次為效能優化更新。模型規格：參數量 119B（MoE 架構）、量化 NVFP4、上下文 262K tokens、支援 Function Calling、推理模式、Prefix Caching。API 名稱：mistral-small-4

Other News

開源模型新時代：巴西里約市府釋出 Rio-3.5-Open-397B — 從 Finetune 看模型國籍與台灣機會

2026-06-14

每日服務摘要 2026-06-14

2026-06-14

Get Started

Get your API key now and start using the LLM service.

Get API Key