Minimax M3 seems to be rolling out on the API
Minimax M3 is rolling out on the API. The model was spotted approximately 15 minutes ago in a screenshot.
Kimi is a conversational AI assistant built by Moonshot AI, designed to handle very long text contexts in a single prompt. For instance, Kimi 1.5 supports context windows exceeding 128,000 tokens for document analysis.
Minimax M3 is rolling out on the API. The model was spotted approximately 15 minutes ago in a screenshot.
AI search agents like GPT-5.4 and Kimi K2.6 mostly confirm their training knowledge rather than genuinely researching the web. Researchers at Harbin Institute of Technology demonstrated this using LiveBrowseComp, a benchmark based on events from the last 90 days. Without relying on training memory, performance collapses.
Developer seeking optimal infrastructure (~$100-150k) to self-host Kimi K2.6 and DeepSeek V4 locally for 5-person team (agentic coding). Compares dual GH200 NVL2 (1.2TB unified memory, $95k) vs 8x RTX 6000 Blackwell (768GB VRAM, $140k). Single GH200 test: 23 tok/s decode at 2-bit quant, but slow prefill and models overflow into slower unified memory.
Hy-MT2 is a family of multilingual translation models (1.8B, 7B, 30B-MoE) supporting 33 languages. The 1.8B model quantized at 1.25-bit weighs 440 MB and improves inference speed by 1.5x. The 7B and 30B models outperform DeepSeek-V4-Pro and Kimi K2.6 in fast-thinking mode; the 1.8B surpasses commercial APIs from Microsoft and Doubao.
Tencent releases Hy-MT2, a multilingual translation model family in three sizes (1.8B, 7B, 30B-MoE) supporting 33 languages. The 1.8B model compressed to 440 MB via 1.25-bit quantization outperforms commercial APIs from Microsoft and Doubao. The 7B and 30B variants exceed DeepSeek-V4-Pro and Kimi K2.6 performance. Includes IFMTBench benchmark and WMT26 partnership.
Cursor releases Composer 2.5, a coding model built on Kimi K2.5 and trained on 25x more synthetic tasks than its predecessor. It matches Opus 4.7 and GPT-5.5 benchmark performance at a fraction of the cost.
Busy month with multiple flagship releases: Gemma 4, DeepSeek V4, Kimi K2.6, MiMo 2.5, GLM-5.1. Nathan Lambert also covers CAISI's V4 assessment of these open-source models.
Kimi K2.6 from Moonshot AI is now available on Vercel AI Gateway. The model excels at long-horizon coding tasks (Rust, Go, Python, front-end, DevOps) and can generate complete interfaces from simple prompts. Optimized for autonomous agents with improved stability and safety.
Hugging Face examines the global expansion of Chinese AI models (Deepseek, Qwen, Kimi). These models gain market share in Europe and Southeast Asia through competitive performance and lower costs. Chinese companies invest heavily in infrastructure and regional partnerships.