Topic

#Kimi

Kimi is a conversational AI assistant built by Moonshot AI, designed to handle very long text contexts in a single prompt. For instance, Kimi 1.5 supports context windows exceeding 128,000 tokens for document analysis.

15Articles

6Sources

61Avg. signal

Reddit r/LocalLLaMA·Jun 17

i post-trained a model to reliably roll a die

A user post-trained a model to reliably simulate a die roll (each face ~1/6), exposing that frontier LLMs (Claude, GPT, Kimi) consistently answer '4'. Uses this toy problem to explore exploration vs. exploitation in RL and model behavior.

Reinforcement learning Claude GPT

SIG

HYP

Reddit r/LocalLLaMA·Jun 13

Unsloth Kimi-K2.7-Code-GGUF

Unsloth releases a GGUF quantized version of Kimi-K2.7-Code model on Hugging Face. The model is currently being uploaded.

Kimi Code generation Open source

SIG

HYP

The Decoder·Jun 13

Moonshot's open model Kimi K2.7 Code undercuts GPT-5.5 and Claude by up to 12x on price per token

Moonshot AI releases Kimi K2.7 Code, an open-weights model with one trillion parameters for programming. Underperforms GPT-5.5 and Claude Opus 4.8 on coding benchmarks but costs up to 12x less per token, offering better value for constrained budgets.

Kimi Code generation Open source

SIG

HYP

Reddit r/LocalLLaMA·Jun 12

moonshotai/Kimi-K2.7-Code · Hugging Face

Kimi K2.7-Code, an agentic coding-focused model, improves long-horizon and complex software engineering tasks. Reduces thinking-token usage by ~30% versus K2.6.

Kimi AI Agents Code generation

SIG

HYP

Vercel AI Blog·Jun 12

Kimi K2.7 Code now available on AI Gateway

Kimi K2.7 Code from Moonshot AI is now available on Vercel AI Gateway. This coding model supports long-horizon programming tasks (frontend, DevOps, optimization) with native multimodal text+vision architecture and thinking mode.

Kimi Code generation Vision

SIG

HYP

The Decoder·Jun 8

Moonshot AI targets a $30 billion valuation, more than six times its late-2025 worth

Moonshot AI, the Chinese company behind the Kimi chatbot, targets a $30 billion valuation in a new funding round, more than six times its late-2025 worth.

Kimi Funding Business

SIG

HYP

Reddit r/LocalLLaMA·Jun 1

Minimax M3 seems to be rolling out on the API

Minimax M3 is rolling out on the API. The model was spotted approximately 15 minutes ago in a screenshot.

Kimi

SIG

HYP

The Decoder·May 31

AI search agents often confirm what they already know instead of actually researching the web

AI search agents like GPT-5.4 and Kimi K2.6 mostly confirm their training knowledge rather than genuinely researching the web. Researchers at Harbin Institute of Technology demonstrated this using LiveBrowseComp, a benchmark based on events from the last 90 days. Without relying on training memory, performance collapses.

Benchmarks AI Agents GPT

SIG

HYP

Reddit r/LocalLLaMA·May 28

GH200 NVL2 or 8x RTX 6000 Blackwell for running Kimi K2.6 / DeepSeek V4 locally? (5 devs, agentic coding)

Developer seeking optimal infrastructure (~$100-150k) to self-host Kimi K2.6 and DeepSeek V4 locally for 5-person team (agentic coding). Compares dual GH200 NVL2 (1.2TB unified memory, $95k) vs 8x RTX 6000 Blackwell (768GB VRAM, $140k). Single GH200 test: 23 tok/s decode at 2-bit quant, but slow prefill and models overflow into slower unified memory.

DeepSeek Kimi AI Agents

SIG

HYP

arXiv cs.CL·May 22

Hy-MT2: A Family of Fast, Efficient and Powerful Multilingual Translation Models in the Wild

Hy-MT2 is a family of multilingual translation models (1.8B, 7B, 30B-MoE) supporting 33 languages. The 1.8B model quantized at 1.25-bit weighs 440 MB and improves inference speed by 1.5x. The 7B and 30B models outperform DeepSeek-V4-Pro and Kimi K2.6 in fast-thinking mode; the 1.8B surpasses commercial APIs from Microsoft and Doubao.

Benchmarks Code generation DeepSeek

SIG

HYP

Reddit r/LocalLLaMA·May 21

Tencent Hy 30B/7B/1.8B

Tencent releases Hy-MT2, a multilingual translation model family in three sizes (1.8B, 7B, 30B-MoE) supporting 33 languages. The 1.8B model compressed to 440 MB via 1.25-bit quantization outperforms commercial APIs from Microsoft and Doubao. The 7B and 30B variants exceed DeepSeek-V4-Pro and Kimi K2.6 performance. Includes IFMTBench benchmark and WMT26 partnership.

Code generation Benchmarks Open source

SIG

HYP

The Decoder·May 18

Cursor's Composer 2.5 matches Opus 4.7 and GPT-5.5 benchmarks at a fraction of the cost

Cursor releases Composer 2.5, a coding model built on Kimi K2.5 and trained on 25x more synthetic tasks than its predecessor. It matches Opus 4.7 and GPT-5.5 benchmark performance at a fraction of the cost.

Code generation Benchmarks Kimi

SIG

HYP

Interconnects (Nathan Lambert)·May 16

Latest open artifacts (#21): Open model bonanza! Gemma 4, DeepSeek V4, Kimi K2.6, MiMo 2.5, GLM-5.1 & others. On CAISI's V4 assessment.

Busy month with multiple flagship releases: Gemma 4, DeepSeek V4, Kimi K2.6, MiMo 2.5, GLM-5.1. Nathan Lambert also covers CAISI's V4 assessment of these open-source models.

Gemini DeepSeek Kimi

SIG

HYP

Vercel AI Blog·Apr 20

Kimi K2.6 on AI Gateway

Kimi K2.6 from Moonshot AI is now available on Vercel AI Gateway. The model excels at long-horizon coding tasks (Rust, Go, Python, front-end, DevOps) and can generate complete interfaces from simple prompts. Optimized for autonomous agents with improved stability and safety.

Kimi Code generation AI Agents

SIG

HYP

Hugging Face Blog·Oct 3

A Short Summary of Chinese AI Global Expansion

Hugging Face examines the global expansion of Chinese AI models (Deepseek, Qwen, Kimi). These models gain market share in Europe and Southeast Asia through competitive performance and lower costs. Chinese companies invest heavily in infrastructure and regional partnerships.

DeepSeek Qwen Kimi

SIG

HYP