Tiled Hacker news on React Router

Kimi K2 is a state-of-the-art mixture-of-experts (MoE) language model

348 points - 07/11/2025

GitHub: https://github.com/MoonshotAI/Kimi-K2

Source

vessenes
07/12/2025
I tried Kimi on a few coding problems that Claude was spinning on. It’s good. It’s huge, way too big to be a “local” model — I think you need something like 16 H200s to run it - but it has a slightly different vibe than some of the other models. I liked it. It would definitely be useful in ensemble use cases at the very least.
simonw
07/11/2025
Pelican on a bicycle result: https://simonwillison.net/2025/Jul/11/kimi-k2/
ozgune
07/12/2025
This is a very impressive general purpose LLM (GPT 4o, DeepSeek-V3 family). It’s also open source.
I think it hasn’t received much attention because the frontier shifted to reasoning and multi-modal AI models. In accuracy benchmarks, all the top models are reasoning ones:
https://artificialanalysis.ai/
If someone took Kimi k2 and trained a reasoning model with it, I’d be curious how that model performs.
exegeist
07/12/2025
Technical strengths aside, I’ve been impressed with how non-robotic Kimi K2 is. Its personality is closer to Anthropic’s best: pleasant, sharp, and eloquent. A small victory over botslop prose.
simonw
07/11/2025
Big release - https://huggingface.co/moonshotai/Kimi-K2-Instruct model weights are 958.52 GB
wiradikusuma
07/11/2025
I've only started using Claude, Gemini, etc in the last few months (I guess it comes with age, I'm no longer interested in trying the latest "tech"). I assume those are "non-agentic" models.
From reading articles online, "agentic" means like you have a "virtual" Virtual Assistant with "hands" that can google, open apps, etc, on their own.
Why not use existing "non-agentic" model and "orchestrate" them using LangChain, MCP etc? Why create a new breed of model?
I'm sorry if my questions sound silly. Following AI world is like following JavaScript world.
fzysingularity
07/12/2025
If I had to guess, the OpenAI open-source model got delayed because Kimi K2 stole their thunder and beat their numbers.
emacdona
07/12/2025
To me, K2 is a mountain and SOTA is “summits on the air”. I saw that headline and thought “holy crap” :-)
jug
07/12/2025
I like new, solid non-reasoning models that push the frontier. These still have nice use cases (basically anything where logic puzzles or STEM subjects don't apply) where you don't want to spend cash on reasoning tokens.
aliljet
07/11/2025
If the SWE Bench results are to be believed... this looks best in class right now for a local LLM. To be fair, show me the guy who is running this locally...
satvikpendem
07/12/2025
This is not open source, they have a "modified MIT license" where they have other restrictions on users over a certain threshold.
```
    Our only modification part is that, if the Software (or any derivative works
    thereof) is used for any of your commercial products or services that have
    more than 100 million monthly active users, or more than 20 million US dollars
    (or equivalent in other currencies) in monthly revenue, you shall prominently
    display "Kimi K2" on the user interface of such product or service.
```
data_maan
07/12/2025
"Open source" lol
Open-weight. As usual, you don't get the dataset, training scripts, etc.
MaxPock
07/11/2025
Would be hilarious if Zuck with his billion dollar poaching failed to beat budget Chinese models.
ksec
07/13/2025
Kimi K2 is the large language model series developed by Moonshot AI team.
Moonshot AI [1] (Moonshot; Chinese: 月之暗面; pinyin: Yuè Zhī Ànmiàn) is an artificial intelligence (AI) company based in Beijing, China. As of 2024, it has been dubbed one of China's "AI Tiger" companies by investors with its focus on developing large language models.
I guess everyone is up to date with AI stuff but this is the first time I heard of Kimi and Moonshot and was wondering where it is from. And it wasn't obvious from a quick glance of comments.
[1] https://en.wikipedia.org/wiki/Moonshot_AI
cyanf
07/11/2025
This is both the largest oss model release thus far, and the largest Muon training run.
fzysingularity
07/12/2025
If I had to guess, the OpenAI open-source model got delayed because Kimi K2 stole their thunder and beat their numbers.
pxc
07/12/2025
So far, I like the answer quality and its voice (a bit less obsequious than either ChatGPT or DeepSeek, more direct), but it seems to badly mangle the format of its answers more often than I've seen with SOTA models (I'd include DeepSeek in that category, or close enough).
awestroke
07/12/2025
This is the model release that made Sam Altman go "Oh wait actually we can't release the new open source model this week, sorry. Something something security concerns".
Perhaps their open source model release doesn't look so good compared to this one
sagarpatil
07/13/2025
All the AI models are no using em-dashes. ChatGPT keeps using them even after explicitly told not to. Anybody know what’s up with these models?
gs17
07/11/2025
> 1T total / 32B active MoE model
Is this the largest open-weight model?
viraptor
07/12/2025
How well separated are experts per domain in a model like that? Specifically, if I'm interested in a programming use only, could we possibly strip it to one or two of them? Or should I assume a much wider spread? (And there would be some overlap anyway from the original root model)
mring33621
07/14/2025
I chatted with this model about stress testing Hazelcast and comparing/contrasting Java Virtual Threads, Goroutines and Kotlin's Coroutines. I really liked its responses. They were concise and useful.
Alifatisk
07/12/2025
Quite impressive benchmark, how come I don't see Kimi in Artificial analysis benchmarks?
LuminaWang7
07/18/2025
kimi K2 really excels at autonomous tool use, complex reasoning, and multi-step task execution.
I developed an intelligent vector database agent using Kimi K2 and Milvus, which enhances document interaction via natural language commands.
RandyOrion
07/13/2025
This is an open weight model, which is in contrast with closed-source models.
However, 1t parameters makes it nearly impossible for local inference, let alone fine-tuning.
bhouston
07/12/2025
Impressive benchmarks!
lvl155
07/13/2025
I love the fact that I can use this right away and test it out in practice. The ecosystem around LLM is simply awesome and improving by the day.
Havoc
07/14/2025
Glad it’s non-reasoning.
Often a faster answer is more useful to me for quick research. Reasoning has its place but don’t think that place is always
Imustaskforhelp
07/11/2025
I really really want to try this model for free since I just don't have a gpu.
Is there any way that I could do so?
Open Router? Or does kimi have their own website? Just curious to really try it out!
jacooper
07/12/2025
The problem with Chinese models is finding decent hosting. The best you can find right now for kimi k2 is only 30 tps, not great.
data_maan
07/12/2025
Open source" lol
It's open-weight. As usual, you don't get the dataset, training scripts, etc.
helloericsf
07/11/2025
How does it stack up against the new Grok 4 model?
07/12/2025
MichaelKSpencer
07/13/2025
[dead]
MichaelKSpencer
07/12/2025
[dead]
38
07/12/2025
The web chat has extremely low limits FYI. I ran into the limit twice before getting a sane answer and gave up
unit149
07/12/2025
[dead]
38
07/12/2025
The web chat has extremely low limits FYI. I ran into the limit twice before getting a sane answer and gave up
mistressgabby
07/11/2025
[flagged]
brcmthrowaway
07/12/2025
Is Kimi the new deep seek?

Kimi K2 is a state-of-the-art mixture-of-experts (MoE) language model

vessenes

summarity

wongarsu

winter_blue

terhechte

satvikpendem

refulgentis

segmondy

fzzzy

overfeed

numpad0

londons_explore

numpad0

refulgentis

SV_BubbleTime

gpm

chithanh

kachapopopow

neuroelectron

spaceman_2020

tuananh

handzhiev

airstrike

nathan_compton

moffkalast

Xmd5a

simonw

cosmojg

pabs3

simonw

ebiester

simonw

qmmmur

sergiotapia

GenerWork

CaptainFever

Lennie

sergiotapia

1vuio0pswjnm7

neoromantique

csomar

jug

_alex_

ozgune

GaggiX

Alifatisk

the_precipitate

exegeist

orbital-decay

simonw

c4pt0r

scottyeager

martin_

maven29

JustFinishedBSG

selfhoster11

refulgentis

selfhoster11

refulgentis

jimjimwii

apitman

selfhoster11

homarp

hereme888

maven29

zackangelo

t1amat

selfhoster11

CamperBob2

selfhoster11

apitman

selfhoster11

kkzz99

wiradikusuma

dcre

simonw

ozten

apitman

selfhoster11