GLM Coding Pro users can already use GLM-5, while Lite users will have to wait a while.
What's particularly intriguing is the official announcement's mention of "strong support from domestic chip partners" for achieving the computing power expansion. I don't know if this is just political correctness or if they genuinely use domestically produced chips for inference.
If this is true, then at least the model of using NVIDIA to train SOTA models and domestically produced chips for inference to cope with the surge in traffic has once again proven to be feasible.
I hope Kimi can learn from this lesson. Users literally putting money in your pocket and you can't even take it, how is that acceptable?