Kimi 2.6 is now being used for Notion. Another huge win for kimi.
it is also used on Huawei Cloud on day 1

I'm very skeptical of the Mythos hype. I think there's a very good reason they're not releasing it, and it has little to do with their concern over "cyber-security".There have been many false dawns but is V4 finally on the horizon? 1.6T is pretty decent, but you really need 10T to compete with Mythos-tier models. I guess that's for next year or the year after that.
Kimi has been going from strength to strength so it's time to finally see if DeepSeek can take the crown of "best AI startup in China" back.
its 10 trillion in size, twice as expensive as opus... not unlike back when openai was scaling test time compute on o1/o2/o3 models and spenting $1000 per task (for work that cost a human $5) and saying it was high on AGI benchmarksI'm very skeptical of the Mythos hype. I think there's a very good reason they're not releasing it, and it has little to do with their concern over "cyber-security".
I don't believe it's just cost and compute resources required. Though yes, I think that's the #1 problem for Anthropic. The compute resources and costs required to deploy this are undoubtedly enormous.its 10 trillion in size, twice as expensive as opus... not unlike back when openai was scaling test time compute on o1/o2/o3 models and spenting $1000 per task (for work that cost a human $5) and saying it was high on AGI benchmarks
I'm very skeptical of the Mythos hype. I think there's a very good reason they're not releasing it, and it has little to do with their concern over "cyber-security".
Elite security researchers find bugs that fuzzers can’t largely by reasoning through the source code. This is effective, but time-consuming and bottlenecked on scarce human expertise. Computers were completely incapable of doing this a few months ago, and now they excel at it. We have many years of experience picking apart the work of the world’s best security researchers, and Mythos Preview is every bit as capable. So far we’ve found no category or complexity of vulnerability that humans can find that this model can’t.
I think there is a psychosis spreading with this technology mostly propup by the hype of US AI CEOs. LLMs are mostly databases with probabilistic outputs. They don't have nuance or conscience of the things they are outputting and that is dangerous because these databases output content faster than humans with nuance can review.The team think it is real and as capable as "elite (human) security researchers":
Though Mozilla didn't disclose how many false positives they had. Hopefully they will have followup posts in the future.
China needs to steal the weights to Mythos,but Anthropic is guarding it more securely than nuclear codes, way more than they guarded the Claude Code source code...The team think it is real and as capable as "elite (human) security researchers":
Though Mozilla didn't disclose how many false positives they had. Hopefully they will have followup posts in the future.