Artificial Intelligence thread

horse

Brigadier
Registered Member
The only reason Anthropic likes to talk about distillation is because data and brute force compute is the only advantage they have

Yeah, exactly. You can only do so much with brute force at this stage of the game. Non-computer people will not understand that at all.
 

9dashline

Major
Registered Member
Okay, I make a counter point to that, although you could be right.

1. Software sometimes can be localized (for whatever reasons). People use email in the West. They use WeChat in China. So I think American AI will never disappear. What form it has as a commercial entity has not been finalized.

2. What clearly has changed, is that the Americans have to compete in the open source AI space to maintain relevance. The US government will always buy American, but that does not go for corporate America.

3. American AI will find a way to stick around. By that I mean the current AI leaders in America, will change tack, and all become hyperscalers. That is only logical, and is what Jensen Huang said a few months ago. AI is booming, and we need more compute. But, data centers are a big capital expense. So who, which companies, has that kind of money for capital investment?

4. Once DeepSeek came out, the entire game changed. All this talk about the frontier and AGI, that is the smokescreen. They are lying to the public, and more important they are lying to themselves to protect their fragile egos, like a gimp.

:confused:


Please, Log in or Register to view URLs content!

Reports are just today, Z.ai finished trained a model that meets or exceeds Fable/Mythos on cybersec.....

I am not sure if it will be opensourced but basically chatgpt literally also announced a cyber version of their 5.6 just a day ago

It probably wont even take China but a few more months to get to Mythos tier, broadly

Also Deepseek released a spec today that makes inferencing even more efficent up to 500%


Please, Log in or Register to view URLs content!
Writing is on the wall
 

tphuang

General
Staff member
Super Moderator
VIP Professional
Registered Member
Its about cost benefit ratio. If im paying $200 a month for subscription but costing them way more to serve because im maxing out the qouta, then its a win win for me. In the long arc of things China is going to win the AI race anyway, its structural, and there is nothing Dario or the USA can do about it... what I do or dont know doesnt make a dent of difference in the grand scheme of history. In the meantimes its the VC suckers and 401k bagholders thats subsidizing my AI use
that's nonsense. you are giving free data to Anthropic to make their model better.

All this is bullshit. Don't try to make it sound like you support China's AI effort, because you are doing none of that.

If China's AI is good enough, then you should use it. Otherwise, there is a gap and you are helping to make it wider.

Well to be fair, lots of mainland engineers also use Claude. It's quite popular. I wouldn't regard it as some kind of indictment of disloyalty.
if we are operating under the assumption that Chinese models are a lot worse, then okay. But if the Chinese models are good enough (which they are), then these people are clearly helping the other side. Of course, they probably don't think about it that way. But now that US govt is shutting off access to them, good riddance.

Okay, I make a counter point to that, although you could be right.

1. Software sometimes can be localized (for whatever reasons). People use email in the West. They use WeChat in China. So I think American AI will never disappear. What form it has as a commercial entity has not been finalized.

2. What clearly has changed, is that the Americans have to compete in the open source AI space to maintain relevance. The US government will always buy American, but that does not go for corporate America.

3. American AI will find a way to stick around. By that I mean the current AI leaders in America, will change tack, and all become hyperscalers. That is only logical, and is what Jensen Huang said a few months ago. AI is booming, and we need more compute. But, data centers are a big capital expense. So who, which companies, has that kind of money for capital investment?

4. Once DeepSeek came out, the entire game changed. All this talk about the frontier and AGI, that is the smokescreen. They are lying to the public, and more important they are lying to themselves to protect their fragile egos, like a gimp.

:confused:
DeepSeek really hasn't changed the game.

Please, Log in or Register to view URLs content!

Reports are just today, Z.ai finished trained a model that meets or exceeds Fable/Mythos on cybersec.....

I am not sure if it will be opensourced but basically chatgpt literally also announced a cyber version of their 5.6 just a day ago

It probably wont even take China but a few more months to get to Mythos tier, broadly

Also Deepseek released a spec today that makes inferencing even more efficent up to 500%


Please, Log in or Register to view URLs content!
Writing is on the wall
again, if Zai is actually good enough (and this article is talking about GLM-5.2), then you should use it. Stop telling other people to use it if you are not willing to use it.

I actually used it so much this week, that I hit the limit on my coding plan, so I'm back to Kimi 2.7.

Until you are willing to put your coding and money where your mouth is, keep quiet about how good the Chinese models are. You don't know because you don't use them.

I've used DeepSeek V4 and it's really not up to par for coding. Minimax, I tried that too and it's bad. Chinese models really only got good at coding with GLM-5.1 and Kimi 2.6.
 

Michael90

Senior Member
Registered Member
that's nonsense. you are giving free data to Anthropic to make their model better.

All this is bullshit. Don't try to make it sound like you support China's AI effort, because you are doing none of that.

If China's AI is good enough, then you should use it. Otherwise, there is a gap and you are helping to make it wider.


if we are operating under the assumption that Chinese models are a lot worse, then okay. But if the Chinese models are good enough (which they are), then these people are clearly helping the other side. Of course, they probably don't think about it that way. But now that US govt is shutting off access to them, good riddance.


DeepSeek really hasn't changed the game.


again, if Zai is actually good enough (and this article is talking about GLM-5.2), then you should use it. Stop telling other people to use it if you are not willing to use it.

I actually used it so much this week, that I hit the limit on my coding plan, so I'm back to Kimi 2.7.

Until you are willing to put your coding and money where your mouth is, keep quiet about how good the Chinese models are. You don't know because you don't use them.

I've used DeepSeek V4 and it's really not up to par for coding. Minimax, I tried that too and it's bad. Chinese models really only got good at coding with GLM-5.1 and Kimi 2.6.
On this one I agree with you. Some people will claim to support something but when it comes to putting the money where their mouth is, they make different choices . When even foreigners are increasingly using Chinese open source AI models since they are good enough for the tasks they want, I don’t think he can keep using that excuse anymore . lol. I think most people who use Claude just get hooked on it. Lol
 

tokenanalyst

Lieutenant General
Registered Member
On this one I agree with you. Some people will claim to support something but when it comes to putting the money where their mouth is, they make different choices . When even foreigners are increasingly using Chinese open source AI models since they are good enough for the tasks they want, I don’t think he can keep using that excuse anymore . lol. I think most people who use Claude just get hooked on it. Lol
Until they see the bill and the "hook" become an horror movie.
Smart people know how to combine different models for different workloads and that most companies are learned to do, the hard way. Instead of paying their entire montly staff salary each week on tokens you switch to DeepSeek for like 80% of your tasks.
 
Top