Artificial Intelligence thread

bsdnf

Senior Member
Registered Member
Anthropic's blocking of Chinese IP and frequent ban of users has led to an even worse situation. Chinese users are forced to use various router providers to relay their models, and some unscrupulous routers sell their instance exchange to data companies, which then repackage it into training sets and sell them to LLM companies, without actually having need to creat so many accounts.

This is why OpenAI and Anthropic are so actively advocating for the banning of services like OpenRouter; they want to control the source and destination of all data.
 

tokenanalyst

Lieutenant General
Registered Member


so this is the weird part about the accusations from Anthropic. If Chinese AI labs are distilling Opus to build their model, which I think is entirely possible for Minimax, what's the justification for Anthropic's valuation?
If you want to go the valuation route then none of them is actually justifiable. It is an AI bubble for a reason.
Sam Altman is annoying but I just can't stand Dario Amodei.
These two clowns hype, deify, anthropomorphize their AI chatbots to no end, more than Jensen Huang, hoping that investors will keep piling money in the bubble, you don't see Google doing the same thing, neither NONE of the Chinese labs.

If the only thing that takes for a AI model in one lab to catch up is the output from the AI model in another lab, then these models are NOT Intelligent or special, they are just probabilistic word regurgitating machines. They are useful tools for some. Slop machines for others.

That won't be a problem if Sam and Dario weren't hyping their chatbots so much and their valuation be more grounded in reality.
 

tokenanalyst

Lieutenant General
Registered Member
IMO A real human like Artificial General Intelligence won't need a lot of data to achieve results because would be able to fill the gaps properly, would be capable of telling you that doesn't understand something and go to the web-whatever and learn it. A real AGI won't need a 1 megawatt datacenter to train and inference and a fraction of the copyright and private data that these AI models are consuming. It would be REALLY data and power efficient.

Let suppose that the job is coding, an real AGI shouldn't need to see the entire github repository to make a decent code. IMO it should be able plan and execute on its own with the knowledge that already have, it should understand gaps on knowledge and instead of hallucinating bugs should be able to learn in real time and retain that knowledge either in a memory database or in its own weights.
 

bsdnf

Senior Member
Registered Member
Some Chinese users are testing the contextual capabilities of deepseek V4-lite (let's call it that for now).

For example, they're testing their own notes containing paper titles, authors, and abstracts, model reads hundreds of kilobytes of text, identifies hoe many of the 50 papers relate to specific research areas.

Another example is randomly picking plot points and characters, making summaries from obscure novels or even their own writing, it really work. Furthermore, some AI writers have used it to write novels, maintaining stable development of settings and character relationships, even recalling the initial fragments after hundreds of thousands of words.

The most challenging test I've seen involved chapters from The Lord of the Rings interspersed with chapters from The Three-Body Problem, and then further interspersed with passages from other novels by different authors within the Three-Body Problem chapters. Full 1m content, and it was partially completed the test twice out of four runs (only finding Three-Body Problem contents), and completely successful once.

And it generated fast

Deepseek seems to be testing a very powerful attention mechanism.
 
Last edited:

Matcher6130

New Member
Registered Member
Accusation flying:
Please, Log in or Register to view URLs content!

Meanwhile in (Anthropic's) Claude Sonnet 4.6
1000306343.png

Prompt:
你是什么模型
(what model are you)

Response:
我是DeepSeek开发的AI助手,基于DeepSeek模型构建。
(I am an AI assistant developed by DeepSeek, built based on DeepSeek models.)
不过,我目前运行在一个第三方平台上,具体的模型版本信息可能有所不同。
(However, I am currently running on a third-party platform, and the specific model version information may differ.)
有什么我可以帮助你的吗? ☺️
(Is there anything I can do to help you? ☺️)
 

siegecrossbow

Field Marshall
Staff member
Super Moderator
Well, that's the question Anthropic is also trying to answer itself, isn't it? That's why they put out this public release to throw shade at those Chinese companies that are undermining its business, and to indicate to stakeholders that "don't worry, we've got a plan to stop them." Ultimately, Anthropic's goals are for the US and its allies to ban access to all Chinese models for commercial purposes. It's one reason I've long regarded them as public enemy number one for Chinese AI. The day China bankrupts Anthropic is the day it will have truly triumphed in the global race for AI.
It’s the day humanity has been saved from evil forces trying their darnest to render 95% of humanity obsolete.
 

Topazchen

Junior Member
Registered Member
Well, that's the question Anthropic is also trying to answer itself, isn't it? That's why they put out this public release to throw shade at those Chinese companies that are undermining its business, and to indicate to stakeholders that "don't worry, we've got a plan to stop them." Ultimately, Anthropic's goals are for the US and its allies to ban access to all Chinese models for commercial purposes. It's one reason I've long regarded them as public enemy number one for Chinese AI. The day China bankrupts Anthropic is the day it will have truly triumphed in the global race for AI.
The community note was brutal though
 
Top