Artificial Intelligence thread

tphuang · Apr 9, 2026

https://twitter.com/i/web/status/2042404881508520157

my piece on the compute side of things. Looks like the domestic chips that were severely underutilized before are finally getting put to use. Inference was never the issue. The issue for Chinese AI labs is funding and also access to large enough training cluster (most of then like Nvidia). Not everyone is like ByteDance and have 500+ Blackwell racks in Malaysia for training.

Wrought · Apr 9, 2026

Piece on the AI talent exodus from US -> China last year, driven by both push and pull factors.

For decades, the Valley served as a gravitational well for global tech ambition, a place where ideas met capital and engineering prowess. But the trajectory is starting to invert. Figures such as Wu Yonghui, who gave up a senior role at Google DeepMind to lead ByteDance’s push into next-generation large language models, and Yao Shunyu, who left OpenAI to anchor Tencent’s AI development, have returned in the past year. Other returnees include Roger Jiang, a senior scientist who left OpenAI to found his own robotics start-up in Shenzhen, and Zhou Hao, a researcher Alibaba poached from Google DeepMind to refine models. And three AI-focused headhunters based in China and San Francisco say they helped hire and relocate more than 30 US-based researchers to China in the past 12 months, versus a low single-digit a year earlier. This is not just a trickle of nostalgia — it is a calculated realignment. Macro and micro factors in China are creating a magnetic pull that is becoming more difficult to resist.

Please, Log in or Register to view URLs content!

solarz · Apr 10, 2026

bsdnf said:
1M context window is a lie.

The reality is that the upper limit for all current models is 256k. Once you hit this limit, model performance will drops significantly, you have to go context compression.

Especially now that Opus suffers from severe computational power deduction, problems that could previously be solved by brute force using powerful benchmarking capabilities after exceeding the window limit are now exposed.

Can you imagine? It can't even solve a car wash problem now.

At the end of the day, LLM response generation is O(n * k), where n is the context length, and k is the response length. This means the larger the context, the longer it takes to generate the next token. Therefore, barring a new algorithm or a revolutionary improvement over the current Attention-based Transformer architecture, LLMs are not really scalable for large tasks. The only way to use them effectively is to break the problem down into small pieces (which in Software Engineering is like 90% of the work), and that's where they'll be stuck for the foreseeable future when it comes to coding.

bsdnf · Apr 10, 2026

solarz said:
At the end of the day, LLM response generation is O(n * k), where n is the context length, and k is the response length. This means the larger the context, the longer it takes to generate the next token. Therefore, barring a new algorithm or a revolutionary improvement over the current Attention-based Transformer architecture, LLMs are not really scalable for large tasks. The only way to use them effectively is to break the problem down into small pieces (which in Software Engineering is like 90% of the work), and that's where they'll be stuck for the foreseeable future when it comes to coding.

I'm not that pessimistic. A year ago, there weren't many models even worth using up to the 200k limit.

bsdnf · Apr 10, 2026

HappyHorse is Alibaba's model.

Please, Log in or Register to view URLs content!

GulfLander · Apr 10, 2026

OpenAI shelves Stargate UK in blow to Britain’s AI ambitions
Artificial intelligence company cites high energy costs and regulation for putting landmark project on hold

Please, Log in or Register to view URLs content!

Michael90 · Apr 10, 2026

tphuang said:
Not everyone is like ByteDance and have 500+ Blackwell racks in Malaysia for training

How do they get that when the US has imposed strict sanctions against Chinese tech companies?

tphuang · Apr 10, 2026

Michael90 said:
How do they get that when the US has imposed strict sanctions against Chinese tech companies?

they rent it from someone else that acquired the chips.

tphuang · Apr 10, 2026

https://twitter.com/i/web/status/2042611135434891592

According to Arena rankings, GLM-5.1 is 3rd best, behind just the two different opus-4.6 versions and ahead of sonnet and oput-4.5 as well as Gemini-3.1 and GPT-5.4

Again, I tried GLM-5 turbo, really good

Michael90 · Apr 10, 2026

Wrought said:
Piece on the AI talent exodus from US -> China last year, driven by both push and pull factors.

Please, Log in or Register to view URLs content!

I think it’s moving more to a two way street movement now instead of the one way movement from china to the US like before . So some now leave while others remain or come to the US . Look at Alibabas top AI leaders who moved recently to meta. So its more of a two way things now not just one way unlike before, so china is improving but still a long way to go to match the US in attracting the best talents globally

Artificial Intelligence thread

tphuang

General

Wrought

Captain

solarz

Brigadier

bsdnf

Senior Member

bsdnf

Senior Member

GulfLander

Brigadier

Michael90

Senior Member

tphuang

General

tphuang

General

Michael90

Senior Member