Artificial Intelligence thread

tokenanalyst · Mar 13, 2025

Please, Log in or Register to view URLs content!

GulfLander · Mar 13, 2025

gpt said:
OAI's founding principles have longed been abandoned, its top researchers have jumped ship,
Please, Log in or Register to view URLs content!
.
We'll find out in literally a year tops if they still retain their fabled first mover advantage.
In terms of policy on AI and semis, expect deregulation and tighter export controls/student/work visas. Same old, really.

Here's another banger:
Please, Log in or Register to view URLs content!

Been awhile since the gov not people thing was used.... they hire those people?

tphuang · Mar 13, 2025

https://twitter.com/i/web/status/1900351100156289421

Tencent + likely Alibaba & ByteDance have placed major H20 orders to secure enough compute for DeepSeek and likely before AI chip ban comes into place.

Apparently, Tencent & ByteDance were the 2nd and 3rd largest Nvidia customers last year by number of chips (after Microsoft), but I guess they were getting the less power H20 chips, so the total compute power would be ranked lower

Eventine · Mar 13, 2025

Please, Log in or Register to view URLs content!

Sesame's open source release was a bust. The company only released a 1B model (their original on the website was 8B) and not the rest of the pipeline code for creating their demo, either. The community is not receiving it well, so this is a great opportunity for a Chinese company to come in & leap frog on market share.

9dashline · Mar 13, 2025

Eventine said:
Please, Log in or Register to view URLs content!

Sesame's open source release was a bust. The company only released a 1B model (their original on the website was 8B) and not the rest of the pipeline code for creating their demo, either. The community is not receiving it well, so this is a great opportunity for a Chinese company to come in & leap frog on market share.

More likely, CIA had a talk with them

Hyper · Mar 13, 2025

Now that OpenAI is talking about copyright law means they might have already broken it, which means there must be a lawsuit against them. Now where can I join one for some sweet 50 bucks.

9dashline · Mar 14, 2025

Hyper said:
Now that OpenAI is talking about copyright law means they might have already broken it, which means there must be a lawsuit against them. Now where can I join one for some sweet 50 bucks.

Also means they are getting desperate.... its reeks of desperation to openly ban Open AI, for a company called OpenAI, will saying that IP theft is okay because the ends justify the means, all to beat China at all costs

This is how Amerikkka will lose it all

Wrought · Mar 14, 2025

FT reports that Deepseek is not interested in offers from various investors, and prefers to stay lean and mean.

Industry insiders said Liang has shown little intention to capitalise on DeepSeek’s sudden fame to further commercialise its technology in the near term. The company is instead focusing the majority of its resources on model development and the quest to build artificial general intelligence — machines with humanlike cognitive capabilities. These people added the independently wealthy founder has also declined to entertain interest from China’s tech giants as well as venture and state-backed funds to invest in the group for the time being. Many have found it difficult to even arrange a meeting with the secluded founder.

“We pulled top-level government connections and only got to sit down with someone from their finance department, who said ‘sorry we are not raising’,” said one investor at a multibillion-dollar Chinese tech fund. “They clearly are not interested in scaling up right now. It’s a rare situation where the founder is wealthy and committed enough to keep it lean in a Navy Seal-style for his pursuit of AGI.”

Please, Log in or Register to view URLs content!

OptimusLion · Mar 14, 2025

Tsinghua team open-sources large model inference engine "Chitu" to help domestic chips break through the cost and efficiency problems of FP8 model deployment and DeepSeek deployment

Professor Zhai Jidong's team from the Institute of High Performance Computing at Tsinghua University and Tsinghua-affiliated science and technology company Qingcheng Jizhi jointly announced the open source of the large model inference engine "Chitu", which is the first engine to natively run FP8 precision models on non-NVIDIA Hopper architecture GPUs and various domestic chips, bringing new breakthroughs to the widespread application and ecological construction of domestic AI chips.

Breaking the "hardware binding" dilemma, FP8 model deployment is no longer restricted

The development of DeepSeek has promoted the FP8 precision model to become the mainstream in the industry. With the continued popularity of DeepSeek, the demand for private deployment of large models in enterprises has also shown a blowout trend.

However, the current world-leading FP8 model has long relied on NVIDIA's H-series high-end GPUs, which has limited domestic companies in deploying large models due to the limitations of AI chips. On the one hand, the import of NVIDIA's H-series chips is restricted, making it difficult for domestic companies to obtain high-performance hardware support; on the other hand, most domestic chips do not support the FP8 data type and cannot fully utilize the performance of the new generation of AI models, making enterprise deployment costs high.

To break this dilemma, Tsinghua University and Tsinghua Unigroup jointly launched the open-source "Chitu" inference engine. Through the innovation of underlying technology, the engine has for the first time realized the efficient deployment of native FP8 models on non-H card devices (including GPU cards before NVIDIA's Hopper architecture and various domestic cards), getting rid of the dependence on specific hardware, and greatly reducing the threshold and cost of deploying AI models for enterprises.

Professor Zhai Jidong of Tsinghua University emphasized that Chitu condenses the team's years of accumulation of parallel computing and compilation optimization technology, and its goal is to "bridge the gap between advanced models and diversified hardware, so that domestic computing power can truly 'run' and provide key support for the implementation of China's large model industry." Qingcheng Jizhi CEO Tang Xiongchao said: "Chitu is positioned to become a bridge connecting diversified computing power and large model applications. We not only support NVIDIA's full range of GPUs, but also deeply optimize for domestic chips. In the future, we will gradually open source adaptation versions."

Please, Log in or Register to view URLs content!

tphuang · Mar 14, 2025

https://twitter.com/i/web/status/1900513336976138505

Alibaba revamped its quark app to incorporate latest Qwen advances.

Artificial Intelligence thread

tokenanalyst

Brigadier

GulfLander

Colonel

tphuang

General

Eventine

Junior Member

9dashline

Captain

Hyper

Junior Member

9dashline

Captain

Wrought

Senior Member

OptimusLion

Junior Member

tphuang

General