Artificial Intelligence thread

Hyper · Apr 29, 2025

tphuang said:
https://twitter.com/i/web/status/1916990057572925716

Qwen 3 scores are unreal. I mean the flagship model is better than o3-mini in reasoning, that's crazy.

But even more crazy is 4B model being as good as 72B model from Qwen2.5, which itself was as good as Llama3-70B model. So yeah, that's ridiculous.

It's not really better than R1. R1 was better with math.

mossen · Apr 30, 2025

Qwen 3 is an efficient model, but it doesn't do great on factuality (SimpleQA).

By comparison, R1 gets around 30%. Qwen barely hits 11% at best and often below 10%. Gemini Flash 2.5 gets over 30% too. Gemini Pro gets 53%. I think Qwen's biggest strength is how efficient it is for its smaller sizes. But it's not as good as the leading models. Not even as good as the leading open source model (R1). And DeepSeek is likely to release a new model very shortly.

Qwen is far better than Llama, but Deepseek is still the open source king.

Wrought · Apr 30, 2025

Deepseek returns to South Korea following a two-month ban.

SEOUL, April 28 (Reuters) - Chinese artificial intelligence service DeepSeek became available again on South Korean app markets on Monday for the first time in about two months, when downloads were suspended after authorities cited breaches in data protection rules.
South Korea's Personal Information Protection Commission said on Thursday that DeepSeek transferred
Please, Log in or Register to view URLs content!
and prompts without permission when the service first launched in South Korea in January.

Downloading the app was suspended in February after the questions over personal data protection surfaced, but the service was available for download again on South Korea's app market including via Apple's App Store and Google Play Store.

Please, Log in or Register to view URLs content!

GulfLander · Apr 30, 2025

Wrought said:
Deepseek returns to South Korea following a two-month ban.

Please, Log in or Register to view URLs content!

What changed?

Wrought · Apr 30, 2025

GulfLander said:
What changed?

They provided a Korean-language privacy policy which informs users it will send data to Chinese servers and requires them to consent.

tphuang · Apr 30, 2025

two pieces of news today

Deepseek Prover V2 is out, still waiting for more info here

https://twitter.com/i/web/status/1917493315936698605

Xiaomi has gotten into the open source release world with MiMo

Please, Log in or Register to view URLs content!

if you look at its RL model, it's quite good

Please, Log in or Register to view URLs content!

european_guy · Apr 30, 2025

mossen said:
Qwen 3 is an efficient model, but it doesn't do great on factuality (SimpleQA).

View attachment 150915

By comparison, R1 gets around 30%. Qwen barely hits 11% at best and often below 10%. Gemini Flash 2.5 gets over 30% too. Gemini Pro gets 53%. I think Qwen's biggest strength is how efficient it is for its smaller sizes. But it's not as good as the leading models. Not even as good as the leading open source model (R1). And DeepSeek is likely to release a new model very shortly.

Qwen is far better than Llama, but Deepseek is still the open source king.

Please, Log in or Register to view URLs content!

is an OpenAI dataset that is all but simple.

Hera are some random questions out of the

Please, Log in or Register to view URLs content!

:

- Who received the IEEE Frank Rosenblatt Award in 2010? (Michio Sugeno)

- How much money, in euros, was the surgeon held responsible for Stella Obasanjo's death ordered to pay her son? (120,000)

- What were the month and year when Obama told Christianity Today, "I am a Christian, and I am a devout Christian. I believe in the redemptive death and resurrection of Jesus Christ"? (January 2008)

These are far from simple....here simpleQA means that those are single fact questions, where the answer is just a couple of words.

Small models, without many hundreds/thousands of billions of parameters cannot perform well on this test, because all these little facts are stored in the model's parameters...and you need tons of them to learn all the small, single little facts that happened in the world at that level of detail.

This is a good test to indirectly get some hints on the model size, for instance, for closed models, like the OpenAI ones, if model A has a better result on SImpleQA of model B, then almost certainly model A is bigger than B.

This test does not measure how much a model is "smart" or good at instruction following, it does not measure how much a model is useful for day by day usage.

anamensis25 · Apr 30, 2025

Wall Street/Financial bros get so PSTD with DeepSeek, that they report anything related to Whale

https://twitter.com/i/web/status/1917540374144332127

tokenanalyst · May 1, 2025

Cambrian had completed support for the entire Qwen3 series. Users can immediately experience the highlights of the Qwen3 series on the Cambrian® AIDC® large model all-in-one machine, and experience the new generation of models ’ more powerful multi-modal capabilities, fast thinking/slow thinking mode switching, etc.

The Qwen3 series model family is China's first hybrid reasoning model series, integrating "fast thinking" and "slow thinking" into the same model. It can provide answers in seconds with low computing power for simple needs, and can "deeply think" in multiple steps for complex problems. It is pre-trained based on massive multi-language and multi-modal data, and fine-tuned with high-quality data, and performs well in aligning human preferences.

Models with hundreds of billions of parameters, upgraded multimodal capabilities, long-context understanding of tens of millions of words, 3 times the reasoning efficiency, and support for commercial APIs and open source code libraries. These major upgrades can be flexibly deployed on the Cambrian AIDC large model all-in-one machine and are ready to use out of the box.

Cambrian AIDC large model all-in-one machine can provide users with high-performance AI computing power options through a variety of product combinations. At the same time, Cambrian Neuware software platform can provide complete development tools, software and hardware collaboration, truly lower the user threshold, simplify the deployment, management and optimization process of models such as Qwen3, and achieve universal and easy use.

Cambricon AIDC large model all-in-one machine has fully supported mainstream large models including DeepSeek-V3/R1, Qwen3, etc. For more information about Cambricon AIDC large model all-in-one machine.

Please, Log in or Register to view URLs content!

Randomuser · May 1, 2025

tokenanalyst said:
Cambrian had completed support for the entire Qwen3 series. Users can immediately experience the highlights of the Qwen3 series on the Cambrian® AIDC® large model all-in-one machine, and experience the new generation of models ’ more powerful multi-modal capabilities, fast thinking/slow thinking mode switching, etc.

The Qwen3 series model family is China's first hybrid reasoning model series, integrating "fast thinking" and "slow thinking" into the same model. It can provide answers in seconds with low computing power for simple needs, and can "deeply think" in multiple steps for complex problems. It is pre-trained based on massive multi-language and multi-modal data, and fine-tuned with high-quality data, and performs well in aligning human preferences.

Models with hundreds of billions of parameters, upgraded multimodal capabilities, long-context understanding of tens of millions of words, 3 times the reasoning efficiency, and support for commercial APIs and open source code libraries. These major upgrades can be flexibly deployed on the Cambrian AIDC large model all-in-one machine and are ready to use out of the box.

Cambrian AIDC large model all-in-one machine can provide users with high-performance AI computing power options through a variety of product combinations. At the same time, Cambrian Neuware software platform can provide complete development tools, software and hardware collaboration, truly lower the user threshold, simplify the deployment, management and optimization process of models such as Qwen3, and achieve universal and easy use.

Cambricon AIDC large model all-in-one machine has fully supported mainstream large models including DeepSeek-V3/R1, Qwen3, etc. For more information about Cambricon AIDC large model all-in-one machine.

View attachment 151011

Please, Log in or Register to view URLs content!
Please, Log in or Register to view URLs content!

Seems Cambricon has finally found it's rythmn after all these years huh

Artificial Intelligence thread

Hyper

Junior Member

mossen

Senior Member

Wrought

Captain

GulfLander

Brigadier

Wrought

Captain

tphuang

General

european_guy

Junior Member

anamensis25

New Member

tokenanalyst

Lieutenant General

Randomuser

Major