Artificial Intelligence thread

coolgod · Jan 30, 2025

escobar said:
DeepSeek-R1 model is now available as an NVIDIA NIM microservice preview

Please, Log in or Register to view URLs content!

lych470 · Jan 30, 2025

iewgnem said:
It's not creating the model to create competitive advantage that's interesting
It's releasing the model so everyone, including other hedge funds can share in your advantage, which nullify your advantage, that's interesting.

The PLA military procurement's philosophy is '探索一代、预研一代、研制一代、生产一代‘ - 'explore a generation, research a generation, develope a generation, produce a generation' - I imagine there would be similar ethos in Chinese AI companies. I would be very surprised if they don't have something that's better up their sleeves.

luminary · Jan 30, 2025

escobar said:
DeepSeek-R1 model is now available as an NVIDIA NIM microservice preview

Please, Log in or Register to view URLs content!

So for tokens per second:
Cerebras ??: 1600
Nvidia H200: 3800

I don't think Steve hsu knows what he's talking about. Cerebras, Groq, Sambanova, etc are gimmicks.

Cerebras probably claimed the 57x number off of this.

Please, Log in or Register to view URLs content!

Indeed on that page you funnily enough see that 57x number come up again, that being the WSE-3 is 57x bigger than a H100's die...

nativechicken · Jan 30, 2025

Please, Log in or Register to view URLs content!

Temstar · Jan 30, 2025

coolgod said:
View attachment 144711
View attachment 144710

People want to use it. Western media can talk shit about censoring and flawed security but since it's open source all cloud providers need to host it unless they want to give money and market share to their competitors. As soon as Microsoft's Azure made the jump rest will all have to follow.

00CuriousObserver · Jan 31, 2025

Please, Log in or Register to view URLs content!

Interested in seeing how the smuggling game is gonna go

Wrought · Jan 31, 2025

MortyandRick said:
When I see people like Byran Wan post or that FLG hostess Lei's real talk post, I'm often reminded of the phrase

"Your 'boos' mean nothing to me, I've seen what you cheer at" - Rick Sanchez.

I think that encapsulates the CPC mentality pretty well.

Username checks out.

huemens · Jan 31, 2025

luminary said:
So for tokens per second:
Cerebras ??: 1600
Nvidia H200: 3800

Cerebras is not running the full version. They are running the 70b distilled version, which you can run on many other lower powered chips.
Nvidia is running the full 671B.

luminary · Jan 31, 2025

Please, Log in or Register to view URLs content!

Semianalysis speeding towards irrelevance and becoming yet another Gordon Chang newsletter.

here's hand waving and cope

Please, Log in or Register to view URLs content!

GulfLander · Jan 31, 2025

"UC Berkeley researchers have developed a small-scale language model reproduction of DeepSeek R1-Zero, an AI language model developed in China, for about $30.[...]
The language model TinyZero is a project led by campus graduate researcher Jiayi Pan and three other researchers, advised by campus professor Professor Alane Suhr and University of Illinois at Urbana-Champaign assistant professor Hao Peng.[...]TinyZero is a small-scale reproduction, with the $30 price going toward server costs to run the experiments. TinyZero is “only useful for very restricted types of tasks” such as countdown and multiplication tasks[...]"

Please, Log in or Register to view URLs content!

Artificial Intelligence thread

coolgod

Brigadier

lych470

Junior Member

luminary

Senior Member

nativechicken

Junior Member

Temstar

Brigadier

00CuriousObserver

Junior Member

Wrought

Senior Member

huemens

Junior Member

luminary

Senior Member

GulfLander

Brigadier