Artificial Intelligence thread

Kalum Pupeter

Junior Member
Registered Member
HHUpV6dWUAAAZpg

Please, Log in or Register to view URLs content!
 

Wrought

Captain
Registered Member
HHUpV6dWUAAAZpg

Please, Log in or Register to view URLs content!

Same source was already posted on the previous page.

 

meedicx

Junior Member
Registered Member
So a new trend right now for benchmarking LLMs is pitting them against each other in PvP game environments. This is a more scalable method to test agentic capabilities than static benchmarks. Here is a good post on the benefits of this method:

Please, Log in or Register to view URLs content!

Recent Chinese LLMs performance have been surprisingly strong in these benchmarks especially Kimi K2.6 and unexpectedly MiMo V2.5 Pro.

GBENCH Intelligence Benchmark
1777827631923.png
Please, Log in or Register to view URLs content!

AI Coding Contest
1777827564934.png
Please, Log in or Register to view URLs content!
 

jli88

Junior Member
Registered Member
You know is a bubble when these guys are paying 5000 per post to bash the competition.

5000 per post is not a lot, if the content creators are large. So I don't think that is out of ordinary. The language makes it appear that this 5000 is not a default amount paid to everyone.

Also, even I was in the camp that AI is a bubble, but I am personally seeing people building whole companies on vibe code, where they are dying for more claude credits. So, I am starting to believe perhaps its not a bubble.
 

tokenanalyst

Lieutenant General
Registered Member
5000 per post is not a lot, if the content creators are large. So I don't think that is out of ordinary. The language makes it appear that this 5000 is not a default amount paid to everyone.

Also, even I was in the camp that AI is a bubble, but I am personally seeing people building whole companies on vibe code, where they are dying for more claude credits. So, I am starting to believe perhaps its not a bubble.
The whole Vibe Code thing just the hysteria in the bubble. Happens in every new technology. The problem if was valued at the right price there will be no bubble but these CEOs had hyped this tech to the moon and that is the problem, now they have to deliver. But at the cost of electricity increase and the cost serving these models become higher with demand I think these models are not going deliver what they are going to be priced, specially when there are cheaper Chinese models alternatives and even those can see their costs increased.

I VibeCoded myself for my own applications, I used Claude, I used OpenAi, now I mostly use my own AI Server and local models, because they meet my demands. Do the slop they produced worth it? Yes, for some applications, It worth the trillion dollars that these CEOs are hyping for? Absolutely no.

1777910361348.png
 
Top