Artificial Intelligence thread

9dashline

Captain
Registered Member

alright, I tried running qwen3.6 27B locally and it worked fine. had a nice web GUI which I could use. I'm happy with it.
recently google released a paper called TurboQuant that basically found an elegant way of quantizing the KV cache memory itself... allowing up to 4x to 6x longer context at near lossless .... I recompiled llama.cpp with this and now on my rtx 5070ti super that only has 16GB vram I can run qwen3.5 9b at the FULL 256k token context window... which is wild because if yall recall, when chatgpt first came out it only had 8k context... and for the longest time the 200k context was only something that antrophic offered enterprise customers...

qwen3.5 9b on artificalanalysis is only a few points under Gemini 2.5 Pro in Intelligence benchmarks which last year in 2025 it was SOTA...

won't be long before everyone can run near frontier models on their own hardware espeically with the new 1-bit models coming out... pair that with the likes of turboquant, qwen's deltanet and Deepseeks' enngram technique of seperating facts (knowledge ) from reasoning/logic/intelligence to make LLMs much capable at much smaller size/footprints, pretty soon offline will be as good as online, especially if the online versions will be going up in cost, and gated (mythos) and deliberatly handicapp (opus 4.7 got intentionally handicapped in the IT security domain etc) ....

if antrophic really cared about mankind, they would open weight the opus and mythos models to the world
 

Michael90

Senior Member
Registered Member
from someone that actually uses Kimi 2.6 for real work & business. This is the kind of stuff people should listen to about certain models instead of a disgruntled former ByteDance employee. I use K2.6 for my normal programming and it's great.


my experience with K2.6 is that it pretty much does everything I need it to do. The only issue is that since I'm using the free online version, it often runs out of token when I request it to do complex tasks. I guess I need to start paying money.
Yes Kimi is quite good. Think the best Chinese AI model I’ve used so far. It’s my default AI model for my work this days. The only issue is that it sometimes run out of token as you said, but yeah I’ve been reluctant to spend on it . But I think I would have to do so, since it’s quite helpful for my job and it doesn’t break the bank either .
One thing I noticed trying Deepseek is that it seems to give responses quite fast compared to Kimi. It’s almost instantaneous, compared to Kimi which tends to be slow. Not sure why that is the case .
 

meedicx

Junior Member
Registered Member
Yes Kimi is quite good. Think the best Chinese AI model I’ve used so far. It’s my default AI model for my work this days. The only issue is that it sometimes run out of token as you said, but yeah I’ve been reluctant to spend on it . But I think I would have to do so, since it’s quite helpful for my job and it doesn’t break the bank either .
One thing I noticed trying Deepseek is that it seems to give responses quite fast compared to Kimi. It’s almost instantaneous, compared to Kimi which tends to be slow. Not sure why that is the case .

I have a Kimi sub. You get a turbo speed up and the difference is night and day compared to the free version. But DeepSeek Instant still seems a bit faster, but the Kimi thinking mode dives deeper than DeepSeek expert.

Even at the intro sub tier, you can't run out of tokens using the normal chat modes, but may run out using the agent mode to do data processing. They recently changed the token logic, so all modes use the same token pool (except coding for some reason). But this has causes a weird experience, where if I max out my 5-hour token limit using Agent Swarm, I have to wait for hours to use the normal chat.
 

bsdnf

Senior Member
Registered Member
A/B testing and limited to images. Unfortunately, I wasn't given the opportunity to test it.

Based on current information, the visual model cannot perform online searches, and its error rate isn't particularly outstanding, but very fast and its world knowledge seems very deep. I'm curious about how they achieved this.
Please, Log in or Register to view URLs content!
Paper is out. The activation parameter is only 13B, no wonder it was so fast.
 

SanWenYu

Major
Registered Member
english.scio.gov.cn/m/chinavoices/2026-04/30/content_118471189.html

Please, Log in or Register to view URLs content!
Very brave move by the court to protect labour rights in this case. We will see how this court ruling affects future cases where human workers get replaced more and more by automation in general. China does not have a case law legal system but precedents are still referred to in courts of law in China.
 

siegecrossbow

Field Marshall
Staff member
Super Moderator
Very brave move by the court to protect labour rights in this case. We will see how this court ruling affects future cases where human workers get replaced more and more by automation in general. China does not have a case law legal system but precedents are still referred to in courts of law in China.
The SeeSeePee understands that allowing corporations to get away with this will negatively impact societal stability.
 

manqiangrexue

Brigadier
The SeeSeePee understands that allowing corporations to get away with this will negatively impact societal stability.
I guess they weighed it against the benefits of mass AI implementation, which could have helped China advance its technological edge. It's not too useful if you suffer from mass unemployment riots, but I am concerned for the negative consequences in the tech race that will determine the next globally dominant power. I also wonder how effective it will be. From what see, if your company considers you dead weight and wants you gone, they'll find a way.
 
Last edited:

magmunta

Junior Member
Registered Member
english.scio.gov.cn/m/chinavoices/2026-04/30/content_118471189.html

Please, Log in or Register to view URLs content!

View attachment 174251
That will hurt corporate profits and could give rise to inefficiencies; for example, if AI does cheaper and faster what junior developer does and a company is obliged to keep the junior, the company loses profits it would otherwise earn. But china is a socialist country, so CCP sticks to socialism.
 
Top