From my observation, while China's LLM industry lacks computing power, it's not to that extent; this isn't something that can be solved simply with money.
As Deepseek pointed out, what they truly lack is data. LLM Company repeatedly claims that internet data has been exhausted and that it needs to rely on synthetic data. However, a significant portion of new data does not simply disappear; it flows into their closed database through chatbots and agents. Leading companies in the US first provide services, acquire data, then train more powerful models, gain more users and data, and simultaneously use these models to develop more powerful development pipelines, achieving a self-iterating flywheel.
This is likely one of the reasons why Deepseek initiated the major price cut: on the one hand, their extremely powerful kv-caching mechanism allows them to do so. When anthropic only provides a 5-minute kv cache, Deepseek can retain it for several days without charging any additional fees. And on the other hand, they hope to absorb enough data to re-accelerate their flywheel. Yes, 99% of the data is useless, but given enough data, some will eventually become useful. No company dares to admit that they can rely solely on synthetic data and human experts.
As Deepseek pointed out, what they truly lack is data. LLM Company repeatedly claims that internet data has been exhausted and that it needs to rely on synthetic data. However, a significant portion of new data does not simply disappear; it flows into their closed database through chatbots and agents. Leading companies in the US first provide services, acquire data, then train more powerful models, gain more users and data, and simultaneously use these models to develop more powerful development pipelines, achieving a self-iterating flywheel.
This is likely one of the reasons why Deepseek initiated the major price cut: on the one hand, their extremely powerful kv-caching mechanism allows them to do so. When anthropic only provides a 5-minute kv cache, Deepseek can retain it for several days without charging any additional fees. And on the other hand, they hope to absorb enough data to re-accelerate their flywheel. Yes, 99% of the data is useless, but given enough data, some will eventually become useful. No company dares to admit that they can rely solely on synthetic data and human experts.
Last edited:


