Students at SLAI, a pilot AI educational institute in Shenzhen launched just last year, successfully performed SFT post-training of DeepSeek v4 Pro on a cluster of 1k Ascend 910C chips. They achieved an MFU of 34.9% with 3k training set of operations research and mathematical modeling questions, improving performance on select benchmarks
Although small in scale, this is a data point showing the maturity of the Ascend tech stack. With newer versions of Ascend chips at scale, complete pre-training and post-training will be possible.
Projects like this also give students real world AI experience using domestic AI chips and LLMs, strengthening the ecosystem. This will help diffuse knowledge and expertise when they enter industry.