A bigger gun is not always a better weapon
“Over the past few months, we start to move off the concept or the belief of the American tech companies, which they call the scaling law, which required continuous expansion of the training cluster,”
This is the Mythical Man-Month all over again. It is as if people do not learn from history?
We saw this earlier with the USSR. Americans used bigger computers to solve mathematical problems, Soviet mathematicians used smarter methods. And it showed that whole classes of problems were intractable when approached with more computing power. Meanwhile, smarter mathematics resulted not only in solved problems, but also much more insight into even weirder mathematical problems. Even today, the effect on mathematics from the old soviet countries is visible.
When looking at 'AI' in the US, I see a lot of 'bigger guns' being applied, but disappointingly little insight coming out. And we already know that the newest foundational models build with the latest data centers would need more data than is available in the world.
Worse, increasingly the data out there is already generated by LLMs instead of humans. And AI generated data is the fast-food of training data, if not the hard-liquor.
In the end, the restrictions do what everyone expected, they force China to develop their own hardware as well as better mathematics and more efficient software.