There’s no way I believe that Deepseek was made for the $5m figure I’ve seen floating around.
But that doesn’t matter. If it cost $15m, $50m, $500m, or even more than that, it’s probably worth it to take a dump in Sam Altman’s morning coffee.
DeepSeek claimed the model training took 2,788 thousand H800 GPU hours, which, at a cost of $2/GPU hour, comes out to a mere $5.576 million.
That seems impossibly low.
DeepSeek is clear that these costs are only for the final training run, and exclude all other expenses
There would have been many other runs before the release version.
There is no downside to lying these days. Yet the public seems surprised that all they see is lying.
So many people don’t even question it. Talk loud and confidently enough and that’s the bar for most unfortunately.
TikTok, Instagram and similar are great examples of this, initially you think wow cool I’m seeing all of these new things and getting so much info. Then you see someone come up on a topic you know something about and the facade breaks when all they do is spew misinformation that attracts a crowd (usually via fear).
Nevertheless, like the funding-hungry CEO he is, Altman quickly turned the thread around to OpenAI promising jam tomorrow, with the execution of the firm’s roadmap, amazing next-gen AI models, and “bringing you all AGI and beyond.”
AGI and beyond?
I kind of suspect this is as much about A.I. progress hitting a wall as anything else. It doesn’t seem like any of the LLMs are improving much between versions anymore. The U.S. companies were just throwing more compute (and money/electricity) at the problem and seeing small gains but it’ll be awhile before the next breakthrough.
Kind of like self-driving cars during their hype cycle. They felt tantalizingly close 10 years ago or so but then progress stalled and it’s been a slow grind ever since.
The amount of people spamming ‘deepseek’ on YouTube comments and live streams is insane. Definitely have a shit load of shadow funding
While I tend to avoid conspiracy theory type thinking, the nature of modern social makes it very easy to run astroturfing/botting campaigns. It’s reasonable to be suspicious.
It’s easy to write a bot. You just ask
ChatGPTDeepSeek for the code.
I find the online cheerleading for AI and AGI strange. It feels like a frothing mob rooting for the unleashing of a monster at times.
I mean, a lot of it is just people who started using chatgpt to do some simple and boring task (writing an email, CV, or summarizing an article) and started thinking that it’s the best thing since sliced bread.
I would know that since I’m a university student. I know the limitations of current AI stuff so I can cautiously use it for certain tasks and don’t trust the output to be correct. Meanwhile, my friend thought that he was making chatgpt better at answering his multiple choice economics quiz by telling it which of the answers it gave was wrong…
I hope that normal people will now realize how full of sh*t he is. They won’t, but DON’T TAKE THIS FROM ME
Does the elephant call the ant hopeless?
One of them is threatened with extinction. ;-)