Published 1 day ago • loading... • Updated 22 hours ago

How DeepSeek's New Way to Train Advanced AI Models Could Disrupt Everything

Summary by ZDNet

The Chinese AI lab may have just found an approach to training frontier LLMs that's both practical and scalable, even for more cash-strapped developers.

9 Articles

ZDNet

Center

How DeepSeek's new way to train advanced AI models could disrupt everything

The Chinese AI lab may have just found an approach to training frontier LLMs that's both practical and scalable, even for more cash-strapped developers.

1 day ago·United States

Read Full Article

World NEWS Live

How DeepSeek's new way to train advanced AI models could disrupt everything - again - WorldNL Magazine

Flavio Coelho/ Moment via GettyFollow ZDNET: Add us as a preferred source on Google.ZDNET's key takeawaysDeepSeek debuted Manifold-Constrained Hyper-Connections, or mHCs.They offer a way to scale LLMs without incurring huge costs.The company postponed the release of its R2 model in mid-2025.Just before the start of the new year, the AI world was introduced to a potential game-changing new method for training advanced models.A team of researchers…

22 hours ago

Read Full Article

constellationr.com

DeepSeek's paper latest evidence AI muscle head era coming to end

By Larry DignanDeepSeek published a technical paper co-authored by co-founder Liang Wenfeng that argues for a new architecture to train foundational models. The paper, which details Manifold-Constrained Hyper-Connections (mHC), argues that a new architecture is needed to better scale deep learning without signal divergence. Granted, the mHC paper is wonky, but the key takeaway is that you can have large-scale training that's more efficient with …

1 day ago

Read Full Article

Computerworld

Deepseek says new method can train AI more efficiently and cheaply

Chinese AI company Deepseek has unveiled a new training method, Manifold-Constrained Hyper-Connections (mHC), which will make it possible to train large language models more efficiently and at lower cost, reports the South China Morning Post. The method is a further development of so-called Hyper-Connections, which was originally developed by Bytedance in 2024. That technology, in turn, builds on the classic ResNet architecture from Microsoft Re…

1 day ago

Read Full Article

finanzmarktwelt.de

Deepseek: China with New Attack Against Usa in Ai Race

Efficiency against "higher, faster, further" of the USA The post DeepSeek: China with a new attack against the USA in AI competition appeared first on financemarktwelt.de.

1 day ago

Read Full Article

BlogNT : le Blog des Nouvelles Technologies

Deepseek Proposes New Architecture to Reduce Energy Waste From Ai

In the ever-greater race for AI, the real problem is no longer just gross performance, but stability. A new research paper published by DeepSeek proposes a discreet but potentially decisive approach: making the training of large models more predictable, in order to avoid the costly failures that blow up the energy bill. The training of [...] The DeepSeek article proposes a new architecture to reduce the energy waste of AI appeared first on BlogN…

1 day ago

Read Full Article

Think freely.Subscribe and get full access to Ground NewsSubscriptions start at $9.99/year