DeepSeek announces open-source initiative and revealed FlashMLA model · TechNode
- DeepSeek announced on February 21 that it will open-source five code repositories over the next week, with new content unlocked daily.
- DeepSeek introduced the FlashMLA model, optimized for Hopper GPUs, and designed for variable-length sequences, confirming its readiness for production.
- At the Global Developer Conference in Shanghai, open-source developers showcased AI possibilities, highlighting the local industry's value at over 450 billion yuan.
- Shanghai Vice-Mayor Chen Jie stated the importance of enhancing the open-source ecosystem and community growth during the conference's opening ceremony.
6 Articles
6 Articles
DeepSeek launches FlashMLA: A breakthrough in AI speed and efficiency for NVIDIA GPUs - Tech Startups
Following the success of its R1 model, Chinese AI startup DeepSeek on Monday unveiled FlashMLA, an open-source Multi-head Latent Attention (MLA) decoding kernel optimized for NVIDIA’s Hopper GPUs. Think of FlashMLA as both a super-efficient translator and a turbo boost […] The post DeepSeek launches FlashMLA: A breakthrough in AI speed and efficiency for NVIDIA GPUs first appeared on Tech Startups.
DeepSeek announces open-source initiative and revealed FlashMLA model · TechNode
On February 21, DeepSeek revealed on social media platform X that it will be open-sourcing five code repositories over the next week, with new content unlocked daily. The company emphasized its commitment to sharing “small but sincere progress” as part of its mission to accelerate tech innovation. The company’s online services have been tested and are now ready for deployment in production. The company, which calls itself a “small team,” highlig…
DeepSeek Opens Access To AI Code, Expanding Open-Source Efforts
China’s DeepSeek has made waves by releasing its AI models as “open source.” The move raises questions about what this means for projects trying to replicate DeepSeek’s achievements. I Photo: Tim Reckmann Flickr However, in the AI world, that term can mean different things. While DeepSeek previously allowed free use and modification of its models, it had yet to publish the underlying code—until now, David Meyer reported for Fortune’s Data Sheet.…
DeepSeek: The Cost-Efficient AI Revolution Shaking Silicon Valley
In late December 2024, a relatively unknown Chinese startup named DeepSeek unleashed a seismic shift in the artificial intelligence landscape. With the release of its DeepSeek-v3 model, followed swiftly by the DeepSeek-R1 in January 2025, the company didn’t just introduce a new player to the AI game—it shattered long-held assumptions about the resources required to build cutting-edge AI. While U.S. tech giants like OpenAI, Google, and Meta have …
DeepSeek Launches FlashMLA, an MLA Decoding Kernel for Hopper GPUs
DeepSeek, a Chinese artificial intelligence (AI) lab by High-Flyer startup, has kicked off its “Open Source Week” by releasing FlashMLA, a decoding kernel designed for Hopper GPUs. It is optimised for processing variable-length sequences and is now in production. The kernel supports BF16 and features a paged KV cache with a block size of 64. On the H800 GPU, it achieves speeds of 3000 GB/s in memory-bound configurations and 580 TFLOPS in compute…
Coverage Details
Bias Distribution
- 67% of the sources are Center
To view factuality data please Upgrade to Premium