Published 2 days ago • loading... • Updated 1 day ago

DeepSeek Drops Open-Source Model that Compresses Text 10x Through Images, Defying Conventions

On Monday, DeepSeek released the open-source DeepSeek-OCR model on Hugging Face and GitHub, saying it compresses image-based text for LLMs using visual perception.
DeepSeek built the model to address LLM long-context limits, as researchers said processing text as images can be more efficient for handling long-context documents with vision encoders.
DeepSeek described the model's two-part architecture with a 380 million-parameter DeepEncoder and a DeepSeek3B-MoE-A570M decoder, trained on 30,000,000 PDF pages in roughly 100 languages.
Practically, the system supports high-throughput data generation for LLMs, producing training data at a scale of over 200,000 pages per day on a single NVIDIA A100 GPU, the company said.
The paper says vision-text compression delivers major token reductions, reporting seven- to 20-times reduction and a compression factor of ten with 97 per cent information retention, following DeepSeek's V3 and R1 open-weight models.

Insights by Ground AI

16 Articles

DeepSeek drops open-source model that compresses text 10x through images, defying conventions

DeepSeek, the Chinese artificial intelligence research company that has repeatedly challenged assumptions about AI development costs, has released a new model that fundamentally reimagines how large language models process information—and the implications extend far beyond its modest branding as an optical character recognition tool.The company's DeepSeek-OCR model, released Monday with full open-source code and weights, achieves what researcher…

1 day ago·San Francisco, United States

Read Full Article

Indian Express

Lean Left

DeepSeek’s new AI model can generate 200K pages of training data daily on a single GPU

The launch of DeepSeek-OCR reflects the company’s continued focus on improving the efficiency of LLMs while driving down the costs of building them.

2 days ago·India

Read Full Article

TechNode

Center

DeepSeek releases new OCR model capable of generating 200,000 pages daily on a single GPU · TechNode

DeepSeek has unveiled DeepSeek-OCR: Contexts Optical Compression, an open-source model developed by its DeepSeek-AI research team. The new system introduces a visual-based method to compress long text contexts, improving recognition efficiency while cutting computation costs. According to the team, DeepSeek-OCR surpasses several mainstream models in benchmark tests with far fewer visual tokens. It can also produce more than 200,000 pages of trai…

2 days ago

Read Full Article

South China Morning Post

Center

DeepSeek unveils AI model that uses visual perception to compress text input

New release continues Chinese start-up’s efforts to raise AI models’ efficiency, while driving down the costs of building and using them.

2 days ago·Hong Kong

Read Full Article

Heise

Deepseek-Ocr: How Images Help Chatbots Have Long Talks

Chinese AI researchers want to keep chatbots fast and cheap with images in long contexts. Optical context compression is intended to improve AI assistants.

1 day ago·Germany

Read Full Article

Génération-NT

A Record Text Compression: the New Ia Deepseek-Ocr Changes the Deal!

The Chinese start-up DeepSeek has just released an open-source multimodal AI model capable of processing complex documents by drastically reducing the cost of calculation. Using visual perception as a powerful compression tool, DeepSeek-OCR opens the way for the analysis of previously inaccessible data volumes.

1 day ago

Read Full Article

Think freely.Subscribe and get full access to Ground NewsSubscriptions start at $9.99/year

Coverage Details

Total News Sources16

Leaning Left1Leaning Right0Center3Last Updated1 day agoBias Distribution

75% Center

Bias Distribution

75% of the sources are Center

75% Center

Untracked bias

Factuality

To view factuality data please Upgrade to Premium

Ownership

To view ownership data please Upgrade to Vantage

the-decoder.de broke the news in Germany 2 days ago on Monday, October 20, 2025.

Sources are mostly out of (0)

DeepSeek Drops Open-Source Model that Compresses Text 10x Through Images, Defying Conventions

16 Articles

16 Articles

DeepSeek drops open-source model that compresses text 10x through images, defying conventions

DeepSeek’s new AI model can generate 200K pages of training data daily on a single GPU

DeepSeek releases new OCR model capable of generating 200,000 pages daily on a single GPU · TechNode

DeepSeek unveils AI model that uses visual perception to compress text input

Deepseek-Ocr: How Images Help Chatbots Have Long Talks

A Record Text Compression: the New Ia Deepseek-Ocr Changes the Deal!

Coverage Details

Bias Distribution

Factuality

Ownership

Similar News Topics

Similar News Topics

DeepSeek Drops Open-Source Model that Compresses Text 10x Through Images, Defying Conventions

DeepSeek-OCR compresses text by up to 10 times while retaining 97% of information to help large language models process longer documents with lower computing costs.

16 Articles

16 Articles

DeepSeek drops open-source model that compresses text 10x through images, defying conventions

DeepSeek’s new AI model can generate 200K pages of training data daily on a single GPU

DeepSeek releases new OCR model capable of generating 200,000 pages daily on a single GPU · TechNode

DeepSeek unveils AI model that uses visual perception to compress text input

Deepseek-Ocr: How Images Help Chatbots Have Long Talks

A Record Text Compression: the New Ia Deepseek-Ocr Changes the Deal!

Coverage Details

Bias Distribution

Factuality

Ownership

Similar News Topics

Similar News Topics