Published 16 days ago • loading... • Updated 14 days ago

Meta unleashes Llama API running 18x faster than OpenAI: Cerebras partnership delivers 2,600 tokens per second

Meta unveiled the Llama API at its inaugural LlamaCon developer conference on April 29, 2025, in Menlo Park, California.
Meta developed the API to offer faster, cost-efficient AI model inference amid growing competition from OpenAI, Google, and emerging rivals like DeepSeek.
The Llama API runs on partner Cerebras’ specialized hardware, delivering speeds up to 2,648 tokens per second, approximately 18 times faster than OpenAI’s ChatGPT.
Meta emphasized its open-model approach by allowing customers to transfer custom models and pledging not to use customer data for training its own models.
This launch marks Meta's shift from solely providing open models to selling AI services, aiming to create new revenue streams and compete in a fast-growing AI inference market.

Insights by Ground AI

Does this summary seem wrong?

20 Articles

All

Left

Center

Right

South China Morning Post

Center

Meta introduces Llama application programming interface to attract AI developers

Llama API will help Meta go up against APIs offered by rival model makers including OpenAI, Google and China’s DeepSeek.

15 days ago·Hong Kong

Read Full Article

VentureBeat

Reposted by

IT Security News - cybersecurity, infosecurity news

Center

Meta unleashes Llama API running 18x faster than OpenAI: Cerebras partnership delivers 2,600 tokens per second

Meta partners with Cerebras to launch its new Llama API, offering developers AI inference speeds up to 18 times faster than traditional GPU solutions, challenging OpenAI and Google in the fast-growing AI services market.

16 days ago·San Francisco, United States

Read Full Article

Engadget

Reposted by

technewstube.com

Lean Left

Meta is making it easier to use Llama models for app development

Meta is releasing a new tool it hopes will encourage developers to use its family of Llama models for their next project. At its inaugural LlamaCon event in Menlo Park on Tuesday, the company announced the Llama API. Available as a limited free preview starting today, the tool gives developers a place to experiment with Meta's AI models, including the recently released Llama 4 Scout and Maverick systems. It also makes it easy to create new API k…

16 days ago·United States

Read Full Article

TechCrunch

Center

Meta previews an API for its Llama AI models

At its inaugural LlamaCon AI developer conference on Tuesday, Meta announced an API for its Llama series of AI models: the Llama API.

16 days ago·United States

Read Full Article

eeNews Europe

Meta and Cerebras team on fast inference for new Llama API

The partnership combines the most popular open-source model, Llama, with the fastest available inference technology. The post Meta and Cerebras team on fast inference for new Llama API appeared first on eeNews Europe.

14 days ago

Read Full Article

The Next Platform

With Its Llama API Service, Meta Platforms Finally Becomes A Cloud

A lot of companies talk about open source, but it can be fairly argued that Meta Platforms, the company that built the largest social network in the world and that has open sourced a ton of infrastructure software as well as datacenter, server, storage, and switch designs, walks the talk the best. … With Its Llama API Service, Meta Platforms Finally Becomes A Cloud was written by Timothy Prickett Morgan at The Next Platform.

14 days ago

Read Full Article

Think freely.Subscribe and get full access to Ground NewsSubscriptions start at $9.99/year