Meta unleashes Llama API running 18x faster than OpenAI: Cerebras partnership delivers 2,600 tokens per second
- Meta unveiled the Llama API at its inaugural LlamaCon developer conference on April 29, 2025, in Menlo Park, California.
- Meta developed the API to offer faster, cost-efficient AI model inference amid growing competition from OpenAI, Google, and emerging rivals like DeepSeek.
- The Llama API runs on partner Cerebras’ specialized hardware, delivering speeds up to 2,648 tokens per second, approximately 18 times faster than OpenAI’s ChatGPT.
- Meta emphasized its open-model approach by allowing customers to transfer custom models and pledging not to use customer data for training its own models.
- This launch marks Meta's shift from solely providing open models to selling AI services, aiming to create new revenue streams and compete in a fast-growing AI inference market.
20 Articles
20 Articles


Meta unleashes Llama API running 18x faster than OpenAI: Cerebras partnership delivers 2,600 tokens per second
Meta partners with Cerebras to launch its new Llama API, offering developers AI inference speeds up to 18 times faster than traditional GPU solutions, challenging OpenAI and Google in the fast-growing AI services market.
Meta is making it easier to use Llama models for app development
Meta is releasing a new tool it hopes will encourage developers to use its family of Llama models for their next project. At its inaugural LlamaCon event in Menlo Park on Tuesday, the company announced the Llama API. Available as a limited free preview starting today, the tool gives developers a place to experiment with Meta's AI models, including the recently released Llama 4 Scout and Maverick systems. It also makes it easy to create new API k…
With Its Llama API Service, Meta Platforms Finally Becomes A Cloud
A lot of companies talk about open source, but it can be fairly argued that Meta Platforms, the company that built the largest social network in the world and that has open sourced a ton of infrastructure software as well as datacenter, server, storage, and switch designs, walks the talk the best. … With Its Llama API Service, Meta Platforms Finally Becomes A Cloud was written by Timothy Prickett Morgan at The Next Platform.
Coverage Details
Bias Distribution
- 75% of the sources are Center
To view factuality data please Upgrade to Premium
Ownership
To view ownership data please Upgrade to Vantage