Salesforce's CRM Benchmark Finds AI Agents Struggle in Real-World Business Scenarios
2 Articles
2 Articles
Salesforce's CRM benchmark finds AI agents struggle in real-world business scenarios
Salesforce's new CRMArena-Pro benchmark reveals major challenges for AI agents in business contexts. Even top models like Gemini 2.5 Pro manage just a 58 percent success rate on single turns. When the dialog gets longer, performance drops to 35 percent. The article Salesforce's CRM benchmark finds AI agents struggle in real-world business scenarios appeared first on THE DECODER.
With CRMArena-Pro, Salesforce has introduced a new benchmark for AI agents. Even top models such as Gemini 2.5 Pro achieve only 58 percent success rate in simple tasks. With longer dialogues, performance falls to 35 percent. The article Salesforce-Benchmark shows: AI agents fail in complex business dialogues first appeared on THE-DECODER.de.
Coverage Details
Bias Distribution
- There is no tracked Bias information for the sources covering this story.
To view factuality data please Upgrade to Premium