From Systems Of Record To Systems Of Reason: The Enterprise AI Revolution
SPICE uses a Challenger to create problems from large document corpora and a Reasoner to solve them, improving AI reasoning accuracy from 55% to 85%, researchers said.
4 Articles
4 Articles
Meta’s SPICE framework lets AI systems teach themselves to reason
Researchers at Meta FAIR and the National University of Singapore have developed a new reinforcement learning framework for self-improving AI systems. Called Self-Play In Corpus Environments (SPICE), the framework pits two AI agents against each other, creating its own challenges and gradually improving without human supervision.While currently a proof-of-concept, this self-play mechanism could provide a basis for future AI systems that can dyna…
Meta’s SPICE framework pushes AI toward self-learning without human supervision
Meta researchers have unveiled a new reinforcement learning framework called SPICE (Self-Play in Corpus Environments) that enables large language models (LLMs) to improve their reasoning skills without human supervision. Developed with the National University of Singapore, SPICE trains a single model to act as both a Challenger, which generates complex, document-based problems, and a Reasoner, which solves them. By grounding the learning process…
Coverage Details
Bias Distribution
- 100% of the sources are Center
Factuality
To view factuality data please Upgrade to Premium



