A user on r/LocalLLaMA reported on May 12 that an Optane local LLM desktop build ran Moonshot’s Kimi K2.5 at about 4 tokens per second using discontinued Intel Optane Persistent Memory, a 12GB RTX 3060, and llama.cpp. The hardware, model family, and software path are all documentable from official sources. The 4 tokens per second figure is not vendor-confirmed — it comes from the builder’s own report.
Intel Optane Persistent Memory powers a loc…
This story is only covered by news sources that have yet to be evaluated by the independent media monitoring agencies we use to assess the quality and reliability of news outlets on our platform. Learn more here.