Rating Standard
NPUBench Standard v1.0 · Section 7 · Performance rating thresholds based on real hardware measurements
Performance Rating Tables
All thresholds derived from real hardware measurements — average of 5 runs on Llama 3.2 3B Instruct (int4), translation task
About the Rating System
The NPUBench rating system evaluates four independent dimensions: TTFT (Prefill phase latency), TPS (Decode phase generation rate), NPU Acceleration Ratio (NPU vs CPU speedup), and Power Efficiency (TPS per watt). Each dimension is rated independently. A device may excel in one metric while being acceptable in another. Thresholds are fixed in NPUBench Standard v1.0 and will not change between versions to ensure cross-version comparability.
Legal Notice
NPUBench Standard v1.0 · Apache 2.0 License · Not affiliated with MLCommons · MLPerf® is a registered trademark of MLCommons Association