|国家预印本平台
首页|Efficiency, Expressivity, and Extensibility in a Close-to-Metal NPU Programming Interface

Efficiency, Expressivity, and Extensibility in a Close-to-Metal NPU Programming Interface

Efficiency, Expressivity, and Extensibility in a Close-to-Metal NPU Programming Interface

来源:Arxiv_logoArxiv
英文摘要

Accelerators such as neural processing units (NPUs) deliver an enticing balance of performance and efficiency compared to general purpose compute architectures. However, effectively leveraging accelerator capabilities is not always simple: low-level programming toolkits may require substantial developer effort while high-level programming toolkits may abstract critical optimization features. This work aims to increase efficiency of designers using IRON, a toolkit for close-to-metal NPU performance engineers. We provide an updated programmer interface to IRON containing new and refined programming constructs. The new interface includes extensible features for placement and data transformation. These contributions are evaluated in terms of 1) efficiency, with analysis showing ~26% average reduction in lines of code and decreases in Halstead metrics for a variety of designs; 2) expressivity, demonstrating the new interface supports the wide range of features and patterns already supported by IRON; and 3) extensibility, illustrating the new tooling for placement and tiling can be extended to accommodate common use-cases.

Eric Keller、Phil James-Roxby、Erika Hunhoff、Joseph Melber、Kristof Denolf、Andra Bisca、Samuel Bayliss、Stephen Neuendorffer、Jeff Fifield、Jack Lo、Pranathi Vasireddy

微电子学、集成电路计算技术、计算机技术

Eric Keller,Phil James-Roxby,Erika Hunhoff,Joseph Melber,Kristof Denolf,Andra Bisca,Samuel Bayliss,Stephen Neuendorffer,Jeff Fifield,Jack Lo,Pranathi Vasireddy.Efficiency, Expressivity, and Extensibility in a Close-to-Metal NPU Programming Interface[EB/OL].(2025-04-25)[2025-06-14].https://arxiv.org/abs/2504.18430.点此复制

评论