2606.26344

Total: 1

#1 Axon: A Synthesizing Superoptimizer for Tensor Programs [PDF] [Copy] [Kimi] [REL]

Authors: Akash Kothari, Shaowei Zhu, Daniel Kroening, Chungha Sung

Writing high performance kernels for AI accelerators requires deep expertise in tiling, instruction selection, data layout, and operator fusion placing a significant burden on programmers. In this paper, we focus on tile based AI accelerator programs and present Axon, a synthesizing superoptimizer for tensor programs: it uses program synthesis to automatically generate target instructions from semantics specifications, and explores semantically equivalent program variants to select the best performing kernel empirically. Axon discovers algebraic transformations by propagating operators through computation graphs and uses SMT over unbounded tensors to guarantee that all transformations preserve semantics without requiring hand crafted rewrite rules. It then lowers tensor operations to target ISA instructions, explores tiling configurations constrained by hardware descriptions, and fuses operators and instructions to minimize memory traffic.

Subjects: Programming Languages , Computation and Language , Performance

Publish: 2026-06-24 19:37:12 UTC