Enable auto-vectorization of add/mul reduction loops on NEON hardware by raneashay · Pull Request #63 · microsoft/openjdk-jdk

raneashay · 2026-03-23T21:59:14Z

On NEON, this patch enables auto-vectorization of sum and product
reduction loops, thus enabling vectorization of several BLAS functions.
In particular, this patch adds strict-order NEON reduction instructions
for {Add|Mul}ReductionV{F|D} operations. Prior to this change,
match_rule_supported_auto_vectorization() blocked these operations,
preventing vectorization of reduction loops that are common in dot
products, matrix-vector multiplications, and matrix-matrix
multiplications. Additionally, this patch also adds UseSVE guards to
existing SVE reduction predicates so that they're not matched on
NEON-only hardware.

On NEON, this patch enables auto-vectorization of sum and product reduction loops, thus enabling vectorization of several BLAS functions. In particular, this patch adds strict-order NEON reduction instructions for `{Add|Mul}ReductionV{F|D}` operations. Prior to this change, `match_rule_supported_auto_vectorization()` blocked these operations, preventing vectorization of reduction loops that are common in dot products, matrix-vector multiplications, and matrix-matrix multiplications. Additionally, this patch also adds UseSVE guards to existing SVE reduction predicates so that they're not matched on NEON-only hardware.

raneashay force-pushed the ashay/improve-auto-vectorization branch from 5ca717d to 6c4c75d Compare March 23, 2026 23:23

raneashay changed the title ~~Enable auto-vectorization of BLAS kernels on NEON hardware~~ Enable auto-vectorization of add/mul reduction loops on NEON hardware Mar 23, 2026

raneashay force-pushed the ashay/improve-auto-vectorization branch from 6c4c75d to a3491b4 Compare March 24, 2026 14:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable auto-vectorization of add/mul reduction loops on NEON hardware#63

Enable auto-vectorization of add/mul reduction loops on NEON hardware#63
raneashay wants to merge 1 commit intomicrosoft:mainfrom
raneashay:ashay/improve-auto-vectorization

raneashay commented Mar 23, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

raneashay commented Mar 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

raneashay commented Mar 23, 2026 •

edited

Loading