add `vpdpbusd` avx512 intrinsic #4776

folkertdev · 2025-12-18T13:15:47Z

This intrinsic is useful for the adler32 checksum algorithm.

The test attempts to hit a bunch of overflow and truncation cases, and I've validated it on real hardware.

rustbot · 2025-12-18T13:15:51Z

Thank you for contributing to Miri! A reviewer will take a look at your PR, typically within a week or two.
Please remember to not force-push to the PR branch except when you need to rebase due to a conflict or when the reviewer asks you for it.

folkertdev · 2025-12-18T13:29:23Z

src/shims/x86/mod.rs

+        let intermediate = i32::from(i16::from(a1).wrapping_mul(i16::from(b1 as i8)))
+            .wrapping_add(i32::from(i16::from(a2).wrapping_mul(i16::from(b2 as i8))))
+            .wrapping_add(i32::from(i16::from(a3).wrapping_mul(i16::from(b3 as i8))))
+            .wrapping_add(i32::from(i16::from(a4).wrapping_mul(i16::from(b4 as i8))));


the as i8 is intentional here so that sign extension is used. So try_from would not work here, how does miri generally handle this?

I don't know the type of everything involved here, but there is cast_signed/cast_unsigned -- does that suffice?

RalfJung

Thanks for the PR!

I am slightly concerned about slowly growing a huge avx512 file that nobody has an overview of any more.^^ But as long as there's a clear motivation in the form of a core ecosystem crate, I hope that will naturally limit the scope of what we have to support.

View changes since this review

RalfJung · 2025-12-20T16:51:39Z

src/shims/x86/mod.rs

+/// <https://www.intel.com/content/www/us/en/docs/intrinsics-guide/index.html#text=_mm_dpbusd_epi32>
+/// <https://www.intel.com/content/www/us/en/docs/intrinsics-guide/index.html#text=_mm256_dpbusd_epi32>
+/// <https://www.intel.com/content/www/us/en/docs/intrinsics-guide/index.html#text=_mm512_dpbusd_epi32>
+fn vpdpbusd<'tcx>(


If this is only used in avx512, please move the function to that file.

RalfJung · 2025-12-20T16:52:04Z

src/shims/x86/mod.rs

    interp_ok(())
 }

+/// Multiply groups of 4 adjacent pairs of unsigned 8-bit integers in a with corresponding signed


Suggested change

/// Multiply groups of 4 adjacent pairs of unsigned 8-bit integers in a with corresponding signed

/// Multiply groups of 4 adjacent pairs of unsigned 8-bit integers in `a` with corresponding signed

same for all the other references to variable names in the doc comment

RalfJung · 2025-12-20T16:55:17Z

src/shims/x86/mod.rs

+        let intermediate = i32::from(i16::from(a1).wrapping_mul(i16::from(b1.cast_signed())))
+            .wrapping_add(i32::from(i16::from(a2).wrapping_mul(i16::from(b2.cast_signed()))))
+            .wrapping_add(i32::from(i16::from(a3).wrapping_mul(i16::from(b3.cast_signed()))))
+            .wrapping_add(i32::from(i16::from(a4).wrapping_mul(i16::from(b4.cast_signed()))));
+


We should find a way to make this more readable...
As a start, why are you mixing i16 and i32? And why wrapping_mul? multiplying two i8 as an i16 cannot overflow, right? Same for add. If things can never overflow, please use the strict operations.

Also, I think it would make sense to let-bind the 4 multiplications. Maybe that could even be written in a loop, e.g. via from_fn?

RalfJung · 2025-12-20T16:56:10Z

tests/pass/shims/x86/intrinsics-x86-avx512.rs

 }

+#[target_feature(enable = "avx512vnni")]
+unsafe fn test_avx512vnni() {


You mentioned that this is aiming to hit a bunch of overflow and truncation cases. Please add comments pointing that out.

rustbot · 2025-12-20T16:58:15Z

Reminder, once the PR becomes ready for a review, use @rustbot ready.

rustbot added the S-waiting-on-review Status: Waiting for a review to complete label Dec 18, 2025

folkertdev commented Dec 18, 2025

View reviewed changes

folkertdev mentioned this pull request Dec 18, 2025

add vnni adler32 variant trifectatechfoundation/zlib-rs#448

Draft

add vpdpbusd avx512 intrinsic

c515a6a

folkertdev force-pushed the vpdpbusd branch from 5b26c30 to c515a6a Compare December 18, 2025 15:37

RalfJung requested changes Dec 20, 2025

View reviewed changes

rustbot added S-waiting-on-author Status: Waiting for the PR author to address review comments and removed S-waiting-on-review Status: Waiting for a review to complete labels Dec 20, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add `vpdpbusd` avx512 intrinsic #4776

add `vpdpbusd` avx512 intrinsic #4776

folkertdev commented Dec 18, 2025

Uh oh!

rustbot commented Dec 18, 2025

Uh oh!

folkertdev Dec 18, 2025

Uh oh!

RalfJung Dec 18, 2025

Uh oh!

RalfJung left a comment •

edited by rustbot

Loading

Uh oh!

RalfJung Dec 20, 2025

Uh oh!

RalfJung Dec 20, 2025

Uh oh!

RalfJung Dec 20, 2025

Uh oh!

RalfJung Dec 20, 2025

Uh oh!

rustbot commented Dec 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	/// Multiply groups of 4 adjacent pairs of unsigned 8-bit integers in a with corresponding signed
	/// Multiply groups of 4 adjacent pairs of unsigned 8-bit integers in `a` with corresponding signed

add vpdpbusd avx512 intrinsic #4776

Are you sure you want to change the base?

add vpdpbusd avx512 intrinsic #4776

Conversation

folkertdev commented Dec 18, 2025

Uh oh!

rustbot commented Dec 18, 2025

Uh oh!

folkertdev Dec 18, 2025

Choose a reason for hiding this comment

Uh oh!

RalfJung Dec 18, 2025

Choose a reason for hiding this comment

Uh oh!

RalfJung left a comment • edited by rustbot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

RalfJung Dec 20, 2025

Choose a reason for hiding this comment

Uh oh!

RalfJung Dec 20, 2025

Choose a reason for hiding this comment

Uh oh!

RalfJung Dec 20, 2025

Choose a reason for hiding this comment

Uh oh!

RalfJung Dec 20, 2025

Choose a reason for hiding this comment

Uh oh!

rustbot commented Dec 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

add `vpdpbusd` avx512 intrinsic #4776

add `vpdpbusd` avx512 intrinsic #4776

RalfJung left a comment •

edited by rustbot

Loading