perf: add native AVX2 uint64/int64 mul kernel by DiamonDinoia · Pull Request #1306 · xtensor-stack/xsimd

DiamonDinoia · 2026-04-16T18:50:07Z

Previously batch<[u]int64_t, avx2> mul fell through to AVX, which has no integer mul, which in turn fell through to SSE4.1 — splitting each 256-bit register into two 128-bit halves (vextracti128/vinserti128) and running the mul_epu32 sequence twice.

Add a sizeof(T)==8 specialization using _mm256_mul_epu32 directly, mirroring the SSE4.1 pattern with 256-bit intrinsics. Generates 8 ymm ops: 2 vpshufd, 3 vpmuludq, 2 vpaddq, 1 vpsllq — no lane splitting.

AVX512F (without DQ) also benefits since it forwards to the AVX2 kernel.

Previously batch<[u]int64_t, avx2> mul fell through to AVX, which has no integer mul, which in turn fell through to SSE4.1 — splitting each 256-bit register into two 128-bit halves (vextracti128/vinserti128) and running the mul_epu32 sequence twice. Add a sizeof(T)==8 specialization using _mm256_mul_epu32 directly, mirroring the SSE4.1 pattern with 256-bit intrinsics. Generates 8 ymm ops: 2 vpshufd, 3 vpmuludq, 2 vpaddq, 1 vpsllq — no lane splitting. AVX512F (without DQ) also benefits since it forwards to the AVX2 kernel.

DiamonDinoia · 2026-04-16T18:50:19Z

@serge-sans-paille this is a small one :)

serge-sans-paille · 2026-04-16T19:14:41Z

yep, looks good!

serge-sans-paille · 2026-04-16T19:15:32Z

You should ahve the right to merge now, feel free to do so once green.

DiamonDinoia assigned serge-sans-paille Apr 16, 2026

serge-sans-paille approved these changes Apr 16, 2026

View reviewed changes

DiamonDinoia merged commit d05d46f into xtensor-stack:master Apr 16, 2026
73 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: add native AVX2 uint64/int64 mul kernel#1306

perf: add native AVX2 uint64/int64 mul kernel#1306
DiamonDinoia merged 1 commit intoxtensor-stack:masterfrom
DiamonDinoia:fix/avx2-uint64-mul

DiamonDinoia commented Apr 16, 2026

Uh oh!

DiamonDinoia commented Apr 16, 2026

Uh oh!

serge-sans-paille commented Apr 16, 2026

Uh oh!

serge-sans-paille commented Apr 16, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

DiamonDinoia commented Apr 16, 2026

Uh oh!

DiamonDinoia commented Apr 16, 2026

Uh oh!

serge-sans-paille commented Apr 16, 2026

Uh oh!

serge-sans-paille commented Apr 16, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants