Support variable-index swizzles #226

reinerp · 2022-01-17T06:06:46Z

Both x86-SSE and ARM-NEON provide byte-level shuffle instructions with indices coming from a register: pshufb on x86 and the tbl instructions on ARM. The core::simd API exposes these for constant indices via the simd_swizzle macro, but as far as I can tell, there's no support for variable indices exposed by core::simd. It would be valuable to provide this, either just for u8 vectors, or perhaps for all vectors?

Variable-index swizzles can be very valuable in some scenarios. (1) Often they are used as table lookup, e.g. pshufb provides a 16-way-parallel table lookup from a 16-element table of bytes. (2) Often in compression/filtering/sorting scenarios the shuffle needs to be computed based on a dynamic calculation.

Here's one potential challenge and one potential solution. x86-SSE and ARM-NEON have matching semantics on in-range indices (select the indexed byte), but have different semantics on out-of-range indices: ARM-NEON returns zero on any out-of-range index, whereas x86-SSE returns zero only if the top bit of the byte is set. The choice taken by wasm-simd, for example, is ARM semantics: https://github.com/WebAssembly/simd/blob/main/proposals/simd/SIMD.md#swizzling-using-variable-indices. Going with those semantics seems like a plausible choice.

The text was updated successfully, but these errors were encountered:

programmerjake · 2022-01-17T06:18:06Z

iirc llvm doesn't have an architecture independent dynamic swizzle instruction/intrinsic.

calebzulawski · 2022-01-17T06:42:59Z

Yeah, LLVM doesn't have an intrinsic for non-const index shuffles. std::simd doesn't have anything (else) preventing it and this is something I'm interested in.

This is hard to implement without an intrinsic, for example: x86-64 doesn't have a shuffle instruction unless at least SSSE3 is available, but std must work all the way down to base the base architecture.

Maybe LLVM could relocate the wasm shuffle lowering code to an LLVM intrinsic, and then we would be able to take advantage of that.

Lokathor · 2022-01-17T07:10:49Z

That there isn't a LLVM intrinsic for this isn't really a blocker, as a fallback the operation can be easily written by casting to an array of bytes, making a new array with a bunch of indexing, and casting back.

The SSSE3 thing also isn't a big deal, since it's easy to do the conditional compilation, and the function would inevitably be generic and/or marked as inline anyway.

calebzulawski · 2022-01-17T09:00:14Z

Conditional compilation doesn't really work--even if the function is inline, std is compiled without any features.

programmerjake · 2022-01-17T09:28:01Z

see additional discussion in #11

Lokathor · 2022-01-17T14:00:26Z

oh crap right because cfg is pre codegen

RalfJung · 2022-03-17T16:01:10Z

Duplicated by #242?

programmerjake · 2022-03-17T16:30:11Z

#242 is more detailed and has step-by-step task lists, so imho this issue should be closed in favor of #242.

calebzulawski · 2022-05-22T02:00:14Z

No point having two issues open, closing this one in favor of that one.

reinerp added the C-feature-request Category: a feature request, i.e. not implemented / a PR label Jan 17, 2022

calebzulawski closed this as not planned Won't fix, can't repro, duplicate, stale May 22, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support variable-index swizzles #226

Support variable-index swizzles #226

reinerp commented Jan 17, 2022

programmerjake commented Jan 17, 2022

Uh oh!

calebzulawski commented Jan 17, 2022

Uh oh!

Lokathor commented Jan 17, 2022

Uh oh!

calebzulawski commented Jan 17, 2022

Uh oh!

programmerjake commented Jan 17, 2022

Uh oh!

Lokathor commented Jan 17, 2022 •

edited

Loading

Uh oh!

RalfJung commented Mar 17, 2022

Uh oh!

programmerjake commented Mar 17, 2022

Uh oh!

calebzulawski commented May 22, 2022

Uh oh!

Support variable-index swizzles #226

Support variable-index swizzles #226

Comments

reinerp commented Jan 17, 2022

programmerjake commented Jan 17, 2022

Uh oh!

calebzulawski commented Jan 17, 2022

Uh oh!

Lokathor commented Jan 17, 2022

Uh oh!

calebzulawski commented Jan 17, 2022

Uh oh!

programmerjake commented Jan 17, 2022

Uh oh!

Lokathor commented Jan 17, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

RalfJung commented Mar 17, 2022

Uh oh!

programmerjake commented Mar 17, 2022

Uh oh!

calebzulawski commented May 22, 2022

Uh oh!

Lokathor commented Jan 17, 2022 •

edited

Loading