Optimize u8x8::trailing_zeros for AArch64

LLVM's `cttz.v8i8` intrinsic is broken on AArch64 machines: https://github.com/rust-lang-nursery/packed_simd/issues/191

Our current workaround just applies `u8::trailing_zeros` to each lane. With 8 lanes, that can be quite slow.

It could be optimized by adapting LLVM's algorithm to Rust's [AArch64 SIMD intrinsics](https://doc.rust-lang.org/core/arch/aarch64/index.html) (some may be missing and we would have to implement those as well: https://github.com/rust-lang-nursery/stdsimd/issues/40).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize u8x8::trailing_zeros for AArch64 #193

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Optimize u8x8::trailing_zeros for AArch64 #193

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions