-
Notifications
You must be signed in to change notification settings - Fork 13k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add SIMD operations that use f16 and f128 #125440
Comments
@rustbot label A-simd T-libs F-f16_and_f128 E-help-wanted C-feature-request -needs-triage |
Nvidia ptx ( Making this work is an important step for making the ptx target "feature complete" with languages traditionally used for GPGPU. Let me know if there's anything I can do to support this. |
Thanks, I'll add that to the top list. It looks like it might not be too hard to add new simd intrinsics on that platform? I have no clue but https://github.com/rust-lang/stdarch/blob/df3618d9f35165f4bc548114e511c49c29e1fd9b/crates/core_arch/src/nvptx/mod.rs is pretty straightforward if you want to give it a shot at some point |
I just tested I looked a bit around in SIMD instructions for other arches and I think this is, as you say, pretty straightforward. I will give it a shot. Hopefully I will get around to creating a PR next week. |
That is great news! Note that unfortunately math symbols aren’t yet available on all targets so testing with the new types is kind of weird sometimes, but hopefully that will be resolved in a week or so with a compiler_builtins update. |
Took me a bit longer than I originally hoped for but I ended up creating a PR for (most) nvptx I have also noticed that we're lacking |
Awesome news, thanks for the update! Looks like there is an open PR to get the new changes #128866.
I'll add it to the issue, no particular reason outside of being lower priority than the intrinsics. |
Another datapoint: ARM Helium (a.k.a. Cortex-M Vector Extension) also support SIMD instructions for f16 but not for f128: https://developer.arm.com/architectures/instruction-sets/intrinsics/#f:@navigationhierarchiessimdisa=[[Neon],Neon,Helium]&f:@navigationhierarchiesreturnbasetype=[[float]]&f:@navigationhierarchieselementbitsize=[[16]]&q= |
Eventually we will want to be able to make use of simd operations for f16 and f128, now that we have primitives to represent them. Possibilities that I know of:
float16x{4,8}
https://developer.arm.com/architectures/instruction-sets/intrinsics/#f:@navigationhierarchiessimdisa=[Neon]&f:@navigationhierarchiesreturnbasetype=[float]&f:@navigationhierarchieselementbitsize=[16]&q=.float16x{1,2}
https://developer.arm.com/architectures/instruction-sets/intrinsics/#f:@navigationhierarchiesreturnbasetype=[float]&f:@navigationhierarchieselementbitsize=[16]&f:@navigationhierarchiessimdisa=[sve2,sve]&q=Probably some work/research overlap with adding assembly #125398
Tracking issue: #116909
The text was updated successfully, but these errors were encountered: