[libc++] Fix endianness for algorithm mismatch #93082

zibi2 · 2024-05-22T18:07:47Z

This PR will fix std/algorithms/alg.nonmodifying/mismatch/mismatch.pass.cpp test for big endian platrofrms such as z/OS.

github-actions · 2024-05-22T18:11:06Z

✅ With the latest revision this PR passed the C/C++ code formatter.

zibi2 · 2024-05-27T14:23:34Z

ping @philnik777 any recommendations?

libcxx/include/__algorithm/mismatch.h

…uce code

llvmbot · 2024-06-03T18:37:29Z

@llvm/pr-subscribers-libcxx

Author: Zibi Sarbinowski (zibi2)

Changes

This PR will fix std/algorithms/alg.nonmodifying/mismatch/mismatch.pass.cpp test for big endian platrofrms such as z/OS.

Full diff: https://github.com/llvm/llvm-project/pull/93082.diff

1 Files Affected:

(modified) libcxx/include/__algorithm/mismatch.h (+41-6)

diff --git a/libcxx/include/__algorithm/mismatch.h b/libcxx/include/__algorithm/mismatch.h
index 632bec02406a4..bdd3314ed1ec5 100644
--- a/libcxx/include/__algorithm/mismatch.h
+++ b/libcxx/include/__algorithm/mismatch.h
@@ -55,7 +55,32 @@ __mismatch(_Iter1 __first1, _Sent1 __last1, _Iter2 __first2, _Pred& __pred, _Pro
 }
 
 #if _LIBCPP_VECTORIZE_ALGORITHMS
-
+template <class _ValueType, size_t _Np>
+_LIBCPP_NODISCARD _LIBCPP_HIDE_FROM_ABI __simd_vector<long long, _Np>
+__reverse_vector(__simd_vector<long long, _Np> __cmp_res) {
+  return [&]<size_t... _Indices>(index_sequence<_Indices...>) {
+    return __builtin_shufflevector(__cmp_res, __cmp_res, (_Np - _Indices - 1)...);
+  }(make_index_sequence<_Np>{});
+}
+template <class _ValueType, size_t _Np>
+_LIBCPP_NODISCARD _LIBCPP_HIDE_FROM_ABI __simd_vector<long, _Np> __reverse_vector(__simd_vector<long, _Np> __cmp_res) {
+  return [&]<size_t... _Indices>(index_sequence<_Indices...>) {
+    return __builtin_shufflevector(__cmp_res, __cmp_res, (_Np - _Indices - 1)...);
+  }(make_index_sequence<_Np>{});
+}
+template <class _ValueType, size_t _Np>
+_LIBCPP_NODISCARD _LIBCPP_HIDE_FROM_ABI __simd_vector<int, _Np> __reverse_vector(__simd_vector<int, _Np> __cmp_res) {
+  return [&]<size_t... _Indices>(index_sequence<_Indices...>) {
+    return __builtin_shufflevector(__cmp_res, __cmp_res, (_Np - _Indices - 1)...);
+  }(make_index_sequence<_Np>{});
+}
+template <class _ValueType, size_t _Np>
+_LIBCPP_NODISCARD _LIBCPP_HIDE_FROM_ABI __simd_vector<_ValueType, _Np>
+__reverse_vector(__simd_vector<_ValueType, _Np> __cmp_res) {
+  return [&]<size_t... _Indices>(index_sequence<_Indices...>) {
+    return __builtin_shufflevector(__cmp_res, __cmp_res, (_Np - _Indices - 1)...);
+  }(make_index_sequence<_Np>{});
+}
 template <class _Iter>
 _LIBCPP_NODISCARD _LIBCPP_HIDE_FROM_ABI _LIBCPP_CONSTEXPR_SINCE_CXX20 pair<_Iter, _Iter>
 __mismatch_vectorized(_Iter __first1, _Iter __last1, _Iter __first2) {
@@ -77,7 +102,11 @@ __mismatch_vectorized(_Iter __first1, _Iter __last1, _Iter __first2) {
       }
 
       for (size_t __i = 0; __i != __unroll_count; ++__i) {
-        if (auto __cmp_res = __lhs[__i] == __rhs[__i]; !std::__all_of(__cmp_res)) {
+        auto __cmp_res = __lhs[__i] == __rhs[__i];
+#  if defined(_LIBCPP_BIG_ENDIAN)
+        __cmp_res = std::__reverse_vector<__value_type>(__cmp_res);
+#  endif
+        if (!std::__all_of(__cmp_res)) {
           auto __offset = __i * __vec_size + std::__find_first_not_set(__cmp_res);
           return {__first1 + __offset, __first2 + __offset};
         }
@@ -89,8 +118,11 @@ __mismatch_vectorized(_Iter __first1, _Iter __last1, _Iter __first2) {
 
     // check the remaining 0-3 vectors
     while (static_cast<size_t>(__last1 - __first1) >= __vec_size) {
-      if (auto __cmp_res = std::__load_vector<__vec>(__first1) == std::__load_vector<__vec>(__first2);
-          !std::__all_of(__cmp_res)) {
+      auto __cmp_res = std::__load_vector<__vec>(__first1) == std::__load_vector<__vec>(__first2);
+#  if defined(_LIBCPP_BIG_ENDIAN)
+      __cmp_res = std::__reverse_vector<__value_type>(__cmp_res);
+#  endif
+      if (!std::__all_of(__cmp_res)) {
         auto __offset = std::__find_first_not_set(__cmp_res);
         return {__first1 + __offset, __first2 + __offset};
       }
@@ -106,8 +138,11 @@ __mismatch_vectorized(_Iter __first1, _Iter __last1, _Iter __first2) {
     if (static_cast<size_t>(__first1 - __orig_first1) >= __vec_size) {
       __first1 = __last1 - __vec_size;
       __first2 = __last2 - __vec_size;
-      auto __offset =
-          std::__find_first_not_set(std::__load_vector<__vec>(__first1) == std::__load_vector<__vec>(__first2));
+      auto __cmp_res = std::__load_vector<__vec>(__first1) == std::__load_vector<__vec>(__first2);
+#  if defined(_LIBCPP_BIG_ENDIAN)
+      __cmp_res = std::__reverse_vector<__value_type>(__cmp_res);
+#  endif
+      auto __offset = std::__find_first_not_set(__cmp_res);
       return {__first1 + __offset, __first2 + __offset};
     } // else loop over the elements individually
   }

philnik777

LGTM with the nit addressed.

libcxx/include/__algorithm/simd_utils.h

This PR is required to fix `std/algorithms/alg.nonmodifying/mismatch/mismatch.pass.cpp` test for big endian platrofrms such as z/OS.

zibi2 requested a review from philnik777 May 22, 2024 18:07

zibi2 self-assigned this May 22, 2024

zibi2 changed the title ~~Fix endianess for algorithm mismatch~~ Fix endianness for algorithm mismatch May 22, 2024

zibi2 force-pushed the zs_mismatch_big_endian branch from 11df170 to 05395e0 Compare May 22, 2024 19:49

Fix endianess for algorithm mismatch

e366cb1

zibi2 changed the title ~~Fix endianness for algorithm mismatch~~ [libc++] Fix endianness for algorithm mismatch May 22, 2024

Update based on the latest changes

05395e0

zibi2 force-pushed the zs_mismatch_big_endian branch 2 times, most recently from 6cb78f6 to b4ce872 Compare May 23, 2024 20:10

zibi2 added 2 commits May 23, 2024 20:15

Add more __reverse_vector overloads

b4ce872

Try to fix windows CI

40cab24

philnik777 reviewed Jun 1, 2024

View reviewed changes

libcxx/include/__algorithm/mismatch.h Outdated Show resolved Hide resolved

Based on the suggestion, apply the variadic template technique to red…

42f64c8

…uce code

zibi2 marked this pull request as ready for review June 3, 2024 18:36

zibi2 requested a review from a team as a code owner June 3, 2024 18:37

zibi2 requested a review from philnik777 June 3, 2024 18:37

llvmbot added the libc++ libc++ C++ Standard Library. Not GNU libstdc++. Not libc++abi. label Jun 3, 2024

zibi2 added 2 commits June 3, 2024 18:43

fix formatting

8cddb83

attempt to fix CI

9bf257d

zibi2 force-pushed the zs_mismatch_big_endian branch from 826d205 to 5eba555 Compare June 10, 2024 13:36

Make __find_first_set endianness aware

5eba555

philnik777 approved these changes Jun 11, 2024

View reviewed changes

libcxx/include/__algorithm/simd_utils.h Outdated Show resolved Hide resolved

zibi2 merged commit ffc3a6b into llvm:main Jun 11, 2024
9 of 11 checks passed

Remove conditinal statement for #include directive

cf070c4

HerrCai0907 mentioned this pull request Jun 13, 2024

tidy #95384

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[libc++] Fix endianness for algorithm mismatch #93082

[libc++] Fix endianness for algorithm mismatch #93082

Uh oh!

zibi2 commented May 22, 2024

Uh oh!

github-actions bot commented May 22, 2024 •

edited

Loading

Uh oh!

zibi2 commented May 27, 2024

Uh oh!

Uh oh!

llvmbot commented Jun 3, 2024

Uh oh!

philnik777 left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

[libc++] Fix endianness for algorithm mismatch #93082

[libc++] Fix endianness for algorithm mismatch #93082

Uh oh!

Conversation

zibi2 commented May 22, 2024

Uh oh!

github-actions bot commented May 22, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

zibi2 commented May 27, 2024

Uh oh!

Uh oh!

llvmbot commented Jun 3, 2024

Uh oh!

philnik777 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented May 22, 2024 •

edited

Loading