Bring fuzzy matching support into master #5508

hjelmn · 2018-08-02T16:13:41Z

This PR brings the fuzzy matching support developed by @mdosanjh at Sandia into pml/ob1 on master.

The fuzzy matching code is disabled by default and can be enabled on the appropriate platforms by specifying the --with-pml-ob1-matching configure option.

Signed-off-by: Nathan Hjelm <[email protected]>

This commit updates the new custom matching code in pml/ob1 so it can not be enabled with a configure option. This commit also renames the fuzzy-matching headers to avoid potential name conflicts and removes the use of C reserved identifiers. Signed-off-by: Nathan Hjelm <[email protected]>

bosilca · 2018-08-02T20:15:46Z

ompi/mca/pml/ob1/custommatch/pml_ob1_custom_match_arrays.h

+
+typedef struct custom_match_prq_node
+{
+    int32_t tags[PRQ_SIZE];


These fields seems to always be accessed together, but laying out the structure this way guarantee 5 cache misses per access which is rather expensive for such a time critical operation.

I agree that restructuring would be pertinent to allow for performant PRQ_SIZE and UMQ_SIZE resizing. The current implementation of 'pml_ob1_custom_match_arrays.h' matching engine structure is explicitly sized to fit each prq/umq node into a single cache line.

bosilca · 2018-08-02T20:24:45Z

ompi/mca/pml/ob1/custommatch/pml_ob1_custom_match_arrays.h

+
+typedef struct custom_match_umq_node
+{
+    int32_t tags[UMQ_SIZE];


same comment as for the prq struct.

bosilca · 2018-08-02T20:30:45Z

ompi/mca/pml/ob1/custommatch/pml_ob1_custom_match_fuzzy512-byte.h

+        result = _mm512_cmpeq_epi8_mask(_mm512_and_epi32(elem->keys, elem->mask), _mm512_and_epi32(search, elem->mask));
+        if(result)
+        {
+            for(i = elem->start; i <= elem->end; i++)


I would think that looping around the set bits in result will be faster as it saves few branches.

bosilca · 2018-08-02T20:31:37Z

ompi/mca/pml/ob1/custommatch/pml_ob1_custom_match_fuzzy512-byte.h

+        {
+            for(i = elem->start; i <= elem->end; i++)
+            {
+                if((0x1 << i & result) && elem->value[i])


is 0x1 really promoted to __mmask64 ?

Signed-off-by: Nathan Hjelm <[email protected]>

thananon · 2018-10-05T20:01:53Z

I have error trying to compile with new matching on Intel Xeon processor with AVX and AVX2 instruction set. GCC 7.1.

Is there something wrong on the configure or am I missing something?

Same error with --with-pml-ob1-matching=vector or fuzzy-*.

/sw/gcc/7.1.0/lib/gcc/x86_64-pc-linux-gnu/7.1.0/include/avx512fintrin.h:3573:1: error: inlining failed in call to always_inline '_mm512_set1_epi32': target specific option mismatch
 _mm512_set1_epi32 (int __A)
 ^~~~~~~~~~~~~~~~~
In file included from custommatch/pml_ob1_custom_match.h:49:0,
                 from pml_ob1_comm.h:38,
                 from pml_ob1.c:51:
custommatch/pml_ob1_custom_match_vectors.h:501:22: note: called from here
         elem->srcs = _mm512_set1_epi32(~0);
                      ^~~~~~~~~~~~~~~~~~~~~
In file included from /sw/gcc/7.1.0/lib/gcc/x86_64-pc-linux-gnu/7.1.0/include/immintrin.h:45:0,
                 from custommatch/pml_ob1_custom_match_vectors.h:17,
                 from custommatch/pml_ob1_custom_match.h:49,
                 from pml_ob1_comm.h:38,
                 from pml_ob1.c:51:
/sw/gcc/7.1.0/lib/gcc/x86_64-pc-linux-gnu/7.1.0/include/avx512fintrin.h:3573:1: error: inlining failed in call to always_inline '_mm512_set1_epi32': target specific option mismatch
 _mm512_set1_epi32 (int __A)

mdosanjh · 2018-10-05T20:11:24Z

@thananon The vector code here is writen for AVX-512 and is not currently compatible with AVX or AVX2 (or the limited AVX-512 implementation on Knights Corner).

thananon · 2018-10-05T20:13:54Z

Ah, I see. Thank you.

hjelmn force-pushed the fuzzy_match branch from b90dbbe to 9abc7b8 Compare August 2, 2018 18:22

Adding custom match source.

572694b

Signed-off-by: Nathan Hjelm <[email protected]>

hjelmn force-pushed the fuzzy_match branch from 9abc7b8 to f51988f Compare August 2, 2018 18:23

hjelmn force-pushed the fuzzy_match branch from f51988f to dd74c62 Compare August 2, 2018 19:06

bosilca reviewed Aug 2, 2018

View reviewed changes

Fixed promotion bug

c8d1348

Signed-off-by: Nathan Hjelm <[email protected]>

hjelmn force-pushed the fuzzy_match branch from 59fdf4e to c8d1348 Compare August 6, 2018 18:56

hjelmn merged commit c294bbc into open-mpi:master Aug 6, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Bring fuzzy matching support into master #5508

Bring fuzzy matching support into master #5508

Uh oh!

hjelmn commented Aug 2, 2018

Uh oh!

bosilca Aug 2, 2018

Uh oh!

mdosanjh Aug 2, 2018

Uh oh!

bosilca Aug 2, 2018

Uh oh!

bosilca Aug 2, 2018

Uh oh!

bosilca Aug 2, 2018

Uh oh!

thananon commented Oct 5, 2018 •

edited

Loading

Uh oh!

mdosanjh commented Oct 5, 2018

Uh oh!

thananon commented Oct 5, 2018

Uh oh!

Uh oh!

Bring fuzzy matching support into master #5508

Bring fuzzy matching support into master #5508

Uh oh!

Conversation

hjelmn commented Aug 2, 2018

Uh oh!

bosilca Aug 2, 2018

Choose a reason for hiding this comment

Uh oh!

mdosanjh Aug 2, 2018

Choose a reason for hiding this comment

Uh oh!

bosilca Aug 2, 2018

Choose a reason for hiding this comment

Uh oh!

bosilca Aug 2, 2018

Choose a reason for hiding this comment

Uh oh!

bosilca Aug 2, 2018

Choose a reason for hiding this comment

Uh oh!

thananon commented Oct 5, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mdosanjh commented Oct 5, 2018

Uh oh!

thananon commented Oct 5, 2018

Uh oh!

Uh oh!

thananon commented Oct 5, 2018 •

edited

Loading