webgpu: Optimize AvgPool when filter size = input size #6762

qjia7 · 2022-08-17T10:16:50Z

AvgPool is very poor in cityscapes architecture in DeepLabV3.
With this change, AvgPool becomes 3.07 ms from 24.77 ms on TGL.

To see the logs from the Cloud Build CI, please join either our discussion or announcement mailing list.

This change is

AvgPool is very pool in cityscapes architecture in DeepLabV3. With this change, AvgPool becomes 3.07 ms from 24.77 ms.

Linchenn

Thank you Jiajia! This perf improvement looks pretty great!

I am not sure if I understand correctly: this change gains performance because WebGPU's mean (reduce) op is optimized by workgroup? If I am right, I think I could not apply this idea to WebGL because mean op and pool op have similar implementations.

Reviewable status: complete! 1 of 1 approvals obtained (waiting on @gyagp and @qjia7)

tfjs-backend-webgpu/src/kernels/AvgPool.ts line 61 at r1 (raw file):

        transpose({inputs: {x: reshapeX}, backend, attrs: {perm: [1, 0]}});
    const meanX = mean(
        {inputs: {x: transposeX}, backend, attrs: {keepDims: false, axis: 1}});

Could we avoid transpose op here? Then we do meanX on axis 0, like:

const meanX = mean(
        {inputs: {x: transposeX}, backend, attrs: {keepDims: false, axis: 0}});

Code quote:

    const transposeX =
        transpose({inputs: {x: reshapeX}, backend, attrs: {perm: [1, 0]}});
    const meanX = mean(
        {inputs: {x: transposeX}, backend, attrs: {keepDims: false, axis: 1}});

qjia7

That's one reason. Another reason is that using reduce makes the data accessing contiguous in memory.
For webgl, I remember @pyu10055 ever said that webgl reduction op are using parallel algorithm that reduce the array in multiple shader calls.. So maybe using reduce is still faster than the current pool2d algorithm. You can have a try. But for current AvgPool op in this model, webgpu does behave much slower than webgl. But after the optimization, it becomes better.

Reviewable status: complete! 1 of 1 approvals obtained (waiting on @gyagp and @Linchenn)

tfjs-backend-webgpu/src/kernels/AvgPool.ts line 61 at r1 (raw file):

Previously, Linchenn wrote…

Could we avoid transpose op here? Then we do meanX on axis 0, like:
const meanX = mean(
        {inputs: {x: transposeX}, backend, attrs: {keepDims: false, axis: 0}});

Done. Thanks.

Linchenn

Thank you for detailed explanation! LGTM!

Reviewable status: complete! 1 of 1 approvals obtained (waiting on @gyagp and @Linchenn)

gyagp

LGTM

webgpu: Optimize AvgPool when filter size = input size

51bf289

AvgPool is very pool in cityscapes architecture in DeepLabV3. With this change, AvgPool becomes 3.07 ms from 24.77 ms.

qjia7 requested review from Linchenn and gyagp August 17, 2022 11:19

Merge branch 'master' into pool_opt

8498c51

Linchenn approved these changes Aug 18, 2022

View reviewed changes

qjia7 added 3 commits August 19, 2022 12:57

Address comments

d430988

move common codes into poolImp

98d0e97

add the missed Pool_impl.ts

7a5ec19

qjia7 commented Aug 19, 2022

View reviewed changes

Linchenn approved these changes Aug 19, 2022

View reviewed changes

gyagp approved these changes Aug 22, 2022

View reviewed changes

Merge branch 'master' into pool_opt

6bca551

gyagp merged commit cf328d3 into tensorflow:master Aug 22, 2022

qjia7 deleted the pool_opt branch August 22, 2022 07:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

webgpu: Optimize AvgPool when filter size = input size #6762

webgpu: Optimize AvgPool when filter size = input size #6762

Uh oh!

qjia7 commented Aug 17, 2022 •

edited

Loading

Uh oh!

Linchenn left a comment

Uh oh!

qjia7 left a comment

Uh oh!

Linchenn left a comment

Uh oh!

gyagp left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

webgpu: Optimize AvgPool when filter size = input size #6762

webgpu: Optimize AvgPool when filter size = input size #6762

Uh oh!

Conversation

qjia7 commented Aug 17, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Linchenn left a comment

Choose a reason for hiding this comment

Uh oh!

qjia7 left a comment

Choose a reason for hiding this comment

Uh oh!

Linchenn left a comment

Choose a reason for hiding this comment

Uh oh!

gyagp left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

qjia7 commented Aug 17, 2022 •

edited

Loading