Skip to content
This repository was archived by the owner on Aug 15, 2019. It is now read-only.

Conversation

annxingyuan
Copy link
Collaborator

@annxingyuan annxingyuan commented Jun 19, 2019

This fixes tensorflow/tfjs#1673

It turns out our current version of packed depthwise conv is only optimal for certain ratios of input size / input depth / filter size.

I think it makes sense to disable for now while I figure out how to make a more generally optimal kernel: tensorflow/tfjs#1679

Current version also has accuracy issue: tensorflow/tfjs#1652

To see the logs from the Cloud Build CI, please join either
our discussion
or announcement mailing list.


This change is Reviewable

@annxingyuan annxingyuan self-assigned this Jun 19, 2019
@annxingyuan annxingyuan requested review from nsthorat and dsmilkov June 19, 2019 12:47
Copy link
Contributor

@dsmilkov dsmilkov left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Reviewed 2 of 2 files at r1.
Reviewable status: :shipit: complete! 1 of 1 approvals obtained (waiting on @dsmilkov and @nsthorat)

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

packed depthwiseConv with large filter size is slow
2 participants