Extend Transformers Trainer Class to Enable PyTorch Torchscript for Inference #17153

jianan-gu · 2022-05-10T07:11:19Z

What does this PR do?

This PR intends to extend Transformers Trainer Class with PyTorch Torchscript (torch.jit.trace) to speed up model inference with just-in-time compilation.

Users could simply enable "--jit_mode" Trainer input args to get benifits.

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

HuggingFaceDocBuilderDev · 2022-05-10T07:26:56Z

The documentation is not available anymore as the PR was closed or merged.

sgugger · 2022-05-10T14:28:28Z

This may be more optimum's domain than Transformers, what do you think @mfuntowicz @LysandreJik @lewtun ?

lewtun · 2022-05-11T12:27:51Z

This may be more optimum's domain than Transformers, what do you think @mfuntowicz @LysandreJik @lewtun ?

Thanks for the ping and thank you @jianan-gu for opening the PR!

I agree that inference-based improvements are better suited in optimum, so in this case we'd need to implement a dedicated torchscript subpackage that effectively extends the Trainer in this manner.

Having said that, I wonder if it's overkill to create a new package just to support this feature, which is natively included in PyTorch.

@mfuntowicz @michaelbenayoun do you see any other benefits that could be had by having a dedicated subpackage for torchscript in optimum?

stas00 · 2022-05-12T04:59:02Z

I'd be happy to work on this PR if you give me a green light that you'll accept this feature.

We can mark is as experimental - should you decide that it'd fit better elsewhere down the road. So it would be easier to experiment and try it on.

sgugger

Thanks for adding this. We would need some test before we can merge the PR, and the code needs to be adapted to the way the labels are handled in Transformers.

src/transformers/trainer.py

src/transformers/training_args.py

Co-authored-by: Sylvain Gugger <[email protected]>

sgugger

Thanks for addressing the comments, this will also need a test before we can merge.

src/transformers/trainer.py

Co-authored-by: Sylvain Gugger <[email protected]>

jianan-gu · 2022-06-09T07:31:00Z

Adding UT Tests in test_trainer.py for jit which covers evaluate and predict.
Thanks.

sgugger

Thanks for adding the tests! I have a few last small comments.

Do you want to have a second look @stas00 ?

src/transformers/trainer.py

src/transformers/training_args.py

stas00

What is the sounds of a tree falling in the forest when there is nobody to hear it?

All these new features need to be documented on the user-side so that they actually get used. So the same as with IPEX let's add user docs and as before ideally a small benchmark to give users an incentive to try the feature.

updade: I added a doc for inference as we haven't created it and then you can push the usage examples there - like we did with IPEX PR. So please fill out the blank and then we are good to go I think.

and added a few nits in the code.

src/transformers/trainer.py

Co-authored-by: Stas Bekman <[email protected]>

stas00 · 2022-06-10T16:33:36Z

docs/source/en/perf_infer_gpu_one.mdx

+
+## JIT-mode
+
+XXX: fill in the details


but isn't your PR adding support for inference only?

jit_mode_eval (`bool`, *optional*, defaults to `False`): Whether or not to use PyTorch jit trace for inference.

Yes, it is for inference only.

so why the generic torch_jit_model name? let's name it accordingly? i.e. adding _eval to it

Sure, have made changes accordingly, thanks for your suggestion.

Thank you, @jianan-gu - please ping me when it's ready for a final review.

jianan-gu · 2022-06-13T04:05:12Z

What is the sounds of a tree falling in the forest when there is nobody to hear it?

All these new features need to be documented on the user-side so that they actually get used. So the same as with IPEX let's add user docs and as before ideally a small benchmark to give users an incentive to try the feature.

updade: I added a doc for inference as we haven't created it and then you can push the usage examples there - like we did with IPEX PR. So please fill out the blank and then we are good to go I think.

and added a few nits in the code.

Though this jit mode works both for CPU and GPU, the current IPEX release covers CPU side optimizations with jit mode for model inference. Therefore we only have added and updated the inference doc for CPU (perf_infer_cpu.mdx), please take a review of the contents @stas00 , thanks. (for the small benchmark, for now, we only have relative performance numbers like shown in #17137, so we will prepare for the numbers with --skip_memory_metrics 0 as a follow-up)

stas00

last few small things to adjust and then it should be good.

docs/source/en/perf_infer_cpu.mdx

src/transformers/trainer.py

Co-authored-by: Stas Bekman <[email protected]>

stas00 · 2022-06-14T01:59:01Z

@sgugger, would you like to have a quick look at the last changes since your review - mainly the newly added doc. Thank you!

otherwise it's good to be merged.

sgugger

Some small comments on the new doc, thanks for adding it!

docs/source/en/perf_infer_cpu.mdx

Co-authored-by: Sylvain Gugger <[email protected]>

add jit mode option and model wrap

4572bd6

This was referenced May 10, 2022

Extend Transformers Trainer Class to Enable CPU AMP and Integrate Intel Extension for PyTorch #17138

Merged

Speed up Hugging Face Models with Intel Extension for PyTorch* #17137

Closed

Merge branch 'main' into Introduce_Jit

ff22924

stas00 self-assigned this May 12, 2022

stas00 added the Performance label May 12, 2022

sgugger reviewed May 19, 2022

View reviewed changes

jianan-gu and others added 4 commits May 20, 2022 11:27

Update src/transformers/training_args.py

b7d87d4

Co-authored-by: Sylvain Gugger <[email protected]>

Update src/transformers/training_args.py

7be2f3f

Co-authored-by: Sylvain Gugger <[email protected]>

Merge branch 'huggingface:main' into Introduce_Jit

74a806e

refine code

8ab4ad3

sgugger reviewed May 20, 2022

View reviewed changes

src/transformers/trainer.py Outdated Show resolved Hide resolved

src/transformers/trainer.py Outdated Show resolved Hide resolved

jianan-gu and others added 5 commits May 23, 2022 08:42

Update src/transformers/trainer.py

fc197cc

Co-authored-by: Sylvain Gugger <[email protected]>

Update src/transformers/trainer.py

1659d18

Co-authored-by: Sylvain Gugger <[email protected]>

add ut and refine code

e8bd011

Merge branch 'main' into Introduce_Jit

f516fdb

code refine

e1727ad

sgugger approved these changes Jun 9, 2022

View reviewed changes

src/transformers/trainer.py Outdated Show resolved Hide resolved

src/transformers/training_args.py Outdated Show resolved Hide resolved

refine code

2d7b0a8

stas00 suggested changes Jun 9, 2022

View reviewed changes

src/transformers/trainer.py Outdated Show resolved Hide resolved

src/transformers/trainer.py Outdated Show resolved Hide resolved

stas00 and others added 5 commits June 9, 2022 12:59

add inference doc

11ffcda

Update src/transformers/trainer.py

8a07c6f

Co-authored-by: Stas Bekman <[email protected]>

Update src/transformers/trainer.py

c5ff4ae

Co-authored-by: Stas Bekman <[email protected]>

add cpu inference performance doc

5297c70

Update perf_infer_cpu.mdx

1351b97

stas00 reviewed Jun 10, 2022

View reviewed changes

jianan-gu added 6 commits June 13, 2022 11:07

Update perf_infer_cpu.mdx

973375f

Update performance.mdx

5f60fe7

Update _toctree.yml

aff4ac5

refine jit func naming

e96d9cf

Update _toctree.yml

ce8b230

Delete perf_infer_gpu_one.mdx

04a36bd

Update perf_infer_cpu.mdx

216b289

stas00 suggested changes Jun 13, 2022

View reviewed changes

docs/source/en/perf_infer_cpu.mdx Outdated Show resolved Hide resolved

src/transformers/trainer.py Outdated Show resolved Hide resolved

jianan-gu and others added 2 commits June 14, 2022 08:44

Update docs/source/en/perf_infer_cpu.mdx

54cd559

Co-authored-by: Stas Bekman <[email protected]>

add none check before jit

ae377f5

stas00 approved these changes Jun 14, 2022

View reviewed changes

sgugger approved these changes Jun 14, 2022

View reviewed changes

docs/source/en/perf_infer_cpu.mdx Outdated Show resolved Hide resolved

docs/source/en/perf_infer_cpu.mdx Outdated Show resolved Hide resolved

jianan-gu and others added 2 commits June 14, 2022 19:44

Update docs/source/en/perf_infer_cpu.mdx

3d78cb5

Co-authored-by: Sylvain Gugger <[email protected]>

Update docs/source/en/perf_infer_cpu.mdx

ffdea68

Co-authored-by: Sylvain Gugger <[email protected]>

sgugger merged commit 3b29c9f into huggingface:main Jun 14, 2022

Extend Transformers Trainer Class to Enable PyTorch Torchscript for Inference #17153

Extend Transformers Trainer Class to Enable PyTorch Torchscript for Inference #17153

Uh oh!

Conversation

jianan-gu commented May 10, 2022

What does this PR do?

Who can review?

Uh oh!

HuggingFaceDocBuilderDev commented May 10, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sgugger commented May 10, 2022

Uh oh!

lewtun commented May 11, 2022

Uh oh!

stas00 commented May 12, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sgugger left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sgugger left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

jianan-gu commented Jun 9, 2022

Uh oh!

sgugger left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

stas00 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

stas00 Jun 10, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jianan-gu Jun 13, 2022

Choose a reason for hiding this comment

Uh oh!

stas00 Jun 13, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jianan-gu Jun 13, 2022

Choose a reason for hiding this comment

Uh oh!

stas00 Jun 13, 2022

Choose a reason for hiding this comment

Uh oh!

jianan-gu commented Jun 13, 2022

Uh oh!

stas00 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

stas00 commented Jun 14, 2022

Uh oh!

sgugger left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

HuggingFaceDocBuilderDev commented May 10, 2022 •

edited

Loading

stas00 commented May 12, 2022 •

edited

Loading

stas00 left a comment •

edited

Loading

stas00 Jun 10, 2022 •

edited

Loading

stas00 Jun 13, 2022 •

edited

Loading