Fairseq v0.10.2 compatible #104

JulianneKnott · 2021-09-08T23:14:21Z

Updated generate.py to be compatible with Fairseq v0.10.2.
Speed tbd.

JiushengChen · 2021-09-08T23:43:18Z

Avoid checking in those txt files and old backup file - "generate_old.py".

feihugis

Thanks @JulianneKnott for this PR! I did a pass and have one question for the files under results: are they used for comparing the outputs from fastseq and fairseq? It will be very helpful if we can add a unit test to automatically check it. We can also integrate it with our CI in the next step.

benchmarks/models/fs_wmt.sh

fastseq/optimizer/fairseq/generate_old.py

JulianneKnott · 2021-09-09T00:03:15Z

Thanks @JulianneKnott for this PR! I did a pass and have one question for the files under results: are they used for comparing the outputs from fastseq and fairseq? It will be very helpful if we can add a unit test to automatically check it. We can also integrate it with our CI in the next step.

Yes. Didn't mean to commit them and will remove them on the next commit. Will look into adding a unit test also.

JulianneKnott · 2021-09-10T16:16:49Z

Benchmark Info:

Util Model Task Split BatchSize Samples Tokens Bleu Rouge Loss Perplexity Runtime(seconds) Throughput(samples/s) Throughput(tokens/s)
fairseq_v0.10.2 bart.large.cnn cnn_dm/len-1024.bin valid 32 13367 1098036 17.93 NA NA NA 3875 3.4 283.4
fairseq_v0.10.2+fastseq_v0.0.4 bart.large.cnn cnn_dm/len-1024.bin valid 32 13367 1098036 17.93 NA NA NA 3028 4.4 362.6

feihugis

The current code looks good to me. As this change will break the other parts of the main branch, could we create another branch (e.g., fairseq-0.10.2) for this work? After finishing all the tasks, it can be merged to the main branch.

JulianneKnott · 2021-09-14T23:46:20Z

Updated beam search optimizer.
Benchmarks:

Util Model Task Split BatchSize Samples Tokens Bleu Rouge Loss Perplexity Runtime(seconds) Throughput(samples/s) Throughput(tokens/s)
fairseq_v0.10.2+fastseq_v0.0.4 bart.large.cnn cnn_dm/len-1024.bin valid 256 13367 1098278 17.90 NA NA NA 718 18.6 1529.6

JulianneKnott · 2021-09-27T19:31:26Z

Updated el attention optimizer
Benchmarks:

Util Model Task Split BatchSize Samples Tokens Bleu Rouge Loss Perplexity Runtime(seconds) Throughput(samples/s) Throughput(tokens/s)
fairseq_v0.10.2+fastseq_v0.0.4 bart.large.cnn cnn_dm/len-1024.bin valid 320 13367 1098535 17.92 NA NA NA 621 21.5 1769.0

fastseq/optimizer/fairseq/beam_search_optimizer.py

fastseq/optimizer/fairseq/el_attention_optimizer.py

setup.py

feihugis

Thanks @JulianneKnott ! LGTM. Just find some very minor issues. After fixing, the PR will be good to merge.

benchmarks/models/fs_prophetnet.sh

feihugis · 2021-10-27T17:32:23Z

fastseq/optimizer/fairseq/beam_search_optimizer.py

-
-    def step(self, step, lprobs, scores):
-        super()._init_buffers(lprobs)
+class BeamSearch(BeamSearch):


I'm not sure if people will get confused when the child class has the same name with the parent class? What do @JulianneKnott think of it?

fastseq/optimizer/fairseq/el_attention_optimizer.py

yuyan2do

Looks good. Just added a few suggestions inline.

Another question: Is there any test cover converting a model to its TorchScript version? If not, we need do it manually to verify our change do not break this functionality.

azure-pipelines.yml

examples/prophetnet/README.md

fastseq/config.py

yuyan2do · 2021-10-27T21:30:32Z

fastseq_cli/generate.py

+        metavar='N',
+        help='number of worker for post process')
+    parser.add_argument(
+        '--decode_hypothesis',


Where is this param used?

fastseq/fastseq/optimizer/fairseq/generate.py

Line 152 in 4a3f17e

if args.decode_hypothesis:

also lines: 211, 218, 240 in the same file

tests/run_fairseq_tests.sh

feihugis added 3 commits September 7, 2021 23:20

generate updated

624672d

cleanup

d677d7a

generate update

408fefa

feihugis reviewed Sep 8, 2021

View reviewed changes

benchmarks/models/fs_wmt.sh Outdated Show resolved Hide resolved

fastseq/optimizer/fairseq/generate_old.py Outdated Show resolved Hide resolved

JulianneKnott added 4 commits September 9, 2021 23:07

debug and cleanup

bb53938

more cleanup

3c095d0

cleanup

09667b8

cleanup

315c165

fixed benchmark script

121bb4c

feihugis reviewed Sep 10, 2021

View reviewed changes

beam search optimizer updates

6e3b2d2

JulianneKnott changed the title ~~Fairseq v0.10.2 compatible - generate.py updated~~ Fairseq v0.10.2 compatible Sep 15, 2021

JulianneKnott added 2 commits September 24, 2021 21:16

el attention update

3e69d18

bug fix

e4fcf96

feihugis reviewed Sep 27, 2021

View reviewed changes

JulianneKnott added 10 commits September 29, 2021 00:07

debug fairseq tests (w/ beam search opt) + cleanup

22ebfb0

clean

c4a87f5

cleanup

ce7557d

fairseq tests (w/ el attn) + cleanup

7f092bd

add cli test for fairseq

e93f7e0

bug fix

1ffd7a2

speed improvements

32f01e6

prophetnet updates

d8240ae

cleanup

c25515a

fairseq version docker

3034a31

JulianneKnott added 19 commits October 26, 2021 18:37

debug - beamsearch only

2ad2aca

debug

2aa6ce8

debug - switch order

1873cbf

debug - move prep env

e698946

debug

fa0495b

debug

f275b6e

skip problematic fairseq tests

0dca80f

debug - change pipmain flag

87fe626

debug - fix typo

276978d

debug

4492d8c

debug - hydracore

c6060e7

debug pipeline - fix pipmain install from local

2e9715a

pip debug try add to path

13a3c32

debug pipeline more changes to fairseq install

b9a850e

check if path update change is necessary

79b548e

cleanup and run all tests

93536b5

fix newline at end of beam search

190a340

fix formatting el attn

1801a82

cleanup

33c401c

feihugis reviewed Oct 27, 2021

View reviewed changes

JulianneKnott added 3 commits October 27, 2021 18:04

removed debugging code

5c74cb2

comment for parent + child classes sharing name

71abfc3

update fastseq version

b3f722d

feihugis approved these changes Oct 27, 2021

View reviewed changes

fix readme

cc5446f

yuyan2do approved these changes Oct 27, 2021

View reviewed changes

JulianneKnott added 2 commits October 27, 2021 23:01

cleanup

4a3f17e

fix readme

4747b23

JulianneKnott merged commit 1974223 into main Oct 27, 2021

JulianneKnott deleted the fairseq_v0.10.2_compatible branch October 27, 2021 23:47

Fairseq v0.10.2 compatible #104

Fairseq v0.10.2 compatible #104

Uh oh!

Conversation

JulianneKnott commented Sep 8, 2021

Uh oh!

JiushengChen commented Sep 8, 2021

Uh oh!

feihugis left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

JulianneKnott commented Sep 9, 2021

Uh oh!

JulianneKnott commented Sep 10, 2021

Uh oh!

feihugis left a comment

Choose a reason for hiding this comment

Uh oh!

JulianneKnott commented Sep 14, 2021

Uh oh!

JulianneKnott commented Sep 27, 2021

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

feihugis left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

feihugis Oct 27, 2021

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

yuyan2do left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

yuyan2do Oct 27, 2021

Choose a reason for hiding this comment

Uh oh!

JulianneKnott Oct 27, 2021

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants