mxnet icon indicating copy to clipboard operation
mxnet copied to clipboard

[FEATURE] Add query_keys transformer version without split

Open agrabows opened this issue 3 years ago • 2 comments

Description

MXNet is fusing split, reshape, swapaxis and batch_dot operators for performance purpose. In gpt-2 model this fuse could be done as well if we exclude split. image -> image

Checklist

Essentials

  • [x] PR's title starts with a category (e.g. [BUGFIX], [MODEL], [TUTORIAL], [FEATURE], [DOC], etc)
  • [x] Changes are complete (i.e. I finished coding on this PR)
  • [x] All changes have test coverage
  • [x] Code is well-documented

agrabows avatar Aug 08 '22 19:08 agrabows

Hey @agrabows , Thanks for submitting the PR All tests are already queued to run once. If tests fail, you can trigger one or more tests again with the following commands:

  • To trigger all jobs: @mxnet-bot run ci [all]
  • To trigger specific jobs: @mxnet-bot run ci [job1, job2]

CI supported jobs: [unix-cpu, unix-gpu, website, windows-cpu, centos-gpu, edge, sanity, centos-cpu, windows-gpu, miscellaneous, clang]


Note: Only following 3 categories can trigger CI :PR Author, MXNet Committer, Jenkins Admin. All CI tests must pass before the PR can be merged.

mxnet-bot avatar Aug 08 '22 19:08 mxnet-bot

image

agrabows avatar Aug 10 '22 16:08 agrabows

@mxnet-bot run ci [unix-gpu]

agrabows avatar Aug 22 '22 15:08 agrabows

Jenkins CI successfully triggered : [unix-gpu]

mxnet-bot avatar Aug 22 '22 15:08 mxnet-bot