executorch icon indicating copy to clipboard operation
executorch copied to clipboard

CUDA delegate: fuse int4 kernel for better performance

Open mergennachin opened this issue 1 month ago • 3 comments

mergennachin avatar Nov 13 '25 16:11 mergennachin

:link: Helpful Links

:test_tube: See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/15813

Note: Links to docs will display an error until the docs builds have been completed.

:x: 4 New Failures

As of commit 3d749b5f98175be40d7156c8f8849ec348e789e9 with merge base da6306f4863f7eb16c27337cd8a42aa9d4ac4be7 (image):

NEW FAILURES - The following jobs have failed:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

pytorch-bot[bot] avatar Nov 13 '25 16:11 pytorch-bot[bot]

This PR needs a release notes: label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example @pytorchbot label "release notes: none"

For more information, see https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

github-actions[bot] avatar Nov 13 '25 16:11 github-actions[bot]

Is it possible to make this a part of Inductor freezing optimization? I think it will benefit all models beyond ET use cases.

desertfire avatar Nov 13 '25 23:11 desertfire