FastGelu custom htp kernel
python3 examples/qualcomm/custom_op/custom_ops_fast_gelu.py --build_folder build-android -m SM8750 --op_package_dir examples/qualcomm/custom_op/fastgelu_op_package_htp/FastGeluOpPackage --build_op_package
:link: Helpful Links
:test_tube: See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/15507
- :page_facing_up: Preview Python docs built from this PR
Note: Links to docs will display an error until the docs builds have been completed.
:x: 14 New Failures, 1 Unrelated Failure
As of commit 5347bcef1f3b50c3dfd8611f24ac7cdcb093a62c with merge base f81d7688fd2d7f0c8cce7d8644afa662b8afe560 ():
NEW FAILURES - The following jobs have failed:
-
Lint / lintrunner / linux-job (gh)
>>> Lint for examples/qualcomm/custom_op/fastgelu_op_package_htp/FastGeluOpPackage/src/FastGeluOpPackageInterface.cpp: -
pull / android / build-llm-demo / linux-job (gh)
RuntimeError: Command docker exec -t 2000942a0dbc551a728ed46acc62f3557d8bcfe00a186fe4151c7b4bdcabf460 /exec failed with exit code 1 -
pull / test-static-llama-qnn-linux (stories_110m) / linux-job (gh)
RuntimeError: Command docker exec -t 18d69b0fc5a4ddcec2a1d11afe20a7a6d96149859a2ba6cb325a8950db4bfa66 /exec failed with exit code 127 -
pull / test-static-llama-qnn-linux (stories_260k_bc) / linux-job (gh)
RuntimeError: Command docker exec -t fe48e6347de21bd912fb8eb9701cbb7216fb1736aab5592404028527aa3cef37 /exec failed with exit code 127 -
pull / unittest-arm-backend-with-no-fvp (test_pytest_ops) / linux-job (gh)
RuntimeError: Command docker exec -t 78e8e133ee71f236b8ddcf588fa19c4cba6e7524830a7c20b06e1779067dec06 /exec failed with exit code 1 -
pull / unittest-nxp-neutron / linux-job (gh)
RuntimeError: Command docker exec -t 2cd089bb37cd0c1d0e8f49a445674f2c289772fc5ec87a11f69906f4922e86b3 /exec failed with exit code 1 -
Test CUDA Builds / export-model-cuda-artifact (google, gemma-3-4b-it, non-quantized) / linux-job (gh)
RuntimeError: Command docker exec -t 4db927d404bdeec21953dfffa5e0ec39220fcf9cdf7ca271948aa2e89820330e /exec failed with exit code 1 -
Test CUDA Builds / export-model-cuda-artifact (google, gemma-3-4b-it, quantized-int4-tile-packed) / linux-job (gh)
RuntimeError: Command docker exec -t bf17b5704715290256e3ea359eb5936f24492c335a3307af16b2a360eefcc9b4 /exec failed with exit code 1 -
Test CUDA Builds / export-model-cuda-artifact (mistralai, Voxtral-Mini-3B-2507, non-quantized) / linux-job (gh)
RuntimeError: Command docker exec -t 0928685aa9bddbff632a06db91473ef2de5c09e2563626fe6b396ecd0fdfba41 /exec failed with exit code 1 -
Test CUDA Builds / export-model-cuda-artifact (mistralai, Voxtral-Mini-3B-2507, quantized-int4-tile-packed) / linux-job (gh)
RuntimeError: Command docker exec -t f755e5af2cbc024d3d8a4eade25a867104df9413ebd1ffde3a420d0ce23c71de /exec failed with exit code 1 -
Test CUDA Builds / export-model-cuda-artifact (mistralai, Voxtral-Mini-3B-2507, quantized-int4-weight-only) / linux-job (gh)
RuntimeError: Command docker exec -t 19c21a4aa16ffaa3354a22a6377dd86763713205438988d5492dfe29a7742843 /exec failed with exit code 1 -
Test CUDA Builds / export-model-cuda-artifact (openai, whisper-small, non-quantized) / linux-job (gh)
RuntimeError: Command docker exec -t ce358fa3f1d10584a8822f1414a6e964af9d0929a61f7ca098779cfbe85fca39 /exec failed with exit code 1 -
Test CUDA Builds / export-model-cuda-artifact (openai, whisper-small, quantized-int4-tile-packed) / linux-job (gh)
RuntimeError: Command docker exec -t daa0017a12d359212385f63df8421a6e40dc983f51191b5249a6b69bd31477e6 /exec failed with exit code 1 -
Test CUDA Builds / export-model-cuda-artifact (openai, whisper-small, quantized-int4-weight-only) / linux-job (gh)
RuntimeError: Command docker exec -t d4dee7cd6cc55c00fc1679860111c23663b8fe766a322c7478e12cf3280c90f5 /exec failed with exit code 1
FLAKY - The following job failed but was likely due to flakiness present on trunk:
- pull / unittest-arm-backend-with-no-fvp (test_pytest_models) / linux-job (gh) (detected as infra flaky with no log or failing log classifier)
This comment was automatically generated by Dr. CI and updates every 15 minutes.
This PR needs a release notes: label
If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.
To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"
For more information, see https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.
Hi team, I add a FastGelu example, but I didn't use HTP intrinsics so the perf is still not optimized. Would like to know where to put these examples
Can I get a review on this PR?
Hi team, I add a FastGelu example, but I didn't use HTP intrinsics so the perf is still not optimized. Would like to know where to put these examples
Thanks for your contribution. I think it is ok to put this custom op in examples/qualcomm/custom_op.