executorch icon indicating copy to clipboard operation
executorch copied to clipboard

Qualcomm AI Engine Direct - GA Static Granite3.3-2b

Open chenweng-quic opened this issue 2 weeks ago • 4 comments

Summary

Add Granite3.3-2b support.

Source model: image image

Static llama: python examples/qualcomm/oss_scripts/llama/llama.py -b build-android -H mlgtw-linux -s c3b39f15 -m SM8650 --temperature 0 --model_mode kv --max_seq_len 1024 --prefill_ar_len 128 --decoder_model granite_3_3-2b_instruct --prompt "I would like to learn python, could you teach me with a simple example?" --run_lm_eval --task hellaswag --limit 10 --artifact llama_qnn --kv_updater shift_pointer

Accuracy(hellaswag)(limit=10)

prepare_pt2e: {'acc_norm,none': 0.5} convert_pt2e: {'acc_norm,none': 0.3} device: {'acc_norm,none': 0.2}

Statistics on SM8650(16a4w_block64)

image

Statistics on SM8750(16a4w_block64)

image

Test plan

python backends/qualcomm/tests/test_qnn_delegate.py -k TestExampleLLMScript.test_granite_3_3_2b_instruct --device c3b39f15 --host mlgtw-linux --model SM8650 --build_folder build-android --executorch_root . --artifact_dir ./llama_qnn --llama_artifacts llama_qnn

cc @cccclai @cbilgin

chenweng-quic avatar Nov 13 '25 10:11 chenweng-quic

:link: Helpful Links

:test_tube: See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/15808

Note: Links to docs will display an error until the docs builds have been completed.

:x: 1 New Failure

As of commit 33576ac73516d2ae597fb230bbbd3b96949bd5ea with merge base 3e90b446f70f5f40435034f272ebf6b56dfb93ef (image):

NEW FAILURE - The following job has failed:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

pytorch-bot[bot] avatar Nov 13 '25 10:11 pytorch-bot[bot]

The label module: qnn is only applicable to issues and has been removed. Please only use this label on issues.

pytorch-bot[bot] avatar Nov 13 '25 10:11 pytorch-bot[bot]

This PR needs a release notes: label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example @pytorchbot label "release notes: none"

For more information, see https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

github-actions[bot] avatar Nov 13 '25 10:11 github-actions[bot]

@cccclai has imported this pull request. If you are a Meta employee, you can view this in D87034494.

meta-codesync[bot] avatar Nov 14 '25 04:11 meta-codesync[bot]