executorch
executorch copied to clipboard
Qualcomm AI Engine Direct - fix miss layernorm registry
Summary
- fix miss layernorm registry for gemma.
Test Plan
python examples/qualcomm/oss_scripts/llama/llama.py -b build-android -s ${SERIAL_NUM} -m ${SOC_MODEL} --temperature 0 --model_mode hybrid --max_seq_len 1024 --prefill_ar_len 128 --decoder_model gemma-2b --prompt "I would like to learn python, could you teach me with a simple example?" --tasks wikitext --limit 1
:link: Helpful Links
:test_tube: See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/15824
- :page_facing_up: Preview Python docs built from this PR
Note: Links to docs will display an error until the docs builds have been completed.
:x: 1 New Failure
As of commit 05c8edbe25abe57d6ea0cf8a2e24565272f31e1e with merge base c02fdfc789899cc7ff1e369770fa207eb3426bb9 ():
NEW FAILURE - The following job has failed:
- pull / android / run-emulator (gh)
The process '/usr/bin/sh' failed with exit code 255
This comment was automatically generated by Dr. CI and updates every 15 minutes.
Hi @cccclai, There’s a bug in the current LayerNorm registry. Since the recent refactor, the Gemma's layernorm didn’t get registered. and that’s breaking gemma model loading.
Thanks!!
@pytorchbot label "release notes: qualcomm"