maxtext
maxtext copied to clipboard
Add Qwen3 Audio Encoder
Description
- Add audio encoder support for qwen3
- classes such as
AudioEncoderandQwen3OmniAudioEncoder - added the
use_audioflag to the configuration. - various audio specific flags
- added merging of audio tokens for qwen3
- added support in maxengine and maxtext_utils
- added support in decode.py
Tests
tests/check_qwen3_omni_audio_vs_reference.py
Checklist
Before submitting this PR, please make sure (put X in square brackets):
- [x] I have performed a self-review of my code. For an optional AI review, add the
gemini-reviewlabel. - [x] I have necessary comments in my code, particularly in hard-to-understand areas.
- [x] I have run end-to-end tests tests and provided workload links above if applicable.
- [x] I have made or will make corresponding changes to the doc if needed, including adding new documentation pages to the relevant Table of Contents (toctree directive) as explained in our documentation.