Add Qwen 3 Omni by samudraneel05 · Pull Request #2590 · keras-team/keras-hub

samudraneel05 · 2026-02-08T17:47:12Z

Description of the change

Add Qwen3-Omni model to Keras-Hub. It has a thinker-talker architecture So far, I've tried to implement the thinker component, which is the core text transformer with audio and vision encoder integration.

Reference

Fixes #2413, Fixes #2523, and fixes #2530

Colab Notebook

Preset conversion attempt: https://colab.research.google.com/drive/1hvFCXT8sqDJewHpt6Kziq_71vRgwneNl?usp=sharing (ran into OOM issues for backbone on both Colab and Kaggle, so unverified… need help here)
Tokenizer comparison: https://www.kaggle.com/code/samudraneel05/qwen3omni-tokenizer-comparison/edit
Preprocessor comparison: https://www.kaggle.com/code/samudraneel05/qwen3omni-preprocessor-comparison

Checklist

I have added all the necessary unit tests for my change.
I have verified that my change does not break existing code and works with all backends (TensorFlow, JAX, and PyTorch).
My PR is based on the latest changes of the main branch (if unsure, rebase the code).
I have followed the Keras Hub Model contribution guidelines in making these changes.
I have followed the Keras Hub API design guidelines in making these changes.
I have signed the Contributor License Agreement.

I have some follow-up questions, which I'll elaborate on in the comments under this.

sachinprasadhs

Thanks! I took a look at a few files and left comments. Please address those comments and mark them as resolved once complete. I will perform another review after the comments have been handled.

keras_hub/src/models/qwen3_omni/qwen3_omni_attention.py

keras_hub/src/models/qwen3_omni/qwen3_omni_audio_converter.py

keras_hub/src/models/qwen3_omni/qwen3_omni_backbone.py

keras_hub/src/models/qwen3_omni/qwen3_omni_backbone_test.py

keras_hub/src/models/qwen3_omni/qwen3_omni_causal_lm.py

keras_hub/src/models/qwen3_omni/qwen3_omni_decoder.py

keras_hub/src/models/qwen3_omni/qwen3_omni_causal_lm.py

- removed whisper dependency - reduced tf usage - made test values smaller in line with moonshine/whisper

samudraneel05 · 2026-02-22T12:25:31Z

/gemini review

gemini-code-assist

Code Review

This is an impressive and well-structured contribution, adding the multimodal Qwen3-Omni model to KerasHub. The code adheres well to the repository's extensive style guide, including modular components, comprehensive tests, and the necessary converters. I have one main piece of feedback regarding code duplication and a related bug in the Qwen3OmniBackbone implementation, which I've detailed in a specific comment. Overall, this is a high-quality pull request.

keras_hub/src/models/qwen3_omni/qwen3_omni_backbone.py

samudraneel05 · 2026-02-25T10:34:32Z

hi, i've addressed the current set of comments. ready for review @sachinprasadhs!

sachinprasadhs

Thanks for addressing all the comments, overall it looks good.
Few small comments and would it be possible to attach the screenshots of numerics matching with 1e-3 level.
Also, add one usage example notebook for different types of input and expected output format.

keras_hub/src/models/qwen3_omni/qwen3_omni_backbone.py

keras_hub/src/models/qwen3_omni/qwen3_omni_causal_lm.py

keras_hub/src/utils/transformers/convert_qwen3_omni_test.py

tools/checkpoint_conversion/convert_qwen3_omni_checkpoints.py

sachinprasadhs · 2026-03-08T21:10:47Z

/gemini review

gemini-code-assist

Code Review

This pull request introduces the Qwen3Omni multimodal model, a significant addition to the library. The implementation is comprehensive, covering the backbone, audio and vision encoders, preprocessors, and the causal language modeling task. The code is well-structured and adheres to the repository's conventions for new model contributions. I have identified a few areas for improvement related to performance and style, which are detailed in the review comments. These include an opportunity to optimize the application of RoPE during cached generation and to make the creation of positional embeddings in the audio encoder more efficient. I also noted a minor deviation from the style guide regarding attribute naming. Overall, this is a well-executed contribution.

keras_hub/src/models/qwen3_omni/qwen3_omni_audio_encoder.py

keras_hub/src/models/qwen3_omni/qwen3_omni_rope.py

samudraneel05 added 30 commits February 8, 2026 22:46

make files to be used

5ee3a50

make files to be changes

25a193e

attention-decoder-rope-backbone

93a900e

initial fixes

378b67b

fix serialization in backbone

1d085e0

making the tokenizer

7809590

causal lm and related changes

4008844

presets established

0c7101f

causal lm preprocessor tests

fd9c124

fixes

e0bb85f

match position id shape to hf implementation

f445cf9

presets for kagglehub

d78952a

reduce PR scope

7d1a7cf

matching with hf and changing from moe to omni

b75629f

add the captioner and the thinking preset

3bc6f9c

implementing interleaved m-rope and fixed attention mech and parameters

35f174f

improvements to looping

d62835c

backbone todos and more updated

ed723b8

documentation made better

2867954

init aligned with repo standard

ab2408d

linter and pre-commit changes

17c4f9d

verification script

b3ed18b

qwen3omni specific changes

41ccd42

qwen3omni issues debugging due to transformers library being outdated

bbf6269

nesting of thinker component and fixes

d8a3420

back to full scale image-audio

f8bc9be

attention and positional encoding finalized

7bd4f1f

fix decoder and rope and hf output matching

549dcde

audio encoder finalization

f1b6879

vision encoder rewrite+full implementation

679c7d3

gemini suggestions investigated

bb29948

samudraneel05 changed the title ~~Qwen 3 omni~~ Add Qwen 3 Omni Feb 8, 2026

sachinprasadhs added the new model For PRs that contribute a new model to the Keras Hub registry. label Feb 9, 2026

divyashreepathihalli requested a review from sachinprasadhs February 13, 2026 01:15

sachinprasadhs reviewed Feb 19, 2026

View reviewed changes

samudraneel05 added 9 commits February 21, 2026 18:19

addressed comments on attention and decoder

55efad5

Addressed comments on converter

91198e4

- removed whisper dependency - reduced tf usage - made test values smaller in line with moonshine/whisper

resolved audio encoder comments

3a2504b

adding missing function in causal lm and aligning with repo

85ea4a3

better modularity plus separation of text vs multimodal input

0732cd4

documentation improvements

a806e6e

backbone and its test fixing and refactor

9898a80

adding audio and vision encoder tests

04c494e

reduce redundancies

3c79f36

gemini-code-assist bot reviewed Feb 22, 2026

View reviewed changes

keras_hub/src/models/qwen3_omni/qwen3_omni_backbone.py Show resolved Hide resolved

converter fixes

9c7dc81

samudraneel05 requested a review from sachinprasadhs February 25, 2026 10:34

missing responses bugfix

14eaf1a

sachinprasadhs reviewed Mar 8, 2026

View reviewed changes

sachinprasadhs added the kokoro:force-run Runs Tests on GPU label Mar 8, 2026

kokoro-team removed the kokoro:force-run Runs Tests on GPU label Mar 8, 2026

gemini-code-assist bot reviewed Mar 8, 2026

View reviewed changes

keras_hub/src/models/qwen3_omni/qwen3_omni_audio_encoder.py Outdated Show resolved Hide resolved

keras_hub/src/models/qwen3_omni/qwen3_omni_audio_encoder.py Show resolved Hide resolved

keras_hub/src/models/qwen3_omni/qwen3_omni_rope.py Show resolved Hide resolved

samudraneel05 added 3 commits March 9, 2026 17:04

audio encoder docstring and init alignment

6c3fe5f

addressed comments on tests and presets

1c7e428

conversion script minor edits

437f8bb

sachinprasadhs added the kokoro:force-run Runs Tests on GPU label Mar 9, 2026

kokoro-team removed the kokoro:force-run Runs Tests on GPU label Mar 9, 2026

Conversation

samudraneel05 commented Feb 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description of the change

Reference

Colab Notebook

Checklist

Uh oh!

sachinprasadhs left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

samudraneel05 commented Feb 22, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

samudraneel05 commented Feb 25, 2026

Uh oh!

sachinprasadhs left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sachinprasadhs commented Mar 8, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

samudraneel05 commented Feb 8, 2026 •

edited

Loading