[release-v1.21.6] Add SplitTensorsTransform to QEFFAutoModel to preve… by asmigosw · Pull Request #968 · quic/efficient-transformers

asmigosw · 2026-05-06T17:09:28Z

Add SplitTensorsTransform to QEFFAutoModel to prevent >2GB protobuf exports

FP16ClipTransform inlines external weights, causing large embedding
models (e.g. BAAI/bge-reranker-v2-m3) to exceed the 2GB ModelProto
parser limit in the AIC compiler

Adding SplitTensorsTransform to _onnx_transforms spills large
initializers to sidecar *.onnx.data files. Updated existing tests
and added regression tests to verify external data spilling behavior.

…nt >2GB protobuf export issue Signed-off-by: Asmita Goswami <asmigosw@qti.qualcomm.com>

quic-hemagnih · 2026-05-07T04:54:30Z

CI-Ready

asmigosw · 2026-05-07T04:55:57Z

CI-Ready

[release-v1.21.6] Add SplitTensorsTransform to QEFFAutoModel to preve…

820c5e1

…nt >2GB protobuf export issue Signed-off-by: Asmita Goswami <asmigosw@qti.qualcomm.com>

quic-hemagnih approved these changes May 7, 2026

View reviewed changes

quic-hemagnih merged commit e7dbee2 into quic:release/v1.21.6 May 7, 2026
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[release-v1.21.6] Add SplitTensorsTransform to QEFFAutoModel to preve…#968

[release-v1.21.6] Add SplitTensorsTransform to QEFFAutoModel to preve…#968
quic-hemagnih merged 1 commit intoquic:release/v1.21.6from
asmigosw:split_tensor_transform

asmigosw commented May 6, 2026

Uh oh!

quic-hemagnih commented May 7, 2026

Uh oh!

asmigosw commented May 7, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

asmigosw commented May 6, 2026

Uh oh!

quic-hemagnih commented May 7, 2026

Uh oh!

asmigosw commented May 7, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants