Skip to content

[QEff. Finetuning] TP+DDP for transformers upgrade to v5.5.4 #960

Open
smedhe wants to merge 35 commits intoquic:ft_experimental_v1from
smedhe:ft_experimental_v1_tf_v55_peft_upgrade
Open

[QEff. Finetuning] TP+DDP for transformers upgrade to v5.5.4 #960
smedhe wants to merge 35 commits intoquic:ft_experimental_v1from
smedhe:ft_experimental_v1_tf_v55_peft_upgrade

Conversation

@smedhe
Copy link
Copy Markdown
Contributor

@smedhe smedhe commented May 4, 2026

This pull request adds TP + DDP support and updates the codebase to remain compatible with newer versions of the dependent libraries. The changes have been validated against the following library versions:

transformers 5.5.4
peft 0.19.1
trl 1.2.0
accelerate 1.13.0

vbaddi and others added 30 commits April 15, 2026 15:16
…T parity

- Rebased downstream wrapper stack to transformers==5.3.0 and aligned coupled deps
    (huggingface-hub, peft, diffusers) in project config.
- Updated model wrapper compatibility paths across causal/VLM/audio/export flows
    to match upstream v5 APIs while preserving downstream public behavior.
- Hardened cache compatibility layer and runtime glue for mixed legacy/new cache
    semantics used by downstream generation/export paths.
- Fixed attention/mask/rotary call-path mismatches introduced by upstream API
    changes (including model-specific signature updates).
- Updated AWQ/quantizer and export compatibility paths to remain ONNX-safe.
- Resolved MoE/export edge cases (including Mixtral/gpt_oss) to keep
    HF PyTorch -> downstream PyTorch -> ONNXRuntime token parity.
- Validation evidence:
    pyenv activate qeff.mainline
    python -m pytest -q tests/test_model_quickcheck.py -n 16
    Result: 26 passed.

Signed-off-by: vbaddi <vbaddi@qti.qualcomm.com>
…odeling_qeff

Signed-off-by: vbaddi <vbaddi@qti.qualcomm.com>
Signed-off-by: vbaddi <vbaddi@qti.qualcomm.com>
Signed-off-by: Asmita Goswami <asmigosw@qti.qualcomm.com>
Signed-off-by: Asmita Goswami <asmigosw@qti.qualcomm.com>
Signed-off-by: Asmita Goswami <asmigosw@qti.qualcomm.com>
Signed-off-by: Asmita Goswami <asmigosw@qti.qualcomm.com>
Signed-off-by: Asmita Goswami <asmigosw@qti.qualcomm.com>
Signed-off-by: Asmita Goswami <asmigosw@qti.qualcomm.com>
Signed-off-by: Asmita Goswami <asmigosw@qti.qualcomm.com>
Signed-off-by: Asmita Goswami <asmigosw@qti.qualcomm.com>
Signed-off-by: asmigosw <asmigosw@qti.qualcomm.com>
Signed-off-by: Asmita Goswami <asmigosw@qti.qualcomm.com>
Signed-off-by: Asmita Goswami <asmigosw@qti.qualcomm.com>
Signed-off-by: Asmita Goswami <asmigosw@qti.qualcomm.com>
Signed-off-by: Asmita Goswami <asmigosw@qti.qualcomm.com>
Signed-off-by: Asmita Goswami <asmigosw@qti.qualcomm.com>
Signed-off-by: Sharvari Medhe <smedhe@qti.qualcomm.com>
Signed-off-by: Sharvari Medhe <smedhe@qti.qualcomm.com>
Signed-off-by: Sharvari Medhe <smedhe@qti.qualcomm.com>
Signed-off-by: Sharvari Medhe <smedhe@qti.qualcomm.com>
Signed-off-by: Sharvari Medhe <smedhe@qti.qualcomm.com>
Copy link
Copy Markdown
Contributor

@quic-akuruvil quic-akuruvil left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

There are so many redundant file changes. Take transforemrs V5.5.4 upgrade branch from models' team, and then raise a PR, keeping that as the base branch.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants