-
Notifications
You must be signed in to change notification settings - Fork 82
Pull requests: quic/efficient-transformers
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[release-v1.21.6] Add SplitTensorsTransform to QEFFAutoModel to preve…
#968
opened May 6, 2026 by
asmigosw
Contributor
Loading…
Enable On Device Sampling for Qwen3ForCausalLM
#963
opened May 5, 2026 by
quic-sanising
Contributor
Loading…
[QEff. Finetuning] TP+DDP for transformers upgrade to v5.5.4
#960
opened May 4, 2026 by
smedhe
Contributor
Loading…
Enable ffn blocking for dense models with automatic blocking configurator
enhancement
New feature or request
qeff.blocking
#958
opened May 4, 2026 by
kdulla
Contributor
Loading…
Optimize attention blocking nested loops
#957
opened Apr 30, 2026 by
anujgupt-github
Contributor
Loading…
Layer wise changes for kimi model
#954
opened Apr 29, 2026 by
abhishek-singh591
Contributor
Loading…
[Nightly CI]: Creating separate Pipeline for Nightly Jobs
#953
opened Apr 29, 2026 by
abukhoy
Contributor
Loading…
fix: improve weight offloading to handle plain tensor attrs and use to_empty()
#952
opened Apr 28, 2026 by
quic-rishinr
Contributor
Loading…
First Block Caching Infra for diffusers
Diffusers
Use for PR related to diffusers in efficient-transformers.
#941
opened Apr 24, 2026 by
quic-amitraj
Contributor
Loading…
feat(moe): NSP-blocked expert dispatch for Qwen3MOE and GPT-OSS prefill
enhancement
New feature or request
#935
opened Apr 21, 2026 by
vbaddi
Contributor
Loading…
updated blocking in diffusers with cross attention check instead of SL
#932
opened Apr 21, 2026 by
tv-karthikeya
Contributor
Loading…
CB Bug fix for Qwen3VL Dense and basic cleaning of example script and Model File
#926
opened Apr 20, 2026 by
qcdipankar
Contributor
Loading…
Enabling support of rerankers models 2B and 8B of qwen3vl
#921
opened Apr 18, 2026 by
quic-amitraj
Contributor
Loading…
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.