quic / efficient-transformers Public

Notifications You must be signed in to change notification settings
Fork 82
Star 88

Code
Issues 4
Pull requests 47
Actions
Projects
Security and quality
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security and quality
Insights

Pull requests: quic/efficient-transformers

Labels 27 Milestones 0

New pull request New

47 Open 910 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Layerwise int4 kimi

#973 opened May 7, 2026 by abhishek-singh591 Contributor

Loading…

Glm4.7 flash reap

#972 opened May 6, 2026 by azajac-qcom

Loading…

rebase test

#971 opened May 6, 2026 by qraniumcitest

Loading…

[release-v1.21.6] Add SplitTensorsTransform to QEFFAutoModel to preve…

#968 opened May 6, 2026 by asmigosw Contributor

Loading…

TF and other package update

#967 opened May 6, 2026 by quic-hemagnih Contributor • Draft

Gemma4

#966 opened May 6, 2026 by tchawada Contributor

Loading…

MLA : fix online/offline absorption

#965 opened May 6, 2026 by quic-mamta Contributor

Loading…

Add DPO specific changes

#964 opened May 6, 2026 by quic-akuruvil Contributor • Draft

Enable On Device Sampling for Qwen3ForCausalLM

#963 opened May 5, 2026 by quic-sanising Contributor

Loading…

MLA Int4 Changes

#962 opened May 5, 2026 by quic-mamta Contributor

Loading…

Porting fp16/bf16 support to release/v1.21.6

#961 opened May 5, 2026 by asmigosw Contributor • Draft

[QEff. Finetuning] TP+DDP for transformers upgrade to v5.5.4

#960 opened May 4, 2026 by smedhe Contributor

Loading…

Enable ffn blocking for dense models with automatic blocking configurator enhancement

New feature or request

qeff.blocking

#958 opened May 4, 2026 by kdulla Contributor

Loading…

Optimize attention blocking nested loops

#957 opened Apr 30, 2026 by anujgupt-github Contributor

Loading…

Layer wise changes for kimi model

#954 opened Apr 29, 2026 by abhishek-singh591 Contributor

Loading…

[Nightly CI]: Creating separate Pipeline for Nightly Jobs

#953 opened Apr 29, 2026 by abukhoy Contributor

Loading…

fix: improve weight offloading to handle plain tensor attrs and use to_empty()

#952 opened Apr 28, 2026 by quic-rishinr Contributor

Loading…

First Block Caching Infra for diffusers Diffusers

Use for PR related to diffusers in efficient-transformers.

#941 opened Apr 24, 2026 by quic-amitraj Contributor

Loading…

feat(moe): NSP-blocked expert dispatch for Qwen3MOE and GPT-OSS prefill enhancement

New feature or request

#935 opened Apr 21, 2026 by vbaddi Contributor

Loading…

updated blocking in diffusers with cross attention check instead of SL

#932 opened Apr 21, 2026 by tv-karthikeya Contributor

Loading…

Added MDP generation to QEff Compile

#930 opened Apr 21, 2026 by quic-mohmeh

Loading…

CB Bug fix for Qwen3VL Dense and basic cleaning of example script and Model File

#926 opened Apr 20, 2026 by qcdipankar Contributor

Loading…

Enabled Qwen3-VL embedding model

#923 opened Apr 20, 2026 by quic-amitraj Contributor

Loading…

[Qwen3_Omni]_Onboarding

#922 opened Apr 20, 2026 by mohiso22 Contributor • Draft

Enabling support of rerankers models 2B and 8B of qwen3vl

#921 opened Apr 18, 2026 by quic-amitraj Contributor

Loading…

Previous 1 2 Next

Previous Next

ProTip! Type g p on any issue or pull request to go back to the pull request listing page.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!