-
Notifications
You must be signed in to change notification settings - Fork 310
Pull requests: microsoft/onnxruntime-genai
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add speculative decoding implementation (draft + target verification) (v0)
#2233
opened Jun 19, 2026 by
samsat701
Loading…
Route init-session provider-option shaping through DeviceInterface
#2232
opened Jun 18, 2026 by
qjia7
Contributor
Loading…
1 of 4 tasks
Add option of fp4 QMoE for gpt-oss in model builder
#2229
opened Jun 15, 2026 by
tianleiwu
Contributor
Loading…
Load models from ONNX Runtime model packages
#2227
opened Jun 12, 2026 by
jambayk
Contributor
Loading…
Add Tool Calling and Reasoning Token Metadata to
genai_config.json with Fallback Map
#2215
opened Jun 11, 2026 by
sayanshaw24
Collaborator
•
Draft
Bump torch from 2.7.1+cpu to 2.12.0+cpu in /test/python/cpu/torch
dependencies
Pull requests that update a dependency file
python
Pull requests that update python code
#2213
opened Jun 11, 2026 by
dependabot
Bot
Loading…
Bump torch from 2.7.1 to 2.12.0+cpu in /test/python/macos/torch
dependencies
Pull requests that update a dependency file
python
Pull requests that update python code
#2212
opened Jun 11, 2026 by
dependabot
Bot
Loading…
Pipeline-as-Config: structural model dispatch (#2114)
#2210
opened Jun 10, 2026 by
justinchuby
Contributor
•
Draft
Fix CUDA QMoE INT4 export in Qwen and Gpt-OSS models
#2209
opened Jun 9, 2026 by
tianleiwu
Contributor
Loading…
Prefer using CMake path variables where available
#2208
opened Jun 9, 2026 by
jaeyoonjung
Contributor
Loading…
Add per-run profiling config for fine-grained Run() profiling
#2152
opened May 9, 2026 by
xiaofeihan1
Contributor
•
Draft
Expose mutable sampling parameters on live Generator
#2145
opened May 8, 2026 by
qjia7
Contributor
Loading…
4 tasks done
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.