InternLM / lmdeploy Public

Notifications You must be signed in to change notification settings
Fork 680
Star 7.8k

Code
Issues 527
Pull requests 61
Discussions
Actions
Projects
Security and quality 1
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security and quality
Insights

Pull requests: InternLM/lmdeploy

Labels 34 Milestones 0

New pull request New

61 Open 2,115 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

add explicit trust_remote_code controls to resolve the security issue

#4511 opened Apr 8, 2026 by lvhan028

Loading…

feat: Add TurboQuant (quant_policy=42) support for KV Cache Quantization

#4510 opened Apr 8, 2026 by windreamer • Draft

make fp8 model quantized by llm-compressor can be inferenced in turbomind enhancement

New feature or request

#4509 opened Apr 8, 2026 by 43758726

Loading…

[Fix]: Handle None scales in generate_zero_point for mixed-format layers

#4505 opened Apr 7, 2026 by lingyezhixing

Loading…

3 tasks done

Refactor step inputs

#4504 opened Apr 7, 2026 by grimoire

Loading…

support more message item types

#4501 opened Apr 7, 2026 by CUHKSZzxy • Draft

fix: handle missing KV cache without crashing engine Bug:P0

#4497 opened Apr 4, 2026 by lvhan028

Loading…

Reject requests on stale session or sleeping engine improvement

#4496 opened Apr 4, 2026 by lvhan028

Loading…

feat(turbomind): integrate cublasGemmGroupedBatchedEx for Qwen3.5 MoE inference on Blackwell GPUs with memory copy optimizations enhancement

New feature or request

#4490 opened Apr 3, 2026 by hd9568

Loading…

fix lite module for transformers>=5.0 improvement

#4488 opened Apr 2, 2026 by 43758726

Loading…

fix ray mem leak

#4487 opened Apr 2, 2026 by grimoire • Draft

Add modern logging utils improvement

#4486 opened Apr 2, 2026 by lzhangzz

Loading…

Integrate deep-ep nccl backend enhancement

New feature or request

#4477 opened Mar 27, 2026 by irexyc

Loading…

[refactor] [api_server] [1/N] Improve reasoning and tool-call parsers improvement

#4468 opened Mar 26, 2026 by lvhan028

Loading…

feat: Turbomind linear gdn prefix caching enhancement

New feature or request

#4465 opened Mar 25, 2026 by lapy

Loading…

refactor get_ppl improvement

#4461 opened Mar 25, 2026 by lvhan028

Loading…

feat: implement Turbomind vision encoder support for Qwen3VL/3.5 families enhancement

New feature or request

#4460 opened Mar 24, 2026 by lapy

Loading…

Support multi stop words improvement

#4454 opened Mar 24, 2026 by lvhan028

Loading…

update h config and add glm4.7 mtp test

#4424 opened Mar 18, 2026 by littlegy

Loading…

lmdeploy support kernel block size

#4421 opened Mar 17, 2026 by Tsundoku958

Loading…

[Feature] Support n parameter in /v1/chat/completions and /v1/completions improvement

#4419 opened Mar 17, 2026 by ziyangliu-666

Loading…

[WIP] Support qwen3-omni

#4411 opened Mar 13, 2026 by CUHKSZzxy • Draft

2 of 4 tasks

Add model deployment best practice section in user guide

#4399 opened Mar 9, 2026 by lvhan028 • Draft

add tool and reasoning test

#4388 opened Mar 2, 2026 by littlegy

Loading…

Fix Structured Output for GPT-OSS Models

#4386 opened Mar 2, 2026 by windreamer

Loading…

Previous 1 2 3 Next

Previous Next

ProTip! Follow long discussions with comments:>50.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!