Issue/1205: Async model loader by qinyiqun · Pull Request #1206 · InfiniTensor/InfiniCore

qinyiqun · 2026-06-05T07:17:28Z

用于主存到显存之间的异步内存拷贝，可以提速现有的主存->显存速度，根据模型不同，差距几倍到几十倍不等

wooway777

请rebase main并补充测试截图

qinyiqun · 2026-06-05T07:35:14Z

pengcheng888 · 2026-06-05T11:27:59Z

 }

+void Module::load_parameter_no_sync(const std::string &name, const Tensor &param) {
+    auto all_params = state_dict();


这里每load一个权重，就要调用一次state_dict()去遍历所有的模块。这里能优化么

qinyiqun added 2 commits June 3, 2026 08:40

Allow batched parameter loading without per-tensor sync

223bdd5

Make module state dict thread local to callers

493e486

qinyiqun requested a review from a team June 5, 2026 07:17

wooway777 requested changes Jun 5, 2026

View reviewed changes

pengcheng888 reviewed Jun 5, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issue/1205: Async model loader#1206

Issue/1205: Async model loader#1206
qinyiqun wants to merge 2 commits into
InfiniTensor:mainfrom
qinyiqun:weight-loading-accel

qinyiqun commented Jun 5, 2026 •

edited

Loading

Uh oh!

wooway777 left a comment

Uh oh!

qinyiqun commented Jun 5, 2026

Uh oh!

pengcheng888 Jun 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

qinyiqun commented Jun 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

wooway777 left a comment

Choose a reason for hiding this comment

Uh oh!

qinyiqun commented Jun 5, 2026

Uh oh!

pengcheng888 Jun 5, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

qinyiqun commented Jun 5, 2026 •

edited

Loading