Skip to content

Issue/1205: Async model loader#1206

Open
qinyiqun wants to merge 2 commits into
InfiniTensor:mainfrom
qinyiqun:weight-loading-accel
Open

Issue/1205: Async model loader#1206
qinyiqun wants to merge 2 commits into
InfiniTensor:mainfrom
qinyiqun:weight-loading-accel

Conversation

@qinyiqun
Copy link
Copy Markdown
Collaborator

@qinyiqun qinyiqun commented Jun 5, 2026

用于主存到显存之间的异步内存拷贝,可以提速现有的主存->显存速度,根据模型不同,差距几倍到几十倍不等

@qinyiqun qinyiqun requested a review from a team June 5, 2026 07:17
Copy link
Copy Markdown
Collaborator

@wooway777 wooway777 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

image

请rebase main并补充测试截图

@qinyiqun
Copy link
Copy Markdown
Collaborator Author

qinyiqun commented Jun 5, 2026

24f6356856d4258a14319e7b27510891 96adb6e3b1f4349cf943e6e0627fea22

}

void Module::load_parameter_no_sync(const std::string &name, const Tensor &param) {
auto all_params = state_dict();
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

这里每load一个权重,就要调用一次state_dict()去遍历所有的模块。 这里能优化么

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants