Add torch_compile flag for training networks by wenxin0319 · Pull Request #28 · NVlabs/FastGen

wenxin0319 · 2026-06-01T05:12:25Z

FastGen currently relies on diffusers-based model execution, which leaves performance on the table during training.

This PR adds an opt-in torch_compile flag that wraps training networks with torch.compile, enabling PyTorch's compiler optimizations (operator fusion, memory planning, kernel autotuning) for significant speedups on common models.

Benchmark (QwenImage, 20.43B params, NVIDIA H100, bfloat16, 512x512):

Setting │ Time/iter │ Std
Baseline (no compile) │ 0.694s │ 0.094s
torch.compile (max-autotune) │ 0.447s │ 0.014s

which is Speedup 1.55x (55% faster)

Compiled iterations also show much lower variance (0.014s vs 0.094s), meaning more consistent training throughput. The one-time compilation overhead (~5-10 min with max-autotune) is amortized over the full training run.

Changes:

Add torch_compile: bool = False config option in BaseModelConfig
Add _apply_torch_compile() in FastGenModel that compiles the main network (self.net)
Override _apply_torch_compile() in DMD2Model to also compile teacher and fake_score networks
Add comprehensive tests covering compile on/off for both SFT and DMD2 models, including training step validation
Add bench_compile.py benchmark script for measuring compile speedup

Usage:
Set torch_compile=True in model config to enable.

wenxin0319 · 2026-06-01T05:16:14Z

@juliusberner Could you please take a look at my PR? Thanks!

Add torch_compile flag for training networks

e9863fe

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add torch_compile flag for training networks#28

Add torch_compile flag for training networks#28
wenxin0319 wants to merge 1 commit into
NVlabs:mainfrom
wenxin0319:main

wenxin0319 commented Jun 1, 2026

Uh oh!

wenxin0319 commented Jun 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

wenxin0319 commented Jun 1, 2026

Uh oh!

wenxin0319 commented Jun 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant