Skip to content

Commit 42a91c4

Browse files
author
聂靖入
committed
support parallel reward function
1 parent 6068728 commit 42a91c4

1 file changed

Lines changed: 1 addition & 1 deletion

File tree

trlx/trainer/accelerate_ppo_trainer.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -339,7 +339,7 @@ def make_experience(self, num_rollouts: int = 1024, iter_count: int = 0): # noq
339339
else:
340340
all_scores = None
341341
max_len = torch.tensor(0, dtype=torch.long, device=device)
342-
342+
343343
if self.config.train.reward_only_in_main_process:
344344
if torch.distributed.is_initialized():
345345
torch.distributed.broadcast(max_len, 0)

0 commit comments

Comments
 (0)