Skip to content

Support for separated training/eval clusters #21

@nikhilk

Description

@nikhilk

Support for a cluster setup where master does training + checkpointing, and a separate eval node does continuous evaluation over the last checkpoint.

Likely a few details to figure out... at some point.

Metadata

Metadata

Assignees

No one assigned

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions