Skip to content
Snippets Groups Projects
Commit 1721a1aa authored by Jan Ebert's avatar Jan Ebert
Browse files

Properly shut down distributed process group

This avoids pending NCCL operations being lost.
parent ac788119
No related branches found
No related tags found
No related merge requests found
......@@ -289,6 +289,8 @@ def main():
print0('Final test loss:', test_loss)
save0(model, 'model-final.pt')
torch.distributed.destroy_process_group()
if __name__ == '__main__':
main()
......@@ -341,6 +341,8 @@ def main():
print0('Final test loss:', test_loss)
save_model(model, 'model-final')
torch.distributed.destroy_process_group()
if __name__ == '__main__':
main()
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Please register or to comment