subprocess.CalledProcessError: Command ‘[‘/public/home/***/anaconda3/bin/python‘, ‘-u‘, ‘./tool

问题描述

问题的提示信息:subprocess.CalledProcessError: Command ‘[’/public/home/***/anaconda3/bin/python’, ‘-u’, ‘./tools/train.py’, ‘–local_rank=7’, ‘configs/bsds/EDTER_BIMLA_320x320_80k_bsds_bs_8.py’, ‘–launcher’, ‘pytorch’]’ returned non-zero exit status 1.

在这里插入图片描述

解决办法

在 DistributedDataParallel 中加入find_unused_parameters = True

model = torch.nn.parallel.DistributedDataParallel(model,device_ids=[args.local_rank],output_device=args.local_rank, find_unused_parameters=True)

你可能感兴趣的:(小技巧,python,深度学习,开发语言)