【多卡训练报错】:The server socket has failed to listen on any local network address.

错误:

RuntimeError: The server socket has failed to listen on any local network address. The server socket has failed to bind to [::]:16664 (errno: 98 - Address already in use). The server socket has failed to bind to 0.0.0.0:16664 (errno: 98 - Address already in use).

原因:训练的时候,已经有一张卡在执行训练,第二张卡执行训练任务时,使用了同一个端口

解决办法:修改第二张卡的端口号,与第一张卡使用的端口号不重复即可

【多卡训练报错】:The server socket has failed to listen on any local network address._第1张图片

参考文章:

【debug】mmseg多级多卡训练报错:The server socket has failed to listen on any local network address._zy_destiny的博客-CSDN博客

你可能感兴趣的:(ai,linux)