位置: IT常识 - 正文
推荐整理分享cuda常见报错(cuda completed with errors),希望有所帮助,仅作参考,欢迎阅读内容。
文章相关热门搜索词:cuda available false,cuda available false,cuda error unspecified,cuda is not available,cuda error in cudaprogram,cuda available false,cuda.is_available false,cuda completed with errors,内容如对您有帮助,希望把文章链接给更多的朋友!
CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect. For debugging consider passing CUDA_LAUNCH_BLOCKING=1. /pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:93: operator(): block: [0,0,0], thread: [70,0,0] Assertion index >= -sizes[i] && index < sizes[i] && "index out of bounds" failed.
Assertion `index >= -sizes[i] && index < sizes[i] && "index out of bounds"
device-side assert triggered,CUDA kernel errors might be asynchronously reported at some other API call,so the stacktrace below might be incorrect./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:93: operator(): block: [0,0,0], thread: [28,0,0] Assertion index >= -sizes[i] && index < sizes[i] && "index out of bounds" failed.
TypeError: can't convert cuda:0 device type tensor to numpy. Use Tensor.cpu() to copy the tensor to host memory first.原因:
a = str(trues_cls.detach().numpy()[0]) #这样是错误的numpy不能读取CUDA tensor 需要将它转化为 CPU tensor。解决方法:
CUDA tensor格式的数据改成numpy时,需要先将其转换成cpu float-tensor随后再转到numpy格式。a= str(trues_cls.detach().cpu().numpy()[0]) # 这样是正确的# detach(): 返回一个新的Tensor,但返回的结果是没有梯度的。 # cpu():把gpu上的数据转到cpu上。 # numpy():将tensor格式转为numpyRuntimeError: CUDA error: invalid device ordinalCUDA kernel errors might be asynchronously reported at some other API call,so the stacktrace below might be incorrect.
For debugging consider passing CUDA_LAUNCH_BLOCKING=1.
服务器存在两个卡,但只能用其中一张卡跑程序原因:环境使用了export,导致只有一个 GPU可见 (GPU:0),而程序中使用 GPUs:1。
输入export CUDA_VISIBLE_DEVICES = '0,1',让设备变为两个,方便在环境中随意切换
友情链接: 武汉网站建设