python - PyTorch: What is the difference between tensor.cuda() and tensor.to(torch.device("cuda:0"))?

Question

Welcome To Ask or Share your Answers For Others

python - PyTorch: What is the difference between tensor.cuda() and tensor.to(torch.device("cuda:0"))?

asked Oct 24, 2021 in Technique[技术] by 深蓝 (71.8m points)

python - PyTorch: What is the difference between tensor.cuda() and tensor.to(torch.device("cuda:0"))?

In PyTorch, what is the difference between the following two methods in sending a tensor (or model) to GPU:

Setup:

X = np.array([[1, 3, 2, 3], [2, 3, 5, 6], [1, 2, 3, 4]]) # X = model()
X = torch.DoubleTensor(X)

Method 1	Method 2
`X.cuda()`	`device = torch.device("cuda:0")` `X = X.to(device)`

See Question&Answers more detail:os

与恶龙缠斗过久,自身亦成为恶龙；凝视深渊过久,深渊将回以凝视…

1 Answer

深蓝 · Answer 1 · 2021-10-23T21:38:05+0000

There is no difference between the two.
Early versions of pytorch had .cuda() and .cpu() methods to move tensors and models from cpu to gpu and back. However, this made code writing a bit cumbersome:

if cuda_available:
  x = x.cuda()
  model.cuda()
else:
  x = x.cpu()
  model.cpu()

Later versions introduced .to() that basically takes care of everything in an elegant way:

device = torch.device('cuda') if cuda_available else torch.device('cpu')
x = x.to(device)
model = model.to(device)

Categories

python - PyTorch: What is the difference between tensor.cuda() and tensor.to(torch.device("cuda:0"))?

python - PyTorch: What is the difference between tensor.cuda() and tensor.to(torch.device("cuda:0"))?

Please log in or register to add a comment.

Please log in or register to answer this question.

1 Answer

Please log in or register to add a comment.

Just Browsing Browsing

Most popular tags