python - RuntimeError: Pytorch Optimizer - Stack Overflow

IT技术

更新时间：2025-04-144

admin管理员组
文章数量:1394143

# create a Pytorch optimizer 
optimizer = torch.optim.AdamW(model.parameters(), lr=learning_rate)

for iter in range(max_iters):
    if iter % eval_iters == 0:
        losses = estimate_loss()
        print(f"step: {iter}, loss {losses}")
        
    # sample a batch of data
    xb, yb = get_batch("train")
    
    #evaluate the loss
    logits, loss = model.forward(xb, yb)
    optimizer.zero_grad(set_to_none=True)
    loss.backward()
    optimizer.step()
    
print(loss.item())

RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cpu! (when checking argument for argument index in method wrapper_CUDA__index_select)

I tried adding model = model.to(device) and adding .cuda() after inputs, but none of them worked. I'm struggling to get it fixed.

# create a Pytorch optimizer 
optimizer = torch.optim.AdamW(model.parameters(), lr=learning_rate)

for iter in range(max_iters):
    if iter % eval_iters == 0:
        losses = estimate_loss()
        print(f"step: {iter}, loss {losses}")
        
    # sample a batch of data
    xb, yb = get_batch("train")
    
    #evaluate the loss
    logits, loss = model.forward(xb, yb)
    optimizer.zero_grad(set_to_none=True)
    loss.backward()
    optimizer.step()
    
print(loss.item())

RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cpu! (when checking argument for argument index in method wrapper_CUDA__index_select)

I tried adding model = model.to(device) and adding .cuda() after inputs, but none of them worked. I'm struggling to get it fixed.

Share Improve this question edited Mar 22 at 9:37 Innat 17.2k6 gold badges60 silver badges112 bronze badges asked Mar 17 at 4:36 user29981993 111 silver badge1 bronze badge

Add a comment |

1 Answer 1

Sorted by: Reset to default 1

Your calculation failed because PyTorch detected a mix of GPU and CPU tensors. Specifically, some of your data is being processed on the graphics card (cuda:0), while other parts remain on the computer's main processor. This mismatch prevents PyTorch from performing operations that require all tensors to be on the same device.

When you use a GPU with PyTorch, it's identified as cuda:0 if it's your only GPU or if you haven't specified otherwise. Your first GPU is always indexed as 0.
To use the GPU, ensure your device settings are set to cuda. If you have multiple GPUs, you can select them by changing the index (e.g., cuda:1, cuda:2).

import torch

device = 'cuda' if torch.cuda.is_available() else 'cpu'
print(f"Using device: {device}")

First, transfer your model to the GPU using model = model.to(device). Make sure to do this before setting up your optimizer.
Next, ensure your input data (xb and yb) also resides on the GPU by using xb = xb.to(device) and yb = yb.to(device).

Maintain device consistency within estimate_loss by ensuring all operations and loaded data reside on the same device as the model.

本文标签： pythonRuntimeError Pytorch OptimizerStack Overflow

版权声明：本文标题：python - RuntimeError: Pytorch Optimizer - Stack Overflow 内容由网友自发贡献，该文观点仅代表作者本人，转载请联系作者并注明出处：http://www.betaflare.com/web/1744579196a2613815.html，本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如发现本站有涉嫌抄袭侵权/违法违规的内容，一经查实，本站将立刻删除。

编程频道|软件玩家 - 软件改变生活！

python - RuntimeError: Pytorch Optimizer - Stack Overflow

1 Answer 1

更多相关文章