python - Multi-GPU fine-tuning llama issue. RuntimeError: Expected all tensors to be on the same device, but found at least two

IT技术

更新时间：2025-03-181

admin管理员组
文章数量:1332619

I am working on a llama fine-tuning task. When I train on a single GPU, the program runs fine.

import os
os.environ["CUDA_VISIBLE_DEVICES"] = "0"
os.environ["TOKENIZERS_PARALLELISM"] = "false"
device = torch.device("cuda:0" if torch.cuda.is_available() else "cpu")
model_name = "../models/llama3_8b/"
model = AutoModelForCausalLM.from_pretrained(
    model_name,
    device_map=device,
    torch_dtype=compute_dtype,
    quantization_config=bnb_config,
)

But when I wanted to use multiple GPUs for fine-tuning, an error occurred. The modified code is as follows:

model = AutoModelForCausalLM.from_pretrained(
    model_name,
    # device_map=device,
    **device_map="auto",**  # Modifications
    torch_dtype=compute_dtype,
    quantization_config=bnb_config,
)
peft_config = LoraConfig(
    lora_alpha=16,
    lora_dropout=0,
    r=64,
    bias="none",
    task_type="CAUSAL_LM",
    target_modules=["q_proj", "k_proj", "v_proj", "o_proj",
                    "gate_proj", "up_proj", "down_proj",],
)
training_arguments = TrainingArguments(
    ...
    **local_rank=os.getenv("LOCAL_RANK", -1),**  # Modifications
    **ddp_find_unused_parameters=False,**  # Modifications
)
trainer = SFTTrainer(
    model=model,
    args=training_arguments,
    train_dataset=train_data,
    #eval_dataset=eval_data,
    peft_config=peft_config,
    dataset_text_field="text",
    tokenizer=tokenizer,
    max_seq_length=max_seq_length,
    packing=False,
    dataset_kwargs={
        "add_special_tokens": False,
        "append_concat_token": False,
    },
)
trainer.train()

The error are as follows：

RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:2 and cuda:0!

Executing Code:

CUDA_VISIBLE_DEVICES=3,4 python llama3.py

Does anyone know how to solve it?

本文标签：

版权声明：本文标题：python - Multi-GPU fine-tuning llama issue. RuntimeError: Expected all tensors to be on the same device, but found at least two 内容由网友自发贡献，该文观点仅代表作者本人，转载请联系作者并注明出处：http://www.betaflare.com/web/1742266200a2443438.html，本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如发现本站有涉嫌抄袭侵权/违法违规的内容，一经查实，本站将立刻删除。

更多相关文章

javascript - Chart.js not installing with NPM - Stack Overflow

IT技术

12分钟前

I am trying to install chart.js . Their documentation for how to install the package with NPM is here :

google cloud platform - SSL issue with a Domain-named GCP bucket - Stack Overflow

IT技术

11分钟前

I'm facing an SSL issue with a Domain-named GCP bucket - here's the situation:I have created

javascript - dc.js - how to create a row chart from multiple columns - Stack Overflow

IT技术

11分钟前

I need to create a rowchart in dc.js with inputs from multiple columns in a csv. So i need to map a col

windows10隐藏分区（隐藏efi系统分区）

编程

10分钟前

我们需要使用diskpart来移除这个误显示的盘符可能使用的命令： 1、以管理员身份运行CMD； 2、运行diskpart命令； 3、list disk 显示所有安

javascript - Event to detect when the text in an <input> is scrolled? - Stack Overflow

IT技术

10分钟前

Is there an event that is fired when the text inside an <input>-tag is scrolled? I mean when the

jquery - How to add a progress bar to file upload using javascript and ajax call? - Stack Overflow

IT技术

9分钟前

I need a progress bar in my file uploader. This is the ajax call I am using for file upload and progres

javascript - Unable to copy array using setstate hook - Stack Overflow

IT技术

9分钟前

I am fetching data from backend using axios whenever I am trying to update hooks it is not updating.Th

c# - How to create generic method of Dapper QueryMultipleAsync? - Stack Overflow

IT技术

9分钟前

I am trying to create a generic method to get multiple data sets using Dapper. I will specify the list

ag grid - AgGrid Excel Exports processCellCallback vs valueGetter - Stack Overflow

IT技术

7分钟前

In our application we make use of Luxon DateTime rather than JS Date.This means that all of our date c

html - JavaScript: Sort array into an ordered list - Stack Overflow

IT技术

6分钟前

On my homework, I have to create an array, and I did that, but not I need it sort into an ordered list.

plugins - How to detect 404 url and make this link underline or change background color?

IT技术

5分钟前

<a id="block_bell" href="<?php echo get_home_url(); ?>email-newsletter"><i class=&qu

javascript - auto height for the <object> element with the embedded content - Stack Overflow

IT技术

5分钟前

I am trying to get the auto height for the <object> element according to the embedded content in

php - How to Get and display the list of youtube videos using javascript - Stack Overflow

IT技术

4分钟前

I have written a C code for getting the list of youtube videos for the url "**" using the li

windows server 服务端系统（VMware 15）虚拟机安装图文教程

编程

3分钟前

目录一、准备工作1.window server 2019 系统 iso 镜像文件2.VMware 虚拟机二、创建虚拟机1.在VMware中新建虚拟机2.选择自定义(高级)3.选择虚拟机硬件兼容性4.选择操作系统5.命名虚拟机并选择虚拟机

custom post types - Force documents to appear in Featured Image dialogue

IT技术

3分钟前

Having a perplexing issue... ...for a custom post-type called 'Resources' I used the 'Featured Image'

Checking for duplicate Javascript objects - Stack Overflow

IT技术

3分钟前

TL;DR version: I want to avoid adding duplicate Javascript objects to an array of similar objects, some

Wordpress theme settings not saving

IT技术

1分钟前

I am trying to develop a theme. I am at the part of using the settings API.The code I use for the settings are as follo

html - how to get frequency value in javascript? - Stack Overflow

IT技术

1分钟前

I’m an italian student, and I’m using p5’s libraries to create a web guitar tuner.I would like to know

javascript - finding index of duplicates in an array in js - Stack Overflow

IT技术

59秒前

I have two arrays arr1=[ 0, 1, 2, 0, 2 ];arr2=[ 0, 0, 1, 2, 2 ];I have to find index of elements of arr

html - Display message after form submit with javascript - Stack Overflow

IT技术

6秒前

I'd like to show the message what I type for submitting after clicking submit button with javascri

发表评论

全部评论 0

暂无评论

编程频道|软件玩家 - 软件改变生活！

python - Multi-GPU fine-tuning llama issue. RuntimeError: Expected all tensors to be on the same device, but found at least two

更多相关文章

javascript - Chart.js not installing with NPM - Stack Overflow

google cloud platform - SSL issue with a Domain-named GCP bucket - Stack Overflow

javascript - dc.js - how to create a row chart from multiple columns - Stack Overflow

windows10隐藏分区（隐藏efi系统分区）

javascript - Event to detect when the text in an &lt;input&gt; is scrolled? - Stack Overflow

jquery - How to add a progress bar to file upload using javascript and ajax call? - Stack Overflow

javascript - Unable to copy array using setstate hook - Stack Overflow

c# - How to create generic method of Dapper QueryMultipleAsync? - Stack Overflow

ag grid - AgGrid Excel Exports processCellCallback vs valueGetter - Stack Overflow

html - JavaScript: Sort array into an ordered list - Stack Overflow

plugins - How to detect 404 url and make this link underline or change background color?

javascript - auto height for the &lt;object&gt; element with the embedded content - Stack Overflow

php - How to Get and display the list of youtube videos using javascript - Stack Overflow

windows server 服务端系统（VMware 15）虚拟机安装图文教程

custom post types - Force documents to appear in Featured Image dialogue

Checking for duplicate Javascript objects - Stack Overflow

Wordpress theme settings not saving

html - how to get frequency value in javascript? - Stack Overflow

javascript - finding index of duplicates in an array in js - Stack Overflow

html - Display message after form submit with javascript - Stack Overflow

发表评论

推荐文章

javascript - Firstore data is not function - Stack Overflow

javascript - Is it possible to export react component as function in non-react project - Stack Overflow

javascript - Removing DOM nodes with proper GC (no leaks) - Stack Overflow

html - reading a drag and drop ordered list via JavaScript - Stack Overflow

javascript - Webpack react-hot-loader not working - Stack Overflow

热门文章

javascript - jQuery: How to create array with two values for each item - Stack Overflow

Not able to make websocket connection in Rust - Stack Overflow

Filtering a pivot to only include columns with data [Snowflake] - Stack Overflow

javascript - How to replace a regular expression match with a $ string with the length of the match - Stack Overflow

python - String cleaning removing consecutive value and put comma in the end - Stack Overflow

javascript - Grab the youtube Video ID with Jquery &amp; .match() - Stack Overflow

java - How to fix an error with fabric loom while modding Minecraft? - Stack Overflow

rewrite rules - add_rewrite_rule not working for blog category page

How do I prevent a browser page refresh in JavaScript? - Stack Overflow

javascript - expo AsyncStorage methods is not a function - Stack Overflow

最新文章

Windows加固-日志配置

windows中如何将已安装的nodejs高版本降级为低版本

操作系统安装大全之详解双主分区独立双系统的安装及启动菜单的制作

Windows主机加固

通过注册表编辑器关闭Windows自动更新

html - Display message after form submit with javascript - Stack Overflow

build runner - How to work with relations between collections in Isar Flutter? - Stack Overflow

javascript - Change css of class when select radio button - Stack Overflow

javascript - finding index of duplicates in an array in js - Stack Overflow

html - how to get frequency value in javascript? - Stack Overflow

惠普OMEN 15-CE001TX 2EF91PA参数报价

苹果新款MacBook Pro 15英寸 i732GB1TBVega Pro 20参数报价

联想Y330A-PSE L参数报价

神舟战神Z7 D6 i7-12650H16GB512GBRTX4050旗舰版参数报价

神舟战神Z7 D6 i7-12650H16GB1TBRTX4050参数报价

javascript - Event to detect when the text in an <input> is scrolled? - Stack Overflow

javascript - auto height for the <object> element with the embedded content - Stack Overflow

javascript - Grab the youtube Video ID with Jquery & .match() - Stack Overflow