huggingface transformers - TypeError in SFTTrainer: Unexpected Keyword Arguments (packing, dataset_text_field, max_seq_length) -

IT技术

更新时间：2025-04-130

admin管理员组
文章数量:1387303

I'm trying to fine-tune a model using SFTTrainer from trl, but I'm facing multiple TypeError issues related to unexpected keyword arguments.

from transformers import TrainingArguments
from trl import SFTTrainer

output_dir = "tinyllama_instruct"
training_arguments = TrainingArguments(
    output_dir=output_dir,
    per_device_train_batch_size=1,
    per_device_eval_batch_size=1,
    gradient_accumulation_steps=16,
    save_strategy="epoch",
    evaluation_strategy="epoch",
    logging_steps=25,
    learning_rate=2e-5,
    max_grad_norm=1.0,
    weight_decay=0.1,
    warmup_ratio=0.1,
    lr_scheduler_type="cosine",
    fp16=True,
    report_to=["tensorboard", "wandb"],
    num_train_epochs=1,
    gradient_checkpointing=True,
    gradient_checkpointing_kwargs={"use_reentrant": False},
)

trainer = SFTTrainer(
    model=model,
    args=training_arguments,
    train_dataset=dataset["train"],
    eval_dataset=dataset["test"],
    tokenizer=tokenizer,
    packing=True,  # Causes TypeError
    dataset_text_field="content",  # Causes TypeError if packing is removed
    max_seq_length=2048,  # Causes TypeError if dataset_text_field is removed
)

The Notebook can be found here: .ipynb

Errors Encountered:

TypeError: SFTTrainer.__init__() got an unexpected keyword argument 'packing'
Removing packing=True results in:
TypeError: SFTTrainer.__init__() got an unexpected keyword argument 'dataset_text_field'
Removing dataset_text_field="content" results in:
TypeError: SFTTrainer.__init__() got an unexpected keyword argument 'max_seq_length'
Finally, when I remove all these arguments, I get a KeyError: 'text' while tokenizing.

What I’ve Tried:

Removing the problematic arguments one by one, but each time a new issue arises.
Checking the latest trl documentation, but packing, dataset_text_field, and max_seq_length don't seem to be part of SFTTrainer anymore.
Verifying my dataset structure.

Question:

Has the SFTTrainer API changed recently, and are these arguments deprecated?
How should I correctly pass max_seq_length and specify the text field in my dataset?
Is packing handled differently now?

Any guidance would be greatly appreciated!

本文标签：

版权声明：本文标题：huggingface transformers - TypeError in SFTTrainer: Unexpected Keyword Arguments (packing, dataset_text_field, max_seq_length) - 内容由网友自发贡献，该文观点仅代表作者本人，转载请联系作者并注明出处：http://www.betaflare.com/web/1744524433a2610657.html，本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如发现本站有涉嫌抄袭侵权/违法违规的内容，一经查实，本站将立刻删除。

更多相关文章

Custom login form

IT技术

24分钟前

I am very new to WordPress. I am trying to display a login form in the header section of my website. However, when I loo

How to prevent Cloud Run 429 too many requests going to the PubSub dead letter queue - Stack Overflow

IT技术

24分钟前

ProblemWe use Cloud Run to process upload events, where a user can upload hundreds of images at once.

javascript - Loop through each element and print value in order - Stack Overflow

IT技术

23分钟前

I have some code where I want to loop through the amount of elements and set the text inside each one f

categories - When creating a new product, auto assign it to all custom taxonomy woocommerce

IT技术

21分钟前

I have a custom taxonomy called Location, and there are a lot of parents and children categories on it to mark in which

javascript - How to change the overlayLoadingTemplate in ag Grid? - Stack Overflow

IT技术

20分钟前

I am using Ag Grid v11.0 with angular 1.x. After the grid is rendered, I want to make an action ( ex: s

reactjs - react select ClearIndicator with custom Onclick event is not working in latest version 10.0.0 - Stack Overflow

IT技术

18分钟前

I am upgrading version of react-select to 5.10.0 and onClick event on custom clear icon is stopped work

javascript - How does browser scrolling work in DOM? - Stack Overflow

IT技术

17分钟前

Originally I was googling the difference between window.scrollTo(0,y) and element.scrollTop=y.On which

amazon web services - opentofu init : failed to retrieve authentication checksums for provider - Stack Overflow

IT技术

17分钟前

When i run tofu init in vscode, it keeps on timing out with this error: failed to retrieve authenticati

Push A List of Objects Into an Array Using jQuery (Or Javascript) - Stack Overflow

IT技术

16分钟前

$=jQuery.noConflict();$( document ).ready(function() {returns an array of image links [".jpg&

javascript - Cannot read property toDataURL of undefined - Stack Overflow

IT技术

15分钟前

Using createElement on canvas as a temporary placement to draw shapes and encounter an error. Does the

javascript - I don't want spaces after commas to be fixed width - best practise? - Stack Overflow

IT技术

12分钟前

I'm creating a website where the main headings are displayed in a fixed width font (the designer&#

functions - How to parse a shortcode within a shortcode?

IT技术

11分钟前

I've created a quick function and shortcode to allow me to include logged-in user only content:function content_for

python - Is there a way to save all Geotags for specific tags from Flickr as a Shapefile? - Stack Overflow

IT技术

10分钟前

I created a code with Jupyter Notebook to download all geotags from Flickr pictures with specific tags.

javascript - Body onresize event does not fire in IE7 when page is part of a frameset and resized vertically - Stack Overflow

IT技术

9分钟前

I have a simple frameset with two frames vertically, i.e. two rows:First row contains a fixed header.Se

customization - How can I display both LTR and RTL language texts on the same page?

IT技术

8分钟前

My WP website is multi-lingual and I can put two different LTR language texts, e.g., English and Chinese, on the same pa

javascript - Get the Id of the element in Angular js - Stack Overflow

IT技术

8分钟前

I have assigned the id value of an element in angularjs to a variable like below var test = angular.ele

jquery - javascript inline callback function to separate function - Stack Overflow

IT技术

6分钟前

Why is this code working:function onCordovaReady() { navigator.globalization.getLocaleName(function (lo

Disabling drift plugin from homepage

IT技术

5分钟前

So our client wants to use Drift the messaging pop up plugin but it is seriously hindering load speeds on the homepage.

javascript - TypeError: Cannot read property 'getValue' of null at welcomeAlert - Stack Overflow

IT技术

3分钟前

I'm new in CRM. I need just on the OnLoad event of the page, show a JavaScript alert message: &qu

Apache dbcp2 connection pool support external OAuth authentication? - Stack Overflow

IT技术

3分钟前

Using the JDBC driver it is possible to do OAuth authentication and Hikari CP support to create CP with

发表评论

全部评论 0

暂无评论

编程频道|软件玩家 - 软件改变生活！

huggingface transformers - TypeError in SFTTrainer: Unexpected Keyword Arguments (packing, dataset_text_field, max_seq_length) -

Errors Encountered:

What I’ve Tried:

Question:

更多相关文章

Custom login form

How to prevent Cloud Run 429 too many requests going to the PubSub dead letter queue - Stack Overflow

javascript - Loop through each element and print value in order - Stack Overflow

categories - When creating a new product, auto assign it to all custom taxonomy woocommerce

javascript - How to change the overlayLoadingTemplate in ag Grid? - Stack Overflow

reactjs - react select ClearIndicator with custom Onclick event is not working in latest version 10.0.0 - Stack Overflow

javascript - How does browser scrolling work in DOM? - Stack Overflow

amazon web services - opentofu init : failed to retrieve authentication checksums for provider - Stack Overflow

Push A List of Objects Into an Array Using jQuery (Or Javascript) - Stack Overflow

javascript - Cannot read property toDataURL of undefined - Stack Overflow

javascript - I don&#39;t want spaces after commas to be fixed width - best practise? - Stack Overflow

functions - How to parse a shortcode within a shortcode?

python - Is there a way to save all Geotags for specific tags from Flickr as a Shapefile? - Stack Overflow

javascript - Body onresize event does not fire in IE7 when page is part of a frameset and resized vertically - Stack Overflow

customization - How can I display both LTR and RTL language texts on the same page?

javascript - Get the Id of the element in Angular js - Stack Overflow

jquery - javascript inline callback function to separate function - Stack Overflow

Disabling drift plugin from homepage

javascript - TypeError: Cannot read property &#39;getValue&#39; of null at welcomeAlert - Stack Overflow

Apache dbcp2 connection pool support external OAuth authentication? - Stack Overflow

发表评论

推荐文章

JavaScript Order List Processing - Stack Overflow

javascript - Jquery-ui dialog box &#39;x&#39; button image is not visible - Stack Overflow

functions - is_email gives me error

javascript - REST API is deprecated for versions v2.1 and higher - Stack Overflow

oop - Oo javascript code completion in any IDE - Stack Overflow

热门文章

ubuntu - I get a black world screen in Gazebo after the plugin opens up. I am using ROS2 Jazzy - Stack Overflow

safari - Return Javascript variable to AppleScript - Stack Overflow

javascript - How to check android app first run - Cordova - Stack Overflow

How can i change Screen Orientation using javascript or Jquery? - Stack Overflow

plugin development - Gutenberg blocks error: Each child in a list should have a unique &quot;key&quot; prop

wp query - How to import a WP backup website into another Wordpress hosting?

javascript - Chart.js always visible labels - Stack Overflow

javascript - Select2 Dropdown Multiple select and unselect - Stack Overflow

c# - After use of Obfuscator application doesn&#39;t run - Stack Overflow

javascript - No routes matched location &quot;&quot; react router v6 - Stack Overflow

最新文章

windows设置断电重启开机后自动输入锁屏密码登录

Windows系统设置开机默认开启数字小键盘

Windows11 开机自动同步时间（开机时间不更新问题）

windows配置开机自启动软件或脚本

【Redis】Windows设置Redis为开机自启动

42 vulnerabilities (1 low, 12 moderate, 28 high, 1 critical) after running npm audit fix --force in Angular 18.2 - Stack Overflo

javascript - jqBootstrapValidation plugin is not working for my form - Stack Overflow

javascript - Startstop setTimeout with a button &amp; prevent counting faster with more clicks - Stack Overflow

rest api - rendering view in backbone

javascript - JQuery trigger a button click once - Stack Overflow

惠普OMEN 15-CE001TX 2EF91PA参数报价

苹果新款MacBook Pro 15英寸 i732GB1TBVega Pro 20参数报价

联想Y330A-PSE L参数报价

神舟战神Z7 D6 i7-12650H16GB512GBRTX4050旗舰版参数报价

神舟战神Z7 D6 i7-12650H16GB1TBRTX4050参数报价

javascript - I don't want spaces after commas to be fixed width - best practise? - Stack Overflow

javascript - TypeError: Cannot read property 'getValue' of null at welcomeAlert - Stack Overflow

javascript - Jquery-ui dialog box 'x' button image is not visible - Stack Overflow

plugin development - Gutenberg blocks error: Each child in a list should have a unique "key" prop

c# - After use of Obfuscator application doesn't run - Stack Overflow

javascript - No routes matched location "" react router v6 - Stack Overflow

javascript - Startstop setTimeout with a button & prevent counting faster with more clicks - Stack Overflow