machine learning - Why Do I get Different performance on Different Runs on my ML model? - Stack Overflow

IT技术

更新时间：2025-01-085

admin管理员组
文章数量:1122832

I'm training ml models (Xgboost and LightGbm) using snowpark, but after every run I got different values of the metrics (AUC, Average precision) and thus never know who is my best model.

I tried setting a global variable at the beggining of my notebook random_seed = 42 and put it in my undersampling function and in the initialization of my models :

 if model_type == 'xgboost':
        model = XGBClassifier(
            random_state=random_seed,
            input_cols=feature_cols,
            label_cols=target_col,
            output_cols=['PREDICTION'],
            passthrough_cols=['INDIVIDUAL_SK', 'DATE_MONTH'],
            **hyperparameters
        )

    elif model_type == 'lightgbm':
        model = LGBMClassifier(
            random_state=random_seed,
            input_cols=feature_cols,
            label_cols=target_col,
            output_cols=['PREDICTION'],
            passthrough_cols=['INDIVIDUAL_SK', 'DATE_MONTH'],
            **hyperparameters
         
        )

def undersample_majority_class(df):

df_with_seniority = df.with_column("years_since", (F.col('TIME_SINCE_FIRST_LEAD')/12).cast('int'))

df_with_random = df_with_seniority.with_column('random_order', F.random(seed=random_seed))
window_spec = Window.partition_by("INDIVIDUAL_SK").order_by(F.col('random_order').asc())
df_ranked = df_with_random.with_column("month_rank", F.row_number().over(window_spec)
)

df_majority = df_ranked.filter(F.col("CONVERSION_INDICATOR") == 0)
df_majority_sampled = df_majority.filter(((F.col("years_since") > 10) & (F.col("month_rank") == 1)) |
((F.col("years_since") <= 10) & (F.col("month_rank") <= 2))
)

df_majority_sampled = df_majority_sampled.drop('years_since','month_rank','random_order' )
df_minority = df.filter(F.col("CONVERSION_INDICATOR") == 1)
df_balanced = df_majority_sampled.union_all(df_minority)



return df_balanced

I don't know what to do to fix this.

本文标签： machine learningWhy Do I get Different performance on Different Runs on my ML modelStack Overflow

版权声明：本文标题：machine learning - Why Do I get Different performance on Different Runs on my ML model? - Stack Overflow 内容由网友自发贡献，该文观点仅代表作者本人，转载请联系作者并注明出处：http://www.betaflare.com/web/1736308118a1933548.html，本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如发现本站有涉嫌抄袭侵权/违法违规的内容，一经查实，本站将立刻删除。

编程频道|软件玩家 - 软件改变生活！

machine learning - Why Do I get Different performance on Different Runs on my ML model? - Stack Overflow

更多相关文章

machine learning - Why Do I get Different performance on Different Runs on my ML model? - Stack Overflow

发表评论

推荐文章

python - Wait-Until strategy in PyTest - Stack Overflow

continuous integration - In Azure DevOps, How to cancel the current build to directly start the new one? - Stack Overflow

custom taxonomy - ACF - Get unique values of array

parquet - Renamed column is returning null from existing data - Stack Overflow

swagger ui - Any way to change values in OpenAPI spec based on selected server? - Stack Overflow

热门文章

如何查询计算机系统补丁更新情况,如何在Win7系统中查看windows Update更新历史记录？...

How to correctly edit permalink structures for both default and custom poststaxonomies?

Mass remove a number of tags from a number of posts

php - I have a background-image in css file but don't show in site.how to fix this prblm. i sent screenshot. thx

custom taxonomy - Display prevnext posts links from the same category with thumbnails in Wordpress

sql - Database table prefix different between wp-config.php and in database

html - Angular 17 with ArcGIS Item-id - Stack Overflow

business objects - SAP WebI Function Equivalent to Excel COUNTIF - Stack Overflow

Woocommerce plugin increasing Database size

printf - using write.table function in R with sprintf with a wildcard - Stack Overflow

最新文章

Java入门级教学（IDEA的下载与安装与JDK的环境配置）

华硕笔记本电脑用U盘重装windows系统

物理网卡MAC修改器v3.0 - 真实网卡硬件MAC地址修改，重装系统不变！

如何一键安装win7系统(一键安装win7系统步骤)

Windows 11最稳定版本详解

multithreading - C++ thread exiting without a notice -- need help debugging with gdb - Stack Overflow

apache kafka - Unknown feature gate KafkaNodePools found in the configuration - Stack Overflow

New Python Instance in VS Code and the terminal is passing indentions that do not exist in the code editor window - Stack Overfl

ros2 - how to modify imu_filter_madgwick to transform RPY from imu_sensor frame to base_link frame? - Stack Overflow

Color a portion of a minipage in Manim - Stack Overflow

惠普OMEN 15-CE001TX 2EF91PA参数报价

苹果新款MacBook Pro 15英寸 i732GB1TBVega Pro 20参数报价

联想Y330A-PSE L参数报价

神舟战神Z7 D6 i7-12650H16GB512GBRTX4050旗舰版参数报价

神舟战神Z7 D6 i7-12650H16GB1TBRTX4050参数报价

编程频道|软件玩家 - 软件改变生活！

machine learning - Why Do I get Different performance on Different Runs on my ML model? - Stack Overflow

更多相关文章

machine learning - Why Do I get Different performance on Different Runs on my ML model? - Stack Overflow

发表评论

推荐文章

python - Wait-Until strategy in PyTest - Stack Overflow

continuous integration - In Azure DevOps, How to cancel the current build to directly start the new one? - Stack Overflow

custom taxonomy - ACF - Get unique values of array

parquet - Renamed column is returning null from existing data - Stack Overflow

swagger ui - Any way to change values in OpenAPI spec based on selected server? - Stack Overflow

热门文章

如何查询计算机系统补丁更新情况,如何在Win7系统中查看windows Update更新历史记录？...

How to correctly edit permalink structures for both default and custom poststaxonomies?

Mass remove a number of tags from a number of posts

php - I have a background-image in css file but don&#39;t show in site.how to fix this prblm. i sent screenshot. thx

custom taxonomy - Display prevnext posts links from the same category with thumbnails in Wordpress

sql - Database table prefix different between wp-config.php and in database

html - Angular 17 with ArcGIS Item-id - Stack Overflow

business objects - SAP WebI Function Equivalent to Excel COUNTIF - Stack Overflow

Woocommerce plugin increasing Database size

printf - using write.table function in R with sprintf with a wildcard - Stack Overflow

最新文章

Java入门级教学（IDEA的下载与安装与JDK的环境配置）

华硕笔记本电脑用U盘重装windows系统

物理网卡MAC修改器v3.0 - 真实网卡硬件MAC地址修改，重装系统不变！

如何一键安装win7系统(一键安装win7系统步骤)

Windows 11最稳定版本详解

multithreading - C++ thread exiting without a notice -- need help debugging with gdb - Stack Overflow

apache kafka - Unknown feature gate KafkaNodePools found in the configuration - Stack Overflow

New Python Instance in VS Code and the terminal is passing indentions that do not exist in the code editor window - Stack Overfl

ros2 - how to modify imu_filter_madgwick to transform RPY from imu_sensor frame to base_link frame? - Stack Overflow

Color a portion of a minipage in Manim - Stack Overflow

惠普OMEN 15-CE001TX 2EF91PA参数报价

苹果新款MacBook Pro 15英寸 i732GB1TBVega Pro 20参数报价

联想Y330A-PSE L参数报价

神舟战神Z7 D6 i7-12650H16GB512GBRTX4050旗舰版参数报价

神舟战神Z7 D6 i7-12650H16GB1TBRTX4050参数报价

php - I have a background-image in css file but don't show in site.how to fix this prblm. i sent screenshot. thx