azure - Seeking performance improvement while using ADF with ADX - Stack Overflow

IT技术

更新时间：2025-01-136

admin管理员组
文章数量:1441803

I have a use case where I need to regularly copy data from one table to another within the same Azure Data Explorer (ADX) database. Here's the specific context:

Number of Tables: Around 50 tables in the database. Data Volume: Each table can have approximately 200GB of data. Frequency: Data copying should occur every 30 minutes.

I have compared multiple possibilities like ADF, AzureFuntions, AzCopy, Logic Apps, Power Automate

ADF seemed to be the best fit.

Cluster details: SKU: Standard_L16as_v3, Large, 16 vCPUs, Memory 128GB, Cache 3500GB Instances: auto scaling enabled upto 20 I am having 30 tables for testing purpose

The pipeline setup looks like this

There is a Master Pipeline which looks for the table in the database and triggers child pipeline for each table
The child pipeline gets the extent for the table and runs copy activity for each extent. (To support better concurrency)
In forEach activity batch count is set to: 50 (Needed to take it down because of throttling in ADF)

The pipeline run is taking approx 1 hour

What can I do to improve the performance and is there any better solution?

Target is to run pipeline with 30 minutes. Is it even possible?

本文标签： azureSeeking performance improvement while using ADF with ADXStack Overflow

版权声明：本文标题：azure - Seeking performance improvement while using ADF with ADX - Stack Overflow 内容由网友自发贡献，该文观点仅代表作者本人，转载请联系作者并注明出处：http://www.betaflare.com/web/1736697836a1948276.html，本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如发现本站有涉嫌抄袭侵权/违法违规的内容，一经查实，本站将立刻删除。

编程频道|软件玩家 - 软件改变生活！

azure - Seeking performance improvement while using ADF with ADX - Stack Overflow

更多相关文章