admin管理员组文章数量:1129022
I have a use case where I need to regularly copy data from one table to another within the same Azure Data Explorer (ADX) database. Here's the specific context:
Number of Tables: Around 50 tables in the database. Data Volume: Each table can have approximately 200GB of data. Frequency: Data copying should occur every 30 minutes.
I have compared multiple possibilities like ADF, AzureFuntions, AzCopy, Logic Apps, Power Automate
ADF seemed to be the best fit.
Cluster details: SKU: Standard_L16as_v3, Large, 16 vCPUs, Memory 128GB, Cache 3500GB Instances: auto scaling enabled upto 20 I am having 30 tables for testing purpose
The pipeline setup looks like this
- There is a Master Pipeline which looks for the table in the database and triggers child pipeline for each table
- The child pipeline gets the extent for the table and runs copy activity for each extent. (To support better concurrency)
- In forEach activity batch count is set to: 50 (Needed to take it down because of throttling in ADF)
The pipeline run is taking approx 1 hour
What can I do to improve the performance and is there any better solution?
Target is to run pipeline with 30 minutes. Is it even possible?
本文标签: azureSeeking performance improvement while using ADF with ADXStack Overflow
版权声明:本文标题:azure - Seeking performance improvement while using ADF with ADX - Stack Overflow 内容由网友自发贡献,该文观点仅代表作者本人, 转载请联系作者并注明出处:http://www.betaflare.com/web/1736697836a1948276.html, 本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如发现本站有涉嫌抄袭侵权/违法违规的内容,一经查实,本站将立刻删除。
发表评论