admin管理员组文章数量:1404332
I am trying to uncompressed gz file in the azure Data factory via copy activity, but its not doing actual unzipping, rather its just copying the file to the sink location. I have created similar configuration which i have for uncompressing the zip file & is successfully working, its just that i changes the "Compression Type" to uncompressing the gz file but to the sink container its that the g z file is getting copied as it is, its not actually uncompressing it. Attached is the configuration for the source data set for gz file to be get uncompressed.
Source data set configuration
I am trying to uncompressed gz file in the azure Data factory via copy activity, but its not doing actual unzipping, rather its just copying the file to the sink location. I have created similar configuration which i have for uncompressing the zip file & is successfully working, its just that i changes the "Compression Type" to uncompressing the gz file but to the sink container its that the g z file is getting copied as it is, its not actually uncompressing it. Attached is the configuration for the source data set for gz file to be get uncompressed.
Source data set configuration
Share Improve this question edited Mar 11 at 13:53 bryanbcook 18.8k2 gold badges47 silver badges79 bronze badges asked Mar 11 at 11:25 Shripad DawareShripad Daware 32 bronze badges 2- Can you provide a sample file that you have tried so far? – Rakesh Govindula Commented Mar 12 at 5:54
- Sorry I don't found option to upload any file to my question, but i have link to the gz file which i downloaded from & below are the details :- getsamplefiles/sample-archive-files/gzip Here on above link youj may find multiple file out of which the first one is the file i am refering to having file name (sample-1.gz). – Shripad Daware Commented Mar 12 at 6:52
1 Answer
Reset to default 0Azure Data Factory Copy activity not able to uncompressed the GZ files.
We can uncompress the .gz
file using a Copy Activity by following this process.
The zip file (sample-1.gz) is stored in the "input container" within Blob Storage.
- Creating Source Dataset (for.gz file):
Select Azure Blob Storage and Binary as the format in the dataset option.
After creating Source dataset, go to Open
- Choose compression type as "gzip (.gz)" and compression level as "Fastest".
- Creating the Sink Dataset:
Select same storage, format type and Linked Service as the "Source".
Choose compression type as No Compression.
- Settings and Performance Tuning:
- In the Settings tab, set Degree of Copy Parallelism at 2 and check Enable Staging.
Trigger and monitor the pipleline.
Check Sink file size is larger than the Source file size
- File has been successfully uncompressed.
本文标签: Azure Datafactory Copy activity not able to uncompressed the GZ filesStack Overflow
版权声明:本文标题:Azure Datafactory Copy activity not able to uncompressed the GZ files - Stack Overflow 内容由网友自发贡献,该文观点仅代表作者本人, 转载请联系作者并注明出处:http://www.betaflare.com/web/1744799067a2625753.html, 本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如发现本站有涉嫌抄袭侵权/违法违规的内容,一经查实,本站将立刻删除。
发表评论