python - Class org.apache.hadoop.fs.s3a.S3AFileSystem not found keeps being raised even if implementation and aws keys are provi

IT技术

更新时间：2025-04-070

admin管理员组
文章数量:1355609

from pyspark.sql import SparkSession
from pyspark.sql.window import Window
from pyspark.sql.functions import skewness, kurtosis, stddev

from airflow.configuration import conf

import sys

def transform_forex_data(file_path, access_key, secret_key):
    try:
        print(f"CSV FILE PATH: {file_path}")

        # how this works is basically we specify spark.jars.packages = .apache.hadoop:hadoop-aws.3.2.0
        spark = SparkSession.builder.appName('feature-engineering') \
        .config("spark.jars.packages", ".apache.hadoop:hadoop-aws:3.2.0") \
        .config("spark.hadoop.fs.s3a.access.key", access_key) \
        .config("spark.hadoop.fs.s3a.secret.key", secret_key) \
        .config("spark.hadoop.fs.s3a.impl", ".apache.hadoop.fs.s3a.S3AFileSystem") \
        .config("spark.hadoop.fs.s3a.aws.credentials.provider", ".apache.hadoop.fs.s3a.SimpleAWSCredentialsProvider") \
        .getOrCreate()

        # spark._jsc.hadoopConfiguration().set("fs.s3a.access.key", access_key)
        # spark._jsc.hadoopConfiguration().set("fs.s3a.secret.key", secret_key)

        usd_php_forex_4h_spark_df = spark.read.csv(file_path, header=True, inferSchema=True)
        usd_php_forex_4h_spark_df.createOrReplaceTempView("usd_php_forex")

    except Exception as e:
        print(f"Error {e} has occured.")

if __name__ == "__main__":
    # access argument vectors given in spark submit job operator
    # which will be the path to the newly saved .csv file
    file_path = sys.argv[1]
    print(file_path)

    # get secrets
    AWS_ACCESS_KEY_ID = conf.get("secrets", "aws_access_key_id")
    AWS_SECRET_ACCESS_KEY = conf.get("secrets", "aws_secret_access_key")

    # pass file path to task
    transform_forex_data(file_path=file_path,
        access_key=AWS_ACCESS_KEY_ID,
        secret_key=AWS_SECRET_ACCESS_KEY)

I've tried provided the spark.hadoop.fs.s3a.impl configuration with value .apache.hadoop.fs.s3a.S3AFileSystem as well as provided my aws access key id and secret access key in order to read the .csv file from the bucket. I've also configured the uri string that will be read by spark with "s3a" instead of "s3" e.g. "s3a://{bucket_name}/raw/usd_php_forex_4hour.csv" which is the file_path variable. Am I missing something here?

本文标签：

版权声明：本文标题：python - Class org.apache.hadoop.fs.s3a.S3AFileSystem not found keeps being raised even if implementation and aws keys are provi 内容由网友自发贡献，该文观点仅代表作者本人，转载请联系作者并注明出处：http://www.betaflare.com/web/1743958398a2568560.html，本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如发现本站有涉嫌抄袭侵权/违法违规的内容，一经查实，本站将立刻删除。

更多相关文章

python - I need to create a compound primary key in a GridDB Cloud collection container - Stack Overflow

IT技术

35分钟前

I´m trying to create a table (collection container) using GridDB Cloud,using a Python script, as show

javascript - How to make your Discord bot to listen to your messages after you entered a command? - Stack Overflow

IT技术

34分钟前

I am making a Google Assistant Discord bot, but I want to know how your bot will reply to your second m

javascript - Materialize CSS on chip delete - Stack Overflow

IT技术

30分钟前

I have been trying to get the tag of a deleted chip from the div in the Materialize chips class, but no

dart - Why is my UI not updating in Flutter with MobX @observable variables when using Observer? - Stack Overflow

IT技术

30分钟前

I created the following observable variables:```@observableObservableList<bool> optionsSelected

javascript - Why getElementById does not work on elements other than document? - Stack Overflow

IT技术

26分钟前

Why document.body.getElementById(idOfElem) and document.body.getElementsByName(nameOfElem) not working?

javascript - Can't appendChild to a node created from another frame - Stack Overflow

IT技术

25分钟前

I have a page with an iframe and would like to extract a DOM node from the child frame and put it on th

javascript - Warning: Expected server HTML to contain a matching <body> in <div> - Stack Overflow

IT技术

24分钟前

I get the above mentioned warning in the console. I don't understand why. I have two matching <

javascript - setting cookie at domain level in React with js-cookie - Stack Overflow

IT技术

23分钟前

I have a React application. Currently I am using js-cookie to manage my cookies. Right now I am facing

javascript - How to update a ref with a signal in Solid.js to control table scroll? - Stack Overflow

IT技术

23分钟前

I'm working with Solid.js and have a table with rows and cells. I'm using signals to manage t

javascript - Is it possible to return a const statement from a function? - Stack Overflow

IT技术

21分钟前

So if I wanted to stop writing many const statements for a simple repeatable thing, should it be possib

javascript - npm install says cannot find file - Stack Overflow

IT技术

19分钟前

I am cloning a repo and get the following error when running npm installENOENT:no such file or director

qt - Setting dynamically created QML rectangle's drag.target property via JavaScript - Stack Overflow

IT技术

16分钟前

I have dynamically created some QML rectangles on a Canvas. I wish to toggle whether the rectangles can

visual studio - How to stop CodeLens from opening a window - Stack Overflow

IT技术

15分钟前

When editing C# in Visual Studio, I'm used to seeing something like this when I click on the CodeL

javascript - How to continue a test case when an assertion failed in CasperJS? - Stack Overflow

IT技术

13分钟前

Is there a way for continue a test suite when a fail occurs?By example : casper.test.begin(""

javascript - Join() an array by tab? - Stack Overflow

IT技术

13分钟前

I'm attempting to integrate with a legacy system which depends on a value being sent via a GET req

export to CSV with Javascript - Stack Overflow

IT技术

8分钟前

I have two functionsExports HTML TableDownload CSV fileUnfortunately my file is downloading as "U

Cannot use yarn when running docker image - Stack Overflow

IT技术

8分钟前

FROM node:20-alpine3.20WORKDIR app... a lot of unrelated stuff...COPY package.json yarn.lock .yarnrc.y

prototypal inheritance - JavaScript hasOwnProperty vs typeof - Stack Overflow

IT技术

5分钟前

I've searched a lot on Google but couldn't found where I was looking for:Benefit of using Obj

javascript - Blob's DataUri vs Base64 string DataUri - Stack Overflow

IT技术

3分钟前

As you know & stated in w3 it is possible to create a url for a Blob object in javascript by using

javascript - How to insert new line within ng-repeat of the middle (Angularjs + twitter bootstrap + jade) - Stack Overflow

IT技术

2分钟前

I want to insert new line like a following.div.rowdiv.span12div(ng-repeat="(data in datas)")p

发表评论

全部评论 0

暂无评论

编程频道|软件玩家 - 软件改变生活！

python - Class org.apache.hadoop.fs.s3a.S3AFileSystem not found keeps being raised even if implementation and aws keys are provi

更多相关文章

python - I need to create a compound primary key in a GridDB Cloud collection container - Stack Overflow

javascript - How to make your Discord bot to listen to your messages after you entered a command? - Stack Overflow

javascript - Materialize CSS on chip delete - Stack Overflow

dart - Why is my UI not updating in Flutter with MobX @observable variables when using Observer? - Stack Overflow

javascript - Why getElementById does not work on elements other than document? - Stack Overflow

javascript - Can&#39;t appendChild to a node created from another frame - Stack Overflow

javascript - Warning: Expected server HTML to contain a matching &lt;body&gt; in &lt;div&gt; - Stack Overflow

javascript - setting cookie at domain level in React with js-cookie - Stack Overflow

javascript - How to update a ref with a signal in Solid.js to control table scroll? - Stack Overflow

javascript - Is it possible to return a const statement from a function? - Stack Overflow

javascript - npm install says cannot find file - Stack Overflow

qt - Setting dynamically created QML rectangle&#39;s drag.target property via JavaScript - Stack Overflow

visual studio - How to stop CodeLens from opening a window - Stack Overflow

javascript - How to continue a test case when an assertion failed in CasperJS? - Stack Overflow

javascript - Join() an array by tab? - Stack Overflow

export to CSV with Javascript - Stack Overflow

Cannot use yarn when running docker image - Stack Overflow

prototypal inheritance - JavaScript hasOwnProperty vs typeof - Stack Overflow

javascript - Blob&#39;s DataUri vs Base64 string DataUri - Stack Overflow

javascript - How to insert new line within ng-repeat of the middle (Angularjs + twitter bootstrap + jade) - Stack Overflow

发表评论

推荐文章

overloading - JavaScript getElementById function overload - Stack Overflow

Two widgets swapping places on drag and drop in Flutter - Stack Overflow

javascript - How can I redirect correctly after logging into Twitter? - Stack Overflow

nuxt.js - No LSP support for default components in Nuxt - Stack Overflow

javascript - How to have TeamCity update the build version number in a specified file? - Stack Overflow

热门文章

javascript - Why are more people nowadays using script to assign events handlers vs assigning the event from within the html ele

javascript - How to disable DataTables paging after initialization? - Stack Overflow

Why single var is good in javascript? - Stack Overflow

python - BLE connection issues with bleak using &#39;just works&#39; pairing - Stack Overflow

Javascript function declarations with a dollar sign - Stack Overflow

javascript - ES modules and jsconfig.json: Error [ERR_MODULE_NOT_FOUND] - Stack Overflow

javascript - Angular 1.5 &amp; ES6 -Dependency injection - Stack Overflow

javascript - How do I check if an indexedDB instance is open? - Stack Overflow

Python, pyserial, No module named &#39;serial&#39; - Stack Overflow

Generate a new random number each time function is called in Javascript - Stack Overflow

最新文章

国内十大杀毒软件（Top 10 Antivirus Software in China）

玩转ChatGPT提示词 持续更新·······

国内可用的 ChatGPT-4中文版镜像网站整理（20250401更新）

国内可用的 ChatGPT-4中文版镜像网站整理（20250314更新）

如何让ChatGPT搜索出真实文献？全攻略指南让你轻松搞定

javascript - Position resizable circles near each other - Stack Overflow

c# - How do I byte-serialize Vector2[]? - Stack Overflow

javascript - how to change an anchor tag using jquery by selecting its class - Stack Overflow

javascript - Parsing GMT date string with moment - Stack Overflow

pytorch - Why does the square of a complex tensor with no imaginary part yields a result with an imaginary part? - Stack Overflo

惠普OMEN 15-CE001TX 2EF91PA参数报价

苹果新款MacBook Pro 15英寸 i732GB1TBVega Pro 20参数报价

联想Y330A-PSE L参数报价

神舟战神Z7 D6 i7-12650H16GB512GBRTX4050旗舰版参数报价

神舟战神Z7 D6 i7-12650H16GB1TBRTX4050参数报价

javascript - Can't appendChild to a node created from another frame - Stack Overflow

javascript - Warning: Expected server HTML to contain a matching <body> in <div> - Stack Overflow

qt - Setting dynamically created QML rectangle's drag.target property via JavaScript - Stack Overflow

javascript - Blob's DataUri vs Base64 string DataUri - Stack Overflow

python - BLE connection issues with bleak using 'just works' pairing - Stack Overflow

javascript - Angular 1.5 & ES6 -Dependency injection - Stack Overflow

Python, pyserial, No module named 'serial' - Stack Overflow

玩转ChatGPT提示词持续更新·······