spark-on-aws-lambda
spark-on-aws-lambda copied to clipboard
reading csv from s3 throwing this error using (sample-spark-script-csv-to-csv.py) file
df=spark.read.option('header','true').csv(input_path) java.lang.RuntimeException: java.lang.ClassNotFoundException: Class org.apache.hadoop.fs.s3a.S3AFileSystem not found please help how to resolve this issue .
@saiteja2420 What is the input_path you are providing? Is it directly a S3 URL/URI?
@saiteja2420 we are using S3a://
@saiteja2420 if it csv to csv , then hadoop-aws.jar file version might be the problem, have you changed any version in the Dockerfile?
No activity