arezamoosavi / acidonspark-etl Goto Github PK
View Code? Open in Web Editor NEWDelta-Lake, ETL, Spark, Airflow
Delta-Lake, ETL, Spark, Airflow
I don't know how to login MinIO and set user or password? Can you help me, please? Thank you.
Bro - good work .. i am struggling to get airflow up. would it be possible to connect
Hi!
I'm totally new to spark/hive/airflow and I ran into an error while running the spark_create_table.py
.
It seems it's not able to access the _symlink_format_manifest
on minio. For me it's strange as it just created it at the previous step.
Traceback (most recent call last):
File "/opt/airflow/dags/etl/spark_create_table.py", line 31, in <module>
spark.sql(sql_delta_table)
File "/opt/spark/python/lib/pyspark.zip/pyspark/sql/session.py", line 723, in sql
File "/opt/spark/python/lib/py4j-0.10.9-src.zip/py4j/java_gateway.py", line 1305, in __call__
File "/opt/spark/python/lib/pyspark.zip/pyspark/sql/utils.py", line 117, in deco
pyspark.sql.utils.AnalysisException: org.apache.hadoop.hive.ql.metadata.HiveException: MetaException(message:Got exception: java.nio.file.AccessDeniedException s3a://datalake/deltatables/employees/_symlink_format_manifest: getFileStatus on s3a://datalake/deltatables/employees/_symlink_format_manifest: com.amazonaws.services.s3.model.AmazonS3Exception: Forbidden (Service: Amazon S3; Status Code: 403; Error Code: 403 Forbidden; Request ID: 1736C1CB94490DFC; S3 Extended Request ID: null), S3 Extended Request ID: null:403 Forbidden)
BTW: I am able to run:
spark.sql("SELECT * FROM delta.`s3a://datalake/deltatables/employees/` ").show()
I really appreciate the work you do.
Thanks a lot!
Adrian
Can you please add steps as well to run each one after
Hi, Is there way, we can add livy to the project?
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.