Phani Kumar
Phani Kumar
Yes @pankajkoti and @pankajastro will work on it together
Since we use Trino Python client for Airflow, and plan to build an asyncio compatible operator, it would benefit the Airflow user community if there is support for asyncio
We've got the local connection working for the SparkSubmitOperator yesterday post further analysis. I think this story is unblocked for now @sunank200 , @bharanidharan14 - please comment.
@sunank200 lets stop further effort on SparkSQLOperator and you can start on LivyOperator
Below operators are on hold due to connectivity issue between airflow worker container and EMR Spark - SparkSubmitOperator - SparkSqlOperator - SparkJDBCOperator - SparkKubernetesOperator
Please refer to this https://www.notion.so/astronomerio/HiveOperator-Roblox-work-2dbfbc70418844379d7a71764fd12498 documentation to try creating a Hive connection from Airflow
Steps to connect to Amazon Hive from airflow worker container: - Open SSH tunnel from the airflow worker node using the below command and then airflow container would consider localhost...
Surprising why I didnt had to make the changes you've mentioned @rajaths010494
Parked for now due to limitations on PyHive and Impyla libraries. @rajaths010494 can you update the contributing guide as discussed
are we ready with the PR for this? It is in draft state at the moment.