datapipelines-essentials-python
datapipelines-essentials-python copied to clipboard
Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformati...
Results
1
datapipelines-essentials-python issues
Sort by
recently updated
recently updated
newest added
Could you describe the setup of your environment to mock the basic infrastructure necessary to run this library in local development and in the AWS cloud? I saw reference to...