aws-sdk-pandas
aws-sdk-pandas copied to clipboard
pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Par...
*P.S. Please do not attach files as it's considered a security risk. Add code snippets directly in the message body as much as possible.*
**Is your feature request related to a problem? Please describe.** Ensure `wr.s3.read_parquet` runs at scale leveraging Ray **Additional context** This is part of the Wrangler @ Scale release **Links** -...
### Feature or Bugfix - Feature - Refactoring ### Detail - Ray support for timestream write - num_threads argument changed to use_threads to be consistent with the rest of awswrangler...
**Is your idea related to a problem? Please describe.** No **Describe the solution you'd like** It would be good to have support for CDC data lake formats like Apache Hudi,...
Currently using wr.redshift.to_sql/copy() to store data to redshift in an airflow pipeline, where multiple dags write to the same table. I run into this serializable isolation error: `Serializable isolation violation...
**Describe the bug** When reading a large parquet file from S3 using read_parquet, I get errors like `ValueError: Length mismatch: Expected axis has 75536 elements, new values have 6741043 elements`....
### Feature or Bugfix - Bugfix ### Detail - S3 SELECT queries with a Scan range don't really make sense for parquet files, that's why we should fall back to...
**Is your idea related to a problem? Please describe.** Add ability to load/update data from a data frame via the bulk loader. **Describe the solution you'd like** The fastest way...
**Is your idea related to a problem? Please describe.** Currently when using `wr.s3.read_parquet(path=list_of_paths)` with a list of file paths, if any individual file does not exist, the following error is...
**Describe the bug** Query regex expressions with a double backslash will cause the query to error when executed by `awswrangler.athena.read_sql_query()`. But the same query run in the Athena console will...