discoverx
discoverx copied to clipboard
A Swiss-Army-knife for your Data Intelligence platform administration.
I have some chinese table name, when I apply vacuum command sql, it raised "[PARSE_SYNTAX_ERROR] Syntax error at or near 'VACUUM'.(line 1, pos 0)" Best Regards
Refactored project structure to comply to Databricks Labs blueprint standard. - Added pyproject.toml - Refactored namespace - Added custom __init__.py - Aligned version file format and changelog format
opt=dx.from_tables("dbdemos|uc_test.*.*") Currently unable to parse multiple catalog at the same time to apply scan on multiple tables
[]([https://codecov.io/github/databrickslabs/discoverx](https://github.com/databrickslabs/discoverx))
utility on top of discoverx to run Delta Housekeeping across multiple tables Analysis that provides stats on Delta tables / recommendations for improvements, including: - stats:size of tables and number...
 localcheckpoint was not working so replaced it with cache. then we got the below error  SparkRuntimeException: [UDF_ERROR.PAYLOAD] Execution of function analyze_series(text#38854) failed - failed to set payload. org.apache.spark.SparkException:...
`DataExplorer` new method `map_chunked` as an alternative to `map`: - `map` processes the tables one by one - `map_chunked` processes the tables in chunks of size `tables_per_chunk`
I had assumed that by providing two columns to `having_columns` I would only get tables that had both columns, but after some failed optimize commands and further inspection it appears...
Amazing work! https://www.databricks.com/dataaisummit/session/map-your-lakehouse-content-discoverx/ I see the docs outline the capability to apply discovery tags to tables (columns). Request to build out a full example that does this