gpdb
gpdb copied to clipboard
Greenplum Database - Massively Parallel PostgreSQL for Analytics. An open-source massively parallel data platform for analytics, machine learning and AI.
Gpfdist is a high-performance ETL tool to load external data for gpdb. In practice, while gpfdist can extract the full performance of the network card, gpfdist can also affect other...
Gpfdist is a high-performance ETL tool to load external data for gpdb. In practice, while gpfdist can extract the full performance of the network card, gpfdist can also affect other...
Please review commits individually.
Derive Combined Hashed Spec For Outer Joins Issue: Outer join operations enforce unnecessary data redistribution, causing execution of ORCA plans to be much slower than that of planner plans. Root...
When there are predicates from correlated EXISTS/ANY subquery that we can infer for outer relation, Orca fails to infer that predicates, which may cause bad plan. This patch fixes that...
This PR is broken into 2 main commits to implement feature: - 39b34ae083d51f46548bfee30d9646d05cf76007: add framework to force tuple routing in execution of modify table node - 21a8bb0804873eefa526091304de64f81f5530a8: remove fallbacks in...
…verted markdown files ( `gpdb-docs/mardkwon` directory)
These commits are some more low-hanging FIXMEs that seemed straightforward.
Currently, analyze will only collect HLL specific stats if it is a leaf partition. This presents an issue if an ordinary table is analyzed, then exchanged or added to a...
We're going to build gpdb on runway/buildweb in the future, on which we are unable to fetch submodules from outside repos. This patch helps collapse pythonsrc-ext submodules. Previously, if the...