Optimized Analytics Package for Spark Platform (OAP)
Optimized Analytics Package for Spark Platform (OAP)
gazelle_plugin
Native SQL Engine plugin for Spark SQL with vectorized SIMD optimizations.
raydp
RayDP provides simple APIs for running Spark on Ray and integrating Spark with AI libraries.
cloudtik
Cloud Scale Platform for Distributed Analytics and AI
remote-shuffle
Spark* shuffle plugin for support shuffling data through a remote Hadoop-compatible file system, as opposed to vanilla Spark's local-disks.
sql-ds-cache
Spark* plug-in for accelerating Spark* SQL performance by using cache and index at SQL data source layer.
oap-mllib
Optimized Spark package to accelerate machine learning algorithms in Apache Spark MLlib.
oap-tools
Tools for building, packaging, and OAP public cloud integrations such as AWS EMR, Google Dataproc and K8S.