ol-infrastructure icon indicating copy to clipboard operation
ol-infrastructure copied to clipboard

Provide Superset as a service for data exploration

Open blarghmatey opened this issue 1 year ago • 0 comments
trafficstars

User Story

  • As a data platform owner, I want to deploy an installation of Superset, so I can provide data consumers with a portal for exploration and visualization

Description/Context

Deploy a customized installation of Superset on our own AWS infrastructure

Acceptance Criteria

  • [x] Superset is deployed
  • [x] Superset is integrated with Keycloak for Authn/Authz
  • [x] Superset is connected to Trino for querying the data lake
  • [ ] dbt mart models are published to Superset as data sets

Plan/Design

  • [x] Create a Pulumi project for managing the AWS infrastructure to support Superset
  • [x] Create a customized Docker image for our Superset installation
    • [x] Custom Python settings module and plugins
    • [x] Specify additional Python dependencies and versions
  • [ ] Integrate dbt with Superset following examples from https://engineering.hometogo.com/how-hometogo-connected-dbt-and-superset-to-make-metadata-more-accessible-and-reduce-analytical-2223af539cc6

blarghmatey avatar Dec 07 '23 20:12 blarghmatey