fides icon indicating copy to clipboard operation
fides copied to clipboard

GCP cloud storage unstructured dataset connection support for `generate` and `annotate dataset`

Open iamkelllly opened this issue 3 years ago • 0 comments
trafficstars

To ease the onboarding to Fides, Fidesctl should be able to support unstructured datasets in addition to structured datasets in generating fides annotations. This will allow adopters to be able to create dataset annotations quickly and directly from unstructured datasets.

This issue is a spike for (final scope WIP):

  • handling the connection to GCP storage bucket(s)
  • determining file path(s) in bucket(s)
  • creating annotation defaults for files in various paths (collection), down to the file-level (field).

iamkelllly avatar Nov 29 '21 15:11 iamkelllly