data-infra icon indicating copy to clipboard operation
data-infra copied to clipboard

Cal-ITP data infrastructure

Results 177 data-infra issues
Sort by recently updated
recently updated
newest added

# Description _Describe your changes and why you're making them. Please include the context, motivation, and relevant dependencies._ We've cycled through several offboarding of contractors and consultants and we should...

# Description Added a new sample notebook to show how to make a parameterized notebooks. Updated the docs with the link to this new notebook, as well as some additional...

The following modifications are suggested: - [x] add agency_id from routes.txt or ageny_name from agencies.txt - [ ] add a description for earliest_tap: `The earliest transaction associated with a customer_id.`...

In #2773, we identify two buckets that we want to move to cold storage (`gtfs-data` and `gtfs-data-test`). Since the ticket's creation, we've determined that applying GCP's [AutoClass](https://cloud.google.com/storage/docs/autoclass) feature to these...

infrastructure

## User story / feature request In order to comply with Caltrans security parameters, we should remove all sensitive Elavon data from our data pipeline. 1. We need to remove...

security

## User story / feature request In order to comply with Caltrans security parameters, we should remove all sensitive Littlepay data from our data pipeline. 1. We need to remove...

security

As a Cal-ITP data user, I want obsolete Payments data models to be deprecated from the data warehouse so it's clear which data is current and maintained for active use....

data-pipeline-ingestion-and-modeling

After Littlepay's recent adjustment to their publishing cadence to better suit our analytics needs, we found that the new publishing time was too late for our `transform_warehouse` DAG start time...

data-pipeline-ingestion-and-modeling

Following rollout of V2 open data publishing, there remain a few things to take care of in our underlying tables and associated code when we get the chance. These include:...

open-data

As an analytics engineer, I want all of our columns to be documented in dbt so that future maintainers and users of the warehouse will understand what each column is...

documentation
product: transit-data-quality