data-infra icon indicating copy to clipboard operation
data-infra copied to clipboard

Cal-ITP data infrastructure

Results 177 data-infra issues
Sort by recently updated
recently updated
newest added

There are [3 instances of this error to date](https://sentry.calitp.org/organizations/sentry/issues/77423/events/?referrer=issue-stream&statsPeriod=90d&stream_index=0): Error message: ```console model.calitp_warehouse.fct_daily_rt_feed_validation_notices - Database Error in model fct_daily_rt_feed_validation_notices (models/mart/gtfs_quality/fct_daily_rt_feed_validation_notices.sql) Resources exceeded during query execution: Your project or organization exceeded...

data-pipeline-ingestion-and-modeling

We're running v20.0.0 of the Sentry Helm chart, and v25.8.1 is the current latest New versions include much saner defaults for reducing resource usage and improving reliability in self-hosted instances:...

kubernetes

Per request from @natam1, we should set this field to have Metabase type "Category" to facilitate use in drop-down fields: https://github.com/cal-itp/data-infra/blob/main/warehouse/models/mart/gtfs_quality/_mart_gtfs_quality.yml#L240 We should possibly consider setting the same field to...

product: transit-data-quality

Build some kind of pipeline that checks whether fare validator payment acceptance devices are working properly. May need to needed utilize device-monitoring APIs from Kuba / ProData (Proxima). MST and...

project-payments

Improve the data pipeline such that it can ingest raw TIDES data from external data sources, parse the data and transform the data into mart tables to use for various...

project-analytics
product: transit-data-quality
data-pipeline-ingestion-and-modeling

# Description This should fix the dbt bug. I think what is happening is that in June we used the column named holiday_website_status for an array, [Current] [Missing] [Off-Season], etc....

**Describe the bug** Due to the way this is written, new tables don't appear in bigquery automagically: https://github.com/cal-itp/data-infra/blob/main/airflow/dags/create_external_tables/airtable/external_airtable_california_transit_services.yml **Expected behavior** All the missing columns appear in bigquery!

# Description Testing a template for automatically creating github maintenance issues. Resolves #3460 ## Type of change - [x] New feature ## How has this been tested? Testing in progress!...

## User story / feature request As an analyst using data from the warehouse, I want the integrity of the data to be ensured, So that my analyses don't have...

data-pipeline-ingestion-and-modeling

**Describe the bug** MV Shuttle has a "false" value for `dim_provider_gtfs_data`.`public_customer_facing_or_regional_subfeed_fixed_route`. It probably should be set to "true". **To Reproduce** See [saved metabase question](https://dashboards.calitp.org/question/3090-mv-shuttle-in-dim-provider-gtfs-data). **Expected behavior** `dim_provider_gtfs_data`.`public_customer_facing_or_regional_subfeed_fixed_route` value for MV...

data-pipeline-ingestion-and-modeling