seasonal-flu icon indicating copy to clipboard operation
seasonal-flu copied to clipboard

Exclude sequences with unusual collection dates

Open joverlee521 opened this issue 6 months ago • 2 comments

Context

@huddlej flagged sequences with unusual collection dates on Slack, where date == date_submitted. We should exclude these sequences from the builds because this is a clear metadata issue.

Possible solutions

  1. Add (date != date_submitted) to all of the filter queries across all configs
  2. Add a new filter rule in the main workflow to exclude these sequences for all builds
  3. Add a new filter rules in the upload workflow to exclude these sequences in our S3 files
  4. Add specific sequences to outliers.txt (e.g. https://github.com/nextstrain/seasonal-flu/commit/8209b359af8941d947e78565db983f9610f2a1ac)

joverlee521 avatar Aug 06 '24 17:08 joverlee521