bgcflow icon indicating copy to clipboard operation
bgcflow copied to clipboard

Fail to build database when multiple antiSMASH version result are present

Open matinnuhamunada opened this issue 9 months ago • 0 comments

The command bgcflow build database fails when there are multiple antiSMASH versions in the data_warehouse. This is because dbt will fetch all parquet of the different versions and then fail the test because there are duplicates of the ids (genome_id, region_id, etc). The dirty fix is to clean up the processed folder to only have result from 1 antiSMASH version. Or improve the DBT schema to only use the latest version of the result.

matinnuhamunada avatar May 01 '24 12:05 matinnuhamunada