bgcflow
bgcflow copied to clipboard
Fail to build database when multiple antiSMASH version result are present
The command bgcflow build database
fails when there are multiple antiSMASH versions in the data_warehouse
. This is because dbt will fetch all parquet of the different versions and then fail the test because there are duplicates of the ids (genome_id, region_id, etc). The dirty fix is to clean up the processed folder to only have result from 1 antiSMASH version. Or improve the DBT schema to only use the latest version of the result.