VectorDBBench
VectorDBBench copied to clipboard
Is there any naming rules for external dataset file names ?
- download "bioasq_medium_1m" and "bioasq_large_10m" dataset from "s3://assets.zilliz.com/benchmark"
- save above dataset to local dir, and treat them as customized dataset
- do test with following command
- vectordb-bench try to get the file with name "train.parquet" but failed
- if I add a symbol link "train.parquet", and let it link to "shuffle_train.parquet", the test can run smoothly
- but for "bioasq_large_10m", the train data file name is like "shuffle_train-00-of-10.parquet", vectordb-bench still want to get the file with name "train.parquet" and fail
Is there any naming rules for external dataset file names ?