FlagEmbedding issues

sparse向量的存储类型

3

想请教您一个问题，dense vector的列表的表示可以是list[float]。那么sparse向量的存储类型应该是list[]什么呢？

Made flag_reranker compute_score output type consistent.

- Remove returning inconsistent value type to function output type. The other existing code using that function is expected to be consistent with that function output type.

muazhari

麻烦您帮我看一下，为什么在微调bge-m3的时候会出现如此报错。之前在没有query数量为1、pos数量为1、neg数量为10的时候为微调训练正常进行；目前调整为query数量为1、pos数量为11、neg数量为10，却有报错信息，我查看了train_data，发现没有什么问题。微调训练命令如下 nohup \ torchrun --nproc_per_node 2 \ -m FlagEmbedding.baai_general_embedding.finetune.run \ --output_dir /bgem3/supervised_simcse_fine-tune \ --model_name_or_path /bgem3 \ --train_data query_pos_neg_data.jsonl \ --learning_rate 1e-5 \ --fp16 \ --num_train_epochs 200 \ --per_device_train_batch_size...

LLLiHaotian

When will Visualized BGE paper be released?

1

ae86208

m3显存

3

# bge-m3 torchrun --nproc_per_node 8 \ -m FlagEmbedding.reranker.run \ --output_dir model \ --model_name_or_path bge-m3 \ --train_data rerank.jsonl \ --learning_rate 6e-5 \ # --deepspeed /ds_config.json # --gradient_checkpointing --fp16 \ --num_train_epochs 5...

songge25