Shitao Xiao

Results 509 comments of Shitao Xiao

huggingface的tainer应该会自动开启wandb。加上这些代码是否会造成功能的冲突?

您好,该数据来自DuReader的论文的实验章节,他使用了covid数据做检索。

任务太简单了,需要提高任务难度。没有使用neg,train_group_size应该>1(指的是pos+neg的数量,pos数量一直为1,因此train_group_size-1为从neg中采样数量)。另外可以提高per_device_train_batch_size,这也能加大任务难度。 不应该提高temperture,推荐0.01-0.1。temperture过高会让模型难以优化。我们的经验中,0.02就很好了。

并不觉得不正常。loss只是0-0.15之间波动,波动范围很小。

Don't install both cpu and gpu versions for faiss. You can uninstall faiss-cpu and re-install faiss-gpu.

You can try to upgrade the datasets package.

Currently, this script doesn't support encoder-decoder architecture.

I think this error is caused by faiss. You can use faiss on CPU instead of GPU

The data format is: ``` {"content": "A is ..."} {"content": "B is ..."} {"content": "C is ..."} {"content": "Panda is ..."} {"content": "... is A"} ``` , where each line...