pyserini
pyserini copied to clipboard
version conflict: the doc of experiments-nfcorpus.md
It says that
python -m pyserini.encode \
input --corpus collections/nfcorpus/corpus.jsonl \
--fields title text \
output --embeddings indexes/nfcorpus.bge-base-en-v1.5 \
--to-faiss \
encoder --encoder BAAI/bge-base-en-v1.5 **--l2-norm** \
--device cpu \
--pooling mean \
--fields title text \
--batch 32
python -m pyserini.encode \
input --corpus collections/nfcorpus/corpus.jsonl \
--fields title text \
output --embeddings indexes/nfcorpus.bge-base-en-v1.5 \
--to-faiss \
encoder --encoder BAAI/bge-base-en-v1.5 --l2-norm \
--device cpu \
--pooling mean \
--fields title text \
--batch 32
but in the latest version, it could find --l2-norm --pooling mean
I think the maintainers maybe forget the issue