FlashRAG
FlashRAG copied to clipboard

Published 20 hours ago •

Reame
Issues

[Update] new functions and bug fixes

Open yeahjack opened this issue 4 months ago • 0 comments

During this PR, I made the following changes:

Add support for flash-attention-2 in HFCausalLMGenerator, and add Llama-3 special token when initializing the model.
Add refiner on multi GPU, hence accelerating the refining process by a lot.
Fix a bug for selective-context for list out of bounds.
Add paths for models in recomp and fix loading.
Fix a bug for intermediate_data.json saving when using bm25s.

Oct 06 '24 17:10 yeahjack