CEPE issues

Seeking help with the loss curve

1

Congratulations on your excellent work. Intuitively, introducing new parameters for cross-attention may lead to a high loss. Could you please share your loss curve? Thanks a lot!

311dada

My results in open-domain QA are much lower using the given checkpoint for CEPE-LLaMA-2-7B. Could you provide some insights into the potential causes for this decline?

3

I'm curious about the discrepancies between my results (in red font) and the results presented in your paper (in black font), both obtained using the default parameters with the run_qa.sh...

sunnynexus

An Issue on Reproducing Streamingllm

1

Congratulations on your excellent work! I attempted to run `bash scripts/run_streamingllm_lm.sh` to reproduce the results of streaming_llm, but I encountered the following error: ``` TypeError: llama_pos_shift_attention_forward() got an unexpected keyword...

Ocean-627

Training data processing is slow

1

Hello, I'm processing redpajama data and it's unacceptably slow, especially processing book domain, any suggestions please? Or can you share a copy of your processed training data, thanks a lot!

yunlongia

CEPE
CEPE copied to clipboard

Metadata

Seeking help with the loss curve

My results in open-domain QA are much lower using the given checkpoint for CEPE-LLaMA-2-7B. Could you provide some insights into the potential causes for this decline?

An Issue on Reproducing Streamingllm

Training data processing is slow

← Metadata

Owner

Metadata

CEPE CEPE copied to clipboard

Metadata

Seeking help with the loss curve

My results in open-domain QA are much lower using the given checkpoint for CEPE-LLaMA-2-7B. Could you provide some insights into the potential causes for this decline?

An Issue on Reproducing Streamingllm

Training data processing is slow

← Metadata

Owner

Metadata

CEPE
CEPE copied to clipboard