EliverQ issues

Results 7 issues of


                                            EliverQ

An issue example of new prompt designing tips.

Ingredient : Task Description Prompt content: Make your prompt as detailed as possible, e.g., "Summarize the article into a short paragraph within 50 words. The major storyline and conclusion should...

example

Welcome more tips for designing prompts

We welcome everyone to provide us with more relevant tips in the form of issues. After selection, we will regularly update them on GitHub and indicate the source. If you...

enhancement

An explanation of the model selection rule in Figure 1

As we mention in the survey, we only include LLMs (larger than 10B) with publicly reported evaluation results in Figure 1. Excluding models with papers (because formal evaluation results are...

good first issue

Evaluation settings of INSTRUCTOR

Hello! I have a very puzzling question that I would like to ask. Since your model is fine-tuned with instructions, why not use instructions during benchmark evaluations (e.g. MTEB)?

Reproduction of training INSTRUCTOR

Hello! I must say, Instructor is truly an amazing project, and I'm eager to replicate your training process. Nevertheless, despite following your training settings, I'm unable to achieve comparable performance...

xformers error when fine-tuning open_llama_3B with memory_efficient_attention

Hi, I feel confused about this bug when using memory_efficient_attention. It seems that the embed per head you choose can't match with xformers? ``` NotImplementedError: No operator found for `memory_efficient_attention_forward`...

请问paddle中是否有类似gradient_checkpointing的功能？或者是否还有什么别的省显存的方式？

### 请提出你的问题 Please ask your question 请问paddle中是否有类似gradient_checkpointing的功能？因为如果不开tensor和pipline并行的话似乎paddle的sharding并不会节省太多显存，或者是否还有什么别的省显存的方式？现在4卡微调7b模型batch_size开到2都会爆掉

status/new-issue

type/question