yanghu819 comments

Results 17 comments of


                                            yanghu819

DiffuSeq-v2 checkpoint release

Hi, great work, where is the checkpoint

lavin-7B结果达不到，两次结果差别很大，一样的代码

我试了单卡a100跑7B 依赖版本和官方一致运行命令： CUDA_VISIBLE_DEVICES=0 /opt/anaconda3/envs/lavin/bin/torchrun --nproc_per_node 1 --master_port 11111 train.py \ --llm_model 7B\ --llama_model_path ../data/weights/ \ --data_path ../data/alpaca_data.json \ --max_seq_len 512 \ --batch_size 4 \ --accum_iter 8 \ --epochs 20...

lavin-7B结果达不到，两次结果差别很大，一样的代码

请教下A100 40G配置下单卡的具体的参数配置。

lavin-7B结果达不到，两次结果差别很大，一样的代码

A100 40G的结果，仍然达不到： [8737] {'acc_natural': '87.66', 'acc_social': '94.71', 'acc_language': '85.64', 'acc_has_text': '87.15', 'acc_has_image': '86.86', 'acc_no_context': '88.08', 'acc_grade_1_6': '89.79', 'acc_grade_7_12': '86.49', 'acc_average': '88.61'} torch等： torch 1.13.0+cu117 transformers 4.37.0.dev0 bitsandbytes 0.41.3.post2 具体环境 name:...

[Badcase]: Cannot Reproduce results on benchmark datasets (e.g. humaneval)

same problem

[Badcase]: Cannot Reproduce results on benchmark datasets (e.g. humaneval)

求官方来个eval code, 我自己调的要么输出重复要么停不下来

Great job, questions about the results

I find acc: 0.05 is due to my imcomplete training data, after using the right gsm8k, the result is a lot better, but still have some issues. ![Image](https://github.com/user-attachments/assets/ff97582b-bea7-447b-85ba-765ed6332c75) ![Image](https://github.com/user-attachments/assets/a4abde8c-fb1e-423d-9a01-6f26de84e161) the...