TinyQi issues

Results 11 issues of


                                            TinyQi

[BUG]the following arguments are required: user_script, user_args

when I run cmd like {python train.py --actor-model facebook/opt-1.3b --reward-model facebook/opt-350m --num-gpus 1}，this error happens。how can i fix it? **ERROR LOG:** usage: ds [-h] [-H HOSTFILE] [-i INCLUDE] [-e EXCLUDE]...

bug

[BUG]: No module named 'colossalai.kernel.op_builder'

### 🐛 Describe the bug My execution command：torchrun --standalone --nproc_per_node=4 ./examples/train_sft.py --pretrain "/share/disk1/xiangchaoqi/gpt_test/pytorch_model-00033-of-00033.bin" --model 'llama' --strategy colossalai_zero2 --log_interval 10 --save_path output/Coati-7B --dataset /share/disk1/xiangchaoqi/gpt_test/instinwild_ch.json --batch_size 4 --accimulation_steps 8 --lr 2e-5 --max_datasets_size...

bug

If you want to add arguments to the training such as the ones you list above (e.g., --gradient_checkpointing) you'll need to add them after `main.py` in the script for example:

If you want to add arguments to the training such as the ones you list above (e.g., --gradient_checkpointing) you'll need to add them after `main.py` in the script for example:...

Training code

Hello ,would u like to release the training code ? pls hh

V4 OCR的检测模型，导出为atlas格式后，无法跑在atlas 300ipro上

我现在尝试使用C++，在atlas 300Ipro 上推理PaddleOCR的模型。但是在推理的时候遇到了问题。问题一：当我直接将paddle_inference的模型导出成nb模型，可以正常推理，但是OCR 的V4的检测模型的**结果为空** 问题二：为了解决问题一，我猜测是不是因为V4的模型的输出有2个输出，为此我进一步验证，我去除了V4模型的其中一个无用的输出之后，再重新导出nb模型，此时运行后会崩溃。崩溃时的打印如下： [I 4/22 20: 0:17.455 ...r/src/driver/huawei_ascend_npu/engine.cc:41 Context] properties: HUAWEI_ASCEND_NPU_SELECTED_DEVICE_IDS=0 [I 4/22 20: 0:17.455 ...r/src/driver/huawei_ascend_npu/engine.cc:66 Context] selected device ids: [I 4/22 20: 0:17.455 ...r/src/driver/huawei_ascend_npu/engine.cc:68 Context] 0...

礼貌询问，请问有没有训练代码呀

RTRT, 礼貌询问，请问有没有训练代码呀

无法复现教程中的OCR v3检测模型的量化精度

我使用文档中：https://github.com/PaddlePaddle/PaddleSlim/tree/release/2.6/example/auto_compression/ocr的量化脚本进行量化，经过训练后模型的精度下降很大，请问我哪里做错了嘛？精度的前后对比如下： ![image](https://github.com/PaddlePaddle/PaddleSlim/assets/28993936/f98745e4-dc6d-4e65-aee1-12d5f4a8bb9d) 我使用的配置文件如下： Global: model_type: det model_dir: /share/disk3/xcq/02.model_cache/pretrain_models/ch_PP-OCRv3_det_infer_bak/ model_filename: inference.pdmodel params_filename: inference.pdiparams algorithm: DB Distillation: alpha: 1.0 loss: l2 QuantAware: use_pact: true activation_bits: 8 is_full_quantize: false onnx_format: True activation_quantize_type:...

TinyQi