ASFF
ASFF copied to clipboard
segment error(core dumped
using cude using tfboard segment error(core dumped)
please provide more information about your error.....I have no idea what happened on your machine
TRAIN: LR: 0.001 MOMENTUM: 0.9 DECAY: 0.0005 BURN_IN: 5 MAXEPOCH: 300 COS: True SYBN: False#True MIX: True NO_MIXUP_EPOCHS: 30 LABAL_SMOOTH: True BATCHSIZE: 1 IMGSIZE: 608 IGNORETHRE: 0.7
train script: python main.py --cfg config/yolov3_baseline.cfg -d VOC --tfboard --ngpu 1 --checkpoint weights/darknet53_feature_mx.pth --start_epoch 0 --half --log_dir log/VOC -s 608
The version of you cuda, pytorch, apex and so on. And the details of your error information.... I know you use the default script, but obviously the error is not in the script.
If I use only one GPU , Is my train script right? thanks
I didn't test the code without distributed training, so even with single gpu, I suggest you keep the distributed training. And your batchsize is only one, which could significantly depress your performance.
@Note-Liu have you solverd it??
@Note-Liu have you solverd it??
no.[cry][cry]
I think it may be caused by the wrong gcc version error and the DCN module part. When I compile the CenterNet code, it happened once! You can update the gcc to 5.0 or above and try the lateset DCN module from the original github.