SOLO icon indicating copy to clipboard operation
SOLO copied to clipboard

Segmentation fault (core dumped)

Open shakey-cuimiao opened this issue 5 years ago • 6 comments

训练自己数据,出现内核溢出loading annotations into memory... Done (t=0.00s) creating index... index created! 2020-10-09 19:32:55,926 - mmdet - INFO - Start running, host: shakey@xiaoi-778, work_dir: /opt/shakey/road_car_objection/SOLO/work_dirs/toy/solov2_light_release_r50_fpn_2gpu_3x 2020-10-09 19:32:55,926 - mmdet - INFO - workflow: [('train', 1)], max: 72 epochs /usr/anaconda3/lib/python3.6/site-packages/torch/nn/functional.py:2479: UserWarning: Default upsampling behavior when mode=bilinear is changed to align_corners=False since 0.4.0. Please specify align_corners=True if the old behavior is desired. See the documentation of nn.Upsample for details. "See the documentation of nn.Upsample for details.".format(mode)) Segmentation fault (core dumped)

shakey-cuimiao avatar Oct 09 '20 11:10 shakey-cuimiao

@shakey-cuimiao Is there enough memory?

WXinlong avatar Oct 19 '20 01:10 WXinlong

@WXinlong how to reduce memory? I use single gpu to train model on just 20 pics data, this problem also is here. The computer have more than 20G memory(not GPU memory).

zhangchaosunshine avatar Nov 26 '20 09:11 zhangchaosunshine

@WXinlong how to reduce memory? I use single gpu to train model on just 20 pics data, this problem also is here. The computer have more than 20G memory(not GPU memory).

解决了吗?我也是一样的问题

LiangYong1216 avatar Dec 03 '20 02:12 LiangYong1216

@WXinlong how to reduce memory? I use single gpu to train model on just 20 pics data, this problem also is here. The computer have more than 20G memory(not GPU memory).

解决了吗?我也是一样的问题

没有

zhangchaosunshine avatar Dec 03 '20 02:12 zhangchaosunshine

有人说是预训练文件加载失败,可是也不知道需要怎样处理

------------------ 原始邮件 ------------------ 发件人: "WXinlong/SOLO" <[email protected]>; 发送时间: 2020年12月3日(星期四) 上午10:37 收件人: "WXinlong/SOLO"<[email protected]>; 抄送: "泰山"<[email protected]>;"Comment"<[email protected]>; 主题: Re: [WXinlong/SOLO] Segmentation fault (core dumped) (#112)

@WXinlong how to reduce memory? I use single gpu to train model on just 20 pics data, this problem also is here. The computer have more than 20G memory(not GPU memory).

解决了吗?我也是一样的问题

没有

— You are receiving this because you commented. Reply to this email directly, view it on GitHub, or unsubscribe.

LiangYong1216 avatar Dec 03 '20 02:12 LiangYong1216

有人说是预训练文件加载失败,可是也不知道需要怎样处理 ------------------ 原始邮件 ------------------ 发件人: "WXinlong/SOLO" <[email protected]>; 发送时间: 2020年12月3日(星期四) 上午10:37 收件人: "WXinlong/SOLO"<[email protected]>; 抄送: "泰山"<[email protected]>;"Comment"<[email protected]>; 主题: Re: [WXinlong/SOLO] Segmentation fault (core dumped) (#112) @WXinlong how to reduce memory? I use single gpu to train model on just 20 pics data, this problem also is here. The computer have more than 20G memory(not GPU memory). 解决了吗?我也是一样的问题 没有 — You are receiving this because you commented. Reply to this email directly, view it on GitHub, or unsubscribe.

好像不是预训练加载的问题,我调试是到训练开始出错了, runner.run(data_loaders, cfg.workflow, cfg.total_epochs)就是这个。

zhangchaosunshine avatar Dec 03 '20 03:12 zhangchaosunshine