one-yolov5
one-yolov5 copied to clipboard
A more efficient yolov5 with oneflow backend 🎉🎉🎉
add no_grad to avoid `RuntimeError: Can only set leaf tensor's data.`
命令:python -m oneflow.distributed.launch --nproc_per_node 2 train.py --batch 64 --data data/mydata.yaml --device 0,1 环境: python3.9+cu117+oneflow0.9.0,2080ti Traceback (most recent call last): File "/home/liaozhongke/deeplearning/one-yolov5/train.py", line 672, in main(opt) File "/home/liaozhongke/deeplearning/one-yolov5/train.py", line 568, in...
- [ ] 显存问题跟踪 , - [ ] deepcopy(m),显存不正常(issues:https://github.com/Oneflow-Inc/one-yolov5/issues/101 ) - [ ] 分类模型性能, - [ ] 提供cpu导出支持 - [ ] [YOLOv5-with-Neural-Magic's-DeepSparse](https://github.com/ultralytics/yolov5/wiki/YOLOv5-with-Neural-Magic's-DeepSparse)
- [引言](#引言) - [one-yolo 测试结果] - [one-yolov5项目相关数据] - [one-yolo 详细测试数据] - [修复方案] - [ 资料集] ## 引言 对 one-yolov5/classify/train.py 跑了两份 nsys 报告 . one-yolo_profile: [03-29-07-10profile.zip](https://github.com/wearmheart/worker/files/11097987/03-29-07-10profile.zip) torch-yolo_profile: [torch_03-29-08-37profile.zip](https://github.com/wearmheart/worker/files/11098388/torch_03-29-08-37profile.zip) ## one-yolo 测试结果...
- [ ] - 利用profile 工具优化性能(目标: GLM仓库性能优化 https://github.com/Oneflow-Inc/libai/tree/main/projects/GLM , one-yolov5分类模型性能优化) - [ ] - 学习oneflow 内存管理 ### profile工具上手 - nsys: [结合NVTX注释上手nsys](https://github.com/Oneflow-Inc/one-yolov5/issues/122#issuecomment-1484386632) - py-spy https://github.com/Oneflow-Inc/one-yolov5/issues/120#issue-1616558768 ### one-yolov5项目 > 项目地址: https://github.com/Oneflow-Inc/one-yolov5 >...
## 前言 ## py-spy 分析 ## 可稳定复现代码 ## 最近计划 ## 前言 在研究 定位 PyTorch 中 Python API 对应的 C++ 代码 https://github.com/Oneflow-Inc/OneTeam/issues/147 时候 试了下 pytorch官网推荐的一个性能定位工具 [py-spy](https://github.com/pytorch/pytorch/edit/master/CONTRIBUTING.md#profiling-with-py-spy) 定位了到pr: https://github.com/Oneflow-Inc/one-yolov5/pull/111 在分类模型训练测试, 每次比PyTorch慢几秒的在 tloss...
## 目标精度 ### [Pretrained Checkpoints](https://github.com/ultralytics/yolov5/releases/tag/v6.1)  ### [New Segmentation Checkpoints](https://github.com/ultralytics/yolov5/releases/tag/v7.0)  ### [New Classification Checkpoints](https://github.com/ultralytics/yolov5/releases/tag/v6.2) 
link https://github.com/Oneflow-Inc/oneflow/pull/9536
> Multi-GPU Training https://github.com/ultralytics/yolov5/issues/475 ## 问题描述 增加 batch_size ,在300个epoch训练下,mAP_0.5:0.95 精度值下降了`2.4750000000000014` 。 数据表如下所示: | | gpu | batch_size | cfg | metrics/mAP_0.5, | metrics/mAP_0.5:0.95, | |--------|-----|------------|--------------|------------------|-----------------------| | 目标 | 2 |...
使用的oneflow的版本是多少