kongjibai

Results 9 issues of kongjibai

As the title, how to run a **custom** **tensorflow**/**pytorch**/**horovod** train demo with Volcano?

lifecycle/stale

您好!请问0.20.11-5.3.12版本中单个页面的标记数目上限如何设置?目前上限50个,再标记就会提示去设置中修改,但在设置中并没有找到相关的选项。

**What happened**: using `kubectl apply -f lm-horovod-tf-mnist-v0.5.yaml` to create a horovod tf job, it outputs `job.batch.volcano.sh/lm-horovod-job created` or `job.batch.volcano.sh/lm-horovod-job configed`, but `kubectl get pod` outputs `No resources found in default...

kind/bug
lifecycle/stale

when i use volcano start a horovod tf job,the `lm-horovod-job-master-0` node will run error and after restart 3 times, it'll be Running status, because it `Permanently added 'lm-horovod-job-worker-0.lm-horovod-job,10.10.10.10' (ECDSA) to...

priority/important-soon

**What happened**: I uninstall volcano-1.5.1 by `kubectl delete -f ./volcano-1.5.1/volcano-development.yaml`, and reinstall volcano-release-1.6 by `kubectl apply -f ./volcano-release-1.6/volcano-development.yaml`. When I apply a vcjob reference the step 2 of `https://volcano.sh/en/docs/tutorials/`, `kubectl...

kind/bug

您好!很高兴您能够分享您的项目,不过我用您的代码直接运行`python main.py --phase train`会报错,提示: ValueError: Variable vgg_16/conv1/conv1_1/weights already exists, disallowed. Did you mean to set reuse=True in VarScope? Originally defined at: File "/home/user/anaconda2/lib/python2.7/site-packages/tensorflow/contrib/framework/python/ops/variables.py", line 216, in variable use_resource=use_resource) File "/home/user/anaconda2/lib/python2.7/site-packages/tensorflow/contrib/framework/python/ops/arg_scope.py",...

您好!很高兴您能够分享关于ACSCP算法的实现,关于您的实现我有几个疑问还请指教: 1)论文中说,生成网络的最后一层后面应该是tanh激活函数,但是看您用的是先Sigmoid后ReLU? 2)生成网络是一个U-Net的结构,在镜像对称的卷积和反卷积层中间添加了skip connections,不应该是e1连接d8、e2连接d7吗?看您是e1连接d7、e2连接d6呢? 3)训练方面,作者训练了300个epoch,前100个epoch,λc=0;后200个epoch λc=10,不知您是否是这样训练的? 4)另外您给出的结果示例,应该是IMG_173_A的图吧,可是原测试数据集中给的ground truth是483个人,而您给出的结果中ground truth人数是707人呢? 5)我用您的代码和数据集训练,在product时生成的密度图咋有很多竖条纹? 6)由密度图得出人数,直接round(sum(sum(mp)))就可以了,这是什么原理? 希望您能够在百忙之中抽出时间予以解答,不生感激,谢谢!

Have anyone run the project successfully?

博主,您好!很感谢您分享opencv调用mask rcnn模型的方法,写的很棒,通俗易懂! 但是我现在遇到一些问题,就是如何将现有Keras、TensorFlow训练的.h5模型转换为opencv可调用的.pbtxt和.pb文件呢?希望您不吝赐教!