textspotter
textspotter copied to clipboard
Hi, Please advice : ``` PROTOC src/caffe/proto/caffe.proto CXX src/caffe/solvers/sgd_solver.cpp CXX src/caffe/solvers/nesterov_solver.cpp CXX src/caffe/solvers/rmsprop_solver.cpp CXX src/caffe/solvers/adadelta_solver.cpp CXX src/caffe/solvers/adam_solver.cpp CXX src/caffe/parallel.cpp CXX src/caffe/solvers/adagrad_solver.cpp CXX src/caffe/internal_thread.cpp CXX src/caffe/solver.cpp CXX src/caffe/layers/accuracy_layer.cpp CXX src/caffe/layers/recurrent_layer.cpp CXX...
论文中训练的第二步后期,需要将检测结果输入到 text-align layer ,请问这里具体是怎么实现的呢?如果计算识别的损失呢?是通过将检测的结果和GT进行IoU的计算来判断检测结果和GT标注的bbox相对应从而得到识别的GT吗?谢谢! @tonghe90
Q1:in train.pt ,"gt_bbox" is noted by ” N * 8 ### grounding truth boxes for text (for computing loss)” but in Class gen_gts_layer which in tool_layers.py it is noted by...
请问有人成功训练了吗?望指导一下
如题,我在train.pt中没看到文字识别的ground truth,如果没有文字的标签,文字识别部分如何训练呢?
我把loss_4s和iou loss层都注释掉了,现在仅有文字识别的softmaxwithloss损失函数(mask loss和iou loss都不参与训练); 然后自己写了一个输入数据层,可以输出包含文字的图片(640*640大小), 作为gt的bbox的四个点的坐标以及文字的标签同时输出; 但是训练时候遇到segmentation fault, 提示内存越界; 请问输入给point bilinear layer的bbox大小有什么限制吗?64*8个采样点的条件下, 输入的bbox大小是否有什么要求?
Hi, Please can you tell the steps taken for pre-procesing synthtext labels ?? your model uses fixed max length of 25 but synthtext dataset has boxes with labels length(number of...
可以识别中文吗?
您好,请问下这个网络可以识别中文吗?或者用比较小的改动来识别中文字符?
如题,用两块6gb(或8gb)的显卡和用一块12gb的显卡进行训练或测试有什么区别?