rknn-toolkit2 icon indicating copy to clipboard operation
rknn-toolkit2 copied to clipboard

yolov5s在RKNN3588上以FP16推理时的精度问题

Open blueWatermelonFri opened this issue 1 year ago • 1 comments

通过本地的ubuntu服务器对yolov5s模型进行连板调试时,fp16的精度下降了4个点,请问这个现象正常吗?

rknntoolkit2的版本如下: rknn-toolkit2 version: 2.0.0b0+9bab5682 rk3588的驱动版本如下:

D RKNNAPI:   API: 2.0.0b0 (18eacd0 build@2024-03-22T06:07:59)
D RKNNAPI:   DRV: rknn_server: 2.0.0b0 (18eacd0 build@2024-03-22T14:07:19)
D RKNNAPI:   DRV: rknnrt: 2.0.0b0 (35a6907d79@2024-03-24T10:31:14)

rknn.config的参数如下:

rknn.config(mean_values=[[0, 0, 0]], 
                std_values=[[255, 255, 255]], 
                target_platform="rk3588")

精度分析结果如下:

rknn convert start!
I rknn-toolkit2 version: 2.0.0b0+9bab5682
--> Config model
done
--> Loading model
I It is recommended onnx opset 19, but your onnx model opset is 13!
I Model converted from pytorch, 'opset_version' should be set 19 in torch.onnx.export for successful convert!
I Loading : 100%|██████████████████████████████████████████████| 121/121 [00:00<00:00, 39667.87it/s]
done
--> Building model
W build: The dataset='calibrate_dataset.txt' is ignored because do_quantization = False!
I rknn building ...
I rknn buiding done.
done
--> Export rknn model
done
--> Accuracy analysis
adb: unable to connect for root: closed
I target set by user is: rk3588
I Get hardware info: target_platform = rk3588, os = Linux, aarch = aarch64
I Check RK3588 board npu runtime version
I Starting ntp or adb, target is RK3588
I Start adb...
I Connect to Device success!
I NPUTransfer: Starting NPU Transfer Client, Transfer version 2.1.0 (b5861e7@2020-11-23T11:50:36)
D RKNNAPI: ==============================================
D RKNNAPI: RKNN VERSION:
D RKNNAPI:   API: 2.0.0b0 (18eacd0 build@2024-03-22T06:07:59)
D RKNNAPI:   DRV: rknn_server: 2.0.0b0 (18eacd0 build@2024-03-22T14:07:19)
D RKNNAPI:   DRV: rknnrt: 2.0.0b0 (35a6907d79@2024-03-24T10:31:14)
D RKNNAPI: ==============================================
D RKNNAPI: Input tensors:
D RKNNAPI:   index=0, name=images, n_dims=4, dims=[1, 640, 640, 3], n_elems=1228800, size=2457600, w_stride = 0, size_with_stride = 0, fmt=NHWC, type=FP16, qnt_type=NONE, zp=0, scale=1.000000
D RKNNAPI: Output tensors:
D RKNNAPI:   index=0, name=output1, n_dims=4, dims=[1, 18, 20, 20], n_elems=7200, size=14400, w_stride = 0, size_with_stride = 0, fmt=NCHW, type=FP16, qnt_type=NONE, zp=0, scale=1.000000
D RKNNAPI:   index=1, name=output2, n_dims=4, dims=[1, 18, 40, 40], n_elems=28800, size=57600, w_stride = 0, size_with_stride = 0, fmt=NCHW, type=FP16, qnt_type=NONE, zp=0, scale=1.000000
D RKNNAPI:   index=2, name=output3, n_dims=4, dims=[1, 18, 80, 80], n_elems=115200, size=230400, w_stride = 0, size_with_stride = 0, fmt=NCHW, type=FP16, qnt_type=NONE, zp=0, scale=1.000000
adb: unable to connect for root: closed
/userdata/dumps/: 89 files pulled. 4.2 MB/s (149901680 bytes in 33.662s)
I Save Tensors to txt: 100%|████████████████████████████████████████| 89/89 [00:03<00:00, 27.86it/s]
I GraphPreparing : 100%|███████████████████████████████████████| 145/145 [00:00<00:00, 15836.22it/s]
I AccuracyAnalysing : 100%|███████████████████████████████████████| 145/145 [00:16<00:00,  8.98it/s]

# simulator_error: calculate the output error of each layer of the simulator (compared to the 'golden' value).
#              entire: output error of each layer between 'golden' and 'simulator', these errors will accumulate layer by layer.
#              single: single-layer output error between 'golden' and 'simulator', can better reflect the single-layer accuracy of the simulator.
# runtime_error: calculate the output error of each layer of the runtime.
#              entire: output error of each layer between 'golden' and 'runtime', these errors will accumulate layer by layer.
#              single_sim: single-layer output error between 'simulator' and 'runtime', can better reflect the single-layer accuracy of runtime.

layer_name                                                  simulator_error                             runtime_error                    
                                                        entire              single                  entire           single_sim          
                                                     cos      euc        cos      euc            cos      euc        cos      euc        
-----------------------------------------------------------------------------------------------------------------------------------
[Input] images                                     1.00000 | 0.0       1.00000 | 0.0           1.00000 | 0.1180    1.00000 | 0.1180      
[Conv] /model.0/conv/Conv_output_0                 1.00000 | 2.4982    1.00000 | 2.4982
[exSwish] /model.0/act/Mul_output_0                1.00000 | 1.9841    1.00000 | 1.3346        1.00000 | 10.165    1.00000 | 10.132      
[Conv] /model.1/conv/Conv_output_0                 1.00000 | 2.4134    1.00000 | 1.8541
[exSwish] /model.1/act/Mul_output_0                1.00000 | 1.9273    1.00000 | 1.0880        1.00000 | 13.101    1.00000 | 8.1516      
[Conv] /model.2/cv1/conv/Conv_output_0             1.00000 | 0.8336    1.00000 | 0.5864
[exSwish] /model.2/cv1/act/Mul_output_0            1.00000 | 0.4872    1.00000 | 0.3146        1.00000 | 3.0559    1.00000 | 1.9833      
[Conv] /model.2/m/m.0/cv1/conv/Conv_output_0       1.00000 | 1.3587    1.00000 | 0.8088
[exSwish] /model.2/m/m.0/cv1/act/Mul_output_0      1.00000 | 0.8585    1.00000 | 0.3834        1.00000 | 4.1053    1.00000 | 1.7504      
[Conv] /model.2/m/m.0/cv2/conv/Conv_output_0       1.00000 | 2.4838    1.00000 | 1.0474
[exSwish] /model.2/m/m.0/cv2/act/Mul_output_0      1.00000 | 2.1420    1.00000 | 0.5897        1.00000 | 6.9511    1.00000 | 2.0777      
[Add] /model.2/m/m.0/Add_output_0                  1.00000 | 2.2869    1.00000 | 0.7157        1.00000 | 7.7536    1.00000 | 0.0         
[Conv] /model.2/cv2/conv/Conv_output_0             1.00000 | 1.5840    1.00000 | 1.0400
[exSwish] /model.2/cv2/act/Mul_output_0            1.00000 | 1.0642    1.00000 | 0.5301        1.00000 | 6.9092    1.00000 | 3.4345      
[Concat] /model.2/Concat_output_0                  1.00000 | 2.5224    1.00000 | 0.6349        1.00000 | 10.385    1.00000 | 0.0         
[Conv] /model.2/cv3/conv/Conv_output_0             1.00000 | 3.0151    1.00000 | 1.0589
[exSwish] /model.2/cv3/act/Mul_output_0            1.00000 | 1.7501    1.00000 | 0.4406        0.99999 | 6.0663    1.00000 | 1.6735      
[Conv] /model.3/conv/Conv_output_0                 1.00000 | 1.1325    1.00000 | 0.3195
[exSwish] /model.3/act/Mul_output_0                1.00000 | 0.5785    1.00000 | 0.1652        0.99999 | 2.2395    1.00000 | 0.3982      
[Conv] /model.4/cv1/conv/Conv_output_0             1.00000 | 0.3326    1.00000 | 0.1160
[exSwish] /model.4/cv1/act/Mul_output_0            1.00000 | 0.1531    1.00000 | 0.0573        1.00000 | 0.5484    1.00000 | 0.1548      
[Conv] /model.4/m/m.0/cv1/conv/Conv_output_0       1.00000 | 0.7294    1.00000 | 0.2583
[exSwish] /model.4/m/m.0/cv1/act/Mul_output_0      1.00000 | 0.4092    1.00000 | 0.1448        1.00000 | 1.4311    1.00000 | 0.3293      
[Conv] /model.4/m/m.0/cv2/conv/Conv_output_0       1.00000 | 0.5763    1.00000 | 0.1975
[exSwish] /model.4/m/m.0/cv2/act/Mul_output_0      1.00000 | 0.2526    1.00000 | 0.0820        1.00000 | 0.8956    1.00000 | 0.2005      
[Add] /model.4/m/m.0/Add_output_0                  1.00000 | 0.3054    1.00000 | 0.0972        1.00000 | 1.0543    1.00000 | 0.0         
[Conv] /model.4/m/m.1/cv1/conv/Conv_output_0       1.00000 | 0.8030    1.00000 | 0.2504
[exSwish] /model.4/m/m.1/cv1/act/Mul_output_0      1.00000 | 0.3734    1.00000 | 0.0868        0.99999 | 1.2211    1.00000 | 0.2090      
[Conv] /model.4/m/m.1/cv2/conv/Conv_output_0       1.00000 | 0.9418    1.00000 | 0.2373
[exSwish] /model.4/m/m.1/cv2/act/Mul_output_0      1.00000 | 0.6107    1.00000 | 0.1805        0.99999 | 1.9182    1.00000 | 0.4279      
[Add] /model.4/m/m.1/Add_output_0                  1.00000 | 0.7068    1.00000 | 0.2072        1.00000 | 2.2366    1.00000 | 0.0         
[Conv] /model.4/cv2/conv/Conv_output_0             1.00000 | 0.9380    1.00000 | 0.2877
[exSwish] /model.4/cv2/act/Mul_output_0            1.00000 | 0.6277    1.00000 | 0.1712        0.99999 | 2.7682    1.00000 | 0.4022      
[Concat] /model.4/Concat_output_0                  1.00000 | 0.9453    1.00000 | 0.1837        0.99999 | 3.5589    1.00000 | 0.0         
[Conv] /model.4/cv3/conv/Conv_output_0             1.00000 | 1.0833    1.00000 | 0.3435
[exSwish] /model.4/cv3/act/Mul_output_0            1.00000 | 0.4307    1.00000 | 0.1062        0.99999 | 1.6902    1.00000 | 0.2590      
[Conv] /model.5/conv/Conv_output_0                 1.00000 | 0.7330    1.00000 | 0.1894
[exSwish] /model.5/act/Mul_output_0                1.00000 | 0.3233    1.00000 | 0.0738        0.99998 | 1.6541    1.00000 | 0.2039      
[Conv] /model.6/cv1/conv/Conv_output_0             1.00000 | 0.2583    1.00000 | 0.0809
[exSwish] /model.6/cv1/act/Mul_output_0            1.00000 | 0.1132    1.00000 | 0.0361        0.99999 | 0.5547    1.00000 | 0.0952      
[Conv] /model.6/m/m.0/cv1/conv/Conv_output_0       1.00000 | 0.5540    1.00000 | 0.1694
[exSwish] /model.6/m/m.0/cv1/act/Mul_output_0      1.00000 | 0.2677    1.00000 | 0.0821        0.99999 | 1.2722    1.00000 | 0.1834      
[Conv] /model.6/m/m.0/cv2/conv/Conv_output_0       1.00000 | 0.3577    1.00000 | 0.1134
[exSwish] /model.6/m/m.0/cv2/act/Mul_output_0      1.00000 | 0.1338    1.00000 | 0.0329        0.99998 | 0.6488    1.00000 | 0.0815      
[Add] /model.6/m/m.0/Add_output_0                  1.00000 | 0.1792    1.00000 | 0.0528        0.99999 | 0.8114    1.00000 | 0.0         
[Conv] /model.6/m/m.1/cv1/conv/Conv_output_0       1.00000 | 0.5771    1.00000 | 0.1766
[exSwish] /model.6/m/m.1/cv1/act/Mul_output_0      1.00000 | 0.2299    1.00000 | 0.0546        0.99998 | 1.0036    1.00000 | 0.1380      
[Conv] /model.6/m/m.1/cv2/conv/Conv_output_0       1.00000 | 0.4603    1.00000 | 0.1238
[exSwish] /model.6/m/m.1/cv2/act/Mul_output_0      1.00000 | 0.2284    1.00000 | 0.0528        0.99998 | 1.0113    1.00000 | 0.1356      
[Add] /model.6/m/m.1/Add_output_0                  1.00000 | 0.2980    1.00000 | 0.0734        0.99999 | 1.3001    1.00000 | 0.0         
[Conv] /model.6/m/m.2/cv1/conv/Conv_output_0       1.00000 | 0.5523    1.00000 | 0.1619
[exSwish] /model.6/m/m.2/cv1/act/Mul_output_0      1.00000 | 0.2044    1.00000 | 0.0524        0.99999 | 0.8724    1.00000 | 0.1243      
[Conv] /model.6/m/m.2/cv2/conv/Conv_output_0       1.00000 | 0.6084    1.00000 | 0.1569
[exSwish] /model.6/m/m.2/cv2/act/Mul_output_0      1.00000 | 0.3599    1.00000 | 0.0859        0.99998 | 1.5912    1.00000 | 0.2535      
[Add] /model.6/m/m.2/Add_output_0                  1.00000 | 0.4789    1.00000 | 0.1097        0.99998 | 2.1075    1.00000 | 0.0         
[Conv] /model.6/cv2/conv/Conv_output_0             1.00000 | 0.6856    1.00000 | 0.1651
[exSwish] /model.6/cv2/act/Mul_output_0            1.00000 | 0.4227    1.00000 | 0.0984        0.99997 | 2.6074    1.00000 | 0.2393      
[Concat] /model.6/Concat_output_0                  1.00000 | 0.6388    1.00000 | 0.0984        0.99998 | 3.3526    1.00000 | 0.0         
[Conv] /model.6/cv3/conv/Conv_output_0             1.00000 | 0.8111    1.00000 | 0.2263
[exSwish] /model.6/cv3/act/Mul_output_0            1.00000 | 0.3316    1.00000 | 0.0756        0.99997 | 1.9800    1.00000 | 0.1875      
[Conv] /model.7/conv/Conv_output_0                 1.00000 | 0.5560    1.00000 | 0.1404
[exSwish] /model.7/act/Mul_output_0                1.00000 | 0.2298    1.00000 | 0.0507        0.99994 | 1.8461    1.00000 | 0.1214      
[Conv] /model.8/cv1/conv/Conv_output_0             1.00000 | 0.2673    1.00000 | 0.0892
[exSwish] /model.8/cv1/act/Mul_output_0            1.00000 | 0.0746    1.00000 | 0.0210        0.99997 | 0.6137    1.00000 | 0.0458      
[Conv] /model.8/m/m.0/cv1/conv/Conv_output_0       1.00000 | 0.4261    1.00000 | 0.1309
[exSwish] /model.8/m/m.0/cv1/act/Mul_output_0      1.00000 | 0.2042    1.00000 | 0.0467        0.99995 | 1.5758    1.00000 | 0.1170      
[Conv] /model.8/m/m.0/cv2/conv/Conv_output_0       1.00000 | 0.5180    1.00000 | 0.1237
[exSwish] /model.8/m/m.0/cv2/act/Mul_output_0      1.00000 | 0.2913    1.00000 | 0.0669        0.99994 | 2.5182    1.00000 | 0.1759      
[Add] /model.8/m/m.0/Add_output_0                  1.00000 | 0.3065    1.00000 | 0.0670        0.99993 | 2.6031    1.00000 | 0.0         
[Conv] /model.8/cv2/conv/Conv_output_0             1.00000 | 0.4653    1.00000 | 0.1089
[exSwish] /model.8/cv2/act/Mul_output_0            1.00000 | 0.2483    1.00000 | 0.0540        0.99993 | 2.1787    1.00000 | 0.1265      
[Concat] /model.8/Concat_output_0                  1.00000 | 0.3945    1.00000 | 0.0571        0.99993 | 3.3945    1.00000 | 0.0         
[Conv] /model.8/cv3/conv/Conv_output_0             1.00000 | 0.6968    1.00000 | 0.1682
[exSwish] /model.8/cv3/act/Mul_output_0            1.00000 | 0.3261    1.00000 | 0.0670        0.99991 | 2.8786    1.00000 | 0.1646      
[Conv] /model.9/cv1/conv/Conv_output_0             1.00000 | 0.3497    1.00000 | 0.0960
[exSwish] /model.9/cv1/act/Mul_output_0            1.00000 | 0.3225    1.00000 | 0.1103        0.99997 | 2.6514    1.00000 | 0.2577      
[MaxPool] /model.9/m/MaxPool_output_0              1.00000 | 0.4414    1.00000 | 0.1297        0.99999 | 3.4128    1.00000 | 0.0         
[MaxPool] /model.9/m_1/MaxPool_output_0            1.00000 | 0.4784    1.00000 | 0.1560        0.99999 | 3.6140    1.00000 | 0.0         
[MaxPool] /model.9/m_2/MaxPool_output_0            1.00000 | 0.4955    1.00000 | 0.1723        0.99999 | 3.6602    1.00000 | 0.0         
[Concat] /model.9/Concat_output_0                  1.00000 | 0.8794    1.00000 | 0.2754        0.99999 | 6.7183    1.00000 | 0.0         
[Conv] /model.9/cv2/conv/Conv_output_0             1.00000 | 0.6059    1.00000 | 0.2436
[exSwish] /model.9/cv2/act/Mul_output_0            1.00000 | 0.1914    1.00000 | 0.0398        0.99992 | 1.6811    1.00000 | 0.0925      
[Conv] /model.10/conv/Conv_output_0                1.00000 | 0.4219    1.00000 | 0.1043
[exSwish] /model.10/act/Mul_output_0               1.00000 | 0.1706    1.00000 | 0.0341        0.99991 | 1.4851    1.00000 | 0.0814      
[Resize] /model.11/Resize_output_0                 1.00000 | 0.3412    1.00000 | 0.0475        0.99991 | 2.9703    1.00000 | 0.0         
[Concat] /model.12/Concat_output_0                 1.00000 | 0.4758    1.00000 | 0.0701        0.99994 | 3.5698    1.00000 | 0.0         
[Conv] /model.13/cv1/conv/Conv_output_0            1.00000 | 0.4899    1.00000 | 0.1309
[exSwish] /model.13/cv1/act/Mul_output_0           1.00000 | 0.1896    1.00000 | 0.0595        0.99997 | 1.3748    1.00000 | 0.1441      
[Conv] /model.13/m/m.0/cv1/conv/Conv_output_0      1.00000 | 0.5131    1.00000 | 0.1695
[exSwish] /model.13/m/m.0/cv1/act/Mul_output_0     1.00000 | 0.2320    1.00000 | 0.0631        0.99996 | 1.6936    1.00000 | 0.1499      
[Conv] /model.13/m/m.0/cv2/conv/Conv_output_0      1.00000 | 0.5625    1.00000 | 0.1369
[exSwish] /model.13/m/m.0/cv2/act/Mul_output_0     1.00000 | 0.2800    1.00000 | 0.0610        0.99993 | 2.2030    1.00000 | 0.1513      
[Conv] /model.13/cv2/conv/Conv_output_0            1.00000 | 0.5349    1.00000 | 0.1256
[exSwish] /model.13/cv2/act/Mul_output_0           1.00000 | 0.2531    1.00000 | 0.0567        0.99994 | 1.9512    1.00000 | 0.1389      
[Concat] /model.13/Concat_output_0                 1.00000 | 0.3775    1.00000 | 0.0559        0.99994 | 2.9429    1.00000 | 0.0         
[Conv] /model.13/cv3/conv/Conv_output_0            1.00000 | 0.8536    1.00000 | 0.1991
[exSwish] /model.13/cv3/act/Mul_output_0           1.00000 | 0.4066    1.00000 | 0.0833        0.99992 | 3.2888    1.00000 | 0.2034      
[Conv] /model.14/conv/Conv_output_0                1.00000 | 0.4692    1.00000 | 0.1020
[exSwish] /model.14/act/Mul_output_0               1.00000 | 0.2757    1.00000 | 0.0680        0.99994 | 2.3128    1.00000 | 0.1682      
[Resize] /model.15/Resize_output_0                 1.00000 | 0.5514    1.00000 | 0.0886        0.99994 | 4.6257    1.00000 | 0.0         
[Concat] /model.16/Concat_output_0                 1.00000 | 0.6997    1.00000 | 0.1145        0.99996 | 4.9248    1.00000 | 0.0         
[Conv] /model.17/cv1/conv/Conv_output_0            1.00000 | 0.5299    1.00000 | 0.1643
[exSwish] /model.17/cv1/act/Mul_output_0           1.00000 | 0.2125    1.00000 | 0.0811        0.99998 | 1.4724    1.00000 | 0.1977      
[Conv] /model.17/m/m.0/cv1/conv/Conv_output_0      1.00000 | 0.5593    1.00000 | 0.2251
[exSwish] /model.17/m/m.0/cv1/act/Mul_output_0     1.00000 | 0.2300    1.00000 | 0.0934        0.99998 | 1.5433    1.00000 | 0.2135      
[Conv] /model.17/m/m.0/cv2/conv/Conv_output_0      1.00000 | 0.8545    1.00000 | 0.2604
[exSwish] /model.17/m/m.0/cv2/act/Mul_output_0     1.00000 | 0.5550    1.00000 | 0.1444        0.99996 | 3.8579    1.00000 | 0.3635      
[Conv] /model.17/cv2/conv/Conv_output_0            1.00000 | 0.5023    1.00000 | 0.1378
[exSwish] /model.17/cv2/act/Mul_output_0           1.00000 | 0.3705    1.00000 | 0.1101        0.99995 | 3.3247    1.00000 | 0.2729      
[Concat] /model.17/Concat_output_0                 1.00000 | 0.6673    1.00000 | 0.1171        0.99996 | 5.0929    1.00000 | 0.0         
[Conv] /model.17/cv3/conv/Conv_output_0            1.00000 | 2.2759    1.00000 | 0.5792
[exSwish] /model.17/cv3/act/Mul_output_0           1.00000 | 1.4949    1.00000 | 0.3410        0.99994 | 11.339    1.00000 | 1.0927      
[Conv] /model.18/conv/Conv_output_0                1.00000 | 0.7697    1.00000 | 0.1473
[exSwish] /model.18/act/Mul_output_0               1.00000 | 0.4751    1.00000 | 0.0723        0.99986 | 3.8054    1.00000 | 0.1805      
[Concat] /model.19/Concat_output_0                 1.00000 | 0.5493    1.00000 | 0.0650        0.99990 | 4.4532    1.00000 | 0.0         
[Conv] /model.20/cv1/conv/Conv_output_0            1.00000 | 0.7051    1.00000 | 0.1141
[exSwish] /model.20/cv1/act/Mul_output_0           1.00000 | 0.3838    1.00000 | 0.0557        0.99984 | 3.1648    1.00000 | 0.1376      
[Conv] /model.20/m/m.0/cv1/conv/Conv_output_0      1.00000 | 0.6564    1.00000 | 0.1113
[exSwish] /model.20/m/m.0/cv1/act/Mul_output_0     1.00000 | 0.3043    1.00000 | 0.0461        0.99986 | 2.5376    1.00000 | 0.1101      
[Conv] /model.20/m/m.0/cv2/conv/Conv_output_0      1.00000 | 0.9716    1.00000 | 0.1299
[exSwish] /model.20/m/m.0/cv2/act/Mul_output_0     1.00000 | 0.6093    1.00000 | 0.0851        0.99981 | 5.2704    1.00000 | 0.2502      
[Conv] /model.20/cv2/conv/Conv_output_0            1.00000 | 0.5102    1.00000 | 0.0828
[exSwish] /model.20/cv2/act/Mul_output_0           1.00000 | 0.3326    1.00000 | 0.0576        0.99985 | 3.2475    1.00000 | 0.1576      
[Concat] /model.20/Concat_output_0                 1.00000 | 0.6942    1.00000 | 0.0676        0.99982 | 6.1906    1.00000 | 0.0         
[Conv] /model.20/cv3/conv/Conv_output_0            1.00000 | 1.5858    1.00000 | 0.2464
[exSwish] /model.20/cv3/act/Mul_output_0           1.00000 | 0.9009    1.00000 | 0.1510        0.99986 | 8.1236    1.00000 | 0.5544      
[Conv] /model.21/conv/Conv_output_0                1.00000 | 0.6364    1.00000 | 0.0815
[exSwish] /model.21/act/Mul_output_0               1.00000 | 0.3965    1.00000 | 0.0494        0.99974 | 3.5894    1.00000 | 0.1256      
[Concat] /model.22/Concat_output_0                 1.00000 | 0.4317    1.00000 | 0.0406        0.99980 | 3.8845    1.00000 | 0.0         
[Conv] /model.23/cv1/conv/Conv_output_0            1.00000 | 0.6592    1.00000 | 0.0848
[exSwish] /model.23/cv1/act/Mul_output_0           1.00000 | 0.3573    1.00000 | 0.0428        0.99974 | 3.1069    1.00000 | 0.1259      
[Conv] /model.23/m/m.0/cv1/conv/Conv_output_0      1.00000 | 0.7364    1.00000 | 0.0879
[exSwish] /model.23/m/m.0/cv1/act/Mul_output_0     0.99999 | 0.4095    1.00000 | 0.0404        0.99964 | 3.4908    1.00000 | 0.1387      
[Conv] /model.23/m/m.0/cv2/conv/Conv_output_0      1.00000 | 0.8693    1.00000 | 0.0855
[exSwish] /model.23/m/m.0/cv2/act/Mul_output_0     1.00000 | 0.5731    1.00000 | 0.0634        0.99970 | 4.9156    1.00000 | 0.1682      
[Conv] /model.23/cv2/conv/Conv_output_0            1.00000 | 0.5111    1.00000 | 0.0647
[exSwish] /model.23/cv2/act/Mul_output_0           1.00000 | 0.3930    1.00000 | 0.0593        0.99980 | 3.8416    1.00000 | 0.1424      
[Concat] /model.23/Concat_output_0                 1.00000 | 0.6949    1.00000 | 0.0571        0.99975 | 6.2387    1.00000 | 0.0         
[Conv] /model.23/cv3/conv/Conv_output_0            1.00000 | 0.9849    1.00000 | 0.1291
[exSwish] /model.23/cv3/act/Mul_output_0           1.00000 | 0.4929    1.00000 | 0.0801        0.99983 | 4.6763    1.00000 | 0.3220      
[Conv] /model.24/m.2/Conv_output_0                 1.00000 | 0.2135    1.00000 | 0.1117
[Sigmoid] output1                                  1.00000 | 0.0289    1.00000 | 0.0074        0.99999 | 0.2542    1.00000 | 0.0921      
[Conv] /model.24/m.1/Conv_output_0                 1.00000 | 0.4433    1.00000 | 0.2245
[Sigmoid] output2                                  1.00000 | 0.0470    1.00000 | 0.0160        0.99999 | 0.4188    1.00000 | 0.1851      
[Conv] /model.24/m.0/Conv_output_0                 1.00000 | 0.9307    1.00000 | 0.5451
[Sigmoid] output3                                  1.00000 | 0.0790    1.00000 | 0.0359        1.00000 | 0.6391    1.00000 | 0.3722      
I The error analysis results save to: ./snapshot/error_analysis.txt
W accuracy_analysis: The mapping of layer_name & file_name save to: ./snapshot/map_name_to_file.txt
done
--> Init runtime environment
adb: unable to connect for root: closed
I target set by user is: rk3588
I Get hardware info: target_platform = rk3588, os = Linux, aarch = aarch64
I Check RK3588 board npu runtime version
I Starting ntp or adb, target is RK3588
I Start adb...
I Connect to Device success!
I NPUTransfer: Starting NPU Transfer Client, Transfer version 2.1.0 (b5861e7@2020-11-23T11:50:36)
D RKNNAPI: ==============================================
D RKNNAPI: RKNN VERSION:
D RKNNAPI:   API: 2.0.0b0 (18eacd0 build@2024-03-22T06:07:59)
D RKNNAPI:   DRV: rknn_server: 2.0.0b0 (18eacd0 build@2024-03-22T14:07:19)
D RKNNAPI:   DRV: rknnrt: 2.0.0b0 (35a6907d79@2024-03-24T10:31:14)
D RKNNAPI: ==============================================
D RKNNAPI: Input tensors:
D RKNNAPI:   index=0, name=images, n_dims=4, dims=[1, 640, 640, 3], n_elems=1228800, size=2457600, w_stride = 0, size_with_stride = 0, fmt=NHWC, type=FP16, qnt_type=NONE, zp=0, scale=1.000000
D RKNNAPI: Output tensors:
D RKNNAPI:   index=0, name=output1, n_dims=4, dims=[1, 18, 20, 20], n_elems=7200, size=14400, w_stride = 0, size_with_stride = 0, fmt=NCHW, type=FP16, qnt_type=NONE, zp=0, scale=1.000000
D RKNNAPI:   index=1, name=output2, n_dims=4, dims=[1, 18, 40, 40], n_elems=28800, size=57600, w_stride = 0, size_with_stride = 0, fmt=NCHW, type=FP16, qnt_type=NONE, zp=0, scale=1.000000
D RKNNAPI:   index=2, name=output3, n_dims=4, dims=[1, 18, 80, 80], n_elems=115200, size=230400, w_stride = 0, size_with_stride = 0, fmt=NCHW, type=FP16, qnt_type=NONE, zp=0, scale=1.000000
done

我的yolov5s模型是在自制训练集上训练得到的,在pytorch的fp32推理精度是81.8,fp16的推理精度是77.5,我翻阅了官方文档,没有找到解释这个现象的原因。所以fp16相较于fp32在rk3588上通过adb连板推理掉了4个点这个现象正常吗?

blueWatermelonFri avatar Jul 15 '24 09:07 blueWatermelonFri

我在另一个模型上遇到了类似问题。你这个问题解决了吗?

zhengyb avatar May 30 '25 15:05 zhengyb