blueoil
blueoil copied to clipboard
Tell device environment information to generate_project.py in order to optimization and simplicity
For example, memory layout of weights of kernels will be one of three layouts: OHWI
(Arm w/ NEON, , x86 w/ AVX2), HWOI
(generic), ChHWBCl
(w/ FPGA).
Since converter doesn't know what layouts will be used, it generates and holds all layouts of weights.
Because of that, converter has ad-hoc implementation and make static-analysis more difficult.
Also, that prevents some optimizations (such as omit memory copy, memory buffer reusing).