coder(anonymous)

Results 18 comments of coder(anonymous)
trafficstars

Does code size affect the performance of small GEMMs?

Thank you for your previous reply. I would like to ask: is it reasonable to run multiple times and average the performance of small-scale GEMM?

The program uses the blasfeo_pack_smat API to generate segmentation fault. What is the cause of this problem? M is not equal to N.

Is there no overhead in using JIT to generate code? Why can it improve the performance of deep learning training?

Thank you for your answer. I would like to ask if there is any way to insert a hexadecimal code into the generated code? This code has been written by...

Thank you for your answer. Does this mean that I can express it as: top_[size_++]=0xb000020;

How to pass runtime parameters to getCode?