Kaustubh Raste

Results 18 comments of Kaustubh Raste

@fbarchard @allenkelly @bhbruce @keeranroth, If you have time, please help review this PR.

@fbarchard @allenkelly @bhbruce @keeranroth can someone merge the changes?

@fbarchard @allenkelly @bhbruce, If you have time, please help review this PR.

@fbarchard @allenkelly @bhbruce @keeranroth, If you have time, please help review this PR.

@keeranroth I ran the mean-nd-test which uses f32 rsum on qemu and used ms calculated by xnnpack as reference for comparison. From comparison, i found the u4v is better than...

@keeranroth I updated the LMUL=2 code to have 8 accumulators in loop and taken performance again for mean-nd-test. From observations the LMUL=4 still takes less time (even when both have...

Hi, any review comments? If not, can we get this committed please?

@fbarchard @allenkelly @bhbruce @keeranroth, If you have time, please help review this PR