tvm icon indicating copy to clipboard operation
tvm copied to clipboard

[SME] Add scalable fp16->fp32 dense schedule

Open lhutton1 opened this issue 1 year ago • 1 comments

This commit extends the functionality of the SME dense and matmul schedules to support operations with fp16 inputs and an fp32 output, where transpose_a=False and transpose_b=True.

For convenience, it also adds a utility called get_vscale_factor which creates the correct multiplier for vscale given a data type, reflecting ideas from an early design of the SVE RFC.

~Note: this commit depends on https://github.com/apache/tvm/pull/16921 so also contains the contents of https://github.com/apache/tvm/pull/16921.~

lhutton1 avatar May 08 '24 19:05 lhutton1

cc @ekalda @Anndrey24 @leandron

lhutton1 avatar May 16 '24 08:05 lhutton1

@tvm-bot rerun

lhutton1 avatar May 27 '24 15:05 lhutton1

Thanks @lhutton1 this is merged now!

ekalda avatar May 28 '24 14:05 ekalda