Paul Fultz II
Paul Fultz II
When compiling for gemm using splitk for MLIR, split the pointwise operators and generate code object using our regular pointwise code generation.
For the detailed perf report we can print the types as abbreviated types so we can attach to each tensor size: * double_type -> f64 * float_type -> f32 *...
We want to be able to insert `copy` operator for some concat cases where: * Almost all the inputs are a precompile_op * Only 1 copy is needed To do...
The test requirements.txt is not updated.