Paul Fultz II

Results 130 issues of Paul Fultz II

## Motivation Add initial unit tests for jit gpu kernels as we dont have any unit tests. This should cover the tests for #4194. ## Technical Details This adds a...

roadmap
Continous Integration

## Motivation MLIR doesnt support fp32 GEG fusion on navi. ## Technical Details Disable GEG fusion for fp32, and enable GEG in jenkins CI. ## Changelog Category - - [...

Implement flash decoding as described here: https://pytorch.org/blog/flash-decoding/ We have attention operators grouped like this: ``` Q -> [B, M, k] K -> [B, k, N] V -> [B, N, D]...

The following fails verification: ```python p = migraphx.program() m = p.get_main_module() p_x = m.add_parameter("x", migraphx.shape(type="float_type", lens=[1, 1, 2])) x_1 = m.add_instruction(migraphx.op("pooling", padding=[1,2], stride=[1], lengths=[3], dilations=[1], count_include_pad=True), [p_x]) # migraphx.shape(type="float_type", lens=[1,...

This upgrades the version to c++23 for our source code and for kernels. We still cannot use newer std library features because sles uses gcc 7. - Update `any_cast` of...

enhancement