Roelof van Dijk comments

Results 34 comments of


                                            Roelof van Dijk

[READY] perf: view as named tuples

Done as far as I am concerned. Should be an easy ~10% speedup. The next step is caching all shapetracker methods. Work in progress, but looks promising. This branch ```...

[READY] perf: view as named tuples

Reduced number of commits. Ready.

[READY] perf: view as named tuples

LAZYCACHE=0 python3.11 -O test/external/external_test_speed_llama.py ``` Master codegen mean runtime: 119.23ms, runs: 135.19, 111.06, 138.51, 110.68, 108.25, 112.41, 111.19, 143.45, 109.42, 112.13 methodcache mean runtime: 111.93ms, runs: 105.79, 105.06, 105.42, 140.35,...

[READY] perf: view as named tuples

There are some minor performance tweaks included - I can remove those if you want to keep this MR cleaner.

[READY] perf: view as named tuples

The diff was larger because I had removed several methods that were used only once, mainly in the View init. * `filter_strides` * `is_contiguous` * `view_from_shape` This reduced the function...

[READY] perf: view as named tuples

``` LAZYCACHE=0 python3.11 -O test/external/external_test_speed_llama.py Master codegen mean runtime: 119.23ms, runs: 135.19, 111.06, 138.51, 110.68, 108.25, 112.41, 111.19, 143.45, 109.42, 112.13 methodcache mean runtime: 111.93ms, runs: 105.79, 105.06, 105.42, 140.35,...

Roelof van Dijk

[READY] perf: view as named tuples

[READY] perf: view as named tuples

[READY] perf: view as named tuples

[READY] perf: view as named tuples

[READY] perf: view as named tuples

[READY] perf: view as named tuples

[READY] perf: view as named tuples

[READY] perf: view as named tuples

Fix view merging for masked views

ci: cache downloads