Michael Kowalski
Michael Kowalski
fixed by #2315
There is a problem with how `__refitem__` provides `__getitem__` when `InlineArray` is used as an `alias`. Maybe since in this case `self` is immutable in `__refitem__`? A failing test to...
testing.assert_... functions now print the file and line number when there is a failure. See this [commit](https://github.com/modularml/mojo/commit/52ceff7b986d6022fd8c82a49702729b176de0d3). So I think this issue can be closed.
[Parameter closure captures are unsafe references](https://docs.modular.com/mojo/roadmap#parameter-closure-captures-are-unsafe-references).
Could these tests using out of bounds values be added in `test_string.mojo`? There should probably be similar tests in `test_list.mojo`. The new `Slice.adjust()` itself has no tests yet. ``` assert_equal("",...
The shape mismatch errors should be fixed in the models and not in the tests. The `_register_load_state_dict_pre_hook()` logic is not correct. Currently it is only applied in the base DistilBert...
I vectorized ROPE as you have a while back but the networks were much too small to see any impact. I also did micro benchmarks focused on the ROPE function...
I think this will get a performance improvement by removing the parallelize call of the loop over heads in ROPE and replacing with a simple for loop. Setting up threads...
On M1 Pro removing parallelize is better. Not a huge difference in the whole network but clearly better. In isolated benchmarks at the size of the baby llama models parallelizing...
I put the comparisons I did in this [branch](https://github.com/mikowals/llama2.mojo/tree/no-parallelize-rope). The graphs above are done where "V1 is current master and V2 is with the two line change to remove parallelize...