Ondřej Čertík

Results 365 issues of Ondřej Čertík

With the following definitions: ```python from lpython import dataclass, i64 @dataclass class TransactionItem: account_idx: i64 commodity: str amount: i64 @dataclass class Transaction: date: str name: str cleared: bool comments: str...

We should expose the Assert ASR node to the frontend, either by introducing an assert statement, and/or just using existing syntax `call assert(x == 5)`, with the appropriate `assert` subroutine...

LFortran (as of today) can compile 6 out of 18 Fortran packages in SciPy: https://github.com/lfortran/lfortran/pull/2743, and we keep working on compiling the rest. At the same time, I think it's...

Currently only the "greedy" sampling is implemented (the token with the highest probability is selected). Implement other sampling methods, some options are: * top-p * top-k * temperature (here is...

TODO: * [ ] Finish it to work correctly for any MPI rank * [ ] Write a test for the result, ensuring it is correct * [ ] Add...

So far we only benchmarked against PyTorch+OpenBLAS. We should also benchmark against PyTorch+Accelerate. Here are a few ways how to do it: * https://towardsdatascience.com/installing-pytorch-on-apple-m1-chip-with-gpu-acceleration-3351dc44d67c * https://twitter.com/isuru_f/status/1636013111592329216 (`conda install "libblas=*=*_accelerate"`)

This approximates the erf() function directly. To use it, apply the following patch: ```diff --- a/gpt2.f90 +++ b/gpt2.f90 @@ -109,7 +109,7 @@ real(sp), intent(in) :: x(:,:), fc_w(:,:), fc_b(:), proj_w(:,:), proj_b(:)...

Batching means that multiple input streams are being computed at the same time, which can vectorize better and thus speedup the inference per token.