Jonatan Kłosko

https://jonatanklosko.com [email protected]

@dashbitco Kraków, Poland Open source developer. Livebook and all things Elixir at @dashbitco. Member of @thewca Software Team.

Results 256 comments of


                                            Jonatan Kłosko

Support GGUF quantized models

For the reference, [here](https://huggingface.co/docs/transformers/main/en/quantization/overview) is a whole table of different quantization libraries/techniques/formats that hf/transformers support. Axon implements a specific quantization method. I believe the idea is that we could use...

Add convenience Functions to save/load quantized Model to/from Disk

This probably belongs more to Axon than Bumblebee, since we need a way to store `%Axon.ModelState{}`. For the model itself, maybe there should be a way to quantize the model...

Add convenience Functions to save/load quantized Model to/from Disk

Oh, I missed `quantize_model`! For the model state you can actually do `Nx.serialize(model_state)`. So it would be this: ```elixir # Serialize File.write!("state.nx", Nx.serialize(model_info.params)) # Load {:ok, spec} = Bumblebee.load_spec({:hf, "..."})...

Explore silence detection in speech-to-text

@tubedude unfortunately it doesn't fit into the usual logits processing approach. We generate the transcription token-by-token, and logits processing applies some transformation to logits at each iteration. My understanding is...

use `xav` instead of `ffmpeg`

@kevinschweikert thanks for the PR! Dropping the ffmpeg dependency would be great, but yeah, I agree that we need xav to be precompiled for this to be beneficial.

use `xav` instead of `ffmpeg`

Sounds good to me!

use `xav` instead of `ffmpeg`

I've just realised that it's not just about precompilation, the main blocker is that `xav` still requires ffmpeg to be installed, so at the moment there is no benefit really...

Feature idea: make the "node" attr of the Remote Execution smart cell accept a variable name

@josevalim we already have an API for changing the editor intellisense node as of #390! Extending the field to accept variable sounds good to me. We probably should make it...

Nx on CUDA stops working: "Command buffer has to have a graph executable to be updated"

Just a quick note that one way we could track all EXLA buffers would be to have a static global list of pointers. Whenever an EXLA buffer is created we...

Nx on CUDA stops working: "Command buffer has to have a graph executable to be updated"

> the list may grow long and deleting becomes expensive Actually, if we store the iterator of the inserted list element inside the EXLA buffer, we should be able to...

‹
1
2
...
17
18
19
20
21
22
23
24
25
26
›