Aleksa Gordić comments

Results 44 comments of


                                            Aleksa Gordić

more detailed explanation of Multi GPU

Adding to what @chinthysl has said we now also support ZeRO stage 1, where we shard the optimizer states, so only a shard of gradients is updated on each device...

`make` fails to autodetect GPU compute capability

Hey @akulchik are you still having problems with this?

pydub.playback.play() raises `SystemError: PY_SSIZE_T_CLEAN macro must be defined for '#' formats`

+1 edit: I solved this by using python 3.9, 3.10 was causing issues. tmp workaround for me

only save missing bits to reconstruct fp32 master weights

I assume two calls are due to the fact that we don't want each thread in the kernel to do stochastic rounding with the same seed. At least that was...

Add SwiGLU support

PR is ready and will be merged into the LLaMA 3 fork.

Suggestion: Use smollm corpus

can you post some (eval) results against edu fineweb?

Add high perf mode

@ngc92 tnx - added!

Getting "Floating point exception (core dumped)" Error

eyeballing your cmdline i'd say your batch size is too small and is causing an exception in the hellaswag eval, this is a known issue and we have a patch...

image-gpt

you would just need to tokenize images and everything else remains pretty much the same? we didn't have multimodal plans for this repo for the near future

clang: error: no such file or directory: 'fastBPE/fastBPE.cpp'

+1 on Ubuntu