redbrain comments

Results 58 comments of


                                            redbrain

OOM trying to pretrain llama 7b on v4-256

This results in `NotImplementedError: Failed to find assignment for logical_axis_index 1 of size 64 with remaining assignable mesh [4, 4, 8].` Any clue what went wrong?

OOM trying to pretrain llama 7b on v4-256

It appears that since a v4-256 has half the chips of a v4-512, the appropriate mesh topology would be `-1,32,1`. But running it with that mesh and with batch sizes...

OOM trying to pretrain llama 7b on v4-256

Still not working, even with the parameters you suggested for mesh_dim and batch_size. Full command ```sh export LIBTPU_INIT_ARGS='--xla_jf_spmd_threshold_for_windowed_einsum_mib=0 --xla_tpu_spmd_threshold_for_allgather_cse=10000 --xla_tpu_spmd_rewrite_einsum_with_reshape=true --xla_enable_async_all_gather=true --jax_enable_async_collective_offload=true --xla_tpu_enable_latency_hiding_scheduler=true TPU_MEGACORE=MEGACORE_DENSE' python -m EasyLM.models.llama.llama_train \ --mesh_dim='!-1,64,1' \...

504 error while trying to load a library

Same here: https://wzrd.in/standalone/color-convert

Getting error `SEVERE: Cannot read JBIG2 image: jbig2-imageio is not installed`

Same here, any updates on this?

Binary packaging

I've found that using `upx --best --ultra-brute` on the pkg executable after creation can decrease its size dramatically.

Binary packaging

mrocha, I'm having the same result with my current project. I'm now trying to figure out if pkg can be modified so that the two are compatible. Aside from that,...

OpenAI-based jailbreaks do not work on newly released gpt-oss:20b

https://x.com/elder_plinius/status/1952958577867669892/ Seems like there's a functioning jailbreak, it just hasn't been added to the repo yet?