steveepreston comments

Results 27 comments of


                                            steveepreston

Release v44 not available for Mac

on TPU VM env, im getting same error! while installed `bitsandbytes` via `pip install -U bitsandbytes`, it still throws: `ImportError: Using bitsandbytes 4-bit quantization requires the latest version of bitsandbytes:...

[Feature] add `mlflow` metric_logging

@ebsmothers Thanks for note @fabiogeraci Awesome!

RuntimeWarning: os.fork() was called. os.fork() is incompatible with multithreaded code, and JAX is multithreaded, so this will likely lead to a deadlock. self.pid = os.fork()

this bug still exists in tf 2.16.1

GPU MaxPool gradient ops do not yet have a deterministic XLA implementation

this issue still exist. how to solve it?

[Question] what to do when model doesn't have `tokenizer.model`?

@RdoubleA Thanks for explain, got the case. I list some other random models that doesn't have a `tokenizer.model`: [deepseek-ai/DeepSeek-V3](https://huggingface.co/deepseek-ai/DeepSeek-V3-Base/tree/main) [Qwen/QVQ](https://huggingface.co/Qwen/QVQ-72B-Preview/tree/main) [nvidia/Llama-3.1-Nemotron](https://huggingface.co/nvidia/Llama-3.1-Nemotron-70B-Instruct-HF/tree/main) [openai/gpt2](https://huggingface.co/openai-community/gpt2/tree/main) [mistralai/Mistral-Nemo](https://huggingface.co/mistralai/Mistral-Nemo-Instruct-2407/tree/main) [CohereForAI/c4ai](https://huggingface.co/CohereForAI/c4ai-command-r7b-12-2024/tree/main) [facebook/opt-125m](https://huggingface.co/facebook/opt-125m/tree/main) I don't have any idea...

steveepreston

Pytorch XLA/PJRT TPU support