grok-1
                                
                                 grok-1 copied to clipboard
                                
                                    grok-1 copied to clipboard
                            
                            
                            
                        Grok open release
# This PR introduces two key enhancements to the tensor loading process: (Fixes #220) - Improved error handling within ThreadPoolExecutor to provide detailed logs for failures during parallel tensor loading....
# Description The tensor loading process, specifically within ThreadPoolExecutor and regex operations in get_load_path_str, requires enhancements to improve error handling and efficiency. ## Enhanced Error Handling in ThreadPoolExecutor: Current behavior...
Running `python run.py` on a single Nvidia GPU it fails with `ValueError: Number of devices 1 must equal the product of mesh_shape (1, 8)` Can the nr of devices be...
run `pip install -r requirements.txt` failed, replaced it as `cuda12-pip` successed.
The attention formula has an [unusual line](https://github.com/xai-org/grok-1/blob/d6d9447e2d3c9bd81da571dc8681ea60009e4c03/model.py#L865) that puts attention weights through `tanh`. 1. What is the reason for this? Normalization/entropy control? Is there a paper about it? 2. Would...
Yeah, I think grok-1 needs an issue template to avoid confused issue
i have installed python 3.10 and venv. Trying to "pip install -r requirements.txt" ERROR: Ignored the following versions that require a different python version: 1.6.2 Requires-Python >=3.7,=3.7,=3.7,=3.7,
Can someone covert this jax model to pytorch model implemented in transformers?
When I'm trying to download the weights from huggingface `huggingface-cli download xai-org/grok-1 --repo-type model --include ckpt-0/* --local-dir checkpoints --local-dir-use-symlinks False` I'm getting error `no matches found: ckpt-0/*`
Hello everyone. I've tried to run the pip install, but I'm facing the following error: ```bash ERROR: Could not find a version that satisfies the requirement jaxlib==0.4.25+cuda12.cudnn89; extra == "cuda12_pip"...