composer
                                
                                
                                
                                    composer copied to clipboard
                            
                            
                            
                        Supercharge Your Model Training
** Environment ** composer version 0.26.0 torch 2.4.0 ** To reproduce Steps to reproduce the behavior: $ git clone https://github.com/maxrousseau/rafale.git $ cd rafale $ uv venv $ . .venv/bin/activate $...
# What does this PR do? Transformers recently added in `mean_resizing` to `resize_token_embeddings`. This is breaking with mixed initialization in downstream training tasks that requires adding tokens to Composer Huggingface...
# What does this PR do? The time units didn't have units, this makes the units explicit and adds a new secs_per_step metric.