Carlos Mocholí comments

Results 427 comments of


                                            Carlos Mocholí

Add Cohere's Command-R

Do you see any specific differences in the modeling?

Make `save_hyperparameters()` robust against different CLI entry points

It's not clear how to make this more robust. Perhaps the best way is to drop support for `python litgpt/finetune/lora.py` since we no longer advertise it

Edited: Alternative methods to perform model loading & sharding operation such that only sharded model utilizes GPU VRAM

If you use `ddp_fork`, that will not shard your model. If you want this, I suggest that you give up using a notebook for training as it cannot support FSDP....

I'm doing an image generation experiment, but my script outputs a json file, how do I train a Transformer model to generate a pixel representation of an image?

I'm sorry but I have no idea about what you are talking about :(

Meaningful error if no validation split fraction is provided in custom JSON data module

@awaelchli Improved the error in #1241. Still, we could set a default fraction

[FEAT] support regex or start-with for checks

I believe this is not currently supported by looking at the piece of code that checks satisfiability https://github.com/tianhaoz95/check-group/blob/df61154e69ffd9d54a43207781839ce24a6867db/src/utils/satisfy_expected_checks.ts#L33 as there's no regex lookup

Carlos Mocholí