Shreyans Dhankhar
Shreyans Dhankhar
@younesbelkada @pacman100 Thanks it worked, but i do observe the change in output with the merged model in comparison to adding adapters to original models separately.
@younesbelkada @pacman100 the above mentioned change is not working. I am getting following error: AttributeError:'T5Stack' object has no attribute 'first_device'. Can you check once?
@younesbelkada still same error after adding just setattr(model, 'model_parallel', True)
@younesbelkada thanks for the fix. I tried running the same and have one observation. When i am running the flan-t5-xxl 8 bit finetuning on g5.8x.large it is showing 27hrs to...
Hi @younesbelkada Here is the device_map: {'shared': 0, 'decoder.embed_tokens': 0, 'encoder.embed_tokens': 0, 'encoder.block.0': 0, 'encoder.block.1': 0, 'encoder.block.2': 0, 'encoder.block.3': 0, 'encoder.block.4': 0, 'encoder.block.5': 0, 'encoder.block.6': 0, 'encoder.block.7': 0, 'encoder.block.8': 0,...