Kartikay Khandelwal comments

Results 125 comments of


                                            Kartikay Khandelwal

FLAVA code

Thanks for the question! We will have more detailed communication around this, but a quick note here. MMF currently supports text + image understanding tasks with some initial support for...

fp32 Full Training seems to be taking a lot of memory

@rohan-varma just BS=1 with the default config.

Benchmark QPS

@joecummings agreed, but with the caveat that we can't have sizable gaps on perf wrt competition. Benchmarking this is a milestone for the MVP.

Can i fine tune "dolphin-2.2.1-mistral-7b.Q2_K.gguf" with torchtune ? using cpu ?

@walidbet18 thanks for opening this issue! Unfortunately, we don't currently support fine-tuning GGUF models. Is the model you mentioned available in native pytorch format? I'm guessing this is a specific...

Can i fine tune "dolphin-2.2.1-mistral-7b.Q2_K.gguf" with torchtune ? using cpu ?

Sounds good! Just to confirm, you're looking to fine-tune the mistral 7B model not specifically ``` dolphin-2.2.1-mistral-7b.Q2_K.gguf```, is that right? Or do you care about ``` dolphin-2.2.1-mistral-7b.Q2_K.gguf``` specifically?

Can i fine tune "dolphin-2.2.1-mistral-7b.Q2_K.gguf" with torchtune ? using cpu ?

Ah ok, then you should be able to use the commands from the README and the tutorials directly, just replace the configs with the one in the mistral folder. So...

Move away from using `/tmp` directories

@joecummings we probably need to address this as part of ```tune download``` to make sure we arent' downloading to ```tmp```. I can look at this on the checkpointing side as...

Move away from using `/tmp` directories

Tracking this in #691

Move away from using `/tmp` directories

Ok yeh, I think I read this as addressing the ```/tmp/artifacts``` issue but that shouldn't be a problem anymore. For downloading the models and files, I think we can find...

Move away from using `/tmp` directories

Why is what hub does a good idea? Models etc that we're downloading are all from HF, why do we need to do what torch hub does for this? IIUC,...