Joe Cummings comments

Results 278 comments of


                                            Joe Cummings

Llama-3 Inference and Uploading to Huggingface

> @kartikayk I'm having this same issue, but on the full fine tuned checkpoint. i can't go back and re-train the model with a new checkpointer (i used meta's checkpointer,...

Llama-3 Inference and Uploading to Huggingface

> > 3- just eyeballing it, I'm not particularly sure about that, but It does seem so. there's a lot of repetition, the model hallucinates really bad even on english...

lm harness distributed evaluation?

This is something we're working closely with the EleutherAI team on providing soon. For now, if you have enough RAM (and patience) you can try running on CPU - this...

Add batched inference

> > A tokenizer pad_id of 0. We can update this if we really see a problem, but it covers all our current use cases. > > Is this true?...

update eval wrapper to match lm_eval's api change

Thanks for this quick fix @water-vapor ! Can you post the output of a run with this updated change for posterity?

Update README table

I think we need to get new numbers for this in general, probably can do this through an automated process. Closing this for now.

Support for Gemma 7B

We're definitely interested in adding more models! As I understand it, the 2B and 7B architectures are roughly the same (just different sizes for the parameters). If you'd be interested...

How can I fine tune Llama 3 on my own data?

Hi @agutell, great questions! I think both of these use-cases are covered by this tutorial: https://pytorch.org/torchtune/main/tutorials/chat.html. If you have follow-up questions though, please let us know!

How can I fine tune Llama 3 on my own data?

> Thank you! I'll check it out. One thing though, that page does not seem to show when you enter the documentation from [pytorch.org/torchtune/stable/index.html](https://pytorch.org/torchtune/stable/index.html) So we have a stable version...

feat: add gemma7b support

> @joecummings should I add unit tests for this PR ? Whoops, I keep overwriting instead of quote and reply. Let's just start with W&B run first.