VatsaDev comments

Results 88 comments of


                                            VatsaDev

trafficstars

Why is there a significant drop in `val_ppl` after fixing data-loading bug?

Hmm, thats really interesting. thinking about it, repeated data could have distorted the data patterns, making the model look at data in ways it shouldn't have, which could have affected...

Why is there a significant drop in `val_ppl` after fixing data-loading bug?

Looks like its been fixed with train run 2? was the previous val data something it might have trained on? Whats the deduplication of the data?

Why is there a significant drop in `val_ppl` after fixing data-loading bug?

@eminorhan I had similar thoughts, but its also possible that in 35% of new data, slipajama might be more uniform, or it could have been code data, which also has...

Working Chat Demo

Image Example: ![Screen Shot 2023-09-16 at 8 54 06 PM](https://github.com/jzhang38/TinyLlama/assets/71975550/19400bc4-1aa6-422c-a397-bf5b889a207c) Overall UI: ![Screen Shot 2023-09-16 at 8 55 43 PM](https://github.com/jzhang38/TinyLlama/assets/71975550/4d2d893e-f09d-41c2-8f9f-3ac8a7bf78d5)

Working Chat Demo

Also, be sure to enter your Ngrok auth token for colab, or it wont work

Working Chat Demo

@gabrielgrant thanks, I've incorporated all the changes, it works again!

Working Chat Demo

Not really, its pure llama, so vllm should be fine with it, but future versions of this also need to move to chatML, which I believe the newer finetunes have...

Is it supported to convert to Apple CoreML? I tried to use coremltools to convert this model to CoreML format, but encountered an error.

This project was meant to be trained and inferenced on a GPU. It does extend to CPU via GGUF, but there is no real support for CoreML/Metal, beyond the standard...

ncclRemoteError

Can you provide more details? How did you get the error, what were you doing, whats your platform/device? Can you give a stack trace. You're github issue right now gives...

How to make sentences make more sense?

GPT-2 is glorified auto complete with the ability to make sentences, If you want better sentences, fine tune it. I have personally had pretty [good success](https://github.com/VatsaDev/nanoChatGPT) with finetuning gpt-2-medium into...