IncognitoPilot icon indicating copy to clipboard operation
IncognitoPilot copied to clipboard

Add support for StarCoder models

Open osanseviero opened this issue 2 years ago • 4 comments

Hi there! StarCoder from BigCode was trained for this kind of tasks, so having some documentation/support for it would be great.

Very nice project btw :fire:

osanseviero avatar Aug 17 '23 20:08 osanseviero

@osanseviero working on this PR for your issue: https://github.com/silvanmelchior/IncognitoPilot/pull/19

ishaan-jaff avatar Aug 18 '23 23:08 ishaan-jaff

Thanks!

Regarding starcoder: Was this finetuned to work in chat-style interactions? When I had a first look at it, it appeared as if it's mostly for coding only / reading comments. Also from the size of the model I'm not sure if it can work well in this setup.

If someone can demo that it works well however, I'm very open to adding it!

silvanmelchior avatar Aug 19 '23 22:08 silvanmelchior

There are multiple BigCode models (1.1B, 3B, 7B, and 15B), so there are smaller version indeed. As for which model, it depends on what exact behaviour one would want. https://twitter.com/lvwerra/status/1691127139314159628 is a good explanation

  • StarCoder+ is trained with more natural language
  • StarChat b is then instruct-tuned. -> see this quick demo https://huggingface.co/spaces/HuggingFaceH4/starchat-playground

osanseviero avatar Aug 20 '23 15:08 osanseviero

I tried it out a bit on HF, looks nice!

Definitely sth to consider, or maybe just in general a better explanation on how to add more models.

silvanmelchior avatar Aug 22 '23 05:08 silvanmelchior