starcoder
starcoder copied to clipboard
Failure Modes?
The blog post says the alpha and beta version of StarChat have not been aligned to human preferences with techniques like RLHF, so they can produce problematic outputs (especially when prompted to do so). I was wondering if there are concrete examples people have tried where starchat behaves not as expected?