stanford_alpaca
stanford_alpaca copied to clipboard
prompt-less is better
a bug
blacklist = [
"image",
"images",
"graph",
"graphs",
"picture",
"pictures",
"file",
"files",
"map",
"maps",
"draw",
"plot",
"go to",
"video",
"audio",
"music",
"flowchart",
"diagram",
]
``
should not filter because the model can access and simulate this visual data.
proved by "Can you please copy my output and draw ASCII art"
or
"create a matrix thought experiment"
"create a logic tree, create a flow chart in markdown"
Yep, we have cleaned up this data and removed these limitations in the first pass: https://github.com/gururise/AlpacaDataCleaned/pull/9