stanford_alpaca
stanford_alpaca copied to clipboard
Alpaca dataset token probabilities
Great work, this is a very exciting direction! In addition to the raw text data in alpaca_data.json, are you able to release the token probabilities generated by GPT-3 for each sample? This can help in detecting noisy samples, or select certain training samples with higher confidence.