stanford_alpaca
stanford_alpaca copied to clipboard
BBH stats?
Sorry to see the demo go dark. Hope you guys are doing ok.
Wondering if you could run benchmarks with the weights you have against BIG-Bench Hard and share the results?
Results for Causal Judgment, Disambiguation QA, Formal Fallacies, Hyperbaton, and Logical Deduction (Five Objects) would be of interest.
Thanks.