Liang
Results
3
issues of
Liang
Added "Beyond Factuality: A Comprehensive Evaluation of Large Language Models as Knowledge Generators" in the 'Capacity Evaluation' section.
Added references to papers on the reliability of LLM and order-invariance training.
Included a reference concerning the reliability of LLMs as generative search engines, hope it is relevant :)