Liang

Results 3 issues of Liang

Added "Beyond Factuality: A Comprehensive Evaluation of Large Language Models as Knowledge Generators" in the 'Capacity Evaluation' section.

Added references to papers on the reliability of LLM and order-invariance training.

Included a reference concerning the reliability of LLMs as generative search engines, hope it is relevant :)