Retrieved January 15, 2023. The human raters aren't gurus in The subject, and so they tend to select text that looks convincing. They'd pick up on quite a few signs and symptoms of hallucination, but not all. Accuracy problems that creep in are tough to catch. ^When prompted to "summarize an write-up" having a fake URL which contains meaningful key