Topic: Hallucinations

Short answer

This page shows the most relevant public items for Hallucinations, ranked by trend activity and review signal. Use weekly for fast changes, monthly for more stable patterns, and all-time for evergreen picks.

Weekly Monthly All time

Current week Past week 2 weeks ago

← Back to home

TruthfulQA: Measuring How Models Mimic Human Falsehoods
Paper • Sep 8, 2021 • arXiv • Stephanie Lin, Jacob Hilton, Owain Evans
We propose TruthfulQA, a benchmark to measure whether a language model is truthful in generating answers to questions. The benchmark comprises 817 questions that span 38 categories, including healt...

Topic: Hallucinations

Short answer

TruthfulQA: Measuring How Models Mimic Human Falsehoods

Related Topics