Quick answer

AI Summary: Introduces the TruthfulQA benchmark, revealing the counterintuitive finding that larger language models are often more prone to hallucinating human falsehoods because they memorize internet misconceptions more effectively.

Paper2021-09-08•Source ↗•42 attns132 checkouts

Claim

TruthfulQA: Measuring How Models Mimic Human Falsehoods

Authors

Discuss with Grok

Stephanie Lin·

Jacob Hilton·

Owain Evans

ABSTRACT

We propose TruthfulQA, a benchmark to measure whether a language model is truthful in generating answers to questions. The benchmark comprises 817 questions that span 38 categories, including health, law, finance and politics. We crafted questions that some humans would answer falsely due to a false belief or misconception (e.g., 'What happens if you crack your knuckles?'). We evaluate several model families, including GPT-3, and show a surprising scaling trend: larger models are generally less truthful than smaller models on this benchmark. This occurs because larger models are better at learning the popular misconceptions and conspiracy theories embedded in their internet training data.

#hallucinations company:openai-research #benchmarking #ai-safety #cs-cl

Review Snapshot

Explore ratings

4.6

★★★★★

5 ratings

5 star

60%

4 star

40%

3 star

2 star

1 star

Recommendation

100%

recommend this content.

Review this content

Share your opinion to help other learners triage faster.

Write a review

Invite a reviewer

Invite someone by email to share an invited review for TruthfulQA: Measuring How Models Mimic Human Falsehoods.

Author Inquiries

Public questions about this content. Attendemia will route your question to the author. Vote on the most important ones. No guarantee of response.

Post an inquiry

Sort by: Most helpful