Topic: Hallucinations

Short answer

This page shows the most relevant public items for Hallucinations, ranked by trend activity and review signal. Use weekly for fast changes, monthly for more stable patterns, and all-time for evergreen picks.

WeeklyMonthlyAll time
Current weekPast week2 weeks ago

← Back to home

  1. TruthfulQA: Measuring How Models Mimic Human Falsehoods

    PaperSep 8, 2021arXivStephanie Lin, Jacob Hilton, Owain Evans

    We propose TruthfulQA, a benchmark to measure whether a language model is truthful in generating answers to questions. The benchmark comprises 817 questions that span 38 categories, including healt...

Related Topics

company:openai-research (1)Benchmarking (1)AI Safety (1)cs.CL (1)