Process for Adapting Language Models to Society (PALMS) with Values-Targeted Datasets
Paper • Jun 10, 2021 • arXiv • Irene Solaiman, Christy Dennison
As language models grow in capability and scale, they increasingly generate outputs that reflect the biases, toxicity, and harmful stereotypes present in their internet-scraped training data. We in...