Skip to content
TheNote.app
Download_on_the_App_Store_Badge_US-UK_RGB_blk_4SVG_092917
Nature
Training language models to be warm can reduce accuracy and increase sycophancy
Follow
Experiments on five different language models show that training language models to produce warmer responses can undermine the accuracy of their output, especially when users express feelings of sadness.
nature.com
nature.com