Nature

Training language models to be warm can reduce accuracy and increase sycophancy

Follow
Experiments on five different language models show that training language models to produce warmer responses can undermine the accuracy of their output, especially when users express feelings of sadness.
favicon
nature.com
nature.com
Create attached notes ...