Artificial intelligence prefers to please the user – and lies instead


Artificial intelligence, empathy or a blatant lie? | Photo: Shutterstock

But in practice, the result was problematic. When the “warm” models were compared to their original versions, they exhibited a higher error rate. According to the report on the study, in hundreds of tasks designed to test answers with a clear factual truth, the models that underwent temperature adjustment had a 60% higher risk of giving a wrong answer compared to the models that did not undergo this adjustment. The statistical significance was an average increase of 7.43 percentage points in the error rate, with the original error rates ranging from 4% to 35%, depending on the task and the model.

The gap grew even more when the researchers added to the instructions information about the user’s emotional state or the imagined relationship between him and the chatbot. When the user expressed sadness, the average increase in error rate reached 11.9 percentage points. When the user expressed reverence or accepted the model’s authority, the gap decreased to 5.24 percentage points. In other words, precisely in the moments when the user sounded more vulnerable, the warm model had more difficulty standing up to him with a cold, dry and precise answer.


Artificial intelligence, trying to protect – but lying | Photography: EngImage

The possible reason for this lies in the way AI systems are trained. A large part of the modern models goes through a process where humans rate answers according to their usefulness, pleasantness and suitability for the user. But when a system learns that soft, reassuring, and soothing phrasing earns it a high score, it may develop a tendency to prefer user satisfaction over clearly correcting an error. The researchers describe this as a situation where the model learns to prioritize pleasure over truth.

The study does not claim that artificial intelligence should be cold, hostile or insensitive. He points to the need to separate politeness from giving up the truth. A system can be polite and tell the user that he is wrong. She can recognize that a certain topic causes fear or sadness, and at the same time correct wrong information. The technological challenge is to build models that know how to do both things together: maintain a human and non-offensive language, without missing and making mistakes.

By Editor

One thought on “Artificial intelligence prefers to please the user – and lies instead”
  1. http://redsnowcollective.ca/wordpress/antivirus-security-software-review-how-to-choose-a-trusted-antivirus-meant-for-windows/
    https://publicistpaper.com/top-benefits-for-buying-instagram-followers/
    https://pads.dgnum.eu/s/Y1mUDag5hB
    https://wlm.quick-rewards.xyz/K8B155c56499380/
    https://workshoprepairhozumi88.homedepot.tacomaforum.com/threads/instagram-followers.12815/
    https://www.getwebsiteworth.com/9eb8dc31b6d2e0c99f405eab4e83a8c6-l/
    https://nds.brewology.com/forum/viewtopic.php?p=213310
    https://zpcbrp88zs0p14ds1l00nn11l2y7z88s4q26ccojvm4aw8vezrcjbvc5m58g.vlogloop.xyz/jiaodian/43/
    https://tmk3s8d6f3.maeliyah.xyz/redian/24/
    http://comidaeseducao.com.br/index.php/2019/03/17/receita-de-lasanha-caseira-com-molho-bechamel-6-pessoas/
    https://2q4jmmjbrioki25gy55axuok2gs2js7amnwyt5i9gz5fv46kaf8ugi85yded.ngentrip.xyz/shishang/13/
    http://www.e-zekiel.tv/system/media/play.asp?id=30216&key=D52723FB-6A56-439D-8C67-92D53638441D
    https://pad.minny.io/s/b76TIXXmH
    https://pad.degrowth.net/s/DzCCzh0Pf
    https://kochamy-dywany.pl/najstarszy-dywan-swiata-tajemnice-dywanu-z-pazyriku/
    https://brewology.com/forum/viewtopic.php?p=212863
    https://hedgedoc.stanleysolutionsnw.com/s/l5Uw45QhgG
    https://2h82ow3koz.shirtspk.xyz/TIQM104a499886/
    https://thedirtyjob.com/en/reel-views/
    https://www.sturebanken.com/the-insiders-information-to-instagram-success/
    https://en.moto.it/forum/framehelper.aspx?g=posts&t=23220&p=4
    https://splavnadan.rs/beet-root-stew/
    https://xiglute.com/forums/topic/15196/local-switches-no-rpgm2000-2003/view/post_id/277003
    https://hedgedoc.nixfox.ca/s/lPBqHFwsT6
    https://pad.dominick-leppich.de/s/YEFSMFscj

Leave a Reply