TY - GEN
T1 - Human or Robot?
T2 - 21st ACM International Conference on Intelligent Virtual Agents, IVA 2021
AU - Ferstl, Ylva
AU - Thomas, Sean
AU - Guiard, Cédric
AU - Ennis, Cathy
AU - McDonnell, Rachel
N1 - Publisher Copyright:
© 2021 Owner/Author.
PY - 2021/9/14
Y1 - 2021/9/14
N2 - Research on creation of virtual humans enables increasing automatization of their behavior, including synthesis of verbal and nonverbal behavior. As the achievable realism of different aspects of agent design evolves asynchronously, it is important to understand if and how divergence in realism between behavioral channels can elicit negative user responses. Specifically, in this work, we investigate the question of whether autonomous virtual agents relying on synthetic text-to-speech voices should portray a corresponding level of realism in the non-verbal channels of motion and visual appearance, or if, alternatively, the best available realism of each channel should be used. In two perceptual studies, we assess how realism of voice, motion, and appearance influence the perceived match of speech and gesture motion, as well as the agent's likability and human-likeness. Our results suggest that maximizing realism of voice and motion is preferable even when this leads to realism mismatches, but for visual appearance, lower realism may be preferable. (A video abstract can be found at https://youtu.be/arfZZ-hxD1Y.)
AB - Research on creation of virtual humans enables increasing automatization of their behavior, including synthesis of verbal and nonverbal behavior. As the achievable realism of different aspects of agent design evolves asynchronously, it is important to understand if and how divergence in realism between behavioral channels can elicit negative user responses. Specifically, in this work, we investigate the question of whether autonomous virtual agents relying on synthetic text-to-speech voices should portray a corresponding level of realism in the non-verbal channels of motion and visual appearance, or if, alternatively, the best available realism of each channel should be used. In two perceptual studies, we assess how realism of voice, motion, and appearance influence the perceived match of speech and gesture motion, as well as the agent's likability and human-likeness. Our results suggest that maximizing realism of voice and motion is preferable even when this leads to realism mismatches, but for visual appearance, lower realism may be preferable. (A video abstract can be found at https://youtu.be/arfZZ-hxD1Y.)
KW - agent design
KW - animation style
KW - anthropomorphism
KW - conversational agents
KW - gesture motion
KW - human-computer interfaces
KW - perception
KW - text-to-speech
UR - http://www.scopus.com/inward/record.url?scp=85115793623&partnerID=8YFLogxK
U2 - 10.1145/3472306.3478338
DO - 10.1145/3472306.3478338
M3 - Conference contribution
AN - SCOPUS:85115793623
T3 - Proceedings of the 21st ACM International Conference on Intelligent Virtual Agents, IVA 2021
SP - 76
EP - 83
BT - Proceedings of the 21st ACM International Conference on Intelligent Virtual Agents, IVA 2021
PB - Association for Computing Machinery, Inc
Y2 - 14 September 2021 through 17 September 2021
ER -