TY - GEN
T1 - Seeing is believing
T2 - 37th International Conference and Exhibition on Computer Graphics and Interactive Techniques, SIGGRAPH 2010
AU - Ennis, Cathy
AU - McDonnell, Rachel
AU - O'Sullivan, Carol
PY - 2010/7/26
Y1 - 2010/7/26
N2 - In many scenes with human characters, interacting groups are an important factor for maintaining a sense of realism. However, little is known about what makes these characters appear realistic. In this paper, we investigate human sensitivity to audio mismatches (i.e., when individuals' voices are not matched to their gestures) and visual desynchronization (i.e., when the body motions of the individuals in a group are mis-aligned in time) in virtual human conversers. Using motion capture data from a range of both polite conversations and arguments, we conduct a series of perceptual experiments and determine some factors that contribute to the plausibility of virtual conversing groups. We found that participants are more sensitive to visual desynchronization of body motions, than to mismatches between the characters' gestures and their voices. Furthermore, synthetic conversations can appear sufficiently realistic once there is an appropriate balance between talker and listener roles. This is regardless of body motion desynchronization or mismatched audio.
AB - In many scenes with human characters, interacting groups are an important factor for maintaining a sense of realism. However, little is known about what makes these characters appear realistic. In this paper, we investigate human sensitivity to audio mismatches (i.e., when individuals' voices are not matched to their gestures) and visual desynchronization (i.e., when the body motions of the individuals in a group are mis-aligned in time) in virtual human conversers. Using motion capture data from a range of both polite conversations and arguments, we conduct a series of perceptual experiments and determine some factors that contribute to the plausibility of virtual conversing groups. We found that participants are more sensitive to visual desynchronization of body motions, than to mismatches between the characters' gestures and their voices. Furthermore, synthetic conversations can appear sufficiently realistic once there is an appropriate balance between talker and listener roles. This is regardless of body motion desynchronization or mismatched audio.
KW - Conversational agents
KW - Crowds
KW - Perception
UR - http://www.scopus.com/inward/record.url?scp=77956362234&partnerID=8YFLogxK
U2 - 10.1145/1778765.1778828
DO - 10.1145/1778765.1778828
M3 - Conference contribution
AN - SCOPUS:77956362234
VL - 29
T3 - ACM Transactions on Graphics
BT - ACM SIGGRAPH 2010 Papers, SIGGRAPH 2010
A2 - Hoppe, Hugues
PB - Association for Computing Machinery, Inc
Y2 - 26 July 2010 through 30 July 2010
ER -