TY - GEN
T1 - Situating spatial templates for human-robot interaction
AU - Kelleher, John
AU - Ross, Robert
AU - Mac Namee, Brian
AU - Sloan, Colm
PY - 2010
Y1 - 2010
N2 - People often refer to objects by describing the object's spatial location relative to another object. Due to their ubiquity in situated discourse, the ability to use 'locative expressions' is fundamental to human-robot dialogue systems. A key component of this ability are computational models of spatial term semantics. These models bridge the grounding gap between spatial language and sensor data. Within the Artificial Intelligence and Robotics communities, spatial template based accounts, such as the Attention Vector Sum model (Regier and Carlson, 2001), have found considerable application in mediating situated human-machine communication (Gorniak, 2004; Brenner et a., 2007; Kelleher and Costello, 2009). Through empirical validation and computational application these template based models have proven their usefulness. We argue, however, that these models ignore important contextual features; resulting in their over-generalization and failure to account for actual usage in situated context. Such over-simplifications are a natural consequence of the experimental design taken in acquiring these models. That is, the data behind and hence the subsequent modelling of template based accounts used simplified scenes and reduced 2-dimensional survey based object configurations. While this is understandable given the original aims of these studies, we nevertheless believe that this is not sufficient justification for the direct application of idealized spatial templates to situated communication. This critique of template based models is similar in spirit to critiques already put forward by a number of researchers: Coventry and Garrod (2004) have stressed the need to account for functional effects; Kelleher and Costello (2009) highlighted the need to account for the effects introduced by distractors. Here, we argue that the models must also be extended to incorporate perspective effects.
AB - People often refer to objects by describing the object's spatial location relative to another object. Due to their ubiquity in situated discourse, the ability to use 'locative expressions' is fundamental to human-robot dialogue systems. A key component of this ability are computational models of spatial term semantics. These models bridge the grounding gap between spatial language and sensor data. Within the Artificial Intelligence and Robotics communities, spatial template based accounts, such as the Attention Vector Sum model (Regier and Carlson, 2001), have found considerable application in mediating situated human-machine communication (Gorniak, 2004; Brenner et a., 2007; Kelleher and Costello, 2009). Through empirical validation and computational application these template based models have proven their usefulness. We argue, however, that these models ignore important contextual features; resulting in their over-generalization and failure to account for actual usage in situated context. Such over-simplifications are a natural consequence of the experimental design taken in acquiring these models. That is, the data behind and hence the subsequent modelling of template based accounts used simplified scenes and reduced 2-dimensional survey based object configurations. While this is understandable given the original aims of these studies, we nevertheless believe that this is not sufficient justification for the direct application of idealized spatial templates to situated communication. This critique of template based models is similar in spirit to critiques already put forward by a number of researchers: Coventry and Garrod (2004) have stressed the need to account for functional effects; Kelleher and Costello (2009) highlighted the need to account for the effects introduced by distractors. Here, we argue that the models must also be extended to incorporate perspective effects.
KW - locative expressions
KW - human-robot dialogue systems
KW - spatial term semantics
KW - spatial language
KW - sensor data
KW - spatial template
KW - situated human-machine communication
KW - contextual features
KW - perspective effects
UR - http://www.scopus.com/inward/record.url?scp=79960126310&partnerID=8YFLogxK
U2 - 10.21427/d7fs52
DO - 10.21427/d7fs52
M3 - Conference contribution
AN - SCOPUS:79960126310
SN - 9781577354871
T3 - AAAI Fall Symposium - Technical Report
SP - 145
EP - 146
BT - Dialog with Robots - Papers from the AAAI Fall Symposium, Technical Report
T2 - 2010 AAAI Fall Symposium
Y2 - 11 November 2010 through 13 November 2010
ER -