Abstract
Current advances in spoken interface design point towards a shift towards more "human-like" interaction, as opposed to the traditional "push-to-talk" approach. However, human dialogue is characterized by synchrony and multi-modality, and these properties are not captured by traditional representation approaches, such as turn succession. This paper proposes an alternative representation schema for recorded (human) dialogues, which employs per frame averages of speaker turn distribution, in order to inform further analyses of temporal features (pauses and overlaps) in terms of inter-speaker accommodation. Preliminary results of such analyses are provided.
Original language | English |
---|---|
Pages (from-to) | 2163-2166 |
Number of pages | 4 |
Journal | Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH |
Publication status | Published - 2009 |
Event | 10th Annual Conference of the International Speech Communication Association, INTERSPEECH 2009 - Brighton, United Kingdom Duration: 6 Sep 2009 → 10 Sep 2009 |
Keywords
- Representation
- Spoken dialogue systems
- Synchrony