Speech Command + Speech Emotion: Exploring Emotional Speech Commands as a Compound and Playful Modality
Speech Command + Speech Emotion: Exploring Emotional Speech Commands as a Compound and Playful Modality
In an era of human-computer interaction with increasingly agentic AI systems capable of connecting with users conversationally, speech is an important modality for commanding agents. By recognizing and using speech emotions (i.e., how a command is spoken), we can provide agents with the ability to emotionally accentuate their responses and socially enrich users' perceptions and experiences. To explore the concept and impact of speech emotion commands on user perceptions, we realized a prototype and conducted a user study (N = 14) where speech commands are used to steer two vehicles in a minimalist and retro game style implementation. While both agents execute user commands, only one of the agents uses speech emotion information to adapt its execution behavior. We report on differences in how users perceived each agent, including significant differences in stimulation and dependability, outline implications for designing interactions with agents using emotional speech commands, and provide insights on how users consciously emote, which we describe as "voice acting".
Ilhan Aslan、Timothy Merritt、Stine S. Johansen、Niels van Berkel
计算技术、计算机技术
Ilhan Aslan,Timothy Merritt,Stine S. Johansen,Niels van Berkel.Speech Command + Speech Emotion: Exploring Emotional Speech Commands as a Compound and Playful Modality[EB/OL].(2025-04-11)[2025-05-17].https://arxiv.org/abs/2504.08440.点此复制
评论