Beyond Text: Enhancing human-computer interaction through multimodal embodied conversational agents

Autores
Novelli, Emiliano; Alvarado, Yoselie; Guerrero, Roberto A.
Año de publicación
2025
Idioma
inglés
Tipo de recurso
documento de conferencia
Estado
versión publicada
Descripción
This paper presents the development of a conversational agent that enriches human-computer interaction beyond traditional textbased interfaces by adopting a multimodal approach integrating speech, facial gesture dynamics, and emotional expressions on mobile platforms. The agent features real-time speech synthesis synchronized with visemes and is capable of displaying basic emotional states through animated facial expressions. Inspired by the principles of Embodied Conversational Agents and Natural User Interfaces, the application leverages vector graphics, animation engines, and several technologies for multiplatform support. The design emphasizes natural interaction, emotional perception, and usability, while seeking to circumvent the Uncanny Valley phenomenon by investigating varied strategies of visual representation. Evaluation results demonstrate that the system performs well in terms of small-screen interface legibility, computational performance, and user affective experience. This work contributes to the field of multimodal interfaces by demonstrating the feasibility and advantages of incorporating emotional and gestural cues into mobile conversational systems.
Red de Universidades con Carreras en Informática
Materia
Ciencias Informáticas
Embodied conversational agents
Visemes
Mobile applications
Multimodal interaction
Facial animation
Affective computing
Natural user interfaces
Nivel de accesibilidad
acceso abierto
Condiciones de uso
http://creativecommons.org/licenses/by-nc-sa/4.0/
Repositorio
SEDICI (UNLP)
Institución
Universidad Nacional de La Plata
OAI Identificador
oai:sedici.unlp.edu.ar:10915/191262

id SEDICI_df72b2f1ee3211a16ae9f689bc087d25
oai_identifier_str oai:sedici.unlp.edu.ar:10915/191262
network_acronym_str SEDICI
repository_id_str 1329
network_name_str SEDICI (UNLP)
spelling Beyond Text: Enhancing human-computer interaction through multimodal embodied conversational agentsNovelli, EmilianoAlvarado, YoselieGuerrero, Roberto A.Ciencias InformáticasEmbodied conversational agentsVisemesMobile applicationsMultimodal interactionFacial animationAffective computingNatural user interfacesThis paper presents the development of a conversational agent that enriches human-computer interaction beyond traditional textbased interfaces by adopting a multimodal approach integrating speech, facial gesture dynamics, and emotional expressions on mobile platforms. The agent features real-time speech synthesis synchronized with visemes and is capable of displaying basic emotional states through animated facial expressions. Inspired by the principles of Embodied Conversational Agents and Natural User Interfaces, the application leverages vector graphics, animation engines, and several technologies for multiplatform support. The design emphasizes natural interaction, emotional perception, and usability, while seeking to circumvent the Uncanny Valley phenomenon by investigating varied strategies of visual representation. Evaluation results demonstrate that the system performs well in terms of small-screen interface legibility, computational performance, and user affective experience. This work contributes to the field of multimodal interfaces by demonstrating the feasibility and advantages of incorporating emotional and gestural cues into mobile conversational systems.Red de Universidades con Carreras en Informática2025-10info:eu-repo/semantics/conferenceObjectinfo:eu-repo/semantics/publishedVersionObjeto de conferenciahttp://purl.org/coar/resource_type/c_5794info:ar-repo/semantics/documentoDeConferenciaapplication/pdf405-414http://sedici.unlp.edu.ar/handle/10915/191262enginfo:eu-repo/semantics/altIdentifier/isbn/978-987-8258-99-7info:eu-repo/semantics/reference/hdl/10915/189846info:eu-repo/semantics/openAccesshttp://creativecommons.org/licenses/by-nc-sa/4.0/Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)reponame:SEDICI (UNLP)instname:Universidad Nacional de La Platainstacron:UNLP2026-04-08T10:40:46Zoai:sedici.unlp.edu.ar:10915/191262Institucionalhttp://sedici.unlp.edu.ar/Universidad públicaNo correspondehttp://sedici.unlp.edu.ar/oai/snrdalira@sedici.unlp.edu.arArgentinaNo correspondeNo correspondeNo correspondeopendoar:13292026-04-08 10:40:46.89SEDICI (UNLP) - Universidad Nacional de La Platafalse
dc.title.none.fl_str_mv Beyond Text: Enhancing human-computer interaction through multimodal embodied conversational agents
title Beyond Text: Enhancing human-computer interaction through multimodal embodied conversational agents
spellingShingle Beyond Text: Enhancing human-computer interaction through multimodal embodied conversational agents
Novelli, Emiliano
Ciencias Informáticas
Embodied conversational agents
Visemes
Mobile applications
Multimodal interaction
Facial animation
Affective computing
Natural user interfaces
title_short Beyond Text: Enhancing human-computer interaction through multimodal embodied conversational agents
title_full Beyond Text: Enhancing human-computer interaction through multimodal embodied conversational agents
title_fullStr Beyond Text: Enhancing human-computer interaction through multimodal embodied conversational agents
title_full_unstemmed Beyond Text: Enhancing human-computer interaction through multimodal embodied conversational agents
title_sort Beyond Text: Enhancing human-computer interaction through multimodal embodied conversational agents
dc.creator.none.fl_str_mv Novelli, Emiliano
Alvarado, Yoselie
Guerrero, Roberto A.
author Novelli, Emiliano
author_facet Novelli, Emiliano
Alvarado, Yoselie
Guerrero, Roberto A.
author_role author
author2 Alvarado, Yoselie
Guerrero, Roberto A.
author2_role author
author
dc.subject.none.fl_str_mv Ciencias Informáticas
Embodied conversational agents
Visemes
Mobile applications
Multimodal interaction
Facial animation
Affective computing
Natural user interfaces
topic Ciencias Informáticas
Embodied conversational agents
Visemes
Mobile applications
Multimodal interaction
Facial animation
Affective computing
Natural user interfaces
dc.description.none.fl_txt_mv This paper presents the development of a conversational agent that enriches human-computer interaction beyond traditional textbased interfaces by adopting a multimodal approach integrating speech, facial gesture dynamics, and emotional expressions on mobile platforms. The agent features real-time speech synthesis synchronized with visemes and is capable of displaying basic emotional states through animated facial expressions. Inspired by the principles of Embodied Conversational Agents and Natural User Interfaces, the application leverages vector graphics, animation engines, and several technologies for multiplatform support. The design emphasizes natural interaction, emotional perception, and usability, while seeking to circumvent the Uncanny Valley phenomenon by investigating varied strategies of visual representation. Evaluation results demonstrate that the system performs well in terms of small-screen interface legibility, computational performance, and user affective experience. This work contributes to the field of multimodal interfaces by demonstrating the feasibility and advantages of incorporating emotional and gestural cues into mobile conversational systems.
Red de Universidades con Carreras en Informática
description This paper presents the development of a conversational agent that enriches human-computer interaction beyond traditional textbased interfaces by adopting a multimodal approach integrating speech, facial gesture dynamics, and emotional expressions on mobile platforms. The agent features real-time speech synthesis synchronized with visemes and is capable of displaying basic emotional states through animated facial expressions. Inspired by the principles of Embodied Conversational Agents and Natural User Interfaces, the application leverages vector graphics, animation engines, and several technologies for multiplatform support. The design emphasizes natural interaction, emotional perception, and usability, while seeking to circumvent the Uncanny Valley phenomenon by investigating varied strategies of visual representation. Evaluation results demonstrate that the system performs well in terms of small-screen interface legibility, computational performance, and user affective experience. This work contributes to the field of multimodal interfaces by demonstrating the feasibility and advantages of incorporating emotional and gestural cues into mobile conversational systems.
publishDate 2025
dc.date.none.fl_str_mv 2025-10
dc.type.none.fl_str_mv info:eu-repo/semantics/conferenceObject
info:eu-repo/semantics/publishedVersion
Objeto de conferencia
http://purl.org/coar/resource_type/c_5794
info:ar-repo/semantics/documentoDeConferencia
format conferenceObject
status_str publishedVersion
dc.identifier.none.fl_str_mv http://sedici.unlp.edu.ar/handle/10915/191262
url http://sedici.unlp.edu.ar/handle/10915/191262
dc.language.none.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv info:eu-repo/semantics/altIdentifier/isbn/978-987-8258-99-7
info:eu-repo/semantics/reference/hdl/10915/189846
dc.rights.none.fl_str_mv info:eu-repo/semantics/openAccess
http://creativecommons.org/licenses/by-nc-sa/4.0/
Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
eu_rights_str_mv openAccess
rights_invalid_str_mv http://creativecommons.org/licenses/by-nc-sa/4.0/
Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
dc.format.none.fl_str_mv application/pdf
405-414
dc.source.none.fl_str_mv reponame:SEDICI (UNLP)
instname:Universidad Nacional de La Plata
instacron:UNLP
reponame_str SEDICI (UNLP)
collection SEDICI (UNLP)
instname_str Universidad Nacional de La Plata
instacron_str UNLP
institution UNLP
repository.name.fl_str_mv SEDICI (UNLP) - Universidad Nacional de La Plata
repository.mail.fl_str_mv alira@sedici.unlp.edu.ar
_version_ 1861919541598093312
score 13.018236