Beyond Text: Enhancing human-computer interaction through multimodal embodied conversational agents

Autores: Novelli, Emiliano; Alvarado, Yoselie; Guerrero, Roberto A.
Año de publicación: 2025
Idioma: inglés
Tipo de recurso: documento de conferencia
Estado: versión publicada
Descripción: This paper presents the development of a conversational agent that enriches human-computer interaction beyond traditional textbased interfaces by adopting a multimodal approach integrating speech, facial gesture dynamics, and emotional expressions on mobile platforms. The agent features real-time speech synthesis synchronized with visemes and is capable of displaying basic emotional states through animated facial expressions. Inspired by the principles of Embodied Conversational Agents and Natural User Interfaces, the application leverages vector graphics, animation engines, and several technologies for multiplatform support. The design emphasizes natural interaction, emotional perception, and usability, while seeking to circumvent the Uncanny Valley phenomenon by investigating varied strategies of visual representation. Evaluation results demonstrate that the system performs well in terms of small-screen interface legibility, computational performance, and user affective experience. This work contributes to the field of multimodal interfaces by demonstrating the feasibility and advantages of incorporating emotional and gestural cues into mobile conversational systems.
Red de Universidades con Carreras en Informática
Materia: Ciencias Informáticas
Embodied conversational agents
Visemes
Mobile applications
Multimodal interaction
Facial animation
Affective computing
Natural user interfaces
Nivel de accesibilidad: acceso abierto
Condiciones de uso: http://creativecommons.org/licenses/by-nc-sa/4.0/
Repositorio
Institución: Universidad Nacional de La Plata
OAI Identificador: oai:sedici.unlp.edu.ar:10915/191262

Acceder

id	SEDICI_df72b2f1ee3211a16ae9f689bc087d25
oai_identifier_str	oai:sedici.unlp.edu.ar:10915/191262
network_acronym_str	SEDICI
repository_id_str	1329
network_name_str	SEDICI (UNLP)
spelling	Beyond Text: Enhancing human-computer interaction through multimodal embodied conversational agentsNovelli, EmilianoAlvarado, YoselieGuerrero, Roberto A.Ciencias InformáticasEmbodied conversational agentsVisemesMobile applicationsMultimodal interactionFacial animationAffective computingNatural user interfacesThis paper presents the development of a conversational agent that enriches human-computer interaction beyond traditional textbased interfaces by adopting a multimodal approach integrating speech, facial gesture dynamics, and emotional expressions on mobile platforms. The agent features real-time speech synthesis synchronized with visemes and is capable of displaying basic emotional states through animated facial expressions. Inspired by the principles of Embodied Conversational Agents and Natural User Interfaces, the application leverages vector graphics, animation engines, and several technologies for multiplatform support. The design emphasizes natural interaction, emotional perception, and usability, while seeking to circumvent the Uncanny Valley phenomenon by investigating varied strategies of visual representation. Evaluation results demonstrate that the system performs well in terms of small-screen interface legibility, computational performance, and user affective experience. This work contributes to the field of multimodal interfaces by demonstrating the feasibility and advantages of incorporating emotional and gestural cues into mobile conversational systems.Red de Universidades con Carreras en Informática2025-10info:eu-repo/semantics/conferenceObjectinfo:eu-repo/semantics/publishedVersionObjeto de conferenciahttp://purl.org/coar/resource_type/c_5794info:ar-repo/semantics/documentoDeConferenciaapplication/pdf405-414http://sedici.unlp.edu.ar/handle/10915/191262enginfo:eu-repo/semantics/altIdentifier/isbn/978-987-8258-99-7info:eu-repo/semantics/reference/hdl/10915/189846info:eu-repo/semantics/openAccesshttp://creativecommons.org/licenses/by-nc-sa/4.0/Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)reponame:SEDICI (UNLP)instname:Universidad Nacional de La Platainstacron:UNLP2026-05-27T11:46:59Zoai:sedici.unlp.edu.ar:10915/191262Institucionalhttp://sedici.unlp.edu.ar/Universidad públicaNo correspondehttp://sedici.unlp.edu.ar/oai/snrdalira@sedici.unlp.edu.arArgentinaNo correspondeNo correspondeNo correspondeopendoar:13292026-05-27 11:46:59.593SEDICI (UNLP) - Universidad Nacional de La Platafalse
dc.title.none.fl_str_mv	Beyond Text: Enhancing human-computer interaction through multimodal embodied conversational agents
title	Beyond Text: Enhancing human-computer interaction through multimodal embodied conversational agents
spellingShingle	Beyond Text: Enhancing human-computer interaction through multimodal embodied conversational agents Novelli, Emiliano Ciencias Informáticas Embodied conversational agents Visemes Mobile applications Multimodal interaction Facial animation Affective computing Natural user interfaces
title_short	Beyond Text: Enhancing human-computer interaction through multimodal embodied conversational agents
title_full	Beyond Text: Enhancing human-computer interaction through multimodal embodied conversational agents
title_fullStr	Beyond Text: Enhancing human-computer interaction through multimodal embodied conversational agents
title_full_unstemmed	Beyond Text: Enhancing human-computer interaction through multimodal embodied conversational agents
title_sort	Beyond Text: Enhancing human-computer interaction through multimodal embodied conversational agents
dc.creator.none.fl_str_mv	Novelli, Emiliano Alvarado, Yoselie Guerrero, Roberto A.
author	Novelli, Emiliano
author_facet	Novelli, Emiliano Alvarado, Yoselie Guerrero, Roberto A.
author_role	author
author2	Alvarado, Yoselie Guerrero, Roberto A.
author2_role	author author
dc.subject.none.fl_str_mv	Ciencias Informáticas Embodied conversational agents Visemes Mobile applications Multimodal interaction Facial animation Affective computing Natural user interfaces
topic	Ciencias Informáticas Embodied conversational agents Visemes Mobile applications Multimodal interaction Facial animation Affective computing Natural user interfaces
dc.description.none.fl_txt_mv	This paper presents the development of a conversational agent that enriches human-computer interaction beyond traditional textbased interfaces by adopting a multimodal approach integrating speech, facial gesture dynamics, and emotional expressions on mobile platforms. The agent features real-time speech synthesis synchronized with visemes and is capable of displaying basic emotional states through animated facial expressions. Inspired by the principles of Embodied Conversational Agents and Natural User Interfaces, the application leverages vector graphics, animation engines, and several technologies for multiplatform support. The design emphasizes natural interaction, emotional perception, and usability, while seeking to circumvent the Uncanny Valley phenomenon by investigating varied strategies of visual representation. Evaluation results demonstrate that the system performs well in terms of small-screen interface legibility, computational performance, and user affective experience. This work contributes to the field of multimodal interfaces by demonstrating the feasibility and advantages of incorporating emotional and gestural cues into mobile conversational systems. Red de Universidades con Carreras en Informática
description	This paper presents the development of a conversational agent that enriches human-computer interaction beyond traditional textbased interfaces by adopting a multimodal approach integrating speech, facial gesture dynamics, and emotional expressions on mobile platforms. The agent features real-time speech synthesis synchronized with visemes and is capable of displaying basic emotional states through animated facial expressions. Inspired by the principles of Embodied Conversational Agents and Natural User Interfaces, the application leverages vector graphics, animation engines, and several technologies for multiplatform support. The design emphasizes natural interaction, emotional perception, and usability, while seeking to circumvent the Uncanny Valley phenomenon by investigating varied strategies of visual representation. Evaluation results demonstrate that the system performs well in terms of small-screen interface legibility, computational performance, and user affective experience. This work contributes to the field of multimodal interfaces by demonstrating the feasibility and advantages of incorporating emotional and gestural cues into mobile conversational systems.
publishDate	2025
dc.date.none.fl_str_mv	2025-10
dc.type.none.fl_str_mv	info:eu-repo/semantics/conferenceObject info:eu-repo/semantics/publishedVersion Objeto de conferencia http://purl.org/coar/resource_type/c_5794 info:ar-repo/semantics/documentoDeConferencia
format	conferenceObject
status_str	publishedVersion
dc.identifier.none.fl_str_mv	http://sedici.unlp.edu.ar/handle/10915/191262
url	http://sedici.unlp.edu.ar/handle/10915/191262
dc.language.none.fl_str_mv	eng
language	eng
dc.relation.none.fl_str_mv	info:eu-repo/semantics/altIdentifier/isbn/978-987-8258-99-7 info:eu-repo/semantics/reference/hdl/10915/189846
dc.rights.none.fl_str_mv	info:eu-repo/semantics/openAccess http://creativecommons.org/licenses/by-nc-sa/4.0/ Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
eu_rights_str_mv	openAccess
rights_invalid_str_mv	http://creativecommons.org/licenses/by-nc-sa/4.0/ Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
dc.format.none.fl_str_mv	application/pdf 405-414
dc.source.none.fl_str_mv	reponame:SEDICI (UNLP) instname:Universidad Nacional de La Plata instacron:UNLP
reponame_str	SEDICI (UNLP)
collection	SEDICI (UNLP)
instname_str	Universidad Nacional de La Plata
instacron_str	UNLP
institution	UNLP
repository.name.fl_str_mv	SEDICI (UNLP) - Universidad Nacional de La Plata
repository.mail.fl_str_mv	alira@sedici.unlp.edu.ar
_version_	1866372196838932480
score	13.343132

Beyond Text: Enhancing human-computer interaction through multimodal embodied conversational agents

Publicaciones similares