Noisy Speech Recognition Based on Combined Audio-Visual Classifiers
- Autores
- Terissi, Lucas Daniel; Sad, Gonzalo Daniel; Gomez, Juan Carlos; Parodi, Marianela
- Año de publicación
- 2015
- Idioma
- inglés
- Tipo de recurso
- artículo
- Estado
- versión publicada
- Descripción
- An isolated word speech recognition system based on audio-visual features is proposed in this paper. To enhance the recognition over different noisy conditions, this system combines three classifiers based on audio, visual and audio-visual information, respectively. The performance of the proposed recognition system is evaluated over two isolated word audio-visual databases, a public one and a database compiled by the authors of this paper. Experimental results show that the structure of the proposed system leads to significant improvements of the recognition rates through a wide range of signal-to-noise ratios.
Fil: Terissi, Lucas Daniel. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Rosario. Centro Internacional Franco Argentino de Ciencias de la Información y Sistemas; Argentina
Fil: Sad, Gonzalo Daniel. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Rosario. Centro Internacional Franco Argentino de Ciencias de la Información y Sistemas; Argentina
Fil: Gomez, Juan Carlos. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Rosario. Centro Internacional Franco Argentino de Ciencias de la Información y Sistemas; Argentina
Fil: Parodi, Marianela. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Rosario. Centro Internacional Franco Argentino de Ciencias de la Información y Sistemas; Argentina - Materia
-
Noisy Speech
Audio-Visual Speech Features
Audio-Visual Information Fusion - Nivel de accesibilidad
- acceso abierto
- Condiciones de uso
- https://creativecommons.org/licenses/by-nc-sa/2.5/ar/
- Repositorio
- Institución
- Consejo Nacional de Investigaciones Científicas y Técnicas
- OAI Identificador
- oai:ri.conicet.gov.ar:11336/4799
Ver los metadatos del registro completo
id |
CONICETDig_7ba8157b4d0433434aa2151bad3d8d23 |
---|---|
oai_identifier_str |
oai:ri.conicet.gov.ar:11336/4799 |
network_acronym_str |
CONICETDig |
repository_id_str |
3498 |
network_name_str |
CONICET Digital (CONICET) |
spelling |
Noisy Speech Recognition Based on Combined Audio-Visual ClassifiersTerissi, Lucas DanielSad, Gonzalo DanielGomez, Juan CarlosParodi, MarianelaNoisy SpeechAudio-Visual Speech FeaturesAudio-Visual Information Fusionhttps://purl.org/becyt/ford/5.1https://purl.org/becyt/ford/5https://purl.org/becyt/ford/2.2https://purl.org/becyt/ford/2An isolated word speech recognition system based on audio-visual features is proposed in this paper. To enhance the recognition over different noisy conditions, this system combines three classifiers based on audio, visual and audio-visual information, respectively. The performance of the proposed recognition system is evaluated over two isolated word audio-visual databases, a public one and a database compiled by the authors of this paper. Experimental results show that the structure of the proposed system leads to significant improvements of the recognition rates through a wide range of signal-to-noise ratios.Fil: Terissi, Lucas Daniel. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Rosario. Centro Internacional Franco Argentino de Ciencias de la Información y Sistemas; ArgentinaFil: Sad, Gonzalo Daniel. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Rosario. Centro Internacional Franco Argentino de Ciencias de la Información y Sistemas; ArgentinaFil: Gomez, Juan Carlos. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Rosario. Centro Internacional Franco Argentino de Ciencias de la Información y Sistemas; ArgentinaFil: Parodi, Marianela. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Rosario. Centro Internacional Franco Argentino de Ciencias de la Información y Sistemas; ArgentinaSpringer2015-01info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersionhttp://purl.org/coar/resource_type/c_6501info:ar-repo/semantics/articuloapplication/pdfapplication/pdfapplication/pdfapplication/pdfapplication/pdfapplication/pdfhttp://hdl.handle.net/11336/4799Terissi, Lucas Daniel; Sad, Gonzalo Daniel; Gomez, Juan Carlos; Parodi, Marianela; Noisy Speech Recognition Based on Combined Audio-Visual Classifiers; Springer; Lecture Notes In Computer Science; 8869; 1-2015; 43-53978-3-319-14898-4978-3-319-14899-10302-9743enginfo:eu-repo/semantics/altIdentifier/url/http://link.springer.com/chapter/10.1007/978-3-319-14899-1_5info:eu-repo/semantics/altIdentifier/doi/10.1007/978-3-319-14899-1_5info:eu-repo/semantics/altIdentifier/issn/0302-9743info:eu-repo/semantics/altIdentifier/isbn/978-3-319-14898-4info:eu-repo/semantics/openAccesshttps://creativecommons.org/licenses/by-nc-sa/2.5/ar/reponame:CONICET Digital (CONICET)instname:Consejo Nacional de Investigaciones Científicas y Técnicas2025-09-03T09:51:02Zoai:ri.conicet.gov.ar:11336/4799instacron:CONICETInstitucionalhttp://ri.conicet.gov.ar/Organismo científico-tecnológicoNo correspondehttp://ri.conicet.gov.ar/oai/requestdasensio@conicet.gov.ar; lcarlino@conicet.gov.arArgentinaNo correspondeNo correspondeNo correspondeopendoar:34982025-09-03 09:51:02.915CONICET Digital (CONICET) - Consejo Nacional de Investigaciones Científicas y Técnicasfalse |
dc.title.none.fl_str_mv |
Noisy Speech Recognition Based on Combined Audio-Visual Classifiers |
title |
Noisy Speech Recognition Based on Combined Audio-Visual Classifiers |
spellingShingle |
Noisy Speech Recognition Based on Combined Audio-Visual Classifiers Terissi, Lucas Daniel Noisy Speech Audio-Visual Speech Features Audio-Visual Information Fusion |
title_short |
Noisy Speech Recognition Based on Combined Audio-Visual Classifiers |
title_full |
Noisy Speech Recognition Based on Combined Audio-Visual Classifiers |
title_fullStr |
Noisy Speech Recognition Based on Combined Audio-Visual Classifiers |
title_full_unstemmed |
Noisy Speech Recognition Based on Combined Audio-Visual Classifiers |
title_sort |
Noisy Speech Recognition Based on Combined Audio-Visual Classifiers |
dc.creator.none.fl_str_mv |
Terissi, Lucas Daniel Sad, Gonzalo Daniel Gomez, Juan Carlos Parodi, Marianela |
author |
Terissi, Lucas Daniel |
author_facet |
Terissi, Lucas Daniel Sad, Gonzalo Daniel Gomez, Juan Carlos Parodi, Marianela |
author_role |
author |
author2 |
Sad, Gonzalo Daniel Gomez, Juan Carlos Parodi, Marianela |
author2_role |
author author author |
dc.subject.none.fl_str_mv |
Noisy Speech Audio-Visual Speech Features Audio-Visual Information Fusion |
topic |
Noisy Speech Audio-Visual Speech Features Audio-Visual Information Fusion |
purl_subject.fl_str_mv |
https://purl.org/becyt/ford/5.1 https://purl.org/becyt/ford/5 https://purl.org/becyt/ford/2.2 https://purl.org/becyt/ford/2 |
dc.description.none.fl_txt_mv |
An isolated word speech recognition system based on audio-visual features is proposed in this paper. To enhance the recognition over different noisy conditions, this system combines three classifiers based on audio, visual and audio-visual information, respectively. The performance of the proposed recognition system is evaluated over two isolated word audio-visual databases, a public one and a database compiled by the authors of this paper. Experimental results show that the structure of the proposed system leads to significant improvements of the recognition rates through a wide range of signal-to-noise ratios. Fil: Terissi, Lucas Daniel. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Rosario. Centro Internacional Franco Argentino de Ciencias de la Información y Sistemas; Argentina Fil: Sad, Gonzalo Daniel. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Rosario. Centro Internacional Franco Argentino de Ciencias de la Información y Sistemas; Argentina Fil: Gomez, Juan Carlos. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Rosario. Centro Internacional Franco Argentino de Ciencias de la Información y Sistemas; Argentina Fil: Parodi, Marianela. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Rosario. Centro Internacional Franco Argentino de Ciencias de la Información y Sistemas; Argentina |
description |
An isolated word speech recognition system based on audio-visual features is proposed in this paper. To enhance the recognition over different noisy conditions, this system combines three classifiers based on audio, visual and audio-visual information, respectively. The performance of the proposed recognition system is evaluated over two isolated word audio-visual databases, a public one and a database compiled by the authors of this paper. Experimental results show that the structure of the proposed system leads to significant improvements of the recognition rates through a wide range of signal-to-noise ratios. |
publishDate |
2015 |
dc.date.none.fl_str_mv |
2015-01 |
dc.type.none.fl_str_mv |
info:eu-repo/semantics/article info:eu-repo/semantics/publishedVersion http://purl.org/coar/resource_type/c_6501 info:ar-repo/semantics/articulo |
format |
article |
status_str |
publishedVersion |
dc.identifier.none.fl_str_mv |
http://hdl.handle.net/11336/4799 Terissi, Lucas Daniel; Sad, Gonzalo Daniel; Gomez, Juan Carlos; Parodi, Marianela; Noisy Speech Recognition Based on Combined Audio-Visual Classifiers; Springer; Lecture Notes In Computer Science; 8869; 1-2015; 43-53 978-3-319-14898-4 978-3-319-14899-1 0302-9743 |
url |
http://hdl.handle.net/11336/4799 |
identifier_str_mv |
Terissi, Lucas Daniel; Sad, Gonzalo Daniel; Gomez, Juan Carlos; Parodi, Marianela; Noisy Speech Recognition Based on Combined Audio-Visual Classifiers; Springer; Lecture Notes In Computer Science; 8869; 1-2015; 43-53 978-3-319-14898-4 978-3-319-14899-1 0302-9743 |
dc.language.none.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
info:eu-repo/semantics/altIdentifier/url/http://link.springer.com/chapter/10.1007/978-3-319-14899-1_5 info:eu-repo/semantics/altIdentifier/doi/10.1007/978-3-319-14899-1_5 info:eu-repo/semantics/altIdentifier/issn/0302-9743 info:eu-repo/semantics/altIdentifier/isbn/978-3-319-14898-4 |
dc.rights.none.fl_str_mv |
info:eu-repo/semantics/openAccess https://creativecommons.org/licenses/by-nc-sa/2.5/ar/ |
eu_rights_str_mv |
openAccess |
rights_invalid_str_mv |
https://creativecommons.org/licenses/by-nc-sa/2.5/ar/ |
dc.format.none.fl_str_mv |
application/pdf application/pdf application/pdf application/pdf application/pdf application/pdf |
dc.publisher.none.fl_str_mv |
Springer |
publisher.none.fl_str_mv |
Springer |
dc.source.none.fl_str_mv |
reponame:CONICET Digital (CONICET) instname:Consejo Nacional de Investigaciones Científicas y Técnicas |
reponame_str |
CONICET Digital (CONICET) |
collection |
CONICET Digital (CONICET) |
instname_str |
Consejo Nacional de Investigaciones Científicas y Técnicas |
repository.name.fl_str_mv |
CONICET Digital (CONICET) - Consejo Nacional de Investigaciones Científicas y Técnicas |
repository.mail.fl_str_mv |
dasensio@conicet.gov.ar; lcarlino@conicet.gov.ar |
_version_ |
1842269069822132224 |
score |
13.13397 |