Denoising and recognition using hidden Markov models with observation distributions modeled by hidden Markov trees

Autores
Milone, Diego Humberto; Di Persia, Leandro Ezequiel; Torres, Maria Eugenia
Año de publicación
2010
Idioma
inglés
Tipo de recurso
artículo
Estado
versión publicada
Descripción
Hidden Markov models have been found very useful for a wide range of applications in machine learning and pattern recognition. The wavelet transform has emerged as a new tool for signal and image analysis. Learning models for wavelet coefficients have been mainly based on fixed-length sequences, but real applications often require to model variable-length, very long or real-time sequences. In this paper, we propose a new learning architecture for sequences analyzed on short-term basis, but not assuming stationarity within each frame. Long-term dependencies will be modeled with a hidden Markov model which, in each internal state, will deal with the local dynamics in the wavelet domain, using a hidden Markov tree. The training algorithms for all the parameters in the composite model are developed using the expectation-maximization framework. This novel learning architecture could be useful for a wide range of applications. We detail two experiments with artificial and real data: model-based denoising and speech recognition. Denoising results indicate that the proposed model and learning algorithm are more effective than previous approaches based on isolated hidden Markov trees. In the case of the 'Doppler' benchmark sequence, with 1024 samples and additive white noise, the new method reduced the mean squared error from 1.0 to 0.0842. The proposed methods for feature extraction, modeling and learning, increased the phoneme recognition rates in 28.13%, with better convergence than models based on Gaussian mixtures.
Fil: Milone, Diego Humberto. Facultad de Ingeniería y Ciencias Hídricas; Argentina
Fil: Di Persia, Leandro Ezequiel. Universidad Nacional del Litoral. Facultad de Ingeniería y Ciencias Hídricas. Departamento de Informática. Laboratorio de Investigaciones en Señales e Inteligencia Computacional; Argentina
Fil: Torres, Maria Eugenia. Universidad Nacional de Entre Rios. Facultad de Ingeniería. Departamento de Matemática E Informatica; Argentina
Materia
Sequence learning
EM algorithm
Wavelets
Speech recognition
Nivel de accesibilidad
acceso abierto
Condiciones de uso
https://creativecommons.org/licenses/by-nc-sa/2.5/ar/
Repositorio
CONICET Digital (CONICET)
Institución
Consejo Nacional de Investigaciones Científicas y Técnicas
OAI Identificador
oai:ri.conicet.gov.ar:11336/97607

id CONICETDig_74c4fcd0ad7d592ebd884d9498981d73
oai_identifier_str oai:ri.conicet.gov.ar:11336/97607
network_acronym_str CONICETDig
repository_id_str 3498
network_name_str CONICET Digital (CONICET)
spelling Denoising and recognition using hidden Markov models with observation distributions modeled by hidden Markov treesMilone, Diego HumbertoDi Persia, Leandro EzequielTorres, Maria EugeniaSequence learningEM algorithmWaveletsSpeech recognitionhttps://purl.org/becyt/ford/1.2https://purl.org/becyt/ford/1Hidden Markov models have been found very useful for a wide range of applications in machine learning and pattern recognition. The wavelet transform has emerged as a new tool for signal and image analysis. Learning models for wavelet coefficients have been mainly based on fixed-length sequences, but real applications often require to model variable-length, very long or real-time sequences. In this paper, we propose a new learning architecture for sequences analyzed on short-term basis, but not assuming stationarity within each frame. Long-term dependencies will be modeled with a hidden Markov model which, in each internal state, will deal with the local dynamics in the wavelet domain, using a hidden Markov tree. The training algorithms for all the parameters in the composite model are developed using the expectation-maximization framework. This novel learning architecture could be useful for a wide range of applications. We detail two experiments with artificial and real data: model-based denoising and speech recognition. Denoising results indicate that the proposed model and learning algorithm are more effective than previous approaches based on isolated hidden Markov trees. In the case of the 'Doppler' benchmark sequence, with 1024 samples and additive white noise, the new method reduced the mean squared error from 1.0 to 0.0842. The proposed methods for feature extraction, modeling and learning, increased the phoneme recognition rates in 28.13%, with better convergence than models based on Gaussian mixtures.Fil: Milone, Diego Humberto. Facultad de Ingeniería y Ciencias Hídricas; ArgentinaFil: Di Persia, Leandro Ezequiel. Universidad Nacional del Litoral. Facultad de Ingeniería y Ciencias Hídricas. Departamento de Informática. Laboratorio de Investigaciones en Señales e Inteligencia Computacional; ArgentinaFil: Torres, Maria Eugenia. Universidad Nacional de Entre Rios. Facultad de Ingeniería. Departamento de Matemática E Informatica; ArgentinaElsevier2010-04info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersionhttp://purl.org/coar/resource_type/c_6501info:ar-repo/semantics/articuloapplication/pdfapplication/pdfapplication/pdfapplication/pdfhttp://hdl.handle.net/11336/97607Milone, Diego Humberto; Di Persia, Leandro Ezequiel; Torres, Maria Eugenia; Denoising and recognition using hidden Markov models with observation distributions modeled by hidden Markov trees; Elsevier; Pattern Recognition; 43; 4; 4-2010; 1577-15890031-3203CONICET DigitalCONICETenginfo:eu-repo/semantics/altIdentifier/doi/10.1016/j.patcog.2009.11.010info:eu-repo/semantics/openAccesshttps://creativecommons.org/licenses/by-nc-sa/2.5/ar/reponame:CONICET Digital (CONICET)instname:Consejo Nacional de Investigaciones Científicas y Técnicas2025-10-15T14:25:37Zoai:ri.conicet.gov.ar:11336/97607instacron:CONICETInstitucionalhttp://ri.conicet.gov.ar/Organismo científico-tecnológicoNo correspondehttp://ri.conicet.gov.ar/oai/requestdasensio@conicet.gov.ar; lcarlino@conicet.gov.arArgentinaNo correspondeNo correspondeNo correspondeopendoar:34982025-10-15 14:25:37.946CONICET Digital (CONICET) - Consejo Nacional de Investigaciones Científicas y Técnicasfalse
dc.title.none.fl_str_mv Denoising and recognition using hidden Markov models with observation distributions modeled by hidden Markov trees
title Denoising and recognition using hidden Markov models with observation distributions modeled by hidden Markov trees
spellingShingle Denoising and recognition using hidden Markov models with observation distributions modeled by hidden Markov trees
Milone, Diego Humberto
Sequence learning
EM algorithm
Wavelets
Speech recognition
title_short Denoising and recognition using hidden Markov models with observation distributions modeled by hidden Markov trees
title_full Denoising and recognition using hidden Markov models with observation distributions modeled by hidden Markov trees
title_fullStr Denoising and recognition using hidden Markov models with observation distributions modeled by hidden Markov trees
title_full_unstemmed Denoising and recognition using hidden Markov models with observation distributions modeled by hidden Markov trees
title_sort Denoising and recognition using hidden Markov models with observation distributions modeled by hidden Markov trees
dc.creator.none.fl_str_mv Milone, Diego Humberto
Di Persia, Leandro Ezequiel
Torres, Maria Eugenia
author Milone, Diego Humberto
author_facet Milone, Diego Humberto
Di Persia, Leandro Ezequiel
Torres, Maria Eugenia
author_role author
author2 Di Persia, Leandro Ezequiel
Torres, Maria Eugenia
author2_role author
author
dc.subject.none.fl_str_mv Sequence learning
EM algorithm
Wavelets
Speech recognition
topic Sequence learning
EM algorithm
Wavelets
Speech recognition
purl_subject.fl_str_mv https://purl.org/becyt/ford/1.2
https://purl.org/becyt/ford/1
dc.description.none.fl_txt_mv Hidden Markov models have been found very useful for a wide range of applications in machine learning and pattern recognition. The wavelet transform has emerged as a new tool for signal and image analysis. Learning models for wavelet coefficients have been mainly based on fixed-length sequences, but real applications often require to model variable-length, very long or real-time sequences. In this paper, we propose a new learning architecture for sequences analyzed on short-term basis, but not assuming stationarity within each frame. Long-term dependencies will be modeled with a hidden Markov model which, in each internal state, will deal with the local dynamics in the wavelet domain, using a hidden Markov tree. The training algorithms for all the parameters in the composite model are developed using the expectation-maximization framework. This novel learning architecture could be useful for a wide range of applications. We detail two experiments with artificial and real data: model-based denoising and speech recognition. Denoising results indicate that the proposed model and learning algorithm are more effective than previous approaches based on isolated hidden Markov trees. In the case of the 'Doppler' benchmark sequence, with 1024 samples and additive white noise, the new method reduced the mean squared error from 1.0 to 0.0842. The proposed methods for feature extraction, modeling and learning, increased the phoneme recognition rates in 28.13%, with better convergence than models based on Gaussian mixtures.
Fil: Milone, Diego Humberto. Facultad de Ingeniería y Ciencias Hídricas; Argentina
Fil: Di Persia, Leandro Ezequiel. Universidad Nacional del Litoral. Facultad de Ingeniería y Ciencias Hídricas. Departamento de Informática. Laboratorio de Investigaciones en Señales e Inteligencia Computacional; Argentina
Fil: Torres, Maria Eugenia. Universidad Nacional de Entre Rios. Facultad de Ingeniería. Departamento de Matemática E Informatica; Argentina
description Hidden Markov models have been found very useful for a wide range of applications in machine learning and pattern recognition. The wavelet transform has emerged as a new tool for signal and image analysis. Learning models for wavelet coefficients have been mainly based on fixed-length sequences, but real applications often require to model variable-length, very long or real-time sequences. In this paper, we propose a new learning architecture for sequences analyzed on short-term basis, but not assuming stationarity within each frame. Long-term dependencies will be modeled with a hidden Markov model which, in each internal state, will deal with the local dynamics in the wavelet domain, using a hidden Markov tree. The training algorithms for all the parameters in the composite model are developed using the expectation-maximization framework. This novel learning architecture could be useful for a wide range of applications. We detail two experiments with artificial and real data: model-based denoising and speech recognition. Denoising results indicate that the proposed model and learning algorithm are more effective than previous approaches based on isolated hidden Markov trees. In the case of the 'Doppler' benchmark sequence, with 1024 samples and additive white noise, the new method reduced the mean squared error from 1.0 to 0.0842. The proposed methods for feature extraction, modeling and learning, increased the phoneme recognition rates in 28.13%, with better convergence than models based on Gaussian mixtures.
publishDate 2010
dc.date.none.fl_str_mv 2010-04
dc.type.none.fl_str_mv info:eu-repo/semantics/article
info:eu-repo/semantics/publishedVersion
http://purl.org/coar/resource_type/c_6501
info:ar-repo/semantics/articulo
format article
status_str publishedVersion
dc.identifier.none.fl_str_mv http://hdl.handle.net/11336/97607
Milone, Diego Humberto; Di Persia, Leandro Ezequiel; Torres, Maria Eugenia; Denoising and recognition using hidden Markov models with observation distributions modeled by hidden Markov trees; Elsevier; Pattern Recognition; 43; 4; 4-2010; 1577-1589
0031-3203
CONICET Digital
CONICET
url http://hdl.handle.net/11336/97607
identifier_str_mv Milone, Diego Humberto; Di Persia, Leandro Ezequiel; Torres, Maria Eugenia; Denoising and recognition using hidden Markov models with observation distributions modeled by hidden Markov trees; Elsevier; Pattern Recognition; 43; 4; 4-2010; 1577-1589
0031-3203
CONICET Digital
CONICET
dc.language.none.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv info:eu-repo/semantics/altIdentifier/doi/10.1016/j.patcog.2009.11.010
dc.rights.none.fl_str_mv info:eu-repo/semantics/openAccess
https://creativecommons.org/licenses/by-nc-sa/2.5/ar/
eu_rights_str_mv openAccess
rights_invalid_str_mv https://creativecommons.org/licenses/by-nc-sa/2.5/ar/
dc.format.none.fl_str_mv application/pdf
application/pdf
application/pdf
application/pdf
dc.publisher.none.fl_str_mv Elsevier
publisher.none.fl_str_mv Elsevier
dc.source.none.fl_str_mv reponame:CONICET Digital (CONICET)
instname:Consejo Nacional de Investigaciones Científicas y Técnicas
reponame_str CONICET Digital (CONICET)
collection CONICET Digital (CONICET)
instname_str Consejo Nacional de Investigaciones Científicas y Técnicas
repository.name.fl_str_mv CONICET Digital (CONICET) - Consejo Nacional de Investigaciones Científicas y Técnicas
repository.mail.fl_str_mv dasensio@conicet.gov.ar; lcarlino@conicet.gov.ar
_version_ 1846082693171773440
score 13.22299