Denoising and recognition using hidden Markov models with observation distributions modeled by hidden Markov trees
- Autores
- Milone, Diego Humberto; Di Persia, Leandro Ezequiel; Torres, Maria Eugenia
- Año de publicación
- 2010
- Idioma
- inglés
- Tipo de recurso
- artículo
- Estado
- versión publicada
- Descripción
- Hidden Markov models have been found very useful for a wide range of applications in machine learning and pattern recognition. The wavelet transform has emerged as a new tool for signal and image analysis. Learning models for wavelet coefficients have been mainly based on fixed-length sequences, but real applications often require to model variable-length, very long or real-time sequences. In this paper, we propose a new learning architecture for sequences analyzed on short-term basis, but not assuming stationarity within each frame. Long-term dependencies will be modeled with a hidden Markov model which, in each internal state, will deal with the local dynamics in the wavelet domain, using a hidden Markov tree. The training algorithms for all the parameters in the composite model are developed using the expectation-maximization framework. This novel learning architecture could be useful for a wide range of applications. We detail two experiments with artificial and real data: model-based denoising and speech recognition. Denoising results indicate that the proposed model and learning algorithm are more effective than previous approaches based on isolated hidden Markov trees. In the case of the 'Doppler' benchmark sequence, with 1024 samples and additive white noise, the new method reduced the mean squared error from 1.0 to 0.0842. The proposed methods for feature extraction, modeling and learning, increased the phoneme recognition rates in 28.13%, with better convergence than models based on Gaussian mixtures.
Fil: Milone, Diego Humberto. Facultad de Ingeniería y Ciencias Hídricas; Argentina
Fil: Di Persia, Leandro Ezequiel. Universidad Nacional del Litoral. Facultad de Ingeniería y Ciencias Hídricas. Departamento de Informática. Laboratorio de Investigaciones en Señales e Inteligencia Computacional; Argentina
Fil: Torres, Maria Eugenia. Universidad Nacional de Entre Rios. Facultad de Ingeniería. Departamento de Matemática E Informatica; Argentina - Materia
-
Sequence learning
EM algorithm
Wavelets
Speech recognition - Nivel de accesibilidad
- acceso abierto
- Condiciones de uso
- https://creativecommons.org/licenses/by-nc-sa/2.5/ar/
- Repositorio
- Institución
- Consejo Nacional de Investigaciones Científicas y Técnicas
- OAI Identificador
- oai:ri.conicet.gov.ar:11336/97607
Ver los metadatos del registro completo
id |
CONICETDig_74c4fcd0ad7d592ebd884d9498981d73 |
---|---|
oai_identifier_str |
oai:ri.conicet.gov.ar:11336/97607 |
network_acronym_str |
CONICETDig |
repository_id_str |
3498 |
network_name_str |
CONICET Digital (CONICET) |
spelling |
Denoising and recognition using hidden Markov models with observation distributions modeled by hidden Markov treesMilone, Diego HumbertoDi Persia, Leandro EzequielTorres, Maria EugeniaSequence learningEM algorithmWaveletsSpeech recognitionhttps://purl.org/becyt/ford/1.2https://purl.org/becyt/ford/1Hidden Markov models have been found very useful for a wide range of applications in machine learning and pattern recognition. The wavelet transform has emerged as a new tool for signal and image analysis. Learning models for wavelet coefficients have been mainly based on fixed-length sequences, but real applications often require to model variable-length, very long or real-time sequences. In this paper, we propose a new learning architecture for sequences analyzed on short-term basis, but not assuming stationarity within each frame. Long-term dependencies will be modeled with a hidden Markov model which, in each internal state, will deal with the local dynamics in the wavelet domain, using a hidden Markov tree. The training algorithms for all the parameters in the composite model are developed using the expectation-maximization framework. This novel learning architecture could be useful for a wide range of applications. We detail two experiments with artificial and real data: model-based denoising and speech recognition. Denoising results indicate that the proposed model and learning algorithm are more effective than previous approaches based on isolated hidden Markov trees. In the case of the 'Doppler' benchmark sequence, with 1024 samples and additive white noise, the new method reduced the mean squared error from 1.0 to 0.0842. The proposed methods for feature extraction, modeling and learning, increased the phoneme recognition rates in 28.13%, with better convergence than models based on Gaussian mixtures.Fil: Milone, Diego Humberto. Facultad de Ingeniería y Ciencias Hídricas; ArgentinaFil: Di Persia, Leandro Ezequiel. Universidad Nacional del Litoral. Facultad de Ingeniería y Ciencias Hídricas. Departamento de Informática. Laboratorio de Investigaciones en Señales e Inteligencia Computacional; ArgentinaFil: Torres, Maria Eugenia. Universidad Nacional de Entre Rios. Facultad de Ingeniería. Departamento de Matemática E Informatica; ArgentinaElsevier2010-04info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersionhttp://purl.org/coar/resource_type/c_6501info:ar-repo/semantics/articuloapplication/pdfapplication/pdfapplication/pdfapplication/pdfhttp://hdl.handle.net/11336/97607Milone, Diego Humberto; Di Persia, Leandro Ezequiel; Torres, Maria Eugenia; Denoising and recognition using hidden Markov models with observation distributions modeled by hidden Markov trees; Elsevier; Pattern Recognition; 43; 4; 4-2010; 1577-15890031-3203CONICET DigitalCONICETenginfo:eu-repo/semantics/altIdentifier/doi/10.1016/j.patcog.2009.11.010info:eu-repo/semantics/openAccesshttps://creativecommons.org/licenses/by-nc-sa/2.5/ar/reponame:CONICET Digital (CONICET)instname:Consejo Nacional de Investigaciones Científicas y Técnicas2025-10-15T14:25:37Zoai:ri.conicet.gov.ar:11336/97607instacron:CONICETInstitucionalhttp://ri.conicet.gov.ar/Organismo científico-tecnológicoNo correspondehttp://ri.conicet.gov.ar/oai/requestdasensio@conicet.gov.ar; lcarlino@conicet.gov.arArgentinaNo correspondeNo correspondeNo correspondeopendoar:34982025-10-15 14:25:37.946CONICET Digital (CONICET) - Consejo Nacional de Investigaciones Científicas y Técnicasfalse |
dc.title.none.fl_str_mv |
Denoising and recognition using hidden Markov models with observation distributions modeled by hidden Markov trees |
title |
Denoising and recognition using hidden Markov models with observation distributions modeled by hidden Markov trees |
spellingShingle |
Denoising and recognition using hidden Markov models with observation distributions modeled by hidden Markov trees Milone, Diego Humberto Sequence learning EM algorithm Wavelets Speech recognition |
title_short |
Denoising and recognition using hidden Markov models with observation distributions modeled by hidden Markov trees |
title_full |
Denoising and recognition using hidden Markov models with observation distributions modeled by hidden Markov trees |
title_fullStr |
Denoising and recognition using hidden Markov models with observation distributions modeled by hidden Markov trees |
title_full_unstemmed |
Denoising and recognition using hidden Markov models with observation distributions modeled by hidden Markov trees |
title_sort |
Denoising and recognition using hidden Markov models with observation distributions modeled by hidden Markov trees |
dc.creator.none.fl_str_mv |
Milone, Diego Humberto Di Persia, Leandro Ezequiel Torres, Maria Eugenia |
author |
Milone, Diego Humberto |
author_facet |
Milone, Diego Humberto Di Persia, Leandro Ezequiel Torres, Maria Eugenia |
author_role |
author |
author2 |
Di Persia, Leandro Ezequiel Torres, Maria Eugenia |
author2_role |
author author |
dc.subject.none.fl_str_mv |
Sequence learning EM algorithm Wavelets Speech recognition |
topic |
Sequence learning EM algorithm Wavelets Speech recognition |
purl_subject.fl_str_mv |
https://purl.org/becyt/ford/1.2 https://purl.org/becyt/ford/1 |
dc.description.none.fl_txt_mv |
Hidden Markov models have been found very useful for a wide range of applications in machine learning and pattern recognition. The wavelet transform has emerged as a new tool for signal and image analysis. Learning models for wavelet coefficients have been mainly based on fixed-length sequences, but real applications often require to model variable-length, very long or real-time sequences. In this paper, we propose a new learning architecture for sequences analyzed on short-term basis, but not assuming stationarity within each frame. Long-term dependencies will be modeled with a hidden Markov model which, in each internal state, will deal with the local dynamics in the wavelet domain, using a hidden Markov tree. The training algorithms for all the parameters in the composite model are developed using the expectation-maximization framework. This novel learning architecture could be useful for a wide range of applications. We detail two experiments with artificial and real data: model-based denoising and speech recognition. Denoising results indicate that the proposed model and learning algorithm are more effective than previous approaches based on isolated hidden Markov trees. In the case of the 'Doppler' benchmark sequence, with 1024 samples and additive white noise, the new method reduced the mean squared error from 1.0 to 0.0842. The proposed methods for feature extraction, modeling and learning, increased the phoneme recognition rates in 28.13%, with better convergence than models based on Gaussian mixtures. Fil: Milone, Diego Humberto. Facultad de Ingeniería y Ciencias Hídricas; Argentina Fil: Di Persia, Leandro Ezequiel. Universidad Nacional del Litoral. Facultad de Ingeniería y Ciencias Hídricas. Departamento de Informática. Laboratorio de Investigaciones en Señales e Inteligencia Computacional; Argentina Fil: Torres, Maria Eugenia. Universidad Nacional de Entre Rios. Facultad de Ingeniería. Departamento de Matemática E Informatica; Argentina |
description |
Hidden Markov models have been found very useful for a wide range of applications in machine learning and pattern recognition. The wavelet transform has emerged as a new tool for signal and image analysis. Learning models for wavelet coefficients have been mainly based on fixed-length sequences, but real applications often require to model variable-length, very long or real-time sequences. In this paper, we propose a new learning architecture for sequences analyzed on short-term basis, but not assuming stationarity within each frame. Long-term dependencies will be modeled with a hidden Markov model which, in each internal state, will deal with the local dynamics in the wavelet domain, using a hidden Markov tree. The training algorithms for all the parameters in the composite model are developed using the expectation-maximization framework. This novel learning architecture could be useful for a wide range of applications. We detail two experiments with artificial and real data: model-based denoising and speech recognition. Denoising results indicate that the proposed model and learning algorithm are more effective than previous approaches based on isolated hidden Markov trees. In the case of the 'Doppler' benchmark sequence, with 1024 samples and additive white noise, the new method reduced the mean squared error from 1.0 to 0.0842. The proposed methods for feature extraction, modeling and learning, increased the phoneme recognition rates in 28.13%, with better convergence than models based on Gaussian mixtures. |
publishDate |
2010 |
dc.date.none.fl_str_mv |
2010-04 |
dc.type.none.fl_str_mv |
info:eu-repo/semantics/article info:eu-repo/semantics/publishedVersion http://purl.org/coar/resource_type/c_6501 info:ar-repo/semantics/articulo |
format |
article |
status_str |
publishedVersion |
dc.identifier.none.fl_str_mv |
http://hdl.handle.net/11336/97607 Milone, Diego Humberto; Di Persia, Leandro Ezequiel; Torres, Maria Eugenia; Denoising and recognition using hidden Markov models with observation distributions modeled by hidden Markov trees; Elsevier; Pattern Recognition; 43; 4; 4-2010; 1577-1589 0031-3203 CONICET Digital CONICET |
url |
http://hdl.handle.net/11336/97607 |
identifier_str_mv |
Milone, Diego Humberto; Di Persia, Leandro Ezequiel; Torres, Maria Eugenia; Denoising and recognition using hidden Markov models with observation distributions modeled by hidden Markov trees; Elsevier; Pattern Recognition; 43; 4; 4-2010; 1577-1589 0031-3203 CONICET Digital CONICET |
dc.language.none.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
info:eu-repo/semantics/altIdentifier/doi/10.1016/j.patcog.2009.11.010 |
dc.rights.none.fl_str_mv |
info:eu-repo/semantics/openAccess https://creativecommons.org/licenses/by-nc-sa/2.5/ar/ |
eu_rights_str_mv |
openAccess |
rights_invalid_str_mv |
https://creativecommons.org/licenses/by-nc-sa/2.5/ar/ |
dc.format.none.fl_str_mv |
application/pdf application/pdf application/pdf application/pdf |
dc.publisher.none.fl_str_mv |
Elsevier |
publisher.none.fl_str_mv |
Elsevier |
dc.source.none.fl_str_mv |
reponame:CONICET Digital (CONICET) instname:Consejo Nacional de Investigaciones Científicas y Técnicas |
reponame_str |
CONICET Digital (CONICET) |
collection |
CONICET Digital (CONICET) |
instname_str |
Consejo Nacional de Investigaciones Científicas y Técnicas |
repository.name.fl_str_mv |
CONICET Digital (CONICET) - Consejo Nacional de Investigaciones Científicas y Técnicas |
repository.mail.fl_str_mv |
dasensio@conicet.gov.ar; lcarlino@conicet.gov.ar |
_version_ |
1846082693171773440 |
score |
13.22299 |