Criticality of mostly informative samples: A Bayesian model selection approach

Autores
Haimovici, Ariel; Marsili, Matteo
Año de publicación
2015
Idioma
inglés
Tipo de recurso
artículo
Estado
versión publicada
Descripción
We discuss a Bayesian model selection approach to high-dimensional data in the deep under-sampling regime. The data is based on a representation of the possible discrete states s, as defined by the observer, and it consists of M observations of the state. This approach shows that, for a given sample size M, not all states observed in the sample can be distinguished. Rather, only a partition of the sampled states s can be resolved. Such a partition defines an emergent classification qs of the states that becomes finer and finer as the sample size increases, through a process of symmetry breaking between states. This allows us to distinguish between the resolution of a given representation of the observer defined states s, which is given by the entropy of s, and its relevance, which is defined by the entropy of the partition qs. Relevance has a nonmonotonic dependence on resolution, for a given sample size. In addition, we characterise most relevant samples and we show that they exhibit power law frequency distributions, generally taken as signatures of criticality. This suggests that criticality reflects the relevance of a given representation of the states of a complex system, and does not necessarily require a specific mechanism of self-organisation to a critical point.
Fil: Haimovici, Ariel. Universidad de Buenos Aires. Facultad de Ciencias Exactas y Naturales. Departamento de Física; Argentina. Consejo Nacional de Investigaciones Científicas y Técnicas; Argentina
Fil: Marsili, Matteo. The Abdus Salam; Italia
Materia
DATA MINING (THEORY)
STATISTICAL INFERENCE
Nivel de accesibilidad
acceso abierto
Condiciones de uso
https://creativecommons.org/licenses/by-nc-sa/2.5/ar/
Repositorio
CONICET Digital (CONICET)
Institución
Consejo Nacional de Investigaciones Científicas y Técnicas
OAI Identificador
oai:ri.conicet.gov.ar:11336/59551

id CONICETDig_4ed2f450a9104415ac400b9ea5ac9c4c
oai_identifier_str oai:ri.conicet.gov.ar:11336/59551
network_acronym_str CONICETDig
repository_id_str 3498
network_name_str CONICET Digital (CONICET)
spelling Criticality of mostly informative samples: A Bayesian model selection approachHaimovici, ArielMarsili, MatteoDATA MINING (THEORY)STATISTICAL INFERENCEhttps://purl.org/becyt/ford/1.3https://purl.org/becyt/ford/1We discuss a Bayesian model selection approach to high-dimensional data in the deep under-sampling regime. The data is based on a representation of the possible discrete states s, as defined by the observer, and it consists of M observations of the state. This approach shows that, for a given sample size M, not all states observed in the sample can be distinguished. Rather, only a partition of the sampled states s can be resolved. Such a partition defines an emergent classification qs of the states that becomes finer and finer as the sample size increases, through a process of symmetry breaking between states. This allows us to distinguish between the resolution of a given representation of the observer defined states s, which is given by the entropy of s, and its relevance, which is defined by the entropy of the partition qs. Relevance has a nonmonotonic dependence on resolution, for a given sample size. In addition, we characterise most relevant samples and we show that they exhibit power law frequency distributions, generally taken as signatures of criticality. This suggests that criticality reflects the relevance of a given representation of the states of a complex system, and does not necessarily require a specific mechanism of self-organisation to a critical point.Fil: Haimovici, Ariel. Universidad de Buenos Aires. Facultad de Ciencias Exactas y Naturales. Departamento de Física; Argentina. Consejo Nacional de Investigaciones Científicas y Técnicas; ArgentinaFil: Marsili, Matteo. The Abdus Salam; ItaliaIOP Publishing2015-10info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersionhttp://purl.org/coar/resource_type/c_6501info:ar-repo/semantics/articuloapplication/pdfapplication/pdfapplication/pdfhttp://hdl.handle.net/11336/59551Haimovici, Ariel; Marsili, Matteo; Criticality of mostly informative samples: A Bayesian model selection approach; IOP Publishing; Journal of Statistical Mechanics: Theory and Experiment; 2015; 10; 10-2015; 1-26; P100131742-5468CONICET DigitalCONICETenginfo:eu-repo/semantics/altIdentifier/doi/10.1088/1742-5468/2015/10/P10013info:eu-repo/semantics/altIdentifier/url/http://iopscience.iop.org/article/10.1088/1742-5468/2015/10/P10013/metainfo:eu-repo/semantics/altIdentifier/url/https://arxiv.org/abs/1502.00356info:eu-repo/semantics/openAccesshttps://creativecommons.org/licenses/by-nc-sa/2.5/ar/reponame:CONICET Digital (CONICET)instname:Consejo Nacional de Investigaciones Científicas y Técnicas2026-06-17T10:00:59Zoai:ri.conicet.gov.ar:11336/59551instacron:CONICETInstitucionalhttp://ri.conicet.gov.ar/Organismo científico-tecnológicoNo correspondehttp://ri.conicet.gov.ar/oai/requestdasensio@conicet.gov.ar; lcarlino@conicet.gov.arArgentinaNo correspondeNo correspondeNo correspondeopendoar:34982026-06-17 10:01:00.068CONICET Digital (CONICET) - Consejo Nacional de Investigaciones Científicas y Técnicasfalse
dc.title.none.fl_str_mv Criticality of mostly informative samples: A Bayesian model selection approach
title Criticality of mostly informative samples: A Bayesian model selection approach
spellingShingle Criticality of mostly informative samples: A Bayesian model selection approach
Haimovici, Ariel
DATA MINING (THEORY)
STATISTICAL INFERENCE
title_short Criticality of mostly informative samples: A Bayesian model selection approach
title_full Criticality of mostly informative samples: A Bayesian model selection approach
title_fullStr Criticality of mostly informative samples: A Bayesian model selection approach
title_full_unstemmed Criticality of mostly informative samples: A Bayesian model selection approach
title_sort Criticality of mostly informative samples: A Bayesian model selection approach
dc.creator.none.fl_str_mv Haimovici, Ariel
Marsili, Matteo
author Haimovici, Ariel
author_facet Haimovici, Ariel
Marsili, Matteo
author_role author
author2 Marsili, Matteo
author2_role author
dc.subject.none.fl_str_mv DATA MINING (THEORY)
STATISTICAL INFERENCE
topic DATA MINING (THEORY)
STATISTICAL INFERENCE
purl_subject.fl_str_mv https://purl.org/becyt/ford/1.3
https://purl.org/becyt/ford/1
dc.description.none.fl_txt_mv We discuss a Bayesian model selection approach to high-dimensional data in the deep under-sampling regime. The data is based on a representation of the possible discrete states s, as defined by the observer, and it consists of M observations of the state. This approach shows that, for a given sample size M, not all states observed in the sample can be distinguished. Rather, only a partition of the sampled states s can be resolved. Such a partition defines an emergent classification qs of the states that becomes finer and finer as the sample size increases, through a process of symmetry breaking between states. This allows us to distinguish between the resolution of a given representation of the observer defined states s, which is given by the entropy of s, and its relevance, which is defined by the entropy of the partition qs. Relevance has a nonmonotonic dependence on resolution, for a given sample size. In addition, we characterise most relevant samples and we show that they exhibit power law frequency distributions, generally taken as signatures of criticality. This suggests that criticality reflects the relevance of a given representation of the states of a complex system, and does not necessarily require a specific mechanism of self-organisation to a critical point.
Fil: Haimovici, Ariel. Universidad de Buenos Aires. Facultad de Ciencias Exactas y Naturales. Departamento de Física; Argentina. Consejo Nacional de Investigaciones Científicas y Técnicas; Argentina
Fil: Marsili, Matteo. The Abdus Salam; Italia
description We discuss a Bayesian model selection approach to high-dimensional data in the deep under-sampling regime. The data is based on a representation of the possible discrete states s, as defined by the observer, and it consists of M observations of the state. This approach shows that, for a given sample size M, not all states observed in the sample can be distinguished. Rather, only a partition of the sampled states s can be resolved. Such a partition defines an emergent classification qs of the states that becomes finer and finer as the sample size increases, through a process of symmetry breaking between states. This allows us to distinguish between the resolution of a given representation of the observer defined states s, which is given by the entropy of s, and its relevance, which is defined by the entropy of the partition qs. Relevance has a nonmonotonic dependence on resolution, for a given sample size. In addition, we characterise most relevant samples and we show that they exhibit power law frequency distributions, generally taken as signatures of criticality. This suggests that criticality reflects the relevance of a given representation of the states of a complex system, and does not necessarily require a specific mechanism of self-organisation to a critical point.
publishDate 2015
dc.date.none.fl_str_mv 2015-10
dc.type.none.fl_str_mv info:eu-repo/semantics/article
info:eu-repo/semantics/publishedVersion
http://purl.org/coar/resource_type/c_6501
info:ar-repo/semantics/articulo
format article
status_str publishedVersion
dc.identifier.none.fl_str_mv http://hdl.handle.net/11336/59551
Haimovici, Ariel; Marsili, Matteo; Criticality of mostly informative samples: A Bayesian model selection approach; IOP Publishing; Journal of Statistical Mechanics: Theory and Experiment; 2015; 10; 10-2015; 1-26; P10013
1742-5468
CONICET Digital
CONICET
url http://hdl.handle.net/11336/59551
identifier_str_mv Haimovici, Ariel; Marsili, Matteo; Criticality of mostly informative samples: A Bayesian model selection approach; IOP Publishing; Journal of Statistical Mechanics: Theory and Experiment; 2015; 10; 10-2015; 1-26; P10013
1742-5468
CONICET Digital
CONICET
dc.language.none.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv info:eu-repo/semantics/altIdentifier/doi/10.1088/1742-5468/2015/10/P10013
info:eu-repo/semantics/altIdentifier/url/http://iopscience.iop.org/article/10.1088/1742-5468/2015/10/P10013/meta
info:eu-repo/semantics/altIdentifier/url/https://arxiv.org/abs/1502.00356
dc.rights.none.fl_str_mv info:eu-repo/semantics/openAccess
https://creativecommons.org/licenses/by-nc-sa/2.5/ar/
eu_rights_str_mv openAccess
rights_invalid_str_mv https://creativecommons.org/licenses/by-nc-sa/2.5/ar/
dc.format.none.fl_str_mv application/pdf
application/pdf
application/pdf
dc.publisher.none.fl_str_mv IOP Publishing
publisher.none.fl_str_mv IOP Publishing
dc.source.none.fl_str_mv reponame:CONICET Digital (CONICET)
instname:Consejo Nacional de Investigaciones Científicas y Técnicas
reponame_str CONICET Digital (CONICET)
collection CONICET Digital (CONICET)
instname_str Consejo Nacional de Investigaciones Científicas y Técnicas
repository.name.fl_str_mv CONICET Digital (CONICET) - Consejo Nacional de Investigaciones Científicas y Técnicas
repository.mail.fl_str_mv dasensio@conicet.gov.ar; lcarlino@conicet.gov.ar
_version_ 1868339598992605184
score 13.040872