NLP-based faceted search: Experience in the development of a science and technology search engine

Autores
Armentano, Marcelo Gabriel; Amandi, Analia Adriana; Campo, Marcelo Ricardo; Godoy, Daniela Lis
Año de publicación
2014
Idioma
inglés
Tipo de recurso
artículo
Estado
versión publicada
Descripción
An appropriate promotion, distribution and dissemination of scientific, artistic and technology developments can foster the collaboration between a country’s productive and academic sectors. The purpose of this paper is to present a novel search engine aiming at helping people to access science and technology advances, researchers and institutions working in specific areas of research. Our search engine first collects information disseminated on the Web in academic institution sites and in researchers personal homepages. Then, after intensive text processing, it summarizes the information in an enriched and user-friendly presentation oriented to non-expert users. Stable performance and an acceptable level of effectiveness for automatic named entities recognition indicate the potential of our approach for bridging the gap between the heterogeneous and unstructured information available on the Web about the research and development advances in a country and the innovation required by the productive sectors.
Fil: Armentano, Marcelo Gabriel. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Tandil. Instituto Superior de Ingenieria del Software; Argentina
Fil: Amandi, Analia Adriana. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Tandil. Instituto Superior de Ingenieria del Software; Argentina
Fil: Campo, Marcelo Ricardo. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Tandil. Instituto Superior de Ingenieria del Software; Argentina
Fil: Godoy, Daniela Lis. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Tandil. Instituto Superior de Ingenieria del Software; Argentina
Materia
Vertical Search Engines
Faceted Search
Named Entities Recognition
Natural Language Processing
Nivel de accesibilidad
acceso abierto
Condiciones de uso
https://creativecommons.org/licenses/by-nc-nd/2.5/ar/
Repositorio
CONICET Digital (CONICET)
Institución
Consejo Nacional de Investigaciones Científicas y Técnicas
OAI Identificador
oai:ri.conicet.gov.ar:11336/6771

id CONICETDig_ed56f15adbc3de2e0e56a5bd1eb4c191
oai_identifier_str oai:ri.conicet.gov.ar:11336/6771
network_acronym_str CONICETDig
repository_id_str 3498
network_name_str CONICET Digital (CONICET)
spelling NLP-based faceted search: Experience in the development of a science and technology search engineArmentano, Marcelo GabrielAmandi, Analia AdrianaCampo, Marcelo RicardoGodoy, Daniela LisVertical Search EnginesFaceted SearchNamed Entities RecognitionNatural Language Processinghttps://purl.org/becyt/ford/1.2https://purl.org/becyt/ford/1An appropriate promotion, distribution and dissemination of scientific, artistic and technology developments can foster the collaboration between a country’s productive and academic sectors. The purpose of this paper is to present a novel search engine aiming at helping people to access science and technology advances, researchers and institutions working in specific areas of research. Our search engine first collects information disseminated on the Web in academic institution sites and in researchers personal homepages. Then, after intensive text processing, it summarizes the information in an enriched and user-friendly presentation oriented to non-expert users. Stable performance and an acceptable level of effectiveness for automatic named entities recognition indicate the potential of our approach for bridging the gap between the heterogeneous and unstructured information available on the Web about the research and development advances in a country and the innovation required by the productive sectors.Fil: Armentano, Marcelo Gabriel. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Tandil. Instituto Superior de Ingenieria del Software; ArgentinaFil: Amandi, Analia Adriana. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Tandil. Instituto Superior de Ingenieria del Software; ArgentinaFil: Campo, Marcelo Ricardo. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Tandil. Instituto Superior de Ingenieria del Software; ArgentinaFil: Godoy, Daniela Lis. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Tandil. Instituto Superior de Ingenieria del Software; ArgentinaElsevier2014-05info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersionhttp://purl.org/coar/resource_type/c_6501info:ar-repo/semantics/articuloapplication/pdfapplication/pdfapplication/pdfapplication/pdfapplication/pdfhttp://hdl.handle.net/11336/6771Armentano, Marcelo Gabriel; Amandi, Analia Adriana; Campo, Marcelo Ricardo; Godoy, Daniela Lis; NLP-based faceted search: Experience in the development of a science and technology search engine; Elsevier; Expert Systems with Applications; 41; 6; 5-2014; 2886-28960957-4174enginfo:eu-repo/semantics/altIdentifier/url/http://www.sciencedirect.com/science/article/pii/S0957417413008397info:eu-repo/semantics/altIdentifier/doi/10.1016/j.eswa.2013.10.023info:eu-repo/semantics/altIdentifier/doi/info:eu-repo/semantics/openAccesshttps://creativecommons.org/licenses/by-nc-nd/2.5/ar/reponame:CONICET Digital (CONICET)instname:Consejo Nacional de Investigaciones Científicas y Técnicas2025-09-29T10:06:28Zoai:ri.conicet.gov.ar:11336/6771instacron:CONICETInstitucionalhttp://ri.conicet.gov.ar/Organismo científico-tecnológicoNo correspondehttp://ri.conicet.gov.ar/oai/requestdasensio@conicet.gov.ar; lcarlino@conicet.gov.arArgentinaNo correspondeNo correspondeNo correspondeopendoar:34982025-09-29 10:06:28.779CONICET Digital (CONICET) - Consejo Nacional de Investigaciones Científicas y Técnicasfalse
dc.title.none.fl_str_mv NLP-based faceted search: Experience in the development of a science and technology search engine
title NLP-based faceted search: Experience in the development of a science and technology search engine
spellingShingle NLP-based faceted search: Experience in the development of a science and technology search engine
Armentano, Marcelo Gabriel
Vertical Search Engines
Faceted Search
Named Entities Recognition
Natural Language Processing
title_short NLP-based faceted search: Experience in the development of a science and technology search engine
title_full NLP-based faceted search: Experience in the development of a science and technology search engine
title_fullStr NLP-based faceted search: Experience in the development of a science and technology search engine
title_full_unstemmed NLP-based faceted search: Experience in the development of a science and technology search engine
title_sort NLP-based faceted search: Experience in the development of a science and technology search engine
dc.creator.none.fl_str_mv Armentano, Marcelo Gabriel
Amandi, Analia Adriana
Campo, Marcelo Ricardo
Godoy, Daniela Lis
author Armentano, Marcelo Gabriel
author_facet Armentano, Marcelo Gabriel
Amandi, Analia Adriana
Campo, Marcelo Ricardo
Godoy, Daniela Lis
author_role author
author2 Amandi, Analia Adriana
Campo, Marcelo Ricardo
Godoy, Daniela Lis
author2_role author
author
author
dc.subject.none.fl_str_mv Vertical Search Engines
Faceted Search
Named Entities Recognition
Natural Language Processing
topic Vertical Search Engines
Faceted Search
Named Entities Recognition
Natural Language Processing
purl_subject.fl_str_mv https://purl.org/becyt/ford/1.2
https://purl.org/becyt/ford/1
dc.description.none.fl_txt_mv An appropriate promotion, distribution and dissemination of scientific, artistic and technology developments can foster the collaboration between a country’s productive and academic sectors. The purpose of this paper is to present a novel search engine aiming at helping people to access science and technology advances, researchers and institutions working in specific areas of research. Our search engine first collects information disseminated on the Web in academic institution sites and in researchers personal homepages. Then, after intensive text processing, it summarizes the information in an enriched and user-friendly presentation oriented to non-expert users. Stable performance and an acceptable level of effectiveness for automatic named entities recognition indicate the potential of our approach for bridging the gap between the heterogeneous and unstructured information available on the Web about the research and development advances in a country and the innovation required by the productive sectors.
Fil: Armentano, Marcelo Gabriel. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Tandil. Instituto Superior de Ingenieria del Software; Argentina
Fil: Amandi, Analia Adriana. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Tandil. Instituto Superior de Ingenieria del Software; Argentina
Fil: Campo, Marcelo Ricardo. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Tandil. Instituto Superior de Ingenieria del Software; Argentina
Fil: Godoy, Daniela Lis. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Tandil. Instituto Superior de Ingenieria del Software; Argentina
description An appropriate promotion, distribution and dissemination of scientific, artistic and technology developments can foster the collaboration between a country’s productive and academic sectors. The purpose of this paper is to present a novel search engine aiming at helping people to access science and technology advances, researchers and institutions working in specific areas of research. Our search engine first collects information disseminated on the Web in academic institution sites and in researchers personal homepages. Then, after intensive text processing, it summarizes the information in an enriched and user-friendly presentation oriented to non-expert users. Stable performance and an acceptable level of effectiveness for automatic named entities recognition indicate the potential of our approach for bridging the gap between the heterogeneous and unstructured information available on the Web about the research and development advances in a country and the innovation required by the productive sectors.
publishDate 2014
dc.date.none.fl_str_mv 2014-05
dc.type.none.fl_str_mv info:eu-repo/semantics/article
info:eu-repo/semantics/publishedVersion
http://purl.org/coar/resource_type/c_6501
info:ar-repo/semantics/articulo
format article
status_str publishedVersion
dc.identifier.none.fl_str_mv http://hdl.handle.net/11336/6771
Armentano, Marcelo Gabriel; Amandi, Analia Adriana; Campo, Marcelo Ricardo; Godoy, Daniela Lis; NLP-based faceted search: Experience in the development of a science and technology search engine; Elsevier; Expert Systems with Applications; 41; 6; 5-2014; 2886-2896
0957-4174
url http://hdl.handle.net/11336/6771
identifier_str_mv Armentano, Marcelo Gabriel; Amandi, Analia Adriana; Campo, Marcelo Ricardo; Godoy, Daniela Lis; NLP-based faceted search: Experience in the development of a science and technology search engine; Elsevier; Expert Systems with Applications; 41; 6; 5-2014; 2886-2896
0957-4174
dc.language.none.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv info:eu-repo/semantics/altIdentifier/url/http://www.sciencedirect.com/science/article/pii/S0957417413008397
info:eu-repo/semantics/altIdentifier/doi/10.1016/j.eswa.2013.10.023
info:eu-repo/semantics/altIdentifier/doi/
dc.rights.none.fl_str_mv info:eu-repo/semantics/openAccess
https://creativecommons.org/licenses/by-nc-nd/2.5/ar/
eu_rights_str_mv openAccess
rights_invalid_str_mv https://creativecommons.org/licenses/by-nc-nd/2.5/ar/
dc.format.none.fl_str_mv application/pdf
application/pdf
application/pdf
application/pdf
application/pdf
dc.publisher.none.fl_str_mv Elsevier
publisher.none.fl_str_mv Elsevier
dc.source.none.fl_str_mv reponame:CONICET Digital (CONICET)
instname:Consejo Nacional de Investigaciones Científicas y Técnicas
reponame_str CONICET Digital (CONICET)
collection CONICET Digital (CONICET)
instname_str Consejo Nacional de Investigaciones Científicas y Técnicas
repository.name.fl_str_mv CONICET Digital (CONICET) - Consejo Nacional de Investigaciones Científicas y Técnicas
repository.mail.fl_str_mv dasensio@conicet.gov.ar; lcarlino@conicet.gov.ar
_version_ 1844613913960775680
score 13.070432