OpenCL overview, implementation, and performance comparison

Autores: Fraire, Juan Andres; Ferreyra, Pablo Alejandro; Marques, Carlos Alberto
Año de publicación: 2013
Idioma: inglés
Tipo de recurso: artículo
Estado: versión publicada
Descripción: High performance parallel computing was something exclusive for expensive specialized hardware some years ago. But now we can find powerful parallel processors in many home graphics card whose interface has been recently opened by many manufacturers for general purpose computing. OpenCL, created by the world most important processors manufacturers, went a little further, aiming for a platform and manufacturer independent parallel language. However, understanding this new processing paradigm is challenging and critical for future computation demanding applications. The first approach of this document is to provide a deep technical background of OpenCL architecture. Second, we propose an implementation of a matrix product calculation OpenCL kernel directly implemented in C++ without wrappers so as to describe in detail the OpenCL programming flow. Thirdly, different platforms and algebraic scenarios are created for this program concluding that the improvement of calculation performance can reach up to 3 orders of magnitude over the same algorithm in plain C++.
Fil: Fraire, Juan Andres. Universidad Nacional de Córdoba. Facultad de Matemática, Astronomía y Física; Argentina. Consejo Nacional de Investigaciones Científicas y Técnicas; Argentina
Fil: Ferreyra, Pablo Alejandro. Universidad Nacional de Córdoba. Facultad de Matemática, Astronomía y Física; Argentina
Fil: Marques, Carlos Alberto. Universidad Nacional de Córdoba. Facultad de Matemática, Astronomía y Física; Argentina. Consejo Nacional de Investigaciones Científicas y Técnicas; Argentina
Materia: Heterogeneus
Systems
Paralelism
Computing
Nivel de accesibilidad: acceso abierto
Condiciones de uso: https://creativecommons.org/licenses/by-nc-sa/2.5/ar/
Repositorio
Institución: Consejo Nacional de Investigaciones Científicas y Técnicas
OAI Identificador: oai:ri.conicet.gov.ar:11336/25002

Acceder

id	CONICETDig_f49a3fe1a9074420cdee2a8f6eb0d069
oai_identifier_str	oai:ri.conicet.gov.ar:11336/25002
network_acronym_str	CONICETDig
repository_id_str	3498
network_name_str	CONICET Digital (CONICET)
spelling	OpenCL overview, implementation, and performance comparisonFraire, Juan AndresFerreyra, Pablo AlejandroMarques, Carlos AlbertoHeterogeneusSystemsParalelismComputinghttps://purl.org/becyt/ford/2.2https://purl.org/becyt/ford/2High performance parallel computing was something exclusive for expensive specialized hardware some years ago. But now we can find powerful parallel processors in many home graphics card whose interface has been recently opened by many manufacturers for general purpose computing. OpenCL, created by the world most important processors manufacturers, went a little further, aiming for a platform and manufacturer independent parallel language. However, understanding this new processing paradigm is challenging and critical for future computation demanding applications. The first approach of this document is to provide a deep technical background of OpenCL architecture. Second, we propose an implementation of a matrix product calculation OpenCL kernel directly implemented in C++ without wrappers so as to describe in detail the OpenCL programming flow. Thirdly, different platforms and algebraic scenarios are created for this program concluding that the improvement of calculation performance can reach up to 3 orders of magnitude over the same algorithm in plain C++.Fil: Fraire, Juan Andres. Universidad Nacional de Córdoba. Facultad de Matemática, Astronomía y Física; Argentina. Consejo Nacional de Investigaciones Científicas y Técnicas; ArgentinaFil: Ferreyra, Pablo Alejandro. Universidad Nacional de Córdoba. Facultad de Matemática, Astronomía y Física; ArgentinaFil: Marques, Carlos Alberto. Universidad Nacional de Córdoba. Facultad de Matemática, Astronomía y Física; Argentina. Consejo Nacional de Investigaciones Científicas y Técnicas; ArgentinaInstitute of Electrical and Electronics Engineers2013-04info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersionhttp://purl.org/coar/resource_type/c_6501info:ar-repo/semantics/articuloapplication/pdfapplication/pdfapplication/pdfapplication/pdfapplication/pdfhttp://hdl.handle.net/11336/25002Fraire, Juan Andres; Ferreyra, Pablo Alejandro; Marques, Carlos Alberto; OpenCL overview, implementation, and performance comparison; Institute of Electrical and Electronics Engineers; IEEE Latin America Transactions; 11; 1; 4-2013; 274-2801548-0992CONICET DigitalCONICETenginfo:eu-repo/semantics/altIdentifier/doi/10.1109/TLA.2013.6502816info:eu-repo/semantics/altIdentifier/url/http://ieeexplore.ieee.org/document/6502816/info:eu-repo/semantics/openAccesshttps://creativecommons.org/licenses/by-nc-sa/2.5/ar/reponame:CONICET Digital (CONICET)instname:Consejo Nacional de Investigaciones Científicas y Técnicas2026-06-04T11:09:31Zoai:ri.conicet.gov.ar:11336/25002instacron:CONICETInstitucionalhttp://ri.conicet.gov.ar/Organismo científico-tecnológicoNo correspondehttp://ri.conicet.gov.ar/oai/requestdasensio@conicet.gov.ar; lcarlino@conicet.gov.arArgentinaNo correspondeNo correspondeNo correspondeopendoar:34982026-06-04 11:09:31.345CONICET Digital (CONICET) - Consejo Nacional de Investigaciones Científicas y Técnicasfalse
dc.title.none.fl_str_mv	OpenCL overview, implementation, and performance comparison
title	OpenCL overview, implementation, and performance comparison
spellingShingle	OpenCL overview, implementation, and performance comparison Fraire, Juan Andres Heterogeneus Systems Paralelism Computing
title_short	OpenCL overview, implementation, and performance comparison
title_full	OpenCL overview, implementation, and performance comparison
title_fullStr	OpenCL overview, implementation, and performance comparison
title_full_unstemmed	OpenCL overview, implementation, and performance comparison
title_sort	OpenCL overview, implementation, and performance comparison
dc.creator.none.fl_str_mv	Fraire, Juan Andres Ferreyra, Pablo Alejandro Marques, Carlos Alberto
author	Fraire, Juan Andres
author_facet	Fraire, Juan Andres Ferreyra, Pablo Alejandro Marques, Carlos Alberto
author_role	author
author2	Ferreyra, Pablo Alejandro Marques, Carlos Alberto
author2_role	author author
dc.subject.none.fl_str_mv	Heterogeneus Systems Paralelism Computing
topic	Heterogeneus Systems Paralelism Computing
purl_subject.fl_str_mv	https://purl.org/becyt/ford/2.2 https://purl.org/becyt/ford/2
dc.description.none.fl_txt_mv	High performance parallel computing was something exclusive for expensive specialized hardware some years ago. But now we can find powerful parallel processors in many home graphics card whose interface has been recently opened by many manufacturers for general purpose computing. OpenCL, created by the world most important processors manufacturers, went a little further, aiming for a platform and manufacturer independent parallel language. However, understanding this new processing paradigm is challenging and critical for future computation demanding applications. The first approach of this document is to provide a deep technical background of OpenCL architecture. Second, we propose an implementation of a matrix product calculation OpenCL kernel directly implemented in C++ without wrappers so as to describe in detail the OpenCL programming flow. Thirdly, different platforms and algebraic scenarios are created for this program concluding that the improvement of calculation performance can reach up to 3 orders of magnitude over the same algorithm in plain C++. Fil: Fraire, Juan Andres. Universidad Nacional de Córdoba. Facultad de Matemática, Astronomía y Física; Argentina. Consejo Nacional de Investigaciones Científicas y Técnicas; Argentina Fil: Ferreyra, Pablo Alejandro. Universidad Nacional de Córdoba. Facultad de Matemática, Astronomía y Física; Argentina Fil: Marques, Carlos Alberto. Universidad Nacional de Córdoba. Facultad de Matemática, Astronomía y Física; Argentina. Consejo Nacional de Investigaciones Científicas y Técnicas; Argentina
description	High performance parallel computing was something exclusive for expensive specialized hardware some years ago. But now we can find powerful parallel processors in many home graphics card whose interface has been recently opened by many manufacturers for general purpose computing. OpenCL, created by the world most important processors manufacturers, went a little further, aiming for a platform and manufacturer independent parallel language. However, understanding this new processing paradigm is challenging and critical for future computation demanding applications. The first approach of this document is to provide a deep technical background of OpenCL architecture. Second, we propose an implementation of a matrix product calculation OpenCL kernel directly implemented in C++ without wrappers so as to describe in detail the OpenCL programming flow. Thirdly, different platforms and algebraic scenarios are created for this program concluding that the improvement of calculation performance can reach up to 3 orders of magnitude over the same algorithm in plain C++.
publishDate	2013
dc.date.none.fl_str_mv	2013-04
dc.type.none.fl_str_mv	info:eu-repo/semantics/article info:eu-repo/semantics/publishedVersion http://purl.org/coar/resource_type/c_6501 info:ar-repo/semantics/articulo
format	article
status_str	publishedVersion
dc.identifier.none.fl_str_mv	http://hdl.handle.net/11336/25002 Fraire, Juan Andres; Ferreyra, Pablo Alejandro; Marques, Carlos Alberto; OpenCL overview, implementation, and performance comparison; Institute of Electrical and Electronics Engineers; IEEE Latin America Transactions; 11; 1; 4-2013; 274-280 1548-0992 CONICET Digital CONICET
url	http://hdl.handle.net/11336/25002
identifier_str_mv	Fraire, Juan Andres; Ferreyra, Pablo Alejandro; Marques, Carlos Alberto; OpenCL overview, implementation, and performance comparison; Institute of Electrical and Electronics Engineers; IEEE Latin America Transactions; 11; 1; 4-2013; 274-280 1548-0992 CONICET Digital CONICET
dc.language.none.fl_str_mv	eng
language	eng
dc.relation.none.fl_str_mv	info:eu-repo/semantics/altIdentifier/doi/10.1109/TLA.2013.6502816 info:eu-repo/semantics/altIdentifier/url/http://ieeexplore.ieee.org/document/6502816/
dc.rights.none.fl_str_mv	info:eu-repo/semantics/openAccess https://creativecommons.org/licenses/by-nc-sa/2.5/ar/
eu_rights_str_mv	openAccess
rights_invalid_str_mv	https://creativecommons.org/licenses/by-nc-sa/2.5/ar/
dc.format.none.fl_str_mv	application/pdf application/pdf application/pdf application/pdf application/pdf
dc.publisher.none.fl_str_mv	Institute of Electrical and Electronics Engineers
publisher.none.fl_str_mv	Institute of Electrical and Electronics Engineers
dc.source.none.fl_str_mv	reponame:CONICET Digital (CONICET) instname:Consejo Nacional de Investigaciones Científicas y Técnicas
reponame_str	CONICET Digital (CONICET)
collection	CONICET Digital (CONICET)
instname_str	Consejo Nacional de Investigaciones Científicas y Técnicas
repository.name.fl_str_mv	CONICET Digital (CONICET) - Consejo Nacional de Investigaciones Científicas y Técnicas
repository.mail.fl_str_mv	dasensio@conicet.gov.ar; lcarlino@conicet.gov.ar
_version_	1867099158818586624
score	12.832306

OpenCL overview, implementation, and performance comparison

Publicaciones similares