Performance evaluation of macroblock-level parallelization of h.264 decoding on a cc-numa multiprocessor architecture

This paper presents a study of the performance scalability of a macroblock-level parallelization of the H.264 decoder for High Definition (HD) applications on a multiprocessor architecture. We have implemented this parallelization on a cache coherent Non-uniform Memory Access (cc-NUMA) shared memory...

Full description

Autores:
Alvarez, Mauricio
Ramirez, Alex
Valero, Mateo
Azevedo, Arnaldo
Meenderinck, Cor
Juurlink, Ben
Tipo de recurso:
Article of journal
Fecha de publicación:
2009
Institución:
Universidad Nacional de Colombia
Repositorio:
Universidad Nacional de Colombia
Idioma:
spa
OAI Identifier:
oai:repositorio.unal.edu.co:unal/28590
Acceso en línea:
https://repositorio.unal.edu.co/handle/unal/28590
http://bdigital.unal.edu.co/18638/
Palabra clave:
Video codec parallelization
multicore architectures
synchronization
H.264
multiprocessor architectures
Rights
openAccess
License
Atribución-NoComercial 4.0 Internacional
id UNACIONAL2_dbe2daabd2a5502951e44499265c455c
oai_identifier_str oai:repositorio.unal.edu.co:unal/28590
network_acronym_str UNACIONAL2
network_name_str Universidad Nacional de Colombia
repository_id_str
spelling Atribución-NoComercial 4.0 InternacionalDerechos reservados - Universidad Nacional de Colombiahttp://creativecommons.org/licenses/by-nc/4.0/info:eu-repo/semantics/openAccesshttp://purl.org/coar/access_right/c_abf2Alvarez, Mauricio41aeaa30-c7bf-47dd-bab3-6550b0f02ea7300Ramirez, Alexbaad626c-85f8-4255-8a7a-c63c5cb1be92300Valero, Mateo4a4a9255-a09a-49d5-94de-4975f9818333300Azevedo, Arnaldo8682b1b1-4dd4-4171-8d5b-13bc47bc558b300Meenderinck, Corb7b033ef-c78c-49ab-8dea-a7fa0d2860b4300Juurlink, Benf2497540-69ff-460d-891f-5770baeda1653002019-06-26T10:20:56Z2019-06-26T10:20:56Z2009https://repositorio.unal.edu.co/handle/unal/28590http://bdigital.unal.edu.co/18638/This paper presents a study of the performance scalability of a macroblock-level parallelization of the H.264 decoder for High Definition (HD) applications on a multiprocessor architecture. We have implemented this parallelization on a cache coherent Non-uniform Memory Access (cc-NUMA) shared memory multiprocessor (SMP) and compared the results with the theoretical expectations. The study includes the evaluation of three different scheduling techniques: static, dynamic and dynamic with tail-submit. A dynamic scheduling approach with a tail-submit optimization presents the best performance obtaining a maximum speedup of 9.5 with 24 processors. A detailed profiling analysis showed that thread synchronization is one of the limiting factors for achieving a better scalability. The paper includes an evaluation of the impact of using blocking synchronization APIs like POSIX threads and POSIX real-time extensions. Results showed that macroblock-level parallelism as a very fine-grain form of Thread-Level Parallelism (TLP) is highly affected by the thread synchronization overhead generated by these APIs. Other synchronization methods, possibly with hardware support, are required in order to make MB-level parallelization more scalable.application/pdfspaUniversidad Nacional de Colombia -Sede Medellínhttp://revistas.unal.edu.co/index.php/avances/article/view/14512Universidad Nacional de Colombia Revistas electrónicas UN Avances en Sistemas e InformáticaAvances en Sistemas e InformáticaAvances en Sistemas e Informática; Vol. 6, núm. 1 (2009); 219-228 Avances en Sistemas e Informática; Vol. 6, núm. 1 (2009); 219-228 1909-0056 1657-7663Alvarez, Mauricio and Ramirez, Alex and Valero, Mateo and Azevedo, Arnaldo and Meenderinck, Cor and Juurlink, Ben (2009) Performance evaluation of macroblock-level parallelization of h.264 decoding on a cc-numa multiprocessor architecture. Avances en Sistemas e Informática; Vol. 6, núm. 1 (2009); 219-228 Avances en Sistemas e Informática; Vol. 6, núm. 1 (2009); 219-228 1909-0056 1657-7663 .Performance evaluation of macroblock-level parallelization of h.264 decoding on a cc-numa multiprocessor architectureArtículo de revistainfo:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersionhttp://purl.org/coar/resource_type/c_6501http://purl.org/coar/resource_type/c_2df8fbb1http://purl.org/coar/version/c_970fb48d4fbd8a85Texthttp://purl.org/redcol/resource_type/ARTVideo codec parallelizationmulticore architecturessynchronizationH.264multiprocessor architecturesORIGINAL14512-43460-1-PB.pdfapplication/pdf1175951https://repositorio.unal.edu.co/bitstream/unal/28590/1/14512-43460-1-PB.pdfd7d12d185be2ee240bb22270d2d20c7cMD51THUMBNAIL14512-43460-1-PB.pdf.jpg14512-43460-1-PB.pdf.jpgGenerated Thumbnailimage/jpeg9952https://repositorio.unal.edu.co/bitstream/unal/28590/2/14512-43460-1-PB.pdf.jpg33eb878cdaf7d7e7f073b18b41ea8077MD52unal/28590oai:repositorio.unal.edu.co:unal/285902022-11-17 23:02:47.522Repositorio Institucional Universidad Nacional de Colombiarepositorio_nal@unal.edu.co
dc.title.spa.fl_str_mv Performance evaluation of macroblock-level parallelization of h.264 decoding on a cc-numa multiprocessor architecture
title Performance evaluation of macroblock-level parallelization of h.264 decoding on a cc-numa multiprocessor architecture
spellingShingle Performance evaluation of macroblock-level parallelization of h.264 decoding on a cc-numa multiprocessor architecture
Video codec parallelization
multicore architectures
synchronization
H.264
multiprocessor architectures
title_short Performance evaluation of macroblock-level parallelization of h.264 decoding on a cc-numa multiprocessor architecture
title_full Performance evaluation of macroblock-level parallelization of h.264 decoding on a cc-numa multiprocessor architecture
title_fullStr Performance evaluation of macroblock-level parallelization of h.264 decoding on a cc-numa multiprocessor architecture
title_full_unstemmed Performance evaluation of macroblock-level parallelization of h.264 decoding on a cc-numa multiprocessor architecture
title_sort Performance evaluation of macroblock-level parallelization of h.264 decoding on a cc-numa multiprocessor architecture
dc.creator.fl_str_mv Alvarez, Mauricio
Ramirez, Alex
Valero, Mateo
Azevedo, Arnaldo
Meenderinck, Cor
Juurlink, Ben
dc.contributor.author.spa.fl_str_mv Alvarez, Mauricio
Ramirez, Alex
Valero, Mateo
Azevedo, Arnaldo
Meenderinck, Cor
Juurlink, Ben
dc.subject.proposal.spa.fl_str_mv Video codec parallelization
multicore architectures
synchronization
H.264
multiprocessor architectures
topic Video codec parallelization
multicore architectures
synchronization
H.264
multiprocessor architectures
description This paper presents a study of the performance scalability of a macroblock-level parallelization of the H.264 decoder for High Definition (HD) applications on a multiprocessor architecture. We have implemented this parallelization on a cache coherent Non-uniform Memory Access (cc-NUMA) shared memory multiprocessor (SMP) and compared the results with the theoretical expectations. The study includes the evaluation of three different scheduling techniques: static, dynamic and dynamic with tail-submit. A dynamic scheduling approach with a tail-submit optimization presents the best performance obtaining a maximum speedup of 9.5 with 24 processors. A detailed profiling analysis showed that thread synchronization is one of the limiting factors for achieving a better scalability. The paper includes an evaluation of the impact of using blocking synchronization APIs like POSIX threads and POSIX real-time extensions. Results showed that macroblock-level parallelism as a very fine-grain form of Thread-Level Parallelism (TLP) is highly affected by the thread synchronization overhead generated by these APIs. Other synchronization methods, possibly with hardware support, are required in order to make MB-level parallelization more scalable.
publishDate 2009
dc.date.issued.spa.fl_str_mv 2009
dc.date.accessioned.spa.fl_str_mv 2019-06-26T10:20:56Z
dc.date.available.spa.fl_str_mv 2019-06-26T10:20:56Z
dc.type.spa.fl_str_mv Artículo de revista
dc.type.coar.fl_str_mv http://purl.org/coar/resource_type/c_2df8fbb1
dc.type.driver.spa.fl_str_mv info:eu-repo/semantics/article
dc.type.version.spa.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.coar.spa.fl_str_mv http://purl.org/coar/resource_type/c_6501
dc.type.coarversion.spa.fl_str_mv http://purl.org/coar/version/c_970fb48d4fbd8a85
dc.type.content.spa.fl_str_mv Text
dc.type.redcol.spa.fl_str_mv http://purl.org/redcol/resource_type/ART
format http://purl.org/coar/resource_type/c_6501
status_str publishedVersion
dc.identifier.uri.none.fl_str_mv https://repositorio.unal.edu.co/handle/unal/28590
dc.identifier.eprints.spa.fl_str_mv http://bdigital.unal.edu.co/18638/
url https://repositorio.unal.edu.co/handle/unal/28590
http://bdigital.unal.edu.co/18638/
dc.language.iso.spa.fl_str_mv spa
language spa
dc.relation.spa.fl_str_mv http://revistas.unal.edu.co/index.php/avances/article/view/14512
dc.relation.ispartof.spa.fl_str_mv Universidad Nacional de Colombia Revistas electrónicas UN Avances en Sistemas e Informática
Avances en Sistemas e Informática
dc.relation.ispartofseries.none.fl_str_mv Avances en Sistemas e Informática; Vol. 6, núm. 1 (2009); 219-228 Avances en Sistemas e Informática; Vol. 6, núm. 1 (2009); 219-228 1909-0056 1657-7663
dc.relation.references.spa.fl_str_mv Alvarez, Mauricio and Ramirez, Alex and Valero, Mateo and Azevedo, Arnaldo and Meenderinck, Cor and Juurlink, Ben (2009) Performance evaluation of macroblock-level parallelization of h.264 decoding on a cc-numa multiprocessor architecture. Avances en Sistemas e Informática; Vol. 6, núm. 1 (2009); 219-228 Avances en Sistemas e Informática; Vol. 6, núm. 1 (2009); 219-228 1909-0056 1657-7663 .
dc.rights.spa.fl_str_mv Derechos reservados - Universidad Nacional de Colombia
dc.rights.coar.fl_str_mv http://purl.org/coar/access_right/c_abf2
dc.rights.license.spa.fl_str_mv Atribución-NoComercial 4.0 Internacional
dc.rights.uri.spa.fl_str_mv http://creativecommons.org/licenses/by-nc/4.0/
dc.rights.accessrights.spa.fl_str_mv info:eu-repo/semantics/openAccess
rights_invalid_str_mv Atribución-NoComercial 4.0 Internacional
Derechos reservados - Universidad Nacional de Colombia
http://creativecommons.org/licenses/by-nc/4.0/
http://purl.org/coar/access_right/c_abf2
eu_rights_str_mv openAccess
dc.format.mimetype.spa.fl_str_mv application/pdf
dc.publisher.spa.fl_str_mv Universidad Nacional de Colombia -Sede Medellín
institution Universidad Nacional de Colombia
bitstream.url.fl_str_mv https://repositorio.unal.edu.co/bitstream/unal/28590/1/14512-43460-1-PB.pdf
https://repositorio.unal.edu.co/bitstream/unal/28590/2/14512-43460-1-PB.pdf.jpg
bitstream.checksum.fl_str_mv d7d12d185be2ee240bb22270d2d20c7c
33eb878cdaf7d7e7f073b18b41ea8077
bitstream.checksumAlgorithm.fl_str_mv MD5
MD5
repository.name.fl_str_mv Repositorio Institucional Universidad Nacional de Colombia
repository.mail.fl_str_mv repositorio_nal@unal.edu.co
_version_ 1814090041161416704