Performance evaluation of macroblock-level parallelization of h.264 decoding on a cc-numa multiprocessor architecture
This paper presents a study of the performance scalability of a macroblock-level parallelization of the H.264 decoder for High Definition (HD) applications on a multiprocessor architecture. We have implemented this parallelization on a cache coherent Non-uniform Memory Access (cc-NUMA) shared memory...
- Autores:
-
Alvarez, Mauricio
Ramirez, Alex
Valero, Mateo
Azevedo, Arnaldo
Meenderinck, Cor
Juurlink, Ben
- Tipo de recurso:
- Article of journal
- Fecha de publicación:
- 2009
- Institución:
- Universidad Nacional de Colombia
- Repositorio:
- Universidad Nacional de Colombia
- Idioma:
- spa
- OAI Identifier:
- oai:repositorio.unal.edu.co:unal/28590
- Acceso en línea:
- https://repositorio.unal.edu.co/handle/unal/28590
http://bdigital.unal.edu.co/18638/
- Palabra clave:
- Video codec parallelization
multicore architectures
synchronization
H.264
multiprocessor architectures
- Rights
- openAccess
- License
- Atribución-NoComercial 4.0 Internacional
id |
UNACIONAL2_dbe2daabd2a5502951e44499265c455c |
---|---|
oai_identifier_str |
oai:repositorio.unal.edu.co:unal/28590 |
network_acronym_str |
UNACIONAL2 |
network_name_str |
Universidad Nacional de Colombia |
repository_id_str |
|
spelling |
Atribución-NoComercial 4.0 InternacionalDerechos reservados - Universidad Nacional de Colombiahttp://creativecommons.org/licenses/by-nc/4.0/info:eu-repo/semantics/openAccesshttp://purl.org/coar/access_right/c_abf2Alvarez, Mauricio41aeaa30-c7bf-47dd-bab3-6550b0f02ea7300Ramirez, Alexbaad626c-85f8-4255-8a7a-c63c5cb1be92300Valero, Mateo4a4a9255-a09a-49d5-94de-4975f9818333300Azevedo, Arnaldo8682b1b1-4dd4-4171-8d5b-13bc47bc558b300Meenderinck, Corb7b033ef-c78c-49ab-8dea-a7fa0d2860b4300Juurlink, Benf2497540-69ff-460d-891f-5770baeda1653002019-06-26T10:20:56Z2019-06-26T10:20:56Z2009https://repositorio.unal.edu.co/handle/unal/28590http://bdigital.unal.edu.co/18638/This paper presents a study of the performance scalability of a macroblock-level parallelization of the H.264 decoder for High Definition (HD) applications on a multiprocessor architecture. We have implemented this parallelization on a cache coherent Non-uniform Memory Access (cc-NUMA) shared memory multiprocessor (SMP) and compared the results with the theoretical expectations. The study includes the evaluation of three different scheduling techniques: static, dynamic and dynamic with tail-submit. A dynamic scheduling approach with a tail-submit optimization presents the best performance obtaining a maximum speedup of 9.5 with 24 processors. A detailed profiling analysis showed that thread synchronization is one of the limiting factors for achieving a better scalability. The paper includes an evaluation of the impact of using blocking synchronization APIs like POSIX threads and POSIX real-time extensions. Results showed that macroblock-level parallelism as a very fine-grain form of Thread-Level Parallelism (TLP) is highly affected by the thread synchronization overhead generated by these APIs. Other synchronization methods, possibly with hardware support, are required in order to make MB-level parallelization more scalable.application/pdfspaUniversidad Nacional de Colombia -Sede Medellínhttp://revistas.unal.edu.co/index.php/avances/article/view/14512Universidad Nacional de Colombia Revistas electrónicas UN Avances en Sistemas e InformáticaAvances en Sistemas e InformáticaAvances en Sistemas e Informática; Vol. 6, núm. 1 (2009); 219-228 Avances en Sistemas e Informática; Vol. 6, núm. 1 (2009); 219-228 1909-0056 1657-7663Alvarez, Mauricio and Ramirez, Alex and Valero, Mateo and Azevedo, Arnaldo and Meenderinck, Cor and Juurlink, Ben (2009) Performance evaluation of macroblock-level parallelization of h.264 decoding on a cc-numa multiprocessor architecture. Avances en Sistemas e Informática; Vol. 6, núm. 1 (2009); 219-228 Avances en Sistemas e Informática; Vol. 6, núm. 1 (2009); 219-228 1909-0056 1657-7663 .Performance evaluation of macroblock-level parallelization of h.264 decoding on a cc-numa multiprocessor architectureArtículo de revistainfo:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersionhttp://purl.org/coar/resource_type/c_6501http://purl.org/coar/resource_type/c_2df8fbb1http://purl.org/coar/version/c_970fb48d4fbd8a85Texthttp://purl.org/redcol/resource_type/ARTVideo codec parallelizationmulticore architecturessynchronizationH.264multiprocessor architecturesORIGINAL14512-43460-1-PB.pdfapplication/pdf1175951https://repositorio.unal.edu.co/bitstream/unal/28590/1/14512-43460-1-PB.pdfd7d12d185be2ee240bb22270d2d20c7cMD51THUMBNAIL14512-43460-1-PB.pdf.jpg14512-43460-1-PB.pdf.jpgGenerated Thumbnailimage/jpeg9952https://repositorio.unal.edu.co/bitstream/unal/28590/2/14512-43460-1-PB.pdf.jpg33eb878cdaf7d7e7f073b18b41ea8077MD52unal/28590oai:repositorio.unal.edu.co:unal/285902022-11-17 23:02:47.522Repositorio Institucional Universidad Nacional de Colombiarepositorio_nal@unal.edu.co |
dc.title.spa.fl_str_mv |
Performance evaluation of macroblock-level parallelization of h.264 decoding on a cc-numa multiprocessor architecture |
title |
Performance evaluation of macroblock-level parallelization of h.264 decoding on a cc-numa multiprocessor architecture |
spellingShingle |
Performance evaluation of macroblock-level parallelization of h.264 decoding on a cc-numa multiprocessor architecture Video codec parallelization multicore architectures synchronization H.264 multiprocessor architectures |
title_short |
Performance evaluation of macroblock-level parallelization of h.264 decoding on a cc-numa multiprocessor architecture |
title_full |
Performance evaluation of macroblock-level parallelization of h.264 decoding on a cc-numa multiprocessor architecture |
title_fullStr |
Performance evaluation of macroblock-level parallelization of h.264 decoding on a cc-numa multiprocessor architecture |
title_full_unstemmed |
Performance evaluation of macroblock-level parallelization of h.264 decoding on a cc-numa multiprocessor architecture |
title_sort |
Performance evaluation of macroblock-level parallelization of h.264 decoding on a cc-numa multiprocessor architecture |
dc.creator.fl_str_mv |
Alvarez, Mauricio Ramirez, Alex Valero, Mateo Azevedo, Arnaldo Meenderinck, Cor Juurlink, Ben |
dc.contributor.author.spa.fl_str_mv |
Alvarez, Mauricio Ramirez, Alex Valero, Mateo Azevedo, Arnaldo Meenderinck, Cor Juurlink, Ben |
dc.subject.proposal.spa.fl_str_mv |
Video codec parallelization multicore architectures synchronization H.264 multiprocessor architectures |
topic |
Video codec parallelization multicore architectures synchronization H.264 multiprocessor architectures |
description |
This paper presents a study of the performance scalability of a macroblock-level parallelization of the H.264 decoder for High Definition (HD) applications on a multiprocessor architecture. We have implemented this parallelization on a cache coherent Non-uniform Memory Access (cc-NUMA) shared memory multiprocessor (SMP) and compared the results with the theoretical expectations. The study includes the evaluation of three different scheduling techniques: static, dynamic and dynamic with tail-submit. A dynamic scheduling approach with a tail-submit optimization presents the best performance obtaining a maximum speedup of 9.5 with 24 processors. A detailed profiling analysis showed that thread synchronization is one of the limiting factors for achieving a better scalability. The paper includes an evaluation of the impact of using blocking synchronization APIs like POSIX threads and POSIX real-time extensions. Results showed that macroblock-level parallelism as a very fine-grain form of Thread-Level Parallelism (TLP) is highly affected by the thread synchronization overhead generated by these APIs. Other synchronization methods, possibly with hardware support, are required in order to make MB-level parallelization more scalable. |
publishDate |
2009 |
dc.date.issued.spa.fl_str_mv |
2009 |
dc.date.accessioned.spa.fl_str_mv |
2019-06-26T10:20:56Z |
dc.date.available.spa.fl_str_mv |
2019-06-26T10:20:56Z |
dc.type.spa.fl_str_mv |
Artículo de revista |
dc.type.coar.fl_str_mv |
http://purl.org/coar/resource_type/c_2df8fbb1 |
dc.type.driver.spa.fl_str_mv |
info:eu-repo/semantics/article |
dc.type.version.spa.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.coar.spa.fl_str_mv |
http://purl.org/coar/resource_type/c_6501 |
dc.type.coarversion.spa.fl_str_mv |
http://purl.org/coar/version/c_970fb48d4fbd8a85 |
dc.type.content.spa.fl_str_mv |
Text |
dc.type.redcol.spa.fl_str_mv |
http://purl.org/redcol/resource_type/ART |
format |
http://purl.org/coar/resource_type/c_6501 |
status_str |
publishedVersion |
dc.identifier.uri.none.fl_str_mv |
https://repositorio.unal.edu.co/handle/unal/28590 |
dc.identifier.eprints.spa.fl_str_mv |
http://bdigital.unal.edu.co/18638/ |
url |
https://repositorio.unal.edu.co/handle/unal/28590 http://bdigital.unal.edu.co/18638/ |
dc.language.iso.spa.fl_str_mv |
spa |
language |
spa |
dc.relation.spa.fl_str_mv |
http://revistas.unal.edu.co/index.php/avances/article/view/14512 |
dc.relation.ispartof.spa.fl_str_mv |
Universidad Nacional de Colombia Revistas electrónicas UN Avances en Sistemas e Informática Avances en Sistemas e Informática |
dc.relation.ispartofseries.none.fl_str_mv |
Avances en Sistemas e Informática; Vol. 6, núm. 1 (2009); 219-228 Avances en Sistemas e Informática; Vol. 6, núm. 1 (2009); 219-228 1909-0056 1657-7663 |
dc.relation.references.spa.fl_str_mv |
Alvarez, Mauricio and Ramirez, Alex and Valero, Mateo and Azevedo, Arnaldo and Meenderinck, Cor and Juurlink, Ben (2009) Performance evaluation of macroblock-level parallelization of h.264 decoding on a cc-numa multiprocessor architecture. Avances en Sistemas e Informática; Vol. 6, núm. 1 (2009); 219-228 Avances en Sistemas e Informática; Vol. 6, núm. 1 (2009); 219-228 1909-0056 1657-7663 . |
dc.rights.spa.fl_str_mv |
Derechos reservados - Universidad Nacional de Colombia |
dc.rights.coar.fl_str_mv |
http://purl.org/coar/access_right/c_abf2 |
dc.rights.license.spa.fl_str_mv |
Atribución-NoComercial 4.0 Internacional |
dc.rights.uri.spa.fl_str_mv |
http://creativecommons.org/licenses/by-nc/4.0/ |
dc.rights.accessrights.spa.fl_str_mv |
info:eu-repo/semantics/openAccess |
rights_invalid_str_mv |
Atribución-NoComercial 4.0 Internacional Derechos reservados - Universidad Nacional de Colombia http://creativecommons.org/licenses/by-nc/4.0/ http://purl.org/coar/access_right/c_abf2 |
eu_rights_str_mv |
openAccess |
dc.format.mimetype.spa.fl_str_mv |
application/pdf |
dc.publisher.spa.fl_str_mv |
Universidad Nacional de Colombia -Sede Medellín |
institution |
Universidad Nacional de Colombia |
bitstream.url.fl_str_mv |
https://repositorio.unal.edu.co/bitstream/unal/28590/1/14512-43460-1-PB.pdf https://repositorio.unal.edu.co/bitstream/unal/28590/2/14512-43460-1-PB.pdf.jpg |
bitstream.checksum.fl_str_mv |
d7d12d185be2ee240bb22270d2d20c7c 33eb878cdaf7d7e7f073b18b41ea8077 |
bitstream.checksumAlgorithm.fl_str_mv |
MD5 MD5 |
repository.name.fl_str_mv |
Repositorio Institucional Universidad Nacional de Colombia |
repository.mail.fl_str_mv |
repositorio_nal@unal.edu.co |
_version_ |
1814090041161416704 |