Efficient irregular wavefront propagation algorithms on Intel Xeon Phi.

Clicks: 218
ID: 53606
2015
Article Quality & Performance Metrics
Overall Quality Improving Quality
0.0 /100
Combines engagement data with AI-assessed academic quality
AI Quality Assessment
Not analyzed
Abstract
We investigate the execution of the Irregular Wavefront Propagation Pattern (IWPP), a fundamental computing structure used in several image analysis operations, on the Intel Xeon Phi co-processor. An efficient implementation of IWPP on the Xeon Phi is a challenging problem because of IWPP's irregularity and the use of atomic instructions in the original IWPP algorithm to resolve race conditions. On the Xeon Phi, the use of SIMD and vectorization instructions is critical to attain high performance. However, SIMD atomic instructions are not supported. Therefore, we propose a new IWPP algorithm that can take advantage of the supported SIMD instruction set. We also evaluate an alternate storage container (priority queue) to track active elements in the wavefront in an effort to improve the parallel algorithm efficiency. The new IWPP algorithm is evaluated with Morphological Reconstruction and Imfill operations as use cases. Our results show performance improvements of up to 5.63 on top of the original IWPP due to vectorization. Moreover, the new IWPP achieves speedups of 45.7 and 1.62, respectively, as compared to efficient CPU and GPU implementations.
Reference Key
gomes2015efficientproceedings Use this key to autocite in the manuscript while using SciMatic Manuscript Manager or Thesis Manager
Authors Gomes, Jeremias M;Teodoro, George;de Melo, Alba;Kong, Jun;Kurc, Tahsin;Saltz, Joel H;
Journal proceedings symposium on computer architecture and high performance computing
Year 2015
DOI
DOI not found
URL URL not found
Keywords

Citations

No citations found. To add a citation, contact the admin at info@scimatic.org

No comments yet. Be the first to comment on this article.