ETRO VUB
About ETRO  |  News  |  Events  |  Vacancies  |  Contact  
Home Research Education Industry Publications About ETRO

ETRO Publications

Full Details

Conference Publication

Performance and toolchain of a combined GPU/FPGA desktop

Host Publication: The 2013 ACM/SIGDA International Symposium on Field Programmable Gate Arrays (FPGA 2013)

Authors: B. da Silva Gomes, A. Braeken, H. D'hollander Erik, A. Touhafi, J. G Cornelis and J. Lemeire

UsePubPlace: New York, NY, USA ©2013

Publisher: ACM

Publication Date: Feb. 2013

Number of Pages: 1

ISBN: 978-1-4503-1887-7


Abstract:

Low-power, high-performance computing nowadays relies on accelerator cards to speed up the calculations. Combining the power of GPUs with the flexibility of FPGAs enlarges the scope of problems that can be accelerated [2, 3]. We describe the performance analysis of a desktop equipped with a GPU Tesla 2050 and an FPGA VirtexLj LX240T. First, the balance between the I/O and the raw peak performance is depicted using the roofline model [4]. Next, the performance of a number of image processing algorithms is measured and the results are mapped onto the roofline graph. This allows to compare the GPU and the FPGA and also to optimize the algorithms for both accelerators. A programming toolchain is implemented, consisting of OpenCL for the GPU and several High-Level Synthesis compilers for the FPGA. Our results show that the HLS compilers outperform handwritten code and offer a performance comparable to the GPU. In addition the FPGA compilers reduce the development time by an order of magnitude, at the expense of an increased resource consumption. The roofline model also shows that both accelerators are equally limited by the input/output bandwidth to the host. A well-tuned accelerator-based codesign, identifying the parallelism, the computation and data patterns of different classes of algorithms, will enable to maximize the performance of the combined GPU/FPGA system [1]. References [1] Asanovic, K. et al. 2009. A view of the parallel computing landscape. Communications of the ACM. 52, 10 (2009), 56ᇗ. [2] Bauer, S. et al. 2010. FPGA-GPU architecture for kernel SVM pedestrian detection. Computer Vision and Pattern Recognition Workshops (CVPRW), 2010 IEEE Computer Society Conference on (2010), 61ᇘ. [3] Inta, R. et al. 2012. The Chimera: An Off-The-Shelf CPU/GPGPU/FPGA Hybrid Computing Platform. International Journal of Reconfigurable Computing. January 2012, Article 2, 10 pages (2012). [4] Williams, S. et al. 2009. Roofline: an insightful visual performance model for multicore architectures. Communications of the ACM. 52, 4 (2009), 65ᇠ.

External Link.

Other Reference Styles
Current ETRO Authors

Prof. Bruno da Silva Gomes

+32 (0)02 629 284

bdasilva@etrovub.be

more info

Mr. Abdellah Touhafi

+32 (0)02 629 377

atouhafi@etrovub.be

more info

Prof. Jan Lemeire

+32 (0)02 629 167

jlemeire@etrovub.be

more info

Other Publications

• Journal publications

IRIS • LAMI • AVSP

• Conference publications

IRIS • LAMI • AVSP

• Book publications

IRIS • LAMI • AVSP

• Reports

IRIS • LAMI • AVSP

• Laymen publications

IRIS • LAMI • AVSP

• PhD Theses

Search ETRO Publications

Author:

Keyword:  

Type:








- Contact person

- IRIS

- AVSP

- LAMI

- Contact person

- Thesis proposals

- ETRO Courses

- Contact person

- Spin-offs

- Know How

- Journals

- Conferences

- Books

- Vacancies

- News

- Events

- Press

Contact

ETRO Department

info@etro.vub.ac.be

Tel: +32 2 629 29 30

©2024 • Vrije Universiteit Brussel • ETRO Dept. • Pleinlaan 2 • 1050 Brussels • Tel: +32 2 629 2930 (secretariat) • Fax: +32 2 629 2883 • WebmasterDisclaimer