Implementation of parallel tridiagonal solvers for a heterogeneous computing environment

, , , , & (2016) Implementation of parallel tridiagonal solvers for a heterogeneous computing environment. The ANZIAM Journal, 56, C446-C462.

Open access copy at publisher website

Description

Tridiagonal diagonally dominant linear systems arise in many scientific and engineering applications. The standard Thomas algorithm for solving such systems is inherently serial forming a bottleneck in computation. Algorithms such as cyclic reduction and SPIKE reduce a single large tridiagonal system into multiple small independent systems which can be solved in parallel. We have developed portable cyclic reduction and SPIKE algorithm OpenCL implementations with the intent to target a range of co-processors in a heterogeneous computing environment including Field Programmable Gate Arrays (FPGAs), Graphics Processing Units (GPUs) and other multi-core processors. In this paper, we evaluate these designs in the context of solver performance, resource efficiency and numerical accuracy.

Impact and interest:

Search Google Scholar™

Citation counts are sourced monthly from Scopus and Web of Science® citation databases.

These databases contain citations from different subsets of available publications and different time periods and thus the citation count from each is usually different. Some works are not in either database and no count is displayed. Scopus includes citations from articles published in 1996 onwards, and Web of Science® generally from 1980 onwards.

Citations counts from the Google Scholar™ indexing service can be viewed at the linked Google Scholar™ search.

ID Code: 91425
Item Type: Contribution to Journal (Journal Article)
Refereed: Yes
ORCID iD:
Warne, Davidorcid.org/0000-0002-9225-175X
Kelson, Neilorcid.org/0000-0002-6077-7538
Banks, Jasmineorcid.org/0000-0003-1507-9682
Farrell, Troyorcid.org/0000-0002-6629-4174
Measurements or Duration: 17 pages
Keywords: Field-Programmable gate Array (FPGA), Graphic Processing Unit (GPU), OpenCL, SPIKE, TDMA, Tri-Diagonal Linear System, cyclic reduction
DOI: 10.21914/anziamj.v56i0.9371
ISSN: 1446-8735
Pure ID: 33017374
Divisions: Past > Institutes > Institute for Future Environments
Past > QUT Faculties & Divisions > Science & Engineering Faculty
Past > QUT Faculties & Divisions > Division of Technology, Information and Library Services
Current > Research Centres > High Performance Computing and Research Support
Copyright Owner: Consult author(s) regarding copyright matters
Copyright Statement: This work is covered by copyright. Unless the document is being made available under a Creative Commons Licence, you must assume that re-use is limited to personal use and that permission from the copyright owner must be obtained for all other uses. If the document is available under a Creative Commons License (or other specified license) then refer to the Licence for details of permitted re-use. It is a condition of access that users recognise and abide by the legal requirements associated with these rights. If you believe that this work infringes copyright please provide details by email to qut.copyright@qut.edu.au
Deposited On: 21 Dec 2015 02:01
Last Modified: 01 Mar 2024 16:40