ISCAS OpenIR  > 并行软件与计算科学实验室 
accelerating linpack performance with mixed precision algorithm on cpu+gpgpu heterogeneous cluster
Wang Lei; Zhang Yunquan; Zhang Xianyi; Liu Fangfang
2010
Conference Name10th IEEE International Conference on Computer and Information Technology, CIT-2010, 7th IEEE International Conference on Embedded Software and Systems, ICESS-2010, 10th IEEE Int. Conf. Scalable Computing and Communications, ScalCom-2010
SourceProceedings - 10th IEEE International Conference on Computer and Information Technology, CIT-2010, 7th IEEE International Conference on Embedded Software and Systems, ICESS-2010, ScalCom-2010
Pages1169-1174
Conference Date37436
Conference PlaceBradford, United kingdom
Indexed TypeEI
Publish PlaceUnited States
ISBN9780770000000
Department(1) Lab of Parallel Computing, Institute of Software, Chinese Academy of Sciences, Beijing 100190, China; (2) State Key Lab of Computing Science, Chinese Academy of Sciences, Beijing 100190, China; (3) Chinese Academy of Sciences, Graduate University, Beijing 100190, China
English AbstractIn this paper, the mixed precision algorithm to solve the linear system of equations and the implementation of HPL package are introduced. We use this mixed precision algorithm to improve HPL package on CPU+GPGPU heterogeneous clusters, which is named for GHPL, and give the implementation mechanisms in detail. The experimental results are measured on the platforms of multi-core CPUs and CPU+GPGPU heterogeneous clusters. From the experimental results, we can find out that our GHPL program has good scalability on all the experimental environments and can sustain more than 1.7Teraflops both on the cluster with 16 nodes containing 32 NVIDIA Tesla C1060 GPUs and on the cluster with 8 nodes containing 32 NVIDIA GeForce GTX 295 GPUs, while the average speedup of it with respect to HPL is 3.06 and 2.40 respectively. © 2010 IEEE.
KeywordEmbedded Software Embedded Systems Information Technology Linear Systems Program Processors
SponsorshipUniversity of Bradford; IEEE; IEEE Computer Society; IEEE TCSC; IEEE Industry Applications Society (IAS)
Content Type会议论文
URIhttp://ir.iscas.ac.cn/handle/311060/8642
Collection并行软件与计算科学实验室 
Recommended Citation
GB/T 7714
Wang Lei,Zhang Yunquan,Zhang Xianyi,et al. accelerating linpack performance with mixed precision algorithm on cpu+gpgpu heterogeneous cluster[C]. United States,2010:1169-1174.
Files in This Item:
File Name/Size DocType Version Access License
05577898.pdf(289KB) 开放获取--Application Full Text
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Wang Lei]'s Articles
[Zhang Yunquan]'s Articles
[Zhang Xianyi]'s Articles
Baidu academic
Similar articles in Baidu academic
[Wang Lei]'s Articles
[Zhang Yunquan]'s Articles
[Zhang Xianyi]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Wang Lei]'s Articles
[Zhang Yunquan]'s Articles
[Zhang Xianyi]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.