ISCAS OpenIR  > 并行软件与计算科学实验室 
accelerating linpack performance with mixed precision algorithm on cpu+gpgpu heterogeneous cluster
Wang Lei; Zhang Yunquan; Zhang Xianyi; Liu Fangfang
2010
会议名称10th IEEE International Conference on Computer and Information Technology, CIT-2010, 7th IEEE International Conference on Embedded Software and Systems, ICESS-2010, 10th IEEE Int. Conf. Scalable Computing and Communications, ScalCom-2010
会议录名称Proceedings - 10th IEEE International Conference on Computer and Information Technology, CIT-2010, 7th IEEE International Conference on Embedded Software and Systems, ICESS-2010, ScalCom-2010
页码1169-1174
会议日期37436
会议地点Bradford, United kingdom
收录类别EI
出版地United States
ISBN9780770000000
部门归属(1) Lab of Parallel Computing, Institute of Software, Chinese Academy of Sciences, Beijing 100190, China; (2) State Key Lab of Computing Science, Chinese Academy of Sciences, Beijing 100190, China; (3) Chinese Academy of Sciences, Graduate University, Beijing 100190, China
摘要In this paper, the mixed precision algorithm to solve the linear system of equations and the implementation of HPL package are introduced. We use this mixed precision algorithm to improve HPL package on CPU+GPGPU heterogeneous clusters, which is named for GHPL, and give the implementation mechanisms in detail. The experimental results are measured on the platforms of multi-core CPUs and CPU+GPGPU heterogeneous clusters. From the experimental results, we can find out that our GHPL program has good scalability on all the experimental environments and can sustain more than 1.7Teraflops both on the cluster with 16 nodes containing 32 NVIDIA Tesla C1060 GPUs and on the cluster with 8 nodes containing 32 NVIDIA GeForce GTX 295 GPUs, while the average speedup of it with respect to HPL is 3.06 and 2.40 respectively. © 2010 IEEE.
关键词Embedded Software Embedded Systems Information Technology Linear Systems Program Processors
主办者University of Bradford; IEEE; IEEE Computer Society; IEEE TCSC; IEEE Industry Applications Society (IAS)
内容类型会议论文
URI标识http://ir.iscas.ac.cn/handle/311060/8642
专题并行软件与计算科学实验室 
推荐引用方式
GB/T 7714
Wang Lei,Zhang Yunquan,Zhang Xianyi,et al. accelerating linpack performance with mixed precision algorithm on cpu+gpgpu heterogeneous cluster[C]. United States,2010:1169-1174.
条目包含的文件
文件名称/大小 文献类型 版本类型 开放类型 使用许可
05577898.pdf(289KB) 开放获取--请求全文
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Wang Lei]的文章
[Zhang Yunquan]的文章
[Zhang Xianyi]的文章
百度学术
百度学术中相似的文章
[Wang Lei]的文章
[Zhang Yunquan]的文章
[Zhang Xianyi]的文章
必应学术
必应学术中相似的文章
[Wang Lei]的文章
[Zhang Yunquan]的文章
[Zhang Xianyi]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。