中国科学院软件研究所机构知识库
Advanced  
ISCAS OpenIR  > 并行计算实验室  > 会议论文
题名:
accelerating linpack performance with mixed precision algorithm on cpu+gpgpu heterogeneous cluster
作者: Wang Lei ; Zhang Yunquan ; Zhang Xianyi ; Liu Fangfang
会议文集: Proceedings - 10th IEEE International Conference on Computer and Information Technology, CIT-2010, 7th IEEE International Conference on Embedded Software and Systems, ICESS-2010, ScalCom-2010
会议名称: 10th IEEE International Conference on Computer and Information Technology, CIT-2010, 7th IEEE International Conference on Embedded Software and Systems, ICESS-2010, 10th IEEE Int. Conf. Scalable Computing and Communications, ScalCom-2010
会议日期: 37436
出版日期: 2010
会议地点: Bradford, United kingdom
关键词: Embedded software ; Embedded systems ; Information technology ; Linear systems ; Program processors
出版地: United States
收录类别: EI
ISBN: 9780770000000
部门归属: (1) Lab of Parallel Computing, Institute of Software, Chinese Academy of Sciences, Beijing 100190, China; (2) State Key Lab of Computing Science, Chinese Academy of Sciences, Beijing 100190, China; (3) Chinese Academy of Sciences, Graduate University, Beijing 100190, China
主办者: University of Bradford; IEEE; IEEE Computer Society; IEEE TCSC; IEEE Industry Applications Society (IAS)
英文摘要: In this paper, the mixed precision algorithm to solve the linear system of equations and the implementation of HPL package are introduced. We use this mixed precision algorithm to improve HPL package on CPU+GPGPU heterogeneous clusters, which is named for GHPL, and give the implementation mechanisms in detail. The experimental results are measured on the platforms of multi-core CPUs and CPU+GPGPU heterogeneous clusters. From the experimental results, we can find out that our GHPL program has good scalability on all the experimental environments and can sustain more than 1.7Teraflops both on the cluster with 16 nodes containing 32 NVIDIA Tesla C1060 GPUs and on the cluster with 8 nodes containing 32 NVIDIA GeForce GTX 295 GPUs, while the average speedup of it with respect to HPL is 3.06 and 2.40 respectively. © 2010 IEEE.
内容类型: 会议论文
URI标识: http://ir.iscas.ac.cn/handle/311060/8642
Appears in Collections:并行计算实验室 _会议论文

Files in This Item:
File Name/ File Size Content Type Version Access License
05577898.pdf(289KB)----限制开放-- 联系获取全文

Recommended Citation:
Wang Lei,Zhang Yunquan,Zhang Xianyi,et al. accelerating linpack performance with mixed precision algorithm on cpu+gpgpu heterogeneous cluster[C]. 见:10th IEEE International Conference on Computer and Information Technology, CIT-2010, 7th IEEE International Conference on Embedded Software and Systems, ICESS-2010, 10th IEEE Int. Conf. Scalable Computing and Communications, ScalCom-2010. Bradford, United kingdom. 37436.
Service
Recommend this item
Sava as my favorate item
Show this item's statistics
Export Endnote File
Google Scholar
Similar articles in Google Scholar
[Wang Lei]'s Articles
[Zhang Yunquan]'s Articles
[Zhang Xianyi]'s Articles
CSDL cross search
Similar articles in CSDL Cross Search
[Wang Lei]‘s Articles
[Zhang Yunquan]‘s Articles
[Zhang Xianyi]‘s Articles
Related Copyright Policies
Null
Social Bookmarking
Add to CiteULike Add to Connotea Add to Del.icio.us Add to Digg Add to Reddit
所有评论 (0)
暂无评论
 
评注功能仅针对注册用户开放,请您登录
您对该条目有什么异议,请填写以下表单,管理员会尽快联系您。
内 容:
Email:  *
单位:
验证码:   刷新
您在IR的使用过程中有什么好的想法或者建议可以反馈给我们。
标 题:
 *
内 容:
Email:  *
验证码:   刷新

Items in IR are protected by copyright, with all rights reserved, unless otherwise indicated.

 

 

Valid XHTML 1.0!
Copyright © 2007-2017  中国科学院软件研究所 - Feedback
Powered by CSpace