Institutional Repository
| accelerating linpack performance with mixed precision algorithm on cpu+gpgpu heterogeneous cluster | |
| Wang Lei; Zhang Yunquan; Zhang Xianyi; Liu Fangfang | |
| 2010 | |
| 会议名称 | 10th IEEE International Conference on Computer and Information Technology, CIT-2010, 7th IEEE International Conference on Embedded Software and Systems, ICESS-2010, 10th IEEE Int. Conf. Scalable Computing and Communications, ScalCom-2010 |
| 会议录名称 | Proceedings - 10th IEEE International Conference on Computer and Information Technology, CIT-2010, 7th IEEE International Conference on Embedded Software and Systems, ICESS-2010, ScalCom-2010 |
| 页码 | 1169-1174 |
| 会议日期 | 37436 |
| 会议地点 | Bradford, United kingdom |
| 收录类别 | EI |
| 出版地 | United States |
| ISBN | 9780770000000 |
| 部门归属 | (1) Lab of Parallel Computing, Institute of Software, Chinese Academy of Sciences, Beijing 100190, China; (2) State Key Lab of Computing Science, Chinese Academy of Sciences, Beijing 100190, China; (3) Chinese Academy of Sciences, Graduate University, Beijing 100190, China |
| 摘要 | In this paper, the mixed precision algorithm to solve the linear system of equations and the implementation of HPL package are introduced. We use this mixed precision algorithm to improve HPL package on CPU+GPGPU heterogeneous clusters, which is named for GHPL, and give the implementation mechanisms in detail. The experimental results are measured on the platforms of multi-core CPUs and CPU+GPGPU heterogeneous clusters. From the experimental results, we can find out that our GHPL program has good scalability on all the experimental environments and can sustain more than 1.7Teraflops both on the cluster with 16 nodes containing 32 NVIDIA Tesla C1060 GPUs and on the cluster with 8 nodes containing 32 NVIDIA GeForce GTX 295 GPUs, while the average speedup of it with respect to HPL is 3.06 and 2.40 respectively. © 2010 IEEE. |
| 关键词 | Embedded Software Embedded Systems Information Technology Linear Systems Program Processors |
| 主办者 | University of Bradford; IEEE; IEEE Computer Society; IEEE TCSC; IEEE Industry Applications Society (IAS) |
| 内容类型 | 会议论文 |
| URI标识 | http://ir.iscas.ac.cn/handle/311060/8642 |
| 专题 | 并行软件与计算科学实验室 |
| 推荐引用方式 GB/T 7714 | Wang Lei,Zhang Yunquan,Zhang Xianyi,et al. accelerating linpack performance with mixed precision algorithm on cpu+gpgpu heterogeneous cluster[C]. United States,2010:1169-1174. |
| 条目包含的文件 | ||||||
| 文件名称/大小 | 文献类型 | 版本类型 | 开放类型 | 使用许可 | ||
| 05577898.pdf(289KB) | 开放获取 | -- | 请求全文 | |||
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论