Institutional Repository
| accelerating linpack performance with mixed precision algorithm on cpu+gpgpu heterogeneous cluster | |
| Wang Lei; Zhang Yunquan; Zhang Xianyi; Liu Fangfang | |
| 2010 | |
| Conference Name | 10th IEEE International Conference on Computer and Information Technology, CIT-2010, 7th IEEE International Conference on Embedded Software and Systems, ICESS-2010, 10th IEEE Int. Conf. Scalable Computing and Communications, ScalCom-2010 |
| Source | Proceedings - 10th IEEE International Conference on Computer and Information Technology, CIT-2010, 7th IEEE International Conference on Embedded Software and Systems, ICESS-2010, ScalCom-2010 |
| Pages | 1169-1174 |
| Conference Date | 37436 |
| Conference Place | Bradford, United kingdom |
| Indexed Type | EI |
| Publish Place | United States |
| ISBN | 9780770000000 |
| Department | (1) Lab of Parallel Computing, Institute of Software, Chinese Academy of Sciences, Beijing 100190, China; (2) State Key Lab of Computing Science, Chinese Academy of Sciences, Beijing 100190, China; (3) Chinese Academy of Sciences, Graduate University, Beijing 100190, China |
| English Abstract | In this paper, the mixed precision algorithm to solve the linear system of equations and the implementation of HPL package are introduced. We use this mixed precision algorithm to improve HPL package on CPU+GPGPU heterogeneous clusters, which is named for GHPL, and give the implementation mechanisms in detail. The experimental results are measured on the platforms of multi-core CPUs and CPU+GPGPU heterogeneous clusters. From the experimental results, we can find out that our GHPL program has good scalability on all the experimental environments and can sustain more than 1.7Teraflops both on the cluster with 16 nodes containing 32 NVIDIA Tesla C1060 GPUs and on the cluster with 8 nodes containing 32 NVIDIA GeForce GTX 295 GPUs, while the average speedup of it with respect to HPL is 3.06 and 2.40 respectively. © 2010 IEEE. |
| Keyword | Embedded Software Embedded Systems Information Technology Linear Systems Program Processors |
| Sponsorship | University of Bradford; IEEE; IEEE Computer Society; IEEE TCSC; IEEE Industry Applications Society (IAS) |
| Content Type | 会议论文 |
| URI | http://ir.iscas.ac.cn/handle/311060/8642 |
| Collection | 并行软件与计算科学实验室 |
| Recommended Citation GB/T 7714 | Wang Lei,Zhang Yunquan,Zhang Xianyi,et al. accelerating linpack performance with mixed precision algorithm on cpu+gpgpu heterogeneous cluster[C]. United States,2010:1169-1174. |
| Files in This Item: | ||||||
| File Name/Size | DocType | Version | Access | License | ||
| 05577898.pdf(289KB) | 开放获取 | -- | Application Full Text | |||
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment