中国科学院软件研究所机构知识库
Advanced  
ISCAS OpenIR  > 软件所图书馆  > 会议论文
Subject: Computer Science
Title:
a locality-based performance model for load-and-compute style computation
Author: Yuan Liang ; Zhang Yunquan
Source: Proceedings - 2012 IEEE International Conference on Cluster Computing, CLUSTER 2012
Conference Name: IEEE International Conference on Cluster Computing
Conference Date: SEP 24-28, 2012
Issued Date: 2012
Conference Place: Beijing, PEOPLES R CHINA
Keyword: locality function ; cache partition ; private cache ; shared cache
Indexed Type: ISTP ; EI
ISSN: 1552-5244
Department: Yuan Liang; Zhang Yunquan Chinese Acad Sci Lab Parallel Software & Computat Sci Inst Software Beijing 100864 Peoples R China.
Sponsorship: IEEE, IEEE Comp Soc, IEEE Tech Comm Scalable Comp (TCSC), Sugon, Intel, Inspur, VMware, Mellanox, PARATERA, BLSC, LoongStore, Nvidia
Abstract: The increasing speed gap between the processor and memory is usually the critical bottleneck in achieving high performance. Hardware caches, programming models, algorithms and data structures have been introduced and proposed to exploit localities on reducing the memory overhead. Some of these new designs share a common load and compute style in which the algorithm first moves all needed data to cache and then performs operations only on the ready data. In this paper, we introduce a locality function to model the reuse ability of an algorithm and propose a corresponding performance model. Then we theoretically analyze how to utilize and design on cache under our model: (1) We present theorems to give the optimal cache partition scheme for the software buffering technique targeting at hiding the memory overhead. (2) We provide methods to decide the optimal multicore design to maximally leverage benefits of both the shared and private caches. (3) We incorporate the memory overhead into the Amdahl's Law to study the speedup limitation on memory bandwidth.
English Abstract: The increasing speed gap between the processor and memory is usually the critical bottleneck in achieving high performance. Hardware caches, programming models, algorithms and data structures have been introduced and proposed to exploit localities on reducing the memory overhead. Some of these new designs share a common load and compute style in which the algorithm first moves all needed data to cache and then performs operations only on the ready data. In this paper, we introduce a locality function to model the reuse ability of an algorithm and propose a corresponding performance model. Then we theoretically analyze how to utilize and design on cache under our model: (1) We present theorems to give the optimal cache partition scheme for the software buffering technique targeting at hiding the memory overhead. (2) We provide methods to decide the optimal multicore design to maximally leverage benefits of both the shared and private caches. (3) We incorporate the memory overhead into the Amdahl's Law to study the speedup limitation on memory bandwidth.
Language: 英语
Content Type: 会议论文
URI: http://ir.iscas.ac.cn/handle/311060/15803
Appears in Collections:软件所图书馆_会议论文

Files in This Item:

There are no files associated with this item.


Recommended Citation:
Yuan Liang,Zhang Yunquan. a locality-based performance model for load-and-compute style computation[C]. 见:IEEE International Conference on Cluster Computing. Beijing, PEOPLES R CHINA. SEP 24-28, 2012.
Service
Recommend this item
Sava as my favorate item
Show this item's statistics
Export Endnote File
Google Scholar
Similar articles in Google Scholar
[Yuan Liang]'s Articles
[Zhang Yunquan]'s Articles
CSDL cross search
Similar articles in CSDL Cross Search
[Yuan Liang]‘s Articles
[Zhang Yunquan]‘s Articles
Related Copyright Policies
Null
Social Bookmarking
Add to CiteULike Add to Connotea Add to Del.icio.us Add to Digg Add to Reddit
所有评论 (0)
暂无评论
 
评注功能仅针对注册用户开放,请您登录
您对该条目有什么异议,请填写以下表单,管理员会尽快联系您。
内 容:
Email:  *
单位:
验证码:   刷新
您在IR的使用过程中有什么好的想法或者建议可以反馈给我们。
标 题:
 *
内 容:
Email:  *
验证码:   刷新

Items in IR are protected by copyright, with all rights reserved, unless otherwise indicated.

 

 

Valid XHTML 1.0!
Copyright © 2007-2019  中国科学院软件研究所 - Feedback
Powered by CSpace