ISCAS OpenIR
Hippo: An enhancement of pipeline-aware in-memory caching for HDFS
Wei, Lan (1); Lian, Wenbo (1); Liu, Kuien (1); Wang, Yongji (1); Wei, Lan
2014
Conference Name2014 23rd International Conference on Computer Communication and Networks, ICCCN 2014
Conference DateAugust 4, 2014 - August 7, 2014
Conference PlaceShanghai, China
Indexed TypeEI
Publish PlaceInstitute of Electrical and Electronics Engineers Inc.
ISSN10952055
ISBN9781479935727
Department(1) Institute of Software, Chinese Academy of Sciences, China
English AbstractIn the age of big data, distributed computing frameworks tend to coexist and collaborate in pipeline using one scheduler. While a variety of techniques for reducing I/O latency have been proposed, these are rarely specific for the whole pipeline performance. This paper proposes memory management logic called 'Hippo' which targets distributed systems and in particular 'pipelined' applications that might span differing big data frameworks. Though individual frameworks may have internal memory management primitives, Hippo proposes to make a generic framework that works agnostic of these highlevel operations. To increase the hit ratio of in-memory cache, this paper discusses the granularity of caching and how Hippo leverages the job dependency graph to make memory retention and pre-fetching decisions. Our evaluations demonstrate that job dependency is essential to improve the cache performance and a global cache policy maker, in most cases, significantly outperforms explicit caching by users.; In the age of big data, distributed computing frameworks tend to coexist and collaborate in pipeline using one scheduler. While a variety of techniques for reducing I/O latency have been proposed, these are rarely specific for the whole pipeline performance. This paper proposes memory management logic called 'Hippo' which targets distributed systems and in particular 'pipelined' applications that might span differing big data frameworks. Though individual frameworks may have internal memory management primitives, Hippo proposes to make a generic framework that works agnostic of these highlevel operations. To increase the hit ratio of in-memory cache, this paper discusses the granularity of caching and how Hippo leverages the job dependency graph to make memory retention and pre-fetching decisions. Our evaluations demonstrate that job dependency is essential to improve the cache performance and a global cache policy maker, in most cases, significantly outperforms explicit caching by users.
Language英语
Content Type会议论文
URIhttp://ir.iscas.ac.cn/handle/311060/16610
Collection中国科学院软件研究所
Corresponding AuthorWei, Lan
Recommended Citation
GB/T 7714
Wei, Lan ,Lian, Wenbo ,Liu, Kuien ,et al. Hippo: An enhancement of pipeline-aware in-memory caching for HDFS[C]. Institute of Electrical and Electronics Engineers Inc.,2014.
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Wei, Lan (1)]'s Articles
[Lian, Wenbo (1)]'s Articles
[Liu, Kuien (1)]'s Articles
Baidu academic
Similar articles in Baidu academic
[Wei, Lan (1)]'s Articles
[Lian, Wenbo (1)]'s Articles
[Liu, Kuien (1)]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Wei, Lan (1)]'s Articles
[Lian, Wenbo (1)]'s Articles
[Liu, Kuien (1)]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.