Institutional Repository
| a fast and high throughput sql query system for big data | |
| Zhu Feng; Liu Jie; Xu Lijie | |
| 2012 | |
| Conference Name | 13th International Conference on Web Information Systems Engineering, WISE 2012 |
| Source | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
| Pages | 783-788 |
| Conference Date | November 28, 2012 - November 30, 2012 |
| Conference Place | Paphos, Cyprus |
| Indexed Type | EI |
| ISSN | 0302-9743 |
| ISBN | 9783642350627 |
| Department | (1) Technology Center of Software Engineering Institute of Software Chinese Academy of Sciences Beijing 100190 China |
| English Abstract | Relational data query always plays an important role in data analysis. But how to scale out the traditional SQL query system is a challenging problem. In this paper, we introduce a fast, high throughput and scalable system to perform read-only SQL well with the advantage of NoSQL's distributed architecture. We adopt HBase as the storage layer and design a distributed query engine (DQE) collaborating with it to perform SQL queries. Our system also contains distinctive index and cache mechanisms to accelerate query processing. Finally, we evaluate our system with real-world big data crawled from Sina Weibo and it achieves good performance under nineteen representative SQL queries. © 2012 Springer-Verlag.; Relational data query always plays an important role in data analysis. But how to scale out the traditional SQL query system is a challenging problem. In this paper, we introduce a fast, high throughput and scalable system to perform read-only SQL well with the advantage of NoSQL's distributed architecture. We adopt HBase as the storage layer and design a distributed query engine (DQE) collaborating with it to perform SQL queries. Our system also contains distinctive index and cache mechanisms to accelerate query processing. Finally, we evaluate our system with real-world big data crawled from Sina Weibo and it achieves good performance under nineteen representative SQL queries. © 2012 Springer-Verlag. |
| Keyword | Digital Storage Query Languages Query Processing Systems Engineering Throughput World Wide Web |
| Language | 英语 |
| Content Type | 会议论文 |
| URI | http://ir.iscas.ac.cn/handle/311060/15889 |
| Collection | 中国科学院软件研究所 |
| Recommended Citation GB/T 7714 | Zhu Feng,Liu Jie,Xu Lijie. a fast and high throughput sql query system for big data[C],2012:783-788. |
| Files in This Item: | There are no files associated with this item. | |||||
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment