Institutional Repository
| A Modified Algorithm for Missing Values in Data Stream Decision Tree Classification | |
| Hou, Xu-shan; Lv, Pin; Wang, Hao | |
| 2014 | |
| Conference Name | International Conference on Artificial Intelligence and Software Engineering (AISE) |
| Pages | 307-313 |
| Conference Date | JAN 11-12, 2014 |
| Conference Place | Phuket, THAILAND |
| Indexed Type | CPCI |
| Publish Place | DESTECH PUBLICATIONS, INC |
| ISBN | 978-1-60595-150-8 |
| Department | [Hou, Xu-shan; Lv, Pin; Wang, Hao] Chinese Acad Sci, Inst Software, Sci & Technol Integrated Informat Syst Lab, Beijing 100190, Peoples R China. |
| English Abstract | Missing values in data stream will affect the accuracy of classification. It is significant to research how to handle missing values in data stream decision tree classification. Considering the time performance of an existing algorithm is not good enough, a more efficient algorithm is proposed in this paper. The modified algorithm makes improvements in two aspects: on the one hand, according to the standard deviation of attribute value, we select different process methods for missing values, which could reduce the time complexity; on the other hand, we optimize the update mechanism, which could reduce the update time. The experiment results show that the run-time of our algorithm is reduced by 20%-70%, while the accuracy is the same as the existing algorithm.; Missing values in data stream will affect the accuracy of classification. It is significant to research how to handle missing values in data stream decision tree classification. Considering the time performance of an existing algorithm is not good enough, a more efficient algorithm is proposed in this paper. The modified algorithm makes improvements in two aspects: on the one hand, according to the standard deviation of attribute value, we select different process methods for missing values, which could reduce the time complexity; on the other hand, we optimize the update mechanism, which could reduce the update time. The experiment results show that the run-time of our algorithm is reduced by 20%-70%, while the accuracy is the same as the existing algorithm. |
| Keyword | Data Stream Classification Decision Tree Missing Values |
| Language | 英语 |
| Content Type | 会议论文 |
| URI | http://ir.iscas.ac.cn/handle/311060/16520 |
| Collection | 中国科学院软件研究所 |
| Recommended Citation GB/T 7714 | Hou, Xu-shan,Lv, Pin,Wang, Hao. A Modified Algorithm for Missing Values in Data Stream Decision Tree Classification[C]. DESTECH PUBLICATIONS, INC,2014:307-313. |
| Files in This Item: | There are no files associated with this item. | |||||
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment