ISCAS OpenIR  > 2009年期刊/会议论文
finding optimal threshold for correction error reads in dna assembling
Chin Francis Y. L.; Leung Henry C. M.; Li Wei-Lin; Yiu Siu-Ming
2009
会议名称9th Asia Pacific Bioinformatics Conference
页码-
会议日期JAN 13-16,
会议地点Beijing, PEOPLES R CHINA
收录类别sci,istp
出版地CURRENT SCIENCE GROUP, MIDDLESEX HOUSE, 34-42 CLEVELAND ST, LONDON W1T 4LB, ENGLAND
出版者BMC BIOINFORMATICS
ISSN1471-2105
部门归属Chin, Francis Y. L.; Leung, Henry C. M.; Yiu, Siu-Ming Univ Hong Kong, Dept Comp Sci, Pokfulam, Hong Kong, Peoples R China. Li, Wei-Lin Chinese Acad Sci, Inst software, State Key Lab Comp Sci, Beijing 100190, Peoples R China.
摘要Background: DNA assembling is the problem of determining the nucleotide sequence of a genome from its substrings, called reads. In the experiments, there may be some errors on the reads which affect the performance of the DNA assembly algorithms. Existing algorithms, e. g. ECINDEL and SRCorr, correct the error reads by considering the number of times each length-k substring of the reads appear in the input. They treat those length-k substrings appear at least M times as correct substring and correct the error reads based on these substrings. However, since the threshold M is chosen without any solid theoretical analysis, these algorithms cannot guarantee their performances on error correction. Results: In this paper, we propose a method to calculate the probabilities of false positive and false negative when determining whether a length-k substring is correct using threshold M. Based on this optimal threshold M that minimizes the total errors ( false positives and false negatives). Experimental results on both real data and simulated data showed that our calculation is correct and we can reduce the total error substrings by 77.6% and 65.1% when compared to ECINDEL and SRCorr respectively. Conclusion: We introduced a method to calculate the probability of false positives and false negatives of the length-k substring using different thresholds. Based on this calculation, we found the optimal threshold to minimize the total error of false positive plus false negative.
语种英语
WOS记录号WOS:000265601900015
引用统计
内容类型会议论文
URI标识http://ir.iscas.ac.cn/handle/311060/8192
专题2009年期刊/会议论文
推荐引用方式
GB/T 7714
Chin Francis Y. L.,Leung Henry C. M.,Li Wei-Lin,et al. finding optimal threshold for correction error reads in dna assembling[C]. CURRENT SCIENCE GROUP, MIDDLESEX HOUSE, 34-42 CLEVELAND ST, LONDON W1T 4LB, ENGLAND:BMC BIOINFORMATICS,2009:-.
条目包含的文件
条目无相关文件。
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Chin Francis Y. L.]的文章
[Leung Henry C. M.]的文章
[Li Wei-Lin]的文章
百度学术
百度学术中相似的文章
[Chin Francis Y. L.]的文章
[Leung Henry C. M.]的文章
[Li Wei-Lin]的文章
必应学术
必应学术中相似的文章
[Chin Francis Y. L.]的文章
[Leung Henry C. M.]的文章
[Li Wei-Lin]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。