摘要
Segmentation, filtering out of measurement errors and identification of breakpoints are integral parts of any analysis of microarray data for the detection of copy number variation (CNV). Existing algorithms designed for these tasks have had some successes in the past, but they tend to be O(N 2) in either computation time or memory requirement, or both, and the rapid advance of microarray resolution has practically rendered such algorithms useless. Here we propose an algorithm, SAD, that is much faster and much less thirsty for memory - O(N) in both computation time and memory requirement -- and offers higher accuracy. The two key ingredients of SAD are the fundamental assumption in statistics that measurement errors are normally distributed and the mathematical relation that the product of two Gaussians is another Gaussian (function). We have produced a computer program for analyzing CNV based on SAD. In addition to being fast and small it offers two important features: quantitative statistics for predictions and, with only two user-decided parameters, ease of use. Its speed shows little dependence on genomic profile. Running on an average modern computer, it completes CNV analyses for a 262 thousand-probe array in ~1 second and a 1.8 million-probe array in 9 seconds.
| 原文 | ???core.languages.en_GB??? |
|---|---|
| 頁(從 - 到) | 58-66 |
| 頁數 | 9 |
| 期刊 | World Academy of Science, Engineering and Technology |
| 卷 | 75 |
| 出版狀態 | 已出版 - 3月 2011 |
UN SDG
此研究成果有助於以下永續發展目標
-
SDG 3 良好的健康和福祉
指紋
深入研究「A pairwise-gaussian-merging approach: Towards genome segmentation for copy number analysis」主題。共同形成了獨特的指紋。引用此
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver