BlindCall: ultra-fast base-calling of high-throughput sequencing data by blind deconvolution.
Title | BlindCall: ultra-fast base-calling of high-throughput sequencing data by blind deconvolution. |
Publication Type | Journal Article |
Authors | Ye C, Hsiao C, Bravo HCorrada |
2014 | |
Journal | Bioinformatics |
Volume | 30 |
Issue | 9 |
Pages | 1214-9 |
algorithms, High-Throughput Nucleotide Sequencing, HUMANS, Probability, Reproducibility of Results, Sequence Analysis, DNA, software, Time factors | |
MOTIVATION: Base-calling of sequencing data produced by high-throughput sequencing platforms is a fundamental process in current bioinformatics analysis. However, existing third-party probabilistic or machine-learning methods that significantly improve the accuracy of base-calls on these platforms are impractical for production use due to their computational inefficiency. RESULTS: We directly formulate base-calling as a blind deconvolution problem and implemented BlindCall as an efficient solver to this inverse problem. BlindCall produced base-calls at accuracy comparable to state-of-the-art probabilistic methods while processing data at rates 10 times faster in most cases. The computational complexity of BlindCall scales linearly with read length making it better suited for new long-read sequencing technologies. | |
10.1093/bioinformatics/btu010 | |
PubMed ID | 24413520 |
PubMed Central ID | PMC3998134 |
Grant List | R01 HG005220 / HG / NHGRI NIH HHS / United States R01HG005220 / HG / NHGRI NIH HHS / United States R01HG006102 / HG / NHGRI NIH HHS / United States |