BlindCall: ultra-fast base-calling of high-throughput sequencing data by blind deconvolution.

TitleBlindCall: ultra-fast base-calling of high-throughput sequencing data by blind deconvolution.
Publication TypeJournal Article
AuthorsYe C, Hsiao C, Bravo HCorrada
2014
JournalBioinformatics
Volume30
Issue9
Pages1214-9
algorithms, High-Throughput Nucleotide Sequencing, HUMANS, Probability, Reproducibility of Results, Sequence Analysis, DNA, software, Time factors

MOTIVATION: Base-calling of sequencing data produced by high-throughput sequencing platforms is a fundamental process in current bioinformatics analysis. However, existing third-party probabilistic or machine-learning methods that significantly improve the accuracy of base-calls on these platforms are impractical for production use due to their computational inefficiency.

RESULTS: We directly formulate base-calling as a blind deconvolution problem and implemented BlindCall as an efficient solver to this inverse problem. BlindCall produced base-calls at accuracy comparable to state-of-the-art probabilistic methods while processing data at rates 10 times faster in most cases. The computational complexity of BlindCall scales linearly with read length making it better suited for new long-read sequencing technologies.

10.1093/bioinformatics/btu010
PubMed ID24413520
PubMed Central IDPMC3998134
Grant ListR01 HG005220 / HG / NHGRI NIH HHS / United States
R01HG005220 / HG / NHGRI NIH HHS / United States
R01HG006102 / HG / NHGRI NIH HHS / United States