Benchmarking of computational error-correction methods for next-generation sequencing data

Abstract

Recent advancements in next-generation sequencing have rapidly improved our ability to study genomic material at an unprecedented scale. Despite substantial improvements in sequencing technologies, errors present in the data still risk confounding downstream analysis and limiting the applicability of sequencing technologies in clinical tools. Computational error correction promises to eliminate sequencing errors, but the relative accuracy of error correction algorithms remains unknown. In this paper, we evaluate the ability of error correction algorithms to fix errors across different types of datasets that contain various levels of heterogeneity.

Publication
Genome Biology
Sei Chang
Sei Chang
PhD Candidate

Machine learning and computational genomics researcher at Columbia University and New York Genome Center.