NextDenovo: an efficient error correction and accurate assembly tool for noisy long reads 论文

2024Genome biology引用 366顶会
Algorithms and Data CompressionAdvanced Data Storage TechnologiesParallel Computing and Optimization Techniques

摘要

Long-read sequencing data, particularly those derived from the Oxford Nanopore sequencing platform, tend to exhibit high error rates. Here, we present NextDenovo, an efficient error correction and assembly tool for noisy long reads, which achieves a high level of accuracy in genome assembly. We apply NextDenovo to assemble 35 diverse human genomes from around the world using Nanopore long-read data. These genomes allow us to identify the landscape of segmental duplication and gene copy number variation in modern human populations. The use of NextDenovo should pave the way for population-scale long-read assembly using Nanopore long-read data.