Stepwise large genome assembly approach: a case of Siberian larch (Larix sibirica Ledeb).
Kuzmin Dmitry A, Feranchuk Sergey I, Sharov Vadim V, Cybin Alexander N, Makolov Stepan V, Putintseva Yuliya A, Oreshkova Natalya V, Krutovsky Konstantin V
De novo assembling of large genomes, such as in conifers (~ 12-30 Gbp), which also consist of ~ 80% of repetitive DNA, is a very complex and computationally intense endeavor. One of the main problems in assembling such genomes lays in computing limitations of nucleotide sequence assembly programs (DNA assemblers). As a rule, modern assemblers are usually designed to assemble genomes with a length not exceeding the length of the human genome (3.24 Gbp). Most assemblers cannot handle the amount of input sequence data required to provide sufficient coverage needed for a high-quality assembly.