Norgal: extraction and de novo assembly of mitochondrial DNA from whole-genome sequencing data

Título

Norgal: extraction and de novo assembly of mitochondrial DNA from whole-genome sequencing data

Autor

Kosai Al-Nakeeb, Thomas Nordahl Petersen, Thomas Sicheritz-Pontén

Descripción

Abstract Background Whole-genome sequencing (WGS) projects provide short read nucleotide sequences from nuclear and possibly organelle DNA depending on the source of origin. Mitochondrial DNA is present in animals and fungi, while plants contain DNA from both mitochondria and chloroplasts. Current techniques for separating organelle reads from nuclear reads in WGS data require full reference or partial seed sequences for assembling. Results Norgal (de Novo ORGAneLle extractor) avoids this requirement by identifying a high frequency subset of k-mers that are predominantly of mitochondrial origin and performing a de novo assembly on a subset of reads that contains these k-mers. The method was applied to WGS data from a panda, brown algae seaweed, butterfly and filamentous fungus. We were able to extract full circular mitochondrial genomes and obtained sequence identities to the reference sequences in the range from 98.5 to 99.5%. We also assembled the chloroplasts of grape vines and cucumbers using Norgal together with seed-based de novo assemblers. Conclusion Norgal is a pipeline that can extract and assemble full or partial mitochondrial and chloroplast genomes from WGS short reads without prior knowledge. The program is available at: https://bitbucket.org/kosaidtu/norgal .

Fecha

2017

Materia

mitochondrial DNA, k-mer, Next-generation sequencing, de novo assembly

Identificador

DOI: 10.1186/s12859-017-1927-y

Fuente

BMC Bioinformatics

Editor

BMC

Cobertura

Biology (General), Computer applications to medicine. Medical informatics

Idioma

EN

Archivos

https://socictopen.socict.org/files/to_import/pdfs/article 915.pdf

Colección

Citación

Kosai Al-Nakeeb, Thomas Nordahl Petersen, Thomas Sicheritz-Pontén, “Norgal: extraction and de novo assembly of mitochondrial DNA from whole-genome sequencing data,” SOCICT Open, consulta 21 de abril de 2026, https://socictopen.socict.org/items/show/880.

Formatos de Salida

Position: 6854 (30 views)