ERROR CORRECTION METHOD FOR SEQUENCING DATA WITH INSERTIONS AND DELETIONS

Título

ERROR CORRECTION METHOD FOR SEQUENCING DATA WITH INSERTIONS AND DELETIONS

Autor

A. V. Alexandrov, A. A. Shalyto

Descripción

Subject of Research.A method for error correction for sequencing reads of a haploid organism with insertions and deletions was developed. It was tested on two libraries: a synthesized dataset for Escherichia coli bacterium and a real dataset of reads for Pseudomonas stutzeri. Method. The method is based on using k-mers but only for finding reads that are close to each other. For the close reads a consensus string is created which is then used for correcting errors in the initial reads. Main Results. The algorithm is implemented as a separated program. The program has been tested on both real and synthesized data. The method performance is higher than that of the other known methods (N50 metric was used as well as total contig length and maximal contig length as metrics for comparison). Practical Relevance. The method can be used together with known genome assembly methods not suitable for application with the reads containing insertion and deletion errors.

Fecha

2016

Materia

genome assembly, error correction, insertions and deletions errors

Identificador

DOI: 10.17586/2226-1494-2016-16-1-108-114

Fuente

Naučno-tehničeskij Vestnik Informacionnyh Tehnologij, Mehaniki i Optiki

Editor

Saint Petersburg National Research University of Information Technologies, Mechanics and Optics (ITMO University)

Cobertura

Optics. Light, Electronic computers. Computer science

Idioma

EN, RU

Archivos

https://socictopen.socict.org/files/to_import/pdfs/article 84.pdf

Colección

Citación

A. V. Alexandrov, A. A. Shalyto, “ERROR CORRECTION METHOD FOR SEQUENCING DATA WITH INSERTIONS AND DELETIONS,” SOCICT Open, consulta 17 de abril de 2026, https://socictopen.socict.org/items/show/83.

Formatos de Salida

Position: 1068 (60 views)