Publication detail

Method for improving the consensus sequence from NGS data based on reads classification

NYKRÝNOVÁ, M. ŠKUTKOVÁ, H.

Original Title

Method for improving the consensus sequence from NGS data based on reads classification

English Title

Method for improving the consensus sequence from NGS data based on reads classification

Type

conference paper

Language

en

Original Abstract

Nowadays, next-generation sequencing is on the rise as a cost of sequencing is constantly dropping. However, the processing of the sequence data remains problematic as the algorithms for reference-based assembly are both computationally and time-demanding. Moreover, resulting assembly is still imperfect. In this paper, we present a new method for improving the quality of the consensus sequence from the assembled data. We examine the regions in the consensus sequence, which show a high number of ambiguous nucleotides. Using a classification of reads covering these regions, we try to reduce the number of ambiguous nucleotides in the consensus. This improved consensus sequence then can be used for bacteria typing and precise discrimination of bacteria strains.

English abstract

Nowadays, next-generation sequencing is on the rise as a cost of sequencing is constantly dropping. However, the processing of the sequence data remains problematic as the algorithms for reference-based assembly are both computationally and time-demanding. Moreover, resulting assembly is still imperfect. In this paper, we present a new method for improving the quality of the consensus sequence from the assembled data. We examine the regions in the consensus sequence, which show a high number of ambiguous nucleotides. Using a classification of reads covering these regions, we try to reduce the number of ambiguous nucleotides in the consensus. This improved consensus sequence then can be used for bacteria typing and precise discrimination of bacteria strains.

Keywords

sequencing; ambiguous nucleotides; genome assembly; consensus calling; cluster analysis

Released

09.09.2019

Publisher

Vysoké učení technické v Brně, Fakulta elektrotechniky a komunikačních technologií

Location

Brno

ISBN

978-80-214-5781-2

Book

Proceedings of IEEE Student Branch Conference Mikulov 2019

Edition number

první

Pages from

6

Pages to

9

Pages count

4

URL

BibTex


@inproceedings{BUT158994,
  author="Markéta {Nykrýnová} and Helena {Škutková}",
  title="Method for improving the consensus sequence from NGS data based on reads classification",
  annote="Nowadays, next-generation sequencing is on the rise as a cost of sequencing is constantly dropping. However, the processing of the sequence data remains problematic as the algorithms for reference-based assembly are both computationally and time-demanding. Moreover, resulting assembly is still imperfect. In this paper, we present a new method for improving the quality of the consensus sequence from the assembled data. We examine the regions in the consensus sequence, which show a high number of ambiguous nucleotides. Using a classification of reads covering these regions, we try to reduce the number of ambiguous nucleotides in the consensus. This improved consensus sequence then can be used for bacteria typing and precise discrimination of bacteria strains.",
  address="Vysoké učení technické v Brně, Fakulta elektrotechniky a komunikačních technologií",
  booktitle="Proceedings of IEEE Student Branch Conference Mikulov 2019",
  chapter="158994",
  howpublished="online",
  institution="Vysoké učení technické v Brně, Fakulta elektrotechniky a komunikačních technologií",
  year="2019",
  month="september",
  pages="6--9",
  publisher="Vysoké učení technické v Brně, Fakulta elektrotechniky a komunikačních technologií",
  type="conference paper"
}