Application of the Monte Carlo method for searching for the possible reading frameshifts in genes
V. M. Rudenkoab, E. V. Korotkovba
a Bioengineering Center of Russian Academy of Sciences, Moscow, Russia
b NRNU MEPhI, Moscow, Russia
In the article we presented the method for searching for the possible reading frameshifts in genes based on revealing change points of triplet frequencies distribution. The statistical significance was estimated by Monte Carlo method. Correctness of the introduced method was demonstrated by using it to analysis the DNA sequences with artificial indels. The method developed was applied for searching for the change points in DNA sequences from databank KEGG GENES. It was revealed more than 140 thousands genes with change points at the significance level equal to 6 %. We classified sequences containing change points by field description in databank KEGG GENES. It appeared that many of them are pseudogenes or they were annotated earlier as sequences containing frameshifts. In addition to these sequences the change points were detected in many genes coding of PE-PGRS, cation channel family protein, PPE family protein and others. The relationship between change points and reading frameshifts in genes is discussed.
DNA sequence, reading frame, reading frameshift, change point, Monte Carlo method.
PDF file (504 kB)
Received 29.03.2011, Published 16.05.2011
V. M. Rudenko, E. V. Korotkov, “Application of the Monte Carlo method for searching for the possible reading frameshifts in genes”, Mat. Biolog. Bioinform., 6:1 (2011), 79–91
Citation in format AMSBIB
\by V.~M.~Rudenko, E.~V.~Korotkov
\paper Application of the Monte Carlo method for searching for the possible reading frameshifts in genes
\jour Mat. Biolog. Bioinform.
Citing articles on Google Scholar:
Related articles on Google Scholar:
|Number of views:|