Intellectual Analisys of Data
Entropy approach to the construction of a measure of word symbolic diverseness and its application to clustering of plant genomes
Yu. G. Smetanina, M. V. Ulyanovbc, A. S. Pestovad
a Federal Research Center "Informatics and Control" of the Russian Academy of Sciences, Moscow
b V. A. Trapeznikov Institute of Control Sciences of Russian Academy of Sciences, Moscow
c Faculty of Computational Mathematics and Cybernetics of Lomonosov Moscow State University, Moscow
d Faculty of Computer Science of Higher School of Economics, Moscow
An approach to the information analysis is considered for the case when the information is presented by words of finite length over a finite alphabet. A method of generating a measure of symbolic diverseness of words based on peak characteristics of a shift entropy function is proposed. The shift entropy function is formally defined using a unit translation operator and the entropy of discrete distributions. A model example is presented together with some results of application of the proposed measure in the clustering of families of plants using the analysis of genome of their representatives.
shift entropy, measure of symbolic diverseness, clustering of plant genomes.
PDF file (659 kB)
Received 05.04.2016, Published 25.05.2016
Yu. G. Smetanin, M. V. Ulyanov, A. S. Pestova, “Entropy approach to the construction of a measure of word symbolic diverseness and its application to clustering of plant genomes”, Mat. Biolog. Bioinform., 11:1 (2016), 114–126
Citation in format AMSBIB
\by Yu.~G.~Smetanin, M.~V.~Ulyanov, A.~S.~Pestova
\paper Entropy approach to the construction of a measure of word symbolic diverseness and its application to clustering of plant genomes
\jour Mat. Biolog. Bioinform.
Citing articles on Google Scholar:
Related articles on Google Scholar:
|Number of views:|