|
This article is cited in 2 scientific papers (total in 2 papers)
Intellectual Control Systems, Data Analysis
Entropy-randomized method for the reconstruction of missing data
Yu. A. Dubnovabc, V. Yu. Polishchukde, Yu. S. Popkovafg, Yu. M. Polishchukh, A. V. Mel'nikovh, E. S. Sokolh a Federal Research Center “Computer Science and Control,” Russian Academy
of Sciences, Moscow, 119333 Russia
b National Research University Higher School of Economics, Moscow, 101000 Russia
c Moscow Institute of Physics and Technology, Dolgoprudnyi, Moscow oblast, 141700 Russia
d Tomsk Polytechnic University, Tomsk, 634050 Russia
e Institute of Monitoring of Climatic and Ecological Systems of the Siberian Branch of the Russian
Academy of Sciences, Tomsk, 634055 Russia
f ORT Braude Academic College of Engineering, University of Haifa, Karmiel, 2161002 Israel
g Trapeznikov Institute of Control Sciences, Russian Academy of Sciences, Moscow, 117997 Russia
h Ugra Research Institute of Information Technologies, Khanty-Mansiisk, 628011 Russia
Abstract:
The article deals with the problem of reconstructing missing data in data collections for machine learning problems. We propose a new randomized method for missing data reconstruction based on the technology of entropy-robust estimation and generation of ensembles of random variables. The method is similar to the use of an auxiliary regression to reconstruct missing values, but unlike the latter, no additional constraints are imposed on the likelihood function of errors in the sample in the case of entropy estimation and small amounts of data are permissible; this becomes extremely relevant in problems where the amount of data for training is limited and the omissions are not systematic. The proposed method is used to reconstruct missing data on the areas of thermokarst lakes in the Arctic zone of the Russian Federation as measured from satellite images.
Keywords:
missing data reconstruction, entropy-based estimation, randomized machine learning, thermokarst lake, Arctic.
Citation:
Yu. A. Dubnov, Yu. A. Dubnov, V. Yu. Polishchuk, Yu. S. Popkov, Yu. M. Polishchuk, A. V. Mel'nikov, E. S. Sokol, “Entropy-randomized method for the reconstruction of missing data”, Avtomat. i Telemekh., 2021, no. 4, 140–160; Autom. Remote Control, 82:4 (2021), 670–686
Linking options:
https://www.mathnet.ru/eng/at15527 https://www.mathnet.ru/eng/at/y2021/i4/p140
|
|