|
|
Informatsionnye Tekhnologii i Vychslitel'nye Sistemy, 2017, Issue 3, Pages 56–69
(Mi itvs274)
|
|
|
|
MACHINE LEARNING
Distributed systems for machine learning
I. E. Trofimov Yandex.Market
Abstract:
Machine learning is a rapidly developing area of research. Many machine learning and data science applications face very large datasets. These datasets are hard to process on a single computer or this processing will be very time consuming. Using only a subsample of a dataset typically leads to the worse quality of model’s predictions. Distributed computational systems are used to solve this problem. The most popular approaches for developing software of such systems include the following computational models: Map/Reduce, Spark, graph computational models and parameter server architecture. Current paper is a review of such systems with analysis of their advantages and disadvantages regarding to machine learning applications. Systems for training artificial neural networks are discussed separately.
Keywords:
machine learning, data science, big data, distributed systems.
Citation:
I. E. Trofimov, “Distributed systems for machine learning”, Informatsionnye Tekhnologii i Vychslitel'nye Sistemy, 2017, no. 3, 56–69
Linking options:
https://www.mathnet.ru/eng/itvs274 https://www.mathnet.ru/eng/itvs/y2017/i3/p56
|
| Statistics & downloads: |
| Abstract page: | 519 | | Full-text PDF : | 684 | | References: | 2 |
|