RUS  ENG JOURNALS   PEOPLE   ORGANISATIONS   CONFERENCES   SEMINARS   VIDEO LIBRARY   PACKAGE AMSBIB
General information
Latest issue
Archive
Impact factor

Search papers
Search references

RSS
Latest issue
Current issues
Archive issues
What is RSS



Model. Anal. Inform. Sist.:
Year:
Volume:
Issue:
Page:
Find






Personal entry:
Login:
Password:
Save password
Enter
Forgotten password?
Register


Model. Anal. Inform. Sist., 2017, Volume 24, Number 2, Pages 215–226 (Mi mais559)  

De-duplication on the backup system with information storage in a database

S. M. Taranin

P.G. Demidov Yaroslavl State University, 14 Sovetskaya str., Yaroslavl 150003, Russia

Abstract: Prevention of data loss from digital media includes such a process as a backup. It can be done manually by copying data to external media or automated on a schedule by using special software. There are the remote backup systems, when data are saved over the network to the remote repository. Such systems are multi-user and they process large amounts of data. Shared storage can meet files containing the same fragments. The elimination of repeated data is based on the mechanism of de-duplication. It is a method of information compression, when the search of copies is performed in the entire dataset rather than within a single file. The main advantage of using this technology is a significant saving of disk space. However, the mechanism of eliminating repetitive data can significantly reduce the speed of saving and restoring information. This article is devoted to the problem of implementing such a mechanism in the backup system with information storage in a relational database. In this paper we consider an example of implementation of such a system working in two modes: with the de-duplication of data and without it. The article illustrates a class diagram for the development of a client part of application as well as the description of tables and relationships between them in a database that belongs to the backend. The author offers an algorithm of saving data wiht de-duplication, and also gives the results of comparative tests on the speed of the algorithms of saving and restoring information when working with relational database management systems from different manufacturers.

Keywords: file, data, backup, de-duplication, database.

DOI: https://doi.org/10.18255/1818-1015-2017-2-215-226

Full text: PDF file (623 kB)
References: PDF file   HTML file

UDC: 004.056.3
Received: 18.09.2016

Citation: S. M. Taranin, “De-duplication on the backup system with information storage in a database”, Model. Anal. Inform. Sist., 24:2 (2017), 215–226

Citation in format AMSBIB
\Bibitem{Tar17}
\by S.~M.~Taranin
\paper De-duplication on the backup system with information storage in a database
\jour Model. Anal. Inform. Sist.
\yr 2017
\vol 24
\issue 2
\pages 215--226
\mathnet{http://mi.mathnet.ru/mais559}
\crossref{https://doi.org/10.18255/1818-1015-2017-2-215-226}
\elib{http://elibrary.ru/item.asp?id=29064003}


Linking options:
  • http://mi.mathnet.ru/eng/mais559
  • http://mi.mathnet.ru/eng/mais/v24/i2/p215

    SHARE: VKontakte.ru FaceBook Twitter Mail.ru Livejournal Memori.ru


    Citing articles on Google Scholar: Russian citations, English citations
    Related articles on Google Scholar: Russian articles, English articles
  • Моделирование и анализ информационных систем
    Number of views:
    This page:123
    Full text:46
    References:13

     
    Contact us:
     Terms of Use  Registration  Logotypes © Steklov Mathematical Institute RAS, 2020