Proceedings of the Institute for System Programming of the RAS
RUS  ENG    JOURNALS   PEOPLE   ORGANISATIONS   CONFERENCES   SEMINARS   VIDEO LIBRARY   PACKAGE AMSBIB  
General information
Latest issue
Archive

Search papers
Search references

RSS
Latest issue
Current issues
Archive issues
What is RSS



Proceedings of ISP RAS:
Year:
Volume:
Issue:
Page:
Find






Personal entry:
Login:
Password:
Save password
Enter
Forgotten password?
Register


Proceedings of the Institute for System Programming of the RAS, 2021, Volume 33, Issue 4, Pages 31–48
DOI: https://doi.org/10.15514/ISPRAS-2021-33(4)-3
(Mi tisp611)
 

Identification of transparent, compressed and encrypted data in network traffic

A. I. Getmanab, M. K. Ikonnikovab

a National Research University Higher School of Economics
b Ivannikov Institute for System Programming of the Russian Academy of Science
Abstract: The article is dedicated to the problem of classifying network traffic into three categories: transparent, compressed and opaque, preferably in real-time. It begins with the description of the areas where this problem needs to be solved, then proceeds to the existing solutions with their methods, advantages and limitations. As most of the current research is done either in the area of separating traffic into transparent and opaque or into compressed and encrypted, the need arises to combine a subset of existing methods to unite these two problems into one. As later the main mathematical ideas and suggestions that lie behind the ideas used in the research done by other scientists are described, the list of the best performing of them is composed to be combined together and used as the features for the random forest classificator, which will divide the provided traffic into three classes. The best performing of these features are used, the optimal tree parameters are chosen and, what's more, the initial three class classifier is divided into two sequential ones to save time needed for classifying in case of transparent packets. Then comes the proposition of the new method to classify the whole network flow as one into one of those three classes, the validity of which is confirmed on several examples of the protocols most specific in this area (SSH, SSL). The article concludes with the directions in which this research is to be continued, mostly optimizing it for real-time classification and obtaining more samples of traffic suitable for experiments and demonstrations.
Keywords: network traffic analysis, network traffic classification, machine learning, encrypted traffic.
Document Type: Article
Language: Russian
Citation: A. I. Getman, M. K. Ikonnikova, “Identification of transparent, compressed and encrypted data in network traffic”, Proceedings of ISP RAS, 33:4 (2021), 31–48
Citation in format AMSBIB
\Bibitem{GetIko21}
\by A.~I.~Getman, M.~K.~Ikonnikova
\paper Identification of transparent, compressed and encrypted data in network traffic
\jour Proceedings of ISP RAS
\yr 2021
\vol 33
\issue 4
\pages 31--48
\mathnet{http://mi.mathnet.ru/tisp611}
\crossref{https://doi.org/10.15514/ISPRAS-2021-33(4)-3}
Linking options:
  • https://www.mathnet.ru/eng/tisp611
  • https://www.mathnet.ru/eng/tisp/v33/i4/p31
  • Citing articles in Google Scholar: Russian citations, English citations
    Related articles in Google Scholar: Russian articles, English articles
    Proceedings of the Institute for System Programming of the RAS
     
      Contact us:
     Terms of Use  Registration to the website  Logotypes © Steklov Mathematical Institute RAS, 2025