Modelirovanie i Analiz Informatsionnykh Sistem
RUS  ENG    JOURNALS   PEOPLE   ORGANISATIONS   CONFERENCES   SEMINARS   VIDEO LIBRARY   PACKAGE AMSBIB  
General information
Latest issue
Archive
Impact factor

Search papers
Search references

RSS
Latest issue
Current issues
Archive issues
What is RSS



Model. Anal. Inform. Sist.:
Year:
Volume:
Issue:
Page:
Find






Personal entry:
Login:
Password:
Save password
Enter
Forgotten password?
Register


Modelirovanie i Analiz Informatsionnykh Sistem, 2024, Volume 31, Number 2, Pages 194–205
DOI: https://doi.org/10.18255/1818-1015-2024-2-194-205
(Mi mais824)
 

This article is cited in 2 scientific papers (total in 2 papers)

Artificial intelligence

Automatic determination of semantic similarity of student answers with the standard one using modern models

N. S. Lagutina, K. V. Lagutina, V. N. Kopnin

P.G. Demidov Yaroslavl State University, Yaroslavl, Russia
Full-text PDF (521 kB) Citations (2)
References:
Abstract: The paper presents the results of a study of modern text models in order to identify, on their basis, the semantic similarity of English-language texts. The task of determining semantic similarity of texts is an important component of many areas of natural language processing: machine translation, information retrieval, question and answer systems, artificial intelligence in education. The authors solved the problem of classifying the proximity of student answers to the teacher's standard answer. The neural network language models BERT and GPT, previously used to determine the semantic similarity of texts, the new neural network model Mamba, as well as stylometric features of the text were chosen for the study. Experiments were carried out with two text corpora: the Text Similarity corpus from open sources and the custom corpus, collected with the help of philologists. The quality of the problem solution was assessed by precision, recall, and F-measure. All neural network language models showed a similar F-measure quality of about 86% for the larger Text Similarity corpus and 50–56% for the custom corpus. A completely new result was the successful application of the Mamba model. However, the most interesting achievement was the use of vectors of stylometric features of the text, which showed 80% F-measure for the custom corpus and the same quality of problem solving as neural network models for another corpus.
Keywords: natural language processing, text similarity, text classification, neural network language models, assessing students' open responses, artificial intelligence in education.
Funding agency Grant number
Yaroslavl State University VIP-016
Yaroslavl State University (project VIP-016).
Received: 20.03.2024
Revised: 11.04.2024
Accepted: 17.04.2024
Document Type: Article
UDC: 004.912
MSC: 68T50
Language: Russian
Citation: N. S. Lagutina, K. V. Lagutina, V. N. Kopnin, “Automatic determination of semantic similarity of student answers with the standard one using modern models”, Model. Anal. Inform. Sist., 31:2 (2024), 194–205
Citation in format AMSBIB
\Bibitem{LagLagKop24}
\by N.~S.~Lagutina, K.~V.~Lagutina, V.~N.~Kopnin
\paper Automatic determination of semantic similarity of student answers with the standard one using modern models
\jour Model. Anal. Inform. Sist.
\yr 2024
\vol 31
\issue 2
\pages 194--205
\mathnet{http://mi.mathnet.ru/mais824}
\crossref{https://doi.org/10.18255/1818-1015-2024-2-194-205}
Linking options:
  • https://www.mathnet.ru/eng/mais824
  • https://www.mathnet.ru/eng/mais/v31/i2/p194
  • This publication is cited in the following 2 articles:
    Citing articles in Google Scholar: Russian citations, English citations
    Related articles in Google Scholar: Russian articles, English articles
    Моделирование и анализ информационных систем
     
      Contact us:
     Terms of Use  Registration to the website  Logotypes © Steklov Mathematical Institute RAS, 2025