RUS  ENG JOURNALS   PEOPLE   ORGANISATIONS   CONFERENCES   SEMINARS   VIDEO LIBRARY   PACKAGE AMSBIB
General information
Latest issue
Archive
Impact factor
Guidelines for authors
Submit a manuscript

Search papers
Search references

RSS
Latest issue
Current issues
Archive issues
What is RSS



Avtomat. i Telemekh.:
Year:
Volume:
Issue:
Page:
Find






Personal entry:
Login:
Password:
Save password
Enter
Forgotten password?
Register


Avtomat. i Telemekh., 2014, Issue 12, Pages 125–138 (Mi at14166)  

This article is cited in 5 scientific papers (total in 5 papers)

Intellectual Control Systems

An automatic multimodal speech recognition system with audio and video information

A. A. Karpovab

a St. Petersburg Institute of Informatics and Automation, Russian Academy of Sciences, St. Petersburg, Russia
b ITMO University, St. Petersburg, Russia

Abstract: The mathematical model and software implementation of an automatic Russian speech recognition system that employs techniques of digital processing and analysis of audiovisual signals from a microphone and a video camera are presented. The description of probabilistic modeling of audiovisual speech based on coupled hidden Markov models, information fusion methods with weight coefficients for audio and video speech modalities, and parametric representation of signals is provided. Quantitative results in multimodal recognition of continuous Russian speech indicate high accuracy and reliability of the automatic system.

Full text: PDF file (523 kB)
References: PDF file   HTML file

English version:
Automation and Remote Control, 2014, 75:12, 2190–2200

Bibliographic databases:

Presented by the member of Editorial Board: └. ┬. ┴ň­Ý°˛ňÚÝ

Received: 28.03.2012

Citation: A. A. Karpov, “An automatic multimodal speech recognition system with audio and video information”, Avtomat. i Telemekh., 2014, no. 12, 125–138; Autom. Remote Control, 75:12 (2014), 2190–2200

Citation in format AMSBIB
\Bibitem{Kar14}
\by A.~A.~Karpov
\paper An automatic multimodal speech recognition system with audio and video information
\jour Avtomat. i Telemekh.
\yr 2014
\issue 12
\pages 125--138
\mathnet{http://mi.mathnet.ru/at14166}
\transl
\jour Autom. Remote Control
\yr 2014
\vol 75
\issue 12
\pages 2190--2200
\crossref{https://doi.org/10.1134/S000511791412008X}
\isi{http://gateway.isiknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcApp=PARTNER_APP&SrcAuth=LinksAMR&DestLinkType=FullRecord&DestApp=ALL_WOS&KeyUT=000346402900008}
\scopus{http://www.scopus.com/record/display.url?origin=inward&eid=2-s2.0-84919360128}


Linking options:
  • http://mi.mathnet.ru/eng/at14166
  • http://mi.mathnet.ru/eng/at/y2014/i12/p125

    SHARE: VKontakte.ru FaceBook Twitter Mail.ru Livejournal Memori.ru


    Citing articles on Google Scholar: Russian citations, English citations
    Related articles on Google Scholar: Russian articles, English articles

    This publication is cited in the following articles:
    1. A. Karpov, A. Ronzhin, I. Kipyatkova, “Automatic analysis of speech and acoustic events for ambient assisted living”, Universal Access in Human-Computer Interaction: Access To Interaction, Pt II, Lecture Notes in Computer Science, 9176, eds. M. Antona, C. Stephanidis, Springer-Verlag Berlin, 2015, 455–463  crossref  isi  scopus
    2. I. S. Kipyatkova, A. A. Karpov, “A study of neural network Russian language models for automatic continuous speech recognition systems”, Autom. Remote Control, 78:5 (2017), 858–867  mathnet  crossref  mathscinet  isi  elib
    3. D. Ivanko, A. Karpov, D. Fedotov, I. Kipyatkova, D. Ryumin, D. Ivanko, W. Minker, M. Zelezny, “Multimodal speech recognition: increasing accuracy using high speed video data”, J. Multimodal User Interfaces, 12:4, SI (2018), 319–328  crossref  isi  scopus
    4. N. Radha, A. Shahina, P. Prabha, P. B. T. Sri, N. A. Khan, “An analysis of the effect of combining standard and alternate sensor signals on recognition of syllabic units for multimodal speech recognition”, Pattern Recognit. Lett., 115, SI (2018), 39–49  crossref  isi  scopus
    5. M. P. Farkhadov, N. V. Petukhova, S. V. Vaskovskii, M.áE.áFarkhadova, “Povyshenie effektivnosti rechevogo interfeisa sáprimeneniem kognitivnykh i lingvisticheskikh znanii”, UBS, 81 (2019), 90–112  mathnet  crossref
  • Avtomatika i Telemekhanika
    Number of views:
    This page:328
    Full text:49
    References:28
    First page:45

     
    Contact us:
     Terms of Use  Registration  Logotypes © Steklov Mathematical Institute RAS, 2020