Национальный цифровой ресурс Руконт - межотраслевая электронная библиотека (ЭБС) на базе технологии Контекстум (всего произведений: 636193)
Контекстум
Руконтекст антиплагиат система
0   0
Первый авторSidorov Maxim
АвторыAlexander Schmitt
Страниц9
ID453674
АннотацияThe ability of artificial systems to recognize paralinguistic signals, such as emotions, depression, or openness, is useful in various applications. However, the performance of such recognizers is not yet perfect. In this study we consider several directions which can significantly improve the performance of such systems. Firstly, we propose building speaker- or gender-specific emotion models. Thus, an emotion recognition (ER) procedure is followed by a gender- or speaker-identifier. Speaker- or gender-specific information is used either for including into the feature vector directly, or for creating separate emotion recognition models for each gender or speaker. Secondly, a feature selection procedure is an important part of any classification problem; therefore, we proposed using a feature selection technique, based on a genetic algorithm or an information gain approach. Both methods result in higher performance than baseline methods without any feature selection algorithms. Finally, we suggest analysing not only audio signals, but also combined audio-visual cues. The early fusion method (or feature-based fusion) has been used in our investigations to combine different modalities into a multimodal approach. The results obtained show that the multimodal approach outperforms single modalities on the considered corpora. The suggested methods have been evaluated on a number of emotional databases of three languages (English, German and Japanese), in both acted and non-acted settings. The results of numerical experiments are also shown in the study.
УДК519.87
Sidorov, M. Automated Recognition of Paralinguistic Signals in Spoken Dialogue Systems: Ways of Improvement / M. Sidorov, Schmitt Alexander // Журнал Сибирского федерального университета. Математика и физика. Journal of Siberian Federal University, Mathematics & Physics .— 2015 .— №2 .— С. 86-94 .— URL: https://rucont.ru/efd/453674 (дата обращения: 18.05.2024)

Предпросмотр (выдержки из произведения)

Mathematics & Physics 2015, 8(2), 208–216 УДК 519.87 Automated Recognition of Paralinguistic Signals in Spoken Dialogue Systems: Ways of Improvement Maxim Sidorov∗ Alexander Schmitt† Institute of Communications Engineering Ulm University Albert Einstein-Allee, 43, Ulm, 89081 Germany Eugene S. Semenkin‡ Institute of Computer Science and Telecommunications Siberian State Aerospace University Krasnoyarskiy Rabochiy, 31, Krasnoyarsk, 660014 Russia Received 19.01.2015, received in revised form 25.02.2015, accepted 20.03.2015 The ability of artificial systems to recognize paralinguistic signals, such as emotions, depression, or openness, is useful in various applications. <...> However, the performance of such recognizers is not yet perfect. <...> In this study we consider several directions which can significantly improve the performance of such systems. <...> Firstly, we propose building speaker- or gender-specific emotion models. <...> Thus, an emotion recognition (ER) procedure is followed by a gender- or speaker-identifier. <...> Speaker- or gender-specific information is used either for including into the feature vector directly, or for creating separate emotion recognition models for each gender or speaker. <...> Secondly, a feature selection procedure is an important part of any classification problem; therefore, we proposed using a feature selection technique, based on a genetic algorithm or an information gain approach. <...> Both methods result in higher performance than baseline methods without any feature selection algorithms. <...> Finally, we suggest analysing not only audio signals, but also combined audio-visual cues. <...> The early fusion method (or feature-based fusion) has been used in our investigations to combine different modalities into a multimodal approach. <...> The results obtained show that the multimodal approach outperforms single modalities on the considered corpora. <...> The results of numerical experiments are also shown in the study. <...> Thus, emotion-related information is used for the assessing of the user’s satisfaction while using a Spoken Dialogue System (SDS) or for the automated monitoring of call-centres. <...> A negative emotion-based signal can be used for changing the dialogue strategy. <...> All rights reserved c – 208 – Maxim Sidorov, Alexander Schmitt, Eugene Semenkin Automated Recognition of Paralinguistic Signals . order to support their decisions and to avoid critical mistakes in practice. <...> Nevertheless <...>

Облако ключевых слов *


* - вычисляется автоматически
Антиплагиат система на базе ИИ