Tampilan Petugas: RECOGNITION OF AUDIO-VISUAL EMOTIONS USING VIDEO CLIPS

RECOGNITION OF AUDIO-VISUAL EMOTIONS USING VIDEO CLIPS

Main Author:	Pragya Singh Tomar* & Brahma Datta Shukla
Format:	Article Journal
Terbitan:	, 2018
Subjects:	Multimodal Emotion Recognition Classifier Fusion Data Fusion Convolutional Neural Networks
Online Access:	https://zenodo.org/record/5184752

ctrlnum	5184752
fullrecord	<?xml version="1.0"?> <dc schemaLocation="http://www.openarchives.org/OAI/2.0/oai_dc/ http://www.openarchives.org/OAI/2.0/oai_dc.xsd"><creator>Pragya Singh Tomar* & Brahma Datta Shukla</creator><date>2018-05-31</date><description>This research describes a multimodal emotion identification system that uses auditory and visual inputs to recognize emotions. Mel-Frequency Cepstral Coefficients, Filter Bank Energies, and prosodic characteristics are retrieved from the audio channel. Two techniques are being investigated for the visual element. First, the geometric relationships between face landmarks, such as distances and angles, are calculated. Second, we condense each emotional movie into a smaller collection of key-frames that may be used to visually distinguish between different emotions. To accomplish so, key-frame summary films are fed into a convolutional neural network. Finally, in a late fusion/stacking approach, the confidence outputs of all the classifiers from all the modalities are utilized to build a new feature space to be trained for final emotion label prediction. Experiments on the SAVEE, eNTERFACE&#39;05, and RML databases reveal that our proposed solution performs significantly better than current options, defining the current state-of-the-art in all three databases.</description><identifier>https://zenodo.org/record/5184752</identifier><identifier>10.5281/zenodo.5184752</identifier><identifier>oai:zenodo.org:5184752</identifier><relation>doi:10.5281/zenodo.5184751</relation><rights>info:eu-repo/semantics/openAccess</rights><rights>https://creativecommons.org/licenses/by/4.0/legalcode</rights><source>INTERNATIONAL JOURNAL OF ENGINEERING SCIENCES & RESEARCH TECHNOLOGY 7(5) 582-584</source><subject>Multimodal Emotion Recognition, Classifier Fusion, Data Fusion, Convolutional Neural Networks.</subject><title>RECOGNITION OF AUDIO-VISUAL EMOTIONS USING VIDEO CLIPS</title><type>Journal:Article</type><type>Journal:Article</type><recordID>5184752</recordID></dc>
format	Journal:Article Journal Journal:Journal
author	Pragya Singh Tomar* & Brahma Datta Shukla
title	RECOGNITION OF AUDIO-VISUAL EMOTIONS USING VIDEO CLIPS
publishDate	2018
topic	Multimodal Emotion Recognition Classifier Fusion Data Fusion Convolutional Neural Networks
url	https://zenodo.org/record/5184752
contents	This research describes a multimodal emotion identification system that uses auditory and visual inputs to recognize emotions. Mel-Frequency Cepstral Coefficients, Filter Bank Energies, and prosodic characteristics are retrieved from the audio channel. Two techniques are being investigated for the visual element. First, the geometric relationships between face landmarks, such as distances and angles, are calculated. Second, we condense each emotional movie into a smaller collection of key-frames that may be used to visually distinguish between different emotions. To accomplish so, key-frame summary films are fed into a convolutional neural network. Finally, in a late fusion/stacking approach, the confidence outputs of all the classifiers from all the modalities are utilized to build a new feature space to be trained for final emotion label prediction. Experiments on the SAVEE, eNTERFACE'05, and RML databases reveal that our proposed solution performs significantly better than current options, defining the current state-of-the-art in all three databases.
id	IOS16997.5184752
institution	ZAIN Publications
institution_id	7213
institution_type	library:special library
library	Cognizance Journal of Multidisciplinary Studies
library_id	5267
collection	Cognizance Journal of Multidisciplinary Studies
repository_id	16997
subject_area	Multidisciplinary
city	Stockholm
province	INTERNASIONAL
shared_to_ipusnas_str	1
repoId	IOS16997
first_indexed	2022-06-06T02:32:34Z
last_indexed	2022-06-06T02:32:34Z
recordtype	dc
_version_	1734895149478051840
score	17.610363

RECOGNITION OF AUDIO-VISUAL EMOTIONS USING VIDEO CLIPS

Lihat Juga