Company News

Audio-Visual Emotion Recognition using Deep Transfer Learning and Multiple Temporal Models

|

Yan Xu*, Yu Cheng*, Jian Zhao, Zhecan Wang, Lin Xiong, Jayashree Karlekar, Hajime Tamura, Tomoyuki Kagaya, Shengmei Shen, Sugiri Pranata, Jiashi Feng, Junliang Xing
Workshop on MS-Celeb-1M Challenge with ICCV 2017, 2017.10

In this paper, we introduce our solution to the Challenge-1 of the MS-Celeb-lM challenges which aims to recognize one million celebrities. To solve this large scale face recognition problem, a Multi-Cognition Softmax Model (MCSM) is proposed to distribute training data to several cognition units by a data shuffling strategy. Here we introduce one cognition unit as a group of independent softmax models, which is designed to increase the diversity of the one softmax model to boost the performance for models ensemble. Meanwhile, a template-based Feature Retrieval (FR) module is adopted to improve the performance of MCSM by a specific voting scheme. Moreover, a one-shot learning method is applied on collected extra 600K identities due to each identity has one image only. Finally, testing images with lower score from MCSM and FR are assigned new labels with higher score by merging one-shot learning results. Extensive experiments on the MS-Celeb-1M testing set demonstrate the superiority of the proposed method. Our solution ranks the first place in both two settings of the final evaluation and outperforms other teams by a large margin.

Link: https://ieeexplore.ieee.org/document/8265434

Share

Related Posts

Panasonic Opens Innovation Hub at Punggol Digital District to Drive AI Smart Building and Robotics Solutions

Singapore, 27 August 2025 – Panasonic R&D Center Singapore (PRDCSG) today announced the opening of its new “Innovation Hub” at…

Read more

Panasonic R&D Center Singapore Won a Silver Medal in a Kaggle Competition

Panasonic R&D Center Singapore won a silver medal in a Kaggle competition titled “Petfinder.my – Pawpularity Contest”, which started on…

Read more

Joint collaborations with I2R: achieved the No. 2 position at MEC 2017 plus a paper accepted at ICMI 2017

In collaboration with the Institute for Infocomm Research (I2R), Panasonic R&D Center Singapore achieved the No. 2 position in the audio-visual emotion recognition sub-challenge of the Multimodal Emotion Recognition Challenge (MEC) 2017. The challenge (*1) is aimed at the comparison of multimedia processing and machine learning methods for automatic audio and visual emotion analysis.

Read more

Panasonic’s Flagship LZ2000 OLED TV Continually Powered by Panasonic R&D Center Singapore’s Neural Network Architecture

At this year’s Consumer Electronics Show (CES) held on 4th -7th January, Panasonic unveiled its new flagship OLED TV for…

Read more