Learnable pooling weights for facial expression recognition

M. Amine Mahmoudi; Aladine Chetouani; Fatma Boufera; Hedi Tabia

doi:10.1016/j.patrec.2020.09.001

Journal Articles Pattern Recognition Letters Year : 2020

Learnable pooling weights for facial expression recognition

(1) , (2) , (1) , (3)

1
2
3

M. Amine Mahmoudi

Function : Author

Université Mustapha Stambouli de Mascara [Algérie] = University Mustapha Stambouli [Mascara, Algeria]

Aladine Chetouani

Function : Author
PersonId : 858228

Laboratoire pluridisciplinaire de recherche en ingénierie des systèmes, mécanique et énergétique

Fatma Boufera

Function : Author

Université Mustapha Stambouli de Mascara [Algérie] = University Mustapha Stambouli [Mascara, Algeria]

Hedi Tabia

Function : Author
PersonId : 11431
IdHAL : hedi-tabia
ORCID : 0000-0002-1827-7150
IdRef : 159010373

Informatique, BioInformatique, Systèmes Complexes

Abstract

Pooling layers are spatial down-sampling layers used in convolutional neural networks (CNN) to gradually downscale the feature map, increase the receptive field size and reduce the number of the parameters in the model. The use of pooling layers leads to less computing complexity and memory consumption reduction but also introduces invariance to certain filter distortions which may induce subtle detail loss. This behaviour is undesired for some fine-grained recognition tasks such as facial expression recognition (FER) which highly relies on specific regional distortion detection. In this paper, we introduce a more filter distortion aware pooling layer based on kernel functions. The proposed pooling reduces the feature map dimensions while keeping track of the majority of the information fed to the next layer instead of ignoring part of them. The experiments on RAF, FER2013 and ExpW databases demonstrate the benefits of such layer and show that our model achieves competitive results with respect to the state-of-the-art approaches.

Keywords

Deep learning Facial expression recognition Kernel methods

Domains

Machine Learning [cs.LG] Signal and Image processing

Fichier principal

MAM2020.pdf (334.69 Ko)

Origin : Files produced by the author(s)

Mathias Legrand : Connect in order to contact the contributor

https://hal.science/hal-02963286

Submitted on : Tuesday, October 31, 2023-2:23:54 PM

Last modification on : Monday, April 22, 2024-4:24:05 PM

Long-term archiving on: Thursday, February 1, 2024-7:00:39 PM

Dates and versions

hal-02963286 , version 1 (31-10-2023)

Identifiers

HAL Id : hal-02963286 , version 1
DOI : 10.1016/j.patrec.2020.09.001

Cite

M. Amine Mahmoudi, Aladine Chetouani, Fatma Boufera, Hedi Tabia. Learnable pooling weights for facial expression recognition. Pattern Recognition Letters, 2020, 138, pp.644--650. ⟨10.1016/j.patrec.2020.09.001⟩. ⟨hal-02963286⟩

Export

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-ORLEANS UNIV-EVRY IBISC IBISC-IRA2 UNIV-PARIS-SACLAY PRISME-CVL INSA-GROUPE INSA-CVL GS-ENGINEERING GS-COMPUTER-SCIENCE GS-LIFE-SCIENCES-HEALTH GS-SPORT-HUMAN-MOVEMENT

160 View

24 Download

Learnable pooling weights for facial expression recognition

Abstract

Keywords

Domains

Dates and versions

Identifiers

Cite

Export

Collections

Altmetric

Share