TY - JOUR
T1 - OWAdapt
T2 - an adaptive loss function for deep learning using OWA operators
AU - Maldonado, Sebastián
AU - Vairetti, Carla
AU - Jara, Katherine
AU - Carrasco, Miguel
AU - López, Julio
N1 - Publisher Copyright:
© 2023 Elsevier B.V.
PY - 2023/11/25
Y1 - 2023/11/25
N2 - In this paper, we propose a novel adaptive loss function for enhancing deep learning performance in classification tasks. Specifically, we redefine the cross-entropy loss to effectively address class-level noise conditions, including the challenging problem of class imbalance. Our approach introduces aggregation operators to improve classification accuracy. The rationale behind our proposed method lies in the iterative up-weighting of class-level components within the loss function, focusing on those with larger errors. To achieve this, we employ the ordered weighted average (OWA) operator and combine it with an adaptive scheme for gradient-based learning. The main finding is that our method outperforms other commonly used loss functions, such as the standard cross-entropy or focal loss, across various binary and multiclass classification tasks. Furthermore, we explore the influence of hyperparameters associated with the OWA operators and propose a default configuration that performs well across different experimental settings.
AB - In this paper, we propose a novel adaptive loss function for enhancing deep learning performance in classification tasks. Specifically, we redefine the cross-entropy loss to effectively address class-level noise conditions, including the challenging problem of class imbalance. Our approach introduces aggregation operators to improve classification accuracy. The rationale behind our proposed method lies in the iterative up-weighting of class-level components within the loss function, focusing on those with larger errors. To achieve this, we employ the ordered weighted average (OWA) operator and combine it with an adaptive scheme for gradient-based learning. The main finding is that our method outperforms other commonly used loss functions, such as the standard cross-entropy or focal loss, across various binary and multiclass classification tasks. Furthermore, we explore the influence of hyperparameters associated with the OWA operators and propose a default configuration that performs well across different experimental settings.
KW - Class-imbalance classification
KW - Deep learning
KW - Loss functions
KW - OWA operators
UR - http://www.scopus.com/inward/record.url?scp=85172671934&partnerID=8YFLogxK
U2 - 10.1016/j.knosys.2023.111022
DO - 10.1016/j.knosys.2023.111022
M3 - Article
AN - SCOPUS:85172671934
SN - 0950-7051
VL - 280
SP - 1
EP - 9
JO - Knowledge-Based Systems
JF - Knowledge-Based Systems
M1 - 111022
ER -