TY - GEN
T1 - LIESA
T2 - 31st International Conference on Multimedia Modeling, MMM 2025
AU - Zhang, Jingyao
AU - Hao, Shijie
AU - Sun, Fuming
AU - Rao, Yuan
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025.
PY - 2025/1/1
Y1 - 2025/1/1
N2 - Low-light image enhancement (LLIE) is a critical low-level image processing task aimed at improving the visual quality of dimly lit images, thereby enhancing user experience and supporting subsequent applications. Recent research has explored incorporating semantic information into LLIE models. However, these methods often suffer from high computational costs during training due to the optimization of semantic-related loss functions or inadequate utilization of semantic features. This paper presents a novel model, Low-light Image Enhancement with Semantic Awareness (LIESA), designed to address these limitations. LIESA focuses on adjusting the illumination component of an image and employs a triple-branch encoder-decoder architecture. To effectively integrate semantic and reflectance features into the illumination feature hierarchy, we introduce multiple Lighting Optimization Modules (LOMs) at different stages of the illumination encoder-decoder branch. In LOMs, two sub-modules, Semantic Prior Fusion Module (SFM) and Illumination Attention Module (IAM), are designed to fuse information from the illumination, reflectance and semantic branches. The design of our model facilitates the incorporation of high-level and low-level features across different resolutions, thereby leading to more effective enhancement in terms of improving visibility and preserving visual naturalness. Experimental evaluations on ten public datasets demonstrate LIESA’s superior performance in terms of both visual quality and quantitative metrics compared to state-of-the-art LLIE methods that incorporate semantic information. Ablation studies further corroborate the effectiveness of LIESA’s key components.
AB - Low-light image enhancement (LLIE) is a critical low-level image processing task aimed at improving the visual quality of dimly lit images, thereby enhancing user experience and supporting subsequent applications. Recent research has explored incorporating semantic information into LLIE models. However, these methods often suffer from high computational costs during training due to the optimization of semantic-related loss functions or inadequate utilization of semantic features. This paper presents a novel model, Low-light Image Enhancement with Semantic Awareness (LIESA), designed to address these limitations. LIESA focuses on adjusting the illumination component of an image and employs a triple-branch encoder-decoder architecture. To effectively integrate semantic and reflectance features into the illumination feature hierarchy, we introduce multiple Lighting Optimization Modules (LOMs) at different stages of the illumination encoder-decoder branch. In LOMs, two sub-modules, Semantic Prior Fusion Module (SFM) and Illumination Attention Module (IAM), are designed to fuse information from the illumination, reflectance and semantic branches. The design of our model facilitates the incorporation of high-level and low-level features across different resolutions, thereby leading to more effective enhancement in terms of improving visibility and preserving visual naturalness. Experimental evaluations on ten public datasets demonstrate LIESA’s superior performance in terms of both visual quality and quantitative metrics compared to state-of-the-art LLIE methods that incorporate semantic information. Ablation studies further corroborate the effectiveness of LIESA’s key components.
KW - Cross attention
KW - Low-light image enhancement
KW - Retinex theory
KW - Semantic information
UR - https://www.scopus.com/pages/publications/85215770649
U2 - 10.1007/978-981-96-2061-6_17
DO - 10.1007/978-981-96-2061-6_17
M3 - Conference contribution
AN - SCOPUS:85215770649
SN - 9789819620609
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 226
EP - 239
BT - MultiMedia Modeling - 31st International Conference on Multimedia Modeling, MMM 2025, Proceedings
A2 - Ide, Ichiro
A2 - Kompatsiaris, Ioannis
A2 - Xu, Changsheng
A2 - Yanai, Keiji
A2 - Chu, Wei-Ta
A2 - Nitta, Naoko
A2 - Riegler, Michael
A2 - Yamasaki, Toshihiko
PB - Springer Science and Business Media Deutschland GmbH
Y2 - 8 January 2025 through 10 January 2025
ER -