TY - GEN
T1 - An Open X-Ray Spectrometric Dataset for Deep Learning-Based Pile-up Correction
AU - Lin, Congyu
AU - Chen, Zikang
AU - Feng, Chujun
AU - Gu, Songqi
AU - Zheng, Xiaoying
AU - Zhu, Yongxin
AU - Trigano, Tom
AU - Bykhovsky, Dima
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025.
PY - 2025/1/1
Y1 - 2025/1/1
N2 - X-ray spectroscopy is used to investigate the elemental composition and chemical states of materials through an energy spectrum, a histogram representing the energies of emitted photons. However, in high input-count-rate scenarios, the pile-up effect occurs, distorting the energy spectrum and affecting measurement accuracy. Currently, deep learning methods have been increasingly applied to pile-up correction. Nevertheless, there is a lack of open datasets for improving and measuring the performance of various algorithms. In this paper, the first open nuclear pulse dataset was constructed using the Allpix Squared [12] simulator, which is a well-recognized simulation framework developed by CERN (European Council for Nuclear Research). The dataset contains nuclear pulse data for twelve representative chemical elements. Detailed and diverse annotations are provided to support various deep-learning models. In addition, X-ray fluorescence measurements were conducted at the Shanghai Synchrotron Radiation Facility (SSRF), where the distribution of simulated data was compared with experimental data. Finally, deep learning-based pile-up correction was performed on this dataset, demonstrating its potential for training and evaluating deep learning algorithms.
AB - X-ray spectroscopy is used to investigate the elemental composition and chemical states of materials through an energy spectrum, a histogram representing the energies of emitted photons. However, in high input-count-rate scenarios, the pile-up effect occurs, distorting the energy spectrum and affecting measurement accuracy. Currently, deep learning methods have been increasingly applied to pile-up correction. Nevertheless, there is a lack of open datasets for improving and measuring the performance of various algorithms. In this paper, the first open nuclear pulse dataset was constructed using the Allpix Squared [12] simulator, which is a well-recognized simulation framework developed by CERN (European Council for Nuclear Research). The dataset contains nuclear pulse data for twelve representative chemical elements. Detailed and diverse annotations are provided to support various deep-learning models. In addition, X-ray fluorescence measurements were conducted at the Shanghai Synchrotron Radiation Facility (SSRF), where the distribution of simulated data was compared with experimental data. Finally, deep learning-based pile-up correction was performed on this dataset, demonstrating its potential for training and evaluating deep learning algorithms.
KW - X-ray spectroscopy
KW - dataset
KW - deep learning
KW - pile-up
UR - https://www.scopus.com/pages/publications/105010169963
U2 - 10.1007/978-981-96-8731-2_25
DO - 10.1007/978-981-96-8731-2_25
M3 - Conference contribution
AN - SCOPUS:105010169963
SN - 9789819687305
T3 - Lecture Notes in Computer Science
SP - 256
EP - 265
BT - Wireless Artificial Intelligent Computing Systems and Applications - 19th International Conference, WASA 2025, Proceedings
A2 - Cai, Zhipeng
A2 - Zhu, Yongxin
A2 - Wang, Yonghao
A2 - Qiu, Meikang
PB - Springer Science and Business Media Deutschland GmbH
T2 - 19th International Conference on Wireless Artificial Intelligent Computing Systems and Applications, WASA 2025
Y2 - 24 June 2025 through 26 June 2025
ER -