TY - GEN
T1 - Robot-Led Vision Language Model Wellbeing Assessment of Children
AU - Abbasi, Nida Itrat
AU - Dogan, Fethiye Irmak
AU - Laban, Guy
AU - Anderson, Joanna
AU - Ford, Tamsin
AU - Jones, Peter B.
AU - Gunes, Hatice
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025/1/1
Y1 - 2025/1/1
N2 - This study presents a novel robot-led approach to assessing children's mental wellbeing using a Vision Language Model (VLM). Inspired by the Child Apperception Test (CAT), the social robot NAO presented children with pictorial stimuli to elicit their verbal narratives of the images, which were then evaluated by a VLM in accordance with CAT assessment guidelines. The VLM's assessments were systematically compared to those provided by a trained psychologist. The results reveal that while the VLM demonstrates moderate reliability in identifying cases with no wellbeing concerns, its ability to accurately classify assessments with wellbeing concerns remains limited. Moreover, although the model's performance was generally consistent when prompted with varying demographic factors such as age and gender, a significantly higher false positive rate was observed for girls, indicating potential sensitivity to gender attribute. These findings highlight both the promise and the challenges of integrating VLMs into robot-led assessments of children's wellbeing.
AB - This study presents a novel robot-led approach to assessing children's mental wellbeing using a Vision Language Model (VLM). Inspired by the Child Apperception Test (CAT), the social robot NAO presented children with pictorial stimuli to elicit their verbal narratives of the images, which were then evaluated by a VLM in accordance with CAT assessment guidelines. The VLM's assessments were systematically compared to those provided by a trained psychologist. The results reveal that while the VLM demonstrates moderate reliability in identifying cases with no wellbeing concerns, its ability to accurately classify assessments with wellbeing concerns remains limited. Moreover, although the model's performance was generally consistent when prompted with varying demographic factors such as age and gender, a significantly higher false positive rate was observed for girls, indicating potential sensitivity to gender attribute. These findings highlight both the promise and the challenges of integrating VLMs into robot-led assessments of children's wellbeing.
UR - https://www.scopus.com/pages/publications/105024561924
U2 - 10.1109/RO-MAN63969.2025.11217833
DO - 10.1109/RO-MAN63969.2025.11217833
M3 - Conference contribution
AN - SCOPUS:105024561924
T3 - IEEE International Workshop on Robot and Human Communication, RO-MAN
SP - 59
EP - 64
BT - 2025 34th IEEE International Conference on Robot and Human Interactive Communication, RO-MAN 2025
PB - Institute of Electrical and Electronics Engineers
T2 - 34th IEEE International Conference on Robot and Human Interactive Communication, RO-MAN 2025
Y2 - 25 August 2025 through 29 August 2025
ER -