Abstract
We present our approach to Task 1 of the CLEF eRisk 2025 Lab, which focuses on identifying depression symptoms in user-generated text. The task is formulated as a sentence ranking problem, aiming to retrieve sentences relevant to each of the 21 symptoms defined in the Beck Depression Inventory-II (BDI-II). The method employs Sentence-BERT to compute semantic similarity between user text and symptom queries derived from the BDI questionnaire’s multiple-choice responses. To improve coverage, queries are expanded based on retrieval results from the training set. Additionally, sentences not referring to the user are filtered out to reduce noise from third-person narratives. Our approach achieved competitive performance, with Average Precision substantially exceeding the median of all submitted systems. This demonstrates the promise of semantic retrieval and first-person filtering for identifying fine-grained depressive symptoms at scale.
| Original language | English |
|---|---|
| Pages (from-to) | 1611-1619 |
| Number of pages | 9 |
| Journal | CEUR Workshop Proceedings |
| Volume | 4038 |
| State | Published - 1 Jan 2025 |
| Event | 26th Working Notes of the Conference and Labs of the Evaluation Forum, CLEF 2025 - Madrid, Spain Duration: 9 Sep 2025 → 12 Sep 2025 |
Keywords
- Beck’s Depression Inventory-II
- Large Language Models
- Mental Health NLP
- Semantic Similarity
- Sentence-BERT
- Text Retrieval
ASJC Scopus subject areas
- General Computer Science