This shared task aims to the development of automatic systems for identify patients potentially suffering from insomnia using electronic health records (EHRs). It is structured as a text classification challenge requiring participants to analyze a clinical note to determine if a patient is likely to have insomnia.
We have developed a comprehensive set of rules (Insomnia rules) to facilitate the identification of patients potentially suffering from insomnia. These rules incorporate both direct and indirect symptoms of insomnia and include information about commonly prescribed hypnotic medications. For this task, we have curated an annotated corpus of 210 clinical notes from the MIMIC III database, adhering to the Insomnia rules during the annotation process. Each note is annotated with a binary label indicating the patient’s overall insomnia status ("yes" or "no"), and at the rule-level to indicate the satisfaction of each rule based on the note’s content. Additionally, to enhance the explainability of participating NLP systems, we provide textual evidence from the clinical notes that support each annotation. This ensures that the outputs of the systems can be effectively justified.
Participants are encouraged to use large language models (LLMs) to tackle the Insomnia detection task. This shared task serves as an exceptional benchmark to assess the reasoning capabilities of LLMs in medicine, applying a realistic set of diagnostic guidelines to real-world clinical data.
This text classification shared task is divided into three distinct subtasks:
To participate in #SMM4H 2025 Task 4, please register your team here with the same e-mail address as your CodaLab account. When your registration is approved, you will be invited to a Google group, where the training, validation, and test data will be made available. Please check the #SMM4H 2025 website for important dates.
GitHub repository: https://github.com/guilopgar/SMM4H-HeaRD-2025-Task-4-Insomnia
The evaluation scripts are available in the evaluation folder within the GitHub repository for the task.
By submitting results to this competition, you consent to the public release of your scores at the SMM4H'25 workshop and in the associated proceedings, at the task organizers' discretion. Scores may include, but are not limited to, automatic and manual quantitative judgements, qualitative judgements, and such other metrics as the task organizers see fit. You accept that the ultimate decision of metric choice and score value is that of the task organizers. You further agree that the task organizers are under no obligation to release scores and that scores may be withheld if it is the task organizers' judgement that the submission was incomplete, erroneous, deceptive, or violated the letter or spirit of the competition's rules. Inclusion of a submission's scores is not an endorsement of a team or individual's submission, system, or science. You further agree that your system may be named according to the team name provided at the time of submission, or to a suitable shorthand as determined by the task organizers. You further agree to submit and present a short paper describing your system during the workshop. You agree not to redistribute the training and test data without the explicit approval of the organizers.
Guillermo Lopez-Garcia, Cedars-Sinai Medical Center, USA (Guillermo.LopezGarcia@cshs.org)
For each subtask, ground truth annotations are provided in JSON format. Participants are required to submit their system outputs following the same format as the ground truth annotations provided by the organizers.
Start: March 1, 2025, midnight
Description: Practice phase: please submit predictions on validation data.
Start: April 7, 2025, midnight
Description: Evaluation phase: please submit predictions on test data. The results obtained here will be used for the the official evaluation of the competition.
Start: April 15, 2025, midnight
Description: Post-Evaluation phase: please submit predictions on test data. This phase starts after the end of the competition.
Start: March 1, 2025, midnight
Description: Practice phase: please submit predictions on validation data.
Start: April 7, 2025, midnight
Description: Evaluation phase: please submit predictions on test data. The results obtained here will be used for the the official evaluation of the competition.
Start: April 15, 2025, midnight
Description: Post-Evaluation phase: please submit predictions on test data. This phase starts after the end of the competition.
Start: March 1, 2025, midnight
Description: Practice phase: please submit predictions on validation data.
Start: April 7, 2025, midnight
Description: Evaluation phase: please submit predictions on test data. The results obtained here will be used for the the official evaluation of the competition.
Start: April 15, 2025, midnight
Description: Post-Evaluation phase: please submit predictions on test data. This phase starts after the end of the competition.
Never
You must be logged in to participate in competitions.
Sign In# | Username | Score |
---|---|---|
1 | prajaktakini | 0.53 |
2 | RBG-AI | 0.45 |
3 | swendelken | 0.41 |