HOPE at IberLEF 2024

Organized by fazlfrs - Current server time: Oct. 5, 2025, 1:45 a.m. UTC

Previous

Task 2 - PolyHope Multiclass Testing (Spanish)
April 1, 2024, midnight UTC

Current

Task 2 - PolyHope Multiclass Testing (English)
April 1, 2024, midnight UTC

End

Competition Ends
Never

Welcome to HOPE 2024!

Description

Hope is one of the exceptional human capacities that allows for flexible anticipation of future events and possible expected outcomes. These visions significantly influence emotions, behaviour and mood (Bruininks and Malle, 2005). Individuals with high hope do not react in the same way to barriers as individuals with low hope, but instead view barriers as challenges to overcome and use their pathway thoughts to plan an alternative route to their goals (Snyder, 1994, 2000). In addition, high hope has been found to correlate with a number of beneficial elements, such as academic performance (Snyder, 2002) and lower levels of depression (Snyder et al., 1997). In contrast, low hope is associated with negative outcomes, such as reduced well-being (Diener, 2009).

Despite the importance and prevalence of hope, it has received little attention in the field of Natural Language Processing (NLP). Machine learning and NLP techniques can be used to analyze social media data and provide insights into the nature of hope in human behavior and decision-making. Therefore, in the last two years, we have promoted research on this topic by organizing shared tasks. We organized shared tasks on hope speech detection at the second workshop on Language Technology for Equality, Diversity and Inclusion (LT-EDI-2022), as a part of ACL 2022 (Chakravarthi et al., 2022), at LT-EDI-2023, within RANLP 2023 (Kumaresan et al., 2023) and the shared task HOPE in IberLEF 2023 (Jiménez-Zafra et al., 2023). The participants registered in these previous tasks show that there is an expected target community, as the maximum number of participants registered was 126 and the minimum 50.

The main novelty of this new edition is the study of hope from two perspectives: i) hope for equality, diversity and inclusion, and ii) hope as expectations. The first perspective was explored in the last edition of IberLEF 2023 for English and Spanish. In this new edition, we focus on expanding and improving our Spanish dataset to answer one of the most frequently asked questions from previous participants and researchers in hope speech detection: Is it possible to detect hope speech in multiple domains, even when we only train our models with texts from one specific area? Therefore, this time we again provide the teams with a training corpus focused on the LGTBI community, but we ask them to test their systems with texts belonging to the LGTBI domain and new unknown domains. The second perspective has not been studied previously in any shared task and, for the IberLEF 2024 edition, we propose its study from a binary and multi-class perspective for English and Spanish.

The proposed shared task will have two task to explore hope in social media texts:

  • Task 1: Hope for Equality, Diversity and Inclusion. Hope speech is the type of speech that is able to relax a hostile environment (Palakodety et al., 2019) and that helps, gives suggestions and inspires for good to a number of people when they are in times of illness, stress, loneliness or depression (Chakravarthi, 2020). On social media, offensive messages are posted against people because of their race, color, ethnicity, gender, sexual orientation, nationality, or religion. As Chakravarthi (2020) stated, how vulnerable groups interact with social media has been studied and found that it plays an essential role in shaping the individual’s personality and view of society (Burnap et al., 2017; Kitzie, 2018; Milne et al., 2016). Examples of these vulnerable groups are the Lesbian, Gay, Bisexual, and Transgender (LGBT) community, racial minorities or people with disabilities. This task is related to the inclusion of vulnerable groups and focuses on the study of the detection of hope speech, in pursuit of Equality, Diversity and Inclusion. Given a tweet written in Spanish, identify whether it contains hope speech or not. Specifically, it is divided into two subtasks: subtask 1.a: Hope speech detection on LGTBI domain; and subtask 1.b: Hope speech detection on unknown domains.
  • Task 2: Hope as expectations. Hope is characterized as "openness of spirit toward the future, a desire, expectation, and wish for something to happen or to be true" that remarkably affects a human’s state of mind, emotions, behaviors, and decisions (Bruininks and Malle, 2005; Balouchzahi et al., 2023). Nowadays, social media platforms significantly affect human life, and people freely express their thoughts on these platforms (Balouchzahi et al., 2022). Therefore, analyzing hope in social media is considered an essential determinant of well-being that can provide potentially valuable insights into the trajectory of goaldirected behaviors, persistence in the face of misfortunes, and the processes underlying adjustment to positive and negative life changes (Balouchzahi et al., 2023). This task focuses on expectations, and desirable and undesirable facts. Specifically, it is divided into two subtasks: subtask 2.a: Binary Hope Speech Detection from English and Spanish texts; and subtask 2.b: Multiclass Hope Speech Detection from English and Spanish texts.

Challenges

  • To explore different ML approaches for multiclass and multilingual hope detection.
  • To experiment with techniques dealing with the imbalanced distribution of labels.
  • To experiment with dealing with noisy social media data for hope detection.
  • To detect hope speech in domains other than those in which our systems have been trained.
  • To address the linguistic differences between English and Spanish texts in the realm of hope expressions.
  • To deal with a lack of context while identifying hope from a text.
  • To distinguish realistic and unrealistic as well as general hope/expectation computationall

Schedule

  • Release of training + development corpora: Feb 16, 2024.
  • Release of test corpora and start of evaluation campaign: Apr 1, 2024.
  • End of evaluation campaign (deadline for runs submission): Apr 16, 2024.
  • Publication of official results: Apr 18, 2024.
  • Paper submission: May 14, 2024.
  • Review notification: Jun 11, 2024.
  • Camera ready submission: Jun 28, 2024.
  • IberLEF Workshop (SEPLN 2023): Sep 27, 2024.
  • Publication of proceedings: Sep ??, 2024.

Organizers Details

The organizers of the competition are:

  • Daniel García-Baena, SINAI, Universidad de Jaén, Spain.
  • Fazlourrahman Balouchzahi, CIC IPN, Mexico.
  • Salud María Jiménez-Zafra, SINAI, Universidad de Jaén, Spain.
  • Sabur Butt, Institute for the Future of Education (IFE) at Tecnológico de Monterrey, Mexico.
  • Miguel Ángel García-Cumbreras, SINAI, Universidad de Jaén, Spain.
  • Atnafu Lambebo Tonja, Centro de Investigación en Computación, Instituto Politécnico Nacional (IPN), Mexico.
  • José Antonio García-Díaz, UMUTeam, Universidad de Murcia, Spain.
  • Selen Bozkurt, Department of Biomedical Informatics, School of Medicine, Emory University.
  • Bharathi Raja Chakravarthi, University of Galway, Ireland.
  • Hector G. Ceballos, Institute for the Future of Education (IFE) at Tecnologico de Monterrey, Mexico.
  • Rafael Valencia-García, UMUTeam, Universidad de Murcia, Spain.
  • Grigori Sidorov, CIC IPN, Mexico.
  • L. Alfonso Ureña-López, SINAI, Universidad de Jaén, Spain.
  • Alexander Gelbukh, CIC IPN, Mexico.

The team members have working experience in NLP and a background in organizing different shared tasks and workshops. Fazlourrahman Balouchzahi, Sabur Butt, Alexander Gelbukh and Grigori Sidorov have experience in organizing shared tasks (Urduthreat2020, Urdufake2021, Emothreat 2022, Kanglish 2022, and Abusive Language@FIRE 2021). Moreover, Daniel García Baena, Salud María Jiménez Zafra, Miguel Ángel García Cumbreras, José Antonio García Díaz, Rafael Valencia García and L. Alfonso Ureña López have organized different events: i) shared tasks on SemEval, ACL, RANLP, EVALITA and IberLEF; ii) conferences and workshops, such as SEPLN and IberLEF; and iii) Doctoral Symposiums, such as the Doctoral Symposium on Natural Language Processing.

Daniel García Baena is a secondary school computer science teacher and a doctoral student at the Universidad de Jaén. His research is focused on Natural Language Processing and Text Categorization, specially in Hope-Speech. He participated in the committees of the Shared Task on Hope Speech Detection for Equality, Diversity, and Inclusion at ACL LT-EDI 2022; and of HOPE at IberLEF 2023.

Fazlourrahman Balouchzahi a graduate Ph.D. student from CIC IPN Mexico, with a strong background in text processing and machine learning, including deep learning and transformers, he has a passion for exploring and contributing to the field of NLP. His research has focused on social media analysis and psychological emotion analysis, specifically detecting hope and regret from the text. Additionally, he brings a vast amount of experience in organizing around 5 and being a participant in more than 20 shared tasks in low-resource languages while bagging top positions in more than 80% of them.

Salud María Jiménez-Zafra, SINAI, Computer Science Department, Universidad de Jaén, Spain (sjzafra@ujaen.es). Her research is focused on Natural Language Processing and Text Classification, specially on Negation Processing, Sentiment Analysis, Offensive Language, Hope-Speech, Social Media Analysis and Language Resource Generation. She has been part of the organizing committee of the three editions of NEGES workshop, of SemEval-2016 Task 5: Aspect Based Sentiment Analysis, of the 32nd and 39th Annual SEPLN conference, of TASS at IberLEF 2020, of EmoEvalEs at IberLEF 2021, of the 2020, 2021, 2022 editions of the Doctoral Symposium on NLP from the PLN.net thematic network, of the 2023 edition of the the Doctoral Symposium on NLP from the Proyecto ILENIA, of the Shared Task on Hope Speech Detection for Equality, Diversity, and Inclusion at LT-EDI 2022 - ACL 2022 and at RANLP - LT-EDI 2023, of PoliticES at IberLEF 2022 and at IberLEF 2023, of HOPE at IberLEF 2023, of EVALITA 2023 Task - PoliticIT and of the 2023 edition of IberLEF workshop. Currently, she is part of the organizing committee of IberLEF 2024, of the Doctoral Symposium on NLP and of the Homophobia/Transphobia detection task at LT-EDI EACL 2024.

Sabur Butt is a Postdoctoral Researcher at the Institute for the Future of Education (IFE) at Tecnológico de Monterrey, Mexico. He specializes in the field of Natural Language Processing (NLP) and has a strong background in computer science.

Selen Bozkurt is an Assistant Professor at Emory University, specializing in natural language processing (NLP), machine learning (ML), and clinical decision support systems. With a background including roles at Stanford University and Flatiron Health, she brings expertise in biomedical data analytics and predictive modeling. Her research focuses on developing clinical decision support tools using advanced data science methods, including NLP and ML algorithms. 

Bharathi Raja Chakravarthi is a permanent Lecturer-Above-the-Bar/Assistant Professor at the School of Computer Science at the University of Galway, Ireland. Before this, he was a Postdoctoral Fellow at the Insight SFI Research Centre for Data Analytics, Data Science Institute, University of Galway, Ireland. He completed his PhD from the Data Science Institute, University of Galway, Ireland. His recent research focuses on text classification, multimodal machine learning, sentiment analysis, abusive/offensive language detection, bias in natural language processing tasks, inclusive language detection, positivity in social media platforms, machine translation, and multilingualism. He has published multiple international conference papers (COLING, LREC, MTSUMMIT, DSAA, LDK, GWC, AICS, FIRE, etc.) and highly reputed journal papers (Computer Speech & Language, Language Resources and Evaluation, Social Network Analysis and Mining, Multimedia Tools and Applications, International Journal of Data Science and Analytics, etc.). He has received the Best Application Paper Award at DSAA 2020 IEEE and ACMfunded conference. Dr. Chakravarthi served as chair and lead organizer for the 1st and 2nd Workshop on Language Technology for Equality, Diversity and Inclusion (https://sites.google.com/view/lt-edi-2022) and Workshop on Speech and Language Technologies for Dravidian Languages (https://dravidianlangtech.github.io/2022/). He served on program committees for a number of ACL conferences and workshops. He also served as guest editor for special issues in Computer Speech & Language, Language Resources and Evaluation, and ACM Transactions on Asian and Low Resource Language Information Processing journals.

Hector G. Ceballos serves as a full-time faculty member in the Computer Science Graduate Program (DCC) and is affiliated with the Research Group on Advanced Artificial Intelligence. His primary research focuses on Machine Learning, Data Science, Process Mining, and Causality, applied to Research and Learning Analytics. With a publication record exceeding 60 papers in journals and conferences, Hector has also lent his expertise as a consultant to banks and IT companies. Over the years, he has mentored more than 20 master’s and PhD students. Currently, Hector G. Ceballos directs the Living Lab & Data Hub at the Institute for the Future of Education (IFE) at Tecnologico de Monterrey. Recognized for his contributions, Hector is a Level 1 member of the Mexican National System of Researchers (SNI) and an adherent member of the Mexican Academy of Computing (AMEXCOMP). Furthermore, Hector takes on the role of Organizing Committee Chair for the Special Interest Group in Learning Analytics in Latin America (LALA-SIG) within the Society for Learning Analytics Research (SOLAR). He is also a member of the QS EduData Summit Advisory Committee.

Rafael Valencia-García is Full Professor at the Department of Informatics and Systems in the Universidad de Murcia. His research interests focus on Natural Language Processing, Sentiment Analysis, Semantic Web and Recommender Systems. He was the General Chair of the SEPLN 2017 conference held in Murcia. He has participated in more than 35 research projects and published more than 150 articles in journals, conferences, and book chapters. He has been guest editor of several NLP related Special Issues in different JCR-indexed journals such as PMC, CSI, IJSEKE, JRPIT, JUCS or SCP. He was part of the organizing committee of the Shared Task on Hope Speech Detection for Equality, Diversity, and Inclusion at LT-EDI 2022 - ACL 2022 and part of the organizing committee of FinancES 2023 and PoliticES 2022 and 2023 in Iber- LEF and PoliticIT 2023 task in EVALITA.

Grigori Sidorov has co-authored more than 250 scientific publications with an h-index of 30. He is well experienced in the field of Emotion Detection in text. Apart from that, he is a regular member of the Mexican Academy of Sciences, National Researcher of Mexico (SNI) level 3 (highest). He is also the Editor-in-Chief of the research journal "Computación y Sistemas" (Clarivate Web of Science (Scielo, CORE collection (emerging sources)), Scopus, DBLP, index of excellence of Conacyt, etc.). He was also one of the organizers of the FIRE 2020 and 2021.

L. Alfonso Ureña-López, Professor of Computer Science, director of the SINAI research group of Universidad de Jaén, Spain (laurena@ujaen.es). He has been the president of SEPLN (Spanish Society for Natural Language Processing). His research is focused on Natural Language Processing, Word Sense Disambiguation, Text Categorization, Sentiment Analysis, Offensive Language, Hope Speech... He has organized and chaired numerous congresses in the area of NLP. Likewise, he has formed and is part of numerous scientific committees of conferences and workshops in NLP.

Alexander Gelbukh is the founder and chair of the CICLing International Conference series. He has been Honorary Chair of ENC-2008, Program co-chair of some recent MICAI, CIC, CORE, and some other conferences. He is also the founding Editor-in-Chief of the International Journal of Computational Linguistics and Applications (IJCLA) and Editor-in-Chief of the journal POLIBITS.

Funding

  • Project CONSENSO (PID2021-122263OB-C21) funded by MCIN/AEI/10.13039/501100011033 and by the European Union NextGenerationEU/PRTR.
  • Project MODERATES (TED2021-130145B-I00) funded by MCIN/AEI/10.13039/501100011033 and by the European Union NextGenerationEU/PRTR.
  • Project SocialTox (PDC2022-133146-C21) MCIN/AEI/10.13039/501100011033 and by the European Union NextGenerationEU/PRTR.
  • Fondo Social Europeo and Administration of the Junta de Andalucía (DOC_01073).
  • Grant P20_00956 (PAIDI 2020) from the Andalusian Regional Government.
  • Project LaTe4PoliticES (PID2022-138099OB-I00) funded by MCIN/AEI/10.13039/501100011033 and the European Fund for Regional Development (FEDER)-a way to make Europe.
  • Project LT-SWM (TED2021-131167B-I00) funded by MCIN/AEI/10.13039/501100011033 and by the European Union NextGenerationEU/PRTR.
  • Mexican Government grant A1-S-47854 of CONACYT.
  • Grants 20232138, 20232080, 20231567 of the Secretaría de Investigación y Posgrado of the Instituto Politécnico Nacional, Mexico.

Contact Person

References

  • Fazlourrahman Balouchzahi, Grigori Sidorov, and Alexander Gelbukh. 2023. Polyhope: Two-level hope speech detection from tweets. Expert Systems with Applications, 225:120078.
  • Sidorov, G., Balouchzahi, F., Butt, S., & Gelbukh, A. (2023). Regret and Hope on Transformers: An Analysis of Transformers on Regret and Hope Speech Detection Datasets. Applied Sciences13(6), 3983.
  • Fazlourrahman Balouchzahi, Grigori Sidorov, and Hosahalli Lakshmaiah Shashirekha. 2022. Fake news spreaders profiling using n-grams of various types and shap-based feature selection. Journal of Intelligent & Fuzzy Systems, 42(5):4437–4448.
  • Patricia Bruininks and Bertram F Malle. 2005. Distinguishing Hope from Optimism and Related Affective States. Motivation and emotion, 29(4):324–352.
  • Pete Burnap, Gualtiero Colombo, Rosie Amery, Andrei Hodorog, and Jonathan Scourfield. 2017. Multi-class machine classification of suicide-related communication on twitter. Online social networks and media, 2:32–44.
  • Bharathi R Chakravarthi. 2020. Hopeedi: A multilingual hope speech detection dataset for equality, diversity, and inclusion. In Proceedings of the Third Workshop on Computational Modeling of People’s Opinions, Personality, and Emotion’s in Social Media, pages 41–53, Barcelona, Spain (Online). Association for Computational Linguistics.
  • Bharathi Raja Chakravarthi, Vigneshwaran Muralidaran, Ruba Priyadharshini, Subalalitha Chinnaudayar Navaneethakrishnan, John P McCrae, Miguel Ángel García-Cumbreras, Salud María Jiménez-Zafra, Rafael Valencia-García, Prasanna Kumar Kumaresan, Rahul Ponnusamy, Daniel García-Baena, and José Antonio García-Díaz. 2022. Overview of the shared task on hope speech detection for equality, diversity, and inclusion. Association for Computational Linguistics, pages 378–388.
  • Ed Diener. 2009. Subjective well-being. The science of well-being, pages 11–58.
  • Emery R Eaves, Mark Nichter, and Cheryl Ritenbaugh. 2016. Ways of Hoping: Navigating the Paradox of Hope and Despair in Chronic Pain. Culture, Medicine, and Psychiatry, 40(1):35–58.
  • Jaklin Eliott and Ian Olver. 2002. The Discursive Properties of “Hope”: A Qualitative Analysis of Cancer Patients’ Speech. Qualitative health research, 12(2):173–193.
  • Douglas Ezzy. 2000. Illness Narratives: Time, Hope and HIV. Social science & medicine, 50(5):605–617.
  • Daniel García-Baena, Miguel Ángel García-Cumbreras, Salud María Jiménez-Zafra, José Antonio García-Díaz, and Rafael Valencia-García. 2023. Hope speech detection in Spanish: The LGTB case. Language Resources and Evaluation, pages 1–28.
  • Eve Garrard and Anthony Wrigley. 2009. Hope and Terminal Illness: False Hope versus Absolute Hope. Clinical Ethics, 4(1):38–43.
  • Salud María Jiménez-Zafra, Miguel Ángel Garcia-Cumbreras, Daniel García-Baena, José Antonio Garcia-Díaz, Bharathi Raja Chakravarthi, Rafael Valencia-García, and Luis Alfonso Ureña-López. 2023. Overview of hope at iberLEF 2023: Multilingual hope speech detection. Procesamiento del Lenguaje Natural, 71:371–381.
  • Vanessa Kitzie. 2018. I pretended to be a boy on the internet: Navigating affordances and constraints of social networking sites and search engines for LGTB+ identity work. First Monday.
  • Prasanna Kumaresan, Bharathi Raja Chakravarthi, Subalalitha Cn, Miguel Ángel García, Salud María Jiménez-Zafra, José Antonio García-Díaz, Rafael Valencia-García, Momchil Hardalov, Ivan Koychev, Prelav Nakov, Daniel García-Baena, Kishore Kumar Ponnusamy, and Blake Preston. 2023. Overview of the shared task on hope speech detection for equality, diversity, and inclusion. In Proceedings of the Third Workshop on Language Technology for Equality, Diversity and Inclusion, pages 47–53. Association for Computational Linguistics.
  • Matthew Links and Jocelyn Kramer. 1994. Breaking Bad News: Realistic versus Unrealistic Hopes. Supportive Care in Cancer, 2(2):91–93.
  • Vibeke Lohne and Elisabeth Severinsson. 2004. Hope During the First Months after Acute Spinal cord Injury. Journal of Advanced Nursing, 47(3):279–286.
  • David N Milne, Glen Pink, Ben Hachey, and Rafael A Calvo. 2016. Clpsych 2016 shared task: Triaging content in online peer-support forums. In Proceedings of the third workshop on computational linguistics and clinical psychology, pages 118–127.
  • Shriphani Palakodety, Ashiqur R KhudaBukhsh, and Jaime G Carbonell. 2019. Hope speech detection: A computational analysis of the voice of peace. arXiv preprint arXiv:1909.12940.
  • Brett Smith and Andrew C Sparkes. 2005. Men, Sport, Spinal Cord Injury, and Narratives of Hope. Social science & medicine, 61(5):1095–1105.
  • C Richard Snyder. 2000. Hypothesis: There is Hope. In Handbook of hope, pages 3–21. Elsevier.
  • C Richard Snyder. 2002. Hope Theory: Rainbows in the Mind. Psychological inquiry, 13(4):249–275.
  • C Richard Snyder, Betsy Hoza, William E Pelham, Michael Rapoff, Leanne Ware, Michael Danovsky, Lori Highberger, Howard Ribinstein, and Kandy J Stahl. 1997. The development and validation of the children’s hope scale. Journal of pediatric psychology, 22(3):399–421.
  • Charles Richard Snyder. 1994. The psychology of hope: You can get there from here. Simon and Schuster.
  • Darren Webb. 2007. Modes of Hoping. History of the human sciences, 20(3):65–83.
  • Rose Wiles, Cheryl Cott, and Barbara E Gibson. 2008. Hope, Expectations and Recovery from Illness: A Narrative Synthesis of Qualitative Research. Journal of advanced nursing, 64(6):564–573.

Task: Hope Speech Detection

Task 1: Hope for Equality, Diversity and Inclusion

Hope speech is the type of speech that is able to relax a hostile environment (Palakodety et al., 2019) and that helps, gives suggestions and inspires for good to a number of people when they are in times of illness, stress, loneliness or depression (Chakravarthi, 2020). On social media, offensive messages are posted against people because of their race, color, ethnicity, gender, sexual orientation, nationality, or religion. As Chakravarthi (2020) stated, how vulnerable groups interact with social media has been studied and found that it plays an essential role in shaping the individual’s personality and view of society (Burnap et al., 2017; Kitzie, 2018; Milne et al., 2016). Examples of these vulnerable groups are the Lesbian, Gay, Bisexual, and Transgender (LGBT) community, racial minorities or people with disabilities. This task is related to the inclusion of vulnerable groups and focuses on the study of the detection of hope speech, in pursuit of Equality, Diversity and Inclusion. Given a tweet written in Spanish, identify whether it contains hope speech or not. Specifically, this task is divided into two subtasks, but participants will participate in both of them at the same time.

There will be a real time leaderboard and participants will be allowed to make a maximum of 10 submissions through CodaLab, from which each team will have to select the best one for ranking. Precision, Recall and F1-score will be measured per category and averaged using the macro-average method. Systems will be ranked using the macro-f1 score. As they are strongly correlated, it is not possible to participate in only one subtask (1.a or 1.b) from task 1. Therefore, submissions for task 1 will automatically qualify for both subtask 1.a and 1.b.

Subtask 1.a: Hope Speech Detection on LGTBI Domain

Giving training data on LGTBI tweets, the participants will have to classify each of the LGTBI-related tweets of the test set in one of the following categories:

  • hs: if the tweet contains hope speech.
  • nhs: if the tweet does not contain hope speech.

Subtask 1.b: Hope Speech Detection on Unknown Domains

Giving training data on LGTBI tweets, the participants will have to classify each of the tweets of the test set on unknown domains in one of the following categories:

  • hs: if the tweet contains hope speech.
  • nhs: if the tweet does not contain hope speech.

How to submit

Each team can participate with up to 10 submissions (except for development, where 100 submissions are allowed). The expected format for submissions is a .zip file (no folders within) with the predictions in a .csv file named as predictions.csv (comma separated file).

IMPORTANT!

  1. The .csv file MUST be named predictions.csv
  2. The .csv file MUST have the following header: id,category
  3. Once a submission has been finished, all detailed scores for Task 1.a and 1.b will available to be downloaded on: "Download output from scoring step".
predictions.csv file example:

id,category
0,hs
1,nhs

Task 2: Hope as Expectations

Hope is characterized as "openness of spirit toward the future, a desire, expectation, and wish for something to happen or to be true" that remarkably affects a human’s state of mind, emotions, behaviors, and decisions (Bruininks and Malle, 2005; Balouchzahi et al., 2023). Nowadays, social media platforms significantly affect human life, and people freely express their thoughts on these platforms (Balouchzahi et al., 2022). Therefore, analyzing hope in social media is considered an essential determinant of well-being that can provide potentially valuable insights into the trajectory of goal-directed behaviors, persistence in the face of misfortunes, and the processes underlying adjustment to positive and negative life changes (Balouchzahi et al., 2023). This task focuses on expectations, and desirable and undesirable facts. Specifically, it is divided into two subtasks.

The participants will be given three-run submissions through CodaLab and they will be evaluated on Precision, Recall, and F1 scores. To evaluate multi-class hope detection detection, we will use accuracy and macro-averaged F1 and weighted F1 scores as used in the original paper (Balouchzahi et al., 2023).

Subtask 2.a: Binary Hope Speech Detection from English and Spanish Texts

Given training data, the participant will classify the text into two categories. In this problem, each text will be assigned one of the following labels :

  • Hope: tweets that convey a mention of hope.
  • Not Hope: tweets that do not convey hope, expectation, or desire.

Subtask 2.b: Multiclass Hope Speech Detection from English and Spanish Texts

Given training data, the participant will classify the text into following categories:

  • Generalized Hope: is expressed as a general hopefulness and optimism that is not directed toward any specific event or outcome (Wiles et al., 2008; Webb, 2007; Ezzy, 2000; Lohne and Severinsson, 2004; Smith and Sparkes, 2005).
  • Realistic Hope: is about expecting something reasonable, meaningful, and possible thing to happen (Wiles et al., 2008; Eaves et al., 2016; Garrard and Wrigley, 2009).
  • Unrealistic Hope: is usually in the form of wishing for something to become true, even though the possibility of happening is remote or significantly less or even zero (Webb, 2007; Eliott and Olver, 2002; Links and Kramer, 1994).
  • Not Hope: tweets that do not convey hope.

How to submit

Each team can participate with up to 10 submissions. The expected format for submissions is a .zip file (no folders within) with the predictions in a .csv file named as predictions.csv (comma separated file).

IMPORTANT!

  1. The .csv file MUST be named predictions.csv
  2. The .csv file MUST have the following header: id,category
  3. The predicted labels for both subtasks should be exactly as training set.
predictions.csv file example:

id,category
0,Hope
1,Not Hope

Terms and Conditions

By downloading the data or by accessing it any manner, you agree not to redistribute the data except for non-commercial and academic-research purposes. The data must not be used for providing surveillance, analyses or research that isolates a group of individuals or any single individual for any unlawful or discriminatory purpose.

You should cite this paper if you are using our data:

  • García-Baena D, García-Cumbreras MÁ, Jiménez-Zafra SM, García-Díaz JA, Rafael VG (2023) Hope Speech Detection in Spanish: The LGTB Case. Language Resources and Evaluation pp 1–31.
  • Jiménez-Zafra, S. M., Garcia-Cumbreras, M. Á., García-Baena, D., Garcia-Díaz, J. A., Chakravarthi, B. R., Valencia-García, R., & Ureña-López, L. A. (2023). Overview of HOPE at IberLEF 2023: Multilingual Hope Speech Detection. Procesamiento del Lenguaje Natural, 71, 371-381.
  • Chakravarthi BR (2020) HopeEDI: A multilingual hope speech detection dataset for equality, diversity, and inclusion. In: Proceedings of the Third Workshop on Computational Modeling of People’s Opinions, Personality, and Emotion’s in Social Media, Association for Computational Linguistics, Barcelona, Spain (Online), pp 41–53, URL ([https://aclanthology.org/2020.peoples-1.5](https://aclanthology.org/2020.peoples-1.5)).
  • Chakravarthi, B. R., Muralidaran, V., Priyadharshini, R., Cn, S., McCrae, J. P., García, M. Á., Jiménez-Zafra, S. M., Valencia-García, R., Kumaresan, P., Ponnusamy, R., García-Baena, D., & García-Díaz, J. (2022, May). Overview of the shared task on hope speech detection for equality, diversity, and inclusion. In Proceedings of the Second Workshop on Language Technology for Equality, Diversity and Inclusion organized as part of ACL 2022 (pp. 378-388). DOI: 10.18653/v1/2022.ltedi-1.58.
  • Balouchzahi, Fazlourrahman, Grigori Sidorov, and Alexander Gelbukh. "PolyHope: Two-level hope speech detection from tweets." Expert Systems with Applications 225 (2023): 120078.
  • Sidorov, G., Balouchzahi, F., Butt, S., & Gelbukh, A. (2023). Regret and Hope on Transformers: An Analysis of Transformers on Regret and Hope Speech Detection Datasets. Applied Sciences13(6), 3983.

Task 1 - EDI Training

Start: Feb. 16, 2024, midnight

Post-Evaluation

Start: Feb. 16, 2024, midnight

Task 2 - PolyHope Binary Training (English)

Start: Feb. 16, 2024, midnight

Task 2 - PolyHope Multiclass Training (Spanish)

Start: Feb. 16, 2024, midnight

Task 2 - PolyHope Multiclass Training (English)

Start: Feb. 16, 2024, midnight

Task 1 - EDI Testing

Start: April 1, 2024, midnight

Task 2 - PolyHope Binary Testing (Spanish)

Start: April 1, 2024, midnight

Task 2 - PolyHope Binary Testing (English)

Start: April 1, 2024, midnight

Task 2 - PolyHope Multiclass Testing (Spanish)

Start: April 1, 2024, midnight

Task 2 - PolyHope Multiclass Testing (English)

Start: April 1, 2024, midnight

Competition Ends

Never

You must be logged in to participate in competitions.

Sign In
# Username Score
1 ChauPhamQuocHung 0.73
2 olp 0.72
3 hongson04 0.71