LMU-BioNLP at SemEval-2024 Task 2: Large Diverse Ensembles for Robust Clinical NLI

Zihang Sun; Danqi Yan; Anyi Wang; Tanalp Agustoslu; Qi Feng; Chengzhi Hu; Longfei Zuo; Shijia Zhou; Hermine Kleiner; Pingjun Hong; Suteera Seeha; Sebastian Loftus; Anna Susanna Barwig; Oliver Kraus; Jona Voholonsky; Yang Sun; Leopold Martin; Lena Altinger; Jing Wang; Leon Weber-Genzel

2024 SEMEVAL SemEval 2024

LMU-BioNLP at SemEval-2024 Task 2: Large Diverse Ensembles for Robust Clinical NLI

Abstract

AbstractIn this paper, we describe our submission for the NLI4CT 2024 shared task on robust Natural Language Inference over clinical trial reports. Our system is an ensemble of nine diverse models which we aggregate via majority voting. The models use a large spectrum of different approaches ranging from a straightforward Convolutional Neural Network over fine-tuned Large Language Models to few-shot-prompted language models using chain-of-thought reasoning.Surprisingly, we find that some individual ensemble members are not only more accurate than the final ensemble model but also more robust.

👥 Mega-Team — 20 authors

🌉 Interdisciplinary Bridge — Healthcare & Medicine and Machine Learning and Natural Language Processing

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Topics

Healthcare & Medicine > Clinical > Clinical NLP Natural Language Processing > Applications > Natural Language Inference Machine Learning > Learning Types > Ensemble Learning Machine Learning > Learning Types > Ensemble Methods Machine Learning > Core Methods > Ensemble Learning

Keywords

few-shot learning ensemble learning natural language inference chain-of-thought reasoning clinical text clinical trial report clinical natural language inference large language model

Download PDF

CLTeam1 at SemEval-2024 Task 10: Large Language Model based ensemble for Emotion Detection in Hinglish 2024

ignore at SemEval-2024 Task 5: A Legal Classification Model with Summary Generation and Contrastive Learning 2024

LMEME at SemEval-2024 Task 4: Teacher Student Fusion - Integrating CLIP with LLMs for Enhanced Persuasion Detection 2024

GeminiPro at SemEval-2024 Task 9: BrainTeaser on Gemini 2024

LMU-BioNLP at SemEval-2024 Task 2: Large Diverse Ensembles for Robust Clinical NLI

Abstract

Authors

Topics

Keywords

Related papers