This is repo for the SemEval-2024 Task 2 paper "Evaluating Clinical Inference Capabilities of Large Language Models". This work is rather a survey to evaluate the success of LLMs, examines LLM success in clinical domain by dissecting the results on dev set. We also classified some interesting examples related to medical domain such as medical abbreviations and general NLU such as numerical expression evaluation. Allld etails can be found in our paper.
-
Notifications
You must be signed in to change notification settings - Fork 0
License
DuyguA/SemEval2024_NLI4CT
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
No description, website, or topics provided.
Resources
License
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published