Clinical analogy resolution performance for foundation language models

Villena, Fabián; Quiroga Curin, Tamara Nancy; Dunstan Escudero, Jocelyn Mariel

Clinical analogy resolution performance for foundation language models

dc.catalogador	vzp
dc.contributor.author	Villena, Fabián
dc.contributor.author	Quiroga Curin, Tamara Nancy
dc.contributor.author	Dunstan Escudero, Jocelyn Mariel
dc.date.accessioned	2025-03-21T16:39:56Z
dc.date.available	2025-03-21T16:39:56Z
dc.date.issued	2024
dc.description.abstract	Using extensive data sources to create foundation language models has revolutionized the performance of deep learning-based architectures. This remarkable improvement has led to state-of-the-art results for various downstream NLP tasks, including clinical tasks. However, more research is needed to measure model performance intrinsically, especially in the clinical domain. We revisit the use of analogy questions as an effective method to measure the intrinsic performance of language models for the clinical domain in English. We tested multiple Transformers-based language models over analogy questions constructed from the Unified Medical Language System (UMLS), a massive knowledge graph of clinical concepts. Our results show that large language models are significantly more performant for analogy resolution than small language models. Similarly, domain-specific language models perform better than general domain language models. We also found a correlation between intrinsic and extrinsic performance, validated through PubMedQA extrinsic task. Creating clinical-specific and language-specific language models is essential for advancing biomedical and clinical NLP and will ensure a valid application in clinical practice. Finally, given that our proposed intrinsic test is based on a term graph available in multiple languages, the dataset can be built to measure the performance of models in languages other than English.
dc.format.extent	13 páginas
dc.fuente.origen	ORCID
dc.identifier.doi	10.1145/3709155
dc.identifier.eissn	2637-8051
dc.identifier.uri	https://doi.org/10.1145/3709155
dc.identifier.uri	https://repositorio.uc.cl/handle/11534/102927
dc.information.autoruc	Escuela de Ingeniería; Quiroga Curin, Tamara Nancy; S/I; 1207385
dc.information.autoruc	Escuela de Ingeniería; Dunstan Escudero, Jocelyn Mariel; S/I; 1285723
dc.language.iso	en
dc.nota.acceso	contenido parcial
dc.revista	ACM Transactions on Computing for Healthcare
dc.rights	acceso restringido
dc.subject	Information systems
dc.subject	Language models
dc.subject	Applied computing
dc.subject	Health informatics
dc.subject	Computing methodologies
dc.subject	Natural language processing
dc.subject.ddc	610
dc.subject.dewey	Medicina y salud	es_ES
dc.subject.ods	03 Good health and well-being
dc.subject.odspa	03 Salud y bienestar
dc.title	Clinical analogy resolution performance for foundation language models
dc.type	artículo
sipa.codpersvinculados	1207385
sipa.codpersvinculados	1285723
sipa.trazabilidad	ORCID;2025-03-03

Collections

Artículos de revistas

Clinical analogy resolution performance for foundation language models

Files

Collections