MedType: Improving Medical Entity Linking with Semantic Type Prediction

Vashishth, Shikhar; Joshi, Rishabh; Dutt, Ritam; Newman-Griffis, Denis; Rose, Carolyn

Computer Science > Computation and Language

arXiv:2005.00460v1 (cs)

[Submitted on 1 May 2020 (this version), latest version 22 Aug 2021 (v4)]

Title:MedType: Improving Medical Entity Linking with Semantic Type Prediction

Authors:Shikhar Vashishth, Rishabh Joshi, Ritam Dutt, Denis Newman-Griffis, Carolyn Rose

View PDF

Abstract:Medical entity linking is the task of identifying and standardizing concepts referred in a scientific article or clinical record. Existing methods adopt a two-step approach of detecting mentions and identifying a list of candidate concepts for them. In this paper, we probe the impact of incorporating an entity disambiguation step in existing entity linkers. For this, we present MedType, a novel method that leverages the surrounding context to identify the semantic type of a mention and uses it for filtering out candidate concepts of the wrong types. We further present two novel largescale, automatically-created datasets of medical entity mentions: WIKIMED, a Wikipediabased dataset for cross-domain transfer learning, and PUBMEDDS, a distantly-supervised dataset of medical entity mentions in biomedical abstracts. Through extensive experiments across several datasets and methods, we demonstrate that MedType pre-trained on our proposed datasets substantially improve medical entity linking and gives state-of-the-art performance. We make our source code and datasets publicly available for medical entity linking research.

Comments:	14 pages
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2005.00460 [cs.CL]
	(or arXiv:2005.00460v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2005.00460

Submission history

From: Shikhar Vashishth [view email]
[v1] Fri, 1 May 2020 15:55:50 UTC (2,725 KB)
[v2] Wed, 16 Sep 2020 15:07:32 UTC (1,506 KB)
[v3] Thu, 11 Feb 2021 23:10:29 UTC (8,025 KB)
[v4] Sun, 22 Aug 2021 06:53:08 UTC (2,718 KB)

Computer Science > Computation and Language

Title:MedType: Improving Medical Entity Linking with Semantic Type Prediction

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:MedType: Improving Medical Entity Linking with Semantic Type Prediction

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators