DuETT: Dual Event Time Transformer for Electronic Health Records

Labach, Alex; Pokhrel, Aslesha; Huang, Xiao Shi; Zuberi, Saba; Yi, Seung Eun; Volkovs, Maksims; Poutanen, Tomi; Krishnan, Rahul G.

Computer Science > Machine Learning

arXiv:2304.13017 (cs)

[Submitted on 25 Apr 2023 (v1), last revised 15 Aug 2023 (this version, v2)]

Title:DuETT: Dual Event Time Transformer for Electronic Health Records

Authors:Alex Labach, Aslesha Pokhrel, Xiao Shi Huang, Saba Zuberi, Seung Eun Yi, Maksims Volkovs, Tomi Poutanen, Rahul G. Krishnan

View PDF

Abstract:Electronic health records (EHRs) recorded in hospital settings typically contain a wide range of numeric time series data that is characterized by high sparsity and irregular observations. Effective modelling for such data must exploit its time series nature, the semantic relationship between different types of observations, and information in the sparsity structure of the data. Self-supervised Transformers have shown outstanding performance in a variety of structured tasks in NLP and computer vision. But multivariate time series data contains structured relationships over two dimensions: time and recorded event type, and straightforward applications of Transformers to time series data do not leverage this distinct structure. The quadratic scaling of self-attention layers can also significantly limit the input sequence length without appropriate input engineering. We introduce the DuETT architecture, an extension of Transformers designed to attend over both time and event type dimensions, yielding robust representations from EHR data. DuETT uses an aggregated input where sparse time series are transformed into a regular sequence with fixed length; this lowers the computational complexity relative to previous EHR Transformer models and, more importantly, enables the use of larger and deeper neural networks. When trained with self-supervised prediction tasks, that provide rich and informative signals for model pre-training, our model outperforms state-of-the-art deep learning models on multiple downstream tasks from the MIMIC-IV and PhysioNet-2012 EHR datasets.

Comments:	Accepted at MLHC 2023, camera-ready version
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2304.13017 [cs.LG]
	(or arXiv:2304.13017v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2304.13017

Submission history

From: Alex Labach [view email]
[v1] Tue, 25 Apr 2023 17:47:48 UTC (121 KB)
[v2] Tue, 15 Aug 2023 21:02:34 UTC (67 KB)

Computer Science > Machine Learning

Title:DuETT: Dual Event Time Transformer for Electronic Health Records

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:DuETT: Dual Event Time Transformer for Electronic Health Records

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators