Abstract
The automatic coding of clinical documentation according to diagnosis codes is a useful task in the Electronic Health Record, but a challenging one due to the large number of codes and the length of patient notes. We investigate four models for assigning multiple ICD codes to discharge summaries, and experiment with data from the MIMIC II and III clinical datasets. We present Hierarchical Attention-bidirectional Gated Recurrent Unit (HA-GRU), a hierarchical approach to tag a document by identifying the sentences relevant for each label. HA-GRU achieves state-of-the art results. Furthermore, the learned sentence-level attention layer highlights the model decision process, allows for easier error analysis, and suggests future directions for improvement.
Original language | English |
---|---|
Title of host publication | 2018 AAAI Joint Workshop on Health Intelligence (W3PHIAI 2018) |
Place of Publication | New Orleans |
Publisher | AAAI press |
Pages | 409-416 |
Number of pages | 8 |
State | Published - 2018 |