TY - GEN
T1 - NeuralHMM
T2 - 2023 International Joint Conference on Neural Networks, IJCNN 2023
AU - Liu, Yuxi
AU - Zhang, Zhenhao
AU - Qin, Shaowen
PY - 2023/8/2
Y1 - 2023/8/2
N2 - Health risk refers to the probability of the occurrence of a specific health outcome for a specific patient. Interest in health risk prediction has been increasing, especially with the availability of a large amount of electronic health records (EHR). An EHR contains multivariate time series data that records meaningful information associated with a chronological set of clinical events for each patient. Recurrent neural networks (RNN) and hidden Markov models (HMM) have been widely used as generative models of time series data. RNN-based models have strong prediction performance but lack transparency. HMMs have a simple functional form and the ability to provide an intuitive probabilistic interpretation, but their state dynamics are 'memoryless', making it difficult to thoroughly take into account the irregularity in patients' health trajectory. This paper proposes a novel deep Markov network for health risk prediction. The method integrates two modules, a GRU (Gated Recurrent Unit) with attention mechanism and a Neural HMM, into a single network. The GRU generates the inputs required for health risk predictions and uses an attention mechanism to create memorable state dynamics for the Neural HMM. The Neural HMM then provides interpretable structured representations through training. Mixture Density Networks are incorporated in the Neural HMM, which contribute to the modeling of complex patterns found in the transition process. Furthermore, an inference network is designed to embed hidden state representations of GRU and Neural HMM into the same space. The inference network enables the two types of representations to learn from each other during the decoding process of Neural HMM, thereby improving the quality of interpretable structured representations. Experimental results on MIMIC-III and eICU datasets demonstrate that our method can outperform state-of-the-art methods and provide transparency of the model decisions.
AB - Health risk refers to the probability of the occurrence of a specific health outcome for a specific patient. Interest in health risk prediction has been increasing, especially with the availability of a large amount of electronic health records (EHR). An EHR contains multivariate time series data that records meaningful information associated with a chronological set of clinical events for each patient. Recurrent neural networks (RNN) and hidden Markov models (HMM) have been widely used as generative models of time series data. RNN-based models have strong prediction performance but lack transparency. HMMs have a simple functional form and the ability to provide an intuitive probabilistic interpretation, but their state dynamics are 'memoryless', making it difficult to thoroughly take into account the irregularity in patients' health trajectory. This paper proposes a novel deep Markov network for health risk prediction. The method integrates two modules, a GRU (Gated Recurrent Unit) with attention mechanism and a Neural HMM, into a single network. The GRU generates the inputs required for health risk predictions and uses an attention mechanism to create memorable state dynamics for the Neural HMM. The Neural HMM then provides interpretable structured representations through training. Mixture Density Networks are incorporated in the Neural HMM, which contribute to the modeling of complex patterns found in the transition process. Furthermore, an inference network is designed to embed hidden state representations of GRU and Neural HMM into the same space. The inference network enables the two types of representations to learn from each other during the decoding process of Neural HMM, thereby improving the quality of interpretable structured representations. Experimental results on MIMIC-III and eICU datasets demonstrate that our method can outperform state-of-the-art methods and provide transparency of the model decisions.
KW - deep Markov model
KW - electronic health records
KW - health risk prediction
KW - model transparency
KW - patient representation learning
UR - http://www.scopus.com/inward/record.url?scp=85169605770&partnerID=8YFLogxK
U2 - 10.1109/IJCNN54540.2023.10191594
DO - 10.1109/IJCNN54540.2023.10191594
M3 - Conference contribution
AN - SCOPUS:85169605770
T3 - Proceedings of the International Joint Conference on Neural Networks
BT - IJCNN 2023 - International Joint Conference on Neural Networks, Proceedings
PB - Institute of Electrical and Electronics Engineers
Y2 - 18 June 2023 through 23 June 2023
ER -