Please use this identifier to cite or link to this item:
http://arks.princeton.edu/ark:/88435/dsp01mg74qp90d
Title: | Large-scale Multi-output Gaussian Processes for Clinical Decision Support |
Authors: | Cheng, Li-Fang |
Advisors: | Engelhardt, Barbara E Li, Kai |
Contributors: | Electrical Engineering Department |
Keywords: | Electronic health records Gaussian process reinforcement learning time series analysis treatment effect estimation |
Subjects: | Statistics Artificial intelligence Medicine |
Issue Date: | 2019 |
Publisher: | Princeton, NJ : Princeton University |
Abstract: | In the scenario of real-time monitoring of hospital patients, high-quality inference of patients’ health status using all information available from clinical covariates and lab tests is essential to enable successful medical interventions and improve patient outcomes. Developing a computational framework that can learn from observational large-scale electronic health records (EHRs) and make accurate real-time predictions is a critical step in achieving this goal. However, existing EHRs pose several challenges for conventional methods. For instance, many covariates are sparsely sampled in time across patients. In addition, there are substantial uncertainties in patient condition and disease progression at any time. These properties make inferring the physiological status of a patient or joint analysis of time series across patients challenging. This dissertation first presents MedGP, a statistical framework that provides accurate real-time predictions of physiological states. MedGP is based on multi-output Gaussian processes, a Bayesian nonparametric model, to capture temporal structures between clinical covariates from noisy and irregularly sampled time series data. MedGP has a number of benefits over current methods. First, alignments of time series across patients are not required. Next, MedGP learns robust and interpretable relationships across covariates through regularization. Lastly, to tackle the computational bottlenecks, an approximate inference approach is derived and shown to achieve orders of speedup. The empirical results show significant improvements over baselines in making online predictions on two real-world medical data sets consist of tens of thousands of patients and millions of observations. Finally, two extended frameworks based on MedGP are explored to enhance clinical decision-making processes, including unifying with reinforcement learning algorithms for action recommendation and designing non-stationary kernels to learn the underlying dynamics of clinical treatments. Both methods demonstrate encouraging results on improving clinical practices and advancing towards precision medicine. |
URI: | http://arks.princeton.edu/ark:/88435/dsp01mg74qp90d |
Alternate format: | The Mudd Manuscript Library retains one bound copy of each dissertation. Search for these copies in the library's main catalog: catalog.princeton.edu |
Type of Material: | Academic dissertations (Ph.D.) |
Language: | en |
Appears in Collections: | Electrical Engineering |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Cheng_princeton_0181D_12850.pdf | 9.79 MB | Adobe PDF | View/Download |
Items in Dataspace are protected by copyright, with all rights reserved, unless otherwise indicated.