Bayesian analytical approaches for metabolomics : a novel method for molecular structure-informed metabolite interaction modeling, a novel diagnostic model for differentiating myocardial infarction type, and approaches for compound identification given mass spectrometry data.
Date on Master's Thesis/Doctoral Dissertation
Interdisciplinary and Graduate Studies
Interdisciplinary Studies with a specialization in Bioinformatics, PhD
Committee Co-Chair (if applicable)
Park, Juw Won
Bayesian statistics; metabolomics; mass spectrometry; interactome; statistics
Metabolomics, the study of small molecules in biological systems, has enjoyed great success in enabling researchers to examine disease-associated metabolic dysregulation and has been utilized for the discovery biomarkers of disease and phenotypic states. In spite of recent technological advances in the analytical platforms utilized in metabolomics and the proliferation of tools for the analysis of metabolomics data, significant challenges in metabolomics data analyses remain. In this dissertation, we present three of these challenges and Bayesian methodological solutions for each. In the first part we develop a new methodology to serve a basis for making higher order inferences in metabolomics, which we define as the testing of hypotheses that are more complex than single metabolite hypothesis tests. This methodology utilizes informative priors that are generated via the analysis of molecular structure similarity to enable the estimation of metabolite "interactomes" (or probabilistic models) which are organism-, sample media-, and condition-specific as well as comprehensive; and that can serve as reference models for studying perturbations in metabolic systems. After discussing the development of our methodology, we present an evaluation of its performance conducted using simulation studies, and we use the methodology for estimating a plasma metabolite interactome for stable heart disease. This interactome may serve as a reference model for evaluating systems-level changes that occur with acute disease events such as myocardial infarction (MI) or unstable angina. In the second part of this work, we present the challenge of developing diagnostic classification models which utilize metabolite abundances and that do not "overfit" relatively small sample sizes, especially given the high dimensionality of metabolite data acquired using platforms such as liquid chromatography-mass spectrometry. We use a Bayesian methodology for estimating a multinomial logistic regression classifier for the detection and discrimination of the subtype of acute myocardial infarction utilizing metabolite abundance data quantified from blood plasma. As heart disease is the leading cause of global mortality, a blood-based and non-invasive diagnostic test that could differentiate between MI types at the time of the event would have great utility. In the final part of this dissertation we review Bayesian approaches for compound identification in metabolomics experiments that utilize liquid chromatography-mass spectrometry which remains a challenging problem.
Trainor, Patrick J., "Bayesian analytical approaches for metabolomics : a novel method for molecular structure-informed metabolite interaction modeling, a novel diagnostic model for differentiating myocardial infarction type, and approaches for compound identification given mass spectrometry data." (2018). Electronic Theses and Dissertations. Paper 3023.
Analytical Chemistry Commons, Applied Statistics Commons, Biochemical Phenomena, Metabolism, and Nutrition Commons, Biochemistry Commons, Bioinformatics Commons, Biostatistics Commons, Cardiology Commons, Cardiovascular Diseases Commons, Computational Biology Commons, Diagnosis Commons, Medical Biomathematics and Biometrics Commons, Medical Molecular Biology Commons, Molecular Biology Commons, Programming Languages and Compilers Commons, Statistical Methodology Commons, Systems Biology Commons, Theory and Algorithms Commons