figshare
Browse

A systematic machine learning and data type comparison yields robust metagenomic predictors of infant age, sex, breastfeeding, antibiotic usage, country of origin, and delivery type

Published on by Braden Tierney
The microbiome is a new frontier for building predictors of human phenotypes. However, the practice of machine learning of human phenotype as a function of microbiome features is fraught with issues of reproducibility. A threat to reproducibility is the wide range of analytic models and microbiome data types available. We aimed to build robust metagenomic predictors of host phenotype by comparing prediction performances and biological interpretation across 8 machine learning methods and 5 different types of metagenomic data. Using 1,570 samples from 300 infants, we fit 7,865 models for 6 host phenotypes. We demonstrate the dependence of accuracy on algorithm choice and feature definition in microbiome data, and propose a framework for building microbiome-derived indicators of host phenotype. We additionally identified biological features predictive of age, sex, breastfeeding status, historical antibiotic usage, country of origin, and delivery type. Our complete results can be viewed at http://apps.chiragjpgroup.org/ubiome_predictions/.

Cite items from this project

DataCite
3 Biotech
3D Printing in Medicine
3D Research
3D-Printed Materials and Systems
4OR
AAPG Bulletin
AAPS Open
AAPS PharmSciTech
Abhandlungen aus dem Mathematischen Seminar der Universität Hamburg
ABI Technik (German)
Academic Medicine
Academic Pediatrics
Academic Psychiatry
Academic Questions
Academy of Management Discoveries
Academy of Management Journal
Academy of Management Learning and Education
Academy of Management Perspectives
Academy of Management Proceedings
Academy of Management Review

cite all items

Share

email