国家天元数学中部中心学术报告 | 胡懿娟 教授(Emory University)

发布时间: 2023-06-01 16:14

报告题目:Integrative Analysis of 16S Marker-Gene and Shotgun Metagenomic Sequencing Data Improves Efficiency of Testing Microbiome Hypotheses

报告时间:2023-06-06  10:00 - 11:00

报告人:胡懿娟 教授(Emory University)

报告地点:理学院东北楼四楼报告厅(404)

Abstract: The most widely used technologies for profiling microbial communities are 16S marker-gene sequencing and shotgun metagenomic sequencing. Surprisingly, many microbiome studies have performed both experiments on the same cohort of samples. The two datasets often yield consistent patterns in taxonomic profiles, highlighting the potential for an integrative analysis to improve power of testing these patterns. However, each dataset is subject to distinct experimental biases that systematically distort the measurements from their actual values in an experiment-specific manner. These experimental biases, together with partially overlapping samples and differential library sizes between the two datasets, pose tremendous challenges when combining the datasets. In this article, we introduce the first method, named LOCOM-I, for such an integrative analysis. The new method is based on our LOCOM model (Hu et al., 2022, PNAS), which employs logistic regression for testing differential abundance of taxa while remaining robust to experimental bias. Our new method combines data from both experiments for differential abundance tests, while accounting for differential experimental biases, assigning adaptive weights to each observation, and accommodating samples and taxa unique to an experiment. To benchmark the performance of the new method, we introduce two ad hoc approaches: applying LOCOM to pooled taxa count data and combining LOCOM p-values from analyzing each dataset separately. We demonstrate the uniform superiority of the new method through extensive simulation studies. An application to two real studies uncovered scientifically plausible findings that would have been missed by analyzing individual datasets.