Topic modeling identifies novel genetic loci associated with multimorbidities in UK Biobank

Cell genomics(2023)

引用 0|浏览12
暂无评分
摘要
Many diseases show patterns of co-occurrence, possibly driven by systemic dysregulation of underlying pro-cesses affecting multiple traits. We have developed a method (treeLFA) for identifying such multimorbidities from routine health-care data, which combines topic modeling with an informative prior derived from medical ontology. We apply treeLFA to UK Biobank data and identify a variety of topics representing multimorbidity clusters, including a healthy topic. We find that loci identified using topic weights as traits in a genome-wide association study (GWAS) analysis, which we validated with a range of approaches, only partially overlap with loci from GWASs on constituent single diseases. We also show that treeLFA improves upon existing methods like latent Dirichlet allocation in various ways. Overall, our findings indicate that topic models can charac-terize multimorbidity patterns and that genetic analysis of these patterns can provide insight into the etiology of complex traits that cannot be determined from the analysis of constituent traits alone.
更多
查看译文
关键词
multimorbidity,topic modeling,treeLFA,topic-GWAS,UK Biobank
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要