Skip to main content
Article
Integration of Multiple Data Sources for Gene Network Inference Using Genetic Perturbation Data
Journal of Computational Biology
  • Ling-Hong Hung, University of Washington Tacoma
  • Ka Yee Yeung, University of Washington Tacoma
  • Xiao Liang
  • William Chad Young
  • Adrian E. Rafferty
Publication Date
4-22-2019
Document Type
Article
Abstract

The inference of gene networks from large-scale human genomic data is challenging due to the difficulty in identifying correct regulators for each gene in a high-dimensional search space. We present a Bayesian approach integrating external data sources with knockdown data from human cell lines to infer gene regulatory networks. In particular, we assemble multiple data sources, including gene expression data, genome-wide binding data, gene ontology, and known pathways, and use a supervised learning framework to compute prior probabilities of regulatory relationships. We show that our integrated method improves the accuracy of inferred gene networks as well as extends some previous Bayesian frameworks both in theory and applications. We apply our method to two different human cell lines, namely skin melanoma cell line A375 and lung cancer cell line A549, to illustrate the capabilities of our method. Our results show that the improvement in performance could vary from cell line to cell line and that we might need to choose different external data sources serving as prior knowledge if we hope to obtain better accuracy for different cell lines.

DOI
10.1089/cmb.2019.0036
Publisher Policy
Pre-print, post-print
Citation Information
Ling-Hong Hung, Ka Yee Yeung, Xiao Liang, William Chad Young, et al.. "Integration of Multiple Data Sources for Gene Network Inference Using Genetic Perturbation Data" Journal of Computational Biology Vol. 26 Iss. 10 (2019) p. 1113 - 1129
Available at: http://works.bepress.com/ky-yeung/50/