Estimation of Directed Acyclic Graphs Through Two-Stage Adaptive Lasso for Gene Network Inference

Sung Won Han, Gong Chen, Myun Seok Cheon, Hua Zhong

Research output: Contribution to journalArticlepeer-review

32 Citations (Scopus)

Abstract

Graphical models are a popular approach to find dependence and conditional independence relationships between gene expressions. Directed acyclic graphs (DAGs) are a special class of directed graphical models, where all the edges are directed edges and contain no directed cycles. The DAGs are well known models for discovering causal relationships between genes in gene regulatory networks. However, estimating DAGs without assuming known ordering is challenging due to high dimensionality, the acyclic constraints, and the presence of equivalence class from observational data. To overcome these challenges, we propose a two-stage adaptive Lasso approach, called NS-DIST, which performs neighborhood selection (NS) in stage 1, and then estimates DAGs by the discrete improving search with Tabu (DIST) algorithm within the selected neighborhood. Simulation studies are presented to demonstrate the effectiveness of the method and its computational efficiency. Two real data examples are used to demonstrate the practical usage of our method for gene regulatory network inference. Supplementary materials for this article are available online.

Original languageEnglish
Pages (from-to)1004-1019
Number of pages16
JournalJournal of the American Statistical Association
Volume111
Issue number515
DOIs
Publication statusPublished - 2016 Jul 2

Keywords

  • Directed acyclic graphs
  • Lasso estimation
  • Neighborhood selection
  • Probabilistic graphical model
  • Structure equation model

ASJC Scopus subject areas

  • Statistics and Probability
  • Statistics, Probability and Uncertainty

Fingerprint

Dive into the research topics of 'Estimation of Directed Acyclic Graphs Through Two-Stage Adaptive Lasso for Gene Network Inference'. Together they form a unique fingerprint.

Cite this