Unsupervised detection of genes of influence in lung cancer using biological networks

Subscribe to email list

Please select the email list(s) to which you wish to subscribe.

User menu

You are here

Unsupervised detection of genes of influence in lung cancer using biological networks

TitleUnsupervised detection of genes of influence in lung cancer using biological networks
Publication TypeJournal Article
Year of Publication2011
AuthorsGoldenberg, A, Mostafavi, S, Quon, G, Boutros, PC, Morris, QD
JournalBIOINFORMATICS
Volume27
Pagination3166-3172
Date PublishedNOV 15
Type of ArticleArticle
ISSN1367-4803
AbstractMotivation: Lung cancer is often discovered long after its onset, making identifying genes important in its initiation and progression a challenge. By the time the tumors are discovered, we only observe the final sum of changes of the few genes that initiated cancer and thousands of genes that they have influenced. Gene interactions and heterogeneity of samples make it difficult to identify genes consistent between different cohorts. Using gene and gene-product interaction networks, we propose a principled approach to identify a small subset of genes whose network neighbors exhibit consistently high expression change ( in cancerous tissue versus normal) regardless of their own expression. We hypothesize that these genes can shed light on the larger scale perturbations in the overall landscape of expression levels. Results: We benchmark our method on simulated data, and show that we can recover a true gene list in noisy measurement data. We then apply our method to four non-small cell lung cancer and two pancreatic cancer cohorts, finding several genes that are consistent within all cohorts of the same cancer type. Conclusion: Our model is flexible, robust and identifies gene sets that are more consistent across cohorts than several other approaches. Additionally, our method can be applied on a per-patient basis not requiring large cohorts of patients to find genes of influence. Our approach is generally applicable to gene expression studies where the goal is to identify a small set of influential genes that may in turn explain the much larger set of genome-wide expression changes.
DOI10.1093/bioinformatics/btr533