Probability & Statistics Day 2012 Group Photo
PROBABILITY & STATISTICS DAY
Funded By: National Security Agency | Hosted By: Center for Interdisciplinary Research and Consulting
Group Photo from the 7th Annual Probability & Statistics Day at UMBC 2013
8th Annual April 18-19, 2014

Register A special feature of Probability and Statistics Day at UMBC 2014 is that the conference, including the workshop, is open to all statistics graduate students from UMBC and local universites free of charge; however, REGISTRATION IS REQUIRED! The deadline to register is Friday, April 11, 2014.   // REGISTER NOW

For more information, contact any member of the organizing committee:

Bimal Sinha
Conference Chair
443.538.3012

Kofi Adragni
  410.455.2406
Yvonne Huang
  410.455.2422
Yaakov Malinovsky
  410.455.2968
Thomas Mathew
  410.455.2418
Nagaraj Neerchal
  410.455.2437
DoHwan Park
  410.455.2408
Junyong Park
  410.455.2407
Anindya Roy
  410.455.2435
Elizabeth Stanwyck
  410.455.5731

Sponsor

Participant Information

RUI JIN

Poster: Gene Pathway selection using group lasso regularized logistic regression

Single nucleotide polymorphisms (SNPs) are the most frequent type of genetic variation between individuals, and represent a promising tool for finding genetic determinants of complex diseases. SNPs are commonly used in genome wide association studies (GWAS) where the goal is to find SNPs associated with the disease. However not the significant SNPs themselves, but their effect on the protein structure or their impact on functional sites at the protein or DNA level is a key factor for understanding the mechanisms underlying the disease. Hence, the functional consequences of SNPs are better appreciated if the evaluation is performed at the biological system level, for instance by determining their effect in the context of genes set or signaling pathways. In this study, we propose a new method to uncover the association between disease status and SNPs which are grouped into gene pathways.. Our method uses overlapping group lasso regularized logistic regression to model the joint effects of SNPs and to select key explanatory SNPs at the group level. We use resampling techniques to rank the pathways and then SNPs within pathways in the order of their significance. Our ranking approach allows the presence of overlapping groups . We use data collected as part of The Study of Addiction: Genetics and Environment (SAGE) to demonstrate the use of our methodology. The overarching goal of the data analysis is to identify novel genetic factors that contribute to addiction through a large-scale genome-wide association study of DSM-IV alcohol dependent cases and non-dependent, unrelated control subjects of European and African American descent.