• Gold M - Skip to Main Content.
  • University of Minnesota
  • Search U of M
  • CSE Home
  • IT Home
  • Directories
  • One Stop
  • myU
Computer Science & Engineering
Prospective Students
Current Students
Alumni
Industry

Computer Science & Engineering

  • Department Info
    • About Us
    • Contact Info
    • Department News
    • Giving
  •  
  • Admissions
    • Undergraduate
    • Graduate
  •  
  • Academics
    • Undergraduate
    • Graduate
  •  
  • People
    • Faculty
    • Graduate Students
  •  
  • Research
    • Research Areas
    • Tech Reports
    • Related Centers
  •  
  • Resources
    • Forms
    • Systems Help
    • Faculty Portal locked external link
    • Computing Facilities
    • Department Wiki locked external link
    • Employment
  •  
  • Site Map
  •  
  •  
Institute of Technology Logo
Home > Research > Tech Reports
Browse reports by year:
[ ALL 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 ]
Browse report authors:
[ ALL A B C D E F G H I J K L M N O P Q R S T U V W X Y Z ]
Browse reports by title:
[ ALL A B C D E F G H I J K L M N O P Q R S T U V W X Y Z ]

University of Minnesota - Computer Science and Engineering Technical Report Abstract

Identifying Clinical and Genetic Markers of Human Disease by Classifying Features on Graphs

Report Number: 07-021
Date of Submission: 9/26/2007

Authors:
   
   
   
   
   
   

View Report:
   PDF format

Abstract:

Identification of clinical and genetic markers of disease can provide crucial information for both disease treatment and etiology. This complex task involves associating high-dimensional patterns such as largescale gene expressions and single nucleotide polymorphisms (SNPs) with disease-related phenotypes using very few samples. We introduce a new graph-based semi-supervised feature classification algorithm to identify discriminative patterns by learning on bipartite graphs built from clinical variables, gene expressions and SNPs. Instead of performing feature selection or unsupervised bi-clustering, our algorithm directly classifies the feature nodes in a bipartite graph as positive, negative or neutral with network propagation, which captures the interactions between both samples and features (clinical and genetic variables) by exploring the global structure of the graph. Although globally optimized for classifying the features, our algorithm can also simultaneously classify the test samples for disease prognosis/diagnosis. We apply our algorithm to studying the Rosetta breast cancer dataset and chronic fatigue syndrome on a CAMDA contest dataset. Our algorithm identifies interesting clinical and genetic markers, some of which are consistent with previous studies in the literature, and achieves better overall classification performance than support vector machines and Bayesian networks. (Supplemental website: http://compbio.cs.umn.edu/Feature_Class/.)

Related Links

  • U of M Research centers and institutes
  • Undergraduate Research Opportunities Program
  • Experts@Minnesota
  • Office of Graduate School Outreach
  • IT Faculty & research
  • Colloquia
  • Talks

 

  • ©2006 - 2009 Regents of the University of Minnesota. All rights reserved.
  • Privacy
  • Contact U of M
  • Contact CSE
  • CSE Employment
  • Site Map
  • The University of Minnesota is an equal opportunity educator and employer.
  • Last modified on July 23, 2008