TY - GEN
T1 - Learning in glaucoma genetic risk assessment
AU - Zhang, Zhuo
AU - Liu, Jiang
AU - Kwoh, Chee Keong
AU - Sim, Xueling
AU - Tay, Wan Ting
AU - Tan, Yonghua
AU - Yin, Fengshou
AU - Wong, Tien Yin
N1 - Copyright:
Copyright 2011 Elsevier B.V., All rights reserved.
PY - 2010
Y1 - 2010
N2 - Genome Wide Association (GWA) studies are powerful tools to identify genes involved in common human diseases, and are becoming increasingly important in genetic epidemiology research. However, the statistical approaches behind GWA studies lack capability in taking into account the possible interactions among genetic markers; and true disease variants may be lost in statistical noise due to high threshold. A typical GWA study reports a few highly suspected signals, e.g. Single-nucleotide polymorphisms (SNPs), which usually account for a tiny portion of overall genetic risks for the disease of interest. This study proposes a computational learning approach in addition to parametric statistical methods along with a filtering mechanism, to build glaucoma genetic risk assessment model. Our data set was obtained from Singapore Malay Eye Study (SiMES), genotyped on Illumina 610quad arrays. We constructed case-control data set with 233 glaucoma and 458 healthy samples. A standard case-control association test was conducted on post-QC dataset with more than 500k SNPs. Genetic profile is constructed using genotype information from a list of 412 SNPs filtered by a relaxed pvalue threshold of 1×10-3, and forms the feature space for learning. Among the five learning algorithms we performed, Support Vector Machines with radial kernel (SVM-radial) achieved the best result, with area under curve (ROC) of 99.4% and accuracy of 95.9%. The result illustrates that, learning approach in post GWAS data analysis is able to accurately assess genetic risk for glaucoma. The approach is more robust and comprehensive than individual SNPs matching method. We will further validate our results in several other data sets obtained in consequential population studies conducted in Singapore.
AB - Genome Wide Association (GWA) studies are powerful tools to identify genes involved in common human diseases, and are becoming increasingly important in genetic epidemiology research. However, the statistical approaches behind GWA studies lack capability in taking into account the possible interactions among genetic markers; and true disease variants may be lost in statistical noise due to high threshold. A typical GWA study reports a few highly suspected signals, e.g. Single-nucleotide polymorphisms (SNPs), which usually account for a tiny portion of overall genetic risks for the disease of interest. This study proposes a computational learning approach in addition to parametric statistical methods along with a filtering mechanism, to build glaucoma genetic risk assessment model. Our data set was obtained from Singapore Malay Eye Study (SiMES), genotyped on Illumina 610quad arrays. We constructed case-control data set with 233 glaucoma and 458 healthy samples. A standard case-control association test was conducted on post-QC dataset with more than 500k SNPs. Genetic profile is constructed using genotype information from a list of 412 SNPs filtered by a relaxed pvalue threshold of 1×10-3, and forms the feature space for learning. Among the five learning algorithms we performed, Support Vector Machines with radial kernel (SVM-radial) achieved the best result, with area under curve (ROC) of 99.4% and accuracy of 95.9%. The result illustrates that, learning approach in post GWAS data analysis is able to accurately assess genetic risk for glaucoma. The approach is more robust and comprehensive than individual SNPs matching method. We will further validate our results in several other data sets obtained in consequential population studies conducted in Singapore.
UR - http://www.scopus.com/inward/record.url?scp=78650840906&partnerID=8YFLogxK
U2 - 10.1109/IEMBS.2010.5627757
DO - 10.1109/IEMBS.2010.5627757
M3 - Conference contribution
C2 - 21097154
AN - SCOPUS:78650840906
SN - 9781424441235
T3 - 2010 Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC'10
SP - 6182
EP - 6185
BT - 2010 Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC'10
T2 - 2010 32nd Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC'10
Y2 - 31 August 2010 through 4 September 2010
ER -