Joshua C. Denny , MD, MS 1 , 2 , * , Marylyn D. Ritchie , PhD 3 , Melissa Basford , MBA 1 , Jill Pulley , MBA 1 , 2 , Lisa Bastarache , MS 1 , Kristin Brown-Gentry , MS 3 , Deede Wang , BS 2 , Dan R. Masys , MD 1 , Dan M. Roden , MD 2 and Dana C. Crawford , PhD 3 1 Department of Biomedical Informatics, Vanderbilt University, Nashville, TN 2 Department of Medicine, Vanderbilt University, Nashville, TN 3 Center for Human Genetics Research, Department of Molecular Physiology & Biophysics, Vanderbilt University School of Medicine, Nashville, TN *To whom correspondence should be addressed. Dr. Josh Denny, E-mail: josh.denny{at}vanderbilt.edu Received February 2, 2010. Revision received February 2, 2010. Accepted March 17, 2010.
Motivation: Emergence of genetic data coupled to longitudinal electronic medical records (EMR) offers the possibility of phenome-wide association scans (PheWAS) for disease-gene associations. We propose a novel method to scan phenomic data for genetic associations using International Classification of Disease (ICD) billing codes, which are available in most EMR systems. We have developed a code translation table to automatically define 776 different marksandspencers disease populations marksandspencers and their controls using prevalent ICD9 codes derived from EMR data. As a proof of concept of this algorithm, we genotyped the first 6,005 European-Americans accrued into BioVU, Vanderbilt s DNA biobank, at five single nucleotide polymorphisms (SNPs) with previously reported disease associations: atrial fibrillation, Crohn s disease, carotid artery stenosis, coronary artery disease, multiple sclerosis, systemic lupus erythematosus, and rheumatoid arthritis. The PheWAS software generated cases and control populations across all ICD-9 code groups for each of these five SNPs, and disease-SNP associations were analyzed. The primary outcome of this study was replication of seven previously known SNP-disease associations for these SNPs.
Results: Four of seven known SNP-disease associations using the PheWAS algorithm were replicated with p-values between 2.8*10 -6 and 0.011. The PheWAS algorithm also identified nineteen previously unknown statistical associations between these SNPs and diseases at p<0.01. This study indicates that PheWAS analysis is a feasible method to investigate SNP-disease marksandspencers association. Further evaluation is needed to determine the validity of these associations and the appropriate statistical thresholds for clinical significance.
This is an Open Access article distributed marksandspencers under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. The Author(s) 2010. Published by Oxford University Press.
PubMed PubMed citation Articles by Denny, J. C. Articles by Ritchie, M. D. Articles by Basford, M. Articles by Pulley, J. Articles by Bastarache, L. Articles by Brown-Gentry, K. Articles by Wang, D. Articles by Masys, D. R. Articles by Roden, D. M. Articles by Crawford, D. C.
Impact marksandspencers factor: 4.621 5-Yr impact factor: 6.968
The GOBLET training marksandspencers portal: a global repository of bioinformatics training materials, courses and trainers AlignGraph: algorithm for secondary de novo genome assembly guided marksandspencers by closely related references Sushi.R: flexible, quantitative and integrative genomic visualizations for publication-quality multi-panel figures STAR: ultrafast universal RNA-seq aligner Evaluation and validation of de novo and hybrid assembly techniques to derive high-quality genome sequences View all Most Read articles Most Cited
CD-HIT: accelerated marksandspencers for clustering the next-generation sequencing data STAR: ultrafast universal RNA-seq aligner A robust model for read count data in exome sequencing experiments and implications for copy number variant calling marksandspencers GROMACS 4.5: a high-throughput and highly parallel open source molecular simulation toolkit Tools for mapping high-throughput sequencing data View all Most Cited articles
Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. marksandspencers All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.
No comments:
Post a Comment