In partial fulfillment of the requirements for the degree of
Doctor of Philosophy in Bioinformatics
in the School of Biological Sciences
Ruoyu Tian
Defends her thesis:Fine-mapping of genetic regulatory variants in human
Thursday, May 7th, 2020
9:00 AM Eastern Timehttps://bluejeans.com/831247632
Thesis Advisor:
Dr. Greg Gibson
School of Biological Sciences
Georgia Institute of Technology
Committee Members:
Dr. Ciaran M. Lee
APC Microbiome Ireland
University College Cork
Dr. James Dahlman
School of Biomedical Engineering
Georgia Institute of Technology
Dr. Melissa Kemp
School of Biomedical Engineering
Georgia Institute of Technology
Dr. King Jordan
School of Biological Sciences
Georgia Institute of Technology
Abstract
The majority of GWAS (Genome-Wide Association Study) identified common genetic variants map to regulatory regions of gene, and are likely to influence disease risk by affecting gene expression. One of the most important challenges is to experimentally fine-map causal regulatory variants that typically lie in credible intervals of 100 or more variants. Another large proportion of genetic variants, rare variants, are expected to have large effects causing disease in individual, but are not detectable in GWAS. Herein, I provide both experimental and computational approaches for fine-mapping common and rare genetic variants accounting for medium and large effect on population or individual. First, I describe a single cell clone-based strategy for targeted single-nucleotide polymorphism (SNP) evaluation wherein microindels are introduced by CRISPR/Cas9. Multiple constraints, including the variability in mutability, clonal genotype and expense, render this approach infeasible for fine-mapping 10%-20% moderate effect size expression SNPs (eSNPs), which is also validated in a simulation study. Subsequently, I switch to a moderate-throughput parallel screening tool that characterizes multiplexed CRISPR/Cas9 perturbed transcriptomes by single-cell RNA-seq, called"expression CROP-seq". Two causal SNPs, rs2251039 and rs35675666, are identified that significantly alter the expression of CISD1 and PARK7, respectively. The sites overlap with chromatin accessibility peaks and are risk loci of inflammatory bowel disease. Expression CROP-seq reduces the variability identified in previous method and is powerful to screen genetic regulatory variants within credible intervals. Finally, to extend its application to rare variants, I develop a novel gene categorization system according to gene intolerance to promoter polymorphism and depletion of rare regulatory variants with GTEx v8 data. 49 GTEx tissues are clustered into functional groups with gene features. It supports the use of tissue-gene genomic annotation for prioritization of GWAS tagged risk loci. In summary, this work comprehensively describes and evaluates two CRISPR/Cas9-based eSNP screening systems. The use of rare regulatory variants in gene classification with tissue information demonstrates its potential in rare disease diagnoses. Both researches inevitably contribute to the genetic interpretation of human complex disease and personalized medicine in post-GWAS era.
News images
Student News