Click:   The Last Update Time:--

胡学海

Supervisor of Doctorate Candidates
Supervisor of Master's Candidates
Name (Simplified Chinese):胡学海
Name (Pinyin):huxuehai
Administrative Position:大数据系系主任
Professional Title:Professor
Education Level:With Certificate of Graduation for Doctorate Study
Degree:Doctoral Degree in Science
Business Address:逸夫楼C609
E-Mail:
Alma Mater:武汉大学
Teacher College:College of Informatics
School/Department:信息学院
Discipline:Other specialties in Statistics    bioinformatics    
Other Contact Information:

Email:

Paper Publications
A statistical framework for predicting critical regions of p53-dependent enhancers
Release time:2021-04-30    Hits:

DOI number:10.1093/bib/bbaa053

Teaching and Research Group:生物统计

Journal:Briefings in Bioinformatics

Key Words:critical enhancer regions; computational CRISPR; p53; K-mer; TF motifs

Abstract:P53 is the ‘guardian of the genome’ and is responsible for regulating cell cycle and apoptosis. The genomic p53 binding regions, where activating transcriptional factors and cofactors like p300 simultaneously bind, are called ‘p53-dependent enhancers’, which play an important role in tumorigenesis. Current experimental assays generally provide a broad peak of each enhancer element, leaving our knowledge about critical enhancer regions (CERs) limited. Under the inspiration of enhancer dissection by CRISPR-Cas9 screen library on genome-wide p53 binding sites, here we introduce a statistical framework called ‘Computational CRISPR Strategy’ (CCS), to predict whether a given DNA fragment will be a p53-dependent CER by employing 7-mer as feature extractions along with random forest as the regressor. When training on a p53 CRISPR enhancer dataset, CCS not only accurately fitted the top-ranked enriched single guide RNAs (sgRNAs) but also successfully reproduced two known CERs that were validated by experiments. When applying it to an independent testing dataset on a tilling of a 2K-b genomic region of CRISPR-deCDKN1A-Lib, the trained model shows great generalizability by identifying a CER containing five top-ranked sgRNAs. A feature importance analysis further indicates that top-ranked 7-mers are mapped onto informative TF motifs including POU5F1 and SOX5, which are differentially enriched in p53-dependent CERs and are potential factors to make a general p53 binding site to form a p53-dependent CER, providing the interpretability of the trained model. Our results demonstrate that CCS is an alternative way of the CRISPR experiment to screen the genome for mapping p53-dependent CERs.

Indexed by:Journal paper

Translation or Not:no

Date of Publication:2021-01-01

Included Journals:SCI