IGAP- Integrative Genome Analysis Pipeline Reveals New Gene Regulatory Model Associated with Nonspecific TF-DNA Binding Affinity
Authors
Alireza Sahaf Naeini1, Amna Farooq1, Magnar Bjørås2, and Junbai Wang1
1Department of Pathology, Norwegian Radium Hospital, PO Box 4953 Nydalen, 0424 Oslo, Norway
2Department of Cancer Res and Molecular Medicine, Norwegian University of Science and Technology (NTNU), N-7489 Trondheim, Norway
Introduction
Human genome is a regulatory jungle where regulation happens in multi-dimensional fashion. While biophysical factors like Non-specific Transcription factor Binding Affinity (nTBA) act prior to sequence level, there are factors acting at sequence level of genome and above sequence level as well. Sequence level regulatory features like enhancers, Transcription Start Site, and High Occupancy Target (HOT) regions may play equally important role in genome regulation along above sequence level features like histone modifications and intra-chromosomal interactions. This multidimensionality of regulation compels inclusion of maximum factors for proper understanding of regulatory landscape of human genome. However, complex nature of TF-DNA binding makes nTBA calculation a computationally exhaustive task. We present here a robust algorithm for calculation of nTBA. In order to integrate major biophysical, genetic and epigenetic features involved in genome regulation, we also present a user-friendly package Integrative Genome Analysis Pipeline (IGAP). This integrative approach not only helped us to segregate highly Active Genomic Zones but also lead us to discover a novel gene regulatory model explaining nTBA and HOT guided search for true transcription factor binding site.
References
- BayesPI - a new model to study protein-DNA interactions: a case study of condition-specific protein binding parameters for Yeast transcription factors. Wang J, Morigen. BMC Bioinformatics. 2009 Oct 20;10:345. doi: 10.1186/1471-2105-10-345. [PMID:19857274]
- Quality versus accuracy: result of a reanalysis of protein-binding microarrays from the DREAM5 challenge by using BayesPI2 including dinucleotide interdependence. Wang J. BMC Bioinformatics. 2014 Aug 27;15:289. doi: 10.1186/1471-2105-15-289. [PMID:25158938]
- BayesPI-BAR: a new biophysical model for characterization of regulatory sequence variations. Wang J, Batmanov K. Nucleic Acids Res. 2015 Dec 2;43(21):e147. doi: 10.1093/nar/gkv733 [PMID:26202972]
- Integrative whole-genome sequence analysis reveals roles of regulatory mutations in BCL6 and BCL2 in follicular lymphoma. Batmanov K, Wang W, Bjoras M, Delabie J, Wang J. Sci Rep. 2017 Aug 1;7(1):7040. doi: 10.1038/s41598-017-07226-4. [PMID:28765546]
- BayesPI-BAR2: A New Python Package for Predicting Functional Non-coding Mutations in Cancer Patient Cohorts. Batmanov K, Delabie J, Wang J. Front Genet. 2019 Apr 2;10:282. doi: 10.3389/fgene.2019.00282. eCollection 2019. PMID: 31001324