GSE215081- The Myc-Associated Zinc Finger Protein (MAZ) controls STAT1-mediated antiviral response by reshaping epigenome [ChIP-Seq]

samples: ChIP-Seq of HAP1/K562 control cells and MAZ knockdown/knockout cells were generated by deep sequencing in duplicate using Illumina GAIIx. Chromatin immunoprecipitation DNA-sequencing (ChIP-seq) for STAT1 and H3K4me3, H3K27Ac in HAP1 cells.

Public on Mar 25, 2024

The objectives of this study are to understand the regulatory roles of MAZ in biological processes using the NGS-deriveed ChIP-seq, DNA-MEDIP-seq and RNA-seq data in HAP1 control cells and MAZ knockout cells. Our comparative analysis of these data generated from the HAP1 control and MAZ KO cells shows that MAZ is required for recruiting STAT1 to its target sites by reshaping epigenetic landscape in the human genome, thereby mediating antiviral response cells.

Illumina HiSeq 2000 (Homo sapiens)

NIH

Xiao T, Li X, Felsenfeld G

Sample analyzed: cell line: HAP1 chip antibody: ab8580 (H3K4me3) treatment: IFNγ treatment time_(hr): 24

#—-Bowtie2 alignment to hg38

bowtie2 -x /home/deviancedev01/bowtie_hg38/index -U SRR21845243_1.fastq -p 22 | samtools view -bS > HAP1_P_IFN_IpH3K4me3.bam

37658998 reads; of these: 37658998 (100.00%) were unpaired; of these: 436065 (1.16%) aligned 0 times 26284801 (69.80%) aligned exactly 1 time 10938132 (29.05%) aligned >1 times 98.84% overall alignment rate

samtools rmdup HAP1_P_IFN_IpH3K4me3.bam HAP1_P_IFN_IpH3K4me3.rmdup.bam samtools sort HAP1_P_IFN_IpH3K4me3.rmdup.bam -o HAP1_P_IFN_IpH3K4me3.rmdup.sorted.bam samtools index HAP1_P_IFN_IpH3K4me3.rmdup.sorted.bam samtools view -b HAP1_P_IFN_IpH3K4me3.rmdup.sorted.bam chr{1..22} > HAP1_P_IFN_IpH3K4me3.rmdup.sorted.cleaned.bam

bowtie2 -x /home/deviancedev01/bowtie_hg38/index -U SRR28027158_1.fastq -p 22 | samtools view -bS > Input_HAP1_P_IFN.bam

39660312 reads; of these: 39660312 (100.00%) were unpaired; of these: 579292 (1.46%) aligned 0 times 26061381 (65.71%) aligned exactly 1 time 13019639 (32.83%) aligned >1 times 98.54% overall alignment rate

samtools rmdup Input_HAP1_P_IFN.bam Input_HAP1_P_IFN.rmdup.bam samtools sort Input_HAP1_P_IFN.rmdup.bam -o Input_HAP1_P_IFN.rmdup.sorted.bam samtools index Input_HAP1_P_IFN.rmdup.sorted.bam samtools view -b Input_HAP1_P_IFN.rmdup.sorted.bam chr{1..22} > Input_HAP1_P_IFN.rmdup.sorted.cleaned.bam

#—-MACS3 peak calls

macs3 callpeak -t HAP1_P_IFN_IpH3K4me3.rmdup.sorted.cleaned.bam -c Input_HAP1_P_IFN.rmdup.sorted.cleaned.bam -g 2913022398 -n GSE215081_hap1_ifny_h3k4me3 -B -q 0.05

INFO @ 26 Aug 2024 13:28:15: [1636 MB] #3 Pileup will be based on sequencing depth in treatment. INFO @ 26 Aug 2024 13:28:15: [1636 MB] #3 Call peaks for each chromosome… INFO @ 26 Aug 2024 13:29:13: [1636 MB] #4 Write output xls file… GSE215081_hap1_ifny_h3k4me3_peaks.xls INFO @ 26 Aug 2024 13:29:13: [1636 MB] #4 Write peak in narrowPeak format file… GSE215081_hap1_ifny_h3k4me3_peaks.narrowPeak INFO @ 26 Aug 2024 13:29:13: [1636 MB] #4 Write summits bed file… GSE215081_hap1_ifny_h3k4me3_summits.bed INFO @ 26 Aug 2024 13:29:13: [1636 MB] Done!

## Loading required package: stats4
## Loading required package: BiocGenerics
## 
## Attaching package: 'BiocGenerics'
## The following objects are masked from 'package:stats':
## 
##     IQR, mad, sd, var, xtabs
## The following objects are masked from 'package:base':
## 
##     anyDuplicated, aperm, append, as.data.frame, basename, cbind,
##     colnames, dirname, do.call, duplicated, eval, evalq, Filter, Find,
##     get, grep, grepl, intersect, is.unsorted, lapply, Map, mapply,
##     match, mget, order, paste, pmax, pmax.int, pmin, pmin.int,
##     Position, rank, rbind, Reduce, rownames, sapply, setdiff, table,
##     tapply, union, unique, unsplit, which.max, which.min
## Loading required package: S4Vectors
## 
## Attaching package: 'S4Vectors'
## The following object is masked from 'package:utils':
## 
##     findMatches
## The following objects are masked from 'package:base':
## 
##     expand.grid, I, unname
## Loading required package: IRanges
## Loading required package: GenomeInfoDb
## Loading required package: GenomicFeatures
## Loading required package: AnnotationDbi
## Loading required package: Biobase
## Welcome to Bioconductor
## 
##     Vignettes contain introductory material; view with
##     'browseVignettes()'. To cite Bioconductor, see
##     'citation("Biobase")', and for packages 'citation("pkgname")'.
## 
## ChIPseeker v1.41.1 Learn more at https://yulab-smu.top/contribution-knowledge-mining/
## 
## Please cite:
## 
## Guangchuang Yu, Li-Gen Wang, and Qing-Yu He. ChIPseeker: an
## R/Bioconductor package for ChIP peak annotation, comparison and
## visualization. Bioinformatics. 2015, 31(14):2382-2383
## clusterProfiler v4.12.5 Learn more at https://yulab-smu.top/contribution-knowledge-mining/
## 
## Please cite:
## 
## T Wu, E Hu, S Xu, M Chen, P Guo, Z Dai, T Feng, L Zhou, W Tang, L Zhan,
## X Fu, S Liu, X Bo, and G Yu. clusterProfiler 4.0: A universal
## enrichment tool for interpreting omics data. The Innovation. 2021,
## 2(3):100141
## 
## Attaching package: 'clusterProfiler'
## The following object is masked from 'package:AnnotationDbi':
## 
##     select
## The following object is masked from 'package:IRanges':
## 
##     slice
## The following object is masked from 'package:S4Vectors':
## 
##     rename
## The following object is masked from 'package:stats':
## 
##     filter
## 
## ReactomePA v1.48.0  For help: https://yulab-smu.top/biomedical-knowledge-mining-book/
## 
## If you use ReactomePA in published research, please cite:
## Guangchuang Yu, Qing-Yu He. ReactomePA: an R/Bioconductor package for reactome pathway analysis and visualization. Molecular BioSystems 2016, 12(2):477-479
## 
## Attaching package: 'plyranges'
## The following object is masked from 'package:clusterProfiler':
## 
##     n
## The following object is masked from 'package:AnnotationDbi':
## 
##     select
## The following object is masked from 'package:IRanges':
## 
##     slice
## The following object is masked from 'package:stats':
## 
##     filter
## Loading required package: rtracklayer
## ##############################################################################
## Pathview is an open source software package distributed under GNU General
## Public License version 3 (GPLv3). Details of GPLv3 is available at
## http://www.gnu.org/licenses/gpl-3.0.html. Particullary, users are required to
## formally cite the original Pathview paper (not just mention it) in publications
## or products. For details, do citation("pathview") within R.
## 
## The pathview downloads and uses KEGG data. Non-academic uses may require a KEGG
## license agreement (details at http://www.kegg.jp/kegg/legal.html).
## ##############################################################################
## 
##  chr1 chr10 chr11 chr12 chr13 chr14 chr15 chr16 chr17 chr18 chr19  chr2 chr20 
##  5021  1405  2030  1874   679  1188  1229  1954  2510   553  3090  2322   990 
## chr21 chr22  chr3  chr4  chr5  chr6  chr7  chr8  chr9 
##   437  1026  1948  1277  1694  1842  1950  1260  1694
## 
##  chr1 chr10 chr11 chr12 chr13 chr14 chr15 chr16 chr17 chr18 chr19  chr2 chr20 
##  5021  1405  2030  1874   679  1188  1229  1954  2510   553  3090  2322   990 
## chr21 chr22  chr3  chr4  chr5  chr6  chr7  chr8  chr9 
##   437  1026  1948  1277  1694  1842  1950  1260  1694

## >> binning method is used...2024-08-26 09:40:13 PM
## >> preparing start_site regions by gene... 2024-08-26 09:40:13 PM
## >> preparing tag matrix by binning...  2024-08-26 09:40:13 PM 
## >> preparing promoter regions...  2024-08-26 09:40:32 PM 
## >> preparing tag matrix...        2024-08-26 09:40:32 PM 
## >> binning method is used...2024-08-26 09:40:32 PM
## >> preparing start_site regions by gene... 2024-08-26 09:40:32 PM
## >> preparing tag matrix by binning...  2024-08-26 09:40:32 PM 
## >> generating figure...       2024-08-26 09:40:47 PM 
## >> done...            2024-08-26 09:40:47 PM

## >> preparing promoter regions...  2024-08-26 09:41:11 PM 
## >> preparing tag matrix...        2024-08-26 09:41:11 PM 
## >> binning method is used...2024-08-26 09:41:11 PM
## >> preparing start_site regions by gene... 2024-08-26 09:41:11 PM
## >> preparing tag matrix by binning...  2024-08-26 09:41:11 PM 
## >> generating figure...       2024-08-26 09:41:23 PM 
## >> done...            2024-08-26 09:41:24 PM

## >> binning method is used...2024-08-26 09:41:47 PM
## >> preparing body regions by gene... 2024-08-26 09:41:47 PM
## >> preparing tag matrix by binning...  2024-08-26 09:41:47 PM 
## >> preparing matrix with flank extension from (TSS-3000bp)~(TTS+3000bp)... 2024-08-26 09:41:47 PM
## >> 619 peaks(4.108589%), having lengths smaller than 800bp, are filtered... 2024-08-26 09:41:55 PM
## >> preparing promoter regions...  2024-08-26 09:44:02 PM 
## >> preparing tag matrix...        2024-08-26 09:44:02 PM 
## >> binning method is used...2024-08-26 09:44:02 PM
## >> preparing body regions by gene... 2024-08-26 09:44:02 PM
## >> preparing tag matrix by binning...  2024-08-26 09:44:02 PM 
## >> preparing matrix with flank extension from (TSS-3000bp)~(TTS+3000bp)... 2024-08-26 09:44:02 PM
## >> 619 peaks(4.108589%), having lengths smaller than 800bp, are filtered... 2024-08-26 09:44:07 PM
## >> generating figure...       2024-08-26 09:46:14 PM 
## >> done...            2024-08-26 09:46:14 PM

## >> preparing promoter regions...  2024-08-26 09:46:41 PM 
## >> preparing tag matrix...        2024-08-26 09:46:41 PM 
## >> binning method is used...2024-08-26 09:46:41 PM
## >> preparing body regions by gene... 2024-08-26 09:46:41 PM
## >> preparing tag matrix by binning...  2024-08-26 09:46:41 PM 
## >> preparing matrix with flank extension from (TSS-3000bp)~(TTS+3000bp)... 2024-08-26 09:46:42 PM
## >> 649 peaks(4.286941%), having lengths smaller than 800bp, are filtered... 2024-08-26 09:46:44 PM
## >> generating figure...       2024-08-26 09:48:50 PM 
## >> done...            2024-08-26 09:48:51 PM
## Scale for fill is already present.
## Adding another scale for fill, which will replace the existing scale.

## Warning in cnetplot.enrichResult(x, ...): Use 'color.params = list(foldChange = your_value)' instead of 'foldChange'.
##  The foldChange parameter will be removed in the next version.
## Warning in cnetplot.enrichResult(x, ...): Use 'color.params = list(edge = your_value)' instead of 'colorEdge'.
##  The colorEdge parameter will be removed in the next version.
## Warning in cnetplot.enrichResult(x, ...): Use 'cex.params = list(gene_node = your_value)' instead of 'cex_gene'.
##  The cex_gene parameter will be removed in the next version.
## Warning in cnetplot.enrichResult(x, ...): Use 'cex.params = list(gene_label = your_value)' instead of 'cex_label_gene'.
##  The cex_label_gene parameter will be removed in the next version.
## Warning: ggrepel: 1172 unlabeled data points (too many overlaps). Consider
## increasing max.overlaps

## Warning in cnetplot.enrichResult(x, ...): Use 'color.params = list(foldChange = your_value)' instead of 'foldChange'.
##  The foldChange parameter will be removed in the next version.
## Warning in cnetplot.enrichResult(x, ...): Use 'color.params = list(edge = your_value)' instead of 'colorEdge'.
##  The colorEdge parameter will be removed in the next version.
## Warning in cnetplot.enrichResult(x, ...): Use 'cex.params = list(gene_node = your_value)' instead of 'cex_gene'.
##  The cex_gene parameter will be removed in the next version.
## Warning in cnetplot.enrichResult(x, ...): Use 'cex.params = list(gene_label = your_value)' instead of 'cex_label_gene'.
##  The cex_label_gene parameter will be removed in the next version.
## Warning: ggrepel: 1192 unlabeled data points (too many overlaps). Consider
## increasing max.overlaps

## >> preparing features information...      2024-08-26 09:49:42 PM 
## >> identifying nearest features...        2024-08-26 09:49:42 PM 
## >> calculating distance from peak to TSS...   2024-08-26 09:49:42 PM 
## >> assigning genomic annotation...        2024-08-26 09:49:42 PM 
## >> adding gene annotation...          2024-08-26 09:49:46 PM
## 'select()' returned 1:many mapping between keys and columns
## >> assigning chromosome lengths           2024-08-26 09:49:47 PM 
## >> done...                    2024-08-26 09:49:47 PM

## Warning: Removed 123 rows containing non-finite outside the scale range
## (`stat_count()`).
## [1] "653635"    "100302278" "100288069" "79854"     "643837"    "148398"

R version 4.4.1 (2024-06-14) – “Race for Your Life” Copyright (C) 2024 The R Foundation for Statistical Computing Platform: x86_64-pc-linux-gnu

RStudio 2024.04.2+764 “Chocolate Cosmos” Release (e4392fc9ddc21961fd1d0efd47484b43f07a4177, 2024-06-05) for Ubuntu Jammy Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/537.36 (KHTML, like Gecko) rstudio/2024.04.2+764 Chrome/120.0.6099.291 Electron/28.3.1 Safari/537.36, Quarto 1.4.555