Overview

Lipid mediators are signaling molecules derived from fatty acids that regulate inflammation and immune responses. By binding to cell-surface receptors, lipid mediators can regulate transcription and alter methylation. The goal of this study is to perform a cross-sectional analysis on patients who tested positive for the cyclic citrullinated peptide antibody (CCP) and determine whether specific CpG sites are associated with individual lipid mediators.

Methods

47 participants have both DNAm and lipid mediator information for at least 1 visit and up to 3 visits across 3 cell types. Initial quality control included adjusting for batch and implementing a range filter on our CpGs from Illumina’s EPICv1 BeadChip array.

For each of the 26 lipid mediators (LMs), linear models were fit with LM as the predictor of interest and m-value of CpG as the outcome, while adjusting for age, sex, and batch. Batch (binary: in batch 1 or other) was a covariate in the model based on PCA plots that suggested a cell-specific batch effect (2.1). A separate model was run for each of the three cell types, Bcell, Tmem, and Tnai. Using the p-values from this probe-level analysis, probes were ranked from smallest p-value to largest. A set-enrichment analysis was completed using the methylRRA() function from the methylGSA R package. All defaults were used except for the minimum and maximum number of genes for a gene set. The minimum number of genes that a gene set must contain is 10 and the maximum number is 1000. These parameters were the same for both KEGG and GO (3.1,3.2). To correct for multiple testing the Benjamini and Hochberg False Discovery Rate (FDR) was used, and the significance cut-off was FDR < 0.05.

Results

Lipid Mediator Biological Group Table

Click Here to Download the lipid mediator pathways File

Summary Statistics

CpG and Sample Size Summary

Below is a file that contains information about sample size and number of CpGs for the dataset used in this analysis. We started with 865,859 probes and between 21 and 23 percent of probes were filtered out due to low quality. We also excluded sex chromosomes for this analysis. We used 47 samples in our model, which was the same across Bcells, Tmem, and Tnai cells. These 47 samples are in the TIPRA cohort and are all CCP positive.

Click Here to Download the Summary Statistics File

Patient Demographics

Below is a summary of several clinical variables for visit 1, which is the group we are using. The dataset is mostly non-Hispanic white females with an average age of 58.47.

Click Here to Download the Summary Statistics File

DMP Intersections Between Cell Types

Below is a table showing how CpGs intersect between cell types after a FDR filter of 0.05. Most CpGs are in only one cell type but there are some that are present in all 3 cell types and a small number that are present in two of the three cell types.

View CpG Intersection PDF

DMR Intersections

Below is a Venn diagram of all significant DMRs and how they intersect between Bcells, Tmem, and Tnai. Most of the significant DMRs were present in all three cell types and Bcells had the most significant DMRs that were not shared between cell types.

Click Here to Download the CellTypeDMR_Intersection File

Below is an UpSet plot comparing how the significant DMRs intersect between lipid mediators. There is very little overlap.

Click Here to Download the LM UpSetPlots File

Below is an UpSet plot comparing how the significant DMRs intersect between mediator pathways. There is very little overlap.

Click Here to Download the Pathway_UpSetPlots File

Associated Genes

The tables below shows the genes associated with each significant DMR as well as the direction of methylation for Bcell, Tnai, and Tmem cell types. All significant DMRs in have a SIDAK p-value below 0.05. DMRs with a single probe were also filtered out.

Bcell

Bcell_estimate <- read_csv("/Users/eckco/Desktop/Norris_May_Update/DMR/All_Bcell_DMR_dataset_with_estimate.csv", show_col_types = FALSE)

# Exclude the first column
Bcell_estimate <- Bcell_estimate[, -1]

# Remove extra rows and columns
Bcell_estimate <- Bcell_estimate %>%
  select(-c("CpG","fdr"))

# Keep only one row per DMR
# Keep only one row per DMR using base R unique
#Bcell_estimate <- unique(Bcell_estimate, by = "DMR_Names")
# Base R approach to keep only one row per unique DMR_Names
Bcell_estimate <- Bcell_estimate[!duplicated(Bcell_estimate$DMR_Names), ]

names(Bcell_estimate)[names(Bcell_estimate) =="sidak"] <- "SIDAK"
Bcell_estimate$SIDAK <- formatC(Bcell_estimate$SIDAK, format = "e", digits = 2)

# Rename Total column
Bcell_estimate <- Bcell_estimate %>%
  rename(numProbes = total)


full_names <- read.csv("/Users/eckco/Desktop/Norris_May_Update/finalLMlist_wParentFAInfo_updatedEnzymeInfo_Jan2024.csv")
# Prepare the full_names dataset by selecting Analyte and choosing between Pathway and Pathway2
full_names_pathway <- full_names %>%
  mutate(pathway = ifelse(is.na(Pathway) | Pathway == "", Pathway2, Pathway)) %>%
  select(Analyte, pathway)

# Merge in the pathway data using Analyte = LM
Bcell_estimate <- Bcell_estimate %>%
  left_join(full_names_pathway, by = c("LM" = "Analyte"))


names(Bcell_estimate)[names(Bcell_estimate) == "pathway"] <- "Pathway"

Bcell_estimate <- Bcell_estimate[, c(5,11,1,2,3,4,6,8,9,10)]
Bcell_estimate <- Bcell_estimate[, c(7,1,2,3,4,5,6,8,9,10)]

library(dplyr)

Bcell_estimate  <- Bcell_estimate  %>%
  mutate(UCSC_RefGene_Name = sapply(strsplit(UCSC_RefGene_Name, ";"), function(genes) {
    unique_genes <- unique(trimws(genes))
    paste(unique_genes, collapse = ";")
  }))%>%
  filter(!is.na(UCSC_RefGene_Name) & UCSC_RefGene_Name != "")


datatable(
  Bcell_estimate,
  rownames = FALSE,
  options = list(
    pageLength = 10,
    dom = 'Bfrtip',              # B: Buttons, f: filter, r: processing, t: table, i: info, p: pagination
    buttons = c('copy', 'csv', 'excel', 'pdf')
  ),
  extensions = 'Buttons',
  caption = "Click to explore or download the full table"
)

Number of unique genes for Bcells:

Tmem

Tmem_estimate <- read_csv("/Users/eckco/Desktop/Norris_May_Update/DMR/All_Tmem_DMR_dataset_with_estimate.csv", show_col_types = FALSE)

# Exclude the first column
Tmem_estimate <- Tmem_estimate[, -1]

# Remove extra rows and columns
Tmem_estimate <- Tmem_estimate %>%
  select(-c("CpG","fdr"))

# Keep only one row per DMR
# Keep only one row per DMR using base R unique
#Tmem_estimate <- unique(Tmem_estimate, by = "DMR_Names")
# Base R approach to keep only one row per unique DMR_Names
Tmem_estimate <- Tmem_estimate[!duplicated(Tmem_estimate$DMR_Names), ]

names(Tmem_estimate)[names(Tmem_estimate) =="sidak"] <- "SIDAK"
Tmem_estimate$SIDAK <- formatC(Tmem_estimate$SIDAK, format = "e", digits = 2)

# Rename Total column
Tmem_estimate <- Tmem_estimate %>%
  rename(numProbes = total)


full_names <- read.csv("/Users/eckco/Desktop/Norris_May_Update/finalLMlist_wParentFAInfo_updatedEnzymeInfo_Jan2024.csv")
# Prepare the full_names dataset by selecting Analyte and choosing between Pathway and Pathway2
full_names_pathway <- full_names %>%
  mutate(pathway = ifelse(is.na(Pathway) | Pathway == "", Pathway2, Pathway)) %>%
  select(Analyte, pathway)

# Merge in the pathway data using Analyte = LM
Tmem_estimate <- Tmem_estimate %>%
  left_join(full_names_pathway, by = c("LM" = "Analyte"))


names(Tmem_estimate)[names(Tmem_estimate) == "pathway"] <- "Pathway"

Tmem_estimate <- Tmem_estimate[, c(5,11,1,2,3,4,6,8,9,10)]
Tmem_estimate <- Tmem_estimate[, c(7,1,2,3,4,5,6,8,9,10)]

library(dplyr)

Tmem_estimate  <- Tmem_estimate  %>%
  mutate(UCSC_RefGene_Name = sapply(strsplit(UCSC_RefGene_Name, ";"), function(genes) {
    unique_genes <- unique(trimws(genes))
    paste(unique_genes, collapse = ";")
  }))%>%
  filter(!is.na(UCSC_RefGene_Name) & UCSC_RefGene_Name != "")

datatable(
  Tmem_estimate,
  rownames = FALSE,
  options = list(
    pageLength = 10,
    dom = 'Bfrtip',              # B: Buttons, f: filter, r: processing, t: table, i: info, p: pagination
    buttons = c('copy', 'csv', 'excel', 'pdf')
  ),
  extensions = 'Buttons',
  caption = "Click to explore or download the full table"
)

Number of unique genes for Tmem:

Tnai

Tnai_estimate <- read_csv("/Users/eckco/Desktop/Norris_May_Update/DMR/All_Tnai_DMR_dataset_with_estimate.csv", show_col_types = FALSE)

# Exclude the first column
Tnai_estimate <- Tnai_estimate[, -1]

# Remove extra rows and columns
Tnai_estimate <- Tnai_estimate %>%
  select(-c("CpG","fdr"))

# Keep only one row per DMR
# Keep only one row per DMR using base R unique
#Tnai_estimate <- unique(Tnai_estimate, by = "DMR_Names")
# Base R approach to keep only one row per unique DMR_Names
Tnai_estimate <- Tnai_estimate[!duplicated(Tnai_estimate$DMR_Names), ]

names(Tnai_estimate)[names(Tnai_estimate) =="sidak"] <- "SIDAK"
Tnai_estimate$SIDAK <- formatC(Tnai_estimate$SIDAK, format = "e", digits = 2)

# Rename Total column
Tnai_estimate <- Tnai_estimate %>%
  rename(numProbes = total)


full_names <- read.csv("/Users/eckco/Desktop/Norris_May_Update/finalLMlist_wParentFAInfo_updatedEnzymeInfo_Jan2024.csv")
# Prepare the full_names dataset by selecting Analyte and choosing between Pathway and Pathway2
full_names_pathway <- full_names %>%
  mutate(pathway = ifelse(is.na(Pathway) | Pathway == "", Pathway2, Pathway)) %>%
  select(Analyte, pathway)

# Merge in the pathway data using Analyte = LM
Tnai_estimate <- Tnai_estimate %>%
  left_join(full_names_pathway, by = c("LM" = "Analyte"))


names(Tnai_estimate)[names(Tnai_estimate) == "pathway"] <- "Pathway"

Tnai_estimate <- Tnai_estimate[, c(5,11,1,2,3,4,6,8,9,10)]
Tnai_estimate <- Tnai_estimate[, c(7,1,2,3,4,5,6,8,9,10)]

library(dplyr)

Tnai_estimate  <- Tnai_estimate  %>%
  mutate(UCSC_RefGene_Name = sapply(strsplit(UCSC_RefGene_Name, ";"), function(genes) {
    unique_genes <- unique(trimws(genes))
    paste(unique_genes, collapse = ";")
  }))%>%
  filter(!is.na(UCSC_RefGene_Name) & UCSC_RefGene_Name != "")


datatable(
  Tnai_estimate,
  rownames = FALSE,
  options = list(
    pageLength = 10,
    dom = 'Bfrtip',              # B: Buttons, f: filter, r: processing, t: table, i: info, p: pagination
    buttons = c('copy', 'csv', 'excel', 'pdf')
  ),
  extensions = 'Buttons',
  caption = "Click to explore or download the full table"
)

Number of unique genes for Tnai:

Pathway analysis

KEGG

Below is a Venn diagram comparing the intersecting pathways using KEGG across cell types.

Click Here to Download the KEGG Venn Diagram File

Below is a Venn diagram comparing the intersection of pathways with the top 10 lowest padj values across cell types using KEGG.

Click Here to Download the Upset Plot File

Pathway Intersections Between Cell Types for KEGG Pathways

Below is a table comparing the intersecting KEGG pathways that are present in all 3 cell types. There are 35 total pathways.

GO

Below is a Venn diagram comparing the intersecting pathways using GO ontology profiles across cell types.

Click Here to Download the GO Venn Diagram File

Below is a Venn diagram comparing the intersection of pathways with the top 10 lowest padj values across cell types using GO.