IRF5 rs4728142 — The Promoter Enhancer Switch: Causal Driver of Interferon Overactivation

Interferon Regulatory Factor 5 (IRF5) is the molecular ignition switch for the type I interferon
response — the branch of innate immunity responsible for fighting viral infections by triggering
widespread pro-inflammatory signaling. When IRF5 is overactive, the same cascade that defends
against viruses begins attacking the body's own tissues, driving systemic lupus erythematosus
(SLE), Sjögren's syndrome, systemic sclerosis, and other autoimmune diseases. rs4728142 sits
approximately 5 kilobases upstream of IRF5's transcription start sites, in a region now established
as an active enhancer | An enhancer is a regulatory DNA element that boosts transcription of a
nearby gene; it doesn't have to be immediately adjacent — enhancers can act from tens of thousands
of base pairs away via DNA looping
for the gene.
The A allele of rs4728142 has recently been confirmed as the leading causal variant for the 5'
IRF5 risk signal — not merely a tag for some other functional change, but the variant that directly
rewires how the IRF5 gene is regulated.

The Mechanism

rs4728142 operates through a newly described chromatin-looping mechanism. The A allele alters
the binding affinity of ZBTB3 (Zinc Finger and BTB Domain-Containing Protein 3) | A transcription
factor that recognizes specific DNA sequences and can recruit chromatin-remodeling complexes;
ZBTB3 belongs to the BTB-POZ family involved in gene regulation and cell fate decisions
.
When ZBTB3 binds the rs4728142-A allele, it orchestrates a short-range chromatin loop that
physically connects the upstream enhancer region to the alternative (short-transcript) promoter
of IRF5. The consequence: the IRF5 gene preferentially produces its shorter isoforms at the expense
of the full-length transcript, and total IRF5 expression is markedly elevated. In lymphoblastoid
cells from European ancestry individuals, the A allele is associated with a 1.5-fold increase in
IRF5 mRNA | Bonferroni-corrected P=0.0004 in CEU cell lines; smaller but significant effects also
seen in Asian-ancestry cells (1.2-fold, P=0.006)
.
In SLE patient monocytes, CRISPR-based interference with the enhancer function at this locus
attenuated production of lupus-associated cytokines, confirming the causal chain from variant
to expression to inflammatory output.

The rs4728142-A allele is in strong linkage disequilibrium | LD measures how often two nearby
variants are inherited together; r²>0.6 means the two variants co-occur on the same chromosome
much more often than chance would predict
(r²>0.6)
with the CGGGG insertion-deletion polymorphism in the IRF5 promoter. The CGGGG indel sits
64 base pairs upstream of the exon 1A transcription start site; the 4-copy insertion allele (4R)
creates an additional binding site for the transcription factor Sp1, which was confirmed by
electrophoretic mobility shift assays. The 4R allele increases IRF5 mRNA expression in peripheral
blood mononuclear cells and amplifies interferon-inducible gene expression (MX1, IFITM1) in
Sjögren's patients. rs4728142 and the CGGGG indel together provide complementary molecular
explanations for why the 5' region of IRF5 elevates interferon tone: the CGGGG indel creates
an extra Sp1 site at the promoter; rs4728142 directs an enhancer loop to that promoter.

The Evidence

The breadth of disease associations for rs4728142 is exceptional even among major autoimmune
risk loci. The variant has been independently associated with six autoimmune conditions across
multiple populations.

In systemic lupus erythematosus, a large meta-analysis of IRF5 variants across nine
populations found rs4728142 among the strongest independent signals P=1.34×10⁻⁸,
OR=1.22 (95% CI 1.14–1.30) | Pooled analysis across European, Asian, and admixed populations
from multiple case-control studies
, placing it
among the most replicated non-HLA genetic risk factors for lupus.

In primary Sjögren's syndrome, the CGGGG indel (in LD with rs4728142-A) was found to
fully explain the IRF5-Sjögren's association in two independent Norwegian and British cohorts,
yielding OR=2.00 (95% CI 1.5–2.7, P=6.6×10⁻⁶) | Norheim et al. 2012, Annals of the Rheumatic
Diseases; OR of 2.00 for carrying the 4R CGGGG allele vs the 3R allele
.
Carriers of the 4R allele had measurably higher IRF5 mRNA in both peripheral blood and salivary
gland epithelial cells — the tissue most directly relevant to Sjögren's pathology.

In inflammatory bowel disease, the CGGGG indel was the lead signal in an independent IBD
GWAS analysis, with OR=1.81 for overall IBD (P=1.9×10⁻⁵) and OR=2.42 specifically for
ulcerative colitis (P=5.3×10⁻⁸) | Eames et al. 2007, Human Molecular Genetics; replication
in a second cohort OR=1.59
. The UC association
is among the strongest single-gene effects in IBD genetics.

In multiple sclerosis, rs4728142 showed independent association across Spanish, Swedish,
and Finnish cohorts (combined P<0.001, OR≈1.14), with allele-specific protein binding on EMSA
suggesting direct transcription factor recruitment at the variant site rather than passive LD.

In systemic sclerosis, the three-SNP IRF5 haplotype incorporating rs4728142 reached
OR=1.75 (P=9.04×10⁻²²) | Dieudé et al. 2013, PLOS ONE; five-country European study including
Spain, Germany, Netherlands, Italy, and UK
, exceeding
the effect of rs10488631 alone and confirming additive contributions from the 5' and 3' haplotype
blocks.

Practical Implications

The A allele at rs4728142 acts as an immune-system volume dial stuck at a higher setting. The
overexpressed IRF5 protein amplifies both the initial interferon alarm response and subsequent
pro-inflammatory cytokine cascades (TNF-α, IL-6, IL-12). For most carriers this translates to
modest increases in autoimmune disease susceptibility across multiple organ systems — not a
predetermined path to disease, but a meaningful shift in the immune threshold.

Because rs4728142 affects the IRF5 enhancer rather than the coding sequence, its effects are
partly modifiable by context: the immune system's activation state, the presence of viral
triggers (particularly Epstein-Barr virus, strongly implicated in SLE pathogenesis), and
environmental modulators of interferon tone including vitamin D. The discovery that CRISPR
interference with this enhancer can attenuate lupus-associated cytokine production in patient
monocytes indicates that this regulatory region is being studied as a potential therapeutic target.

Population stratification is striking at this locus: European and Latino populations carry the A
allele at approximately 44% frequency — nearly half the population — while East Asian populations
carry it at only 13%. This mirrors the population-level differences in autoimmune disease prevalence
and suggests that variation at IRF5 contributes to the well-documented ethnic disparities in lupus
and related conditions.

Interactions

rs4728142 functions within the IRF5 three-block haplotype architecture alongside rs2004640
(exon 1B splice site) and rs10488631 (3' haplotype tag). The 5' block tagged by rs4728142
and the 3' block tagged by rs10488631 operate independently and additively: individuals
carrying both the rs4728142-A risk allele and the rs10488631-C risk allele carry contributions
from both haplotype blocks, with combined effects substantially exceeding those of either
alone. The three-block SSc haplotype data illustrating OR=1.75 for the combined haplotype
versus OR=1.63 for rs10488631 alone quantifies this additive contribution.

Downstream of IRF5, STAT4 (rs7574865) amplifies cellular responsiveness to the interferons
that IRF5 drives. IRF5 variants increase interferon production; STAT4 variants increase
cellular sensitivity to that interferon. Studies in primary Sjögren's syndrome and SLE
demonstrate striking additive effects: with all five IRF5 + STAT4 risk alleles, OR for
Sjögren's reaches 6.78. rs4728142 risk carriers with concurrent STAT4 rs7574865 risk
alleles thus face a compounded immune activation burden — elevated production and elevated
responsiveness to what is produced.

The autoimmune pleiotropic nature of rs4728142 — associated with lupus, Sjögren's, IBD,
MS, SSc, and RA — means the variant likely exerts a shared molecular mechanism (IRF5
overactivation) that manifests as different diseases depending on which other genetic and
environmental factors are present. Individuals carrying this risk allele who develop symptoms
in one autoimmune domain should be aware of overlapping risk across organ systems.

All Genotypes

GG normal

No rs4728142 risk allele; IRF5 enhancer operates at standard activity without ZBTB3-driven promoter looping

You carry two copies of the G allele at rs4728142, meaning you do not carry the A allele that drives ZBTB3-mediated chromatin looping to the IRF5 alternative promoter. Your IRF5 gene expression at this enhancer locus is at baseline, without the upregulation associated with the A allele. This is the most common genotype globally: approximately 33% of people carry GG, though this varies substantially by ancestry — it is more common in East Asian populations where the A allele frequency is only 13%. Among Europeans, approximately 31% carry this genotype. You do not carry the tag for the CGGGG promoter 4R insertion at this locus, meaning your IRF5 promoter lacks the additional Sp1 binding site associated with elevated interferon output.

AG intermediate

One copy of the IRF5 risk allele at this enhancer; modestly elevated IRF5 expression and increased autoimmune susceptibility

You carry one copy of the A allele at rs4728142, meaning one of your two IRF5 gene copies is regulated by the risk enhancer configuration that promotes ZBTB3-mediated chromatin looping to the alternative IRF5 promoter. This is associated with intermediate elevation of IRF5 expression — roughly in the range of the A/G effect seen across populations. This is the most common genotype globally (approximately 49% of people), reflecting that the A allele is common particularly in European and Latino populations. The AG genotype is associated with modestly increased susceptibility to lupus (OR≈1.1–1.2 per allele), Sjögren's syndrome, ulcerative colitis, and systemic sclerosis, particularly in the context of the broader IRF5 haplotype you carry at rs2004640 and rs10488631.

AA high_risk

Two copies of the IRF5 enhancer risk allele; elevated IRF5 expression with substantially increased susceptibility to lupus, Sjögren's syndrome, ulcerative colitis, and related autoimmune conditions

You carry two copies of the A allele at rs4728142, meaning both copies of your IRF5 gene are regulated by the risk enhancer architecture. ZBTB3 binds both gene copies' upstream enhancer region, driving chromatin loops to the alternative IRF5 promoter on both chromosomes and producing the maximum genotypic upregulation of IRF5-short transcripts. Expression studies showed AA cells have the highest IRF5 expression among the three genotypes, and your immune cells are primed toward stronger and more sustained interferon responses compared to GG individuals. Approximately 18.5% of people globally carry this genotype; it is more common in European and Latino populations (around 19%) and rarer in East Asian populations (under 2%). The AA genotype also maximizes the LD signal from the CGGGG 4R insertion: you carry two copies of the 4R-tagged haplotype with its extra Sp1 site, compounding the transcriptional upregulation at the IRF5 promoter.