Zinc Finger Protein 821, also known as ZNF821, is a protein encoded by the ZNF821 gene. This gene is located on the 16th chromosome and is expressed highly in the testes, moderately expressed in the brain and low expression in 23 other tissues. The protein encoded is 412 amino acids long with 2 Zinc Finger motifs (C2H2 type) and a 23 amino acid long STPR domain.

ZNF821
Identifiers
AliasesZNF821, zinc finger protein 821
External IDsMGI: 1923121; HomoloGene: 32345; GeneCards: ZNF821; OMA:ZNF821 - orthologs
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)
RefSeq (protein)
Location (UCSC)Chr 16: 71.86 – 71.9 MbChr 8: 110.43 – 110.45 Mb
PubMed search[3][4]
Wikidata
View/Edit HumanView/Edit Mouse

Gene

edit

Locus

edit

ZNF821 is located at 16q22.2 on the minus strand, it is composed of 35,657 bases spanning from base 71,893,583 to 71,929,239. ZNF821 has 8 exons and is located in the same neighborhood as 4 other genes, ATXNL1, IST1, PKD1L3, AP1G1.[5]

Transcriptional Regulation

edit

Transcription of ZNF821 is handled by the promoter GXP_9784938 which is 539 bases long and located from base 71,884,046 to 71,884,585. The promoter region begins 404 base pairs upstream of the beginning of transcription. Several transcription factors with scores greater than 0.9 are predicted to regulate ZNF821 expression.

Transcription Factor Abbreviation Binding Site Strand
Myeloid zinc finger protein MZF1 GTGGGGATCCG
Cyclin D binding myb-like transcription factor DMTF CACCCCTAGGCCCGA -
Early growth response 2 EGRF GAGAGGGGGTGCCTGCGGC
Human acute myelogenous leukemia factors HAML AGCTGTGGTTGGGGG
C2H2 zinc finger transcription factors 2 ZF02 GACTTGAGCTACCACCCCATTCT -
Hypoxia-response elements HIFF CCATCCCACCGCAAATGTGCAGGTC -
Ets variant 1 ETSF CACGTCCAGGAAGGTCTGGGG
Homeobox transcription factor Nanog HOXF ACCCGGGAATGGGCGAGGC
GLIS family zinc finger 3 GLIF CGCTCCGCCCCCCAAGG -
GTF2I-like repeat 4 of GTF3 GUCE CGGGATTGGGC
zinc finger protein with KRAB and SCAN domains 12 ZF07 GGAGCCCCTCCTCTCCA
Myc associated zinc finger protein MAZF GTCTCGGGGAGAGGAGTCCGGGGCGGGTGTT -
Zinc finger and BTB domain containing 14 VF5F CGGTCCGCGCGCGGCCC , -
Transcription factor II B (TFIIB) recognition element TF2B CCGCGCC -
Zinc finger protein 37 alpha ZF37 CCTCCCCCT -
E2F transcription factor 1 E2FF CGCGCGAGGGCGGCGGG -
Cas-interacting zinc finger CIZF GTAGAAAAAGG -
Sma- and Mad-related proteins SMAD TCTGTCTGTCT
SRY (sex determining region Y)-box 6 SORY CAGACAGACAGACGACAACCGAAACAGGCAG -

Expression

edit

ZNF821 is highly expressed in the testes, almost 2.5 times as much as in the brain, the next most highly expressed in tissue. Expression in the brain is primarily during fetal development, with lower levels of expression occurring in the cerebellum. There are low levels of expression in most other tissues.

mRNA

edit

Variants and Isoforms

edit

ZNF821 has 7 different transcript variants and 4 isoforms.[6] Variant 1 Isoform 1 is the second longest and but most abundant of all the variants and isoforms. While variant 2 is longer, it contains one fewer exon. Variant 1, Isoform 1 is 1987 bases long with a 5' UTR 415 bases long and a 3' UTR 433 bases long.

Variant Isoform Length # of Exons
1 1 1987 8
2 1 2005 7
3 2 1894 7
4 2 1879 6
5 3 1853 7
6 4 1959 8
7 4 1722 7

Protein

edit

Characteristics

edit

The protein encoded by the ZNF821 gene is 412 amino acids long with a calculated molecular weight of ~ 47 kDa and a predicted isoelectric point of 6.14. Compared to the rest of the human proteome, there are decreased amounts of Isoleucine and Tyrosine residues as well as increased levels of Arginine residues.[7]

Structure

edit
 
AlphaFold prediction of ZNF821 Protein.

ZNF821 protein contains two C2H2 Zinc Finger motifs (spanning amino acids 120-140 and 152–172, respectively) and an STPR (one-score-and-three-amino acid peptide repeat) domain (spanning amino acids 223–314) containing a bipartite nuclear localization signal. This STPR domain is a double-stranded DNA-binding domain with similar traits to the silkworm FMBP-1 STPR domain and is thought to be responsible for the nuclear localization of the ZNF821 protein.[8] The secondary structure of the ZNF821 protein is composed of several alpha helical structures along with two small regions of beta sheets.[9][10][11][12] The tertiary structure of the ZNF821 protein provides exposure of the Zinc Fingers for presumed DNA-binding.[13]

Cellular Localization

edit

The ZNF821 protein binds to DNA making it highly likely to be localized to the nucleus, there is also a bipartite nuclear localization sequence from Lys280 to Arg297, Lys304 to Leu320, and Lys338 to Arg354. An analysis of the subcellular localization in both close and distant orthologs resulted in a >99% chance of being localized to the nucleus for all orthologs.[14]

Regulation

edit

The ZNF821 protein is predicted to be modified post-translationally at several different positions. When compared with both close and distant orthologous sequences two phosphorylation sites are conserved, the Serine at position 2 and the Threonine at position 7.[15] It is also predicted by several sources to have further phosphorylation sites of the Serine at position 254 and the Tyrosine at position 279.[16][15]

Interactions

edit

Several proteins have been shown to interact with the ZNF821 protein, many of them relating to transcriptional regulation.[17]

Abbreviation Protein Name Identification Method Function
ATM ATM Serine/Threonine kinase Two Hybrid Array Activates checkpoint signaling upon sensing DNA damage
CCDC85B Coiled-coil domain-containing protein 85B Two Hybrid Pooling Transcriptional Repressor
SMARCA2 Probable global transcription activator SNF2L2 Two Hybrid Pooling Transcriptional activation and repression by chromatin remodeling
CDCA7L Cell division cycle-associated 7-like protein Two Hybrid Array Transcriptional Repressor
PIM2 Serine/threonine-protein kinase Pim-2 Two Hybrid Array Proto-oncogene
DVL3 Segment polarity protein dishevelled homolog DVL-3 Two Hybrid Array Cell signal transduction
RUNDC3A RUN domain-containing protein 3A Two Hybrid Array Effector of RAPA2A
FXR1 Fragile X mental retardation syndrome-related protein 1 Two Hybrid Pooling RNA-binding protein

Homology

edit

Paralogs

edit

ZNF821 has no paralogs in humans.[5]

Orthologs

edit

There are orthologs for ZNF821 across vertebrates, but none for the protein in invertebrates. The Zinc Finger motifs are conserved into invertebrates. The STPR domain is only present in mammals.

Genus Species Common Name Taxonomy Date of Divergence (MYA) Sequence Length (AA) Sequence Identity (%)
Homo sapiens Human Primates 0.00 412 100.00
Pongo abelii Sumatran Orangutan Primates 15.20 412 99.76
Lacerta agilis Sand lizard Squamata 318.00 429 76.33
Chrysemys picta bellii Western Painted turtle Testudines 318.00 411 87.65
Gopherus evgoodei Sinaloan desert tortoise Testudines 318.00 413 86.44
Antrostomus carolinensis Chuck-will's-widow Caprimulgiformes 318.00 411 86.20
Apteryx rowi Okarito kiwi Apterygiformes 318.00 482 87.41
Leptosomus discolor Cuckoo roller Leptosomiformes 318.00 410 85.68
Xenopus laevis Two-lined caecilian Gymnophiona 351.70 409 76.76
Geotrypetes seraphini Gaboon caecilian Gymnophiona 351.70 378 75.26
Bufo bufo Common toad Anura 351.70 395 66.75
Rhinatrema bivittatum African clawed frog Anura 351.70 398 69.64
Pygocentrus nattereri Red-bellied piranha Characiformes 433.00 456 48.58
Cyprinus carpio Common Carp Cypriniformes 433.00 441 49.56
Polypterus senegalus Senegal bichir Polypteriformes 433.00 539 54.59
Erpetoichthys calabaricus Reedfish Polypteriformes 433.00 471 49.58
Paramormyrops kingsleyae Old Calabar mormyrid Osteoglossiformes 433.00 443 53.63
Carcharodon carcharias Great white shark Lamniformes 465.00 360 42.07
 
Divergence Rate of ZNF821 compared to Cytochrome c and Fibrinogen alpha.

The relative rate of divergence is slow when compared to the rates of two reference proteins, Cytochrome c and Fibrinogen alpha, but increases to slightly faster than Cytochrome c as the date of divergence gets closer to the present.

Clinical Significance

edit

ZNF821 has been associated in some capacity with several different diseases and conditions. It has been implicated in causing craniosynostosis through interactions with the transcription factor BCL11B by affecting the charges on the arginine-3, arginine-5, and lysine-3 residues, thereby increasing their conformational flexibility.[18] It has also been found to be a possible biomarker for methamphetamine-associated psychosis (MAP) via the process of RNA-degradation.[19] Another disease association is that with breast cancer as part of a DNA-repair sub network. ZNF821 was found to be dysregulated among breast cancer patients.[20] Finally, there is a study showing an increase in methylation over time on ZNF821 in Parkinson's disease patients who did not receive L-dopa/entacapone. This provides a clearer view of changes due only to Parkinson's pathophysiology.[21]

References

edit
  1. ^ a b c GRCh38: Ensembl release 89: ENSG00000102984Ensembl, May 2017
  2. ^ a b c GRCm38: Ensembl release 89: ENSMUSG00000031728Ensembl, May 2017
  3. ^ "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  4. ^ "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  5. ^ a b "Human BLAT Search". genome.ucsc.edu. Retrieved 2021-12-16.
  6. ^ "ZNF821 zinc finger protein 821 [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2021-12-16.
  7. ^ "SAPS < Sequence Statistics < EMBL-EBI". www.ebi.ac.uk. Retrieved 2021-12-16.
  8. ^ Nonaka Y, Muto H, Aizawa T, Okabe E, Myoba S, Yokoyama T, et al. (September 2010). "STPR, a 23-amino acid tandem repeat domain, found in the human function-unknown protein ZNF821". Biochemistry. 49 (38): 8367–8375. doi:10.1021/bi100448f. PMID 20795678.
  9. ^ "NPS@ : SOPMA secondary structure prediction". npsa-prabi.ibcp.fr. Retrieved 2021-12-16.
  10. ^ "Figure S6: Predicted secondary structure of CoV-RMEN using CFSSP:Chou and Fasman secondary structure prediction server". doi:10.7717/peerj.9572/supp-13. {{cite journal}}: Cite journal requires |journal= (help)
  11. ^ "Bioinformatics Toolkit". toolkit.tuebingen.mpg.de. Retrieved 2021-12-16.
  12. ^ "Prediction of the Secondary Structure by GOR". cib.cf.ocha.ac.jp. Retrieved 2021-12-16.
  13. ^ "AlphaFold Protein Structure Database". alphafold.ebi.ac.uk. Retrieved 2021-12-16.
  14. ^ "Services". www.healthtech.dtu.dk. Retrieved 2021-12-18.
  15. ^ a b "Motif Scan". myhits.sib.swiss. Retrieved 2021-12-16.
  16. ^ "PhosphoSitePlus". www.phosphosite.org. Retrieved 2021-12-16.
  17. ^ "PSICQUIC View". www.ebi.ac.uk. Retrieved 2021-12-18.
  18. ^ Goos JA, Vogel WK, Mlcochova H, Millard CJ, Esfandiari E, Selman WH, et al. (August 2019). "A de novo substitution in BCL11B leads to loss of interaction with transcriptional complexes and craniosynostosis". Human Molecular Genetics. 28 (15): 2501–2513. doi:10.1093/hmg/ddz072. PMC 6644156. PMID 31067316.
  19. ^ Breen MS, Uhlmann A, Nday CM, Glatt SJ, Mitt M, Metsalpu A, et al. (May 2016). "Candidate gene networks and blood biomarkers of methamphetamine-associated psychosis: an integrative RNA-sequencing report". Translational Psychiatry. 6 (5): e802. doi:10.1038/tp.2016.67. PMC 5070070. PMID 27163203.
  20. ^ Arroyo R, Suñé G, Zanzoni A, Duran-Frigola M, Alcalde V, Stracker TH, et al. (March 2015). "Systematic identification of molecular links between core and candidate genes in breast cancer". Journal of Molecular Biology. 427 (6 Pt B): 1436–1450. doi:10.1016/j.jmb.2015.01.014. PMID 25640309.
  21. ^ Henderson-Smith A, Fisch KM, Hua J, Liu G, Ricciardelli E, Jepsen K, et al. (April 2019). "DNA methylation changes associated with Parkinson's disease progression: outcomes from the first longitudinal genome-wide methylation analysis in blood". Epigenetics. 14 (4): 365–382. doi:10.1080/15592294.2019.1588682. PMC 6557551. PMID 30871403.