KIAA0825 is a protein that in humans is encoded by the gene of the same name, located on chromosome 5, 5q15. It is a possible risk factor in Type II Diabetes, and associated with high levels of glucose in the blood. It is a relatively fast mutating gene, compared to other coding genes. There is however one region which is highly conserved across the species that have the gene, known as DUF4495. It is predicted to travel between the nucleus and the cytoplasm.

KIAA0825
Identifiers
AliasesKIAA0825, C5orf36, PAPA10
External IDsOMIM: 617266; MGI: 1919621; HomoloGene: 89234; GeneCards: KIAA0825; OMA:KIAA0825 - orthologs
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_001145678
NM_173665

NM_001081353
NM_001145676
NM_028215

RefSeq (protein)

NP_001139150
NP_775936

NP_001074822
NP_001139148

Location (UCSC)Chr 5: 94.15 – 94.62 MbChr 13: 77.28 – 77.76 Mb
PubMed search[3][4]
Wikidata
View/Edit HumanView/Edit Mouse

General information

edit
 
The Isoforms of C5orf36

KIAA0825 is gene that appears to be a genetic factor that increases the risk of Type II Diabetes, possibly by increasing the level of blood glucose levels.[5] It has also been identified as a possible oncogene.[6] C5orf36 has one common alias KIAA0825. The gene is about 478 kb long and contains 22 exons. It produces 10 different variants: 9 alternatively spliced, and one un-spliced version. The longest experimentally confirmed mRNA is 7240 bp long and produces a protein 1275 amino acids long.[7] The protein is predicted to weigh about 147.8kDal. It has orthologs in most animals including Aplysia californica, but is not found outside animals with the possible exception of Plasmodiophora brassicae.

Protein information

edit

The protein has a predicted weight of 147.8 kDal.[8][9] It does not contain a known nuclear localization signal but does contain a nuclear export signal.[10] The subcellular localization for the protein is predicted to be the nucleus and the cytoplasm.[11] This suggests that the protein might shuttle back and forth across the nuclear membrane.

Secondary structure

edit
 
This is a 3-D Prediction created by I-TASSER. The green indicates the conserved DUF4495.

Several programs suggest that the secondary structure of the protein is mainly helices with only a few beta sheets.[12][13][14][15] Analysis of protein composition also suggests that the protein has relatively low levels of glycine.[16] This could suggest a fairly rigid structure relative to other proteins. The tertiary structure is harder to predict due to the size of the protein, partially due to its size. The 3-D structure shown shows a prediction made by I-TASSER. This is a possible strture with a C-score of -1.06 on a scale from -5 to 1 (in which the higher the number the greater the confidence).[17][18][19] This predicted structure indicates there are two main parts, and it is possible they interact depending on the state of the protein (e.g. whether or not it's phosphorylated).

Expression

edit
 
mRNA expression data from the Human Protein Atlas, calculated as transcripts per million (TPM).
 
This shows the expression levels of C5orf36 in human tissue. It is provided by the Human Protein Atlas.

The mRNA for KIAA0825 is expressed at relatively low rates in comparison to other mRNAs.[20] The protein however is expressed at relatively high rates, especially in parts of the brain as well as adrenal glands and the thyroid.[21] This would suggest that the protein is not readily degraded and remains in the cell for long periods of time, such that continuous transcription of the DNA into mRNA is unnecessary. No current finding suggest that there is alternative expression of different isoforms in different tissues.

Regulation

edit

Analysis of the promoter offers some insight into the expression of KIAA0825.[22] One possible regulator found is the NeuroD1 transcription factor. This factor is an important regulator for the insulin gene, and a mutation in this gene can lead to Type II diabetes.[23] This could explain why KIAA0825 is expressed at lower levels in patients with Type II diabetes. Another possible transcription factor is the Myeloid zinc finger 1 factor, which is tied to myeloid leukemia, because it delays apoptosis of cells in the presence of retinoic acid.[24] There are also several places where Vertebrate SMAD family transcription factors can bind. These transcription factors are thought to be responsible for nucleocytoplasmic dynamics.[25] This means that these SMAD transcription factors could affect KIAA0825, because subcellular localization suggests it shuttles across the nuclear envelope.

Function

edit

There are two proteins found to interact with KIAA0825. One is Interleukin enhancer-binding factor 3.[26] ILF3 is a factor that complexes with other proteins and regulates gene expression and stabilizes mRNAs.[27] The other is the Amyloid-beta precursor protein.[28] This protein is an integral membrane protein found most commonly in the synapses of neurons. Neither of these proteins is well enough understood to indicate for certain the role of C5orf36 in human cells. They however suggest that KIAA0825 could serve a variety of roles in different parts of the cell.

Orthology

edit

KIAA0825 orthologs can be found in virtually all animals, but cannot be found in plants, bacteria, or protozoa. It is mostly highly conserved in vertebrates especially mammals, but genes that contain region similar to DUF4495 region can be found in California sea hare, generally one of the most simple animal. The size especially in mammals is well conserved sticking very close to between 1250 and 1300 amino acids long. This suggests that the protein wraps around on itself forming important structures for its function.

There were no paralogs found of the gene KIAA0825 in humans or in any other species.

References

edit
  1. ^ a b c GRCh38: Ensembl release 89: ENSG00000185261Ensembl, May 2017
  2. ^ a b c GRCm38: Ensembl release 89: ENSMUSG00000071252Ensembl, May 2017
  3. ^ "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  4. ^ "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  5. ^ Li J, Wei J, Xu P, Yan M, Li J, Chen Z, Jin T (December 2016). "Impact of diabetes-related gene polymorphisms on the clinical characteristics of type 2 diabetes Chinese Han population". Oncotarget. 7 (51): 85464–85471. doi:10.18632/oncotarget.13399. PMC 5356749. PMID 27863428.
  6. ^ Delgado AP, Brandao P, Chapado MJ, Hamid S, Narayanan R (July–August 2014). "Open reading frames associated with cancer in the dark matter of the human genome". Cancer Genomics & Proteomics. 11 (4): 201–13. PMID 25048349.
  7. ^ NCBI Resource Coordinators (January 2017). "Database Resources of the National Center for Biotechnology Information". Nucleic Acids Research. 45 (D1): D12–D17. doi:10.1093/nar/gkw1071. PMC 5210554. PMID 27899561.
  8. ^ Brendel V, Bucher P, Nourbakhsh IR, Blaisdell BE, Karlin S (March 1992). "Methods and algorithms for statistical analysis of protein sequences". Proceedings of the National Academy of Sciences of the United States of America. 89 (6): 2002–6. Bibcode:1992PNAS...89.2002B. doi:10.1073/pnas.89.6.2002. PMC 48584. PMID 1549558.
  9. ^ Brendel V. "SDSC Biology Workbench". workbench.sdsc.edu. Department of Mathematics, Stanford University, CA. Retrieved 17 April 2017.
  10. ^ la Cour T, Kiemer L, Mølgaard A, Gupta R, Skriver K, Brunak S (June 2004). "Analysis and prediction of leucine-rich nuclear export signals". Protein Engineering, Design & Selection. 17 (6): 527–36. doi:10.1093/protein/gzh062. PMID 15314210.
  11. ^ Nakai K, Horton P (January 1999). "PSORT: a program for detecting sorting signals in proteins and predicting their subcellular localization". Trends in Biochemical Sciences. 24 (1): 34–6. doi:10.1016/s0968-0004(98)01336-x. PMID 10087920.
  12. ^ Bigelow HR, Petrey DS, Liu J, Przybylski D, Rost B (28 April 2004). "Predicting transmembrane beta-barrels in proteomes". Nucleic Acids Research. 32 (8): 2566–77. doi:10.1093/nar/gkh580. PMC 419468. PMID 15141026.
  13. ^ Rost B, Yachdav G, Liu J (July 2004). "The PredictProtein server". Nucleic Acids Research. 32 (Web Server issue): W321–6. doi:10.1093/nar/gkh377. PMC 441515. PMID 15215403.
  14. ^ Garnier J, Osguthorpe DJ, Robson B (March 1978). "Analysis of the accuracy and implications of simple methods for predicting the secondary structure of globular proteins". Journal of Molecular Biology. 120 (1): 97–120. doi:10.1016/0022-2836(78)90297-8. PMID 642007.
  15. ^ Burgess AW, Ponnuswamy PK, Scheraga HA (1974). "Analysis of Conformations of Amino Acid Residues and Prediction of Backbone Topography in Proteins". Israel Journal of Chemistry. 12 (1–2): 239–286. doi:10.1002/ijch.197400022.
  16. ^ Brendel V, Bucher P, Nourbakhsh IR, Blaisdell BE, Karlin S (March 1992). "Methods and algorithms for statistical analysis of protein sequences". Proceedings of the National Academy of Sciences of the United States of America. 89 (6): 2002–6. Bibcode:1992PNAS...89.2002B. doi:10.1073/pnas.89.6.2002. PMC 48584. PMID 1549558.
  17. ^ Zhang Y (January 2008). "I-TASSER server for protein 3D structure prediction". BMC Bioinformatics. 9 (1): 40. doi:10.1186/1471-2105-9-40. PMC 2245901. PMID 18215316.
  18. ^ Roy A, Kucukural A, Zhang Y (April 2010). "I-TASSER: a unified platform for automated protein structure and function prediction". Nature Protocols. 5 (4): 725–38. doi:10.1038/nprot.2010.5. PMC 2849174. PMID 20360767.
  19. ^ Yang J, Yan R, Roy A, Xu D, Poisson J, Zhang Y (January 2015). "The I-TASSER Suite: protein structure and function prediction". Nature Methods. 12 (1): 7–8. doi:10.1038/nmeth.3213. PMC 4428668. PMID 25549265.
  20. ^ Uhlén M, Fagerberg L, Hallström BM, Lindskog C, Oksvold P, Mardinoglu A, et al. (January 2015). "Proteomics. Tissue-based map of the human proteome". Science. 347 (6220): 1260419. doi:10.1126/science.1260419. PMID 25613900. S2CID 802377.
  21. ^ Uhlén M, Fagerberg L, Hallström BM, Lindskog C, Oksvold P, Mardinoglu A, et al. (January 2015). "Proteomics. Tissue-based map of the human proteome". Science. 347 (6220): 1260419. doi:10.1126/science.1260419. PMID 25613900. S2CID 802377.
  22. ^ "Genomatix". Genomatix. Archived from the original on 19 August 2021. Retrieved 7 May 2017.
  23. ^ Prantera G, Pimpinelli S, Rocchi A (1 January 1999). "Effects of distamycin A on human leukocytes in vitro". Cytogenetics and Cell Genetics. 23 (1–2): 103–7. doi:10.1128/MCB.19.1.704. PMC 83927. PMID 83927.
  24. ^ Robertson KA, Hill DP, Kelley MR, Tritt R, Crum B, Van Epps S, Srour E, Rice S, Hromas R (May 1998). "The myeloid zinc finger gene (MZF-1) delays retinoic acid-induced apoptosis and differentiation in myeloid leukemia cells". Leukemia. 12 (5): 690–8. doi:10.1038/sj.leu.2401005. PMID 9593266.
  25. ^ Massagué J, Seoane J, Wotton D (December 2005). "Smad transcription factors". Genes & Development. 19 (23): 2783–810. doi:10.1101/gad.1350705. PMID 16322555.
  26. ^ Chu L, Su MY, Maggi LB, Lu L, Mullins C, Crosby S, Huang G, Chng WJ, Vij R, Tomasson MH (August 2012). "Multiple myeloma-associated chromosomal translocation activates orphan snoRNA ACA11 to suppress oxidative stress". The Journal of Clinical Investigation. 122 (8): 2793–806. doi:10.1172/JCI63051. PMC 3408744. PMID 22751105.
  27. ^ Chaumet A, Castella S, Gasmi L, Fradin A, Clodic G, Bolbach G, Poulhe R, Denoulet P, Larcher JC (June 2013). "Proteomic analysis of interleukin enhancer binding factor 3 (Ilf3) and nuclear factor 90 (NF90) interactome". Biochimie. 95 (6): 1146–57. doi:10.1016/j.biochi.2013.01.004. PMID 23321469.
  28. ^ Oláh J, Vincze O, Virók D, Simon D, Bozsó Z, Tõkési N, et al. (September 2011). "Interactions of pathological hallmark proteins: tubulin polymerization promoting protein/p25, beta-amyloid, and alpha-synuclein". The Journal of Biological Chemistry. 286 (39): 34088–100. doi:10.1074/jbc.M111.243907. PMC 3190826. PMID 21832049.