C14orf119

C14orf119
Identifiers
Aliases	C14orf119, chromosome 14 open reading frame 119
External IDs	MGI: 1920893; HomoloGene: 9921; GeneCards: C14orf119; OMA:C14orf119 - orthologs
Gene location (Human)
Chr.	Chromosome 14 (human)
End	23,100,456 bp
Gene location (Mouse)
Chr.	Chromosome 14 (mouse)
End	54,928,198 bp
RNA expression pattern
	Top expressed in
	mucosa of ileum; ; stromal cell of endometrium; ; mucosa of sigmoid colon; ; islet of Langerhans; ; smooth muscle tissue; ; rectum; ; decidua; ; oocyte; ; mucosa of transverse colon; ; right adrenal cortex;
	Top expressed in
	olfactory epithelium; ; medullary collecting duct; ; primitive streak; ; otic placode; ; abdominal wall; ; medial ganglionic eminence; ; endocardial cushion; ; fossa; ; condyle; ; efferent ductule;
	More reference expression data
	n/a
Orthologs
	55017
	58248
	ENSG00000179933
	ENSMUSG00000040822
	Q9NWQ9
	Q9JJ93
	NM_017924
	NM_021437
	NP_060394
	NP_067412
	Wikidata
View/Edit Human	View/Edit Mouse

C14orf119 is a protein that in humans is encoded by the c14orf119 gene. The c14orf119 protein is predicted to be localized in the nucleus.^[5] Additionally, c14orf119 expression is decreased in individuals with systemic lupus erythematosus (SLE) when compared with healthy individual and is increased in individuals with various types of lymphomas when compared to healthy individuals.^[6]^[7]

Gene

The common aliases of c14orf119 are chromosome open reading frame 119 and My028.^[8] The gene is located on chromosome 14, with the specific location of 14q11.2.^[9] It contains two exons and covers 5.76 kb, from 23563900 to 23569660 on the forward strand.^[10] The span of the c14orf119 gene, from the start of transcription to the polyA site, is 4951 basepairs in length.^[11]

Transcripts

The c14orf119 mRNA is composed of 2914 basepairs.^[9] C14orf119 has two isoforms, shown in the table below.

C14orf119 Isoforms^[12]
Name	Accession Number^[13]	Transcript ID	Length
C14orf119-201	NM_017924.4	ENST00000319074.6	2914 nt
C14orf119-202	XM_017021390.2	ENST00000554203.1	725 nt

Protein

The c14orf119 protein is composed of 140 amino acids.^[14] The molecular weight of the c14orf119 protein is approximately 16 kDa and the basal isoelectric point is 4.86.^[15] There is a long section of hydrophobic amino acids at the start of the protein.^[16] There are no additional significant compositional features of the c14orf119 protein, including charge clusters, charge runs, patterns, repetitive structures or multiplets.^[17] The primary sequence of the c14orf119 protein is as follows,

MPLESSSSMP LSFPSLLPSV PHNTNPSPPL MSYITSQEMK CILHWFANWS GPQRERFLED LVAKAVPEKL

QPLLDSLEQL SVSGADRPPS IFECQLHLWD QWFRGWAEQE RNEFVRQLEF SEPDFVAKFY QAVAATAGKD^[18]

There are two known c14orf119 protein isoforms, as shown in the table below.

C14orf119 Protein Isoforms^[19]
Name	Accession Number	Size	Domain Inclusion
Uncharacterized c14orf119 protein	NP_060394.1	140 aa	DUF4508
Uncharacterized c14orf119 protein isoform X1	XP_016876879.1	140 aa	DUF4508

Domains and motifs

There is a domain of unknown function (DUF) found in the c14orf119 protein: DUF4508 (with an E-value of 6.3e-36).^[20] This DUF is a part of a family of proteins that is found in eukaryotes and is typically between 117 and 253 amino acids in length.^[21] Additionally, there are three predicted CK2 phosphorylation sites (at positions 36, 83, and 121) within the c14orf119 protein.^[22]

Figure 1. Phyre2 predicted secondary structure of the c14orf119 protein.^[23]

Secondary structure

The predicted secondary structure of the c14orf119 protein is largely alpha helical in content. The specific makeup of the secondary structure is as follows, alpha helices make up 38.57% of the protein (54 amino acids), extended strands make up 23.57% of the protein (33 amino acids), and random coils make up 37.86% of the protein (53 amino acids).^[24] Phryre2, a program for protein modeling, prediction, and analysis, was used to determine and model the predicted structure of the c14orf119 protein.^[25] Shown in Figure 1, Phyre2 created a model for the predicted structure of 106 (out of a total of 140) residues of the c14orf119 protein, with 79.7% confidence and 76% coverage.^[25]

Tertiary and quaternary structures

With only two cysteines, 52 amino acids apart, found in the c14orf119 protein sequence, there were no predicted disulfide bonds in the c14orf119 protein.^[26]^[17] There are no predicted transmembrane regions or signal peptides in the c14orf119 protein.^[27]^[28]^[29]

Gene level regulation

Promoter

The predicted promoter sequence associated with c14orf119 is 3332 bases in length.^[30] This promoter sequence has one CpG island associated with it, with a CpG count of 78^[30] Additionally, there are a number of transcription factor binding sites associated with this promoter sequence, such as RB1, HNF4A, ETS1, and RBL2.^[31]

Expression pattern

C14orf119 is expressed in 203 organs.^[32] The c14orf119 gene is expressed in a number of tissues and has the highest expression rates in cultured fibroblast cells, with a TPM of 75.63.^[33] There is notable decreased expression of c14orf119 in the following tissues, pancreas, bone marrow, brain, salivary glands, and the liver.^[19]^[34] Additionally, there is notable increased expression of c14orf119 in the adrenal gland, kidney, lung, prostate, thymus, white blood cells, lymph node, and thyroid.^[19] Finally, expression levels of c14orf119 decrease with the development of the kidney and increases with development of the stomach.^[19]

Transcript level regulation

There were no predicted enhancers associated with c14orf119.^[31] There were a number of stem loop formation predictions in both the 5' UTR and 3' UTR of c14orf119.^[35]

miRNA targeting

The miRNA binding sites found in the 3' UTR of c14orf119 include miR-489, miR-1872, and miR-4778-3p; however, there were no miRNA binding sites found in the 5' UTR of c14orf119.^[36]

Protein level regulation

Subcellular localization

The c14orf119 protein is predicted to be located in the nucleus, with a reliability score of 55.5.^[5] However, the protein has a 7.9% basic residue content and a nuclear localization signal (NLS) score of -0.47.^[37] Additionally, there was a predicted ER retention motif at positions 136-139 of the protein.^[37] Finally, there were no N-terminal signal peptides, no cleavage sites for mitochondria, no actinin-type actin-binding motifs, and no N-myristolyation pattern.^[5]

Figure 2. Conceptual translation of c14orf119, which reveals predicted post-translational modifications.^[38]^[39]^[40]^[41]^[42]^[43]

Post-translational modifications

There are a number of post-translational modifications of the c14orf119 protein, all of which are shown on the conceptual translation of c14orf119 in Figure 2.

There are predicted ubiquitination sites at lysine residues at positions 128 and 139.^[44]

There are predicted kinase-specific phosphorylation sites at serines at the following position in the c14orf119 protein sequence, 15, 19, 27, 32, 36, 81, 83, 90, and 121.^[43]^[45] Protein phosphorylation at serine residues can play critical roles in the regulation of protein function and the transmission of signals throughout the cell.^[46]

There are two N-glycosylation sites at positions 25-27 and 48–50.^[42] This type of post-translational modification plays important roles in both the structure and function of some eukaryotic proteins.

Additionally, there are predicted glycation of epsilon amino groups of lysines at the following positions, 40, 64, 69, and 139.^[41] Glycation is a process in which proteins react with reducing sugar molecules, which ultimately impairs the function and changes the characteristics of the protein.^[47]

There are also predicted mammalian mucin type GalNAc-O-glycosylation sites at the following positions, 5, 6, 7, 12, 15, 19, and 24.^[40] GalNAc-type-O-glycosylation is the attachment of a sugar molecule to the oxygen atom of serine or threonine residues in a protein.^[48] O-glycans or the sugars added to the serine or threonine, have various functions, including allowing recognition of foreign material, providing cartilage and tendon flexibility, controlling cell metabolism, and trafficking cells in the immune system.^[49]

There is a predicted SUMOylation sites at the lysine at position 139.^[39] SUMOylation is involved in transcriptional regulation, protein stability, apoptosis, nuclear-cytosolic transport, progression through the cell cycle, and response to stress.^[50]

Finally, there are predicted O-GlcNAc sites at the serines at the following position in the c14orf119 protein, 5, 6, 7, 8, and 83.^[38] This post-translational modification can play various critical roles such as, progression through the cell cycle, response to cellular stress, protein turnover, and protein stability.^[51]

Regulation of expression

Epigenetic

There are varying levels of H3K27ac, H3K4me1, and H3K4me3 throughout the c14orf119 gene.^[31] H3K4me1 has variation in signal strength among different cell lines, which may reflect differences of epigenetic landscapes in these cell lines.^[31] Additionally, there is a strong signal of H3K27ac across the majority of cell lines along the predicted promoter region.^[31] Finally, there is also a strong signal of H3K4me3 across the majority of the cell types along the predicted promoter region, with no signal variation across cell types.^[31]

Figure 3. Date of divergence graph for c14orf119, with comparison to hemoglobin, fibrinogen alpha chain, and cytochrome c.

Homology/evolution

C14orf119 is conserved in both vertebrates and invertebrates, however, it is not conserved in bacteria, archaea, trichoplax, plants or fungi.^[52] The c14orf119 gene is highly conserved in the mammalian orthologs, however, within the non-mammalian orthologs, there are various insertions, especially at the beginning and end of the gene.^[52] This gene does not contain any paralogs or paralogous domains.^[52]

As shown in Figure 3, the c14orf119 gene has evolved moderately quickly when compared to cytochrome c, fibrinogen alpha chain, and hemoglobin. It has evolved faster than both hemoglobin and cytochrome c, but slower than fibrinogen alpha chain.

The table below reveals the various orthologs of the c14orf119 protein. This table includes the date of divergence (DoD) from humans, in million years ago (MYA), accession number, and percent identity and similarity to humans for each ortholog.

C14orf119 Orthologs
Genus and Species	Common Name	Taxonomy - Class	Taxonomy - Order	DoD (MYA)	Accession Number	Sequence Length (aa)	Percent Identity	Percent Similarity
Homo sapiens	Human	Mammalia	Primates	0	NP_060394.1	140	100	100
Mus musculus	Mouse	Mammalia	Rodentia	89	NP_067412.1	142	83.1	90.1
Myotis brandtii	Brandt's Bat	Mammalia	Chiroptera	94	XP_005852873.1	141	86.5	90.8
Callorhinus ursinus	Northern Fur Seal	Mammalia	Carnivora	94	XP_025726115.1	142	88	91.5
Bos taurus	Cattle	Mammalia	Artiodactyla	94	XP_002690553.1	142	88	92.3
Orycteropus afer afer	Aardvark	Mammalia	Tubulidentata	102	XP_007949377.1	140	85.1	89.4
Python bivittatus	Burmese Python	Reptilia	Squamata	318	XP_007441564.1	156	47.8	60.2
Podarcia muralis	Common Wall Lizard	Reptilia	Squamata	318	XP_028559108.1	115	51.8	63.8
Nanorana parkeri	High Himalaya Frog	Amphibia	Anura	351.7	XP_018411628.1	115	45.7	60.7
Larimichthys crocea	Marine Fish	Actinopterygii	Perciformes	433	XP_010740478.3	201	34.5	44.3
Aethina tumida	Small Hive Beetle	Insecta	Coleoptera	736	XP_019869014.1	124	18.6	39.7
Bombus terrestris	Buff-Tailed Bumblebee	Insecta	Hymenoptera	736	XP_020718687.1	125	19	36.1
Photinus pyraliis	Common Eastern Firefly	Insecta	Coleoptera	736	XP_031358233.1	128	19.9	40.4
Pieris rapae	Cabbage White Butterfly	Insecta	Lepidoptera	736	XP_022116245.1	180	20	38.4
Nasonia vitripennis	Small Parasitoid Wasp	Insecta	Hymenoptera	736	XP_031785555.1	121	22.4	42.1
Biomphalaria glabrata	Freshwater Snail	Gastropoda	Basommatophora	736	XP_013090201.1	113	31.7	46.2
Aplysia californica	California Seahorse	Gastropoda	Anaspidea	736	XP_005112416.1	112	32.6	47.9

Function/biochemistry

The function of the c14orf119 protein is not yet well understood by the scientific community.

Interacting proteins

There are a number predicted interacting proteins found in Y2H screens, such as exportin 1 (XPO1), ras homolog family member U (RHOU), deoxyhypusine hydroxylase/monooxygenase (DOHH), hepatocyte nuclear factor 4, alpha (HNF4A), leukocyte receptor cluster member 1 (LENG1), and ubiquitin C (UBC).^[53]^[54]^[55]

Clinical significance

Disease association

Expression of c14orf119 is decreased in individuals with systemic lupus erythematosus (SLE) when compared with healthy individuals.^[6] Furthermore, expression of c14orf119 is increased in individuals with various types of lymphomas when compared to healthy individuals.^[7]

References

^ ^a ^b ^c GRCh38: Ensembl release 89: ENSG00000179933 – Ensembl, May 2017
^ ^a ^b ^c GRCm38: Ensembl release 89: ENSMUSG00000040822 – Ensembl, May 2017
^ "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
^ "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
^ ^a ^b ^c "PSORT II page for c14orf119". PSORT II.^{[permanent dead link]}
^ ^a ^b "NCBI GEO Profile for record GDS4889, c14orf119". NCBI GEO.
^ ^a ^b "NCBI GEO Profile for record GDS3516, c14orf119". NCBI GEO.
^ "C14orf119 Gene (Protein Coding)". GeneCards. Retrieved February 26, 2020.
^ ^a ^b "C14orf119 chromosome 14 open reading frame 119 [ Homo sapiens (human) ]". NCBI. Retrieved February 26, 2020.
^ "Homo sapiens gene C14orf119, encoding chromosome 14 open reading frame 119". AceView. Retrieved February 26, 2020.
^ "Genome Data Viewer". ncbi.nlm.nih.gov. Retrieved May 1, 2020.
^ "Gene: C14orf119 ENSG00000179933". Ensembl. Retrieved February 26, 2020.
^ "NC_000014.9 Chromosome 14 Reference GRCh38.p13 Primary Assembly". NCBI Gene. Retrieved April 30, 2020.
^ "uncharacterized protein C14orf119 [Homo sapiens] - Protein - NCBI". ncbi.nlm.nih.gov. Retrieved May 2, 2020.
^ "Uncharacterized protein C14orf119". PhosphoSitePlus. Retrieved February 26, 2020.
^ "Statistical Analysis of Protein Sequences, Compositional Analysis - c14orf119". Statistical Analysis of Protein Sequences, Compositional Analysis. Retrieved May 1, 2020.
^ ^a ^b "Statistical Analysis of Protein Sequences, Compositional Analysis, c14orf119". Statistical Analysis of Protein Sequence (SAPS).
^ "uncharacterized protein C14orf119 [Homo sapiens]". NCBI Protein. Retrieved February 26, 2020.
^ ^a ^b ^c ^d "C14orf119 chromosome 14 open reading frame 119 [Homo sapiens (human)] - Gene - NCBI". ncbi.nlm.nih.gov. Retrieved May 1, 2020.
^ "MotifFinder page for c14orf119 protein". MotifFinder.
^ "Pfam: DUF4508". genome.jp. Retrieved May 1, 2020.
^ "MyHits Motif Scan page for c14orf119 protein". MyHits Motif Scan.
^ "Phyre 2 Results for c14orf119". sbg.bio.ic.ac.uk. Retrieved May 3, 2020.
^ "GOR page for c14orf119 protein". GOR.
^ ^a ^b "Phyre 2 Results for c14orf119". sbg.bio.ic.ac.uk. Retrieved May 1, 2020.^{[permanent dead link]}
^ "DISULFIND - Cysteines Disulfide Bonding State and Connectivity Predictor". disulfind.dsi.unifi.it. Retrieved May 2, 2020.^{[permanent dead link]}
^ "CCTOP - c14orf119 protein". CCTOP. Retrieved May 2, 2020.
^ "DAS-TMfilter prediction results". mendel.imp.ac.at. Archived from the original on February 5, 2018. Retrieved May 2, 2020.
^ "SignalP-5.0". cbs.dtu.dk. Retrieved May 2, 2020.
^ ^a ^b "Human hg38 chr14:23,093,525-23,098,476 UCSC Genome Browser v397". genome.ucsc.edu. Retrieved May 2, 2020.
^ ^a ^b ^c ^d ^e ^f "UCSC Genome Browser page for c14orf119". UCSC Genome Browser.
^ "Uncharacterized Protein c14orf119". ENSEMBL. Retrieved February 25, 2020.
^ "Gene Expression for c14orf119". GTExPortal. Retrieved February 25, 2020.
^ "GDS3113 / 161646". ncbi.nlm.nih.gov. Retrieved May 3, 2020.
^ "Sfold - Software for Statistical Folding and Studies of Regulatory RNAs". sfold.wadsworth.org. Retrieved May 2, 2020.
^ "miRDB - MicroRNA Target Prediction Database". mirdb.org. Retrieved May 2, 2020.
^ ^a ^b "PSORT II - c14orf119". PSORT II. Retrieved April 28, 2020.^{[permanent dead link]}
^ ^a ^b "YinOYang page for c14orf119 protein". YinOYang.
^ ^a ^b "SUMOsp page for c14orf119 protein". SUMOsp. Archived from the original on May 6, 2018. Retrieved May 3, 2020.
^ ^a ^b "NetOGlyc page for c14orf119 protein". NetOGlyc.
^ ^a ^b "NetGlycate page for c14orf119 protein". NetGlycate.
^ ^a ^b "NetNGlyc - c14orf119". NetNGlyc. Retrieved April 28, 2020.
^ ^a ^b "GPS page for the c14orf119 protein". GPS.
^ "Proteins for C14orf119 Gene". GeneCards. Retrieved February 26, 2020.
^ "NetPhos - c14orf119". NetPhos. Retrieved April 30, 2020.
^ Blom N, Gammeltoft S, Brunak S (December 1999). "Sequence and structure-based prediction of eukaryotic protein phosphorylation sites". Journal of Molecular Biology. 294 (5): 1351–1362. doi:10.1006/jmbi.1999.3310. PMID 10600390.
^ Johansen MB, Kiemer L, Brunak S (September 2006). "Analysis and prediction of mammalian protein glycation". Glycobiology. 16 (9): 844–853. doi:10.1093/glycob/cwl009. PMID 16762979.
^ Steentoft C, Vakhrushev SY, Joshi HJ, Kong Y, Vester-Christensen MB, Schjoldager KT, et al. (May 2013). "Precision mapping of the human O-GalNAc glycoproteome through SimpleCell technology". The EMBO Journal. 32 (10): 1478–1488. doi:10.1038/emboj.2013.79. PMC 3655468. PMID 23584533.
^ Hounsell EF, Davies MJ, Renouf DV (February 1996). "O-linked protein glycosylation structure and function". Glycoconjugate Journal. 13 (1): 19–26. doi:10.1007/BF01049675. PMID 8785483. S2CID 31369853.
^ Hay RT (April 2005). "SUMO: a history of modification". Molecular Cell. 18 (1): 1–12. doi:10.1016/j.molcel.2005.03.012. PMID 15808504.
^ Hart GW, Slawson C, Ramirez-Correa G, Lagerlof O (July 7, 2011). "Cross talk between O-GlcNAcylation and phosphorylation: roles in signaling, transcription, and chronic disease". Annual Review of Biochemistry. 80 (1): 825–858. doi:10.1146/annurev-biochem-060608-102511. PMC 3294376. PMID 21391816.
^ ^a ^b ^c "BLAST: Basic Local Alignment Search Tool". blast.ncbi.nlm.nih.gov. Retrieved May 2, 2020.
^ "C14orf119 (My028) Result Summary | BioGRID". thebiogrid.org. Retrieved May 3, 2020.
^ al, David Lynn et. "InnateDB: Systems Biology of the Innate Immune Response". innatedb.com. Retrieved May 3, 2020.
^ "Results - mentha: the interactome browser". mentha.uniroma2.it. Retrieved May 3, 2020.

[refGRCh38Ensembl-1] GRCh38: Ensembl release 89: ENSG00000179933 – Ensembl, May 2017

[refGRCm38Ensembl-2] GRCm38: Ensembl release 89: ENSMUSG00000040822 – Ensembl, May 2017

[3] "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.

[4] "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.

[:0-5] "PSORT II page for c14orf119". PSORT II.^{[permanent dead link]}

[:10-6] "NCBI GEO Profile for record GDS4889, c14orf119". NCBI GEO.

[:11-7] "NCBI GEO Profile for record GDS3516, c14orf119". NCBI GEO.

[8] "C14orf119 Gene (Protein Coding)". GeneCards. Retrieved February 26, 2020.

[:8-9] "C14orf119 chromosome 14 open reading frame 119 [ Homo sapiens (human) ]". NCBI. Retrieved February 26, 2020.

[10] "Homo sapiens gene C14orf119, encoding chromosome 14 open reading frame 119". AceView. Retrieved February 26, 2020.

[11] "Genome Data Viewer". ncbi.nlm.nih.gov. Retrieved May 1, 2020.

[12] "Gene: C14orf119 ENSG00000179933". Ensembl. Retrieved February 26, 2020.

[13] "NC_000014.9 Chromosome 14 Reference GRCh38.p13 Primary Assembly". NCBI Gene. Retrieved April 30, 2020.

[14] "uncharacterized protein C14orf119 [Homo sapiens] - Protein - NCBI". ncbi.nlm.nih.gov. Retrieved May 2, 2020.

[15] "Uncharacterized protein C14orf119". PhosphoSitePlus. Retrieved February 26, 2020.

[16] "Statistical Analysis of Protein Sequences, Compositional Analysis - c14orf119". Statistical Analysis of Protein Sequences, Compositional Analysis. Retrieved May 1, 2020.

[:7-17] "Statistical Analysis of Protein Sequences, Compositional Analysis, c14orf119". Statistical Analysis of Protein Sequence (SAPS).

[18] "uncharacterized protein C14orf119 [Homo sapiens]". NCBI Protein. Retrieved February 26, 2020.

[:4-19] "C14orf119 chromosome 14 open reading frame 119 [Homo sapiens (human)] - Gene - NCBI". ncbi.nlm.nih.gov. Retrieved May 1, 2020.

[20] "MotifFinder page for c14orf119 protein". MotifFinder.

[21] "Pfam: DUF4508". genome.jp. Retrieved May 1, 2020.

[22] "MyHits Motif Scan page for c14orf119 protein". MyHits Motif Scan.

[23] "Phyre 2 Results for c14orf119". sbg.bio.ic.ac.uk. Retrieved May 3, 2020.

[24] "GOR page for c14orf119 protein". GOR.

[:3-25] "Phyre 2 Results for c14orf119". sbg.bio.ic.ac.uk. Retrieved May 1, 2020.^{[permanent dead link]}

[26] "DISULFIND - Cysteines Disulfide Bonding State and Connectivity Predictor". disulfind.dsi.unifi.it. Retrieved May 2, 2020.^{[permanent dead link]}

[27] "CCTOP - c14orf119 protein". CCTOP. Retrieved May 2, 2020.

[28] "DAS-TMfilter prediction results". mendel.imp.ac.at. Archived from the original on February 5, 2018. Retrieved May 2, 2020.

[29] "SignalP-5.0". cbs.dtu.dk. Retrieved May 2, 2020.

[:6-30] "Human hg38 chr14:23,093,525-23,098,476 UCSC Genome Browser v397". genome.ucsc.edu. Retrieved May 2, 2020.

[:1-31] ^ ^a ^b ^c ^d ^e ^f "UCSC Genome Browser page for c14orf119". UCSC Genome Browser.

[32] "Uncharacterized Protein c14orf119". ENSEMBL. Retrieved February 25, 2020.

[33] "Gene Expression for c14orf119". GTExPortal. Retrieved February 25, 2020.

[34] "GDS3113 / 161646". ncbi.nlm.nih.gov. Retrieved May 3, 2020.

[35] "Sfold - Software for Statistical Folding and Studies of Regulatory RNAs". sfold.wadsworth.org. Retrieved May 2, 2020.

[36] "miRDB - MicroRNA Target Prediction Database". mirdb.org. Retrieved May 2, 2020.

[:5-37] "PSORT II - c14orf119". PSORT II. Retrieved April 28, 2020.^{[permanent dead link]}

[:12-38] "YinOYang page for c14orf119 protein". YinOYang.

[:13-39] "SUMOsp page for c14orf119 protein". SUMOsp. Archived from the original on May 6, 2018. Retrieved May 3, 2020.

[:14-40] "NetOGlyc page for c14orf119 protein". NetOGlyc.

[:15-41] "NetGlycate page for c14orf119 protein". NetGlycate.

[:16-42] "NetNGlyc - c14orf119". NetNGlyc. Retrieved April 28, 2020.

[:17-43] "GPS page for the c14orf119 protein". GPS.

[44] "Proteins for C14orf119 Gene". GeneCards. Retrieved February 26, 2020.

[45] "NetPhos - c14orf119". NetPhos. Retrieved April 30, 2020.

[46] Blom N, Gammeltoft S, Brunak S (December 1999). "Sequence and structure-based prediction of eukaryotic protein phosphorylation sites". Journal of Molecular Biology. 294 (5): 1351–1362. doi:10.1006/jmbi.1999.3310. PMID 10600390.

[47] Johansen MB, Kiemer L, Brunak S (September 2006). "Analysis and prediction of mammalian protein glycation". Glycobiology. 16 (9): 844–853. doi:10.1093/glycob/cwl009. PMID 16762979.

[48] Steentoft C, Vakhrushev SY, Joshi HJ, Kong Y, Vester-Christensen MB, Schjoldager KT, et al. (May 2013). "Precision mapping of the human O-GalNAc glycoproteome through SimpleCell technology". The EMBO Journal. 32 (10): 1478–1488. doi:10.1038/emboj.2013.79. PMC 3655468. PMID 23584533.

[49] Hounsell EF, Davies MJ, Renouf DV (February 1996). "O-linked protein glycosylation structure and function". Glycoconjugate Journal. 13 (1): 19–26. doi:10.1007/BF01049675. PMID 8785483. S2CID 31369853.

[50] Hay RT (April 2005). "SUMO: a history of modification". Molecular Cell. 18 (1): 1–12. doi:10.1016/j.molcel.2005.03.012. PMID 15808504.

[51] Hart GW, Slawson C, Ramirez-Correa G, Lagerlof O (July 7, 2011). "Cross talk between O-GlcNAcylation and phosphorylation: roles in signaling, transcription, and chronic disease". Annual Review of Biochemistry. 80 (1): 825–858. doi:10.1146/annurev-biochem-060608-102511. PMC 3294376. PMID 21391816.

[:9-52] "BLAST: Basic Local Alignment Search Tool". blast.ncbi.nlm.nih.gov. Retrieved May 2, 2020.

[53] "C14orf119 (My028) Result Summary | BioGRID". thebiogrid.org. Retrieved May 3, 2020.

[54] , David Lynn et. "InnateDB: Systems Biology of the Innate Immune Response". innatedb.com. Retrieved May 3, 2020.

[55] "Results - mentha: the interactome browser". mentha.uniroma2.it. Retrieved May 3, 2020.

[5]

[6]

[7]

[1]

[2]

[3]

[4]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

[20]

[21]

[22]

[23]

[24]

[25]

[26]

[27]

[28]

[29]

[30]

[31]

[32]

[33]

[34]

[35]

[36]

[37]

[38]

[39]

[40]

[41]

[42]

[43]

[44]

[45]

[46]

[47]

[48]

[49]

[50]

[51]

[52]

[53]

[54]

[55]