Copied from GC box article for editing:
In molecular biology, a GC box is a distinct pattern of nucleotides found in the promoter region of some eukaryotic genes upstream of the TATA box and approximately 110 bases upstream from the transcription initiation site. It has a consensus sequence GGGCGG which is position dependent and orientation independent. The GC elements are bound by transcription factors and have similar functions to enhancers.
Edited version:
In molecular biology, a GC box, also known as a GSG box[1], is a distinct pattern of nucleotides found in the promoter region of some eukaryotic genes. The GC box is upstream of the TATA box and approximately 110 bases upstream from the transcription initiation site. It has a consensus sequence GGGCGG which is position dependent and orientation independent. The GC elements are bound by transcription factors and have similar functions to enhancers. Some known GC box-binding proteins include Sp1, Krox/Egr, Wilms' tumor, MIGI, and CREA.[1]
The GC box is commonly the binding site for zinc-finger proteins. An alpha helix section of the protein corresponds with a major groove in the DNA. Zinc-fingers bind to triplet base pair sequences, with residue 21 binding to the first base pair, residue 18 binding to the second base pair, and residue 15 binding to the third base pair. The triplet base pairs can either be a GGG or a GCG. If residue 18 is a histidine, it will bind to a G, and if residue 18 is a glutamic acid, it will bind to a C. GC box-binding zinc-finger have between 2 and 4 fingers, making them interact with base pair sequences that are 6 to 8 base pairs in length.[1]
- ^ a b c Lundin, M.; Nehlin, J. O.; Ronne, H. (1994-03-01). "Importance of a flanking AT-rich region in target site recognition by the GC box-binding zinc finger protein MIG1". Molecular and Cellular Biology. 14 (3): 1979–1985. doi:10.1128/MCB.14.3.1979. ISSN 0270-7306. PMID 8114729.