Regulatory sequences
editEukaryotes vs. Prokaryotes
editRegulatory sequences make up a small portion of the genome that does not encode for proteins [1]. These regulatory sequences determine the level, location and chronology of gene expression [1]. Regulatory sequences primarily dictate transcriptional regulation, that is, if and when a gene is transcribed. The initiation of transcription of a gene is under the influence of promoters, regulatory elements and proteins known as transcription factors [2]. Promoters are regions of DNA that include a sequence responsible for the proper transcription of a protein-coding sequence of DNA, known as a gene [2]. Regulatory elements are regions of non-coding DNA that regulate the transcription of nearby genes [3]. Transcription factors are proteins that interact with promoters, and other specific DNA sequences within the regulatory regions, to determine the pattern of gene expression [2]. While both eukaryotes and prokaryotes require promoters and transcription factors to regulate transcription, they differ in the type of regulatory machinery used.
Prokaryotes
editProkaryotic organisms are single-celled organisms that do not have a defined nucleus [4]. Therefore, the DNA within prokaryotes is found within the cytoplasm, available for simultaneous transcription and translation [4]. The regulation of transcription is the primary method to control protein expression in a prokaryotic cell [4]. Prokaryotes differ from eukaryotes in that they contain operons; a single unit of DNA that acts to regulate the expression of multiple genes at once [3]. These operons react in response to environmental changes and activate transcription of specific genes in response to such changes [3]. Operons contain promoter DNA sequences as well as operator sequences. In bacteria, the promoter contains two short sequence elements approximately -10 and -35 nucleotides upstream from the transcription start site [3]. The sequence at -10 (the -10 element) has the sequence TATAAT, while the sequence at -35 (the -35 element) has sequence TTGACA [3]. These sequences promote the binding of RNA polymerase, allowing transcription of the gene of interest.
Operator
editAn operator is a sequence of DNA found near the promoter of an operon that can be recognized by a repressor protein. When a repressor protein binds to the operator site, transcription of the gene, or genes, is blocked [3].
Repressor
editA repressor is a type of transcription factor that binds to a cis-acting element, such as an operator site, and prevents transcription [3]. When an inducer is present, it binds with the repressor, causing the repressor to change shape and release from the operator. When this happens, the RNA polymerase can proceed with transcription, and protein synthesis begins and continues until another repressor binds with the operator [3].
Eukaryotes
editEukaryotic organisms are multi-cellular and contain organelles which increase their complexity [4]. In eukaryotic cells, the DNA is contained inside of the nucleus where it is transcribed into RNA [3]. The RNA strand is then processed into protein in the cytoplasm outside of the nucleus. Regulation of gene expression can occur at all stages of this process. During transcriptional regulation, promoter sequences determine the site of transcription initiation and direct the binding of RNA polymerase [4]. There are three types of promoter sequences that have been identified in eukaryotic DNA: the TATA box, initiator proteins, and CpG Islands [4]. Eukaryotes also have unique transcriptional factors known as activators, enhancers, and silencers [3]. Activators bind to enhancers, which are discrete DNA units located at point along the chromosome, to form an activator-enhancer complex [3]. The activator-enhancer complex then causes the DNA to bend, so that additional transcription factors have better access to the promoter sequence [3]. These additional transcription factors recruit RNA polymerase to initiate transcription. Silencers are repressor proteins that block transcription through binding to the DNA sequence.
TATA Box
editAn important promoter sequence found in many genes is the TATA Box. The TATA box is a conserved region rich in adenine and thymine and is slightly upstream (approximately 25 to 30 bp) of the start site of transcription [2]. The TATA box is a regulatory sequence found only in eukaryotes and has an important role in the determining the position of the start of transcription[2]. Instead of a TATA Box, some eukaryotic genes contain another promoter element called an initiator. The initiator sequence has a similar function as the TATA box, in that it recruits RNA polymerase in order for transcription to begin.
CpG Islands
editGenes that are expressed ("turned-on") most of the time, in most cells, are known as “house-keeping genes” [2]. These genes contain large amounts of the DNA nucleotides, cytosine and guanine [2]. These parts of the DNA sequences with large amounts of cytosine and guanine are known as CpG islands. These CpG islands are found in the promoters of these genes and serve as binding sites for transcription factors [2] . CpG Islands are also targets for DNA methylation [2]. DNA methylation occurs when a chemical compound, known methyl group, is added to the DNA sequence. When DNA methylation occurs at a gene promoter sequence, the expression of the gene is modified. When a CG-promoter is methylated, gene transcription is inhibited [3].
- ^ a b Pennacchio, L.A; Rubin, E.M. (2001). "Genomic strategies to identify mammalian regulatory sequences". Nature Reviews Genetics (2: 100-109).
- ^ a b c d e f g h i Nussbaum, Robert L.; McInnes, Roderick R.; Willard, Huntington (2015). Thompson&Thompson Genetics In Medicine. Philadelphia: Elsevier. pp. 30–35. ISBN 978-1-4377-0696-3.
- ^ a b c d e f g h i j k l m Hartwell, L.H. (2014). Genetics: From Genes to Genomes (Canadian ed.). Canada: McGraw-Hill Ryerson Limited.
- ^ a b c d e f Lodish, H. (2000). Molecular Cell Biology (4th ed.). New York: W. H. Freemam.