A putative gene is an alignment segment of DNA that is believed to be a gene. Putative genes can share sequence similarities to already characterized genes and thus can be inferred to share a similar function, yet the exact function of putative genes remains unknown.[1] Newly identified sequences are considered putative gene candidates when homologs of those sequences are found to be associated with the phenotype of interest.[2]

Examples

edit

Examples of studies involving putative genes include the discovery of 30 putative receptor genes found in rat vomeronasal organ (VNO)[3] and the identification of 79 putative TATA boxes found in many plant genomes.[4]

Practical importance

edit

In order to define and characterize a biosynthetic gene cluster, all the putative genes within said cluster must first be identified and their functions must be characterized. This can be performed by complementation and knock out experiments. In the process of characterizing putative genes, the genome under study becomes increasingly well understood as more interactions can be identified.[5] Identification of putative genes is necessary to study genomic evolution, as significant proportion of genomes make up larger families of related genes. Genomic evolution occurs by processes such as duplication of individual genes, genome segments, or entire genomes. These processes can result in loss of function, altered function, or gain of function, and have drastic affects on the phenotype.[6][7]

DNA mutations outside of a putative gene can act by positional effect, in which they alter the gene expression. These alterations leave the transcription unit and promoter of the gene intact, but may involve distal promoters, enhancer/silencer elements, or the local chromatin environment. These mutations can be associated with diseases or disorders associated with the gene.

Identification

edit

Putative genes can be identified by clustering large groups of sequences by patterns and arranging by mutual similarity[8] or can be inferred by potential TATA boxes.[9]

Putative genes can also be identified by recognizing differences between well-known gene clusters and gene clusters with a unique profiling.[10]

Software tools have been developed in order to automatically identify putative genes. This is done by searching for gene families and testing the validity of uncharacterized genes by comparison to already identified genes.[11]

Protein products can be identified and used to characterize the putative gene that codes for it.[12]

See also

edit

References

edit
  1. ^ Alexandre S, Guyaux M, Murphy NB, Coquelet H, Pays A, Steinert M, Pays E (June 1988). "Putative genes of a variant-specific antigen gene transcription unit in Trypanosoma brucei". Molecular and Cellular Biology. 8 (6): 2367–78. doi:10.1128/mcb.8.6.2367. PMC 363435. PMID 3405209.
  2. ^ Mishima K, Hirao T, Tsubomura M, Tamura M, Kurita M, Nose M, et al. (April 2018). "Identification of novel putative causative genes and genetic marker for male sterility in Japanese cedar (Cryptomeria japonica D.Don)". BMC Genomics. 19 (1): 277. doi:10.1186/s12864-018-4581-5. PMC 5914023. PMID 29685102.
  3. ^ Dulac C, Axel R (October 1995). "A novel family of genes encoding putative pheromone receptors in mammals". Cell. 83 (2): 195–206. doi:10.1016/0092-8674(95)90161-2. PMID 7585937. S2CID 18784638.
  4. ^ Joshi CP (August 1987). "An inspection of the domain between putative TATA box and translation start site in 79 plant genes". Nucleic Acids Research. 15 (16): 6643–53. doi:10.1093/nar/15.16.6643. PMC 306128. PMID 3628002.
  5. ^ Wawrzyn GT, Bloch SE, Schmidt-Dannert C (2012-01-01). "Discovery and characterization of terpenoid biosynthetic pathways of fungi". In Hopwood (ed.). Natural Product Biosynthesis by Microorganisms and Plants, Part A. Methods in Enzymology. Vol. 515. pp. 83–105. doi:10.1016/b978-0-12-394290-6.00005-7. ISBN 9780123942906. PMID 22999171.
  6. ^ Frank RL, Mane A, Ercal F (September 2006). "An automated method for rapid identification of putative gene family members in plants". BMC Bioinformatics. 7 (2): S19. doi:10.1186/1471-2105-7-S2-S19. PMC 1683565. PMID 17118140.
  7. ^ Emery AE (2013). "Personal Memories of David Rimoin". Emery and Rimoin's Principles and Practice of Medical Genetics. Elsevier. pp. i. doi:10.1016/b978-0-12-383834-6.11001-8. ISBN 978-0-12-383834-6.
  8. ^ Aouf M, Liyanage L (2012-09-26). "Analysis of High Dimensionality Yeast Gene Expression Data Using Data Mining". Applied Mechanics and Materials. 197: 515–522. Bibcode:2012AMM...197..515A. doi:10.4028/www.scientific.net/amm.197.515. S2CID 109965976.
  9. ^ Joshi CP (August 1987). "An inspection of the domain between putative TATA box and translation start site in 79 plant genes". Nucleic Acids Research. 15 (16): 6643–53. doi:10.1093/nar/15.16.6643. PMC 306128. PMID 3628002.
  10. ^ Mihali TK, Carmichael WW, Neilan BA (February 2011). "A putative gene cluster from a Lyngbya wollei bloom that encodes paralytic shellfish toxin biosynthesis". PLOS ONE. 6 (2): e14657. Bibcode:2011PLoSO...614657M. doi:10.1371/journal.pone.0014657. PMC 3037375. PMID 21347365.
  11. ^ Frank RL, Mane A, Ercal F (September 2006). "An automated method for rapid identification of putative gene family members in plants". BMC Bioinformatics. 7 Suppl 2 (2): S19. doi:10.1186/1471-2105-7-S2-S19. PMC 1683565. PMID 17118140.
  12. ^ Denison M, Perlman S (April 1987). "Identification of putative polymerase gene product in cells infected with murine coronavirus A59". Virology. 157 (2): 565–8. doi:10.1016/0042-6822(87)90303-5. PMC 7131660. PMID 3029990.
edit