Cross-linking immunoprecipitation

Cross-linking and immunoprecipitation (CLIP, or CLIP-seq) is a method used in molecular biology that combines UV crosslinking with immunoprecipitation in order to identify RNA binding sites of proteins on a transcriptome-wide scale, thereby increasing our understanding of post-transcriptional regulatory networks.[1][2][3] CLIP can be used either with antibodies against endogenous proteins, or with common peptide tags (including FLAG, V5, HA, and others) or affinity purification, which enables the possibility of profiling model organisms or RBPs otherwise lacking suitable antibodies.[4]

Workflow

edit
 
Basic Principle of CLIP

CLIP begins with the in vivo cross-linking of RNA-protein complexes using ultraviolet light (UV). Upon UV exposure, covalent bonds are formed between proteins and nucleic acids that are in close proximity (on the order of Angstroms apart).[5] The cross-linked cells are then lysed, RNA is fragmented, and the protein of interest is isolated via immunoprecipitation. In order to allow for priming of reverse transcription, RNA adapters are ligated to the 3' ends, and RNA fragments are labelled to enable the analysis of the RNA-protein complexes after they have been separated from free RNA using gel electrophoresis and membrane transfer. Proteinase K digestion is then performed in order to remove protein from the crosslinked RNA, which leaves a few amino acids at the crosslink site. This often leads to truncation of cDNAs at the crosslinked nucleotide, which is exploited in variants such as iCLIP to increase the resolution of the method.[6] cDNA is then synthesized via RT-PCR followed by high-throughput sequencing followed by mapping the reads back to the transcriptome and other computational analyses to study the interaction sites.[2]

History and applications

edit

CLIP was originally undertaken to study interactions between the neuron-specific RNA-binding protein and splicing factors NOVA1 and NOVA2 in the mouse brain, identifying RNA binding sites that contained the expected Nova-binding motifs. Sequencing of the cDNA library identified many positions close to alternative exons, several of which were found to require Nova1/2 for their brains-specific splicing patterns.[1] In 2008, CLIP was combined with high-throughput sequencing (termed "HITS-CLIP") to generate genome-wide protein-RNA interaction maps for Nova;[7] since then a number of other RNA-binding proteins have been studied with CLIP, including PTBP1,[8] RbFox2 (where it was referred to as "CLIP-seq"),[9] SFRS1,[10] Argonaute,[11][12][13] hnRNP C,[6] the Fragile-X mental retardation protein FMRP,[14][15] Ptbp2 (in the mouse brain),[16] Mbnl2,[17] the nElavl proteins (the neuron-specific Hu proteins),[18] and has been applied to RNA binding proteins from all kingdoms of life, including prokaryotes.[19] CLIP analysis of the RNA-binding protein Argonaute led to identification of microRNA targets[20] by decoding microRNA-mRNA and protein-RNA interaction maps in the mouse brain[11][21] and subsequently in budding yeast (Saccharomyces cerevisiae),[22] Caenorhabditis elegans,[23] embryonic stem cells[24] and tissue culture cells.[25]

Methods

edit

HITS-CLIP

edit
 
HITS-CLIP[26][27]

HITS-CLIP combines UV cross-linking and immunoprecipitation with high-throughput sequencing to identify binding sites of RNA-binding proteins.[5] HITS-CLIP also introduced the addition of dinucleotide barcodes to primers, providing the ability to sequence and then deconvolute multiple experiments simultaneously.[7] With analysis of cross-linking induced mutation sites (CIMS) at high sequencing depths, crosslink sites can be differentiated from other sources of sequence variation.[28]

PAR-CLIP

edit
 
PAR-CLIP[26][27]

PAR-CLIP (photoactivatable ribonucleoside–enhanced cross-linking and immunoprecipitation) is also used for identifying the binding sites of cellular RNA-binding proteins (RBPs) and microRNA-containing ribonucleoprotein complexes (miRNPs).[25] The method relies on the incorporation of photoreactive ribonucleoside analogs, such as 4-thiouridine (4-SU) and 6-thioguanosine (6-SG) into nascent RNA transcripts by living cells. Irradiation of the cells by UV light of 365 nm induces efficient cross-linking of photoreactive nucleoside-labeled cellular RNAs to interacting RBPs. Immunoprecipitation of the RBP of interest is followed by the isolation of the cross-linked and co-immunoprecipitated RNA. The isolated RNA is converted into a cDNA library and deep sequenced using high-throughput sequencing technology. Cross-linking the 4-SU and 6-SG analogs results in thymidine to cytidine, and guanosine to adenosine transitions respectively. As a result, PAR-CLIP can identify binding site locations with high accuracy.

However, PAR-CLIP is limited mainly to cultured cells, and nucleoside cytotoxicity is a concern;[2] it has been reported that 4-SU inhibits ribosomal RNA synthesis, induces a nucleolar stress response, and reduces cell proliferation.[29] PAR-CLIP has been employed to determine the transcriptome-wide binding sites of several known RBPs and microRNA-containing ribonucleoprotein complexes at high resolution. This includes the miRNA targeting AGO and TNRC6 proteins.[21]

iCLIP

edit
 
iCLIP[26][27]

iCLIP (individual nucleotide–resolution crosslinking and immunoprecipitation) is a variant of CLIP that enabled amplification of truncated cDNAs, which are produced when reverse transcription stops prematurely at the cross-link site.[6] Other approaches to identify protein-RNA crosslink sites include mutational analysis of read-through cDNAs, such as nucleotide transitions in PAR-CLIP,[25] or rare errors introduced by reverse transcriptase when it reads through the crosslink sites in standard HITS-CLIP methods, termed Crosslink induced mutation site (CIMS) analysis.[30]

iCLIP also added a random sequence (unique molecular identifier, UMI) along with experimental barcodes to the primer used for reverse transcription, thereby barcoding unique cDNAs to minimise any errors or quantitative biases of PCR, and thus improving the quantification of binding events. Enabling amplification of truncated cDNAs led to identification of the sites of RNA-protein interactions at high resolution by analysing the starting position of truncated cDNAs, as well as their precise quantification using UMIs with software called "iCount". These innovations of iCLIP were adopted by later variants of CLIP such as eCLIP and irCLIP.[4] Another modification of iCLIP, miCLIP, identifies methylated RNA sites with use of mutant enzyme or modification-specific antibody.[31][32][2] The quantitative nature of iCLIP enabled comparison across samples at the level of full RNAs,[33] or to study competitive binding of multiple RNA-binding proteins[34] or subtle changes in binding of a mutant protein at the level of binding peaks.[35]

eCLIP

edit

eCLIP (enhanced CrossLinking and ImmunoPrecipitation followed by high-throughput sequencing) is also used to map RBP binding sites on RNAs transcriptome-wide.[36] eCLIP was designed to improve upon iCLIP by increasing the efficiency in converting purified RNA fragments into cDNA library. At its publication, eCLIP was reported to increase such efficiency by >1000-fold, which not only decreases wasted sequencing of PCR duplicate molecules, but also dramatically decreases experimental failures during the CLIP procedure. Additionally, the amplification in eCLIP is now comparable to RNA-seq, enabling rigorous quantitative normalization against paired input controls (to remove background at ribosomal and other highly abundant RNAs) as well as quantitative comparison across peaks and samples, enabling the ability to detect allele-specific binding or differential RNA binding between conditions.
As in other CLIP methods, eCLIP relies on RBP-RNA interactions covalently linked using UV crosslinking of live cells. Cells are then lysed, and RNA is fragmented using limited RNase treatment. A specific RBP (and its bound RNA) is then immunoprecipitated using an antibody that specifically recognizes the targeted RBP. After ligation of a 3’ RNA adapter, immunoprecipitated material (as well as a paired input sample) are run on denaturing protein gels and transferred to nitrocellulose membranes. A region from the protein size to 75 kDa above is cut from the membrane and treated with Proteinase K to release RNA. After cleanup, RNA is then reverse transcribed to ssDNA, when a second adapter is ligated. By ligating the second adapter to cDNAs, eCLIP can identify truncated cDNAs, similar to iCLIP, and thereby study RNA-protein interaction sites with high resolution. PCR amplification is then used to obtain sufficient material for high-throughput sequencing. eCLIP can also be used to identify miRNA targets and profile RNA modifications such as m6A.
eCLIP datasets have been produced for over 150 RBPs with validated commercially available antibodies.[37]

Other CLIP methods

edit

sCLIP (simple CLIP) is a technique that requires lower amounts of input RNA and omits radio-labeling of the immunoprecipitated RNA. The method is based on linear amplification of the immunoprecipitated RNA and thereby improves the complexity of the sequencing-library despite significantly reducing the amount of input material and omitting several purification steps. Additionally, it permits a radiolabel-free visualization of immunoprecipitated RNA by using a highly sensitive biotin-based labeling technique. Along with a bioinformatical platform this method is designed to provide deep insights into RNA–protein interactomes in biomedical science, where the amount of starting material is often limited (i.e. in case of precious clinical samples).[38] Additional iCLIP variants have also been developed that retain the individual nucleotide resolution but differ in one or more steps from the original iCLIP method. These include iCLIP2, irCLIP, iiCLIP, and iCLIP1.5, a few to name.

As a modification of CLIP, methylated RNA sites were identified with the use of mutant enzyme or modification-specific antibody with the methods termed miCLIP or m6A-CLIP.[31][32][39][2]

Advantages and limitations

edit

Advantages

edit

RNA-binding proteins are frequently components of multi-protein complexes, and RNAs from various genes are present in cells at a range of abundance, therefore it is common that RNAs bound to co-purified proteins or non-specifically sticking to beads may be isolated when immunoprecipitating a specific protein. The data specificity obtained using early immunoprecipitation methods such as RIP have been demonstrated to be dependent on the reaction conditions of the experiment, such as protein concentrations and ionic conditions, and reassociation of RNA-binding proteins following cell lysis could lead to detection of artificial interactions.[40] Formaldehyde crosslinking methods have been used to preserve RNA-protein interactions, but these also generate protein-protein cross-links. By employing UV crosslinking that is specific to direct protein-RNA contacts, CLIP avoid protein-protein cross-links and ensures high specificity, while also obtaining positional information on the sites of protein-RNA interactions.

Since UV crosslinking creates a covalent bond, the crosslinked RNA fragments retain a short peptide after Proteinase K digestion, which can be exploited to identify the crosslink site. Reverse transcription most often truncates at the crosslink sites, creating truncated cDNAs that are exploited by iCLIP, while read-through cDNAs often contain mutations at the crosslink site (see HITS-CLIP and PAR-CLIP).[2]

Limitations

edit
 
CLIP Summary[26][28][27]

All CLIP library generation protocols require moderate quantities of cells or tissue (50–100 mg), require numerous enzymatic steps, and customised computational analyses.[12][41] Certain steps are difficult to optimize and frequently have low efficiencies. For example, overdigestion with RNase can decrease the number of identified binding sites and thus needs to be optimised.[27] Crosslinking efficiency also varies between proteins,[42] and nucleotide bias of crosslinking has been reported,[43] for example by comparing cross-linking sites and motifs enriched when protein-RNA complexes are studied in vivo in living cells and in vitro,[44] though methods are being developed to minimise such bias for enriched motif discovery.[45] Computationally predicted miRNA targets derived from TargetScan are comparable to CLIP in identifying miRNA targets, raising questions as to its utility relative to existing predictions.[46] Because CLIP methods rely on immunoprecipitation, crosslinked RNA could in some cases affect antibody-epitope interactions. Finally, significant differences have been observed. Therefore, raw CLIP results require further computational analyses to thoroughly investigate RNA-protein binding site interactions within the cell.

Similar methods

edit
  • RIP-Chip studies enrichment of full RNAs after immunoprecipitation of specific protein followed by microarray analysis, but without using cross-linking, it does not identify RNA binding sites
  • ChIP-Seq, method for finding interactions with DNA rather than RNA
  • SELEX, an in vitro method for finding a consensus binding sequence

Further reading

edit
  • starBase database: a database for exploring miRNA-lncRNA, miRNA-mRNA, miRNA-sncRNA, miRNA-circRNA, protein-lncRNA, protein-RNA interactions and ceRNA networks from PAR-CLIP(CLIP-Seq, HITS-CLIP, iCLIP, CLASH) data, and TargetScan,[46] PicTar, RNA22, miRanda and PITA microRNA target sites.
  • BIMSB doRiNA database: a database for exploring protein-RNA and microRNA-target interactions from CLIP-Seq, HITS-CLIP, PAR-CLIP, iCLIP data and PICTAR microRNA target site predictions.
  • miRTarCLIP: A computational approach for identifying microRNA-target interactions using high-throughput CLIP and PAR-CLIP sequencing.
  • clipz: a pipeline to analyze short RNA reads from HITS-CLIP experiments.
  • dCLIP: dCLIP is a Perl program for discovering differential binding regions in two comparative CLIP-Seq (HITS-CLIP, PAR-CLIP or iCLIP) experiments.

References

edit
  1. ^ a b Ule, Jernej; Jensen, Kirk B.; Ruggiu, Matteo; Mele, Aldo; Ule, Aljaz; Darnell, Robert B. (2003-11-14). "CLIP identifies Nova-regulated RNA networks in the brain". Science. 302 (5648): 1212–1215. Bibcode:2003Sci...302.1212U. doi:10.1126/science.1090095. ISSN 1095-9203. PMID 14615540. S2CID 23420615.
  2. ^ a b c d e f Hafner, Markus; Katsantoni, Maria; Köster, Tino; Marks, James; Mukherjee, Joyita; Staiger, Dorothee; Ule, Jernej; Zavolan, Mihaela (2021-03-04). "CLIP and complementary methods". Nature Reviews Methods Primers. 1 (1): 1–23. doi:10.1038/s43586-021-00018-1. ISSN 2662-8449. S2CID 233834798.
  3. ^ Ule, Jernej; Hwang, Hun-Way; Darnell, Robert B. (2018-08-01). "The Future of Cross-Linking and Immunoprecipitation (CLIP)". Cold Spring Harbor Perspectives in Biology. 10 (8): a032243. doi:10.1101/cshperspect.a032243. ISSN 1943-0264. PMC 6071486. PMID 30068528.
  4. ^ a b Lee, Flora C. Y.; Ule, Jernej (2018-02-01). "Advances in CLIP Technologies for Studies of Protein-RNA Interactions". Molecular Cell. 69 (3): 354–369. doi:10.1016/j.molcel.2018.01.005. ISSN 1097-4164. PMID 29395060.
  5. ^ a b Darnell 2010.
  6. ^ a b c König, Julian; Zarnack, Kathi; Rot, Gregor; Curk, Tomaz; Kayikci, Melis; Zupan, Blaz; Turner, Daniel J.; Luscombe, Nicholas M.; Ule, Jernej (July 2010). "iCLIP reveals the function of hnRNP particles in splicing at individual nucleotide resolution". Nature Structural & Molecular Biology. 17 (7): 909–915. doi:10.1038/nsmb.1838. ISSN 1545-9985. PMC 3000544. PMID 20601959.
  7. ^ a b Licatalosi et al. 2008.
  8. ^ Xue et al. 2009.
  9. ^ Yeo et al. 2009.
  10. ^ Sanford et al. 2009.
  11. ^ a b Chi et al. 2009.
  12. ^ a b Moore et al. 2014.
  13. ^ Chi, Sung Wook; Hannon, Gregory J.; Darnell, Robert B. (2012-02-12). "An alternative mode of microRNA target recognition". Nature Structural & Molecular Biology. 19 (3): 321–327. doi:10.1038/nsmb.2230. ISSN 1545-9985. PMC 3541676. PMID 22343717.
  14. ^ Darnell et al. 2011.
  15. ^ Hale, Caryn R.; Sawicka, Kirsty; Mora, Kevin; Fak, John J.; Kang, Jin Joo; Cutrim, Paula; Cialowicz, Katarzyna; Carroll, Thomas S.; Darnell, Robert B. (2021-12-23). "FMRP regulates mRNAs encoding distinct functions in the cell body and dendrites of CA1 pyramidal neurons". eLife. 10: e71892. doi:10.7554/eLife.71892. ISSN 2050-084X. PMC 8820740. PMID 34939924.
  16. ^ Licatalosi et al. 2012.
  17. ^ Charizanis et al. 2012.
  18. ^ Ince-Dunn et al. 2012.
  19. ^ Holmqvist et al. 2016.
  20. ^ Thomson, Bracken & Goodall 2011.
  21. ^ a b Yang et al. 2011.
  22. ^ Wolf, Joshua J.; Dowell, Robin D.; Mahony, Shaun; Rabani, Michal; Gifford, David K.; Fink, Gerald R. (June 2010). "Feed-forward regulation of a cell fate determinant by an RNA-binding protein generates asymmetry in yeast". Genetics. 185 (2): 513–522. doi:10.1534/genetics.110.113944. ISSN 1943-2631. PMC 2881133. PMID 20382833.
  23. ^ Zisoulis et al. 2010.
  24. ^ Leung et al. 2011.
  25. ^ a b c Hafner, Markus; Landthaler, Markus; Burger, Lukas; Khorshid, Mohsen; Hausser, Jean; Berninger, Philipp; Rothballer, Andrea; Ascano, Manuel; Jungkamp, Anna-Carina; Munschauer, Mathias; Ulrich, Alexander; Wardle, Greg S.; Dewell, Scott; Zavolan, Mihaela; Tuschl, Thomas (2010-04-02). "Transcriptome-wide identification of RNA-binding protein and microRNA target sites by PAR-CLIP". Cell. 141 (1): 129–141. doi:10.1016/j.cell.2010.03.009. ISSN 1097-4172. PMC 2861495. PMID 20371350.
  26. ^ a b c d Sugimoto et al. 2012.
  27. ^ a b c d e König et al. 2012.
  28. ^ a b Zhang & Darnell 2011.
  29. ^ Burger et al. 2013.
  30. ^ Zhang, Chaolin; Darnell, Robert B (1 June 2011). "Mapping in vivo protein-RNA interactions at single-nucleotide resolution from HITS-CLIP data". Nature Biotechnology. 29 (7): 607–614. doi:10.1038/nbt.1873. PMC 3400429. PMID 21633356.
  31. ^ a b Hussain, Shobbir; Sajini, Abdulrahim A.; Blanco, Sandra; Dietmann, Sabine; Lombard, Patrick; Sugimoto, Yoichiro; Paramor, Maike; Gleeson, Joseph G.; Odom, Duncan T.; Ule, Jernej; Frye, Michaela (2013-07-25). "NSun2-mediated cytosine-5 methylation of vault noncoding RNA determines its processing into regulatory small RNAs". Cell Reports. 4 (2): 255–261. doi:10.1016/j.celrep.2013.06.029. ISSN 2211-1247. PMC 3730056. PMID 23871666.
  32. ^ a b Linder, Bastian; Grozhik, Anya V.; Olarerin-George, Anthony O.; Meydan, Cem; Mason, Christopher E.; Jaffrey, Samie R. (August 2015). "Single-nucleotide-resolution mapping of m6A and m6Am throughout the transcriptome". Nature Methods. 12 (8): 767–772. doi:10.1038/nmeth.3453. ISSN 1548-7105. PMC 4487409. PMID 26121403.
  33. ^ Tollervey, James R.; Curk, Tomaž; Rogelj, Boris; Briese, Michael; Cereda, Matteo; Kayikci, Melis; König, Julian; Hortobágyi, Tibor; Nishimura, Agnes L.; Zupunski, Vera; Patani, Rickie; Chandran, Siddharthan; Rot, Gregor; Zupan, Blaž; Shaw, Christopher E. (April 2011). "Characterizing the RNA targets and position-dependent splicing regulation by TDP-43". Nature Neuroscience. 14 (4): 452–458. doi:10.1038/nn.2778. ISSN 1546-1726. PMC 3108889. PMID 21358640.
  34. ^ Zarnack, Kathi; König, Julian; Tajnik, Mojca; Martincorena, Iñigo; Eustermann, Sebastian; Stévant, Isabelle; Reyes, Alejandro; Anders, Simon; Luscombe, Nicholas M.; Ule, Jernej (2013-01-31). "Direct competition between hnRNP C and U2AF65 protects the transcriptome from the exonization of Alu elements". Cell. 152 (3): 453–466. doi:10.1016/j.cell.2012.12.023. ISSN 1097-4172. PMC 3629564. PMID 23374342.
  35. ^ Hallegger, Martina; Chakrabarti, Anob M.; Lee, Flora C. Y.; Lee, Bo Lim; Amalietti, Aram G.; Odeh, Hana M.; Copley, Katie E.; Rubien, Jack D.; Portz, Bede; Kuret, Klara; Huppertz, Ina; Rau, Frédérique; Patani, Rickie; Fawzi, Nicolas L.; Shorter, James (2021-09-02). "TDP-43 condensation properties specify its RNA-binding and regulatory repertoire". Cell. 184 (18): 4680–4696.e22. doi:10.1016/j.cell.2021.07.018. ISSN 1097-4172. PMC 8445024. PMID 34380047.
  36. ^ Van Nostrand, Pratt & Shishkin 2016.
  37. ^ Van Nostrand, Eric L.; Freese, Peter; Pratt, Gabriel A.; Wang, Xiaofeng; Wei, Xintao; Xiao, Rui; Blue, Steven M.; Chen, Jia-Yu; Cody, Neal A. L.; Dominguez, Daniel; Olson, Sara; Sundararaman, Balaji; Zhan, Lijun; Bazile, Cassandra; Bouvrette, Louis Philip Benoit (July 2020). "A large-scale binding and functional map of human RNA-binding proteins". Nature. 583 (7818): 711–719. Bibcode:2020Natur.583..711V. doi:10.1038/s41586-020-2077-3. ISSN 1476-4687. PMC 7410833. PMID 32728246.
  38. ^ Kargapolova et al. 2017.
  39. ^ Ke, Shengdong; Alemu, Endalkachew A.; Mertens, Claudia; Gantman, Emily Conn; Fak, John J.; Mele, Aldo; Haripal, Bhagwattie; Zucker-Scharff, Ilana; Moore, Michael J.; Park, Christopher Y.; Vågbø, Cathrine Broberg; Kusśnierczyk, Anna; Klungland, Arne; Darnell, James E.; Darnell, Robert B. (2015-10-01). "A majority of m6A residues are in the last exons, allowing the potential for 3' UTR regulation". Genes & Development. 29 (19): 2037–2053. doi:10.1101/gad.269415.115. ISSN 1549-5477. PMC 4604345. PMID 26404942.
  40. ^ Mili & Steitz 2004.
  41. ^ Chakrabarti, Anob M.; Haberman, Nejc; Praznik, Arne; Luscombe, Nicholas M.; Ule, Jernej (2018-07-20). "Data Science Issues in Studying Protein–RNA Interactions with CLIP Technologies". Annual Review of Biomedical Data Science. 1 (1): 235–261. doi:10.1146/annurev-biodatasci-080917-013525. ISSN 2574-3414. PMC 7614488. PMID 37123514. S2CID 90760475.
  42. ^ Ule et al. 2005.
  43. ^ Knörlein, Anna; Sarnowski, Chris P.; de Vries, Tebbe; Stoltz, Moritz; Götze, Michael; Aebersold, Ruedi; Allain, Frédéric H.-T.; Leitner, Alexander; Hall, Jonathan (2022-05-17). "Nucleotide-amino acid π-stacking interactions initiate photo cross-linking in RNA-protein complexes". Nature Communications. 13 (1): 2719. Bibcode:2022NatCo..13.2719K. doi:10.1038/s41467-022-30284-w. ISSN 2041-1723. PMC 9114321. PMID 35581222.
  44. ^ Bohnsack et al. 2009.
  45. ^ Kuret, Klara; Amalietti, Aram Gustav; Jones, D. Marc; Capitanchik, Charlotte; Ule, Jernej (2022-09-09). "Positional motif analysis reveals the extent of specificity of protein-RNA interactions observed by CLIP". Genome Biology. 23 (1): 191. doi:10.1186/s13059-022-02755-2. ISSN 1474-760X. PMC 9461102. PMID 36085079.
  46. ^ a b Agarwal et al. 2015.

Sources

edit