WO2018096356A1 - Methods for conditional gene knock-out - Google Patents
Methods for conditional gene knock-out Download PDFInfo
- Publication number
- WO2018096356A1 WO2018096356A1 PCT/GB2017/053546 GB2017053546W WO2018096356A1 WO 2018096356 A1 WO2018096356 A1 WO 2018096356A1 GB 2017053546 W GB2017053546 W GB 2017053546W WO 2018096356 A1 WO2018096356 A1 WO 2018096356A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- intron
- sequence
- gene
- recombinase
- nuclease
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/85—Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/87—Introduction of foreign genetic material using processes not otherwise provided for, e.g. co-transformation
- C12N15/90—Stable introduction of foreign DNA into chromosome
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/87—Introduction of foreign genetic material using processes not otherwise provided for, e.g. co-transformation
- C12N15/90—Stable introduction of foreign DNA into chromosome
- C12N15/902—Stable introduction of foreign DNA into chromosome using homologous recombination
Definitions
- This invention relates to a system and method for generating conditional gene knock-out cells.
- the system includes the use of an artificial intron to modify a target allele to permit conditional knock-out of the allele by disruption of the artificial intron.
- Genome editing has been revolutionized by the discovery of the C ISP /Cas technology that enables the targeted introduction of double strand breaks into the human genome at unprecedented precision and efficiency 1"4 . Once the double-strand break has been
- the cell will repair the break by one of two common pathways: in the presence of a homology template, the cell will use homology-directed repair or homologous
- the cell will employ non- homologous end joining to repair the break 6 .
- indels small insertions or deletions
- Gene knockouts are strains containing a frameshift mutation or, ultimately, a stop codon in coding exons and thus represent complete loss-of-function mutants. They are fundamental for our understanding of gene function as they enable the study of the corresponding loss-of- function phenotype.
- the homology donor was then used to establish genetically engineered cells by spontaneous homologous recombination (e.g. in mouse embryonic stem cells) or, more recently, by C ISP -assisted homologous recombination.
- spontaneous homologous recombination e.g. in mouse embryonic stem cells
- C ISP -assisted homologous recombination e.g. in C ISP -assisted homologous recombination.
- degrons include the auxin/ mAID system 9 , the HaloPROTAC3/ HaloTag system 10 and the HCV protease inhibitor/ SMAShTag system 11 .
- degrons has two major limitations: (i) The bi-allelic tagging of genes occurs at very low frequency and thus, it is difficult to obtain a cell line or an organism in which all copies of a gene have been conditionally inactivated and (ii) The insertion of foreign sequences within the coding region of a gene often disturbs protein folding or protein function. As a consequence, degron tagging may be less predictable than desired.
- FLIP invertible intronic cassette
- puroR puromycin resistance gene
- polyadenylation signal pA
- pA polyadenylation signal
- This system has several shortcomings: (i) The cassette that is used as a donor is large (>2kb; the exact size not evident from the publication) and hence, the assembly of targeting constructs with homology arms is costly, cumbersome and time- consuming; (ii) The cassette contains an antibiotic resistance gene which is not always desired and (iii) Expression of the gene of interest is disrupted by a splice acceptor whose efficiency can vary depending on genomic context. Accordingly, there is a need for an alternative conditional gene knock out method.
- the present invention is intended to address the disadvantages of the prior art.
- the inventors have determined a method which is intended to significantly reduce the size of the artificial intron that is used and ensure that gene expression can be rapidly and efficiently abrogated in a wide range of mammalian genes.
- the artificial intron size may be small so that homology donors can be easily assembled by gene synthesis; (ii) the approach does not require the use of antibiotic selection; (iii) the cassette disrupts gene expression completely, i.e. reveals a stop codon upon cassette activation; (iv) the cassette has a good "window of opportunity", so that protein levels are strongly decreased as a consequence of cassette activation; and (v) the approach is scalable and success is predictable for most human genes without too much prior insight into protein function or folding.
- an artificial intron cassette that may contain as few as about 200 nucleotides. Insertion of this cassette into the coding exon of a gene does not disrupt gene function as the cassette is based on a natural intron sequence and the intron is thus removed by splicing. Addition of Cre recombinase (e.g. by addition of a recombinant adenovirus bearing Cre recombinase) leads to the deletion of critical elements within the intron that are required to maintain splicing, most importantly: the branch site. As a consequence, the cassette will no longer be removed and the ribosome will run into a stop codon, thus disrupting the expression of the gene of interest upon addition of Cre recombinase. The resulting cell line contains a frameshift and thus represents a complete gene knockout.
- Described herein is a system that allows the generation of conditional gene knockouts on a large scale with the following features: (i) engineering is straightforward in diploid or even polyploid cells; (ii) the approach does not require any prior insight into the structure/ function of the targeted gene product; (iii) in the "off-state" of the cassette, the activity of the target gene product is largely unaffected; (iv) the cassette activation (transition to "on-state”) triggers a profound decrease in gene expression.
- DECAI Degradation via Cre-regulated Artificial Intron
- the present inventors have now developed a new conditional gene knock-out system that overcomes the above mentioned limitations.
- the system uses an artificial intron sequence which, when introduced into the coding exon of a target gene, permits normal gene function because the artificial intron is based on a natural intron sequence and is thus removed by splicing.
- the artificial intron includes one or more stop codons or a series of stop codons (one for each of the three possible reading frames) and an excisable or disruptable intron element.
- this intron element is excised, intron function is disrupted and normal splicing of the intron is prevented.
- the ribosome will then run into the stop codon, thus disrupting the expression of the target gene.
- a method for generating an allele for conditional gene knock-out in a cell comprising a target gene comprising: introducing an artificial intron sequence into an exon of the target gene, the artificial intron sequence comprising:
- the method may comprise introducing into the cell a sequence-specific nuclease that cleaves a sequence within the target gene thereby introducing the artificial intron sequence into the target gene.
- the method may also include the step of introducing or activating a recombinase or nuclease in the cell thereby excising or disrupting the branch point and abrogating splicing of the artificial intron sequence.
- the intron may be disrupted by applying two sequence-specific nucleases (e.g. Cas9 with two guide NAs targeting two sites flanking the branch point) to excise the branch point and thus abrogate splicing.
- the guide RNA recognition sites flanking the branch point may also be identical, thus only requiring Cas9 along with a single guide RNA.
- artificial introns are provided.
- the artificial introns may be as described herein and may be used in any of the methods described herein.
- the sequence of the artificial intron may comprise:
- the sequence of the artificial intron may comprise two stop codons, more preferably three stop codons, which may be positioned in a series of three or more stop codons 5' to the branch point.
- the artificial intron may contain a single branch point and may also lack related sequences that can take over branch point function once the single branch point has been removed.
- the sequence-specific nuclease used to cleave the target gene may be an NA guided nuclease, or a DNA guided nuclease, a Zinc finger nuclease or a TALEN.
- the RNA-guided nuclease may be Cas9 or an RNA-guided Argonaute.
- the sequence-specific nuclease may be introduced as a protein, m RNA, or cDNA.
- the first and second recombinase sites may be loxP sites or FRT sites.
- the first (5') loxP site comprises at least one stop codon.
- the first recombinase or nuclease site is positioned adjacent to the splice donor site.
- the artificial intron sequence may be small, for example, about 200-
- the artificial intron sequence comprises a nucleotide sequence at least 80%, 85%, 90%, 95%,96%, 97%, 98%, 99%, or 100% identical to SEQ ID NO:
- the invention provides cells containing any of the artificial introns as described herein. Also provided are cells containing an artificial intron introduced according to any of the methods described herein.
- the cells comprise an artificial intron in an exon of a target gene, the artificial intron having at least one stop codon positioned 5' of or within the first recombinase or nuclease site, and wherein the single branch site is flanked by a pair of recombinase or nuclease sites, the recombinase or nuclease sites being arranged such that the single branch site may be excised or disrupted, thereby abrogating splicing of the artificial intron.
- a cell comprising an artificial intron in an exon of a target gene, the artificial intron having at least one stop codon positioned 5' to or within a first recombinase or nuclease site, and wherein a single branch site is flanked by a pair of recombinase or nuclease sites, the recombinase or nuclease sites being arranged such that the single branch site may be excised or disrupted, thereby abrogating splicing of the artificial intron, and wherein the artificial intron sequence also comprises a splice donor sequence and a splice acceptor sequence.
- a cell comprising an artificial intron in an exon of a target gene, the artificial intron sequence comprising:
- the cells used in the methods herein, or comprising the artificial intron sequences or nucleic acid constructs described herein may be eukaryotic cells, including mammalian cells such as haploid cells (e.g. the HAP1 cell-line), human cells, human induced pluripotent cells, human induced pluripotent stem cells, embryonic stem cells, or CHO cells or a T-cell (such as those for use in human therapy).
- mammalian cells such as haploid cells (e.g. the HAP1 cell-line), human cells, human induced pluripotent cells, human induced pluripotent stem cells, embryonic stem cells, or CHO cells or a T-cell (such as those for use in human therapy).
- the cells described herein may be used in therapy.
- the cells described herein may be used to generate transgenic animals.
- nucleic acid construct comprising an artificial intron sequence as described herein and which may be used in any of the methods described herein.
- the nucleic acid construct further comprises 5' and 3' homology arms comprising sequences homologous to corresponding target gene sequences.
- the nucleic acid constructs described herein may be part of a plasmid.
- a vector comprising the artificial intron sequence or nucleic acid constructs as described herein.
- Cre/loxP system has been used to create a variety of conditional KO mice. In most of these, a coding exon of the target gene was flanked with loxP sites. In the absence of Cre, gene expression is unaffected. Upon addition or activation of Cre, the intervening exon is deleted or inverted, resulting in a gene KO.
- Cre could be applied by transient transfection or virus infection.
- mice could already carry a Cre transgene, often coupled to a promoter that warrants expression in a specific tissue or at a certain stage of development.
- Mice harbouring Cre are then crossed with strains harbouring alleles flanked with loxP sites. In that way, target gene expression can be disrupted in a tissue or developmental stage-specific manner, depending on which Cre mouse line was used for the intercross.
- non-human transgenic animals whose genome comprises the artificial intron sequence described herein
- mice In mice, this problem is often solved by creating a heterozygous conditional allele (e.g. by flanking an exon with loxP sites) and backcrossing heterozygous mice to homozygosity. This is unfortunately not possible in cell lines as they do not undergo sexual reproduction and hence, the conditional KO allele cannot be separated from the wild-type allele.
- This problem is solved in an elegant way using the approach described herein: By targeting a coding exon with a g NA and introducing an artificial intron on one allele, the second allele is often disrupted as a consequence of Cas9 cleavage and repair by non-homologous end joining.
- the resulting cell line will bear one allele harbouring the artificial intron and a second allele harbouring a frameshift mutation. For many essential genes, such lines will be viable. Hence, this solution alleviates the need for backcrossing and allows conditional KO model generation in one-go.
- a gene Knockout(KO) is only desirable in a certain subtype or cells or at a certain developmental stage. In that case, it is feasible to engineer the total population of cells (e.g. all T cells) with the conditional KO cassette described herein.
- Cre is then applied to a specific subset (e.g. the CD4-positive T cells) to inactivate the target gene selectively in this sub-population of cells.
- Stem cell differentiation is often heterogenous or incomplete.
- One way to improve the quality and outcome of the process would be inactivate certain genes at a specific point during the process and thereby steer cell differentiation in a certain direction.
- a conditional Knockout (KO) allele into a pluripotency factor (e.g. Oct4/POU5Fl; as shown in Example 2) and thereby create an iPSC line in which pluripotency can be terminated at will.
- this line will be subjected to differentiation by applying the appropriate cocktail of factors or by transducing with one or more master regulators as is known in the art.Once cells are starting to differentiate, one activates the conditional Oct4 allele (e.g. by addition of Cre recombinase).
- Cre recombinase e.g. by addition of Cre recombinase
- the cell may be an induced pluripotent stem cell and the target gene is Oct4.
- conditional KO approach can be used to create more homogeneous differentiation protocols.
- Genes that drive and control the maintenance of intermediate states of differentiation e.g. like pancreatic
- the conditional KO cassette can be activated, leading to the death of progenitor cells that contaminate the culture. Reduction of tumorigenesis by suicide switches
- conditional KO cassette described herein can be used to create a proliferation switch by targeting a gene involved in cell cycle (e.g. Cyclin-dependent Kinases). Blocking of proliferation can be carried out upon cell expansion to the required amounts for transplantation, or after stem cell differentiation into the desired cell type, and can be performed both prior and posttransplantation (different strategies can used for Cre delivery or Cre-activation).
- a gene involved in cell cycle e.g. Cyclin-dependent Kinases
- Certain fusion transcripts are commonly produced by cancer cells, and detection of these constitutes the diagnostic of certain cancer types.
- Reference standards are materials that can be used as positive or negative controls during the diagnostic of patient material.
- the conditional KO cassette described herein can be used to create transcript fusions to be used as reference material by allowing the hiding of a selection cassette (e.g. antibiotic resistance such as puromycin). This feature enables the generation and selection of fusion transcripts composed of an endogenous gene and a transgene, without any scar at the RNA level, as the synthetic intron is removed by slicing.
- an element could represent an enhancer that is poorly described or annotated and altering this enhancer could have a profound impact on gene expression.
- intron As the intron is actively removed by splicing, it is not present at the mRNA level and does therefore not impact transcription. Modifying the intron to contain an exogenous sequence (e.g. a molecular barcode) could represent an elegant way of "hiding" this sequence in the genome without leaving a trace in the transcriptome.
- an exogenous sequence e.g. a molecular barcode
- FIG. 1 NanoLuc expression from constructs containing synthetic intron configurations HAPl cells were transfected with a pcDNA3.1 plasmid expressing NanoLuc under the control of a CMV promoter. Cells were lysed 24h post transfection and analysed using the Nano-Glo Luciferase Assay System (Promega) according to manufacturer's instructions. Median luminescence across 24 replicates is shown for each condition. As a control, expression levels from a plasmid containing the NanoLuc gene or the NanoLuc with the chimeric intron with no loxP sites were measured.. For sequence details of the different artificial introns see below SEQ I D NO:2 to SEQ ID NO: ll.
- Figure 2 NanoLuc expression from constructs with chimeric intron at a different site
- HAPl cells were transfected with a pcDNA3.1 plasmid expressing NanoLuc under the control of a CMV promoter. Cells were lysed 24h post transfection and analysed using the Nano-Glo Luciferase Assay System (Promega) according to manufacturer's instructions. Median
- Luminescence across 24 replicates is shown for each condition.
- expression levels from a plasmid containing the NanoLuc gene or the NanoLuc with the artificial intron were measured.
- FIG. 3 Genotyping CDK4-intron and METTL16-intron cell lines after Cre recombination Cells expressing CDK4-intron or M ETTL16-intron were infected with a recombinant retrovirus encoding Cre-ERT2. Upon enrichment of transduced cells, Cre was activated by adding ⁇ 4- hydroxitamoxifen (4-OHT) for three days. Genomic DNA from CDK4-intron and M ETTL16- intron cell lines treated with Cre recombinase and/or 4-OHT was used for a PCR amplifying the region around the insertion site of the artificial intron. Genomic DNA from wild type HAPl cells was used as a control. The expected sized of the PCR products are as follows. For CDK4,
- HAPl WT 210bp
- CDK4-intron before recombination: 411bp
- CDK4-intron after recombination: 264bp.
- M ETTL16 HAPl WT: 250bp
- M ETTL16-intron before recombination: 451bp
- M ETTL16-intron after recombination: 304bp.
- Figure 4 Cell viability of CDK4-intron and METTL16 intron after Cre recombination
- Figure 5 Approach for conditional gene inactivation using artificial introns.
- an artificial intron is introduced into a coding exon of a gene.
- the branch point in yellow
- the branch point in yellow
- the intron gets removed by splicing and leaves the m NA of the target gene intact.
- Cre Upon recombination with Cre, the branch point is excised and the intron is inactivated.
- FIG. 6 Identification of the artificial intron configuration that enables conditional gene inactivation.
- HAPl cells were transfected with NanoLuc without intron ("no intron") or one harbouring the intron without further modification (“intron”).
- intron an intron
- cassette variants Var 1-5) were tested in which loxP sites positioning relative to the branch point varied. Note that we included two designs for each cassette, one in which the cassette was intact (“no Cre") and one in which the sequence between the loxP sites had been removed (“Cre”). NanoLuc levels were measured 24h post transfection using the Nano-Glo Dual assay. This data is also shown in Figure 1.
- Variant 4 identified in Figure 6A was placed into various sequence contexts within the NanoLuc gene, reflecting one of four possible insertion sites: CAG-G, AAG-G, CAG-A and AAG-A. For each of these, two sites were evaluated. H EK293 cells were transiently transfected with these various constructs and NanoLuc luciferase activity was measured using the NanoGlo Dual assay 48h hours after transfection.
- Figure 7 Cre-recombination leads to branch point excision.
- A Genomic DNA was isolated from cells transfected with Cre (for CD46) or cells transduced with Cre-ERT2 and treated with 4-hydroxitamoxifen (4-OHT; for M ETTL16). Samples were genotyped using PCR primers specific for CD46 or M ETTL16. The right hand side of Figure 7A, regarding M ETTL16, is also shown in the right hand side of Figure 3.
- B m NA was isolated from cells expressing Cre, reverse transcribed using oligo (dT) and analysed with primers specific for the CD46 cDNA.
- FIG. 8 Artificial intron cassette activation leads to conditional gene inactivation.
- a and B HAP1 cells bearing the artificial intron in CD46 were transfected with Cre recombinase. For Western blotting, cells were analysed using a CD46-specific antibody. For FACS, cells were stained with a CD46-specific antibody and analysed by flow cytometry.
- C M ETTL16-DECAI cells were transduced with Cre-ERT2, treated with 4-OHT as indicated and stained with Crystal Violet. The data in Figure 8C is also shown in Figure 4, right hand side.
- FIG. 9 Generation of Oct4-conditional knockout human iPS cells.
- A Wild-type cells (+/+) or cells bearing the artificial intron in exon 1 of Oct4 (Oct4-DECAI #1 and Oct4-DECAI #2, created with two independent gRNAs) were transfected with Cre-ERT2 recombinase and 4-OHT or left untreated. Cells were fixed in 4% PFA and stained for Oct4 and SSEA4 using the specific antibodies. DAPI was used for nuclear staining. Images were acquired with an Olympus 1X83 microscope (10X magnification, ⁇ scale bars). For bright field images see Figure 14.
- (B) Flow cytometry analysis of the cells before (No Cre) and after Cre treatment (+Cre) stained with Oct4 and SSEA4 specific antibodies.
- FIG. 1 Homology donor templates for cell line generation. Homology donor templates used to engineer the CD46-DECAI, M ETTL16-DECAI and OCT4-DECAI cell lines are shown. The donors contain ⁇ 400bp right and left homology arms flanking the 201bp artificial intron cassette (underlined).
- FIG. 11 Analysis of CD46 expression in CD46-DECAI and WT Hapl cells.
- HAP1 wild-type and CD46-DECAI cells were transfected with a plasmid expressing Cre recombinase. Cells were then stained with a CD46-specific antibody and analysed by flow cytometry.
- FIG. 12 Isolation of Cre-ERT2 expressing METTL16-DECAI clones. Crystal violet staining of single cell clones isolated from two M ETTL16-DECAI cell lines (Clone B5 and Clone G4). 12 clones of each cell line were isolated following transduction with Cre-ERT2 and these potentially Cre-ERT2 expressing clones were treated with 4-OHT to trigger cassette activation. Clones in which 4-OHT triggered cassette activation, and hence, cell death, are marked with a box.
- Cre-recombination leads to a change in morphology of Oct4-DECAI cells.
- a clear change in cell morphology is observed only in the Oct4-DECAI#l and Oct4-DECAI#2 cells.
- Cre-recombination leads to branch point excision in Oct4-DECAI cells.
- Genomic DNA was isolated from WT, Oct4-DECAI#l and Oct4-DECAI#2 cells that had been transfected with Cre-ERT2 and treated with 4-OHT, and from cells not transfected with Cre-ERT2. Samples were genotyped using PCR primers specific to either the cassette insertion site of Oct4-DECAI#l or Oct4-DECAI#2.
- Introns are non-coding or intervening polynucleotide sequence of varying length, normally present within most eukaryotic genes and which are removed from a newly transcribed m RNA precursor by the process of splicing.
- the process of splicing requires that the 5' and 3' ends of the intron be correctly cleaved and the resulting ends of the m RNA be accurately joined, such that a mature m RNA having the proper reading frame for protein synthesis is produced.
- Many splice donor and splice acceptors sites meaning the sequences immediately surrounding the exon-intron- and intron-exon-boundaries, have been characterized and described and are known to the skilled artisan.
- the sequence of mammalian introns begins with GT and ends with AG with a few minor introns starting with AT and ending with AC.
- the artificial introns described herein comprise all the intron elements necessary for normal intron function. However, the artificial introns described herein have been modified by the inclusion of recombinase or nuclease sites so that intron function may be disrupted and gene function knocked-out.
- the artificial intron described herein may be based on a natural intronic sequence with the intron elements that are necessary for normal intron function, including (i) a 5' splice donor, branch point and a 3' splice acceptor (ii) at least one of these intron elements, typically the branch point, is flanked by a pair of recombinase or nuclease sites; (iii) the 5'end of the cassette comprises at least one stop codon (preferably two stop codons, more preferable a series of three stop codons; one for each reading frame) which will be in frame with the target gene.
- treatment with the recombinase or the nuclease will excise or invert the intron element, e.g. branch site, between the recombinase or nuclease sites, abrogate splicing of the intron causing the ribosome to run into a stop codon and terminate translation.
- the intron element e.g. branch site
- An artificial intron sequence derived from natural mammalian intron sequences may be used, for example based on a well-characterized chimeric intron found in the pCI-neo mammalian expression vector from Promega (Cat. No. E1841). This artificial intron is created by combining the 5' splice donor site from an intron of the human ⁇ -globin gene with the branch and
- An artificial intron useful in this invention may be an intron which leads to comparable expression of the target gene of interest when placed in an exon of the target gene. Ideally, following disruption of the inserted intron, by recombinase addition or the like, gene expression should be significantly or even completely abrogated.
- the artificial intron can be a) derived from a natural intron but may be modified by nucleotide substitutions, deletions and/or insertions, b) a chimeric intron composed of different intron sequences derived from one or more natural intron sequences of the gene of interest and/or of different genes, c) a de novo designed synthetic intron or d) any combination of the above.
- Artificial intron sequences as described herein contain at least three conserved intron elements which are found in natural introns and are essential for splicing: a 5 ' splice site (also known as splice donor site), a branch point (or branch site) and a 3 ' splice site (also known as a splice acceptor) adjacent to a run of pyrimidines called a polypyrimidine tract. Recognition of these sites by the splicing machinery is followed by the excision of an intron. Splice donor site, splice acceptor site and branch point sequences are well known in the art and any may be utilized in the present invention.
- splice donor and splice acceptor sites include the consensus splice donor and splice acceptor sites mentioned below and the sites shown in SEQ. ID NO. 1.
- Efficient splice donor and acceptor sites suitable for this invention can be readily determined using techniques for measuring the efficiency of splicing. Intron splicing efficiency is readily determined by quantifying the spliced transcripts versus the full-length, unspliced transcripts that contain the intron(s), using methods known in the art.
- the consensus sequences include sequences in the exon as well as the intron.
- the splice sites as described herein include naturally occurring, engineered or synthetic, consensus or cryptic splice sites.
- the branch point or branch site is usually located approximately between 10 and 60 nucleotides upstream of the splice acceptor and forms, during the splicing process via its conserved adenosine residue, a lariat structure with the splice donor.
- one preferred branch point sequence is TACTAAC (Zhuang et al, PNAS 86, 2752-2756, 1989).
- the branch point may be adjacent to or within the polypyrimidine tract of the splice acceptor site.
- the artificial intron used in the constructs and methods of the invention has one or more stop codons in all possible 3 reading frames and/or has a nucleotide sequence length which is not dividable by 3 to prevent a complete read through of the intron sequence in case of a non-splicing event.
- the artificial intron sequence may include stop codons present in the original intron sequence but may also include additional stop codon(s) positioned between the splice donor site and a first recombinase site.
- the 5'end of the artificial intron preferably contains at least one stop codon, or two or three stop codons, that will be in frame with the target gene following successful recombination (and intron destruction).
- treatment with the recombinase or the nuclease will destroy the intron and thus the ribosome will run into a stop codon and terminate translation.
- Cre and Flp are the recombinases that are most commonly used in molecular biology. Cre can recognize LoxP sites (or variants thereof); Flp recognizes F T sites (or variants thereof). If a given site of interest is flanked by two parallel recombinase sites, then the addition of the cognate recombinase will trigger the excision of the intervening sequence with high efficiency and precision.
- the key challenge was to add the recombinase sites (e.g. loxP sites as shown in the Exam ples herein) in a way that did not interfere with splicing.
- the recombinase sites or nuclease sites should be positioned to flank an intron element, e.g. branch site, that is of critical importance to maintain splicing, such that when this intron element is excised or disrupted, by recombinase or nuclease addition, intron function is disrupted.
- branch point sequence which was the prime candidate sequence for excision or disruption, is not very well defined and the sequence requirements are quite degenerate.
- the first loxP site of the pair flanking the branch site which, typically, remains in the target gene following recombinase treatment, may be placed close to the 5' splice donor site, for example.
- the first lox P site may comprise one or more stop codons. Variants of loxP sites are known to the person skilled in the art, for example, as shown here: https://en.wikipedia.org/wiki/Cre-
- loxP site variants that have been described contain one or more stop codons.
- the 5' loxP site may be placed in close proximity to the splice donor.
- the 3' loxP site may also be placed in close proximity to the splice acceptor whilst retaining a functional polypyrimidine tract. Functionality can be assessed by common methods to measure splicing activity.
- the branch site should be contained between the two loxP sites.
- the present invention provides an artificial intron that may, advantageously, be small in size, for example about 200-250, 200-300, 200-400, 200-500, 200-600, 200-700 or 200-800 or 200- 1000 nucleotides in length. Insertion of this cassette into the coding exon of a gene does not significantly disrupt gene function as the cassette is based on natural intron sequences and is thus removed by splicing.
- the consensus sequences include sequences in the exon as well as the intron.
- the sequence necessary in the exon is MAG-G (where the hyphen denotes the intron).
- Gene tagging can be conducted based on two concurring DNA repair mechanisms: Nonhomologous end joining or homology-direct repair.
- homology-directed repair a donor nucleic acid construct is used that includes, in addition to the artificial intron sequence, homology arms of 100bp-5kb length. Importantly, both the 5' and the 3'end of the artificial intron needs to be flanked with homology arms. If such a nucleic acid construct is applied together with a nuclease, such as Cas9, then the artificial intron can be integrated into the exon of a target gene at reasonable efficiencies.
- a nuclease such as Cas9
- the resulting cell line will only represent a conditional knockout if all alleles in that cell are tagged. This is usually difficult as the efficiency of bi-allelic gene tagging is very low. This represents a limitation for the approach presented here, but even more so for competing approaches such as the degron tagging approach. Consequently, competing approaches have not been applied at scale.
- the method presented here offers an attractive compromise to solve this problem, i.e. by targeting one allele with the conditional KO cassette and destroying the second allele with C ISP /Cas cleavage, followed by the introduction of a frameshift mutation through erroneous N H EJ. As a consequence, one would end up with one allele containing a frameshift mutation and the other one containing a conditional allele that could be inactivated upon recombinase addition.
- hypomorphic alleles Such alleles would be usually referred to as “hypomorphic alleles" because they only retain gene function partially. Large scale mutagenesis in yeast has suggested that many essential genes tolerate such hypomorphic alleles.
- Recombinases can be applied in many ways that are known to the skilled person. For instance, one could transiently transfect cells using lipofection or electroporation with an expression plasmid encoding the recombinase. Second, one can infect cells with a recombinant virus (retrovirus, lentivirus, adenovirus are most commonly used) encoding the recombinase. Third, one could create a cell line that stably and inducibly expresses the recombinase. Fourth, one could envisage applying the purified recombinase protein, either by itself (following electroporation) or as fusion with a cell-penetrating peptide (e.g. the arginine-rich motif of the H IV-1 TAT).
- a recombinant virus retrovirus, lentivirus, adenovirus are most commonly used
- guide RNA or "gRNA”
- gRNA guide RNA
- gRNA synthetic gRNA, gRNA obtained by in vitro transcription or gRNA expressed in cells from plasmid or PC product.
- T is used in place of "U” in guide RNA sequences, as would be readily apparent to the skilled person.
- the sequence of the intron cassette used in the examples herein is based on a well- characterized chimeric intron found in the pCI-neo mammalian expression vector from Promega (Cat. No. E1841). This chimeric intron is created by combining the 5' splice donor site from an intron of the human ⁇ -globin gene with the branch and 3 ' splice acceptor site of the intron of an immunoglobulin gene heavy chain variable region (SEQ I D NO:l).
- the chimeric intron is shown with important splicing motifs highlighted. 5' and 3' splice sites are shown in bold, the branch site is double underlined and the polypyrimidine tract is underlined.
- LoxP sites are 34 base pair sequences that are recognized by Cre recombinase to trigger a specific recombination event depending on their location and orientation. If the loxP sites are in parallel orientation, the sequence between the two sites is excised. To enable deletion of the branch site, two loxP sites were added to the cassette in parallel orientation flanking the branch site. Cre recombination will cause excision of the branch site and thus abrogate removal of the intron by the splicing machinery. As a consequence, the ribosome will read into the remaining cassette and encounter a premature stop codon that disrupts the gene of interest.
- NanoLuc sequence is shown below.
- the synthetic intron was inserted in the underlined site between the CAG and G.
- the intron was inserted at the double underlines site between the AAG and G.
- construct number 4 (SEQ ID NO:8 and SEQ ID NO:9) of the synthetic intron is the configuration that had no effect on NanoLuc expression with the cassette, but caused a profound ⁇ 30 fold reduction with the post-Cre configuration (compare NanoLuc + intron loxP v4 and NanoLuc + intron loxP post-Cre v4 conditions in Figure 1). Given these results, we decided to proceed with the version 4 cassette (SEQ ID NO:8) for all further experiments.
- NanoLuc expression plasmids with the version 4 cassette inserted at the new site were electroporated in HAPl cells and NanoLuc expression was assayed after 24 hours ( Figure 2).
- HAPl cells were transfected with Cas9, a g NA targeting the locus where the synthetic intron would be inserted and a homology donor (Table 1, SEQ ID NO: 13 and SEQ I D NO:14).
- the site where the artificial intron would be inserted was selected following the exonic sequence requirements for splicing (SEQ I D NO:19 and SEQ I D NO:20).
- the homology donor comprised the synthetic intron and 400bp homology arms upstream and downstream the insertion site (SEQ I D NO:21, SEQ I D NO:22).
- Single cell clones containing the synthetic intron were isolated, and genotyped by PCR and Sanger sequencing (primers: Table 1, SEQ I D NO:15 to SEQ I D NO:18 and Sanger sequencing: SEQ ID NO:23, SEQ I D NO:24).
- Genomic region surrounding the artificial intron insertion site is shown below. Exonic sequence is shown in lower case letters and intronic sequence in uppercase letter. The exonic splicing sequence motif is shown in bold. The guide RNA used during the cell line engineering is underlined.
- CDK4 (SEQ I D NO: 19)
- the sequence used as a donor template with ⁇ 400bp homology arms flanking the artificial intron is shown below.
- LoxP sites are in lower case letters and the stop codons in all three reading frames are shown in italics.
- the single nucleotide that was mutated to create a stop codon is in brackets.
- CDK4 (SEQ ID NO:21)
- the M ETLL16 and CDK4 clones containing the artificial intron were transduced with a retrovirus expressing Cre-ERT2 to induce recombination.
- Cre-ERT2 encodes a Cre recombinase fused to a mutant estrogen ligand-binding domain (ERT2) that requires tamoxifen or 4-Hydroxytamoxifen (4-OHT) for activity 14 .
- ERT2 estrogen ligand-binding domain
- 4-OHT 4-Hydroxytamoxifen
- genomic DNA was isolated to verify presence or absence of the artificial intron by amplifying the insertion site by PCR ( Figure 3).
- Cre recombination and addition of 4-OHT triggered the deletion of the sequence between the two loxP sites (the branch site) in both M ETTL16-intron and CDK4-intron cells.
- Genomic DNA from CDK4-intron and M ETTL16-intron cell lines treated with Cre and/or 4-OHT was used for a PCR amplifying the region around the insertion site of the artificial intron.
- the expected sized of the PCR products are as follows. For CDK4-intron (before recombination):
- M ETLL16 is a highly essential gene in HAP1 cells and absence of this protein can be easily assessed by measuring cell viability.
- equal numbers of cells for METTL16-intron and CDK4-intron were plated in six-well dishes following Cre-E T2 transduction and either treated or not treated with 4- OHT for a total of 6 days. Cells were then stained with crystal violet to assess total cell number as an approximation of cell viability (Figure 4).
- the cassette we developed can be inserted into a reporter gene (NanoLuc) and insertion does not interfere with NanoLuc expression.
- the cassette contains parallel loxP sites and removal of the sequences between the loxP sites triggers a ⁇ 30fold decrease in NanoLuc expression.
- Cre recombinase e.g. by addition of a recombinant adenovirus bearing Cre recombinase
- Cre recombinase leads to the deletion of critical elements within the intron that are required to maintain splicing, most importantly: the branch site.
- the cassette will no longer be removed and the ribosome will run into a stop codon, thus disrupting the expression of the gene of interest upon addition of Cre recombinase.
- the resulting cell line contains a frameshift and thus represents a complete gene knockout.
- EXAM PLE 2 EXAM PLE 2:
- gRNA sequences The gRNAs in Table 3 were used to insert the cassette. Table 3: gRNAs used to insert the cassette
- NanoLuc assay HAP1 or HEK293 cells were transfected with a pcDNA3.1 plasmid encoding NanoLuc or NanoLuc containing artificial intron variants (see above) and pGL4.53[luc2/PGK] vector (Promega) for normalization. Cells were harvested 24h-48h post transfection and analysed using the Nano-Glo Dual Luciferase Assay System (Promega) according to manufacturer's instructions. Generation of cell lines
- HAP1 cells were transfected with TurboFectin transfection reagent (OriGene).
- TurboFectin transfection reagent OriGene
- 0.8x10 s cells were seeded in a 6-well plate and transfected on the following day with l ⁇ g of Cas9 expression plasmid (#48137 from Addgene) , ⁇ g of U6 driven gRNA expression plasmid (Horizon Discovery), O ⁇ g of PCR product encoding the donor and O ⁇ g of plasmid encoding a blasticidin resistance gene.
- 24h post transfection cells were treated with 20 ⁇ g/ml of blasticidin for 24h to eliminate untransfected cells. Single clones were obtained by limiting dilution. Positive clones were then identified by PCR screening.
- iPS cells cultured in TeSR-E8 and Vitronectin XF were transfected with the P3 Primary Cell solution using the CM-138 pulse in the 4D-Nucleofector (Lonza).
- 1.0x10 s cells were transfected with 2 ⁇ g of Cas9 expression plasmid, 2 ⁇ g of gRNA expression plasmid, 2 ⁇ g of plasmid encoding the donor and O ⁇ g of plasmid encoding a blasticidin resistance gene.
- Cells were plated in culture media supplemented with ⁇ ROCKi (Y-27632, Abeam) for 24h, and then treated with 10 g/ml of blasticidin for 24h to eliminate untransfected cells. Recovered colonies were single cell plated after treatment with StemPro Accutase (Thermo Fisher Scientific) and clones were manually picked into 96wells for screening.
- Genomic DNA was isolated using the QIAamp DNA Mini Kit (Qjagen) according to manufacturer's instructions. PCRs where done with the primers below using the GoTaq DNA Polymerase (Promega) following manufacturer's instructions. The primers in Table 4 were used for PCR.
- TaqMan gene expression assays (Thermo Fisher Scientific) were used for each gene analysed. qRT-PCR experiments were performed using QuantStudio 6 Flex Real Time PCR System (Thermo Fisher Scientific). All values were normalized to an internal control (GAPDH) and brought to power of 2. Each target's value was then compared to the highest value of that target across all samples. All samples were done in technical triplicates and the median of these is displayed. The following TaqMan gene expression assays were used: Gapdh
- Hs03929097_gl Oct4 (Hs00999632_gl), Nanog (Hs04399610_gl), Sox2 (Hs01053049_sl), Cdx2 (Hs01078080_ml), Eomes (Hs00172872_ml) (all from Thermo Fisher Scientific).
- Approximately lxlO 7 HEK392T cells were transfected with 1.7 ⁇ g pAdvantage plasmid (Promega), 2.6 ⁇ g VSV-G expression vector, 4 ⁇ g gag-pol expression vector and 6.6 ⁇ g CreERT2 expression vector, using TurboFectin transfection reagent (OriGene). Medium was changed 24 hours after transfection. Viral supernatant was collected 48 and 72 hours after transfection, and used to transduce the M ETTL16-DECAI cells. Since the Cre-ERT2 retrovirus also encoded a PGK-PuroR cassette, cells containing Cre could be enriched for by selecting with 0.3 ⁇ g/ml puromycin. FACS, Western Blotting and Immunocytochemistry
- HAP1 experiments cells were trypsinized and washed with PBS. Cells were stained with a CD46-specific antibody (APC-CD46; catalogue #564253; BD Biosciences) in FACS buffer (5% FCS in PBS) for 30 minutes and excess antibody was removed by washing with PBS. Cells were analysed by flow cytometry using the BDLSR Fortessa. For the human iPS cell experiments, cells were fixed and permeabilized using the
- Cytofix/Cytoperm kit (BD) according to manufacturer's instructions. Cells were then stained using the following directly conjugated antibodies: Alexa Fluor ® 488 anti-Oct4 (653706, BioLegend) and PE/Cy7 anti-human SSEA-4 (330420, BioLegend). Cells were analysed using the iQue Screener PLUS (IntelliCyt). Data was analysed using the FlowJo software.
- lysates were prepared using Frackelton buffer (lOm M Tris/HCI pH7.5, 50mM NaCI, 30mM sodium pyrophosphate, 1% Triton X-100, 50mM NaF and protease inhibitors). Lysates were separated on 10% SDS-PAGE and blotted on Nitrocellulose membranes. Membranes were stained with a primary antibody (anti-CD46, abl08307, Abeam) and a secondary antibody (anti-rabbit, 111-035-003, Jackson Immunoresearch Europe) and visualized using ECL reagent (Thermo Fisher Scientific).
- the PSC 4-Marker Immunocytochemistry Kit (Thermo Fisher Scientific, A24881) was used. Briefly, iPS cells were fixed using 4% PFA for 15min,
- variant 1 and 2 were well tolerated, yet we did not observe any reduction in gene expression upon Cre recombination, suggesting that alternative branch points had taken over.
- variant 4 Figure 5C was most fit for purpose in as much as it was well tolerated in the "off-state” and triggered a profound decrease ( ⁇ 14 fold) in NanoLuc expression upon cassette activation.
- Introns of the GT-AG category such as the one we chose, have a preference for CAG or AAG upstream of the splice donor and G or A
- the frequency at which the cassette was incorporated was higher in the essential gene (M ETTL16) than in the non-essential gene (CD46). While this may well be a coincidence, it is plausible to assume higher targeting frequencies for essential genes as other editing events (e.g. the accumulation of indels) may be subject to negative selection.
- Human iPSCs are pluripotent and their pluripotency depends on the expression of the transcription factor Oct4/POU5Fl 18 .
- Oct4 Inactivation of Oct4 triggers the collapse of the core transcriptional network leading to rapid loss of pluripotency. This is associated with drastic changes in colony morphology and loss of other pluripotency markers, such as Nanog, Sox2 and SSEA4.
- We created homology donors for Oct4/POU5Fl 400bp homology arms flanking the DECAI cassette) and targeted two independent sites in exon 1 (Figure 10). Clonal cell lines were established and genotyped by PCR. Both homozygous and heterozygous cell lines were achieved (Table 5).
- Oct4- DECAI cells lost Oct4 expression, and consequently, SSEA4 (Figure 9A-B), Nanog and Sox2 expression (Figure 9C). They also changed their morphology quite dramatically ( Figure 13), indicating that these cells had lost their pluripotency.
- the induced knockout of Oct4 resulted in differentiation of human iPSCs with a significant increase in transcription of Cdx2 and Eomes ( Figure 9C), genes associated with trophoblast and endoderm lineages 1 ,20 .
- Cre/loxP recombination at the genomic DNA level ( Figure 14). Altogether, this highlights the feasibility of our approach for conditional gene inactivation and suggests its applicability in more physiological settings. Discussion
- the artificial intron approach presented here also has some conceivable shortcomings.
- insertion of the artificial intron may dysregulate endogenous splicing events, leading to constitutive gene inactivation.
- CDK4 gene which is nonessential in HAP1 cells
- CDK4 expression data not shown.
- a more detailed analysis of these cells revealed that the intron was successfully removed by splicing, but some erroneous splicing events led to the assembly of a non-functional mRNA that was most likely degraded by non-sense mediated decay.
- Cre-ERT2 by retroviral transduction for M ETTL16-DECAI and OCT4-DECAI activation. While this is possible, we noticed some shortcomings, mainly: (i) Cre-ERT2 expression can be leaky and already occur in the absence of 4-OHT (data not shown) and (ii) Cre-ERT2 expression can be silenced, especially when Cre-ERT2 is delivered via retroviral infection. A more elegant approach would be to express Cre-ERT2 from a safe-harbour locus such as AAVS1 or ROSA26.
- CRISPRi CRISPR interference
- a catalytically dead Cas9 directed by a guide RNA will suppress expression of the target gene by sterically hindering transcription initiation/ elongation or by the action of a silencing effector domain fused to Cas9 22,23 . While this strategy has the advantage of being reversible, it is difficult to reach complete inhibition of the target gene expression.
- inducible CRISPR systems There are several iterations of these systems ranging from split Cas9 molecules that form a catalytically active molecule upon a certain stimulus 24 or control of Cas9 or gRNA expression by inducible promoters 25 . The efficiency of these methods will depend on the expression level of the gRNA and will likely show a variety of effects across cells in the sample since the editing induced by Cas9 will not happen in a predictable manner. This contrasts with the DECAI method described herein which will lead to a predictable modification and disruption of expression upon Cre recombination.
- Degrons are drug-regulatable domains that allow one to modulate the abundance of the degron tagged protein by adding or withdrawing a small molecule.
- Popular degrons include the auxin/ mAI D system 9 ,the HaloPROTAC3/ HaloTag system 10 and the HCV protease inhibitor/ SMAShTag system 11 .
- At least some of the degrons that are commonly used are much faster in degrading the client protein (with a half-life of minutes 9 rather than hours or days) as they lead to the recruitment of E3 ligases that actively degrade the protein population that is present.
- the degron tags have the advantage of allowing reversible and tuneable degradation of the target proteins.
- the use of degrons has two major limitations: (i) the bi-allelic tagging of genes occurs at very low frequency and thus, it is difficult to obtain a cell line or an organism in which all copies of a gene have been conditionally inactivated and (ii) the insertion of foreign sequences within the coding region of a gene could disturb protein folding or protein function.
- degron tagging can be cumbersome and may not be compatible with high-throughput applications.
- the approach described herein is more suitable for serial production of conditional knockouts, whereas degron tagging may be the method of choice if the protein is well understood and inefficient clone recovery is less of a constraint.
- the DECAI method described herein offers a convenient and efficient alternative for generating conditional knockout alleles in mammalian cells.
- HaloPROTACS Use of Small Molecule PROTACs to Induce Degradation of HaloTag Fusion Proteins. ACS Chem Biol 10, 1831-1837, doi:10.1021/acschembio.5b00442 (2015).
- Burckstummer, T. et al. A reversible gene trap collection empowers haploid genetics in human cells. Nat. Methods 10, 965-971 (2013).
- Gilbert, L. A. et al. CRISPR-mediated modular RNA-guided regulation of transcription in eukaryotes. Cell 154, 442-51 (2013).
Landscapes
- Genetics & Genomics (AREA)
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Organic Chemistry (AREA)
- Biomedical Technology (AREA)
- Chemical & Material Sciences (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Molecular Biology (AREA)
- Microbiology (AREA)
- Physics & Mathematics (AREA)
- Plant Pathology (AREA)
- Biophysics (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Mycology (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
This invention relates to a system and method for generating conditional gene knock-out cells. The system includes the use of an artificial intron to modify a target allele to permit conditional knock-out of the allele by disruption of the artificial intron.In particular, the invention relates to a method for generating an allele for conditional gene knock-out in a cell comprising a target gene, the method comprising: introducing an artificial intron sequence into an exon of the target gene. The invention also relates to cells containing an artificial intron introduced according to the method. The approach has broad utility for conditional gene activation and could be used to study the loss-of-function phenotypes of essential genes.
Description
METHODS FOR CONDITIONAL GENE KNOCK-OUT
Field of the Invention
This invention relates to a system and method for generating conditional gene knock-out cells. The system includes the use of an artificial intron to modify a target allele to permit conditional knock-out of the allele by disruption of the artificial intron.
Background of the Invention
Genome editing has been revolutionized by the discovery of the C ISP /Cas technology that enables the targeted introduction of double strand breaks into the human genome at unprecedented precision and efficiency1"4. Once the double-strand break has been
established, the cell will repair the break by one of two common pathways: in the presence of a homology template, the cell will use homology-directed repair or homologous
recombination5. In the absence of homologous sequences, the cell will employ non- homologous end joining to repair the break6.
As non-homologous end joining is by nature quite imprecise, it triggers the accumulation of small insertions or deletions (so called indels)6. If such indels lie within the coding sequence of a gene, they often disrupt the reading frame and thus abrogate the translation (or
transcription; as a consequence of nonsense-mediated decay) of the corresponding gene. This has been exploited to create cell lines or whole organisms (e.g. in mice) bearing so called gene knockouts.
Gene knockouts are strains containing a frameshift mutation or, ultimately, a stop codon in coding exons and thus represent complete loss-of-function mutants. They are fundamental for our understanding of gene function as they enable the study of the corresponding loss-of- function phenotype.
Unfortunately, not all human genes can be studied by the means of establishing gene knockouts. Certain human genes cannot be inactivated because inactivation leads to the death of the cell or the whole organism. These genes are called essential genes as they are essential for life7. By definition, essential genes are key regulators and it turns out that many of them represent important drug targets as well. As a consequence, there is a large unmet need for the study of essential genes.
Historically, essential genes have been explored using conditional knockouts. These were established by assembling a large homology donor containing both a selection cassette to enrich for the targeting event and loxP sites that enable the Cre-mediated excision of an exon that lies within the gene of interest8. The homology donor was then used to establish genetically engineered cells by spontaneous homologous recombination (e.g. in mouse embryonic stem cells) or, more recently, by C ISP -assisted homologous recombination. Unfortunately, none of these approaches are easily scalable because donor assembly is cumbersome and the overall efficiency of the process is low and hence requires the screening of hundreds if not thousands of clones by Southern blotting or PCR.
With the advent of CRISPR, it became more feasible to tag endogenous genes with drug- regulatable domains that are called degrons. If such a domain is fused to a gene of interest, the resulting gene product can be regulated in its abundance by adding or withdrawing a small molecule. Popular degrons include the auxin/ mAID system9, the HaloPROTAC3/ HaloTag system10 and the HCV protease inhibitor/ SMAShTag system11. However, the use of degrons has two major limitations: (i) The bi-allelic tagging of genes occurs at very low frequency and thus, it is difficult to obtain a cell line or an organism in which all copies of a gene have been conditionally inactivated and (ii) The insertion of foreign sequences within the coding region of a gene often disturbs protein folding or protein function. As a consequence, degron tagging may be less predictable than desired.
Very recently, an approach referred to as CRISPR-FLIP was introduced (see
http://biorxiv.Qrg/content/early/2016/06/01/05G549 for details). It is based on an invertible intronic cassette (FLIP), similar to COIN12. In the non-mutagenic orientation, the FLIP cassette expresses the puromycin resistance gene (puroR) to select for correct nuclease-assisted targeting into the exon of one allele. Upon exposure to Cre recombinase the FLIP cassette is inverted to a mutagenic configuration that activates a cryptic splice acceptor and
polyadenylation signal (pA) and disrupts the initial splice acceptor resulting in the loss of gene function. While this system is attractive, it has several shortcomings: (i) The cassette that is used as a donor is large (>2kb; the exact size not evident from the publication) and hence, the assembly of targeting constructs with homology arms is costly, cumbersome and time- consuming; (ii) The cassette contains an antibiotic resistance gene which is not always desired and (iii) Expression of the gene of interest is disrupted by a splice acceptor whose efficiency
can vary depending on genomic context. Accordingly, there is a need for an alternative conditional gene knock out method.
The present invention is intended to address the disadvantages of the prior art. In particular, the inventors have determined a method which is intended to significantly reduce the size of the artificial intron that is used and ensure that gene expression can be rapidly and efficiently abrogated in a wide range of mammalian genes.
Described herein is a system that overcomes the above mentioned limitations of the prior art and provides an artificial intron that adheres to the following principles: (i) The artificial intron size may be small so that homology donors can be easily assembled by gene synthesis; (ii) the approach does not require the use of antibiotic selection; (iii) the cassette disrupts gene expression completely, i.e. reveals a stop codon upon cassette activation; (iv) the cassette has a good "window of opportunity", so that protein levels are strongly decreased as a consequence of cassette activation; and (v) the approach is scalable and success is predictable for most human genes without too much prior insight into protein function or folding.
Described herein is an artificial intron cassette that may contain as few as about 200 nucleotides. Insertion of this cassette into the coding exon of a gene does not disrupt gene function as the cassette is based on a natural intron sequence and the intron is thus removed by splicing. Addition of Cre recombinase (e.g. by addition of a recombinant adenovirus bearing Cre recombinase) leads to the deletion of critical elements within the intron that are required to maintain splicing, most importantly: the branch site. As a consequence, the cassette will no longer be removed and the ribosome will run into a stop codon, thus disrupting the expression of the gene of interest upon addition of Cre recombinase. The resulting cell line contains a frameshift and thus represents a complete gene knockout.
Described herein is a system that allows the generation of conditional gene knockouts on a large scale with the following features: (i) engineering is straightforward in diploid or even polyploid cells; (ii) the approach does not require any prior insight into the structure/ function of the targeted gene product; (iii) in the "off-state" of the cassette, the activity of the target gene product is largely unaffected; (iv) the cassette activation (transition to "on-state") triggers a profound decrease in gene expression. Based on these requirements, we designed a genetic approach named DECAI (Degradation via Cre-regulated Artificial Intron), where
conditional gene knockouts can be generated by inserting a small intron cassette into the coding sequence of an exon.
Summary of the Invention
The present inventors have now developed a new conditional gene knock-out system that overcomes the above mentioned limitations. The system uses an artificial intron sequence which, when introduced into the coding exon of a target gene, permits normal gene function because the artificial intron is based on a natural intron sequence and is thus removed by splicing. In order to permit disruption of the target gene, the artificial intron includes one or more stop codons or a series of stop codons (one for each of the three possible reading frames) and an excisable or disruptable intron element. When this intron element is excised, intron function is disrupted and normal splicing of the intron is prevented. The ribosome will then run into the stop codon, thus disrupting the expression of the target gene. Provided herein is a method for generating an allele for conditional gene knock-out in a cell comprising a target gene, the method comprising: introducing an artificial intron sequence into an exon of the target gene, the artificial intron sequence comprising:
a) a splice donor sequence;
b) a first nuclease or recombinase site;
c) a branch point sequence;
d) a second nuclease or recombinase site;
e) a splice acceptor sequence; and
f) a stop codon positioned 5' to or within the first recombinase or nuclease site. The method may comprise introducing into the cell a sequence-specific nuclease that cleaves a sequence within the target gene thereby introducing the artificial intron sequence into the target gene.
To inactivate the introduced intron, the method may also include the step of introducing or activating a recombinase or nuclease in the cell thereby excising or disrupting the branch point and abrogating splicing of the artificial intron sequence.
Alternatively, the intron may be disrupted by applying two sequence-specific nucleases (e.g. Cas9 with two guide NAs targeting two sites flanking the branch point) to excise the branch point and thus abrogate splicing. Alternatively, the guide RNA recognition sites flanking the branch point may also be identical, thus only requiring Cas9 along with a single guide RNA.
In another aspect, artificial introns are provided. The artificial introns may be as described herein and may be used in any of the methods described herein. The sequence of the artificial intron may comprise:
a) a splice donor sequence;
b) a first nuclease or recombinase site;
c) a branch point sequence;
d) a second nuclease or recombinase site;
e) a splice acceptor sequence; and
f) a stop codon positioned 5' to or within the first recombinase or nuclease site.
The sequence of the artificial intron may comprise two stop codons, more preferably three stop codons, which may be positioned in a series of three or more stop codons 5' to the branch point.
Advantageously, the artificial intron may contain a single branch point and may also lack related sequences that can take over branch point function once the single branch point has been removed.
The sequence-specific nuclease used to cleave the target gene may be an NA guided nuclease, or a DNA guided nuclease, a Zinc finger nuclease or a TALEN. The RNA-guided nuclease may be Cas9 or an RNA-guided Argonaute. The sequence-specific nuclease may be introduced as a protein, m RNA, or cDNA.
The first and second recombinase sites may be loxP sites or FRT sites.
In some embodiments the first (5') loxP site comprises at least one stop codon.
The first recombinase or nuclease site is positioned adjacent to the splice donor site. Advantageously, the artificial intron sequence may be small, for example, about 200-
250, 200-300, 200-400, 200-500, 200-600, 200-700 or 200-800 or 200-1000 or 200 to 1500 nucleotides in length.
In some embodiments the artificial intron sequence comprises a nucleotide sequence at least 80%, 85%, 90%, 95%,96%, 97%, 98%, 99%, or 100% identical to SEQ ID NO:
1 .
In another aspect, the invention provides cells containing any of the artificial introns as described herein. Also provided are cells containing an artificial intron introduced according to any of the methods described herein.
In some embodiments the cells comprise an artificial intron in an exon of a target gene, the artificial intron having at least one stop codon positioned 5' of or within the first recombinase or nuclease site, and wherein the single branch site is flanked by a pair of recombinase or nuclease sites, the recombinase or nuclease sites being arranged such that the single branch site may be excised or disrupted, thereby abrogating splicing of the artificial intron.
In some embodiments provided herein is a cell comprising an artificial intron in an exon of a target gene, the artificial intron having at least one stop codon positioned 5' to or within a first recombinase or nuclease site, and wherein a single branch site is flanked by a pair of recombinase or nuclease sites, the recombinase or nuclease sites being arranged such that the single branch site may be excised or disrupted, thereby abrogating splicing of the artificial intron, and wherein the artificial intron sequence also comprises a splice donor sequence and a splice acceptor sequence. In some embodiments provided herein is a cell comprising an artificial intron in an exon of a target gene, the artificial intron sequence comprising:
a) a splice donor sequence;
b) a first nuclease or recombinase site;
c) a branch point sequence;
d) a second nuclease or recombinase site;
e) a splice acceptor sequence; and
f) a stop codon positioned 5' to or within the first recombinase or nuclease site.
The cells used in the methods herein, or comprising the artificial intron sequences or nucleic acid constructs described herein, may be eukaryotic cells, including mammalian cells such as haploid cells (e.g. the HAP1 cell-line), human cells, human induced pluripotent cells, human induced pluripotent stem cells, embryonic stem cells, or CHO cells or a T-cell (such as those for use in human therapy).
The cells described herein may be used in therapy.
The cells described herein may be used to generate transgenic animals.
In another aspect, there is provided a nucleic acid construct comprising an artificial intron sequence as described herein and which may be used in any of the methods described herein. In some embodiments, for example where the artificial intron is to be introduced into the target gene by homology directed repair, the nucleic acid construct further comprises 5'
and 3' homology arms comprising sequences homologous to corresponding target gene sequences. Suitably, the nucleic acid constructs described herein may be part of a plasmid.
In another aspect, there is provided a vector comprising the artificial intron sequence or nucleic acid constructs as described herein.
Some embodiments of the methods and cells provided herein are discussed further below. Generation of conditional Knockout (KO) mice and rats
In the past 20 years, the Cre/loxP system has been used to create a variety of conditional KO mice. In most of these, a coding exon of the target gene was flanked with loxP sites. In the absence of Cre, gene expression is unaffected. Upon addition or activation of Cre, the intervening exon is deleted or inverted, resulting in a gene KO.
Cre could be applied by transient transfection or virus infection. Alternatively, mice could already carry a Cre transgene, often coupled to a promoter that warrants expression in a specific tissue or at a certain stage of development. Mice harbouring Cre are then crossed with strains harbouring alleles flanked with loxP sites. In that way, target gene expression can be disrupted in a tissue or developmental stage-specific manner, depending on which Cre mouse line was used for the intercross.
The approach to introduce artificial introns described herein can be used in a very similar manner, i.e. to create rodents harbouring conditional KO alleles. In a second step, such models can be crossed with rodents strains harbouring Cre, thereby giving rise to rodent models that delete the target gene wherever Cre is activated. This allows researchers to study gene KO phenotypes in specific cell types or developmental stages.
Thus provided herein are non-human transgenic animals whose genome comprises the artificial intron sequence described herein
Establishment of conditional Knockout (KO) cell lines
Many academic researchers wish to study the loss-of-function phenotype of essential genes. Yet, establishing a gene KO in an essential gene is not possible because the resulting cell line is not viable.
In mice, this problem is often solved by creating a heterozygous conditional allele (e.g. by flanking an exon with loxP sites) and backcrossing heterozygous mice to homozygosity. This is unfortunately not possible in cell lines as they do not undergo sexual reproduction and hence, the conditional KO allele cannot be separated from the wild-type allele.
This problem is solved in an elegant way using the approach described herein: By targeting a coding exon with a g NA and introducing an artificial intron on one allele, the second allele is often disrupted as a consequence of Cas9 cleavage and repair by non-homologous end joining. Hence, the resulting cell line will bear one allele harbouring the artificial intron and a second allele harbouring a frameshift mutation. For many essential genes, such lines will be viable. Hence, this solution alleviates the need for backcrossing and allows conditional KO model generation in one-go.
Cell Therapy
In cell-based therapy, one may encounter a scenario where a gene Knockout(KO) is only desirable in a certain subtype or cells or at a certain developmental stage. In that case, it is feasible to engineer the total population of cells (e.g. all T cells) with the conditional KO cassette described herein. In a second step, Cre is then applied to a specific subset (e.g. the CD4-positive T cells) to inactivate the target gene selectively in this sub-population of cells. iPSC differentiation
Stem cell differentiation is often heterogenous or incomplete. One way to improve the quality and outcome of the process would be inactivate certain genes at a specific point during the process and thereby steer cell differentiation in a certain direction. For instance, one could introduce a conditional Knockout (KO) allele into a pluripotency factor (e.g. Oct4/POU5Fl; as shown in Example 2) and thereby create an iPSC line in which pluripotency can be terminated at will. Then, this line will be subjected to differentiation by applying the appropriate cocktail of factors or by transducing with one or more master regulators as is known in the art.Once cells are starting to differentiate, one activates the conditional Oct4 allele (e.g. by addition of Cre recombinase). As a consequence, cells that have not undergone differentiation will be eliminated from culture and the resulting cell population will be more pure.
Thus in an embodiment of the methods and cells provided herein the cell may be an induced pluripotent stem cell and the target gene is Oct4.
In a similar fashion, the conditional KO approach can be used to create more homogeneous differentiation protocols. Genes that drive and control the maintenance of intermediate states of differentiation (e.g. like pancreatic) could be engineered with the conditional KO cassette at the iPSC stage. Once the differentiation into the final desired cell types (e.g. b-cells) is accomplished, the conditional KO cassette can be activated, leading to the death of progenitor cells that contaminate the culture.
Reduction of tumorigenesis by suicide switches
Uncontrollable proliferation of cells (i.e. tumorigenesis) is one of the most challenging safety concerns of cell and stem cell treatments. One way to tackle this problem is the development of "suicide switches" that can be activated to trigger cell death at any given stage posttransplantation. Another approach consists of the development of "proliferation block switches", whose activation leads to blocking of the proliferation capacity of the transplanted cells. A lack of proliferation is of essence for long term cell grafts, such as the transplant of dopaminergic cells for treatment of Parkinson's disease.
The conditional KO cassette described herein can be used to create a proliferation switch by targeting a gene involved in cell cycle (e.g. Cyclin-dependent Kinases). Blocking of proliferation can be carried out upon cell expansion to the required amounts for transplantation, or after stem cell differentiation into the desired cell type, and can be performed both prior and posttransplantation (different strategies can used for Cre delivery or Cre-activation).
Generation of RNA fusions for reference standards
Certain fusion transcripts (e.g. BC -ABL) are commonly produced by cancer cells, and detection of these constitutes the diagnostic of certain cancer types. Reference standards are materials that can be used as positive or negative controls during the diagnostic of patient material. The conditional KO cassette described herein can be used to create transcript fusions to be used as reference material by allowing the hiding of a selection cassette (e.g. antibiotic resistance such as puromycin). This feature enables the generation and selection of fusion transcripts composed of an endogenous gene and a transgene, without any scar at the RNA level, as the synthetic intron is removed by slicing.
Hiding an exogenous sequence in the genome
In some instances, one may wish to introduce a sequence into the genome that does not affect gene expression of the resulting cell line. One could obviously target non-coding regions of the genome, but this is risky as one never knows whether a given sequence does or does not have an impact on gene expression. For instance, an element could represent an enhancer that is poorly described or annotated and altering this enhancer could have a profound impact on gene expression.
In another aspect, provided herein is a method which solves this problem: As the intron is actively removed by splicing, it is not present at the mRNA level and does therefore not
impact transcription. Modifying the intron to contain an exogenous sequence (e.g. a molecular barcode) could represent an elegant way of "hiding" this sequence in the genome without leaving a trace in the transcriptome.
Figures
Figure 1: NanoLuc expression from constructs containing synthetic intron configurations HAPl cells were transfected with a pcDNA3.1 plasmid expressing NanoLuc under the control of a CMV promoter. Cells were lysed 24h post transfection and analysed using the Nano-Glo Luciferase Assay System (Promega) according to manufacturer's instructions. Median luminescence across 24 replicates is shown for each condition. As a control, expression levels from a plasmid containing the NanoLuc gene or the NanoLuc with the chimeric intron with no loxP sites were measured.. For sequence details of the different artificial introns see below SEQ I D NO:2 to SEQ ID NO: ll.
Figure 2: NanoLuc expression from constructs with chimeric intron at a different site
HAPl cells were transfected with a pcDNA3.1 plasmid expressing NanoLuc under the control of a CMV promoter. Cells were lysed 24h post transfection and analysed using the Nano-Glo Luciferase Assay System (Promega) according to manufacturer's instructions. Median
Luminescence across 24 replicates is shown for each condition. As a control, expression levels from a plasmid containing the NanoLuc gene or the NanoLuc with the artificial intron were measured.
Figure 3: Genotyping CDK4-intron and METTL16-intron cell lines after Cre recombination Cells expressing CDK4-intron or M ETTL16-intron were infected with a recombinant retrovirus encoding Cre-ERT2. Upon enrichment of transduced cells, Cre was activated by adding ΙμΜ 4- hydroxitamoxifen (4-OHT) for three days. Genomic DNA from CDK4-intron and M ETTL16- intron cell lines treated with Cre recombinase and/or 4-OHT was used for a PCR amplifying the region around the insertion site of the artificial intron. Genomic DNA from wild type HAPl cells was used as a control. The expected sized of the PCR products are as follows. For CDK4,
HAPl WT: 210bp, CDK4-intron (before recombination): 411bp, CDK4-intron (after recombination): 264bp. For M ETTL16, HAPl WT: 250bp, M ETTL16-intron (before recombination): 451bp, M ETTL16-intron (after recombination): 304bp.
Figure 4: Cell viability of CDK4-intron and METTL16 intron after Cre recombination
Cells described in Figure 3 were treated with 4-OHT for six days, fixed with methanol and stained with crystal violet to assess cell viability. Figure 5: Approach for conditional gene inactivation using artificial introns.
(A) For the purpose of conditional gene inactivation, an artificial intron is introduced into a coding exon of a gene. In the artificial intron (marked in blue), the branch point (in yellow) is flanked by two parallel loxP sites (black triangles). Under steady-state conditions, the intron gets removed by splicing and leaves the m NA of the target gene intact. (B) Upon recombination with Cre, the branch point is excised and the intron is inactivated.
Consequently, the intron is no longer removed and the ribosome reads through the intronic sequence, running into one of three stop codons that were added to the artificial intron cassette. (C) DNA sequence of the artificial intron that was used for most of the experiments presented in Example 2. Important features are highlighted : splice donor in green, loxP sites are in lower case letters, stop codons in three reading frames in red italics, branch point in yellow, polypyrimidine tract is underlined and splice acceptor in blue.
Figure 6. Identification of the artificial intron configuration that enables conditional gene inactivation. (A) HAPl cells were transfected with NanoLuc without intron ("no intron") or one harbouring the intron without further modification ("intron"). In addition, a set of cassette variants (Var 1-5) were tested in which loxP sites positioning relative to the branch point varied. Note that we included two designs for each cassette, one in which the cassette was intact ("no Cre") and one in which the sequence between the loxP sites had been removed ("Cre"). NanoLuc levels were measured 24h post transfection using the Nano-Glo Dual assay. This data is also shown in Figure 1. (B) Variant 4 identified in Figure 6A was placed into various sequence contexts within the NanoLuc gene, reflecting one of four possible insertion sites: CAG-G, AAG-G, CAG-A and AAG-A. For each of these, two sites were evaluated. H EK293 cells were transiently transfected with these various constructs and NanoLuc luciferase activity was measured using the NanoGlo Dual assay 48h hours after transfection.
Figure 7. Cre-recombination leads to branch point excision. (A) Genomic DNA was isolated from cells transfected with Cre (for CD46) or cells transduced with Cre-ERT2 and treated with 4-hydroxitamoxifen (4-OHT; for M ETTL16). Samples were genotyped using PCR primers specific for CD46 or M ETTL16. The right hand side of Figure 7A, regarding M ETTL16, is also
shown in the right hand side of Figure 3. (B) m NA was isolated from cells expressing Cre, reverse transcribed using oligo (dT) and analysed with primers specific for the CD46 cDNA.
Figure 8. Artificial intron cassette activation leads to conditional gene inactivation. (A and B) HAP1 cells bearing the artificial intron in CD46 were transfected with Cre recombinase. For Western blotting, cells were analysed using a CD46-specific antibody. For FACS, cells were stained with a CD46-specific antibody and analysed by flow cytometry. (C) M ETTL16-DECAI cells were transduced with Cre-ERT2, treated with 4-OHT as indicated and stained with Crystal Violet. The data in Figure 8C is also shown in Figure 4, right hand side.
Figure 9. Generation of Oct4-conditional knockout human iPS cells. (A) Wild-type cells (+/+) or cells bearing the artificial intron in exon 1 of Oct4 (Oct4-DECAI #1 and Oct4-DECAI #2, created with two independent gRNAs) were transfected with Cre-ERT2 recombinase and 4-OHT or left untreated. Cells were fixed in 4% PFA and stained for Oct4 and SSEA4 using the specific antibodies. DAPI was used for nuclear staining. Images were acquired with an Olympus 1X83 microscope (10X magnification, ΙΟΟμιη scale bars). For bright field images see Figure 14. (B) Flow cytometry analysis of the cells before (No Cre) and after Cre treatment (+Cre) stained with Oct4 and SSEA4 specific antibodies. (C) qRT-PCR analysis was performed before and after induction of Cre recombination. Expression levels of Oct4, two additional pluripotency markers (Nanog and Sox2), and two differentiation markers (Cdx2 and Eomes) are shown. qRT-PCR values are normalized to GAPDH and shown as relative to the highest value of that target across all samples.
Figure lO.Homology donor templates for cell line generation. Homology donor templates used to engineer the CD46-DECAI, M ETTL16-DECAI and OCT4-DECAI cell lines are shown. The donors contain ~400bp right and left homology arms flanking the 201bp artificial intron cassette (underlined).
Figure 11. Analysis of CD46 expression in CD46-DECAI and WT Hapl cells. HAP1 wild-type and CD46-DECAI cells were transfected with a plasmid expressing Cre recombinase. Cells were then stained with a CD46-specific antibody and analysed by flow cytometry.
Figure 12. Isolation of Cre-ERT2 expressing METTL16-DECAI clones. Crystal violet staining of single cell clones isolated from two M ETTL16-DECAI cell lines (Clone B5 and Clone G4). 12 clones of each cell line were isolated following transduction with Cre-ERT2 and these potentially Cre-ERT2 expressing clones were treated with 4-OHT to trigger cassette activation.
Clones in which 4-OHT triggered cassette activation, and hence, cell death, are marked with a box.
Figure 13. Cre-recombination leads to a change in morphology of Oct4-DECAI cells. Bright field microscopy images of WT, Oct4-DECAI#l and Oct4-DECAI#2 after transfection with Cre-ERT2 and 4-OHT treatment. A clear change in cell morphology is observed only in the Oct4-DECAI#l and Oct4-DECAI#2 cells.
Figure 14. Cre-recombination leads to branch point excision in Oct4-DECAI cells. Genomic DNA was isolated from WT, Oct4-DECAI#l and Oct4-DECAI#2 cells that had been transfected with Cre-ERT2 and treated with 4-OHT, and from cells not transfected with Cre-ERT2. Samples were genotyped using PCR primers specific to either the cassette insertion site of Oct4-DECAI#l or Oct4-DECAI#2.
Detailed Description
Introns are non-coding or intervening polynucleotide sequence of varying length, normally present within most eukaryotic genes and which are removed from a newly transcribed m RNA precursor by the process of splicing. In general, the process of splicing requires that the 5' and 3' ends of the intron be correctly cleaved and the resulting ends of the m RNA be accurately joined, such that a mature m RNA having the proper reading frame for protein synthesis is produced. Many splice donor and splice acceptors sites, meaning the sequences immediately surrounding the exon-intron- and intron-exon-boundaries, have been characterized and described and are known to the skilled artisan.
Normally, the sequence of mammalian introns begins with GT and ends with AG with a few minor introns starting with AT and ending with AC.
The artificial introns described herein comprise all the intron elements necessary for normal intron function. However, the artificial introns described herein have been modified by the inclusion of recombinase or nuclease sites so that intron function may be disrupted and gene function knocked-out.
The artificial intron described herein may be based on a natural intronic sequence with the intron elements that are necessary for normal intron function, including (i) a 5' splice donor, branch point and a 3' splice acceptor (ii) at least one of these intron elements, typically the branch point, is flanked by a pair of recombinase or nuclease sites; (iii) the 5'end of the
cassette comprises at least one stop codon (preferably two stop codons, more preferable a series of three stop codons; one for each reading frame) which will be in frame with the target gene. As a consequence, treatment with the recombinase or the nuclease will excise or invert the intron element, e.g. branch site, between the recombinase or nuclease sites, abrogate splicing of the intron causing the ribosome to run into a stop codon and terminate translation.
An artificial intron sequence derived from natural mammalian intron sequences may be used, for example based on a well-characterized chimeric intron found in the pCI-neo mammalian expression vector from Promega (Cat. No. E1841). This artificial intron is created by combining the 5' splice donor site from an intron of the human β-globin gene with the branch and
3'splice acceptor site of the intron of an immunoglobulin gene heavy chain variable region (SEQ I D NO: 1).
An artificial intron useful in this invention may be an intron which leads to comparable expression of the target gene of interest when placed in an exon of the target gene. Ideally, following disruption of the inserted intron, by recombinase addition or the like, gene expression should be significantly or even completely abrogated.
The artificial intron can be a) derived from a natural intron but may be modified by nucleotide substitutions, deletions and/or insertions, b) a chimeric intron composed of different intron sequences derived from one or more natural intron sequences of the gene of interest and/or of different genes, c) a de novo designed synthetic intron or d) any combination of the above.
Artificial intron sequences as described herein contain at least three conserved intron elements which are found in natural introns and are essential for splicing: a 5 ' splice site (also known as splice donor site), a branch point (or branch site) and a 3 ' splice site (also known as a splice acceptor) adjacent to a run of pyrimidines called a polypyrimidine tract. Recognition of these sites by the splicing machinery is followed by the excision of an intron. Splice donor site, splice acceptor site and branch point sequences are well known in the art and any may be utilized in the present invention. These elements can be found, inter alia, in the art or derived from consensus sequences, either empirically by inserting, deleting or substituting nucleotides, or by using software capable of predicting splicing sequences.
Preferred splice donor and splice acceptor sites include the consensus splice donor and splice acceptor sites mentioned below and the sites shown in SEQ. ID NO. 1. However, other sequences with sufficient splicing efficiency can be used as well. Efficient splice donor and acceptor sites suitable for this invention can be readily determined using techniques for measuring the efficiency of splicing. Intron splicing efficiency is readily determined by quantifying the spliced transcripts versus the full-length, unspliced transcripts that contain the intron(s), using methods known in the art.
The consensus sequence for a splice donor site consists of the sequence MAG:GTRAGT (with M = C or A, R = A or G and the colon denoting the site of exon-intron boundary = cleavage site) (Ohshima et al, J. Mol. Biol. 195, 247 - 259, 1987).
The consensus sequence for a splice acceptor is composed of a polypyrimidine tract (CnTn) that extends for 10 15 nucleotides followed by CAG:G (with N = any nucleotide and the colon denoting the site of intron-exon- boundary = cleavage site) (Seif et al, Nucleic Acids Res. 6(10), 3387-98, 1979).
As mentioned above in the description of splice donor and splice acceptor site, the consensus sequences include sequences in the exon as well as the intron. The sequence necessary in the exon is MAG-G (where M = C or A and the hyphen denotes the intron).
The splice sites as described herein include naturally occurring, engineered or synthetic, consensus or cryptic splice sites. The branch point or branch site is usually located approximately between 10 and 60 nucleotides upstream of the splice acceptor and forms, during the splicing process via its conserved adenosine residue, a lariat structure with the splice donor. The consensus sequence for the branch point is reported as YNYYRAY (with Y = T or C, R = A or G, N = any base; the conserved adenosine involved in the lariat formation is underlined). In mammalian genomes one preferred branch point sequence is TACTAAC (Zhuang et al, PNAS 86, 2752-2756, 1989).
The branch point may be adjacent to or within the polypyrimidine tract of the splice acceptor site.
Preferably, the artificial intron used in the constructs and methods of the invention has one or more stop codons in all possible 3 reading frames and/or has a nucleotide sequence length which is not dividable by 3 to prevent a complete read through of the intron sequence in case of a non-splicing event. The artificial intron sequence may include stop codons present in the original intron sequence but may also include additional stop codon(s) positioned between the splice donor site and a first recombinase site. The 5'end of the artificial intron preferably contains at least one stop codon, or two or three stop codons, that will be in frame with the target gene following successful recombination (and intron destruction). As a consequence, treatment with the recombinase or the nuclease will destroy the intron and thus the ribosome will run into a stop codon and terminate translation.
Cre and Flp are the recombinases that are most commonly used in molecular biology. Cre can recognize LoxP sites (or variants thereof); Flp recognizes F T sites (or variants thereof). If a given site of interest is flanked by two parallel recombinase sites, then the addition of the cognate recombinase will trigger the excision of the intervening sequence with high efficiency and precision.
In terms of designing the artificial intron of the present invention, the key challenge was to add the recombinase sites (e.g. loxP sites as shown in the Exam ples herein) in a way that did not interfere with splicing. At the same time, the recombinase sites or nuclease sites should be positioned to flank an intron element, e.g. branch site, that is of critical importance to maintain splicing, such that when this intron element is excised or disrupted, by recombinase or nuclease addition, intron function is disrupted. The latter was particularly difficult as the branch point sequence, which was the prime candidate sequence for excision or disruption, is not very well defined and the sequence requirements are quite degenerate. As a consequence, deleting one branch point sequence might lead to an adjacent sequence taking over branch point activity. Hence, it was important to empirically define a cassette architecture in which splicing is maintained in the presence of the loxP sites and splicing is completely abrogated upon deletion of sequences that lie between the loxP sites (so that no other sequences can take over branch point function).
The first loxP site of the pair flanking the branch site, which, typically, remains in the target gene following recombinase treatment, may be placed close to the 5' splice donor site, for example.
The first lox P site may comprise one or more stop codons. Variants of loxP sites are known to the person skilled in the art, for example, as shown here: https://en.wikipedia.org/wiki/Cre-
Lox recombination.
Many of the loxP site variants that have been described contain one or more stop codons.
The 5' loxP site may be placed in close proximity to the splice donor. The 3' loxP site may also be placed in close proximity to the splice acceptor whilst retaining a functional polypyrimidine tract. Functionality can be assessed by common methods to measure splicing activity. The branch site should be contained between the two loxP sites.
If the loxP sites are in parallel orientation, the sequence between the two sites is excised.
The present invention provides an artificial intron that may, advantageously, be small in size, for example about 200-250, 200-300, 200-400, 200-500, 200-600, 200-700 or 200-800 or 200- 1000 nucleotides in length. Insertion of this cassette into the coding exon of a gene does not significantly disrupt gene function as the cassette is based on natural intron sequences and is thus removed by splicing.
As mentioned above in the description of splice donor and splice acceptor site, the consensus sequences include sequences in the exon as well as the intron. The sequence necessary in the exon is MAG-G (where the hyphen denotes the intron).
Gene tagging can be conducted based on two concurring DNA repair mechanisms: Nonhomologous end joining or homology-direct repair. In the case of homology-directed repair, a donor nucleic acid construct is used that includes, in addition to the artificial intron sequence, homology arms of 100bp-5kb length. Importantly, both the 5' and the 3'end of the artificial intron needs to be flanked with homology arms. If such a nucleic acid construct is applied together with a nuclease, such as Cas9, then the artificial intron can be integrated into the exon of a target gene at reasonable efficiencies.
Similar results can be obtained with a donor nucleic acid construct that includes gRNA recognition sites13 and another gRNA that cuts the target gene at the site where intron integration is meant to occur. While HDR is more precise and more efficient, it does require custom donor nucleic acid construct assembly (due to the homology arms). The NHEJ
approach, in contrast, can be applied across a variety of genomic loci and does not require any adaptation.
In either of the two scenarios, the resulting cell line will only represent a conditional knockout if all alleles in that cell are tagged. This is usually difficult as the efficiency of bi-allelic gene tagging is very low. This represents a limitation for the approach presented here, but even more so for competing approaches such as the degron tagging approach. Consequently, competing approaches have not been applied at scale. We envisage that the method presented here offers an attractive compromise to solve this problem, i.e. by targeting one allele with the conditional KO cassette and destroying the second allele with C ISP /Cas cleavage, followed by the introduction of a frameshift mutation through erroneous N H EJ. As a consequence, one would end up with one allele containing a frameshift mutation and the other one containing a conditional allele that could be inactivated upon recombinase addition.
Such alleles would be usually referred to as "hypomorphic alleles" because they only retain gene function partially. Large scale mutagenesis in yeast has suggested that many essential genes tolerate such hypomorphic alleles.
Recombinase treatment/delivery
Recombinases can be applied in many ways that are known to the skilled person. For instance, one could transiently transfect cells using lipofection or electroporation with an expression plasmid encoding the recombinase. Second, one can infect cells with a recombinant virus (retrovirus, lentivirus, adenovirus are most commonly used) encoding the recombinase. Third, one could create a cell line that stably and inducibly expresses the recombinase. Fourth, one could envisage applying the purified recombinase protein, either by itself (following electroporation) or as fusion with a cell-penetrating peptide (e.g. the arginine-rich motif of the H IV-1 TAT).
Guide RNA
Where the term "guide RNA" or "gRNA" is used herein, a person skilled in the art would understand this could mean synthetic gRNA, gRNA obtained by in vitro transcription or gRNA
expressed in cells from plasmid or PC product. In some instances herein, "T" is used in place of "U" in guide RNA sequences, as would be readily apparent to the skilled person.
Various further aspects and embodiments of the present invention will be apparent to those skilled in the art in view of the present disclosure.
All documents mentioned in this specification are incorporated herein by reference in their entirety. "and/or" where used herein is to be taken as specific disclosure of each of the two specified features or components with or without the other. For example "A and/or B" is to be taken as specific disclosure of each of (i) A, (ii) B and (iii) A and B, just as if each is set out individually herein.
Unless context dictates otherwise, the descriptions and definitions of the features set out above are not limited to any particular aspect or embodiment of the invention and apply equally to all aspects and embodiments which are described.
Certain aspects and embodiments of the invention will now be illustrated by way of example and with reference to the figures and sequences described herein. Examples
EXAMPLE 1
The sequence of the intron cassette used in the examples herein is based on a well- characterized chimeric intron found in the pCI-neo mammalian expression vector from Promega (Cat. No. E1841). This chimeric intron is created by combining the 5' splice donor site from an intron of the human β-globin gene with the branch and 3'splice acceptor site of the intron of an immunoglobulin gene heavy chain variable region (SEQ I D NO:l).
Chimeric intron sequence (SEQ ID NO:l)
The chimeric intron is shown with important splicing motifs highlighted. 5' and 3' splice sites are shown in bold, the branch site is double underlined and the polypyrimidine tract is underlined.
SEQ I D NO:l
GTAAGTATCAAGGTTACAAGACAGGTTTAAGGAGACCAATAGAAACTGGGCTTGTCGAGACAGAGAA GACTCTTGCGTTTCTGATAGGCACCTATTGGTCTTACTGACATCCACTTTGCCTTTCTCTCCACAG
LoxP sites are 34 base pair sequences that are recognized by Cre recombinase to trigger a specific recombination event depending on their location and orientation. If the loxP sites are in parallel orientation, the sequence between the two sites is excised. To enable deletion of the branch site, two loxP sites were added to the cassette in parallel orientation flanking the branch site. Cre recombination will cause excision of the branch site and thus abrogate removal of the intron by the splicing machinery. As a consequence, the ribosome will read into the remaining cassette and encounter a premature stop codon that disrupts the gene of interest. In order to ensure that a stop codon would be encountered in all three reading frames, we mutated a single nucleotide near the beginning of the chimeric intron. To be certain that efficient splicing of the intron was maintained with these modifications, we created five different versions of the cassette with the loxP sites at varying locations (SEQ I D NOs: 2,4,6,8 and 10). To simulate the maximal impact that Cre recombination could have on these cassettes, we also included the respective derivatives that would arise following removal of the sequence between the loxP site (SEQ ID NOs: 3,4,7,9 and 11). It was important to find a configuration where the addition of the loxP sites did not alter splicing of the intron but would result in a loss of target gene expression upon loxP site excision.
Five versions of the synthetic intron with loxP sites were constructed, the sequences and the corresponding expected sequence after Cre treatment are shown below with the important features highlighted. 5' and 3' splice sites are shown in bold, the branch site is double underlined and the polypyrimidine tract is underlined. loxP sites are in lower case letters and the stop codons in all three reading frames are shown in italics. The single nucleotide that was mutated to create a stop codon is in brackets. For each version, the expected sequence after Cre treatment and loxP site excision is also shown.
Version 1: Artificial intron with loxP sites (SEQ ID NO:2)
GT^GTATCAAGGTL4 G7AAGACAGGTTL44GGAGACCAATAGAAACTGGGCTTataacttcgtatagcata cattatacgaagttatGTCGAGACAGAGAAGACTCTTGCGTTTCTG TAGGCACataacttcgtatagcatacattat acgaagttatCTATTGGTCTTACTGACATCCACTTTGCCTTTCTCTCCACAG
Version 1: Expected sequence after Cre treatment (SEQ ID NO:3)
GT^GTATCAAGGTL4 G7AAGACAGGTTL44GGAGACCAATAGAAACTGGGCTTataacttcgtatagcata cattatacgaagttatCTATTGGTCTTACTGACATCCACTTTGCCTTTCTCTCCACAG
Version 2: Synthetic intron with loxP sites (SEQ ID NO:4)
Gr^G7"4otaacttcgtafogcatacattatacgaagttatTCAAGGTTA[G]AAGACAGGTTTAAGGAGACCAATA
GAAACTGGGCTTGTCGAGACAGAGAAGACTCTTGCGTTTCTG TAGGCACataacttcgtatagcatacattat acgaagttatCTATTGGTCTTACTGACATCCACTTTGCCTTTCTCTCCACAG
Version 2: Expected sequence after Cre treatment (SEQ ID NO:5)
Gr^G7 ataacttcgtafaqcatacattatacgaagttatCTATTGGTCTTACTGACATCCACTTTGCCTTTCTCTC CACAG Version 3: Synthetic intron with loxP sites (SEQ ID NO:6)
GT^GL4ofaacttcgtafogcatacattatacgaagttatTCAAGGTTA[G]AAGACAGGTTTAAGGAGACCAATA GAAACTGGGCTTGTCGAGACAGAGAAGACTCTTGCGTTTCTGATAGGCACCTATTGGTCTTACTGACAT CCACTTTGCCTTTCTCTCCAataacttcgtatagcatacattatacgaagttatCAG Version 3: Expected sequence after Cre treatment (SEQ ID NO:7)
GT^GLAotaacttcgtafogcatacattatacgaagttatCAG
Version 4: Synthetic intron with loxP sites (SEQ ID NO:8)
Gr^G7"4otaacttcgtafogcatacattatacgaagttatTCAAGGTTA[G]AAGACAGGTTTAAGGAGACCAATA GAAACTGGGCTTGTCGAGACAGAGAAGACTCTTGCGTTTCTGATAGGCACCTATTGGTCTTACTGACAT CCACTTTGCCataacttcgtatagcatacattatacgaagttatTTTCTCTCCACAG
Version 4: Expected sequence after Cre-treatment (SEQ ID NO:9)
Gr^G7"4ataacttcgtafaqcatacattatacgaagttatTTTCTCTCCACAG
Version 5: Synthetic intron with loxP sites (SEQ ID NO:10)
Gr^GL4otaacttcgtafogcatacattatacgaagttatTCAAGGTTA[G]AAGACAGGTTTAAGGAGACCAATA GAAACTGGGCTTGTCGAGACAGAGAAGACTCTTGCGTTTCTGATAGGCACCTATTGGTCTTACTGACAa taacttcgtatagcatacattatacgaagttatTCCACTTTGCCTTTCTCTCCACAG
Version 5: Expected sequence after Cre treatment (SEQ ID NO:ll)
Gr^G7 ataacttC tafgqcatacattatac aa ttatTCCACTTTGCCTTTCTCTCCACAG As a proof of concept, we tested the effect of the synthetic intron and modified synthetic introns (with loxP sites) on the expression of the NanoLuc Luciferase gene. To this end, we inserted the various cassettes mentioned above into the coding sequence of the NanoLuc gene. When selecting the site to insert the artificial intron, we ensured that it conformed to the exon sequence requirements (MAG-G; M = C or A and hyphen denotes intron) to ensure efficient splicing. To ensure that deletion of the loxP sequence would result in loss of expression, we also cloned expression plasmids containing the NanoLuc gene with the expected sequence of the artificial introns after Cre treatment (shown above in SEQ ID NOs: 3,5,7, 9 and 11). The ideal cassette should not alter expression of the NanoLuc reporter, but should abolish expression after Cre treatment due to the premature stop codons encoded in the sequence.
NanoLuc sequence with intron insertion sites marked (SEQ ID NO: 12)
NanoLuc sequence is shown below. For the experiments shown in Figure 1 the synthetic intron was inserted in the underlined site between the CAG and G. For the experiments shown in Figure 2, the intron was inserted at the double underlines site between the AAG and G.
ATGGTCTTCACACTCGAAGATTTCGTTGGGGACTGGCGACAGACAGCCGGCTACAACCTGGACCAAGT CCTTGAACAGGGAGGTGTGTCCAGTTTGTTTCAGAATCTCGGGGTGTCCGTAACTCCGATCCAAAGGA TTGTCCTGAGCGGTGAAAATGGGCTGAAGATCGATATCCATGTCATCATCCCGTATGAAGGTCTGAGC GGCGACCAAATGGGCCAGATCGAAAAAATTTTTAAGGTGGTGTACCCTGTGGATGATCATCACTTTAA GGTGATCCTGCACTATGGCACACTGGTAATCGACGGGGTTACGCCGAACATGATCGACTATTTCGGAC
GGCCGTATGAAGGCATCGCCGTGTTCGACGGCAAAAAGATCACTGTAACAGGGACCCTGTGGAACGG
CAACAAAATTATCGACGAGCGCCTGATCAACCCCGACGGCTCCCTGCTGTTCCGAGTAACCATCAACG
GAGTGACCGGCTGGCGGCTGTGCGAACGCATTCTGGCGTAA We electroporated HAP1 cells with the NanoLuc expression plasmids and assessed luciferase levels after 24 hours using the NanoGlo Luciferase Assay system (Promega) (Figure 1).
Comparing NanoLuc and NanoLuc + intron conditions demonstrates that the presence of the chimeric intron in the NanoLuc gene does not have a profound effect on NanoLuc expression. Importantly, loss of NanoLuc expression in one of the 5 constellations, version 3, highlights
that the position of the loxP sites is not trivial and insertion of loxP sites can completely abrogate splicing and thus expression of NanoLuc. Overall, construct number 4 (SEQ ID NO:8 and SEQ ID NO:9) of the synthetic intron is the configuration that had no effect on NanoLuc expression with the cassette, but caused a profound ~30 fold reduction with the post-Cre configuration (compare NanoLuc + intron loxP v4 and NanoLuc + intron loxP post-Cre v4 conditions in Figure 1). Given these results, we decided to proceed with the version 4 cassette (SEQ ID NO:8) for all further experiments.
To rule out positional effects on splicing of the synthetic intron, we inserted version 4 cassette into a secondary location with the NanoLuc gene (See SEQ ID NO: 12 for details ). The only requirement that had to be adhered to when selecting an insertion site was to have the necessary exonic splicing motifs present (the MAG-G sequence consensus).
The NanoLuc expression plasmids with the version 4 cassette inserted at the new site were electroporated in HAPl cells and NanoLuc expression was assayed after 24 hours (Figure 2).
Insertion into the alternative site had no effect on expression of NanoLuc, but decreased NanoLuc expression 70 fold upon induction, indicating that synthetic intron function is not sensitive to the sequence context and works robustly at different sites in the NanoLuc gene.
To expand on these results, we set out to test this strategy on endogenous genes in HAPl cells. As test cases, we selected one essential gene, M ETTL16, and one non-essential gene, CDK47. Essentiality for all genes in HAPl cells has been determined experimentally using extensive gene-trapping followed by next generation sequencing analysis7. By comparing the number of disruptive versus non-disruptive gene-trap integrations in each gene, the authors were able to confidently predict which genes are essential for viability in HAPl cells. The artificial intron was inserted into the genes by homology directed repair. To this end, HAPl cells were transfected with Cas9, a g NA targeting the locus where the synthetic intron would be inserted and a homology donor (Table 1, SEQ ID NO: 13 and SEQ I D NO:14). The site where the artificial intron would be inserted was selected following the exonic sequence requirements for splicing (SEQ I D NO:19 and SEQ I D NO:20). The homology donor comprised the synthetic intron and 400bp homology arms upstream and downstream the insertion site (SEQ I D NO:21, SEQ I D NO:22). Single cell clones containing the synthetic intron were isolated,
and genotyped by PCR and Sanger sequencing (primers: Table 1, SEQ I D NO:15 to SEQ I D NO:18 and Sanger sequencing: SEQ ID NO:23, SEQ I D NO:24).
Table 1: Guide RNA and primer sequences
Genomic region surrounding the artificial intron insertion site is shown below. Exonic sequence is shown in lower case letters and intronic sequence in uppercase letter. The exonic splicing sequence motif is shown in bold. The guide RNA used during the cell line engineering is underlined.
CDK4 (SEQ I D NO: 19)
AGGTGGGGTGTGATGATCTGTAGAGAAGTGGGGACCCTGAGGAAATAATGAGAGGCCATGTTGGGT TAAAGGGGATTGAAAAGTGAGCATTTACTCTGGTCAGgctgatggacgtctgtgccacatcccgaactgaccgggag atcaaggtaaccctggtgtttgagcatgtagaccaggacctaaggacatatctggacaaggcacccccaccaggcttgccagccgaaa cgatcaagGTGAGTGGGGTTGGTAGGCATTGAGAGGTGGATTGGGACCTTTGTAGTAGAACCTTCTGGG ATTTCAGGTATGGTGCCTAGTTTCCAGTGCATCTGTACCTCCCCTTTGAAACTAGgatctgatgcgccagtttct aagaggcctagatttccttcatgccaattgcatcgttcaccgagatctgaagccagagaacattctggtgacaagtggtggaacagtca agctggctgactttggcctggccagaatctacagctaccagatggcacttacacccgtgGTCAGTAGAAAGATGGTACCAAA ATGGGTTCTGGTTGGGAGTAGGAGAGTGATTGCCCGTAGCAATTGAGAAGTCATGTGCTTCATGTGTT CAGTCAAGCAAGTTGTGTTTCATGGTAACCCATGGGGTCCCCATCCATTCTTCCTATTCCCTTTAGgttgtt acactctggtaccgagctcccgaagttcttctgcagtccacatatgcaacacctgtggacatgtggagtgttggctgtatctttgcagag atgtttcgtcgaaaGTATGGGACCCACATACCCTGGACTACCTTGAATTCCCCAAATCGCTTGTTCATAAACC ACATCCATACCT
METTL16 (SEQ I D NO:20)
ACTTGTTCAAAGTCACACAGTGAGGAGACGACGAGACAGGAATTTGAGCCGAGTCATTTAATCTCTGT
GCTCCATATACGGATATGCAGTGTTTTTGGTGGAGGATTGTACTGATTTTCATAAGCTGTGACTGTGTG AAACACTCGTGTATGTGCCACTGGGATGAGTGCTCTGTGTAACTGTGTGTGATCTCCCTGGGAGGTTC G G ATG G GTATTTG GTTCTG CCTGTTTG CCGTAG AG CTTGTAC A AC AT AGTG A ATCTTACTA ATC AGTTTT ACTTCCTTTTCTTTATTCCTCAGccttaattttaaagaccccgaagcagtcagagctctgacgtgtactctcctaagggaagatt ttggactttctattgatattccattggagagactaattcccacagttcccttgagactcaactatattcactgggtagaagatctgatcggt caccaggattctgacaaaagtactctccgaagaggaattgacatagGTATATCATTTTAAATTC I I I I I I GGCTAAACAG
TTTTCATAAGTTTTGCGAGATCAAATTCTTTGTAAACTTTTCTACTGGGTATTTGTTGTTGTACAAAAGA
AGCTGGCTACAGATACAGGTTGAGCATCCTTAATCCGAAAATCCGAAATGCTTCAAAATCTGCAACCTT
TGAGCACTGATGTGTTGCTCAAAGGCAATGTGCGTTGGAGCGTTCCAGATTTGGGGGTTAGGGATGTT CAGCCTGTAAATATAGTGCACATATTTCAAAATCTGAAAAAAAAATTGAAATCCAAAACAC
Donor templates
The sequence used as a donor template with ~400bp homology arms flanking the artificial intron is shown below. For the artificial intron: 5' and 3' splice sites are shown in bold, the branch site is double underlined and the polypyrimidine tract is underlined. LoxP sites are in lower case letters and the stop codons in all three reading frames are shown in italics. The single nucleotide that was mutated to create a stop codon is in brackets.
CDK4 (SEQ ID NO:21)
AGGTGGGGTGTGATGATCTGTAGAGAAGTGGGGACCCTGAGGAAATAATGAGAGGCCATGTTGGGT TAAAGGGGATTGAAAAGTGAGCATTTACTCTGGTCAGgctgatggacgtctgtgccacatcccgaactgaccgggag atcaaggtaaccctggtgtttgagcatgtagaccaggacctaaggacatatctggacaaggcacccccaccaggcttgccagccgaaa cgatcaagGTGAGTGGGGTTGGTAGGCATTGAGAGGTGGATTGGGACCTTTGTAGTAGAACCTTCTGGG ATTTCAGGTATGGTGCCTAGTTTCCAGTGCATCTGTACCTCCCCTTTGAAACTAGgatctgatgcgccagtttct aagGr^G7 \otaacttcgtafogcatacattatacgaagttatTCAAGGTTA[G]AAGACAGGTTTAAGGAGACCA ATAGAAACTGGGCTTGTCGAGACAGAGAAGACTCTTGCGTTTCTGATAGGCACCTATTGGTCTTACTG ACATCCACTTTGCCataacttcgtatagcatacattatacgaagttatTTTCTCTCCACAGaggcctagatttccttcatgcca attgcatcgttcaccgagatctgaagccagagaacattctggtgacaagtggtggaacagtcaagctggctgactttggcctggccaga atctacagctaccagatggcacttacacccgtgGTCAGTAGAAAGATGGTACCAAAATGGGTTCTGGTTGGGAGT
AGGAGAGTGATTGCCCGTAGCAATTGAGAAGTCATGTGCTTCATGTGTTCAGTCAAGCAAGTTGTGTT TCATGGTAACCCATGGGGTCCCCATCCATTCTTCCTATTCCCTTTAGgttgttacactctggtaccgagctcccgaa gttcttctgcagtccacatatgcaacacctgtggacatgtggagtgttggctgtatctttgcagagatgtttcgtcgaaaGTATGGGA CCCACATACCCTGGACTACCTTGAATTCCCCAAATCGCTTGTTCATAAACCACATCCATACCT METTL16 (SEQ ID NO:22)
ACTTGTTCAAAGTCACACAGTGAGGAGACGACGAGACAGGAATTTGAGCCGAGTCATTTAATCTCTGT GCTCCATATACGGATATGCAGTGTTTTTGGTGGAGGATTGTACTGATTTTCATAAGCTGTGACTGTGTG AAACACTCGTGTATGTGCCACTGGGATGAGTGCTCTGTGTAACTGTGTGTGATCTCCCTGGGAGGTTC G G ATG G GTATTTGGTTCTG CCTGTTTG CCGTAG AG CTTGT AC AAC ATAGTG A ATCTTACT AATC AGTTTT ACTTCCTTTTCTTTATTCCTCAGccttaattttaaagaccccgaagcagtcagagctctgacgtgtactctcctaagGr/i/iG7"
4otaacttcgtafogcatacattatacgaagttatTCAAGGTTA[G]AAGACAGGTTTAAGGAGACCAATAGAAACT GGGCTTGTCGAGACAGAGAAGACTCTTGCGTTTCTGATAGGCACCTATTGGTCTTACTGACATCCACTT TGCCataacttcgtatagcatacattatacgaagttatTTTCTCTCCACAGggaagattttggactttctattgatattccattgga gagactaattcccacagttcccttgagactcaactatattcactgggtagaagatctgatcggtcaccaggattctgacaaaagtactct ccgaagaggaattgacatagGTATATCATTTTAAATTCTTTTTTGGCTAAACAGTTTTCATAAGTTTTGCGAGA TC A AATTCTTTGTAA ACTTTTCTACTG G GT ATTTGTTGTTGTAC A A AAG AAG CTG G CTAC AG ATAC AG GT TGAGCATCCTTAATCCGAAAATCCGAAATGCTTCAAAATCTGCAACCTTTGAGCACTGATGTGTTGCTC AAAGGCAATGTGCGTTGGAGCGTTCCAGATTTGGGGGTTAGGGATGTTCAGCCTGTAAATATAGTGCA CATATTTCAAAATCTGAAAAAAAAATTGAAATCCAAAACAC
Clonal cell line sequencing results
Sanger sequencing results from genotyping PC s of CDK4-intron and M ETTL16-intron are shown below
METTL16-intron (SEQ I D NO:23)
TCTTTATTCCTCAGCCTTAATTTTAAAGACCCCGAAGCAGTCAGAGCTCTGACGTGTACTCTCCTAAGGT AAGTAATAACTTCGTATAGCATACATTATACGAAGTTATTCAAGGTTAGAAGACAGGTTTAAGGAGAC CAATAGAAACTGGGCTTGTCGAGACAGAGAAGACTCTTGCGTTTCTGATAGGCACCTATTGGTCTTACT GACATCCACTTTGCCATAACTTCGTATAGCATACATTATACGAAGTTATTTTCTCTCCACAGGGAAGATT TTGGACTTTCTATTGATATTCCATTGGAGAGACTAATTCCCACAGTTCCCTTGAGACTCAACTATATTCA CTGGGTAGAAGATCTGATCGGTCACCAGGATTCTGACAAA CDK4-intron (SEQ ID NO:24)
GGTGCCTAGTTTCCAGTGCATCTGTACCTCCCCTTTGAAACTAGGATCTGATGCGCCAGTTTCTAAGGT AAGTAATAACTTCGTATAGCATACATTATACGAAGTTATTCAAGGTTAGAAGACAGGTTTAAGGAGAC CAATAGAAACTGGGCTTGTCGAGACAGAGAAGACTCTTGCGTTTCTGATAGGCACCTATTGGTCTTACT GACATCCACTTTGCCATAACTTCGTATAGCATACATTATACGAAGTTATTTTCTCTCCACAGAGGCCTAG ATTTCCTTCATGCCAATTGCATCGTTCACCGAGATCTGAAGCCAGAGAACATTCTGGTGACAAGTGGTG GAACAGTCAAGCTG
The M ETLL16 and CDK4 clones containing the artificial intron (from now on referred to as M ETTL16-intron and CDK4-intron), were transduced with a retrovirus expressing Cre-ERT2 to induce recombination. Cre-ERT2 encodes a Cre recombinase fused to a mutant estrogen ligand-binding domain (ERT2) that requires tamoxifen or 4-Hydroxytamoxifen (4-OHT) for activity14. Six days after transduction, both cell lines were treated with 4-OHT to induce Cre recombination. After three days of 4-OHT treatment, genomic DNA was isolated to verify presence or absence of the artificial intron by amplifying the insertion site by PCR (Figure 3). As expected, Cre recombination and addition of 4-OHT triggered the deletion of the sequence between the two loxP sites (the branch site) in both M ETTL16-intron and CDK4-intron cells.
Genomic DNA from CDK4-intron and M ETTL16-intron cell lines treated with Cre and/or 4-OHT was used for a PCR amplifying the region around the insertion site of the artificial intron. The expected sized of the PCR products are as follows. For CDK4-intron (before recombination):
411bp, CDK4-intron (after recombination): 264bp. For M ETTL16-intron (before recombination): 451bp, M ETTL16-intron (after recombination): 304bp. The PCR fragment size corresponding to the loci after Cre recombination was only observed when cells were transduced with Cre and treated with 4-OHT. Based on this observation, it could be concluded that Cre recombinase faithfully triggered the deletion of the sequence between the loxP sites. This should lead to the inactivation of the intron and should thus abrogate splicing activity.
Having confirmed that recombination is occurring at the genomic locus, we next wanted to verify that this was leading to knock-out of the targeted gene. M ETLL16 is a highly essential gene in HAP1 cells and absence of this protein can be easily assessed by measuring cell viability. To this end, equal numbers of cells for METTL16-intron and CDK4-intron were plated in six-well dishes following Cre-E T2 transduction and either treated or not treated with 4- OHT for a total of 6 days. Cells were then stained with crystal violet to assess total cell number as an approximation of cell viability (Figure 4). One can clearly observe that M ETTL16-intron in the presence of Cre and 4-OHT shows reduced viability when compared to other conditions. This suggests that METTL16 gene is no longer being expressed due to the deletion of the synthetic intron. As expected, CDK4-intron shows no changes in viability since this gene is known to be non-essential in HAP1. This highlights that the impact of the cassette is specific and likely to be a consequence of METTL16 inactivation.
We show here that the cassette we developed can be inserted into a reporter gene (NanoLuc) and insertion does not interfere with NanoLuc expression. The cassette contains parallel loxP sites and removal of the sequences between the loxP sites triggers a ~30fold decrease in NanoLuc expression. These findings were confirmed by inserting the cassette in endogenous gene in HAP1 cells. Cell lines containing this synthetic intron show recombination at the genomic level in the presence of Cre and show the expected phenotype upon knock-out of the endogenous gene. This suggests that the cassette is functional and able to disrupt gene expression upon transduction with Cre recombinase.
Addition of Cre recombinase (e.g. by addition of a recombinant adenovirus bearing Cre recombinase) leads to the deletion of critical elements within the intron that are required to maintain splicing, most importantly: the branch site. As a consequence, the cassette will no longer be removed and the ribosome will run into a stop codon, thus disrupting the expression of the gene of interest upon addition of Cre recombinase. The resulting cell line contains a frameshift and thus represents a complete gene knockout. EXAM PLE 2:
Materials and Methods
Cassette sequences
Five different cassettes (Variants 1-5 here, referred to as Versions 1-5 above) were tested with varying locations of the loxP sites within the synthetic intron sequences. For NanoLuc trials, a 'recombined' version of each cassettes, with the predicted sequence after treatment with Cre recombinase, was tested alongside. Table 2: Cassette sequences of variants 1-5
gRNA sequences
The gRNAs in Table 3 were used to insert the cassette. Table 3: gRNAs used to insert the cassette
NanoLuc assay HAP1 or HEK293 cells were transfected with a pcDNA3.1 plasmid encoding NanoLuc or NanoLuc containing artificial intron variants (see above) and pGL4.53[luc2/PGK] vector (Promega) for normalization. Cells were harvested 24h-48h post transfection and analysed using the Nano-Glo Dual Luciferase Assay System (Promega) according to manufacturer's instructions. Generation of cell lines
HAP1 cells were transfected with TurboFectin transfection reagent (OriGene). In brief, 0.8x10s cells were seeded in a 6-well plate and transfected on the following day with l^g of Cas9 expression plasmid (#48137 from Addgene) , ^g of U6 driven gRNA expression plasmid (Horizon Discovery), O^g of PCR product encoding the donor and O^g of plasmid encoding a blasticidin resistance gene. 24h post transfection, cells were treated with 20μg/ml of blasticidin for 24h to eliminate untransfected cells. Single clones were obtained by limiting dilution. Positive clones were then identified by PCR screening. iPS cells cultured in TeSR-E8 and Vitronectin XF (both STEMCELL Technologies) were transfected with the P3 Primary Cell solution using the CM-138 pulse in the 4D-Nucleofector (Lonza). In brief, 1.0x10s cells were transfected with 2^g of Cas9 expression plasmid, 2^g of gRNA expression plasmid, 2^g of plasmid encoding the donor and O^g of plasmid
encoding a blasticidin resistance gene. Cells were plated in culture media supplemented with ΙΟμΜ ROCKi (Y-27632, Abeam) for 24h, and then treated with 10 g/ml of blasticidin for 24h to eliminate untransfected cells. Recovered colonies were single cell plated after treatment with StemPro Accutase (Thermo Fisher Scientific) and clones were manually picked into 96wells for screening.
Genomic DNA isolation and PCR
Genomic DNA was isolated using the QIAamp DNA Mini Kit (Qjagen) according to manufacturer's instructions. PCRs where done with the primers below using the GoTaq DNA Polymerase (Promega) following manufacturer's instructions. The primers in Table 4 were used for PCR.
Table 4: Primers used for PCR
RNA isolation, cDNA synthesis and qRT-PCR
Total RNA was isolated from cells using the RNeasy mini kit (QIAGEN) in accordance with manufacturer's protocol. For RNA from Hapl cells, 1 μg of total RNA was reverse transcribed using oligo dT primer and M-MLV Reverse Transcriptase (Promega). The cDNA was then used as a template for PCR using the primers listed in the table below. For RNA from iPSCs, 1 μg of total RNA was reverse transcribed into cDNA using the Superscript III First-Strand Synthesis SuperMix kit (Thermo Fisher Scientific). 10 ng of cDNA was used for qRT-PCR reactions that were set up in triplicates using either TaqMan Gene Expression Master Mix (Thermo Fisher
Scientific). TaqMan gene expression assays (Thermo Fisher Scientific) were used for each gene analysed. qRT-PCR experiments were performed using QuantStudio 6 Flex Real Time PCR System (Thermo Fisher Scientific). All values were normalized to an internal control (GAPDH) and brought to power of 2. Each target's value was then compared to the highest value of that target across all samples. All samples were done in technical triplicates and the median of these is displayed. The following TaqMan gene expression assays were used: Gapdh
(Hs03929097_gl), Oct4 (Hs00999632_gl), Nanog (Hs04399610_gl), Sox2 (Hs01053049_sl), Cdx2 (Hs01078080_ml), Eomes (Hs00172872_ml) (all from Thermo Fisher Scientific).
The primers in Table 5 were used for RT-PCR: Table 5: Primers for RT-PCR
Retroviral production and transduction of cells
Approximately lxlO7 HEK392T cells were transfected with 1.7 μg pAdvantage plasmid (Promega), 2.6 μg VSV-G expression vector, 4 μg gag-pol expression vector and 6.6 μg CreERT2 expression vector, using TurboFectin transfection reagent (OriGene). Medium was changed 24 hours after transfection. Viral supernatant was collected 48 and 72 hours after transfection, and used to transduce the M ETTL16-DECAI cells. Since the Cre-ERT2 retrovirus also encoded a PGK-PuroR cassette, cells containing Cre could be enriched for by selecting with 0.3μg/ml puromycin. FACS, Western Blotting and Immunocytochemistry
For the HAP1 experiments, cells were trypsinized and washed with PBS. Cells were stained with a CD46-specific antibody (APC-CD46; catalogue #564253; BD Biosciences) in FACS buffer (5% FCS in PBS) for 30 minutes and excess antibody was removed by washing with PBS. Cells were analysed by flow cytometry using the BDLSR Fortessa.
For the human iPS cell experiments, cells were fixed and permeabilized using the
Cytofix/Cytoperm kit (BD) according to manufacturer's instructions. Cells were then stained using the following directly conjugated antibodies: Alexa Fluor® 488 anti-Oct4 (653706, BioLegend) and PE/Cy7 anti-human SSEA-4 (330420, BioLegend). Cells were analysed using the iQue Screener PLUS (IntelliCyt). Data was analysed using the FlowJo software.
For Western blotting, lysates were prepared using Frackelton buffer (lOm M Tris/HCI pH7.5, 50mM NaCI, 30mM sodium pyrophosphate, 1% Triton X-100, 50mM NaF and protease inhibitors). Lysates were separated on 10% SDS-PAGE and blotted on Nitrocellulose membranes. Membranes were stained with a primary antibody (anti-CD46, abl08307, Abeam) and a secondary antibody (anti-rabbit, 111-035-003, Jackson Immunoresearch Europe) and visualized using ECL reagent (Thermo Fisher Scientific).
For Immunocytochemistry, the PSC 4-Marker Immunocytochemistry Kit (Thermo Fisher Scientific, A24881) was used. Briefly, iPS cells were fixed using 4% PFA for 15min,
permeabilized using 1% Saponin for 15min, and blocked with 3% BSA for 30min (all at room temperature). Primary antibodies used: rabbit anti-Oct4 (A24867) and mouse lgG3 anti-SSEA4 (A24866); secondary antibodies: Alexa Fluor 594 donkey anti-rabbit (A24870) and Alexa Fluor 188 goat anti-mouse lgG3 (A24877). DAPI was used for nuclear staining.
Data availability
All data generated or analysed during this study are included in this published article (and its Supplementary Information files).
Results
To generate conditional gene knockouts, we designed a genetic approach that was based on inserting a small intron cassette into the coding sequence of an exon. Introns were favoured because they are removed by splicing and thus leave the targeted gene intact until the cassette is activated (Figure 5A). To overcome limitations of previous similar approaches, we decided to minimize the size of the intron and to base our design on a very small chimeric intron. Introns contain three elements that are critical for their function: the splice donor, the branch point and the splice acceptor. We reasoned that removal of the branch point by Cre/loxP recombination could cripple the intron, thus triggering the translation of truncated intronic sequence (Figure 5B). Addition of three stop codons (one for each reading frame) would then lead to translational termination of the gene and would thus abrogate gene expression.
Since the intron we used is very small, the precise architecture of the cassette was not straightforward and the two major risks included: (i) cassette modification could disturb intron splicing, hence leaving the cassette in a constant "on-state"; (ii) as the branch point sequence is quite degenerate, nearby sequences may take over branch point function upon its removal. To overcome these difficulties, we established an assay based on NanoLuc reporter gene expression where we could easily test a variety of cassette configurations. A series of NanoLuc expression plasmids were created in which the different cassette designs were inserted into one specific location of the NanoLuc sequence (see sequence details in the Materials and Methods). We also included the respective sequences as they would occur after Cre recombination, thus simulating the Cre/loxP excision event. An ideal cassette leaves NanoLuc expression unaffected (off-state) but triggers a profound decrease upon Cre/loxP
recombination (on-state).
First, we noted that insertion of the intronic sequence without modification (no loxP sites) was well tolerated (Figure 6A, Figure 1). This suggested that the intron was functional in principal and that the site within NanoLuc tolerated an intronic insertion. Next, we assessed the various variants side-by-side (Figure 6A, Figure 1). Of interest, introduction of cassette variant 3 (v3) lead to a strong decrease in NanoLuc expression even in the "off-state" of the cassette, suggesting that this cassette modification was incompatible with proper intron splicing.
Variants 1 and 2 were well tolerated, yet we did not observe any reduction in gene expression upon Cre recombination, suggesting that alternative branch points had taken over. Of the variants tested, variant 4 (Figure 5C) was most fit for purpose in as much as it was well tolerated in the "off-state" and triggered a profound decrease (~14 fold) in NanoLuc expression upon cassette activation.
Splicing is strongly dependent on the sequence context and requires certain specific sequences at the intron/exon junction. Introns of the GT-AG category, such as the one we chose, have a preference for CAG or AAG upstream of the splice donor and G or A
downstream of the splice acceptor16. Hence, the choice of target sites to insert an intron into the coding sequence of an exon should be restricted to CAG-G, CAG-A, AAG-G or AAG-A (where the hyphen denotes the site where the intron is inserted). To test whether all four sites tolerated the insertion of the intron equally, we created eight NanoLuc constructs, targeting eight different intron insertion sites (two for each of the four motifs). Transient overexpression in HEK293 suggests that all of these sites tolerated the insertion of an intron equally (Figure 6B). This suggests that our approach is broadly applicable across the various insertion sites depicted above.
Having established a cassette in which the desired features had been encoded in just 201 nucleotides, we decided to test this cassette in human cells. In order to introduce the intron, we designed and synthesized homology donors of ~lkb in which the cassette had been placed in the middle (leaving ~400bp homology arms on either end; Figure 10). We targeted exons of two human genes: (i) CD46 because it is cell-surface expressed and can be easily detected by FACS; and (ii) M ETTL16 because it is an essential gene and cassette activation should trigger cell death.
We chose the human haploid cell line HAP1 for the ensuing targeting experiments because these cells contain a single copy of every human gene and conditional gene inactivation is thus straightforward17. We chose exons that were at least lOObp from the start codon and within the first 30% of the coding sequence of the gene. Following transfection of HAP1 cells with Cas9, the respective gene-specific gRNA and the homology donor, we obtained clonal cell lines by limiting dilution. Single clones were genotyped by PCR to identify clones in which cassette integration had occurred. Note that we did not select for cassette integration as the conditional KO cassette itself does not contain any selectable marker. In spite of this, we still obtained targeted clones at frequencies of 5% or higher (see Table 6). Of interest, the frequency at which the cassette was incorporated was higher in the essential gene (M ETTL16) than in the non-essential gene (CD46). While this may well be a coincidence, it is plausible to assume higher targeting frequencies for essential genes as other editing events (e.g. the accumulation of indels) may be subject to negative selection.
Table 6
Clonal cell lines bearing the cassette were expanded and Cre recombinase was expressed to trigger cassette activation. For CD46, HAP1 cells were transiently transfected with a plasmid encoding Cre. For M ETTL16, cells were infected with a retrovirus encoding Cre-ERT2 recombinase. Cre-ERT2 was activated by addition of 4-hydroxitamoxifen (4-OHT). Genomic DNA from the resulting cell lines was isolated and analysed by PCR (Figure 7A). Cell lines bearing tagged alleles displayed a band that was shifted in size by ~200bp, corresponding to
the artificial intron that had been integrated at the targeted site. Following transduction with Cre (and activation with 4-OHT), this band was shifted to smaller size by ~150bp,
corresponding to the sequence that had been removed by Cre recombination. This indicates that Cre-mediated recombination faithfully triggers the removal of the sequences between the two parallel loxP sites. We also isolated m RNA from the cell line in which CD46 had been tagged with the artificial intron and analysed the CD46 transcript by RT-PCR (Figure 7B). In the absence of Cre, the intron was efficiently removed in the CD46-DECAI clone and thus not detectable at the mRNA level. Upon addition of Cre, the intron was inactivated and the crippled intronic sequence was incorporated in the m RNA sequence, leading to a shift towards higher molecular weight.
Next, we assessed the impact of cassette activation on gene expression and gene function. We started our analysis with CD46 because the gene is non-essential and its expression can be easily visualized. We compared cells bearing the intron to wild-type cells and noted no difference, neither by Western blotting (Figure 8A) nor by flow cytometry (Figure 11). This indicates that the artificial intron did not disrupt endogenous gene expression in the "off- state". Upon transfection of cells with Cre, CD46-DECAI cells abrogated CD46 expression almost completely, whereas wild-type cells were unaffected (Figure 8A and B). The few remaining CD46-positive cells (~5% as judged by FACS) likely arose from incomplete transfection of cells with Cre recombinase.
We also assessed the impact of the artificial intron on the essential gene M ETTL16.
Transduction of cells with Cre-ERT2 and activation of Cre-ERT2 with 4-OHT triggered cell death in M ETTL16-DECAI cells (Figure 8C, Figure 4 right hand side). We noted that cell death was incomplete (data not shown), suggesting that retroviral Cre-ERT2 expression may have been silenced in a subpopulation of cells. To overcome this silencing, we isolated single cell clones bearing retroviral insertions of Cre-ERT2 and activated Cre-ERT2 by 4-OHT treatment (Figure
12). As expected, some clones showed complete cell death upon 4-OHT treatment, whereas others were almost unaffected. This highlights the need for effective delivery and expression of Cre.
We also wanted to apply our approach in a more physiological setting and decided to test it in human induced pluripotent stem cells (iPSCs). Human iPSCs are pluripotent and their pluripotency depends on the expression of the transcription factor Oct4/POU5Fl18.
Inactivation of Oct4 triggers the collapse of the core transcriptional network leading to rapid loss of pluripotency. This is associated with drastic changes in colony morphology and loss of other pluripotency markers, such as Nanog, Sox2 and SSEA4. We created homology donors for
Oct4/POU5Fl (400bp homology arms flanking the DECAI cassette) and targeted two independent sites in exon 1 (Figure 10). Clonal cell lines were established and genotyped by PCR. Both homozygous and heterozygous cell lines were achieved (Table 5).
We selected two homozygous clones bearing the artificial intron in distinct genomic locations in exon 1 to study in greater detail. Immunofluorescence staining (Figure 9A), flow cytometry analysis (Figure 9B) and RNA expression (Figure 9C) showed that human iPSC colonies bearing the intron expressed similar levels of Oct4 as non-targeted cells , indicating that the intron by itself did not affect Oct4 expression. Likewise, pluripotency was retained as suggested by the SSEA4 staining, and normal expression of pluripotency markers Nanog and Sox2 (Figure 9A-C). Upon recombination with Cre, wildtype cells were essentially unaffected. In contrast, Oct4- DECAI cells lost Oct4 expression, and consequently, SSEA4 (Figure 9A-B), Nanog and Sox2 expression (Figure 9C). They also changed their morphology quite dramatically (Figure 13), indicating that these cells had lost their pluripotency. The induced knockout of Oct4 resulted in differentiation of human iPSCs with a significant increase in transcription of Cdx2 and Eomes (Figure 9C), genes associated with trophoblast and endoderm lineages 1 ,20. In line with these observations, we could detect Cre/loxP recombination at the genomic DNA level (Figure 14). Altogether, this highlights the feasibility of our approach for conditional gene inactivation and suggests its applicability in more physiological settings. Discussion
In the examples described herein, we created a novel small cassette that readily allows the tagging of endogenous genes, poising them for conditional gene inactivation. The system is robust, triggers a profound decrease in gene expression and may be the first approach that can be truly applied at a large scale.
Of note, the work presented here has been inspired by the CRISPR-FLI P approach15. Yet it goes beyond this approach in at least two key aspects: (i) gene expression is disrupted by a series of stop codons, not by revealing a splice acceptor that traps the transcript; (ii) most importantly, the cassette size is considerably smaller (201bp versus at least l,931bp in CRISPR-FLI P) and hence, donor assembly is more straightforward. Additionally, even strong splice acceptors can be leaky depending on the genomic context21 and hence, the impact of the CRISPR-FLIP cassette may vary from gene to gene.
Experiments presented here were performed in HAP1 cells and human iPSCs, but we are confident that this approach can be applied to any mammalian cell line. While engineering in haploid cell lines, such as HAP1, is straightforward with only a single allele to target, most cell
lines are diploid like iPSCs or aneuploid, harbouring multiple chromosome copies. To obtain complete conditional knockouts in such cells, we envisage two possible solutions: (i) complete bi-allelic or multi-allelic knock-in: one obtains a cell line in which both alleles of a given gene of interest have been tagged with the cassette. While this is possible, the efficiency will likely not be very high as bi-allelic tagging is infrequent, (ii) Mono-allelic knock-in with disruption of the open reading frame of the additional alleles: as a compromise, we suggest to screen for clones in which one allele has been tagged and the other allele(s) have been inactivated by Cas9 cleavage following N H EJ repair. Since the donor template is integrated in the middle of a coding exon near the start of a gene, the g NA used to trigger incorporation of the donor will likely lead to indels abrogating gene expression. This is exemplified by our results in iPSCs where clones were recovered with both bi-allelic targeting and mono-allelic knock-in with indels on the second allele. This may not be possible in all instances as some genes may require all alleles to be intact, but for most genes, this is likely to represent an acceptable and tolerable compromise.
The artificial intron approach presented here also has some conceivable shortcomings. First, there is a potential risk that insertion of the artificial intron may dysregulate endogenous splicing events, leading to constitutive gene inactivation. When conducting these studies, we noted one case where insertion of the artificial intron into the CDK4 gene (which is nonessential in HAP1 cells) abrogated CDK4 expression (data not shown). A more detailed analysis of these cells revealed that the intron was successfully removed by splicing, but some erroneous splicing events led to the assembly of a non-functional mRNA that was most likely degraded by non-sense mediated decay. Hence, when targeting non-essential genes, one may need to monitor the impact of cassette integration on endogenous gene expression.
Fortunately, this can easily be assessed by a conventional RT-PCR. Of note, this is less of a problem for essential genes because cells bearing such editing events will not be viable.
Second, our method depends on robust Cre recombinase delivery and expression for activating the cassette to the on-state. Here, we introduced Cre-ERT2 by retroviral transduction for M ETTL16-DECAI and OCT4-DECAI activation. While this is possible, we noticed some shortcomings, mainly: (i) Cre-ERT2 expression can be leaky and already occur in the absence of 4-OHT (data not shown) and (ii) Cre-ERT2 expression can be silenced, especially when Cre-ERT2 is delivered via retroviral infection. A more elegant approach would be to express Cre-ERT2 from a safe-harbour locus such as AAVS1 or ROSA26. This would not only be better defined from a genetic point of view, but also likely to yield more robust and reproducible levels of Cre expression.
While the methods described herein offer an easy and efficient strategy for generating conditional knockouts, some alternative methods exists and are worth mentioning. Recently, several strategies have been developed where modified versions of the CRISPR/Cas9 system are adopted to modulate gene expression. One example is CRISPR interference (CRISPRi), where a catalytically dead Cas9 directed by a guide RNA will suppress expression of the target gene by sterically hindering transcription initiation/ elongation or by the action of a silencing effector domain fused to Cas922,23. While this strategy has the advantage of being reversible, it is difficult to reach complete inhibition of the target gene expression. Yet another method, is the use of inducible CRISPR systems. There are several iterations of these systems ranging from split Cas9 molecules that form a catalytically active molecule upon a certain stimulus24 or control of Cas9 or gRNA expression by inducible promoters25. The efficiency of these methods will depend on the expression level of the gRNA and will likely show a variety of effects across cells in the sample since the editing induced by Cas9 will not happen in a predictable manner. This contrasts with the DECAI method described herein which will lead to a predictable modification and disruption of expression upon Cre recombination.
Another alternative to the genetic approach described herein, is a biochemical approach such as degron tagging. Degrons are drug-regulatable domains that allow one to modulate the abundance of the degron tagged protein by adding or withdrawing a small molecule. Popular degrons include the auxin/ mAI D system9,the HaloPROTAC3/ HaloTag system10 and the HCV protease inhibitor/ SMAShTag system11. At least some of the degrons that are commonly used (such as the mAID system) are much faster in degrading the client protein (with a half-life of minutes9 rather than hours or days) as they lead to the recruitment of E3 ligases that actively degrade the protein population that is present. Additionally, the degron tags have the advantage of allowing reversible and tuneable degradation of the target proteins. However, the use of degrons has two major limitations: (i) the bi-allelic tagging of genes occurs at very low frequency and thus, it is difficult to obtain a cell line or an organism in which all copies of a gene have been conditionally inactivated and (ii) the insertion of foreign sequences within the coding region of a gene could disturb protein folding or protein function. As a consequence, degron tagging can be cumbersome and may not be compatible with high-throughput applications.
With this in mind, the approach described herein is more suitable for serial production of conditional knockouts, whereas degron tagging may be the method of choice if the protein is well understood and inefficient clone recovery is less of a constraint.
In summary, the DECAI method described herein offers a convenient and efficient alternative for generating conditional knockout alleles in mammalian cells.
Mali, P., Esvelt, K. M. & Church, G. M . Cas9 as a versatile tool for engineering biology. Nature methods 10, 957-963, doi: 10.1038/nmeth.2649 (2013).
Mali, P. et al. RNA-guided human genome engineering via Cas9. Science 339, 823-826, doi:10.1126/science. l232033 (2013).
Jinek, M. et al. A programmable dual-RNA-guided DNA endonuclease in adaptive bacterial immunity. Science 337, 816-821, doi:10.1126/science.l225829 (2012).
Cong, L. et al. Multiplex genome engineering using CRISPR/Cas systems. Science 339, 819-823, doi:10.1126/science.l231143 (2013).
Dickinson, D. J., Ward, J. D., Reiner, D. J. & Goldstein, B. Engineering the Caenorhabditis elegans genome using Cas9-triggered homologous recombination. Nature methods 10, 1028-1034, doi: 10.1038/nmeth.2641 (2013).
van Overbeek, M. et al. DNA Repair Profiling Reveals Nonrandom Outcomes at Cas9- Mediated Breaks. Mol Cell 63, 633-646, doi: 10.1016/j.molcel.2016.06.037 (2016). Blomen, V. A. et al. Gene essentiality and synthetic lethality in haploid human cells. Science, doi:10.1126/science.aac7557 (2015).
Skarnes, W. C. et al. A conditional knockout resource for the genome-wide study of mouse gene function. Nature 474, 337-342, doi:10.1038/naturel0163 (2011).
Natsume, T., Kiyomitsu, T., Saga, Y. & Kanemaki, M. T. Rapid Protein Depletion in Human Cells by Auxin-lnducible Degron Tagging with Short Homology Donors. Cell Rep 15, 210-218, doi:10.1016/j.celrep.2016.03.001 (2016).
Buckley, D. L. et al. HaloPROTACS: Use of Small Molecule PROTACs to Induce Degradation of HaloTag Fusion Proteins. ACS Chem Biol 10, 1831-1837, doi:10.1021/acschembio.5b00442 (2015).
Chung, H. K. et al. Tunable and reversible drug control of protein production via a self- excising degron. Nat Chem Biol 11, 713-720, doi:10.1038/nchembio.l869 (2015). Economides, A. N. et al. Conditionals by inversion provide a universal method for the generation of conditional alleles. Proceedings of the National Academy of Sciences of the United States of America 110, E3179-3188, doi: 10.1073/pnas. l217812110 (2013). Lackner, D. H. et al. A generic strategy for CRISPR-Cas9-mediated gene tagging. Nature communications 6, 10237, doi:10.1038/ncommsl0237 (2015).
Feil, R., Wagner, J., Metzger, D. & Chambon, P. Regulation of Cre recombinase activity by mutated estrogen receptor ligand-binding domains. Biochem Biophys Res Commun 237, 752-757, doi:10.1006/bbrc.l997.7124 (1997).
Andersson-Rolf, A. et al. One-step generation of conditional and reversible gene knockouts. Nat. Methods 14, 287-289 (2017).
Sibley, C. R., Blazquez, L. & Ule, J. Europe PMC Funders Group Lessons from non- canonical splicing.17, 407-421 (2017).
Essletzbichler, P. et al. Megabase-scale deletion using CRISPR/Cas9 to generate a fully haploid human cell line. Genome Res.24, 2059-2065 (2014).
Kim, J., Chu, J. L, Shen, X. H., Wang, J. L. & Orkin, S. H. An extended transcriptional network for pluripotency of embryonic stem cells (vol 132, pg 1049, 2008). Cell 133, 1290 (2008).
Zafarana, G., Avery, S. R., Avery, K., Moore, H. D. & Andrews, P. W. Specific knockdown of OCT4 in human embryonic stem cells by inducible short hairpin RNA interference. Stem Cells 27, 776-782 (2009).
Chen, Y., Wang, K., Gong, Y. G., Khoo, S. K. & Leach, R. Roles of CDX2 and EOM ES in human induced trophoblast progenitor cells. Biochem. Biophys. Res. Commun.431, 197-202 (2013).
Burckstummer, T. et al. A reversible gene trap collection empowers haploid genetics in human cells. Nat. Methods 10, 965-971 (2013).
Gilbert, L. A. et al. CRISPR-mediated modular RNA-guided regulation of transcription in eukaryotes. Cell 154, 442-51 (2013).
Qi, L. S. et al. Repurposing CRISPR as an RNA-guided platform for sequence-specific control of gene expression. Cell 152, 1173-83 (2013).
Zetsche, B., Volz, S. E. & Zhang, F. A split-Cas9 architecture for inducible genome editing and transcription modulation. Nat. Biotechnol. 33, 139-142 (2015).
de Solis, C. A., Ho, A., Holehonnur, R. & Ploski, J. E. The Development of a Viral Mediated CRISPR/Cas9 System with Doxycycline Dependent gRNA Expression for Inducible In vitro and In vivo Genome Editing. Front. Mol. Neurosci. 9, 70 (2016).
Claims
1. A method for generating an allele for conditional gene knock-out in a cell
comprising a target gene, the method comprising: introducing an artificial intron sequence into an exon of the target gene, the artificial intron sequence comprising: a) a splice donor sequence;
b) a first nuclease or recombinase site;
c) a branch point sequence;
d) a second nuclease or recombinase site;
e) a splice acceptor sequence; and
a stop codon positioned 5' to or within the first nuclease or recombinase site.
2. A method according to claim 1 comprising introducing into the cell a sequence-specific nuclease that cleaves a sequence within an exon of the target gene thereby introducing the artificial intron sequence into the target gene.
3. The method of claim 1 or claim 2 further comprising the step of introducing or
activating a recombinase or nuclease in the cell thereby excising or disrupting the branch point and abrogating splicing of the artificial intron sequence.
4. The method of any preceding claim wherein the artificial intron sequence comprises two stop codons, more preferably three stop codons to cover all possible reading frames of the gene.
5. The method of claim 4 or claim 5 wherein a series of three or more stop codons are positioned 5' to or within the first nuclease or recombinase site.
6. The method of any preceding claim wherein the artificial intron contains a single branch point.
7. The method of any preceding claim wherein the artificial intron sequence is introduced by homology directed repair (HD ) or non-homologous end joining (NHEJ).
8. The method of any preceding claim wherein the sequence-specific nuclease is an RNA guided nuclease, or a DNA guided nuclease, a Zinc finger nuclease or a TALEN.
9. The method of claim 8, wherein the NA-guided nuclease is Cas9 or an RNA-guided Argonaute.
10. The method of claim 7 wherein the DNA-guided nuclease is a DNA-guided Argonaute.
11. The method of any of claims 2 to 10, wherein the sequence-specific nuclease is
introduced as a protein, mRNA, or cDNA.
12. The method of any preceding claim wherein first and second recombinase sites are loxP sites or FRT sites.
13. The method of claim 12 wherein the first (5') loxP site comprises at least one stop
codon.
14. The method of any preceding claim wherein the first recombinase or nuclease site is positioned adjacent to the splice donor site.
15. The method of any of claims 3 to 14 wherein the recombinase is Cre recombinase or Flp recombinase.
16. The method of any preceding claim wherein the cell is a mammalian cell preferably a HAP1 cell, a human induced pluripotent stem cell, an embryonic stem cell, or a CHO cell or a T-cell.
17. The method of any preceding claim wherein the artificial intron sequence is about 200- 250, 200-300, 200-400, 200-500, 200-600, 200-700 or 200-800 or 200-1000 nucleotides in length.
18. The method of any preceding claim wherein the artificial intron sequence comprises a nucleotide sequence at least 80%, 85%, 90%, 95%, 97%, 98% 99%, or 100% identical to SEQ ID NO: 8 .
19. A cell containing an artificial intron introduced according to the method of any of claims 1 to 18.
20. A cell comprising an artificial intron in an exon of a target gene, the artificial intron having at least one stop codon positioned 5' to or within a recombinase site, and wherein a single branch site is flanked by a pair of recombinase or nuclease sites, the recombinase or nuclease sites being arranged such that the single branch site may be excised or disrupted, thereby abrogating splicing of the artificial intron.
21. A cell according to claim 19 or claim 20 for use in therapy.
22. The method of any of claims 3-18, wherein the artificial intron is introduced into a
population of cells, and wherein the recombinase or nuclease is introduced or activated in a specific subset of cells to inactivate the target gene selectively in this subset of cells.
23. The method of any of claims 1-18, wherein the artificial intron sequence further
comprises a selection gene (e.g. an antibiotic resistance gene such as puromycin) or a marker (e.g. a molecular barcode).
Applications Claiming Priority (4)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| GB1620091.7 | 2016-11-28 | ||
| GBGB1620091.7A GB201620091D0 (en) | 2016-11-28 | 2016-11-28 | Methods for conditional gene knock-out |
| GB1717695.9 | 2017-10-27 | ||
| GBGB1717695.9A GB201717695D0 (en) | 2017-10-27 | 2017-10-27 | Methods for conditional gene knock-out |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2018096356A1 true WO2018096356A1 (en) | 2018-05-31 |
Family
ID=60702828
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/GB2017/053546 Ceased WO2018096356A1 (en) | 2016-11-28 | 2017-11-27 | Methods for conditional gene knock-out |
Country Status (1)
| Country | Link |
|---|---|
| WO (1) | WO2018096356A1 (en) |
Cited By (17)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2020003006A3 (en) * | 2018-06-28 | 2020-03-05 | Crispr Therapeutics Ag | Compositions and methods for genomic editing by insertion of donor polynucleotides |
| CN112430550A (en) * | 2020-10-15 | 2021-03-02 | 天津科技大学 | Strain for regulating and controlling synergistic death of protein degradation induced gene and application and method |
| US11236313B2 (en) | 2016-04-13 | 2022-02-01 | Editas Medicine, Inc. | Cas9 fusion molecules, gene editing systems, and methods of use thereof |
| WO2022079082A1 (en) | 2020-10-15 | 2022-04-21 | F. Hoffmann-La Roche Ag | Nucleic acid constructs for simultaneous gene activation |
| US11345932B2 (en) | 2018-05-16 | 2022-05-31 | Synthego Corporation | Methods and systems for guide RNA design and use |
| CN114763559A (en) * | 2021-01-15 | 2022-07-19 | 中国农业大学 | Targeting gene capturing system independent of homologous recombination and application thereof |
| CN114807126A (en) * | 2021-01-22 | 2022-07-29 | 清华大学深圳国际研究生院 | Method for silencing expression of long non-coding RNA and application thereof |
| EP4086350A1 (en) | 2021-05-07 | 2022-11-09 | IMBA-Institut für Molekulare Biotechnologie GmbH | Controlled gene expression methods and means |
| US11597924B2 (en) | 2016-03-25 | 2023-03-07 | Editas Medicine, Inc. | Genome editing systems comprising repair-modulating enzyme molecules and methods of their use |
| US11667911B2 (en) | 2015-09-24 | 2023-06-06 | Editas Medicine, Inc. | Use of exonucleases to improve CRISPR/CAS-mediated genome editing |
| US11680268B2 (en) | 2014-11-07 | 2023-06-20 | Editas Medicine, Inc. | Methods for improving CRISPR/Cas-mediated genome-editing |
| US11866727B2 (en) | 2015-11-06 | 2024-01-09 | Crispr Therapeutics Ag | Materials and methods for treatment of glycogen storage disease type 1A |
| US11866726B2 (en) | 2017-07-14 | 2024-01-09 | Editas Medicine, Inc. | Systems and methods for targeted integration and genome editing and detection thereof using integrated priming sites |
| US12201699B2 (en) | 2014-10-10 | 2025-01-21 | Editas Medicine, Inc. | Compositions and methods for promoting homology directed repair |
| WO2025033445A1 (en) * | 2023-08-07 | 2025-02-13 | 株式会社Logomix | Method for conditionally inactivating gene, and cell therefor |
| US12312606B2 (en) | 2020-10-15 | 2025-05-27 | Hoffmann-La Roche Inc. | Nucleic acid constructs for VA RNA transcription |
| WO2026008633A1 (en) * | 2024-07-02 | 2026-01-08 | Novo Nordisk A/S | Intronic targeted insertion of transgenes |
Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2002040685A2 (en) * | 2000-11-16 | 2002-05-23 | Cornell Research Foundation, Inc. | Vectors for conditional gene inactivation |
| WO2002083909A2 (en) * | 2001-04-17 | 2002-10-24 | Multigen International B.V. | Recombinant nucleotide sequence coding for a gene |
| WO2005001087A2 (en) * | 2003-06-11 | 2005-01-06 | Regeneron Pharmaceuticals, Inc. | Methods of modifying genes in eukaryotic cells |
| WO2006048466A2 (en) * | 2004-11-08 | 2006-05-11 | Genoway | Animal models comprising at least one conditional ko and one conditional ki |
| WO2011059799A1 (en) * | 2009-10-29 | 2011-05-19 | Regeneron Pharmaceuticals, Inc. | Multifunctional alleles |
-
2017
- 2017-11-27 WO PCT/GB2017/053546 patent/WO2018096356A1/en not_active Ceased
Patent Citations (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2002040685A2 (en) * | 2000-11-16 | 2002-05-23 | Cornell Research Foundation, Inc. | Vectors for conditional gene inactivation |
| WO2002083909A2 (en) * | 2001-04-17 | 2002-10-24 | Multigen International B.V. | Recombinant nucleotide sequence coding for a gene |
| WO2005001087A2 (en) * | 2003-06-11 | 2005-01-06 | Regeneron Pharmaceuticals, Inc. | Methods of modifying genes in eukaryotic cells |
| WO2006048466A2 (en) * | 2004-11-08 | 2006-05-11 | Genoway | Animal models comprising at least one conditional ko and one conditional ki |
| WO2006048464A2 (en) * | 2004-11-08 | 2006-05-11 | Genoway | Expressing heterologous genes in a specific manner in certain tissues of transgenic animal models |
| WO2011059799A1 (en) * | 2009-10-29 | 2011-05-19 | Regeneron Pharmaceuticals, Inc. | Multifunctional alleles |
Non-Patent Citations (32)
| Title |
|---|
| A. N. ECONOMIDES ET AL: "Conditionals by inversion provide a universal method for the generation of conditional alleles", PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES, vol. 110, no. 34, 5 August 2013 (2013-08-05), pages E3179 - E3188, XP055155451, ISSN: 0027-8424, DOI: 10.1073/pnas.1217812110 * |
| AMANDA ANDERSSON-ROLF ET AL.: "Rapid, One-step generation of biallelic conditional gene knockouts", 1 June 2016 (2016-06-01), pages 1 - 17, XP002777980, Retrieved from the Internet <URL:https://www.biorxiv.org/content/biorxiv/early/2016/06/01/056549.full.pdf> [retrieved on 20180206] * |
| ANDERSSON-ROLF, A. ET AL.: "One-step generation of conditional and reversible gene knockouts", NAT. METHODS, vol. 14, 2017, pages 287 - 289 |
| BLOMEN, V. A. ET AL.: "Gene essentiality and synthetic lethality in haploid human cells", SCIENCE, 2015 |
| BUCKLEY, D. L. ET AL.: "HaloPROTACS: Use of Small Molecule PROTACs to Induce Degradation of HaloTag Fusion Proteins", ACS CHEM BIOL, vol. 10, 2015, pages 1831 - 1837, XP002762672 |
| BURCKSTUMMER, T. ET AL.: "A reversible gene trap collection empowers haploid genetics in human cells", NAT. METHODS, vol. 10, 2013, pages 965 - 971, XP055097822, DOI: doi:10.1038/nmeth.2609 |
| CHEN, Y.; WANG, K.; GONG, Y. G.; KHOO, S. K.; LEACH, R.: "Roles of CDX2 and EOMES in human induced trophoblast progenitor cells", BIOCHEM. BIOPHYS. RES. COMMUN., vol. 431, 2013, pages 197 - 202, XP028970948, DOI: doi:10.1016/j.bbrc.2012.12.135 |
| CHUNG, H. K. ET AL.: "Tunable and reversible drug control of protein production via a self-excising degron", NAT CHEM BIOL, vol. 11, 2015, pages 713 - 720, XP055408268, DOI: doi:10.1038/nchembio.1869 |
| CONG, L. ET AL.: "Multiplex genome engineering using CRISPR/Cas systems", SCIENCE, vol. 339, 2013, pages 819 - 823, XP055400719, DOI: doi:10.1126/science.1231143 |
| DE SOLIS, C. A.; HO, A.; HOLEHONNUR, R.; PLOSKI, J. E.: "The Development of a Viral Mediated CRISPR/Cas9 System with Doxycycline Dependent gRNA Expression for Inducible In vitro and In vivo Genome Editing", FRONT. MOL. NEUROSCI., vol. 9, 2016, pages 70, XP055381278, DOI: doi:10.3389/fnmol.2016.00070 |
| DICKINSON, D. J.; WARD, J. D.; REINER, D. J.; GOLDSTEIN, B.: "Engineering the Caenorhabditis elegans genome using Cas9-triggered homologous recombination", NATURE METHODS, vol. 10, 2013, pages 1028 - 1034, XP055340980, DOI: doi:10.1038/nmeth.2641 |
| ECONOMIDES, A. N. ET AL.: "Conditionals by inversion provide a universal method for the generation of conditional alleles", PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, vol. 110, 2013, pages E3179 - E3188, XP055155451, DOI: doi:10.1073/pnas.1217812110 |
| ESSLETZBICHLER, P. ET AL.: "Megabase-scale deletion using CRISPR/Cas9 to generate a fully haploid human cell line", GENOME RES., vol. 24, 2014, pages 2059 - 2065, XP055159551, DOI: doi:10.1101/gr.177220.114 |
| FEIL, R.; WAGNER, J.; METZGER, D.; CHAMBON, P.: "Regulation of Cre recombinase activity by mutated estrogen receptor ligand-binding domains", BIOCHEM BIOPHYS RES COMMUN, vol. 237, 1997, pages 752 - 757, XP026088101, DOI: doi:10.1006/bbrc.1997.7124 |
| GILBERT, L. A. ET AL.: "CRISPR-mediated modular RNA-guided regulation of transcription in eukaryotes", CELL, vol. 154, 2013, pages 442 - 451, XP055321615, DOI: doi:10.1016/j.cell.2013.06.044 |
| GUZZARDO PALOMA M ET AL: "A small cassette enables conditional gene inactivation by CRISPR/Cas9.", SCIENTIFIC REPORTS, vol. 7, no. 1, 16770, 1 December 2017 (2017-12-01), pages 1 - 11, XP002777981, ISSN: 2045-2322 * |
| JINEK, M. ET AL.: "A programmable dual-RNA-guided DNA endonuclease in adaptive bacterial immunity", SCIENCE, vol. 337, 2012, pages 816 - 821, XP055299674, DOI: doi:10.1126/science.1225829 |
| KIM, J.; CHU, J. L.; SHEN, X. H.; WANG, J. L.; ORKIN, S. H.: "An extended transcriptional network for pluripotency of embryonic stem cells (vol 132, pg 1049, 2008", CELL, vol. 133, 2008, pages 1290 |
| LACKNER, D. H. ET AL.: "A generic strategy for CRISPR-Cas9-mediated gene tagging", NATURE COMMUNICATIONS, vol. 6, 2015, pages 10237, XP055295901, DOI: doi:10.1038/ncomms10237 |
| MALI, P. ET AL.: "RNA-guided human genome engineering via Cas9", SCIENCE, vol. 339, 2013, pages 823 - 826 |
| MALI, P.; ESVELT, K. M.; CHURCH, G. M.: "Cas9 as a versatile tool for engineering biology", NATURE METHODS, vol. 10, 2013, pages 957 - 963, XP002718606, DOI: doi:10.1038/nmeth.2649 |
| NATSUME, T.; KIYOMITSU, T.; SAGA, Y.; KANEMAKI, M. T.: "Rapid Protein Depletion in Human Cells by Auxin-Inducible Degron Tagging with Short Homology Donors", CELL REP, vol. 15, 2016, pages 210 - 218, XP055373129, DOI: doi:10.1016/j.celrep.2016.03.001 |
| OHSHIMA ET AL., J. MOL. BIOL., vol. 195, 1987, pages 247 - 259 |
| QI, L. S. ET AL.: "Repurposing CRISPR as an RNA-guided platform for sequence-specific control of gene expression", CELL, vol. 152, 2013, pages 1173 - 1183, XP055346792, DOI: doi:10.1016/j.cell.2013.02.022 |
| SEIF ET AL., NUCLEIC ACIDS RES., vol. 6, no. 10, 1979, pages 3387 - 3398 |
| SIBLEY, C. R.; BLAZQUEZ, L.; ULE, J., EUROPE PMC FUNDERS GROUP LESSONS FROM NON-CANONICAL SPLICING, vol. 17, 2017, pages 407 - 421 |
| SKARNES WILLIAM C ET AL: "A conditional knockout resource for the genome-wide study of mouse gene function", NATURE, MACMILLAN JOURNALS LTD., ETC, vol. 474, no. 7351, 1 June 2011 (2011-06-01), pages 337 - 342, XP009157945, ISSN: 0028-0836, [retrieved on 20110615], DOI: 10.1038/NATURE10163 * |
| SKARNES, W. C. ET AL.: "A conditional knockout resource for the genome-wide study of mouse gene function", NATURE, vol. 474, 2011, pages 337 - 342, XP009157945, DOI: doi:10.1038/nature10163 |
| VAN OVERBEEK, M. ET AL.: "DNA Repair Profiling Reveals Nonrandom Outcomes at Cas9-Mediated Breaks", MOL CELL, vol. 63, 2016, pages 633 - 646, XP029690136, DOI: doi:10.1016/j.molcel.2016.06.037 |
| ZAFARANA, G.; AVERY, S. R.; AVERY, K.; MOORE, H. D.; ANDREWS, P. W.: "Specific knockdown of OCT4 in human embryonic stem cells by inducible short hairpin RNA interference", STEM CELLS, vol. 27, 2009, pages 776 - 782 |
| ZETSCHE, B.; VOLZ, S. E.; ZHANG, F.: "A split-Cas9 architecture for inducible genome editing and transcription modulation", NAT. BIOTECHNOL., vol. 33, 2015, pages 139 - 142, XP055227889, DOI: doi:10.1038/nbt.3149 |
| ZHUANG ET AL., PNAS, vol. 86, 1989, pages 2752 - 2756 |
Cited By (25)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US12201699B2 (en) | 2014-10-10 | 2025-01-21 | Editas Medicine, Inc. | Compositions and methods for promoting homology directed repair |
| US11680268B2 (en) | 2014-11-07 | 2023-06-20 | Editas Medicine, Inc. | Methods for improving CRISPR/Cas-mediated genome-editing |
| US11667911B2 (en) | 2015-09-24 | 2023-06-06 | Editas Medicine, Inc. | Use of exonucleases to improve CRISPR/CAS-mediated genome editing |
| US11866727B2 (en) | 2015-11-06 | 2024-01-09 | Crispr Therapeutics Ag | Materials and methods for treatment of glycogen storage disease type 1A |
| US11597924B2 (en) | 2016-03-25 | 2023-03-07 | Editas Medicine, Inc. | Genome editing systems comprising repair-modulating enzyme molecules and methods of their use |
| US12049651B2 (en) | 2016-04-13 | 2024-07-30 | Editas Medicine, Inc. | Cas9 fusion molecules, gene editing systems, and methods of use thereof |
| US11236313B2 (en) | 2016-04-13 | 2022-02-01 | Editas Medicine, Inc. | Cas9 fusion molecules, gene editing systems, and methods of use thereof |
| US11866726B2 (en) | 2017-07-14 | 2024-01-09 | Editas Medicine, Inc. | Systems and methods for targeted integration and genome editing and detection thereof using integrated priming sites |
| US11345932B2 (en) | 2018-05-16 | 2022-05-31 | Synthego Corporation | Methods and systems for guide RNA design and use |
| US11802296B2 (en) | 2018-05-16 | 2023-10-31 | Synthego Corporation | Methods and systems for guide RNA design and use |
| US11697827B2 (en) | 2018-05-16 | 2023-07-11 | Synthego Corporation | Systems and methods for gene modification |
| US11827877B2 (en) | 2018-06-28 | 2023-11-28 | Crispr Therapeutics Ag | Compositions and methods for genomic editing by insertion of donor polynucleotides |
| WO2020003006A3 (en) * | 2018-06-28 | 2020-03-05 | Crispr Therapeutics Ag | Compositions and methods for genomic editing by insertion of donor polynucleotides |
| US11332760B2 (en) | 2018-06-28 | 2022-05-17 | Crispr Therapeutics Ag | Compositions and methods for genomic editing by insertion of donor polynucleotides |
| JP7550057B2 (en) | 2018-06-28 | 2024-09-12 | クリスパー セラピューティクス アーゲー | Compositions and methods for genome editing by insertion of donor polynucleotides |
| JP2021529518A (en) * | 2018-06-28 | 2021-11-04 | クリスパー セラピューティクス アーゲー | Compositions and Methods for Genome Editing by Insertion of Donor polynucleotide |
| CN112430550A (en) * | 2020-10-15 | 2021-03-02 | 天津科技大学 | Strain for regulating and controlling synergistic death of protein degradation induced gene and application and method |
| WO2022079082A1 (en) | 2020-10-15 | 2022-04-21 | F. Hoffmann-La Roche Ag | Nucleic acid constructs for simultaneous gene activation |
| US12312606B2 (en) | 2020-10-15 | 2025-05-27 | Hoffmann-La Roche Inc. | Nucleic acid constructs for VA RNA transcription |
| CN114763559A (en) * | 2021-01-15 | 2022-07-19 | 中国农业大学 | Targeting gene capturing system independent of homologous recombination and application thereof |
| CN114807126A (en) * | 2021-01-22 | 2022-07-29 | 清华大学深圳国际研究生院 | Method for silencing expression of long non-coding RNA and application thereof |
| EP4086350A1 (en) | 2021-05-07 | 2022-11-09 | IMBA-Institut für Molekulare Biotechnologie GmbH | Controlled gene expression methods and means |
| WO2022234086A1 (en) | 2021-05-07 | 2022-11-10 | Imba - Institut Für Molekulare Biotechnologie Gmbh | Controlled gene expression methods and means |
| WO2025033445A1 (en) * | 2023-08-07 | 2025-02-13 | 株式会社Logomix | Method for conditionally inactivating gene, and cell therefor |
| WO2026008633A1 (en) * | 2024-07-02 | 2026-01-08 | Novo Nordisk A/S | Intronic targeted insertion of transgenes |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| WO2018096356A1 (en) | Methods for conditional gene knock-out | |
| Hockemeyer et al. | Efficient targeting of expressed and silent genes in human ESCs and iPSCs using zinc-finger nucleases | |
| US8592211B2 (en) | Enhanced PiggyBac transposon and methods for transposon mutagenesis | |
| Yusa et al. | Generation of transgene-free induced pluripotent mouse stem cells by the piggyBac transposon | |
| US11396664B2 (en) | Replicative transposon system | |
| Bhagwan et al. | Variable expression and silencing of CRISPR-Cas9 targeted transgenes identifies the AAVS1 locus as not an entirely safe harbour | |
| US20190085325A1 (en) | Conditional crispr sgrna expression | |
| JP2019523012A (en) | Methods for genome editing | |
| Guzzardo et al. | A small cassette enables conditional gene inactivation by CRISPR/Cas9 | |
| Salvador-Palomeque et al. | Dynamic methylation of an L1 transduction family during reprogramming and neurodifferentiation | |
| Kesselring et al. | A single amino acid switch converts the Sleeping Beauty transposase into an efficient unidirectional excisionase with utility in stem cell reprogramming | |
| Hindul et al. | Construction of a human hTERT RPE-1 cell line with inducible Cre for editing of endogenous genes | |
| Rozen-Gagnon et al. | A selectable, plasmid-based system to generate CRISPR/Cas9 gene edited and knock-in mosquito cell lines | |
| Pryzhkova et al. | Adaptation of the AID system for stem cell and transgenic mouse research | |
| Manivannan et al. | Targeted integration of single-copy transgenes in Drosophila melanogaster tissue-culture cells using recombination-mediated cassette exchange | |
| McKinley | Employing CRISPR/Cas9 genome engineering to dissect the molecular requirements for mitosis | |
| Shi et al. | Efficient and rapid fluorescent protein knock-in with universal donors in mouse embryonic stem cells | |
| CN111218479A (en) | A method for knocking out transcribable elements by knocking in terminators using gene editing technology | |
| Yamane et al. | Zscan10 is dispensable for maintenance of pluripotency in mouse embryonic stem cells | |
| Yu et al. | Improved site-specific recombinase-based method to produce selectable marker-and vector-backbone-free transgenic cells | |
| US20210010022A1 (en) | Novel nucleic acid construct | |
| Turakainen et al. | Transposition-based method for the rapid generation of gene-targeting vectors to produce Cre/Flp-modifiable conditional knock-out mice | |
| Wagenbach et al. | Production of CRISPR‐Cas9 Transgenic Cell Lines for Knocksideways Studies | |
| Dolatshad et al. | A versatile transgenic allele for mouse overexpression studies | |
| Bersten et al. | Inducible and reversible lentiviral and Recombination Mediated Cassette Exchange (RMCE) systems for controlling gene expression |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 17817024 Country of ref document: EP Kind code of ref document: A1 |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |
|
| 122 | Ep: pct application non-entry in european phase |
Ref document number: 17817024 Country of ref document: EP Kind code of ref document: A1 |





