Nothing Special   »   [go: up one dir, main page]

WO2024052353A1 - Proximity inducing system and uses thereof - Google Patents

Proximity inducing system and uses thereof Download PDF

Info

Publication number
WO2024052353A1
WO2024052353A1 PCT/EP2023/074340 EP2023074340W WO2024052353A1 WO 2024052353 A1 WO2024052353 A1 WO 2024052353A1 EP 2023074340 W EP2023074340 W EP 2023074340W WO 2024052353 A1 WO2024052353 A1 WO 2024052353A1
Authority
WO
WIPO (PCT)
Prior art keywords
seq
proximity
polypeptide
amino acid
acid sequence
Prior art date
Application number
PCT/EP2023/074340
Other languages
French (fr)
Inventor
Arnaud Gautier
Franck Perez
Sara BOTTONE
Zeyneb Vildan CAKIL
Octave JOLIOT
Original Assignee
Sorbonne Universite
Institut Curie
Centre National De La Recherche Scientifique (Cnrs)
Ecole Normale Superieure
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sorbonne Universite, Institut Curie, Centre National De La Recherche Scientifique (Cnrs), Ecole Normale Superieure filed Critical Sorbonne Universite
Publication of WO2024052353A1 publication Critical patent/WO2024052353A1/en

Links

Classifications

    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N33/00Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
    • G01N33/48Biological material, e.g. blood, urine; Haemocytometers
    • G01N33/50Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
    • G01N33/58Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving labelled substances
    • G01N33/582Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving labelled substances with fluorescent label
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N33/00Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
    • G01N33/48Biological material, e.g. blood, urine; Haemocytometers
    • G01N33/50Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
    • G01N33/68Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving proteins, peptides or amino acids
    • G01N33/6803General methods of protein analysis not limited to specific proteins or families of proteins
    • G01N33/6845Methods of identifying protein-protein interactions in protein mixtures
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07DHETEROCYCLIC COMPOUNDS
    • C07D277/00Heterocyclic compounds containing 1,3-thiazole or hydrogenated 1,3-thiazole rings
    • C07D277/02Heterocyclic compounds containing 1,3-thiazole or hydrogenated 1,3-thiazole rings not condensed with other rings
    • C07D277/20Heterocyclic compounds containing 1,3-thiazole or hydrogenated 1,3-thiazole rings not condensed with other rings having two or three double bonds between ring members or between ring members and non-ring members
    • C07D277/32Heterocyclic compounds containing 1,3-thiazole or hydrogenated 1,3-thiazole rings not condensed with other rings having two or three double bonds between ring members or between ring members and non-ring members with hetero atoms or with carbon atoms having three bonds to hetero atoms with at the most one bond to halogen, e.g. ester or nitrile radicals, directly attached to ring carbon atoms
    • C07D277/36Sulfur atoms

Definitions

  • the present invention relates to the control, and optionally the detection, of the localization of biological molecules.
  • the present invention relates to a proximity inducing system that allows to induce, and optionally detect, proximity between two biological molecules.
  • the present invention also relates to uses of said proximity inducing system.
  • CIP chemically induced proximity
  • CIP systems aim at controlling protein proximity and do not allow the detection of the protein interaction triggered by the chemical inducer of proximity and/or the visualization of the protein localization following proximity induction. Such detection and/or visualization thus usually require the use of an additional reporter, distinct from the interacting dimerization domains and the chemical inducer of proximity.
  • CIP systems usually rely on two dimerization domains of more than 10 kDa each, that might lead to dysfunctional fusion proteins. For example, in rapamycin or rapalogs based CIP, FKBP is about 12 kDa and FRB is about 11 kDa.
  • most CIP systems rely on chemical inducers of proximity that induce the formation of very tightly bound ternary assemblies. Such ternary assemblies are usually too stable to be dissociated by washing away the inducer, preventing the reversal of the recruitment process.
  • a first polypeptide of 114 amino acid residues having a sequence as set forth in SEQ ID NO: 1 and a second polypeptide of 11 amino acid residues having a sequence as set forth in SEQ ID NO: 2 are able to complement and form a ternary assembly with a small synthetic compound which acts as a molecular inducer of proximity.
  • the ternary assembly between the two polypeptide and the small synthetic compound is induced by said small synthetic compound regardless of any initial proximity between the two polypeptides of SEQ ID NO: 1 and SEQ ID NO: 2.
  • the two polypeptides of SEQ ID NO: 1 and SEQ ID NO: 2 are not able to complement.
  • the two polypeptides of SEQ ID NO: 1 and SEQ ID NO: 2 and the molecular inducer of proximity thus form a proximity inducing system, that may be used to control the localization of biological molecules such as proteins.
  • the molecular inducer of proximity may be a fluorogenic chromophore that emits substantial fluorescence only when interacting with the two polypeptides of SEQ ID NO: 1 and SEQ ID NO: 2.
  • the proximity inducing system may thus advantageously allow detection and/visualization of the ternary assembly without the need for any additional reporter.
  • the Inventors also showed that the ternary assembly between the two polypeptide and the molecular inducer of proximity is extremely rapid and remains stable over a long period of time, unless the molecular inducer of proximity is removed.
  • the proximity inducing system is fully reversible and accordingly, can be used to control the proximity between two proteins with a high temporal resolution through addition and removal of the molecular inducer of proximity.
  • the system described herein may be a relevant tool for biological and biomedical applications requiring the control of the localization, transport, stability and/or function of biological molecules.
  • the present invention relates to a method for inducing proximity between two biological molecules in a sample, said method comprising the following steps: obtaining a first biological molecule coupled to a first polypeptide, wherein said first polypeptide comprises an amino acid sequence as set forth in SEQ ID NO: 1 or an amino acid sequence having at least about 70% identity with SEQ ID NO: 1, or a truncated fragment thereof comprising at least 89 consecutive amino acid residues from the C-terminal end of SEQ ID NO: 1 or of the sequence having at least about 70% identity with SEQ ID NO: 1; obtaining a second biological molecule coupled to a second polypeptide, wherein said second polypeptide comprises an amino acid sequence as set forth in SEQ ID NO: 2 or an amino acid sequence having at least about 70% identity with SEQ ID NO: 2, or a truncated fragment thereof comprising at least 8 consecutive amino acid residues from the N-terminal end of SEQ ID NO: 2 or of the sequence having at least about 70% identity with SEQ ID NO: 2; adding
  • the present invention also relates to an assay relying on the induction of proximity between two biological molecules in a sample, said assay comprising the following steps: obtaining a first biological molecule coupled to a first polypeptide, wherein said first polypeptide comprises an amino acid sequence as set forth in SEQ ID NO: 1 or an amino acid sequence having at least about 70% identity with SEQ ID NO: 1, or a truncated fragment thereof comprising at least 89 consecutive amino acid residues from the C-terminal end of SEQ ID NO: 1 or of the sequence having at least about 70% identity with SEQ ID NO: 1; obtaining a second biological molecule coupled to a second polypeptide, wherein said second polypeptide comprises an amino acid sequence as set forth in SEQ ID NO: 2 or an amino acid sequence having at least about 70% identity with SEQ ID NO: 2, or a truncated fragment thereof comprising at least 8 consecutive amino acid residues from the N-terminal end of SEQ ID NO: 2 or of the sequence having at least about 70% identity with SEQ
  • the concentration of molecular inducer of proximity required to reach half maximal assembly in the sample is of less than about 5 pM.
  • the molecular inducer of proximity is fluorogenic and the method or assay further comprises detecting a fluorescence resulting from the formation of the ternary assembly between the first polypeptide, the second polypeptide and the fluorogenic molecular inducer of proximity.
  • the first polypeptide consists of an amino acid sequence as set forth in SEQ ID NO: 1
  • the second polypeptide consists of an amino acid sequence as set forth in SEQ ID NO: 2.
  • the molecular inducer of proximity is a compound of formula (I) or a salt and/or solvate thereof; wherein
  • R 1 , R 2 , R 5 and R 6 each independently represents H, halo, hydroxyl, aryl, alkyl, cycloalkyl, heteroalkyl (e.g., alkoxy) or heterocycloalkyl, wherein said aryl, alkyl, cycloalkyl, heteroalkyl or heterocycloalkyl is optionally substituted by at least one group selected from halo, hydroxyl, oxo, nitro, amido, carboxy, amino, cyano, haloalkoxy and haloalkyl;
  • R 3 represents a non-binding doublet (z.e., a free pair of electrons), H, halo, hydroxyl, aryl, alkyl, cycloalkyl, heteroalkyl or heterocycloalkyl, wherein said aryl, alkyl, cycloalkyl, heteroalkyl or heterocycloalkyl is optionally substituted by at least one group selected from halo, hydroxyl, oxo, nitro, amido, carboxy, amino, cyano, haloalkoxy and haloalkyl;
  • R 4 represents a single or a double bound, interrupted or terminated by one S, O or N heteroatom, wherein said heteroatom is optionally substituted by at least one group selected from H, hydroxyl, aryl, alkyl, cycloalkyl, heteroalkyl and heterocycloalkyl, wherein said aryl, alkyl, cycloalkyl, heteroalkyl or heterocycloalkyl is optionally substituted by at least one group selected from halo, hydroxyl, oxo, nitro, amido, carboxy, amino, cyano, haloalkoxy and haloalkyl;
  • W represents OH, SH, NHR 7 or NR 7 R 8 , wherein R 7 and R 8 each independently represents H, halo, hydroxyl, aryl, alkyl, cycloalkyl, heteroalkyl or heterocycloalkyl, wherein said aryl, alkyl, cycloalkyl, heteroalkyl or heterocycloalky
  • X represents O, S or NH
  • Z represents O, S or NH.
  • the assay is for controlling the localization, the transport and/or the function of a biological molecule. In some embodiments, the assay is for inducing the degradation of a biological molecule. In some embodiments, the assay is for detecting the co-occurrence of two biological molecules in the sample.
  • the first and second biological molecules are proteins.
  • the present invention also relates to a proximity inducing system comprising two polypeptides and a molecular inducer of proximity, wherein: a first polypeptide comprises an amino acid sequence as set forth in SEQ ID NO: 1 or an amino acid sequence having at least about 70% identity with SEQ ID NO: 1, or a truncated fragment thereof comprising at least 89 consecutive amino acid residues from the C-terminal end of SEQ ID NO: 1 or of the sequence having at least about 70% identity with SEQ ID NO: 1; and a second polypeptide comprises an amino acid sequence as set forth in SEQ ID NO: 2 or an amino acid sequence having at least about 70% identity with SEQ ID NO: 2, or a truncated fragment thereof comprising at least 8 consecutive amino acid residues from the N-terminal end of SEQ ID NO: 2 or of the sequence having at least about 70% identity with SEQ ID NO: 2, wherein the first polypeptide, the second polypeptide and the molecular inducer of proximity form a ternary assembly.
  • the concentration of molecular inducer of proximity required to reach half maximal assembly is of less than about 5 pM.
  • the molecular inducer of proximity is fluorogenic.
  • the molecular inducer of proximity is a compound of formula (I) as described above, or a salt and/or solvate thereof.
  • the first polypeptide consists of an amino acid sequence as set forth in SEQ ID NO: 1
  • the second polypeptide consists of an amino acid sequence as set forth in SEQ ID NO: 2.
  • the present invention also relates to a cell or cell line comprising two biological molecules, wherein: a first biological molecule is coupled to a first polypeptide, wherein said first polypeptide comprises an amino acid sequence as set forth in SEQ ID NO: 1 or an amino acid sequence having at least about 70% identity with SEQ ID NO: 1, or a truncated fragment thereof comprising at least 89 consecutive amino acid residues from the C-terminal end of SEQ ID NO: 1 or of the sequence having at least about 70% identity with SEQ ID NO: 1; a second biological molecule is coupled to a second polypeptide, wherein said second polypeptide comprises an amino acid sequence as set forth in SEQ ID NO: 2 or an amino acid sequence having at least about 70% identity with SEQ ID NO: 2, or a truncated fragment thereof comprising at least 8 consecutive amino acid residues from the N-terminal end of SEQ ID NO: 2 or of the sequence having at least about 70% identity with SEQ ID NO: 2, wherein in presence of a molecular inducer of
  • the concentration of molecular inducer of proximity required to reach half maximal assembly is of less than about 5 pM.
  • the first and second biological molecule are proteins.
  • Alkyl by itself or as part of another group, refers to a hydrocarbyl radical of formula CntCn i wherein n is a number greater than or equal to 1.
  • alkyl groups of this invention comprise from 1 to 12 carbon atoms, preferably from 1 to 6 carbon atoms. Alkyl groups may be linear or branched and may be substituted as indicated herein.
  • Non-limiting examples of alkyl groups include methyl, ethyl, propyl (zz-propyl, z-propyl), butyl (zz-butyl, z-butyl, .s-butyl and /-butyl), pentyl and its isomers (e.g., zz-pentyl, z.w-pentyl), and hexyl and its isomers (e.g., zz-hexyl, z.w-hexyl).
  • alkoxy refers to any -O-alkyl or -O-aryl group. Generally, alkoxy groups of this invention are -O-alkyl groups.
  • Amino refers to -NH2 group.
  • Aryl refers to a polyunsaturated, aromatic hydrocarbyl group having a single ring (z.e., phenyl) or multiple aromatic rings fused together (e.g., naphtyl) or linked covalently, typically containing from 5 to 12 carbon atoms, preferably from 6 to 10 carbon atoms, wherein at least one ring is aromatic.
  • the aromatic ring may optionally include one to two additional rings (either cycloalkyl, heterocycloalkyl or heteroaryl) fused thereto.
  • Aryl is also intended to include the partially hydrogenated derivatives of the carbocyclic systems enumerated herein, as long as at least one ring is aromatic.
  • Nonlimiting examples of aryl include phenyl, biphenyl, biphenylenyl, 5- or 6-tetralinyl, naphthalen-1- or -2-yl, 4-, 5-, 6 or 7-indenyl, 1- 2-, 3-, 4- or 5-acenaphthylenyl, 3-, 4- or 5-acenaphthenyl, 1- or 2-pentalenyl, 4- or 5-indanyl, 5-, 6-, 7- or 8-tetrahydronaphthyl, 1,2,3,4-tetrahydronaphthyl, 1,4-dihydronaphthyl, and 1-, 2-, 3-, 4- or 5-pyrenyl.
  • Carboxy refers to -COOH group.
  • Cyano refers to -CN group.
  • Cycloalkyl refers to a cyclic alkyl group, z.e., a monovalent, saturated, or unsaturated hydrocarbyl group having 1 or 2 cyclic structures. Cycloalkyl includes monocyclic or bicyclic hydrocarbyl groups. Cycloalkyl groups may comprise 3 or more carbon atoms in the ring and generally, cycloalkyl groups of this invention comprise from 3 to 10, preferably from 3 to 8 carbon atoms, more preferably from 3 to 6 carbon atoms.
  • cycloalkyl encompasses polycyclic cycloalkyls (e.g., bicycles) and bridged cycloalkyl structures, including cycles bound together through one atom (“spiro”) or through two atoms.
  • Non-limiting examples of cycloalkyl groups include cyclopropyl, cyclobutyl, cyclopropyl, cyclohexyl, cycoheptyl, cyclooctanyl, cyclononanyl, cyclodecanyl, norbornyl, adamantyl, bicyclo[2.2.2]octanyl, bicyclo[4.4.0]decanyl, bicyclo[3.2.1]octanyl, bicyclo[3.3.1]nonanyl, bicyclo[2.1.
  • Halogen or “halo” means fluoro, chloro, bromo, or iodo. Generally, halo groups of this invention are fluoro, chloro or bromo.
  • Haloalkyl refers to an alkyl group wherein one or more hydrogen atoms are replaced with a halogen atom.
  • Non-limiting examples of haloalkyl groups include chloromethyl, 1 -bromoethyl, fluoromethyl, difluoromethyl, trifluoromethyl and 1,1,1 -trifluoroethyl .
  • Haloalkoxy refers to an alkoxy group wherein one or more hydrogen atoms are replaced with a halogen atom.
  • Non-limiting examples of haloalkoxy groups include chloromethoxy, 1 -bromoethoxy, fluoromethoxy, difluoromethoxy, trifluoromethoxy and 1,1,1 -trifluoroethoxy .
  • Heteroalkyl refers to an alkyl group wherein one or more carbon atoms are replaced by a heteroatom, for example oxygen, nitrogen or sulfur atom, and wherein the resulting heteroalkyl group comprises at least one carbon atom.
  • the heteroatoms are bound along the alkyl chain only to carbon atoms, i.e., each heteroatom is separated from any other heteroatom by at least one carbon atom, typically by at least two carbon atoms.
  • the nitrogen and sulfur heteroatoms may optionally be oxidized and the nitrogen heteroatoms may optionally be quaternized e.g., sulfur may be oxidized as SO or SO2).
  • at least two carbon atoms are replaced by a heteroatom.
  • the heteroalkyl is bound to another group or molecule through a carbon atom, i.e., the binding atom is not selected among the heteroatoms included therein.
  • the heteroalkyl is bound to another group or molecule through one of the heteroatoms included therein.
  • an heteroalkyl may be substituted either through a carbon atom or through a heteroatom (e.g. , nitrogen), unless otherwise specified.
  • Non-limiting examples of heteroalkyl include alkoxy, ethers and poly ethers (e.g, polyethylene glycol), secondary and tertiary amines and polyamines, thioethers and polythioethers, and combinations thereof.
  • Heterocycloalkyl refers to a cyclic heteroalkyl group, typically comprising from 2 to 15 carbon atoms, preferably from 2 to 11 carbon atoms, more preferably from 2 to 7 carbon atoms, furthermore preferably from 2 to 6 carbon atoms. Heterocycloalkyl groups are typically 3- to 7-membered, preferably 5- or 6-membered. Heterocycloalkyl are typically monocyclic or bicyclic, preferably monocyclic. This definition encompasses polycyclic heterocycloalkyls (e.g., bicycles) and bridged heterocycloalkyl structures, including cycles bound together through one atom (“spiro”) or through two atoms.
  • the heterocycloalkyl is bound to another group or molecule through a carbon atom, z.e., the binding atom is not selected among the heteroatoms included therein. In one embodiment, the heterocycloalkyl is bound to another group or molecule through one of the heteroatoms included therein. When substituted by one or more other group(s), an heterocycloalkyl may be substituted either through a carbon atom or through a heteroatom (e.g., nitrogen), unless otherwise specified.
  • a heteroatom e.g., nitrogen
  • heterocycloalkyl examples include aziridine, pyrrolidine, piperidine, piperazine, morpholine, thiomorpholine, azepane, azocane, octahydro-777-isoindole, decahydroisoquinoline, tetrahydrofuran, tetrahydropyran, tetrahydroisoquinoline
  • Haldroxyl refers to -OH group.
  • Niro refers to -NO2 group.
  • Solvate refers to a molecular complex comprising a compound of the invention and contains stoichiometric or sub-stoichiometric amounts of one or more pharmaceutically acceptable solvent molecule such as ethanol.
  • solvent molecule such as ethanol.
  • hydrate refers to when said solvent is water.
  • an element means one element or more than one element.
  • amino acid refers to both natural and synthetic amino acids, and both D- and L-amino acids. They are represented by their full name, their three-letter code or their one-letter code as well-known in the art.
  • Standard amino acid or “naturally occurring amino acid” means any of the twenty standard L-amino acids commonly found in naturally occurring polypeptides.
  • Non-standard amino acid means any amino acid, other than the standard amino acids, regardless of whether it is prepared synthetically or derived from a natural source. For example, naphtlylalanine can be substituted for tryptophan to facilitate synthesis.
  • amino acids that can be substituted include, but are not limited to, L-hydroxypropyl, L-3,4- dihydroxyphenylalanyl, alpha-amino acids such as L-alpha-hydroxylysyl and D-alpha- methylalanyl, L-alpha-m ethylalanyl, beta-amino acids, and isoquinolyl.
  • the polypeptides as described herein may comprise standard amino acids or non-standard amino acids.
  • amino acid also encompasses chemically modified amino acids, including, but not limited to, salts, amino acid derivatives (such as amides), and substitutions.
  • Amino acids contained within the polypeptides and fusion proteins as described herein, and particularly at the carboxy- or amino-terminus, can thus be modified by methylation, amidation, acetylation or substitution with other chemical groups. Additionally, a disulfide linkage may be present or absent in the polypeptides as described herein.
  • polypeptide mimetics encompassed herein include polypeptides or fusion proteins as described herein having the following modifications: i) polypeptides wherein one or more of the peptidyl -C(O)NR- linkages (bonds) have been replaced by a non-peptidyl linkage such as a -CJfc-carbamate linkage (-CH2OC(O)NR-), a phosphonate linkage, a -CH2-sulfonamide (-CH2-S(O)2NR-) linkage, a urea (-NHC(O)NH-) linkage, a -CH2-secondary amine linkage, or with an alkylated peptidyl linkage (-C(O)NR-) wherein R is C1-C4 alkyl; ii) polypeptides wherein the N-terminus is derivatized to a -NRR 1 group, to a -NRC(O)R group, to a -NRC(O
  • Cell line refers to cells obtained from the proliferation of a single cell and therefore forming a uniform population of cells comprising an identical genetic material.
  • “Complement” or “complementation” with reference to the first and second polypeptides as described herein refers to the capacity of said two polypeptides to assemble together to reconstitute a scaffold to which the molecular inducer of proximity as described herein can bind.
  • CIP system refers to a chemically induced proximity system.
  • emission refers to emission, as in emission wavelength.
  • ex refers to excitation, as in excitation wavelength.
  • Fluorogenic chromophore or “fluorogen” refer to a chromophore, the brightness of which can be significantly enhanced by an environmental change.
  • a fluorogenic chromophore or fluorogen is substantially non-fluore scent in solution under its free form, but brightens up when placed into an environment constraining its conformation and excluding deexcitation of its excited state by non-radiative pathways.
  • the fluorogenic chromophore is almost invisible in solution and becomes fluorescent upon binding of a protein scaffold, such as the one formed by the two polypeptides described herein.
  • Identity when used in a relationship between the sequences of two or more polypeptides or of two or more nucleic acids, refers to the degree of sequence relatedness between polypeptides or nucleic acids (respectively), as determined by the number of matches between strings of two or more amino acid residues or of two or more nucleotides, respectively. “Identity” measures the percent of identical matches between the smaller of two or more sequences with gap alignments (if any) addressed by a particular mathematical model or computer program (z.e., “algorithms”). Identity of related polypeptides or nucleic acid sequences can be readily calculated by known methods. Such methods include for example those described in Arthur M.
  • Preferred methods for determining identity are designed to give the largest match between the sequences tested. Methods of determining identity are described in publicly available computer programs. Examples of computer program methods for determining identity between two sequences include the GCG program package, including GAP (Devereux et al., 1984. Nucl. Acid. Res. 12(1 Pt l):387-395; Genetics Computer Group, University of Wisconsin Biotechnology Center, Madison, WI), BLASTP, BLASTN, TBLASTN and FASTA (Altschul et al., 1990. J. Mol. Biol. 215(3):403-410). The BLASTX program is publicly available from the National Center for Biotechnology Information (NCBI) and other sources (BLAST Manual, Altschul et al. NCB/NLM/NIH Bethesda, Md. 20894; Altschul et al., 1990. J. Mol. Biol. 215(3):403-410). The well-known Smith Waterman algorithm may also be used to determine identity.
  • Nucleic acid or “nucleic acid molecule” refers to a polymer of nucleotides covalently linked by phosphodiester bonds, such as deoxyribonucleic acids (DNAs) or ribonucleic acids (RNAs), in either single- or double-stranded form.
  • DNAs deoxyribonucleic acids
  • RNAs ribonucleic acids
  • nucleic acid thus encompasses single-stranded, partially double-stranded, and fully double-stranded nucleic acids. Unless specifically limited, the term encompasses nucleic acids containing known analogues of natural nucleotides that have similar binding properties as the reference nucleic acid and are metabolized in a manner similar to naturally occurring nucleotides.
  • nucleic acid sequence or “nucleotide sequence” refers to a contiguous sequence of nucleotides in a single nucleic acid. Unless otherwise indicated, a particular nucleic acid sequence also implicitly encompasses conservatively modified variants thereof (e.g., degenerate codon substitutions), alleles, orthologs, SNPs (single-nucleotide polymorphisms), and complementary sequences as well as the sequence explicitly indicated.
  • degenerate codon substitutions may be achieved by generating sequences in which the third position of one or more selected (or all) codons is substituted with mixed-base and/or deoxyinosine residues (Batzer et al., Nucleic Acid Res. 19:5081 (1991); Ohtsuka et al., J. Biol. Chem. 260:2605-2608 (1985); and Rossolini et al., Mol. Cell. Probes 8:91-98 (1994)).
  • a nucleotide sequence encoding an amino acid sequence includes all nucleotide sequences that are degenerate versions of each other and that encode the same amino acid sequence.
  • nucleotide sequence that encodes a protein or an RNA may also include introns to the extent that the nucleotide sequence encoding the protein or RNA may in some version contain at least one intron.
  • a particular nucleic acid sequence described herein implicitly comprises its corresponding complementary sequence. It should be noted that a particular nucleic acid sequence described herein implicitly comprises the DNA sequence and the corresponding RNA sequence.
  • “Pharmaceutically acceptable excipient” or “pharmaceutically acceptable carrier” refers to an excipient or carrier that does not produce an adverse, allergic or other untoward reaction when administered to a subject, such as an animal, in particular a mammal, preferably a human. It includes any and all solvents, such as, for example, dispersion media, coatings, antibacterial and antifungal agents, isotonic and absorption delaying agents.
  • a pharmaceutically acceptable excipient or carrier refers to a non-toxic solid, semi-solid or liquid filler, diluent, encapsulating material or formulation auxiliary of any type.
  • Reporter protein refers to a protein that may be detected, localized or quantified as a way to indirectly assess a target of interest or a mechanism of interest.
  • sample refers to a specimen or small quantity of material, in particular of biological material, generally solid or liquid. “Sample” may thus also refer to cells or tissues or organisms of interest.
  • Vector refers to a vehicle by which a nucleotide sequence (e.g., a DNA sequence or an RNA sequence), for example a nucleotide sequence encoding a polypeptide or protein can be introduced into a host cell, so as to transform, transfect or transduce the host cell and promote replication and/or expression (e.g., transcription and/or translation) of the introduced nucleotide sequence.
  • a nucleotide sequence e.g., a DNA sequence or an RNA sequence
  • a nucleotide sequence encoding a polypeptide or protein for example a nucleotide sequence encoding a polypeptide or protein can be introduced into a host cell, so as to transform, transfect or transduce the host cell and promote replication and/or expression (e.g., transcription and/or translation) of the introduced nucleotide sequence.
  • “Expression vector” refers to a vector comprising regulatory elements (or regulatory sequences) operatively linked to a nucleotide sequence to be expressed.
  • An expression vector thus comprises sufficient cis-acting regulatory elements for controlling the expression of the nucleotide sequence; other elements that may be required for controlling the expression of the nucleotide sequence of interest may be supplied by a host cell.
  • One object of the invention is a proximity inducing system comprising two polypeptides and a molecular inducer of proximity as described herein, wherein: a first polypeptide comprises or consists of an amino acid sequence as set forth in SEQ ID NO: 1 or an amino acid sequence having at least about 30% identity with SEQ ID NO: 1 as described herein, or a truncated fragment thereof as described herein; and a second polypeptide comprises or consists of an amino acid sequence as set forth in SEQ ID NO: 2 or an amino acid sequence having at least about 30% identity with SEQ ID NO: 2 as described herein, or a truncated fragment thereof as described herein.
  • the present invention relates to a proximity inducing system comprising two polypeptides and a molecular inducer of proximity as described herein, wherein: a first polypeptide comprises or consists of an amino acid sequence as set forth in SEQ ID NO: 1 or an amino acid sequence having at least about 30% identity with SEQ ID NO: 1 as described herein, or a truncated fragment thereof as described herein; and a second polypeptide comprises or consists of an amino acid sequence as set forth in SEQ ID NO: 2 or an amino acid sequence having at least about 30% identity with SEQ ID NO: 2 as described herein, or a truncated fragment thereof as described herein, wherein the first polypeptide (or a truncated fragment thereof), the second polypeptide (or a truncated fragment thereof) and the molecular inducer of proximity form a ternary assembly.
  • the present invention relates to a proximity inducing system comprising two polypeptides and a molecular inducer of proximity as described herein, wherein: a first polypeptide comprises or consists of an amino acid sequence as set forth in SEQ ID NO: 1 or an amino acid sequence having at least about 30% identity with SEQ ID NO: 1 as described herein, or a truncated fragment thereof as described herein; and a second polypeptide comprises or consists of an amino acid sequence as set forth in SEQ ID NO: 2 or an amino acid sequence having at least about 30% identity with SEQ ID NO: 2 as described herein, or a truncated fragment thereof as described herein, wherein the first polypeptide (or a truncated fragment thereof), the second polypeptide (or a truncated fragment thereof) and the molecular inducer of proximity form a ternary assembly, with the concentration of molecular inducer of proximity required to reach half maximal assembly being of less than about 5 pM.
  • the expression “the first polypeptide, the second polypeptide and the molecular inducer of proximity form a ternary assembly” encompasses the formation of a ternary assembly between a truncated fragment of the first polypeptide as described herein, a truncated fragment of the second polypeptide as described herein and the molecular inducer of proximity as described herein.
  • the first polypeptide as described herein and the second polypeptide as described herein are able to complement and bind the molecular inducer of proximity as described herein to form a ternary assembly.
  • the presence of the molecular inducer of proximity triggers the complementation of the first and the second polypeptides (or truncated fragments thereof) and their binding to the molecular inducer of proximity independently of any initial proximity between the first and the second polypeptides.
  • a preexisting proximity between the first and the second polypeptides (or truncated fragments thereof) is not required for the molecular inducer of proximity to be able to trigger the complementation of the first and the second polypeptides (or truncated fragments thereof) and their binding to the molecular inducer of proximity.
  • the first polypeptide (or a truncated fragment thereof), the second polypeptide (or a truncated fragment thereof) and the molecular inducer of proximity thus form a ternary assembly independently of any initial proximity between the first and the second polypeptides (or truncated fragments thereof).
  • the first polypeptide (or a truncated fragment thereof), the second polypeptide (or a truncated fragment thereof) and the molecular inducer of proximity thus form a ternary assembly in the absence of any preexisting proximity between the first and the second polypeptides (or truncated fragments thereof).
  • proximity it is meant a distance shorter than about 20 nm, preferably shorter than about 10 nm, and more preferably shorter than about 5 nm.
  • the first polypeptide as described herein, or a truncated fragment thereof is not able on its own to bind the molecular inducer of proximity as described herein.
  • the second polypeptide as described herein, or a truncated fragment thereof is not able on its own to bind the molecular inducer of proximity as described herein.
  • the first polypeptide as described herein and the second polypeptide as described herein (or truncated fragments thereof) are not able to complement in the absence of the molecular inducer of proximity as described herein.
  • the formation of a ternary assembly between the first polypeptide as described herein (or a truncated fragment thereof), the second polypeptide as described herein (or a truncated fragment thereof), and the molecular inducer of proximity as described herein is rapid.
  • rapid it is meant that a ternary assembly between the first polypeptide as described herein (or a truncated fragment thereof), the second polypeptide as described herein (or a truncated fragment thereof), and the molecular inducer of proximity as described herein is formed in less than about 90 seconds.
  • rapid it is meant that half maximal assembly between the first polypeptide as described herein (or a truncated fragment thereof), the second polypeptide as described herein (or a truncated fragment thereof), and the molecular inducer of proximity as described herein is reached in less than about 90 seconds, preferably in less than about 60 seconds, more preferably in less than about 45 second, even more preferably in less than about 30 seconds.
  • the formation of a ternary assembly between the first polypeptide as described herein (or a truncated fragment thereof), the second polypeptide as described herein (or a truncated fragment thereof), and the molecular inducer of proximity as described herein is stable.
  • the formation of a ternary assembly between the first polypeptide as described herein (or a truncated fragment thereof), the second polypeptide as described herein (or a truncated fragment thereof), and the molecular inducer of proximity as described herein is reversible.
  • the concentration of molecular inducer of proximity required to reach half maximal assembly means the concentration of molecular inducer of proximity required to reach 50% of the ternary assembly formed between the first polypeptide, the second polypeptide and the molecular inducer of proximity.
  • Methods to evaluate, assess, or quantify the ternary assembly formed between the first polypeptide, the second polypeptide and the molecular inducer of proximity are well-known in the art. Such methods may notably rely on the evaluation, assessment, or quantification of the fluorescence emitted by a fluorogenic molecular inducer of proximity upon binding to the first and second polypeptides, and include spectrofluorimetry, flow cytometry analysis, and fluorescence microscopy (including laser scanning- and spinning-disk based confocal microscopy, multiphoton-microscopy, super-resolution microscopy).
  • the concentration of molecular inducer of proximity required to reach half maximal assembly is less than about 5, 4.5, 4, 3.5, 3, 2.5, 2, 1.5, 1, 0.5, or 0.25 pM. In some embodiments, the concentration of molecular inducer of proximity required to reach half maximal assembly ranges from about 0.05 to about 5 pM, preferably from about 0.1 to about 2.5 pM, more preferably from about 0.1 to about 1 pM.
  • the first polypeptide as described herein comprises or consists of an amino acid sequence as set forth in SEQ ID NO: 1 or an amino acid sequence having at least about 30%, 35%, 40%, 45%, 50%, or 55%, preferably at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 1.
  • the first polypeptide as described herein comprises or consists of an amino acid sequence as set forth in SEQ ID NO: 1 or an amino acid sequence having at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 1.
  • sequences having at least about 60% identity with SEQ ID NO: 1 include SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, and SEQ ID NO: 14.
  • sequence having at least about 60% identity with SEQ ID NO: 1 is selected from the group comprising or consisting of SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, and SEQ ID NO: 14.
  • sequences having at least about 65% identity with SEQ ID NO: 1 include SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, and SEQ ID NO: 14.
  • the sequence having at least about 65% with SEQ ID NO: 1 is selected from the group comprising or consisting of SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, and SEQ ID NO: 14.
  • sequences having at least about 70% identity with SEQ ID NO: 1 include SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 11, and SEQ ID NO: 12.
  • sequence having at least about 70% identity with SEQ ID NO: 1 is selected from the group comprising or consisting of SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 11, and SEQ ID NO: 12.
  • the first polypeptide comprises or consists of an amino acid sequence as set forth in SEQ ID NO: 1.
  • the polypeptide consisting of an amino acid sequence as set forth in SEQ ID NO: 1 is referred to as “ FIRE mate”.
  • the amino acid sequence having at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 1 as described herein comprises at least one, two, three, four, or five of the following amino acid residues at the indicated positions with reference to SEQ ID NO: 1 : a valine (V) at position 30, a leucine (L) at position 41, a threonine (T) at position 72, an alanine (A) at position 83, a threonine (T) at position 95, and/or a leucine (L) at position 109.
  • V valine
  • L leucine
  • T threonine
  • A alanine
  • T threonine
  • T threonine
  • L leucine
  • the amino acid sequence having at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 1 as described herein comprises two, three, four, five, or all of the following amino acid residues at the indicated positions with reference to SEQ ID NO: 1 : a valine (V) at position 30, a leucine (L) at position 41, a threonine (T) at position 72, an alanine (A) at position 83, a threonine (T) at position 95, and/or a leucine (L) at position 109.
  • the amino acid sequence having at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 1 as described herein comprises the following amino acid residues at the indicated positions with reference to SEQ ID NO: 1 : a valine (V) at position 30, a leucine (L) at position 41, a threonine (T) at position 72, an alanine (A) at position 83, a threonine (T) at position 95, and a leucine (L) at position 109.
  • the amino acid sequence having at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 1 as described herein comprises at least one, two, three, four, five, six, seven, or eight of the following amino acid residues at the indicated positions with reference to SEQ ID NO: 1 : an asparagine (N) at position 17, a glutamic acid (E) at position 21, an arginine (R) at position 25, a valine (V) at position 30, a leucine (L) at position 41, a threonine (T) at position 72, an alanine (A) at position 83, a threonine (T) at position 95, and/or a leucine (L) at position 109.
  • N asparagine
  • E glutamic acid
  • R arginine
  • V valine
  • L leucine
  • T threonine
  • A th
  • the amino acid sequence having at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 1 as described herein comprises two, three, four, five, six, seven, eight, or all of the following amino acid residues at the indicated positions with reference to SEQ ID NO: 1 : an asparagine (N) at position 17, a glutamic acid (E) at position 21, an arginine (R) at position 25, a valine (V) at position 30, a leucine (L) at position 41, a threonine (T) at position 72, an alanine (A) at position 83, a threonine (T) at position 95, and/or a leucine (L) at position 109.
  • N asparagine
  • E glutamic acid
  • R arginine
  • V valine
  • L leucine
  • T threonine
  • A threon
  • the amino acid sequence having at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 1 as described herein comprises the following amino acid residues at the indicated positions with reference to SEQ ID NO: 1 : an asparagine (N) at position 17, a glutamic acid (E) at position 21, an arginine (R) at position 25, a valine (V) at position 30, a leucine (L) at position 41, a threonine (T) at position 72, an alanine (A) at position 83, a threonine (T) at position 95, and a leucine (L) at position 109.
  • an asparagine (N) at position 17 a glutamic acid (E) at position 21, an arginine (R) at position 25, a valine (V) at position 30, a leucine (L) at position 41, a threonine (
  • the amino acid sequence having at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 1 as described herein further comprises a glycine (G) at position 69 with reference to SEQ ID NO: 1.
  • the amino acid sequence having at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 1 as described herein further comprises at least one, two, three, four, five, or six of the following amino acid residues at the indicated positions with reference to SEQ ID NO: 1 : a tryptophan (W) at position 94, an isoleucine (I) at position 96, a proline (P) at position 97, a threonine (T) at position 98, a serine (S) at position 99, an arginine (R) at position 100, and/or a glycine (G) at position 101.
  • W tryptophan
  • I isoleucine
  • P proline
  • T threonine
  • S serine
  • R arginine
  • G glycine
  • the amino acid sequence having at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 1 as described herein further comprises the following amino acid residues at the indicated positions with reference to SEQ ID NO: 1 : a tryptophan (W) at position 94, an isoleucine (I) at position 96, a proline (P) at position 97, a threonine (T) at position 98, a serine (S) at position 99, an arginine (R) at position 100, and a glycine (G) at position 101.
  • W tryptophan
  • I isoleucine
  • P proline
  • T threonine
  • S serine
  • R arginine
  • G glycine
  • amino acid sequence having at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 1 as described herein comprises:
  • a glycine (G) at position 69 a tryptophan (W) at position 94, an isoleucine (I) at position 96, a proline (P) at position 97, a threonine (T) at position 98, a serine (S) at position 99, an arginine (R) at position 100, and a glycine (G) at position 101; and at least one, two, three, four, or five of the following amino acid residues at the indicated positions with reference to SEQ ID NO: 1 : a valine (V) at position 30, a leucine (L) at position 41, a threonine (T) at position 72, an alanine (A) at position 83, a threonine (T) at position 95, and/or a leucine (L) at position 109.
  • V valine
  • L leucine
  • A threonine
  • T threonine
  • T threonine
  • amino acid sequence having at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 1 as described herein comprises:
  • a glycine (G) at position 69 a tryptophan (W) at position 94, an isoleucine (I) at position 96, a proline (P) at position 97, a threonine (T) at position 98, a serine (S) at position 99, an arginine (R) at position 100, and a glycine (G) at position 101; and
  • amino acid sequence having at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 1 as described herein comprises:
  • amino acid sequence having at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 1 as described herein comprises:
  • a glycine (G) at position 69 a tryptophan (W) at position 94, an isoleucine (I) at position 96, a proline (P) at position 97, a threonine (T) at position 98, a serine (S) at position 99, an arginine (R) at position 100, and a glycine (G) at position 101; and
  • the amino acid sequence having at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 1 as described herein comprises at least one, two, three, four, five, six, seven, eight, nine, ten, eleven, twelve, or thirteen of the following amino acid residues at the indicated positions with reference to SEQ ID NO: 1 : a valine (V) at position 30, a leucine (L) at position 41, a glycine (G) at position 69, a threonine (T) at position 72, an alanine (A) at position 83, a tryptophan (W) at position 94, a threonine (T) at position 95, an isoleucine (I) at position 96, a proline (P) at position 97, a threonine (T) at position 98, a serine (S)
  • V valine
  • L
  • the amino acid sequence having at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 1 as described herein comprises the following amino acid residues at the indicated positions with reference to SEQ ID NO: 1 : a valine (V) at position 30, a leucine (L) at position 41, a glycine (G) at position 69, a threonine (T) at position 72, an alanine (A) at position 83, a tryptophan (W) at position 94, a threonine (T) at position 95, an isoleucine (I) at position 96, a proline (P) at position 97, a threonine (T) at position 98, a serine (S) at position 99, an arginine (R) at position 100, a glycine (G) at position 101, and
  • the amino acid sequence having at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 1 as described herein comprises at least one, two, three, four, five, six, seven, eight, nine, ten, eleven, twelve, thirteen, fourteen, fifteen, or sixteen of the following amino acid residues at the indicated positions with reference to SEQ ID NO: 1 : an asparagine (N) at position 17, a glutamic acid (E) at position 21, an arginine (R) at position 25, a valine (V) at position 30, a leucine (L) at position 41, a glycine (G) at position 69, a threonine (T) at position 72, an alanine (A) at position 83, a tryptophan (W) at position 94, a threonine (T) at position 95, an isoleucine (I
  • the amino acid sequence having at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 1 as described herein comprises the following amino acid residues at the indicated positions with reference to SEQ ID NO: 1 : an asparagine (N) at position 17, a glutamic acid (E) at position 21, an arginine (R) at position 25, a valine (V) at position 30, a leucine (L) at position 41, a glycine (G) at position 69, a threonine (T) at position 72, an alanine (A) at position 83, a tryptophan (W) at position 94, a threonine (T) at position 95, an isoleucine (I) at position 96, a proline (P) at position 97, a threonine (T) at position 98, a serine residues at the indicated positions with
  • the truncated fragment of the first polypeptide as described herein results from the deletion of consecutive amino acid residues starting from the N-terminal end of the first polypeptide as described herein, preferably the deletion of a number of consecutive amino acid residues ranging from 1 amino acid residue to 40, 35, 30, or 25 amino acid residues from the N-terminal end of the first polypeptide as described herein.
  • the truncated fragment of the first polypeptide as described herein is truncated of the amino acid residues from position 1 to at most position 25 of SEQ ID NO: 1 or of the sequence having at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 1 as described herein.
  • the truncated fragment of the first polypeptide as described herein is truncated of 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, or 25 consecutive amino acid residue(s) from the N-terminal end (also referred to as N-ter) of SEQ ID NO: 1 or of the sequence having at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 1 as described herein.
  • the truncated fragment of the first polypeptide as described herein comprises at least 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, or 112 consecutive amino acid residues from the C-terminal end of SEQ ID NO: 1 or of the sequence having at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 1 as described herein.
  • the truncated fragment of the first polypeptide as described herein comprises 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, or 113 consecutive amino acid residues from the C-terminal end of SEQ ID NO: 1 or of the sequence having at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 1 as described herein.
  • the second polypeptide comprises or consists of an amino acid sequence as set forth in SEQ ID NO: 2 or an amino acid sequence having at least about 30%, 35%, 40%, 45%, 50%, or 55%, preferably at least about 60%, 65% 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 2.
  • the second polypeptide comprises or consists of an amino acid sequence as set forth in SEQ ID NO: 2 or an amino acid sequence having at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 2.
  • sequences having at least about 60% identity with SEQ ID NO: 2 include SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, and SEQ ID NO: 20.
  • the sequence having at least about 60% identity with SEQ ID NO: 2 is selected from the group comprising or consisting of SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, and SEQ ID NO: 20.
  • sequences having at least about 70% identity with SEQ ID NO: 2 include SEQ ID NO: 15, SEQ ID NO: 18, SEQ ID NO: 19, and SEQ ID NO: 20.
  • amino acid sequence having at least about 70% identity with SEQ ID NO: 2 is selected from the group comprising or consisting of SEQ ID NO: 15, SEQ ID NO: 18, SEQ ID NO: 19, and SEQ ID NO: 20.
  • sequences having at least about 80% identity with SEQ ID NO: 2 include SEQ ID NO: 18, SEQ ID NO: 19, and SEQ ID NO: 20.
  • amino acid sequence having at least about 80% identity with SEQ ID NO: 2 is selected from the group comprising or consisting of SEQ ID NO: 18, SEQ ID NO: 19, and SEQ ID NO: 20.
  • the second polypeptide comprises or consists of an amino acid sequence as set forth in SEQ ID NO: 2.
  • FIRE tag the polypeptide consisting of an amino acid sequence as set forth in SEQ ID NO: 2 is referred to as “ FIRE tag”.
  • the amino acid sequence having at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 2 as described herein comprises the following amino acid residue at the indicated position with reference to SEQ ID NO: 2: an arginine (R) at position 3.
  • the amino acid sequence having at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 2 as described herein further comprises the following amino acid residue at the indicated position with reference to SEQ ID NO: 2: an isoleucine (I) at position 8.
  • the truncated fragment of the second polypeptide as described herein comprises at least 8 consecutive amino acid residues of the second polypeptide as described herein. In some embodiments, the truncated fragment of the second polypeptide as described herein comprises 8, 9, or 10 consecutive amino acid residues of SEQ ID NO: 2 or of the sequence having at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 2 as described herein.
  • the truncated fragment of the second polypeptide as described herein results from the deletion of consecutive amino acid residues starting from the N-terminal end and/or the C-terminal end of the second polypeptide as described herein, preferably the deletion of 1, 2 or 3 consecutive amino acid residues.
  • the truncated fragment of the second polypeptide as described herein results from the deletion of consecutive amino acid residues starting from the N-terminal end of the second polypeptide as described herein, preferably the deletion of 1, 2 or 3 consecutive amino acid residues. In some embodiments, the truncated fragment of the second polypeptide as described herein results from the deletion of consecutive amino acid residues starting from the C-terminal end of the second polypeptide as described herein, preferably the deletion of 1, 2 or 3 consecutive amino acid residues.
  • the truncated fragment of the second polypeptide as described herein comprises at least 8, 9, or 10 consecutive amino acid residues from the N-terminal end of SEQ ID NO: 2 or of the sequence having at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 2 as described herein.
  • the truncated fragment of the second polypeptide as described herein comprises 8, 9, or 10 consecutive amino acid residues from the N-terminal end of SEQ ID NO: 2 or of the sequence having at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 2 as described herein.
  • the second polypeptide as described herein, or a truncated fragment thereof as described herein is fused to a 113-amino acid fragment of a photoactive yellow protein (PYP) polypeptide having an amino acid sequence as set forth in SEQ ID NO: 21 or an amino acid sequence having at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99%, or more identity with the amino acid sequence as set forth in SEQ ID NO: 21, or to a truncated fragment thereof.
  • PYP photoactive yellow protein
  • Said truncated fragment may comprise at least 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, or 112 consecutive amino acid residues from the C-terminal end of SEQ ID NO: 21 or of the sequence having at least about 60, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 21 as described herein.
  • Said truncated fragment may thus comprise 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, or 112 consecutive amino acid residues from the C-terminal end of SEQ ID NO: 21 or of the sequence having at least about 60, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 21 as described herein
  • the molecular inducer of proximity is a compound of formula (I) or a salt and/or solvate thereof; wherein R 1 , R 2 , R 5 and R 6 each independently represents H, halo, hydroxyl, aryl, alkyl, cycloalkyl, heteroalkyl (e.g., alkoxy) or heterocycloalkyl, wherein said aryl, alkyl, cycloalkyl, heteroalkyl or heterocycloalkyl is optionally substituted by at least one group selected from halo, hydroxyl, oxo, nitro, amido, carboxy, amino, cyano, haloalkoxy and haloalkyl;
  • R 3 represents a non-binding doublet (z.e., a free pair of electrons), H, halo, hydroxyl, aryl, alkyl, cycloalkyl, heteroalkyl or heterocycloalkyl, wherein said aryl, alkyl, cycloalkyl, heteroalkyl or heterocycloalkyl is optionally substituted by at least one group selected from halo, hydroxyl, oxo, nitro, amido, carboxy, amino, cyano, haloalkoxy and haloalkyl;
  • R 4 represents a single or a double bound, interrupted or terminated by one S, O or N heteroatom, wherein said heteroatom is optionally substituted by at least one group selected from H, hydroxyl, aryl, alkyl, cycloalkyl, heteroalkyl and heterocycloalkyl, wherein said aryl, alkyl, cycloalkyl, heteroalkyl or heterocycloalkyl is optionally substituted by at least one group selected from halo, hydroxyl, oxo, nitro, amido, carboxy, amino, cyano, haloalkoxy and haloalkyl;
  • W represents OH, SH, NHR 7 or NR 7 R 8 , wherein R 7 and R 8 each independently represents H, halo, hydroxyl, aryl, alkyl, cycloalkyl, heteroalkyl or heterocycloalkyl, wherein said aryl, alkyl, cycloalkyl, heteroalkyl or heterocycloalkyl is optionally substituted by at least one group selected from halo, hydroxyl, oxo, nitro, amido, carboxy, amino, cyano, haloalkoxy and haloalkyl;
  • X represents O, S or NH
  • Z represents O, S or NH.
  • the alkyl, cycloalkyl, heteroalkyl or heterocycloalkyl groups may be saturated or unsaturated. In some embodiments, the alkyl, cycloalkyl, heteroalkyl or heterocycloalkyl groups are saturated.
  • R 1 represents H, alkyl or alkoxy.
  • R 2 represents H, alkyl or alkoxy.
  • R 3 represents H.
  • R 4 represents a double bound terminated by S, O or NH.
  • R 5 represents H
  • R 6 represents H, alkyl or alkoxy.
  • W represents OH.
  • Z represents O.
  • the molecular inducer of proximity is selected from the group comprising or consisting of: (Z)-5-(4-hydroxybenzylidene)-2-thioxo-l,3- thiazolidin-4-one (HBR); (Z)-5-(4-hydroxy-3-methylbenzylidene)-2-thioxo-l,3- thiazolidin-4-one (HMBR); (Z)-5-(4-hydroxy-3,5-dimethoxybenzylidene)-2- thioxothiazolidin-4-one (HBR-3,5DOM); (Z)-5-(4-hydroxy-3-methoxybenzylidene)-2- thioxo-l,3-thiazolidin-4-one (HBR-3OM); (Z)-5-(4-hydroxy-3,5-dimethylbenzylidene)- 2-thioxothiazolidin-4-one (HBR-3,5DM); (Z)-5-(4-hydroxy-2,5-d
  • R 1 , R 2 and R 6 each independently represents H, alkyl or alkoxy
  • X, Y and Y each independently represents O, S or NH.
  • R 1 represents H, methyl (Me) or methoxy (OMe).
  • R 2 represents H, methyl or methoxy.
  • R 6 represents H, methyl or methoxy.
  • R 1 , R 2 and R 6 respectively represent: (Me, H and H), (H, Me and Me), (Me, H and Me), (OMe, H and OMe) or (H, OMe and OMe).
  • X, Y and Y respectively represent: (S, S and O), (S, O and S), (S, NH and O), (S, O and O) or (O, O and O).
  • R 1 , R 2 , R 6 , X, Y and Y respectively represent: (Me, H, H, S, S and O), (H, Me, Me, S, O and S), (Me, H, Me, S, NH and O), (OMe, H, OMe, S, O and O) or (H, OMe, OMe, O, O and O).
  • the molecular inducer of proximity is selected from the group comprising or consisting of (Z)-5-(4-hydroxy-2,5-dimethylbenzylidene)-2-thioxo- l,3-thiazolidin-4-one (HBR-2,5DM), (Z)-5-(4-hydroxy-3-methylbenzylidene)-2-thioxo- l,3-thiazolidin-4-one (HMBR), (Z)-5-(4-hydroxy-3-methylbenzylidene)-4-thioxo-l,3- thiazolidin-2-one (HBIR-3M), (Z)-5-(4-hydroxy-3,5-dimethylbenzylidene)-2- thioxothiazolidin-4-one (HBR-3,5DM), (Z)-5-(4-hydroxy-3,5-dimethoxybenzylidene)- 2-thioxothiazolidin-4-one (HBR-3,5DOM), (Z)-5-(
  • the molecular inducer of proximity is selected from the group comprising or consisting of (Z)-5-(4-hydroxy-2,5-dimethylbenzylidene)-2-thioxo- l,3-thiazolidin-4-one (HBR-2,5DM), (Z)-5-(4-hydroxy-3-methylbenzylidene)-2-thioxo-
  • the molecular inducer of proximity is (Z)-5-(4-hydroxy- 2,5-dimethylbenzylidene)-2-thioxo-l,3-thiazolidin-4-one (HBR-2,5DM)
  • the molecular inducer of proximity is (Z)-5-(4-hydroxy- 3-methylbenzylidene)-2-thi oxo-1, 3-thiazolidin-4-one (HMBR)
  • the molecular inducer of proximity is (Z)-5-(4-hydroxy- 3-methylbenzylidene)-4-thi oxo-1, 3-thiazolidin-2-one (HBIR-3M)
  • the molecular inducer of proximity is a chromophore.
  • the molecular inducer of proximity is non-fluorogenic. In some embodiments, the molecular inducer of proximity is a non-fluorogenic chromophore. Examples of non-fluorogenic molecular inducers of proximity include (Z)- 5-(4-hydroxy-3-methylbenzylidene)-4-thi oxo-1, 3-thiazolidin-2-one (HBIR-3M).
  • the molecular inducer of proximity is fluorogenic. In some embodiments, the molecular inducer of proximity is a fluorogenic chromophore.
  • fluorogenic molecular inducers of proximity include (Z)-5-(4-hydroxy-2,5- dimethylbenzylidene)-2-thi oxo-1, 3-thiazolidin-4-one (HBR-2,5DM), (Z)-5-(4-hydroxy- 3-methylbenzylidene)-2-thi oxo-1, 3-thiazolidin-4-one (HMBR), (Z)-5-(4-hydroxy-3,5- dimethylbenzylidene)-2-thioxothiazolidin-4-one (HBR-3,5DM), (Z)-5-(4-hydroxy-3,5- dimethoxybenzylidene)-2-thioxothiazolidin-4-one (HBR-3,5DOM), (Z)-5-(4-hydroxy- 3-methoxybenzylidene)-2-
  • the fluorogenic chromophore is selected from the group comprising or consisting of (Z)-5-(4-hydroxy-2,5-dimethylbenzylidene)-2-thioxo-l,3- thiazolidin-4-one (HBR-2,5DM), (Z)-5-(4-hydroxy-3-methylbenzylidene)-2-thioxo-l,3- thiazolidin-4-one (HMBR), (Z)-5-(4-hydroxy-3,5-dimethylbenzylidene)-2- thioxothiazolidin-4-one (HBR-3,5DM), (Z)-5-(4-hydroxy-3,5-dimethoxybenzylidene)- 2-thioxothiazolidin-4-one (HBR-3,5DOM), (Z)-5-(4-hydroxy-3-methoxybenzylidene)- 2-thioxo-l,3-thiazolidin-4-one (HBR-3OM), and (Z)-5-(
  • the fluorogenic chromophore is (Z)-5-(4-hydroxy-2,5- dimethylbenzylidene)-2-thioxo-l,3-thiazolidin-4-one (HBR-2,5DM) and (Z)-5-(4- hydroxy-3-methylbenzylidene)-2-thi oxo-1, 3-thiazolidin-4-one (HMBR).
  • Another object of the invention is a biological molecule comprising the first polypeptide as described herein, or a truncated fragment thereof as described herein, and/or the second polypeptide as described herein, or a truncated fragment thereof as described herein.
  • the biological molecule comprising the first and/or second polypeptide, or a truncated fragment thereof is a biological molecule to which the first and/or second polypeptide, or a truncated fragment thereof, is coupled or attached, either covalently or non-covalently.
  • Methods for non-covalently attaching a peptide or polypeptide to a biological molecule as described herein are well-known and include, for example, chemical coupling.
  • Methods for covalently attaching a peptide or polypeptide to a biological molecule as described herein are well-known and include, for example, chemical coupling or genetic fusion as described herein.
  • biological molecule encompasses any molecule present in a living organism.
  • biological molecules include amino acids, polypeptides, proteins, monosaccharides, polysaccharides, nucleotides, nucleic acids, lipids, fatty acids, glycolipids, sterols, vitamins, hormones, neurotransmitters, and metabolites.
  • the biological molecule may be a nucleic acid or a protein.
  • nucleic acids include DNA and RNA such as messenger RNA (mRNA), transfer RNA (tRNA).
  • proteins include transcription factors, enzymes, receptors, immunoreceptors, immunoglobulins, signaling proteins.
  • the biological molecule is a protein.
  • Another object of the invention is thus a fusion protein comprising a protein of interest fused to the first polypeptide as described herein, or a truncated fragment thereof as described herein, and/or the second polypeptide as described herein, or a truncated fragment thereof as described herein.
  • the first and second polypeptide, or truncated fragments thereof may each be expressed in fusion with any protein of interest within a host cell by inserting (for example via transformation or transfection) a nucleotide sequence which encodes both the polypeptide and the protein of interest.
  • Methods for fusing a peptide or polypeptide to a protein of interest are well- known. Briefly, such methods comprise inserting the nucleotide sequence encoding a protein of interest in frame with the nucleic sequence encoding the peptide(s) or polypeptide(s).
  • the nucleic sequence encoding the protein of interest can be inserted so that the encoded peptide or polypeptide is situated at the N-terminal end of the protein of interest or at the C-terminal end of the protein of interest, or internally, as desired.
  • a short nucleic sequence encoding a linker or spacer may be present between the sequences coding for the peptide or polypeptide and for the protein of interest.
  • the protein of interest may be a natural protein, a chimeric protein resulting from the fusion of various protein domains, or a synthetic protein.
  • the protein of interest may be an intracellular protein, a membrane protein, a cell surface protein present at least in part at the extra membranous surface, or a secreted protein.
  • the protein of interest may be a transcription factor, an enzyme, a receptor, an immunoreceptor, an immunoglobulin, or a signaling protein.
  • the protein of interest is a reporter protein.
  • Reporter proteins allow to detect, assess and/or monitor a variation in a sample. Examples of variation that can be detected, assessed and/or monitored using reporter proteins include cell signaling, gene expression, metabolite or analyte concentration, cell-cell interaction, cell movement, cell death, intracellular transport, secretion, cell-cycle phase, and circadian rhythm.
  • the fusion protein as described herein comprises at least one additional element other than the first and/or second polypeptide as described herein, or truncated fragments thereof as described herein, and the protein of interest.
  • additional elements include linkers, targeting signals, localization signals, protease target sites, fluorescent proteins (such as fluorescent protein tags), antibody crystallizable fragments (Fc), enzymes.
  • the fusion protein as described herein comprises a linker.
  • Methods for designing or selecting a linker are well-known to one skilled in the art.
  • Another object of the invention is a kit comprising a biological molecule as described herein, such as a fusion protein as described herein, and a molecular inducer of proximity as described herein.
  • Another object of the invention is a pair of biological molecules as described herein, wherein one biological molecule comprises the first polypeptide as described herein, or a truncated fragment thereof as described herein, and the other biological molecule comprises the second polypeptide as described herein, or a truncated fragment thereof as described herein.
  • the present invention relates to a pair of biological molecules as described herein, with one biological molecule comprising the first polypeptide as described herein, or a truncated fragment thereof as described herein, and the other biological molecule comprising the second polypeptide as described herein, or a truncated fragment thereof as described herein, wherein in presence of a molecular inducer of proximity as described herein, the first polypeptide, the second polypeptide and the molecular inducer of proximity are able to form a ternary assembly, with the concentration of molecular inducer of proximity required to reach half maximal assembly being of less than about 5 pM.
  • the pair of biological molecules consists of one biological molecule to which the first polypeptide, or a truncated fragment thereof, is coupled or attached, either covalently or non-covalently; and another biological molecule to which the second polypeptide, or a truncated fragment thereof, is coupled or attached, either covalently or non-covalently.
  • the pair of biological molecules may consist of two distinct biological molecules.
  • the pair of biological molecules may consist of two biological molecules of different structure, such as a nucleic acid and a protein.
  • the pair of biological molecules may consist of two distinct biological molecules of the same structure, such as two nucleic acids or two proteins.
  • the pair of biological molecules may consist of two copies of the same biological molecule (such as two copies of the same protein), with one copy comprising the first polypeptide, or a truncated fragment thereof, and the other comprising the second polypeptide, or a truncated fragment thereof.
  • the pair of biological molecules is a pair of fusion proteins as described herein, with one fusion protein comprising a protein of interest fused to the first polypeptide as described herein, or a truncated fragment thereof as described herein, and the other comprising a protein of interest fused to the second polypeptide as described herein, or a truncated fragment thereof as described herein.
  • both fusion proteins of the pair comprise the same protein of interest.
  • one fusion protein comprises the first polypeptide, or a truncated fragment thereof, and a protein of interest
  • the other fusion protein comprises the second polypeptide, or a truncated fragment thereof, and the same protein of interest.
  • each fusion protein of the pair comprises a distinct protein of interest.
  • one fusion protein comprises the first polypeptide, or a truncated fragment thereof, and a first protein of interest
  • the other fusion protein comprises the second polypeptide, or a truncated fragment thereof, and a second, different, protein of interest.
  • At least one of the fusion proteins of the pair comprises a protein of interest being a reporter protein.
  • both fusion proteins of the pair comprise a protein of interest being a reporter protein, either the same reporter protein or different reporter proteins.
  • At least one of the fusion proteins of the pair comprises an additional element as described herein, such as a linker, a targeting signal, a localization signal, a protease target site, an antibody crystallizable fragment (Fc), a fluorescent protein (such as a fluorescent protein tag), or an enzyme.
  • an additional element such as a linker, a targeting signal, a localization signal, a protease target site, an antibody crystallizable fragment (Fc), a fluorescent protein (such as a fluorescent protein tag), or an enzyme.
  • Another object of the invention is a kit comprising a pair of biological molecules as described herein, such as a pair of fusion proteins as described herein, and a molecular inducer of proximity as described herein.
  • nucleic acid molecule comprising or consisting of the nucleotide sequence encoding the first polypeptide as described herein, or a truncated fragment thereof as described herein, and/or the second polypeptide as described herein, or a truncated fragment thereof as described herein.
  • nucleic acid molecule include DNA molecules and RNA molecules.
  • DNA molecules include cDNA molecules (complementary DNA).
  • RNA molecules include mRNA molecules (messenger RNA).
  • nucleotide sequence encoding the first and/or the second polypeptide as described herein, or a truncated fragment thereof as described herein it is meant all nucleotide sequences that are degenerate versions of each other and that encode the same first and/or second polypeptide, or a truncated fragment thereof.
  • One skilled in the art is familiar with methods for adapting a coding sequence on the basis of the genetic code, such as methods making use of codon degeneracy to introduce silent mutations and methods taking into account codon usage bias and variation of the standard genetic code relevant to the host cell considered.
  • the nucleic acid molecule comprises or consists of a nucleotide sequence encoding a first polypeptide comprising or consisting of an amino acid sequence as set forth in SEQ ID NO: 1, or a truncated fragment thereof as described herein.
  • An example of nucleotide sequence encoding a first polypeptide comprising or consisting of an amino acid sequence as set forth in SEQ ID NO: 1 is a nucleotide sequence comprising or consisting of the sequence as set forth in SEQ ID NO: 22.
  • the nucleic acid molecule comprises or consists of a nucleotide sequence as set forth in SEQ ID NO: 22 or a nucleotide sequence having at least about 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 22.
  • the nucleic acid molecule comprises or consists of a nucleotide sequence encoding the second polypeptide comprising or consisting of an amino acid sequence as set forth in SEQ ID NO: 2, or a truncated fragment thereof as described herein.
  • nucleotide sequence encoding a second polypeptide comprising or consisting of an amino acid sequence as set forth in SEQ ID NO: 2 is a nucleotide sequence comprising or consisting of as set forth in SEQ ID NO: 23.
  • the nucleic acid molecule comprises or consists of a nucleotide sequence as set forth in SEQ ID NO: 23 or a nucleotide sequence having at least about 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 23.
  • the nucleic acid molecule comprises or consists of a nucleotide sequence encoding a fusion protein as described herein.
  • Another object of the invention is a kit comprising the nucleic acid molecule as described herein, and a molecular inducer of proximity as described herein.
  • Another object of the invention is a pair of nucleic acid molecules as described herein, wherein one nucleic acid molecule comprises or consists of a nucleotide sequence encoding the first polypeptide as described herein, or a truncated fragment thereof as described herein, and the other biological molecule comprises or consists of a nucleotide sequence encoding the second polypeptide as described herein, or a truncated fragment thereof as described herein.
  • the pair of nucleic acid molecules consists of one nucleic acid molecule comprising or consisting of a nucleotide sequence encoding a first polypeptide comprising or consisting of an amino acid sequence as set forth in SEQ ID NO: 1, or a truncated fragment thereof as described herein; and another nucleic acid molecule comprising or consisting of a nucleotide sequence encoding a second polypeptide comprising or consisting of an amino acid sequence as set forth in SEQ ID NO: 2, or a truncated fragment thereof as described herein.
  • the pair of nucleic acid molecules consists of one nucleic acid molecule comprising or consisting of a nucleotide sequence as set forth in SEQ ID NO: 22 or a nucleotide sequence having at least about 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 22; and another nucleic acid molecule comprising or consisting of a nucleotide sequence as set forth in SEQ ID NO: 23 or a nucleotide sequence having at least about 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 23.
  • the pair of nucleic acid molecules consists of two nucleic acid molecules comprising or consisting of nucleotides sequences encoding a fusion protein as described herein, wherein one nucleic acid molecule comprises or consists of a nucleotide sequence encoding a fusion protein comprising the first polypeptide or a truncated fragment thereof as described herein and a protein of interest as described herein, and the other biological molecule comprises or consists of a nucleotide sequence encoding a second fusion protein comprising the second polypeptide or a truncated fragment thereof as described herein and a protein of interest as described herein.
  • Another object of the invention is a kit comprising the pair of nucleic acid molecule as described herein, and a molecular inducer of proximity as described herein.
  • Another object of the invention is a vector, in particular an expression vector, comprising a nucleotide sequence encoding the first polypeptide as described herein, or a truncated fragment thereof as described herein, and/or the second polypeptide as described herein, or a truncated fragment thereof as described herein.
  • vectors include, without being limited to, plasmids, viral vectors, artificial chromosomes, liposomes, and lipid nanoparticles.
  • the vector is a nucleic acid molecule as described herein.
  • Example of such vectors include plasmids, artificial chromosomes.
  • the vector as described herein comprises: a first nucleotide sequence encoding the first polypeptide as described herein, or a truncated fragment thereof as described herein; and a second nucleotide sequence encoding the second polypeptide as described herein, or a truncated fragment thereof as described herein.
  • the vector as described herein comprises: a first nucleotide sequence encoding a first polypeptide comprising or consisting of an amino acid sequence as set forth in SEQ ID NO: 1 or a truncated fragment thereof as described herein; and a second nucleotide sequence encoding a second polypeptide comprising or consisting of an amino acid sequence as set forth in SEQ ID NO: 2 or a truncated fragment thereof as described herein.
  • the vector as described herein comprises: a first nucleotide sequence comprising or consisting of the sequence as set forth in SEQ ID NO: 22 or a nucleotide sequence having at least about 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 22; and a second nucleotide sequence comprising or consisting of the sequence as set forth in SEQ ID NO: 23 or a nucleotide sequence having at least about 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 23.
  • the vector may comprise the nucleotide sequence(s) required for the fusion of the nucleic sequence encoding a protein of interest in frame with the nucleic sequence encoding the first polypeptide as described herein and/or the second polypeptide as described herein, or a truncated fragment thereof as described herein.
  • the vector as described herein comprises a nucleotide sequence encoding at least one fusion protein as described herein.
  • Another obj ect of the invention is a kit comprising the vector as described herein, and a molecular inducer of proximity as described herein.
  • Another object of the invention is a pair of vectors as described herein, wherein one vector comprises a nucleotide sequence encoding the first polypeptide as described herein, or a truncated fragment thereof as described herein, and the other vector comprises a nucleotide sequence encoding the second polypeptide as described herein, or a truncated fragment thereof as described herein.
  • the pair of vectors consists of one vector comprising a nucleotide sequence encoding a first polypeptide comprising or consisting of an amino acid sequence as set forth in SEQ ID NO: 1, or a truncated fragment thereof as described herein; and another vector comprising a nucleotide sequence encoding a second polypeptide comprising or consisting of an amino acid sequence as set forth in SEQ ID NO: 2, or a truncated fragment thereof as described herein.
  • the pair of vectors consists of one vector comprising a nucleotide sequence as set forth in SEQ ID NO: 22 or a nucleotide sequence having at least about 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 22; and another vector comprising a nucleotide sequence as set forth in SEQ ID NO: 23 or a nucleotide sequence having at least about 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 23.
  • the pair of vectors consists of two vectors each comprising a nucleotide sequence encoding a fusion protein as described herein, wherein one vector comprises a nucleotide sequence encoding a fusion protein comprising the first polypeptide or a truncated fragment thereof as described herein and a protein of interest as described herein, and the other vector comprises a nucleotide sequence encoding a second fusion protein comprising the second polypeptide or a truncated fragment thereof as described herein and a protein of interest as described herein.
  • Another obj ect of the invention is a kit comprising the pair of vectors as described herein, and a molecular inducer of proximity as described herein.
  • Another object of the invention is a cell or a cell line comprising or expressing the first polypeptide as described herein, or a truncated fragment thereof as described herein, and/or the second polypeptide as described herein, or a truncated fragment thereof as described herein.
  • the cell or cell line described herein comprises or expresses both the first polypeptide as described herein, or a truncated fragment thereof as described herein, and the second polypeptide as described herein, or a truncated fragment thereof as described herein.
  • Another object of the invention is a cell or a cell line comprising or expressing a nucleic acid molecule comprising or consisting of the nucleotide sequence encoding the first polypeptide as described herein, or a truncated fragment thereof as described herein, and/or the second polypeptide as described herein, or a truncated fragment thereof as described herein.
  • the cell or cell line comprises or expresses a pair of nucleic acid molecules as described herein, wherein one nucleic acid molecule comprises or consists of a nucleotide sequence encoding the first polypeptide as described herein, or a truncated fragment thereof as described herein, and the other biological molecule comprises or consists of a nucleotide sequence encoding the second polypeptide as described herein, or a truncated fragment thereof as described herein.
  • the cell or cell line as described herein may be a genetically modified cell or cell line, that is to say a cell or cell line genetically modified to express the first polypeptide as described herein, or a truncated fragment thereof as described herein, and/or the second polypeptide as described herein, or a truncated fragment thereof as described herein.
  • the cell or cell line may be a prokaryotic cell or cell line, or a eukaryotic cell or cell line.
  • the cell or cell line may be a bacterium cell or cell line, an archaea cell or cell line, a yeast cell or cell line, a plant cell or cell line, or an animal cell or cell line.
  • the cell or cell line may be a mammal cell or cell line.
  • the cell or cell line may be a human cell or cell line.
  • the cell or cell line may be a primary cell or cell line, in particular a human primary cell or cell line.
  • the cell or cell line may be an immortalized cell or cell line, in particular a human immortalized cell or cell line.
  • the cell or cell line may be an immune cell or cell line, in particular a human immune cell or cell line.
  • the cell or cell line as described herein comprises a vector as described herein. In some embodiments, the cell or cell line as described herein comprises a pair of vectors as described herein.
  • the first polypeptide as described herein, or a truncated fragment thereof as described herein, and/or the second polypeptide as described herein, or a truncated fragment thereof as described herein may be comprised within a biological molecule as described herein.
  • the first and/or second polypeptide, or a truncated fragment thereof may be coupled or attached, either covalently or non-covalently, to a biological molecule.
  • the cell or cell line as described herein comprises or expresses at least one biological molecule comprising the first polypeptide as described herein, or a truncated fragment thereof as described herein, and/or the second polypeptide as described herein, or a truncated fragment thereof as described herein.
  • the cell or cell line as described herein comprises or expresses at least one fusion protein comprising the first polypeptide as described herein, or a truncated fragment thereof as described herein, and/or the second polypeptide as described herein, or a truncated fragment thereof as described herein.
  • the cell or cell line as described herein comprises or expresses a pair of biological molecules as described herein.
  • the cell or cell line as described herein comprises or expresses a first biological molecule comprising the first polypeptide as described herein, or a truncated fragment thereof as described herein, and a second biological molecule comprising the second polypeptide as described herein, or a truncated fragment thereof as described herein.
  • the cell or cell line as described herein comprises or expresses two biological molecules, wherein: a first biological molecule is coupled to the first polypeptide as described herein, or a truncated fragment thereof as described herein; a second biological molecule is coupled to the second polypeptide as described herein, or a truncated fragment thereof as described herein, wherein in presence of a molecular inducer of proximity as described herein, the first polypeptide, the second polypeptide and the molecular inducer of proximity are able to form a ternary assembly, with the concentration of molecular inducer of proximity required to reach half maximal assembly being of less than about 5 pM.
  • the cell or cell line as described herein comprises or expresses a pair of fusion proteins as described herein.
  • the cell or cell line as described herein comprises or expresses a first fusion protein comprising the first polypeptide as described herein, or a truncated fragment thereof as described herein, and a second fusion protein comprising the second polypeptide as described herein, or a truncated fragment thereof as described herein.
  • the cell or cell line as described herein comprises or expresses two fusion proteins as described herein, wherein: a first fusion protein comprises the first polypeptide as described herein, or a truncated fragment thereof as described herein, and a protein of interest as described herein; a second fusion protein comprises the second polypeptide as described herein, or a truncated fragment thereof as described herein, and a protein of interest as described herein, wherein in presence of a molecular inducer of proximity as described herein, the first polypeptide, the second polypeptide and the molecular inducer of proximity are able to form a ternary assembly, with the concentration of molecular inducer of proximity required to reach half maximal assembly being of less than about 5 pM.
  • Another object of the invention is a kit comprising the cell or cell line as described herein, and a molecular inducer of proximity as described herein.
  • At least one of the fusion proteins comprised within or expressed by the cell or cell line as described herein comprises a reporter protein.
  • both fusion proteins comprised within or expressed by the cell or cell line as described herein comprise a reporter protein.
  • the cell or cell line as described herein is a biosensor.
  • biosensor » it is meant that the cell or cell line as described herein may be used to assess and/or monitor a target of interest, a physiological mechanism or a biological process of interest.
  • targets of interest include analytes or metabolites.
  • the cell as described herein may thus be used to assess and/or monitor the presence in the cell of an analyte or metabolite.
  • physiological mechanisms or biological processes of interest include apoptosis, cell cycle, viral infection, intracellular protein trafficking and transport.
  • the cell or cell line as described herein is a biosensor of proximity.
  • biosensor of proximity » it is meant that the cell or cell line as described herein may be used to assess and/or monitor the intracellular localization and/or transport of a biological molecule, in particular a protein, such as a cargo protein.
  • At least one of the biological molecule comprised within or expressed by the cell or cell line as described herein is a therapeutic effector.
  • at least one of the biological molecule comprised within or expressed by the cell or cell line as described herein is a fusion protein comprising a protein of interest which is a therapeutic effector.
  • therapeutic effector it is meant a biological molecule, in particular a protein, able to induce a therapeutic effect upon administration to a subject.
  • the therapeutic effector may be a protein selected from the group comprising or consisting of transcription factors, enzymes, receptors, immunoreceptors, immunoglobulins, and signaling proteins.
  • the therapeutic effector may be a signaling protein such as, for example, a chimeric antigen receptor (CAR).
  • CAR chimeric antigen receptor
  • the therapeutic effector may be a transcription factor.
  • the activity and/or function of the therapeutic effector is controlled through chemically induced proximity, chemically induced localization, chemically induced transport, chemically induced dimerization, and/or chemically induced degradation.
  • At least one of the biological molecule comprised within or expressed by the cell or cell line as described herein is a safety switch.
  • at least one of the biological molecule comprised within or expressed by the cell or cell line as described herein is a fusion protein comprising a protein of interest which is a safety switch.
  • safety switch it is meant a biological molecule, in particular a protein, able to control the activity and/or viability of a cell, for example by controlling gene expression, and/or by inducing cell death (apoptosis).
  • the safety switch may be a protein selected from the group comprising or consisting of transcription factors and enzymes.
  • the safety switch may be an enzyme such as, for example, a caspase.
  • the safety switch may be a transcription factor.
  • the activity and/or function of the safety switch is controlled through chemically induced proximity, chemically induced localization, chemically induced transport, chemically induced dimerization, and/or chemically induced degradation.
  • Another object of the present invention is a pharmaceutical composition
  • a pharmaceutical composition comprising or consisting of any one of: a biological molecule as described herein, a pair of biological molecules as described herein, a nucleic acid molecule as described herein, a pair of nucleic acid molecules as described herein, a vector as described herein, a pair of vectors as described herein, or a cell or cell line as described herein; and optionally at least one pharmaceutically acceptable excipient or pharmaceutically acceptable carrier.
  • the pharmaceutical composition as described herein further comprises a molecular inducer of proximity as described herein.
  • Another obj ect of the present invention is a medicament comprising or consisting of any one of: a biological molecule as described herein, a pair of biological molecules as described herein, a nucleic acid molecule as described herein, a pair of nucleic acid molecules as described herein, a vector as described herein, a pair of vectors as described herein, or a cell or cell line as described herein.
  • the medicament as described herein further comprises a molecular inducer of proximity a described herein.
  • kits-of-parts comprising or consisting of: a first part comprising any one of: a biological molecule as described herein, a pair of biological molecules as described herein, a nucleic acid molecule as described herein, a pair of nucleic acid molecules as described herein, a vector as described herein, a pair of vectors as described herein, or a cell or cell line as described herein; and a second part comprising a molecular inducer of proximity a described herein.
  • Another object of the present invention is any one of: a biological molecule as described herein, a pair of biological molecules as described herein, a nucleic acid molecule as described herein, a pair of nucleic acid molecules as described herein, a vector as described herein, a pair of vectors as described herein, a cell or cell line as described herein, a kit comprising any one of the above, and a molecular inducer of proximity as described herein, a pharmaceutical composition as described herein, or a kit-of-parts as described herein, for use as a medicament.
  • any one of: a cell or cell line as described herein, a pharmaceutical composition as described herein comprising said cell or cell line, a medicament as described herein comprising said cell or cell line, or a kit-of-parts as described herein comprising said cell or cell line is for use as a cell therapy or as a cellular therapy.
  • any one of: a cell or cell line as described herein, a pharmaceutical composition as described herein comprising said cell or cell line, a medicament as described herein comprising said cell or cell line, or a kit-of-parts as described herein comprising said cell or cell line is for use for in vivo transplantation in a subject in need thereof.
  • Another object of the invention is a method for inducing proximity between two biological molecules in a sample, said method comprising the following steps: obtaining a first biological molecule coupled to the first polypeptide as described herein, or a truncated fragment thereof as described herein; obtaining a second biological molecule coupled to the second polypeptide as described herein, or a truncated fragment thereof as described herein; adding a molecular inducer of proximity as described herein to the sample; wherein, upon addition of the molecular inducer of proximity, the first polypeptide, the second polypeptide and the molecular inducer of proximity form a ternary assembly, thereby inducing proximity between the first and the second biological molecules present in the sample.
  • the concentration of molecular inducer of proximity required to reach half maximal assembly is less than about 5, 4.5, 4, 3.5, 3, 2.5, 2, 1.5, 1, 0.5, or 0.25 pM. In some embodiments, the concentration of molecular inducer of proximity required to reach half maximal assembly ranges from about 0.05 to about 5 pM, preferably from about 0.1 to about 2.5 pM, more preferably from about 0.1 to about 1 pM.
  • the present invention relates to a method for inducing proximity between two biological molecules in a sample, said method comprising the following steps: obtaining a first biological molecule coupled to the first polypeptide as described herein, or a truncated fragment thereof as described herein; obtaining a second biological molecule coupled to the second polypeptide as described herein, or a truncated fragment thereof as described herein; adding a molecular inducer of proximity as described herein to the sample; wherein, upon addition of the molecular inducer of proximity, the first polypeptide, the second polypeptide and the molecular inducer of proximity form a ternary assembly, with the concentration of molecular inducer of proximity required to reach half maximal assembly in the sample being of less than about 5 pM, thereby inducing proximity between the first and the second biological molecules present in the sample.
  • the method is for inducing proximity between two biological molecules in a sample, said method comprising the following steps: coupling or attaching a first biological molecule to the first polypeptide as described herein, or a truncated fragment thereof as described herein; coupling or attaching a second biological molecule to the second polypeptide as described herein, or a truncated fragment thereof as described herein; adding a molecular inducer of proximity as described herein to the sample; wherein, upon addition of the molecular inducer of proximity, the first polypeptide, the second polypeptide and the molecular inducer of proximity form a ternary assembly, with the concentration of molecular inducer of proximity required to reach half maximal assembly in the sample being of less than about 5 pM, thereby inducing proximity between the first and the second biological molecules present in the sample.
  • the method is for inducing proximity between two biological molecules in a sample, said method comprising the following steps: expressing in the sample a first biological molecule coupled to the first polypeptide as described herein, or a truncated fragment thereof as described herein; expressing in the sample a second biological molecule coupled to the second polypeptide as described herein, or a truncated fragment thereof as described herein; adding a molecular inducer of proximity as described herein to the sample; wherein, upon addition of the molecular inducer of proximity, the first polypeptide, the second polypeptide and the molecular inducer of proximity form a ternary assembly, with the concentration of molecular inducer of proximity required to reach half maximal assembly in the sample being of less than about 5 pM, thereby inducing proximity between the first and the second biological molecules present in the sample.
  • the two biological molecules may be two distinct biological molecules.
  • the two biological molecules may be two biological molecules of different structure, such as a nucleic acid and a protein.
  • the two biological molecules may be two distinct biological molecules of the same structure, such as two nucleic acids or two proteins.
  • the two biological molecules may be two copies of the same biological molecule (such as two copies of the same protein), with one copy coupled to the first polypeptide as described herein, or a truncated fragment thereof as described herein, and the other copy coupled to the second polypeptide as described herein, or a truncated fragment thereof as described herein.
  • the biological molecules may be selected from the group comprising or consisting of amino acids, polypeptides, proteins, monosaccharides, polysaccharides, nucleotides, nucleic acids, lipids, fatty acids, glycolipids, sterols, vitamins, hormones, neurotransmitters, and metabolites.
  • the biological molecules are proteins or nucleic acids. In some embodiments, the biological molecules are proteins.
  • the biological molecules coupled to the first or second polypeptide as described herein, or a truncated fragment thereof as described herein are fusion proteins comprising said first or second polypeptide, or a truncated fragment thereof, and a protein of interest as described herein.
  • the two fusion proteins may comprise the same protein of interest or different proteins of interest.
  • the biological molecules are non-covalently coupled to the first or second polypeptide as described herein, or a truncated fragment thereof as described herein. In some embodiments, the biological molecules are covalently coupled to the first or second polypeptide as described herein, or a truncated fragment thereof as described herein. In some embodiments, the biological molecules are proteins, and said proteins are fused to the first or second polypeptide as described herein, or a truncated fragment thereof as described herein.
  • the method is for inducing proximity in a sample between a first biological molecule coupled to the first polypeptide as described herein, or a truncated fragment thereof as described herein, and a second biological molecule coupled to the second polypeptide as described herein, or a truncated fragment thereof as described herein, said method comprising: adding a molecular inducer of proximity as described herein to the sample, wherein, upon addition of the molecular inducer of proximity, the first polypeptide, the second polypeptide and the molecular inducer of proximity form a ternary assembly, with the concentration of molecular inducer of proximity required to reach half maximal assembly in the sample being of less than about 5 pM, thereby inducing proximity between the first and the second biological molecules present in the sample.
  • the sample may be a biological sample.
  • the sample may be an organism.
  • Organisms considered herein are for instance model organisms used in biomedical research such as, for example, bacterium, yeast, fruit fly, nematode, zebrafish, mouse, rat, guinea pig, rabbit, and dog.
  • Other organisms considered herein are plant organisms, such as Arabidopsis thaliana.
  • Tobacco for example Nicotiana labaciim
  • wheat for example Triticum aestivum, Triticum durum, Triticum compactum
  • rice for example Oryza sativa
  • the sample may be a cell, a cell line, or a cell culture. Examples of cells include, bacteria cells, archaea cells, yeast cells, plant cells, animal cells.
  • Example of animal cells include mammal cells, in particular human cells.
  • the cells may be primary cells.
  • the cells may be immortalized cells.
  • the cells in the sample may be alive.
  • the cells in the sample may be fixed for microscopy and imaging.
  • the organism or the cells in the sample, or a sub set thereof is/are genetically modified to express the first and second polypeptides as described herein, or truncated fragments thereof as described herein, attached to the biological molecules as described herein, preferably proteins of interest.
  • the method is a method for inducing proximity between two biological molecules in a cell sample, said method comprising the following steps: expressing a first biological molecule coupled to the first polypeptide as described herein, or a truncated fragment thereof as described herein, in the cells of the sample; expressing a second biological molecule coupled to the second polypeptide as described herein, or a truncated fragment thereof as described herein, in the cells of the sample; adding a molecular inducer of proximity as described herein to the cell sample; wherein, upon addition of the molecular inducer of proximity, the first polypeptide, the second polypeptide and the molecular inducer of proximity form a ternary assembly in the cell sample, with the concentration of molecular inducer of proximity required to reach half maximal assembly in the sample being of less than about 5 pM, thereby inducing proximity between the first and the second biological molecules expressed in the cells of the sample.
  • the ternary assembly formed between the first polypeptide, the second polypeptide and the molecular inducer of proximity may be intracellular, extracellular or localized at the membrane of the cells.
  • the first biological molecule and/or the second biological molecule expressed in the cells of the sample may be secreted by the cells of the sample or may be localized at the membrane of the cells of the sample.
  • the method is a method for inducing proximity between two biological molecules in an organism, such as a plant organism or a non-human model organism, said method comprising the following steps: expressing a first biological molecule coupled to the first polypeptide as described herein, or a truncated fragment thereof as described herein, in the organism; expressing a second biological molecule coupled to the second polypeptide as described herein, or a truncated fragment thereof as described herein, in the organism; administering a molecular inducer of proximity as described herein to the organism; wherein, upon administration of the molecular inducer of proximity, the first polypeptide, the second polypeptide and the molecular inducer of proximity form a ternary assembly, with the concentration of molecular inducer of proximity required to reach half maximal assembly in the sample being of less than about 5 pM, thereby inducing proximity between the first and the second biological molecules expressed in the organism.
  • One skilled in the art is familiar with methods allowing to genetically modify organisms or cells in a culture in order for them to express a biological molecule of interest. Examples of such methods include transfection, electroporation, injection and transgenesis of nucleic acid molecules, such as a nucleic acid molecules as described herein. Said methods also include transplantation, injection and co-culture of cells modified to express proteins of interest.
  • the method is for inducing proximity between two biological molecules in a sample, said method comprising the following steps: fusing a first biological molecule, preferably a protein, to the first polypeptide as described herein, or a truncated fragment thereof as described herein; fusing a second biological molecule, preferably a protein, to the second polypeptide as described herein, or a truncated fragment thereof as described herein; expressing both biological molecules in the sample, adding a molecular inducer of proximity as described herein to the sample; wherein, upon addition of the molecular inducer of proximity, the first polypeptide, the second polypeptide and the molecular inducer of proximity form a ternary assembly, with the concentration of molecular inducer of proximity required to reach half maximal assembly in the sample being of less than about 5 pM, thereby inducing proximity between the first and the second biological molecules present in the sample.
  • the molecular inducer of proximity as described herein is added in the sample at a concentration of less than about 5, 4.5, 4, 3.5, 3, 2.5, 2, 1.5, 1, 0.5, or 0.25 pM. In some embodiments, in the methods as described herein, the molecular inducer of proximity as described herein is added in the sample at a concentration ranging from about 0.05 to about 5 pM, preferably from about 0.1 to about 2.5 pM, more preferably from about 0.1 to about 1 pM.
  • the molecular inducer of proximity may be non-fluorogenic.
  • the molecular inducer of proximity may be fluorogenic.
  • the method may thus further comprise a step of detecting and/or visualizing a fluorescence resulting from the formation of the ternary assembly between the first polypeptide, the second polypeptide and a fluorogenic molecular inducer of proximity.
  • the method may thus allow to detect and/or visualize the localization of the ternary assembly formed between the first polypeptide, the second polypeptide and a fluorogenic molecular inducer of proximity.
  • the method may further comprise a step of monitoring over time a fluorescence resulting from the formation of the ternary assembly between the first polypeptide, the second polypeptide and a fluorogenic molecular inducer of proximity.
  • the method may thus allow to monitor over time the formation of the ternary assembly between the first polypeptide, the second polypeptide and a fluorogenic molecular inducer of proximity.
  • fluorescence tomography One skilled in the art is familiar with the techniques available to detect fluorescence in a sample and is able to choose among those depending on the sample, the excitation and emission spectra of the fluorogenic compound(s) and the desired readout. Examples of such techniques include direct observation, fluorescence spectroscopy, flow cytometry, fluorescence microscopy (including laser scanning- and spinning-disk based confocal microscopy, multiphoton-microscopy, super-resolution microscopy), and fluorescence tomography.
  • the method may further comprise a step of assessing and/or quantifying a fluorescence resulting from the formation of the ternary assembly between the first polypeptide, the second polypeptide and a fluorogenic molecular inducer of proximity.
  • the method may thus allow to assess and/or quantify the formation of the ternary assembly between the first polypeptide, the second polypeptide and a fluorogenic molecular inducer of proximity.
  • the method comprises a step of detecting a variation of the fluorescence resulting from the formation of the ternary assembly between the first polypeptide, the second polypeptide and a fluorogenic molecular inducer of proximity.
  • the method may thus comprise a step of detecting an increase or a decrease of the fluorescence resulting from the formation of the ternary assembly between the first polypeptide, the second polypeptide and a fluorogenic molecular inducer of proximity.
  • the method further allows to measure the amount of the ternary assembly formed between the first polypeptide, the second polypeptide and a fluorogenic molecular inducer of proximity relative to a reference value by measuring fluorescence intensity.
  • said reference value corresponds to the fluorescence intensity value in a control sample.
  • said reference value corresponds to the fluorescence intensity value(s) at a different location(s) and/or time point(s) in the same sample.
  • Another object of the invention is an assay relying on the induction of proximity between two biological molecules in a sample, said assay comprising the following steps: obtaining a first biological molecule coupled to the first polypeptide as described herein, or a truncated fragment thereof as described herein; obtaining a second biological molecule coupled to the second polypeptide as described herein, or a truncated fragment thereof as described herein; adding a molecular inducer of proximity as described herein to the sample; wherein, upon addition of the molecular inducer of proximity, the first polypeptide, the second polypeptide and the molecular inducer of proximity form a ternary assembly, thereby inducing proximity between the first and the second biological molecules present in the sample.
  • the concentration of molecular inducer of proximity required to reach half maximal assembly is less than about 5, 4.5, 4, 3.5, 3, 2.5, 2, 1.5, 1, 0.5, or 0.25 pM. In some embodiments, the concentration of molecular inducer of proximity required to reach half maximal assembly ranges from about 0.05 to about 5 pM, preferably from about 0.1 to about 2.5 pM, more preferably from about 0.1 to about 1 pM.
  • the present invention relates to an assay relying on the induction of proximity between two biological molecules in a sample, said assay comprising the following steps: obtaining a first biological molecule coupled to the first polypeptide as described herein, or a truncated fragment thereof as described herein; obtaining a second biological molecule coupled to the second polypeptide as described herein, or a truncated fragment thereof as described herein; adding a molecular inducer of proximity as described herein to the sample; wherein, upon addition of the molecular inducer of proximity, the first polypeptide, the second polypeptide and the molecular inducer of proximity form a ternary assembly, with the concentration of molecular inducer of proximity required to reach half maximal assembly in the sample being of less than about 5 pM, thereby inducing proximity between the first and the second biological molecules present in the sample.
  • the assay comprises the following steps: coupling or attaching a first biological molecule to the first polypeptide as described herein, or a truncated fragment thereof as described herein; coupling or attaching a second biological molecule to the second polypeptide as described herein, or a truncated fragment thereof as described herein; adding a molecular inducer of proximity as described herein to the sample; wherein, upon addition of the molecular inducer of proximity, the first polypeptide, the second polypeptide and the molecular inducer of proximity form a ternary assembly, with the concentration of molecular inducer of proximity required to reach half maximal assembly in the sample being of less than about 5 pM, thereby inducing proximity between the first and the second biological molecules present in the sample.
  • the assay comprises the following steps: expressing in the sample a first biological molecule coupled to the first polypeptide as described herein, or a truncated fragment thereof as described herein; expressing in the sample a second biological molecule coupled to the second polypeptide as described herein, or a truncated fragment thereof as described herein; adding a molecular inducer of proximity as described herein to the sample; wherein, upon addition of the molecular inducer of proximity, the first polypeptide, the second polypeptide and the molecular inducer of proximity form a ternary assembly, with the concentration of molecular inducer of proximity required to reach half maximal assembly in the sample being of less than about 5 pM, thereby inducing proximity between the first and the second biological molecules present in the sample.
  • the assay relies on the induction of proximity in a sample between a first biological molecule coupled to the first polypeptide as described herein, or a truncated fragment thereof as described herein, and a second biological molecule coupled to the second polypeptide as described herein, or a truncated fragment thereof as described herein, and the assay comprises: adding a molecular inducer of proximity as described herein to the sample, wherein, upon addition of the molecular inducer of proximity, the first polypeptide, the second polypeptide and the molecular inducer of proximity form a ternary assembly, with the concentration of molecular inducer of proximity required to reach half maximal assembly in the sample being of less than about 5 pM, thereby inducing proximity between the first and the second biological molecules present in the sample.
  • the assay as described herein may be used for controlling the localization, transport, stability, degradation and/or function of a biological molecule or of a pair of biological molecules as described herein.
  • the assay as described herein may be used for controlling in a cell the localization, transport, stability, degradation and/or function of a protein or of a pair of proteins as described herein.
  • the protein(s) may be fusion protein(s) comprising the first or second polypeptide as described herein, or a truncated fragment as described herein, and a protein of interest as described herein.
  • the assay as described herein is for controlling the localization, transport and/or function of a biological molecule or of a pair of biological molecules.
  • the assay as described herein is for controlling the localization of a biological molecule, in particular a protein, or of a pair of biological molecules, in particular a pair of proteins.
  • the assay may be used for controlling in a cell the localization of a cytosolic protein, in particular a cytosolic fusion protein, by inducing proximity between said protein and a protein, in particular a fusion protein, localized in a cellular compartment, such as for example an organelle.
  • the assay as described herein is for controlling the transport of a biological molecule, in particular a protein, or of a pair of biological molecules, in particular a pair of proteins.
  • the assay may be used for controlling the nucleocytoplasmic trafficking of a protein in a cell by inducing proximity between a protein, in particular a fusion protein, comprising a nuclear localization signal (NLS) and a cytosolic protein, in particular a cytosolic fusion protein, such as a protein comprising a nuclear export signal (NES).
  • NLS nuclear localization signal
  • NES nuclear export signal
  • the assay as described herein is for controlling the function of a biological molecule, in particular a protein, or of a pair of biological molecules, in particular a pair of proteins.
  • the assay as described herein is for controlling the stability of a biological molecule, in particular a protein, or of a pair of biological molecules, in particular a pair of proteins. In some embodiments, the assay as described herein is for inducing the degradation of a biological molecule, in particular a protein, or of a pair of biological molecules, in particular a pair of proteins. [0242] In some embodiments, the assay as described herein is for detecting in the sample the co-occurrence of two biological molecules, in particular of two proteins. For example, the assay may be used for detecting in a cell the co-occurrence of two biological molecules, in particular of two proteins, in a cellular compartment, such as an organelle.
  • the molecular inducer of proximity as described herein is added to the sample at a concentration of less than about 5, 4.5, 4, 3.5, 3, 2.5, 2, 1.5, 1, 0.5, or 0.25 pM. In some embodiments, in the assays as described herein, the molecular inducer of proximity as described herein is added in the sample at a concentration ranging from about 0.05 to about 5 pM, preferably from about 0.1 to about 2.5 pM, more preferably from about 0.1 to about 1 pM.
  • the molecular inducer of proximity may be non-fluorogenic.
  • the molecular inducer of proximity may be fluorogenic.
  • the assay may further comprise a step of detecting and/or visualizing a fluorescence resulting from the formation of the ternary assembly between the first polypeptide, the second polypeptide and a fluorogenic molecular inducer of proximity.
  • the assay may thus allow to detect and/or visualize the localization of the ternary assembly formed between the first polypeptide, the second polypeptide and a fluorogenic molecular inducer of proximity.
  • the assay may further comprise a step of monitoring over time a fluorescence resulting from the formation of the ternary assembly between the first polypeptide, the second polypeptide and a fluorogenic molecular inducer of proximity.
  • the assay may thus allow to monitor over time the formation of the ternary assembly between the first polypeptide, the second polypeptide and a fluorogenic molecular inducer of proximity.
  • the assay may further comprise a step of assessing and/or quantifying a fluorescence resulting from the formation of the ternary assembly between the first polypeptide, the second polypeptide and a fluorogenic molecular inducer of proximity.
  • the assay may thus allow to assess and/or quantify the formation of the ternary assembly between the first polypeptide, the second polypeptide and a fluorogenic molecular inducer of proximity.
  • the assay comprises a step of detecting a variation of the fluorescence resulting from the formation of the ternary assembly between the first polypeptide, the second polypeptide and a fluorogenic molecular inducer of proximity.
  • the assay may thus comprise a step of detecting an increase or a decrease of the fluorescence resulting from the formation of the ternary assembly between the first polypeptide, the second polypeptide and a fluorogenic molecular inducer of proximity.
  • the assay further comprises measuring the amount of the ternary assembly formed between the first polypeptide, the second polypeptide and a fluorogenic molecular inducer of proximity relative to a reference value by measuring fluorescence intensity.
  • said reference value corresponds to the fluorescence intensity value in a control sample.
  • said reference value corresponds to the fluorescence intensity value(s) at a different location(s) and/or time point(s) in the same sample.
  • Figure l is a schematic representation of the working principle of the proximity inducing system as described herein.
  • the system relies on the coupling (for example through genetic fusion) of two proteins to two distinct polypeptides, called FIRE mate (i.e., polypeptide of SEQ ID NO: 1) and FIRE tag (z.e., polypeptide of SEQ ID NO: 2).
  • FIRE mate i.e., polypeptide of SEQ ID NO: 1
  • FIRE tag z.e., polypeptide of SEQ ID NO: 2
  • These two polypeptides can interact in a specific manner only in presence of a small synthetic molecule (z.e., a molecular inducer of proximity called the “match”) that stabilizes the interaction between the two polypeptides.
  • the match may be a fluorogenic chromophore, such as HBR-2,5DM or HMBR, that becomes fluorescent (z.e., “bright match” also referred to as “FIRE fluorescence” or “FIRE signal”) specifically upon formation of the ternary assembly consisting of the two polypeptides and the match (so-called “FIRE” ternary assembly).
  • a fluorogenic chromophore such as HBR-2,5DM or HMBR
  • HMBR z.e., “bright match” also referred to as “FIRE fluorescence” or “FIRE signal” specifically upon formation of the ternary assembly consisting of the two polypeptides and the match (so-called “FIRE” ternary assembly).
  • FIGS 2A and 2B are graphs illustrating the fluorogen-dependent interaction between the two polypeptides EIRE mate and EIRE tag.
  • the graphs show the normalized average fluorescence detected from about 50,000 HEK293T cells co-expressing the FK506-binding protein (FKBP) fused to EIRE tag and the FKBP-rapamycin-binding domain of mammalian target of rapamycin (FRB) fused to EIRE mate treated without (triangles) or with (squares) 500 nM of rapamycin, and with 1, 5, 10, 25 or 50 pM of HBR-2,5DM (2A) or HMBR (2B).
  • Data represent the mean values ⁇ SD of three independent experiments.
  • Figures 3A-E illustrate the recruitment of cytoplasmic proteins to mitochondria with the proximity inducing system described herein.
  • Figure 3A is a schematic representation of the experimental design wherein HeLa cells co-expressing mCherry- EIRE tag and Tom20-ECFP- EIRE mate were treated with 10 pM of match (either HBR-2,5DM or HMBR), and imaged by time-lapse confocal microscopy.
  • Figures 3B-C are graphs showing the temporal evolution of the FIRE signal detected upon addition at 0 min of HBR-2,5DM (matchl - 3B) or HMBR (match2 - 3C).
  • FIG. 3D-E are sets of representative confocal micrographs of cells before (0 min) and after (5 min) addition of HBR-2,5DM (matchl - 3D) or HMBR (match2 - 3E) (mCherry: ex/em 561/606-675 nm; FIRE: ex/em 488/508-570 nm; ECFP: ex/em 445/455-499 nm). Experiments were repeated 3 times with similar results. Scale bars are 20 pm.
  • Figures 4A-D illustrate the recruitment over time of cytoplasmic proteins to mitochondria with the proximity inducing system described herein.
  • (4A, C) HeLa cells co-expressing mCherry- FIRE tag and Tom20-ECFP- FIRE mate were treated with 10 pM HBR-2,5DM and imaged by time-lapse confocal microscopy at 1 image per 5 seconds.
  • (4B, D) U2OS cells co-expressing mCherry- EIRE tag and Tom20-ECFP- EIRE mate were treated with 10 pM HBR-2,5DM and imaged by time-lapse confocal microscopy at 1 image every 2 minutes.
  • Figure 4A is a representative time-lapse of the HeLa cells from the addition of HBR-2,5DM at 0 sec (ex/em 488/508-570 nm). Experiment was repeated 3 times with similar results. Scale bars are 20 pm.
  • Figure 4B is a representative time-lapse of the U2OS cells from the addition ofHBR-2,5DM at 0 min (mCherry: ex/em 561/606-675 nm; FIRE: ex/em 488/508-570 nm; ECFP: ex/em 445/455-499 nm). Experiment was repeated 3 times with similar results. Scale bars are 20 pm.
  • FIGS 5A-D demonstrate that the EIRE tag positioning does not influence the proximity inducing system described herein.
  • HeLa cells co-expressing either EIRE tag-mCherry (5A,C) or mCherry- EIRE tag-mCherry (5B, D) and Tom20-ECFP- EIRE mate were treated with 10 pM HBR-2,5DM (matchl) and imaged by time-lapse confocal microscopy.
  • Figures 5A-B are sets of representative confocal micrographs of cells before (0 min) and after (5 min) addition of HBR-2,5DM (matchl) (mCherry: ex/em 561/606-675 nm; FIRE: ex/em 488/508-570 nm; ECFP: ex/em 445/455-499 nm). Experiments were repeated 3 times with similar results. Scale bars are 20 pm.
  • Figures 6A-E illustrate the recruitment of cytoplasmic proteins to the Golgi apparatus with the proximity inducing system described herein.
  • Figure 6A is a representation of the experimental design wherein HeLa cells co-expressing mCherry- FIRE tag and FIRE mate-ECFP-Giantin were treated with 10 pM of match (either HBR-2,5DM or HMBR), and imaged by time-lapse confocal microscopy.
  • Figures 6B-C are graphs showing the temporal evolution of the FIRE signal detected upon addition at 0 min of HBR-2,5DM (matchl - 6B) or HMBR (match2 - 6C).
  • FIG. 6D-E are sets of representative confocal micrographs of cells before (0 min) and after (5 min) addition of HBR-2,5DM (matchl - 6D) or HMBR (match2 - 6E) (mCherry: ex/em 561/606-675 nm; FIRE: ex/em 488/508-570 nm; ECFP: ex/em 445/455-499 nm). Experiments were repeated 3 times with similar results. Scale bars are 20 pm.
  • Figures 7A-E illustrate the recruitment of cytoplasmic proteins to the endoplasmic reticulum with the proximity inducing system described herein.
  • Figure 7A is a representation of the experimental design wherein HeLa cells co-expressing mCherry - EIRE tag and EIRE mate-ECFP-Cb5 were treated with 10 pM of match (either HBR-2,5DM or HMBR), and imaged by time-lapse confocal microscopy.
  • Figures 7B-C are graphs showing the temporal evolution of the FIRE signal detected upon addition at 0 min of HBR-2,5DM (matchl - 7C) or HMBR (match2 - 7D).
  • FIG. 7D-E are sets of representative confocal micrographs of cells before (0 min) and after (5 min) addition of HBR-2,5DM (matchl - 7D) or HMBR (match2 - 7E) (mCherry: ex/em 561/606-675 nm; FIRE: ex/em 488/508-570 nm; ECFP: ex/em 445/455-499 nm). Experiments were repeated 3 times with similar results. Scale bars are 20 pm.
  • Figures 8A-E illustrate the recruitment of cytoplasmic proteins to the plasma membrane with the proximity inducing system described herein.
  • Figure 8A is a representation of the experimental design wherein HEK293 cells co-expressing mCherry- FIRE tag and Lynl l-ECFP- FIRE mate were treated with 10 pM of match (either HBR-2,5DM or HMBR), and imaged by time-lapse confocal microscopy.
  • Figures 8B-C are graphs showing the temporal evolution of the FIRE signal detected upon addition at 0 min of HBR-2,5DM (matchl - 8B) or HMBR (match2 - 8C).
  • FIG. 8D-E are sets of representative confocal micrographs of cells before (0 min) and after (5 min) addition of HBR-2,5DM (matchl - 8D) or HMBR (match2 - 8E) (mCherry: ex/em 561/606-675 nm; FIRE: ex/em 488/508-570 nm; ECFP: ex/em 445/455-499 nm). Experiments were repeated 3 times with similar results. Scale bars are 20 pm.
  • Figures 9A-D demonstrate the reversibility of the proximity inducing system described herein and the possibility to induce non-fluorogenic recruitment.
  • HeLa cells co-expressing mCherry- EIRE tag and EIRE mate-ECFP-Giantin were treated with 10 pM HBR-2,5DM (matchl), washed with HBR-2,5DM-free medium and then treated with 10 pM HBIR-3M.
  • Cells were imaged by time-lapse spinning-disk confocal microscopy.
  • Figures 9A-D are sets of representative confocal micrographs of untreated cells (9A), cells after addition of HBR-2,5DM (matchl) (9B), cells after washout of HBR-2,5DM (matchl) (9C), and cells after addition of HBIR-3M (9D).
  • FIG. 10 is a graph illustrating the fluorogen-dependent interaction between caged EIRE tag and EIRE mate.
  • the graph shows the normalized average fluorescence detected from about 50,000 HEK293T cells co-expressing the FK506-binding protein (FKBP) fused to caged EIRE tag and the FKBP-rapamycin-binding domain of mammalian target of rapamycin (FRB) fused to EIRE mate treated without or with 500 nM of rapamycin, and with 1, 5, 10, 25 or 50 pM of HBR-2,5DM.
  • FKBP FK506-binding protein
  • FB mammalian target of rapamycin
  • Figures 11A-C illustrate the use of a caged proximity inducing system as described herein.
  • Figure 11A is a representation of the experimental design wherein HeLa cells co-expressing mCherry-caged FIRE tag and FIRE mate-ECFP-Giantin were treated with 10 pM of match (z.e., HBR-2,5DM), and imaged by time-lapse confocal microscopy.
  • Figure 11C is a set of representative confocal micrographs of cells before (0 min) and after (9 min) addition of HBR-2,5DM (match 1) (mCherry: ex/em 561/606-675 nm; FIRE: ex/em 488/508-570 nm; ECFP: ex/em 445/455-499 nm). Experiments were repeated 3 times with similar results. Scale bars are 20 pm.
  • Figures 12A-B demonstrate that the proximity inducing system described herein is orthogonal to the FRB/FKBP/rapamycin CIP system.
  • Figure 12A is a representation of the experimental design wherein HeLa cells co-expressing mCherry-FKBP- EIRE tag, TOM20- EIRE mate and FRB-ECFP-Giantin were treated with 10 pM of match (z.e., HBR-2,5DM) and 500 nM rapamycin, and imaged by time-lapse confocal microscopy.
  • 10 pM of match z.e., HBR-2,5DM
  • Figure 12B is a set of representative confocal micrographs of cells before and after addition of HBR-2,5DM (match 1) and rapamycin (rap) (mCherry: ex/em 561/606-675 nm; FIRE: ex/em 488/508-570 nm; ECFP: ex/em 445/455-499 nm). Scale bars are 20 pm.
  • Figures 13A-D illustrate the nuclear export of proteins mediated by the proximity inducing system described herein.
  • Figure 13A is a representation of the experimental design wherein HeLa cell co-expressing NLS-mCherry- EIRE mate and NES- ECFP- EIRE tag were treated with 10 pM of match (i.e., HBR-2,5DM) and imaged by timelapse confocal microscopy.
  • Figure 13D is a set of representative confocal micrographs before (0 min) and after (4 min) treatment with HBR-2,5DM (mCherry: ex/em 561/606-675 nm; FIRE: ex/em 488/508-570 nm; ECFP: ex/em 445/455-499 nm). Scale bars are 20 pm.
  • Synthetic oligonucleotides used for cloning were purchased from Integrated DNA technology. PCR reactions were performed with Q5 polymerase (New England Biolabs) in the buffer provided. PCR products were purified using QIAquick PCR purification kit (Qiagen). DNAse I, T4 ligase, Fusion polymerase, Taq ligase and Taq exonuclease were purchased from New England Biolabs and used with accompanying buffers and according to the manufacturer’s protocols. Isothermal assemblies (Gibson assembly) were performed using homemade mix prepared according to previously described protocols (Gibson et al. Enzymatic assembly of DNA molecules up to several hundred kilobases. Nat Methods.
  • HMBR and HBR-2,5DM were previously reported (Plamont etal., Small fluorescence-activating and absorption-shifting tag for tunable protein imaging in vivo. Proc Natl Acad Sci U S A. 2016 Jan 19; 113(3):497-502) and Li et al.. Dynamic multicolor protein labeling in living cells. Chem Sci. 2017 Aug l;8(8):5598-5605).
  • HeLa cells were cultured in Minimal Essential Media (MEM) supplemented with phenol red, Glutamax I, 1 mM of sodium pyruvate, 1% (vol/vol) of non-essential amino-acids and 10% (vol/vol) fetal calf serum (FCS), at 37 °C in a 5% CO2 atmosphere.
  • HEK 293T cells (ATCC CRL-3216) were cultured in Dulbecco's Modified Eagle Medium (DMEM) supplemented with phenol red and 10% (vol/vol) FCS at 37 °C in a 5% CO2 atmosphere.
  • DMEM Dulbecco's Modified Eagle Medium
  • U2OS cells (ATCC HTB-96) were cultured in Dulbecco’s modified Eagle’s medium (DMEM) supplemented with phenol red and 10% (vol/vol) FCS and 1% (vol/vol) penicillin-streptomycin at 37°C in a 5% CO2 atmosphere.
  • DMEM Dulbecco’s modified Eagle’s medium
  • FCS phenol red
  • FCS phenol red
  • FCS 1% (vol/vol) penicillin-streptomycin
  • pDish IBIDI Biovalley coated with poly-L-lysine.
  • Cells were transiently transfected using Genejuice (Merck) according to the manufacturer’s protocols for 24 hours prior to imaging.
  • DPBS Dulbecco’s Phosphate-Buffered Saline
  • DMEM medium without serum and phenol red
  • the compounds e.g., molecular inducer of proximity
  • Flow cytometry in HEK 293T cells was performed on a MACSQuant® analyzer equipped with 405 nm, 488 nm and 635 nm lasers and seven channels.
  • cells were first grown in cell culture flasks, then transiently co-transfected 24 hours after seeding using Genejuice (Merck) according to the manufacturer’s protocol for 24 hours. After 24 hours, cells were centrifuged in PBS-BSA (Phosphate-Buffered Saline with 1 mg/mL bovine serum albumin) and resuspended in PBS-BSA supplemented with the appropriate amounts of compounds (e.g., molecular inducer of proximity).
  • PBS-BSA Phosphate-Buffered Saline with 1 mg/mL bovine serum albumin
  • mTurquoise2 (ex 434 nm/ em 473 nm) and iRFP670 (ex 638 nm / em 660 ⁇ 10 nm) were analyzed with the following parameters: excitation (or ex) 488nm, emission (or em) 525 ⁇ 20 nm.
  • mTurquoise2 and iRFP670 are transfection markers used to select cells expressing the constructs of interest. Data were analyzed using FlowJo vlO.7.1. Fluorescence microscopy
  • confocal micrographs were acquired on a Zeiss LSM 980 Laser Scanning Microscope equipped with a plan apochromat 63x /1.4 NA oil immersion objective. ZEN software was used to collect the data. The images were analyzed using Icy (2.4.0.0) and Fiji (Image J).
  • a ROI region of interest was determined by masking the signal in the ECFP (Enhanced Cyan Fluorescent Protein) channel with the plug-in HK-means with the following parameters: intensity class equals 100, minimum object size (pixels or px) equals 500, maximum object size (px) equals 1500-3000.
  • intensity class equals 100
  • minimum object size (pixels or px) equals 500
  • maximum object size (px) equals 1500-3000.
  • the signal intensity of the ROI was tracked over time for each channel by using plug-in Active Contours. The background signal was subtracted. Data were processed using GraphPad Prism v9.3.0.
  • the two fragments z.e., the two polypeptides of SEQ ID NO: 1 (z.e., EIRE mate) and SEQ ID NO: 2 (z.e., EIRE tag), were surprisingly found to be able to complement and form a fluorescent ternary assembly (also referred to as “FIRE”) with a fluorogen such as HBR-2,5DM or HMBR (also referred to as “the match”).
  • FIRE fluorescent ternary assembly
  • HBR-2,5DM or HMBR also referred to as “the match”.
  • fluorogens such as HBR-2,5DM or HMBR only emit substantial fluorescence when in an environment constraining their conformation and excluding deexcitation of their excited state, such as the assembly formed with the two polypeptides of SEQ ID NO: 1 (z.e., EIRE mate) and SEQ ID NO: 2 (z.e., EIRE tag).
  • the ternary assembly between the first polypeptide of SEQ ID NO: 1 (z.e., EIRE mate), the second polypeptide of SEQ ID NO: 2 (z.e., EIRE tag), and the fluorogen was found to occur upon addition of the fluorogen, regardless of any initial proximity between the two polypeptides of SEQ ID NO: 1 and SEQ ID NO: 2.
  • fluorogens such as HBR-2,5DM or HMBR thus act as molecular inducers of proximity.
  • FKBP FK506-binding protein
  • FRB FKBP-rapamycin-binding domain of mammalian target of rapamycin
  • Treatment of the HEK293 cells with rapamycin is thus expected to induce the dimerization of FKBP and FRB, and subsequently to induce proximity between EIRE tag and EIRE mate.
  • treatment of the HEK293 cells with the fluorogens HBR-2,5DM ( Figure 2A) or HMBR ( Figure 2B) induced the formation of a ternary fluorescent assembly at all tested fluorogen concentrations (1, 5, 10, 25 or 50 pM as indicated).
  • EIRE tag was fused to the C-terminus of the red fluorescent protein mCherry for diffuse cytosolic expression
  • EIRE mate was fused to the C-terminus of the N-terminal domain of the mitochondrial outer membrane protein TOM20 (TOM20 1-34), so that pFASTl-114 faced the cytosol.
  • An enhanced cyan fluorescent protein (ECFP) was inserted between EIRE mate and TOM20 1-34 to monitor the localization of the resulting fusion protein (blue fluorescence).
  • the EIRE tag was fused at the N-terminus of mCherry ( EIRE tag-mCherry), or inserted in between two mCherry (mCherry- EIRE tag-mCherry).
  • the TOM20-ECFP- EIRE mate fusion protein as described above was co-expressed in HeLa cells with either the EIRE tag-mCherry fusion protein or with the mCherry- EIRE tag-mCherry fusion protein.
  • EIRE mate was anchored at the membrane of various organelles or cellular structures so that EIRE mate faced the cytosol. EIRE mate was thus fused:
  • an ECFP was inserted between EIRE mate and the other protein domain/sequence to monitor the localization of the resulting fusion protein (blue fluorescence).
  • Each fusion protein was co-expressed in mammalian cells (HeLa cells or HEK293 cells as indicated) together with mCherry- EIRE tag (red fluorescence).
  • HBR-2,5DM Figures 6B&D, 7B&D, 8B&D
  • HMBR Figures 6C&E, 7C&E, 8C&E
  • FIRE signal corresponding to the HBR-2,5DM or HMBR fluorescence
  • mCherry- EIRE tag occurred with very similar kinetics regardless of the organelle or cellular structure (z.e., Golgi apparatus, endoplasmic reticulum membrane, or plasma membrane). Induced proximity is reversible
  • Proximity can be induced with a non-fluorogenic chromophore
  • HBIR-3M non-fluorogenic chromophores
  • HBIR-3M is able to form very tight non-fluorescent ‘invisible’ assembly with EIRE tag and EIRE mate.
  • the fusion proteins mCherry- EIRE tag (red fluorescence) and EIRE mate-ECFP-Giantin (blue fluorescence) were co-expressed in HeLa cells as described above.
  • HBIR-3M was shown to induce the proximity of the proteins fused to EIRE mate (ie., ECFP-Giantin) and EIRE tag (z.e., mCherry) as efficiently as HBR-2,5DM ( Figure 9D).
  • halophila PYP C69G induces the folding of the caged EIRE tag protein, thus preventing any intermolecular interaction of the caged EIRE tag with EIRE mate.
  • the fusion proteins FKBP-caged EIRE tag and FRB- EIRE mate were co-expressed in HEK293T cells. The cells were then treated with the fluorogens HBR-2,5DM at various concentrations (1, 5, 10, 25 or 50 pM as indicated), in the presence or absence of rapamycin.
  • the proximity inducing system can be used simultaneously with a CIP system

Landscapes

  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Molecular Biology (AREA)
  • Urology & Nephrology (AREA)
  • Biomedical Technology (AREA)
  • Hematology (AREA)
  • Immunology (AREA)
  • Chemical & Material Sciences (AREA)
  • Physics & Mathematics (AREA)
  • Food Science & Technology (AREA)
  • General Health & Medical Sciences (AREA)
  • Cell Biology (AREA)
  • Medicinal Chemistry (AREA)
  • Biotechnology (AREA)
  • Analytical Chemistry (AREA)
  • Biochemistry (AREA)
  • Microbiology (AREA)
  • General Physics & Mathematics (AREA)
  • Pathology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Biophysics (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Peptides Or Proteins (AREA)

Abstract

The present invention relates to a method for inducing proximity between two biological molecules in a sample, to an assay relying on the induction of proximity between two biological molecules in a sample, and to a proximity inducing system comprising two polypeptides and a molecular inducer of proximity.

Description

PROXIMITY INDUCING SYSTEM AND USES THEREOF
FIELD OF INVENTION
[0001] The present invention relates to the control, and optionally the detection, of the localization of biological molecules. In particular, the present invention relates to a proximity inducing system that allows to induce, and optionally detect, proximity between two biological molecules. The present invention also relates to uses of said proximity inducing system.
BACKGROUND OF INVENTION
[0002] Many essential biological processes, such as gene regulation, protein transport, signal transduction, or metabolism, are governed and regulated by the physical closeness/interaction, or proximity, between biological molecules such as proteins and/or nucleic acids. In particular, proximity between proteins plays an essential and ubiquitous role in numerous biological processes.
[0003] Molecular tools enabling to control and observe the proximity of proteins are essential for studying the functional role of the physical distance between two proteins. Various molecular tools were thus developed to understand the role of proximity in cellular and physiological mechanisms. Most of these tools allow either to control protein proximity or to observe protein proximity.
[0004] In particular, chemically induced proximity (CIP) technologies have been developed to enable the precise temporal control of biological processes such as transcription, cell signaling, or protein localization. The CIP technologies or systems generally rely on the genetic fusion of proteins to dimerization domains that interact in a specific manner in presence of a small molecule, acting as a chemical inducer of proximity. Examples of CIP systems include systems based on the interaction between the FK506 binding protein (FKBP) and the FKBP-rapamycin binding domain (FRB), inducible with rapamycin or synthetic rapamycin analogs of rapamycin called rapalogs.
[0005] Of note, most of the CIP systems aim at controlling protein proximity and do not allow the detection of the protein interaction triggered by the chemical inducer of proximity and/or the visualization of the protein localization following proximity induction. Such detection and/or visualization thus usually require the use of an additional reporter, distinct from the interacting dimerization domains and the chemical inducer of proximity. Furthermore, CIP systems usually rely on two dimerization domains of more than 10 kDa each, that might lead to dysfunctional fusion proteins. For example, in rapamycin or rapalogs based CIP, FKBP is about 12 kDa and FRB is about 11 kDa. Finally, most CIP systems rely on chemical inducers of proximity that induce the formation of very tightly bound ternary assemblies. Such ternary assemblies are usually too stable to be dissociated by washing away the inducer, preventing the reversal of the recruitment process.
[0006] There is thus a need for improved proximity inducing systems. In particular, there is a need for robust, rapid, and fully reversible proximity inducing systems, allowing for the control, and advantageously the visualization, of protein proximity with high temporal resolution.
[0007] The Inventors surprisingly demonstrated that a first polypeptide of 114 amino acid residues having a sequence as set forth in SEQ ID NO: 1 and a second polypeptide of 11 amino acid residues having a sequence as set forth in SEQ ID NO: 2 are able to complement and form a ternary assembly with a small synthetic compound which acts as a molecular inducer of proximity. Indeed, the ternary assembly between the two polypeptide and the small synthetic compound is induced by said small synthetic compound regardless of any initial proximity between the two polypeptides of SEQ ID NO: 1 and SEQ ID NO: 2. Accordingly, in the absence of the molecular inducer of proximity, the two polypeptides of SEQ ID NO: 1 and SEQ ID NO: 2 are not able to complement. The two polypeptides of SEQ ID NO: 1 and SEQ ID NO: 2 and the molecular inducer of proximity thus form a proximity inducing system, that may be used to control the localization of biological molecules such as proteins. [0008] Of note, the molecular inducer of proximity may be a fluorogenic chromophore that emits substantial fluorescence only when interacting with the two polypeptides of SEQ ID NO: 1 and SEQ ID NO: 2. The proximity inducing system may thus advantageously allow detection and/visualization of the ternary assembly without the need for any additional reporter. Furthermore, the Inventors also showed that the ternary assembly between the two polypeptide and the molecular inducer of proximity is extremely rapid and remains stable over a long period of time, unless the molecular inducer of proximity is removed. Indeed, the proximity inducing system is fully reversible and accordingly, can be used to control the proximity between two proteins with a high temporal resolution through addition and removal of the molecular inducer of proximity. As illustrated in the experimental section, through its ability to induce proximity between two biological molecules, and optionally to detect said proximity, the system described herein may be a relevant tool for biological and biomedical applications requiring the control of the localization, transport, stability and/or function of biological molecules.
SUMMARY
[0009] The present invention relates to a method for inducing proximity between two biological molecules in a sample, said method comprising the following steps: obtaining a first biological molecule coupled to a first polypeptide, wherein said first polypeptide comprises an amino acid sequence as set forth in SEQ ID NO: 1 or an amino acid sequence having at least about 70% identity with SEQ ID NO: 1, or a truncated fragment thereof comprising at least 89 consecutive amino acid residues from the C-terminal end of SEQ ID NO: 1 or of the sequence having at least about 70% identity with SEQ ID NO: 1; obtaining a second biological molecule coupled to a second polypeptide, wherein said second polypeptide comprises an amino acid sequence as set forth in SEQ ID NO: 2 or an amino acid sequence having at least about 70% identity with SEQ ID NO: 2, or a truncated fragment thereof comprising at least 8 consecutive amino acid residues from the N-terminal end of SEQ ID NO: 2 or of the sequence having at least about 70% identity with SEQ ID NO: 2; adding a molecular inducer of proximity to the sample; wherein, upon addition of the molecular inducer of proximity, the first polypeptide, the second polypeptide and the molecular inducer of proximity form a ternary assembly, thereby inducing proximity between the first and the second biological molecules present in the sample.
[0010] The present invention also relates to an assay relying on the induction of proximity between two biological molecules in a sample, said assay comprising the following steps: obtaining a first biological molecule coupled to a first polypeptide, wherein said first polypeptide comprises an amino acid sequence as set forth in SEQ ID NO: 1 or an amino acid sequence having at least about 70% identity with SEQ ID NO: 1, or a truncated fragment thereof comprising at least 89 consecutive amino acid residues from the C-terminal end of SEQ ID NO: 1 or of the sequence having at least about 70% identity with SEQ ID NO: 1; obtaining a second biological molecule coupled to a second polypeptide, wherein said second polypeptide comprises an amino acid sequence as set forth in SEQ ID NO: 2 or an amino acid sequence having at least about 70% identity with SEQ ID NO: 2, or a truncated fragment thereof comprising at least 8 consecutive amino acid residues from the N-terminal end of SEQ ID NO: 2 or of the sequence having at least about 70% identity with SEQ ID NO: 2; adding a molecular inducer of proximity to the sample; wherein, upon addition of the molecular inducer of proximity, the first polypeptide, the second polypeptide and the molecular inducer of proximity form a ternary assembly, thereby inducing proximity between the first and the second biological molecules present in the sample.
[0011] In some embodiments, the concentration of molecular inducer of proximity required to reach half maximal assembly in the sample is of less than about 5 pM.
[0012] In some embodiments, the molecular inducer of proximity is fluorogenic and the method or assay further comprises detecting a fluorescence resulting from the formation of the ternary assembly between the first polypeptide, the second polypeptide and the fluorogenic molecular inducer of proximity.
[0013] In some embodiments, the first polypeptide consists of an amino acid sequence as set forth in SEQ ID NO: 1, and the second polypeptide consists of an amino acid sequence as set forth in SEQ ID NO: 2.
In some embodiments, the molecular inducer of proximity is a compound of formula (I)
Figure imgf000006_0001
or a salt and/or solvate thereof; wherein
R1, R2, R5 and R6 each independently represents H, halo, hydroxyl, aryl, alkyl, cycloalkyl, heteroalkyl (e.g., alkoxy) or heterocycloalkyl, wherein said aryl, alkyl, cycloalkyl, heteroalkyl or heterocycloalkyl is optionally substituted by at least one group selected from halo, hydroxyl, oxo, nitro, amido, carboxy, amino, cyano, haloalkoxy and haloalkyl;
R3 represents a non-binding doublet (z.e., a free pair of electrons), H, halo, hydroxyl, aryl, alkyl, cycloalkyl, heteroalkyl or heterocycloalkyl, wherein said aryl, alkyl, cycloalkyl, heteroalkyl or heterocycloalkyl is optionally substituted by at least one group selected from halo, hydroxyl, oxo, nitro, amido, carboxy, amino, cyano, haloalkoxy and haloalkyl;
R4 represents a single or a double bound, interrupted or terminated by one S, O or N heteroatom, wherein said heteroatom is optionally substituted by at least one group selected from H, hydroxyl, aryl, alkyl, cycloalkyl, heteroalkyl and heterocycloalkyl, wherein said aryl, alkyl, cycloalkyl, heteroalkyl or heterocycloalkyl is optionally substituted by at least one group selected from halo, hydroxyl, oxo, nitro, amido, carboxy, amino, cyano, haloalkoxy and haloalkyl; W represents OH, SH, NHR7 or NR7R8, wherein R7 and R8 each independently represents H, halo, hydroxyl, aryl, alkyl, cycloalkyl, heteroalkyl or heterocycloalkyl, wherein said aryl, alkyl, cycloalkyl, heteroalkyl or heterocycloalkyl is optionally substituted by at least one group selected from halo, hydroxyl, oxo, nitro, amido, carboxy, amino, cyano, haloalkoxy and haloalkyl;
X represents O, S or NH; and
Z represents O, S or NH.
[0014] In some embodiments, the assay is for controlling the localization, the transport and/or the function of a biological molecule. In some embodiments, the assay is for inducing the degradation of a biological molecule. In some embodiments, the assay is for detecting the co-occurrence of two biological molecules in the sample.
[0015] In some embodiments, the first and second biological molecules are proteins.
[0016] The present invention also relates to a proximity inducing system comprising two polypeptides and a molecular inducer of proximity, wherein: a first polypeptide comprises an amino acid sequence as set forth in SEQ ID NO: 1 or an amino acid sequence having at least about 70% identity with SEQ ID NO: 1, or a truncated fragment thereof comprising at least 89 consecutive amino acid residues from the C-terminal end of SEQ ID NO: 1 or of the sequence having at least about 70% identity with SEQ ID NO: 1; and a second polypeptide comprises an amino acid sequence as set forth in SEQ ID NO: 2 or an amino acid sequence having at least about 70% identity with SEQ ID NO: 2, or a truncated fragment thereof comprising at least 8 consecutive amino acid residues from the N-terminal end of SEQ ID NO: 2 or of the sequence having at least about 70% identity with SEQ ID NO: 2, wherein the first polypeptide, the second polypeptide and the molecular inducer of proximity form a ternary assembly.
[0017] In some embodiments, the concentration of molecular inducer of proximity required to reach half maximal assembly is of less than about 5 pM. In some embodiments, the molecular inducer of proximity is fluorogenic. In some embodiments, the molecular inducer of proximity is a compound of formula (I) as described above, or a salt and/or solvate thereof.
[0018] In some embodiments, the first polypeptide consists of an amino acid sequence as set forth in SEQ ID NO: 1, and the second polypeptide consists of an amino acid sequence as set forth in SEQ ID NO: 2.
[0019] The present invention also relates to a cell or cell line comprising two biological molecules, wherein: a first biological molecule is coupled to a first polypeptide, wherein said first polypeptide comprises an amino acid sequence as set forth in SEQ ID NO: 1 or an amino acid sequence having at least about 70% identity with SEQ ID NO: 1, or a truncated fragment thereof comprising at least 89 consecutive amino acid residues from the C-terminal end of SEQ ID NO: 1 or of the sequence having at least about 70% identity with SEQ ID NO: 1; a second biological molecule is coupled to a second polypeptide, wherein said second polypeptide comprises an amino acid sequence as set forth in SEQ ID NO: 2 or an amino acid sequence having at least about 70% identity with SEQ ID NO: 2, or a truncated fragment thereof comprising at least 8 consecutive amino acid residues from the N-terminal end of SEQ ID NO: 2 or of the sequence having at least about 70% identity with SEQ ID NO: 2, wherein in presence of a molecular inducer of proximity, the first polypeptide, the second polypeptide and the molecular inducer of proximity are able to form a ternary assembly.
[0020] In some embodiments, the concentration of molecular inducer of proximity required to reach half maximal assembly is of less than about 5 pM. In some embodiments, the first and second biological molecule are proteins.
DEFINITIONS
[0021] In the present invention, the following terms have the following meanings: Chemical definitions
[0022] “Alkyl”, by itself or as part of another group, refers to a hydrocarbyl radical of formula CntCn i wherein n is a number greater than or equal to 1. Generally, alkyl groups of this invention comprise from 1 to 12 carbon atoms, preferably from 1 to 6 carbon atoms. Alkyl groups may be linear or branched and may be substituted as indicated herein. Non-limiting examples of alkyl groups include methyl, ethyl, propyl (zz-propyl, z-propyl), butyl (zz-butyl, z-butyl, .s-butyl and /-butyl), pentyl and its isomers (e.g., zz-pentyl, z.w-pentyl), and hexyl and its isomers (e.g., zz-hexyl, z.w-hexyl).
[0023] “Alkoxy” refers to any -O-alkyl or -O-aryl group. Generally, alkoxy groups of this invention are -O-alkyl groups.
[0024] “Amido” refers to -CONH2 group.
[0025] “Amino” refers to -NH2 group.
[0026] “Aryl” refers to a polyunsaturated, aromatic hydrocarbyl group having a single ring (z.e., phenyl) or multiple aromatic rings fused together (e.g., naphtyl) or linked covalently, typically containing from 5 to 12 carbon atoms, preferably from 6 to 10 carbon atoms, wherein at least one ring is aromatic. The aromatic ring may optionally include one to two additional rings (either cycloalkyl, heterocycloalkyl or heteroaryl) fused thereto. Aryl is also intended to include the partially hydrogenated derivatives of the carbocyclic systems enumerated herein, as long as at least one ring is aromatic. Nonlimiting examples of aryl include phenyl, biphenyl, biphenylenyl, 5- or 6-tetralinyl, naphthalen-1- or -2-yl, 4-, 5-, 6 or 7-indenyl, 1- 2-, 3-, 4- or 5-acenaphthylenyl, 3-, 4- or 5-acenaphthenyl, 1- or 2-pentalenyl, 4- or 5-indanyl, 5-, 6-, 7- or 8-tetrahydronaphthyl, 1,2,3,4-tetrahydronaphthyl, 1,4-dihydronaphthyl, and 1-, 2-, 3-, 4- or 5-pyrenyl.
[0027] “Carboxy” refers to -COOH group.
[0028] “Cyano” refers to -CN group.
[0029] “Cycloalkyl” refers to a cyclic alkyl group, z.e., a monovalent, saturated, or unsaturated hydrocarbyl group having 1 or 2 cyclic structures. Cycloalkyl includes monocyclic or bicyclic hydrocarbyl groups. Cycloalkyl groups may comprise 3 or more carbon atoms in the ring and generally, cycloalkyl groups of this invention comprise from 3 to 10, preferably from 3 to 8 carbon atoms, more preferably from 3 to 6 carbon atoms. This definition of “cycloalkyl” encompasses polycyclic cycloalkyls (e.g., bicycles) and bridged cycloalkyl structures, including cycles bound together through one atom (“spiro”) or through two atoms. Non-limiting examples of cycloalkyl groups include cyclopropyl, cyclobutyl, cyclopropyl, cyclohexyl, cycoheptyl, cyclooctanyl, cyclononanyl, cyclodecanyl, norbornyl, adamantyl, bicyclo[2.2.2]octanyl, bicyclo[4.4.0]decanyl, bicyclo[3.2.1]octanyl, bicyclo[3.3.1]nonanyl, bicyclo[2.1. l]hexane, 2,3-dihydro-lH-indenyl, 1,2,3,4-tetrahydronaphthalenyl, decahydronaphthalenyl, 1,2,3,4-tetrahydronaphthalenyl, and octahydropentalenyl.
[0030] “Halogen” or “halo” means fluoro, chloro, bromo, or iodo. Generally, halo groups of this invention are fluoro, chloro or bromo.
[0031] “Haloalkyl” refers to an alkyl group wherein one or more hydrogen atoms are replaced with a halogen atom. Non-limiting examples of haloalkyl groups include chloromethyl, 1 -bromoethyl, fluoromethyl, difluoromethyl, trifluoromethyl and 1,1,1 -trifluoroethyl .
[0032] “Haloalkoxy” refers to an alkoxy group wherein one or more hydrogen atoms are replaced with a halogen atom. Non-limiting examples of haloalkoxy groups include chloromethoxy, 1 -bromoethoxy, fluoromethoxy, difluoromethoxy, trifluoromethoxy and 1,1,1 -trifluoroethoxy .
[0033] “Heteroalkyl” refers to an alkyl group wherein one or more carbon atoms are replaced by a heteroatom, for example oxygen, nitrogen or sulfur atom, and wherein the resulting heteroalkyl group comprises at least one carbon atom. In heteroalkyl groups, the heteroatoms are bound along the alkyl chain only to carbon atoms, i.e., each heteroatom is separated from any other heteroatom by at least one carbon atom, typically by at least two carbon atoms. The nitrogen and sulfur heteroatoms may optionally be oxidized and the nitrogen heteroatoms may optionally be quaternized e.g., sulfur may be oxidized as SO or SO2). Heteroalkyl groups may further include one or more =0 and/or =S groups. In one embodiment, at least two carbon atoms are replaced by a heteroatom. In one embodiment, the heteroalkyl is bound to another group or molecule through a carbon atom, i.e., the binding atom is not selected among the heteroatoms included therein. In one embodiment, the heteroalkyl is bound to another group or molecule through one of the heteroatoms included therein. When substituted by one or more other group(s), an heteroalkyl may be substituted either through a carbon atom or through a heteroatom (e.g. , nitrogen), unless otherwise specified. Non-limiting examples of heteroalkyl include alkoxy, ethers and poly ethers (e.g, polyethylene glycol), secondary and tertiary amines and polyamines, thioethers and polythioethers, and combinations thereof.
[0034] “Heterocycloalkyl” refers to a cyclic heteroalkyl group, typically comprising from 2 to 15 carbon atoms, preferably from 2 to 11 carbon atoms, more preferably from 2 to 7 carbon atoms, furthermore preferably from 2 to 6 carbon atoms. Heterocycloalkyl groups are typically 3- to 7-membered, preferably 5- or 6-membered. Heterocycloalkyl are typically monocyclic or bicyclic, preferably monocyclic. This definition encompasses polycyclic heterocycloalkyls (e.g., bicycles) and bridged heterocycloalkyl structures, including cycles bound together through one atom (“spiro”) or through two atoms. In one embodiment, the heterocycloalkyl is bound to another group or molecule through a carbon atom, z.e., the binding atom is not selected among the heteroatoms included therein. In one embodiment, the heterocycloalkyl is bound to another group or molecule through one of the heteroatoms included therein. When substituted by one or more other group(s), an heterocycloalkyl may be substituted either through a carbon atom or through a heteroatom (e.g., nitrogen), unless otherwise specified. Non-limiting examples of heterocycloalkyl include aziridine, pyrrolidine, piperidine, piperazine, morpholine, thiomorpholine, azepane, azocane, octahydro-777-isoindole, decahydroisoquinoline, tetrahydrofuran, tetrahydropyran, tetrahydroisoquinoline
(e.g., 1,2,3,4-tetrahydroisoquiline), hexahydropyridazine, hexahydropyrazine, hexahydropyrimidine, decahydroquinoline, octahydropyrrolo[3,4-c]pyrrole, isoindoline, 1,2,3,4-tetrahydroquinoline and oxetane.
[0035] “Hydroxyl” refers to -OH group.
[0036] “Nitro” refers to -NO2 group.
[0037] “Oxo” refers to =0 group.
[0038] “Solvate” refers to a molecular complex comprising a compound of the invention and contains stoichiometric or sub-stoichiometric amounts of one or more pharmaceutically acceptable solvent molecule such as ethanol. The term “hydrate” refers to when said solvent is water.
General definitions
[0039] The terms “a” and “an” refer to one or to more than one (z.e., to at least one) of the grammatical object of the article. By way of example, “an element” means one element or more than one element.
[0040] The term “about” preceding a figure means plus or minus 10%, or less, of the value of said figure. It is to be understood that the value to which the term “about” refers is itself also specifically, and preferably, disclosed.
[0041] “Amino acid” refers to both natural and synthetic amino acids, and both D- and L-amino acids. They are represented by their full name, their three-letter code or their one-letter code as well-known in the art. “Standard amino acid” or “naturally occurring amino acid” means any of the twenty standard L-amino acids commonly found in naturally occurring polypeptides. “Non-standard amino acid” means any amino acid, other than the standard amino acids, regardless of whether it is prepared synthetically or derived from a natural source. For example, naphtlylalanine can be substituted for tryptophan to facilitate synthesis. Other synthetic amino acids that can be substituted include, but are not limited to, L-hydroxypropyl, L-3,4- dihydroxyphenylalanyl, alpha-amino acids such as L-alpha-hydroxylysyl and D-alpha- methylalanyl, L-alpha-m ethylalanyl, beta-amino acids, and isoquinolyl. The polypeptides as described herein may comprise standard amino acids or non-standard amino acids. The term “amino acid” also encompasses chemically modified amino acids, including, but not limited to, salts, amino acid derivatives (such as amides), and substitutions. Amino acids contained within the polypeptides and fusion proteins as described herein, and particularly at the carboxy- or amino-terminus, can thus be modified by methylation, amidation, acetylation or substitution with other chemical groups. Additionally, a disulfide linkage may be present or absent in the polypeptides as described herein. Other polypeptide mimetics encompassed herein include polypeptides or fusion proteins as described herein having the following modifications: i) polypeptides wherein one or more of the peptidyl -C(O)NR- linkages (bonds) have been replaced by a non-peptidyl linkage such as a -CJfc-carbamate linkage (-CH2OC(O)NR-), a phosphonate linkage, a -CH2-sulfonamide (-CH2-S(O)2NR-) linkage, a urea (-NHC(O)NH-) linkage, a -CH2-secondary amine linkage, or with an alkylated peptidyl linkage (-C(O)NR-) wherein R is C1-C4 alkyl; ii) polypeptides wherein the N-terminus is derivatized to a -NRR1 group, to a -NRC(O)R group, to a -NRC(O)OR group, to a -NRS(O)2R group, to a -NHC(0)NHR group, where R and R1 are hydrogen or C1-C4 alkyl with the proviso that R and R1 are not both hydrogen; iii) polypeptides wherein the C terminus is derivatized to -C(O)R2, where R2 is selected from the group consisting of C1-C4 alkoxy, and -NR3R4, where R3 and R4 are independently selected from the group consisting of hydrogen and C1-C4 alkyl.
[0042] “Cell line” refers to cells obtained from the proliferation of a single cell and therefore forming a uniform population of cells comprising an identical genetic material.
[0043] “Complement” or “complementation” with reference to the first and second polypeptides as described herein refers to the capacity of said two polypeptides to assemble together to reconstitute a scaffold to which the molecular inducer of proximity as described herein can bind.
[0044] The term “CIP system” refers to a chemically induced proximity system.
[0045] The term “em” refers to emission, as in emission wavelength.
[0046] The term “ex” refers to excitation, as in excitation wavelength.
[0047] “Fluorogenic chromophore” or “fluorogen” refer to a chromophore, the brightness of which can be significantly enhanced by an environmental change. A fluorogenic chromophore or fluorogen is substantially non-fluore scent in solution under its free form, but brightens up when placed into an environment constraining its conformation and excluding deexcitation of its excited state by non-radiative pathways. In some embodiments, the fluorogenic chromophore is almost invisible in solution and becomes fluorescent upon binding of a protein scaffold, such as the one formed by the two polypeptides described herein.
[0048] “Identity” or “identical”, when used in a relationship between the sequences of two or more polypeptides or of two or more nucleic acids, refers to the degree of sequence relatedness between polypeptides or nucleic acids (respectively), as determined by the number of matches between strings of two or more amino acid residues or of two or more nucleotides, respectively. “Identity” measures the percent of identical matches between the smaller of two or more sequences with gap alignments (if any) addressed by a particular mathematical model or computer program (z.e., “algorithms”). Identity of related polypeptides or nucleic acid sequences can be readily calculated by known methods. Such methods include for example those described in Arthur M. Lesk, Computational Molecular Biology: Sources and Methods for Sequence Analysis (New- York: Oxford University Press, 1988); Douglas W. Smith, Biocomputing: Informatics and Genome Projects (New-York: Academic Press, 1993); Hugh G. Griffin and Annette M. Griffin, Computer Analysis of Sequence Data, Part 1 (New Jersey: Humana Press, 1994); Gunnar von Heinje, Sequence Analysis in Molecular Biology: Treasure Trove or Trivial Pursuit (Academic Press, 1987); Michael Gribskov and John Devereux, Sequence Analysis Primer (New York: M. Stockton Press, 1991); and Carillo etal., 1988. SIAM J. AppL Math. 48(5): 1073-1082. Preferred methods for determining identity are designed to give the largest match between the sequences tested. Methods of determining identity are described in publicly available computer programs. Examples of computer program methods for determining identity between two sequences include the GCG program package, including GAP (Devereux et al., 1984. Nucl. Acid. Res. 12(1 Pt l):387-395; Genetics Computer Group, University of Wisconsin Biotechnology Center, Madison, WI), BLASTP, BLASTN, TBLASTN and FASTA (Altschul et al., 1990. J. Mol. Biol. 215(3):403-410). The BLASTX program is publicly available from the National Center for Biotechnology Information (NCBI) and other sources (BLAST Manual, Altschul et al. NCB/NLM/NIH Bethesda, Md. 20894; Altschul et al., 1990. J. Mol. Biol. 215(3):403-410). The well-known Smith Waterman algorithm may also be used to determine identity.
[0049] “Nucleic acid” or “nucleic acid molecule” refers to a polymer of nucleotides covalently linked by phosphodiester bonds, such as deoxyribonucleic acids (DNAs) or ribonucleic acids (RNAs), in either single- or double-stranded form. A used herein, the term “nucleic acid” thus encompasses single-stranded, partially double-stranded, and fully double-stranded nucleic acids. Unless specifically limited, the term encompasses nucleic acids containing known analogues of natural nucleotides that have similar binding properties as the reference nucleic acid and are metabolized in a manner similar to naturally occurring nucleotides.
[0050] “Nucleic acid sequence” or “nucleotide sequence” refers to a contiguous sequence of nucleotides in a single nucleic acid. Unless otherwise indicated, a particular nucleic acid sequence also implicitly encompasses conservatively modified variants thereof (e.g., degenerate codon substitutions), alleles, orthologs, SNPs (single-nucleotide polymorphisms), and complementary sequences as well as the sequence explicitly indicated. Specifically, degenerate codon substitutions may be achieved by generating sequences in which the third position of one or more selected (or all) codons is substituted with mixed-base and/or deoxyinosine residues (Batzer et al., Nucleic Acid Res. 19:5081 (1991); Ohtsuka et al., J. Biol. Chem. 260:2605-2608 (1985); and Rossolini et al., Mol. Cell. Probes 8:91-98 (1994)). Unless otherwise specified, a nucleotide sequence encoding an amino acid sequence includes all nucleotide sequences that are degenerate versions of each other and that encode the same amino acid sequence. The nucleotide sequence that encodes a protein or an RNA may also include introns to the extent that the nucleotide sequence encoding the protein or RNA may in some version contain at least one intron. Notably, a particular nucleic acid sequence described herein implicitly comprises its corresponding complementary sequence. It should be noted that a particular nucleic acid sequence described herein implicitly comprises the DNA sequence and the corresponding RNA sequence.
[0051] “Pharmaceutically acceptable excipient” or “pharmaceutically acceptable carrier” refers to an excipient or carrier that does not produce an adverse, allergic or other untoward reaction when administered to a subject, such as an animal, in particular a mammal, preferably a human. It includes any and all solvents, such as, for example, dispersion media, coatings, antibacterial and antifungal agents, isotonic and absorption delaying agents. A pharmaceutically acceptable excipient or carrier refers to a non-toxic solid, semi-solid or liquid filler, diluent, encapsulating material or formulation auxiliary of any type. For human administration, preparations should meet sterility, pyrogenicity, general safety and purity standards as required by the regulatory offices such as the FDA (US Food and Drug Administration) or EMA (European Medicines Agency). [0052] “Reporter protein” refers to a protein that may be detected, localized or quantified as a way to indirectly assess a target of interest or a mechanism of interest.
[0053] “Sample” refers to a specimen or small quantity of material, in particular of biological material, generally solid or liquid. “Sample” may thus also refer to cells or tissues or organisms of interest.
[0054] “Vector” refers to a vehicle by which a nucleotide sequence (e.g., a DNA sequence or an RNA sequence), for example a nucleotide sequence encoding a polypeptide or protein can be introduced into a host cell, so as to transform, transfect or transduce the host cell and promote replication and/or expression (e.g., transcription and/or translation) of the introduced nucleotide sequence.
[0055] “Expression vector” refers to a vector comprising regulatory elements (or regulatory sequences) operatively linked to a nucleotide sequence to be expressed. An expression vector thus comprises sufficient cis-acting regulatory elements for controlling the expression of the nucleotide sequence; other elements that may be required for controlling the expression of the nucleotide sequence of interest may be supplied by a host cell.
DETAILED DESCRIPTION
[0056] One object of the invention is a proximity inducing system comprising two polypeptides and a molecular inducer of proximity as described herein, wherein: a first polypeptide comprises or consists of an amino acid sequence as set forth in SEQ ID NO: 1 or an amino acid sequence having at least about 30% identity with SEQ ID NO: 1 as described herein, or a truncated fragment thereof as described herein; and a second polypeptide comprises or consists of an amino acid sequence as set forth in SEQ ID NO: 2 or an amino acid sequence having at least about 30% identity with SEQ ID NO: 2 as described herein, or a truncated fragment thereof as described herein. [0057] In some embodiments, the present invention relates to a proximity inducing system comprising two polypeptides and a molecular inducer of proximity as described herein, wherein: a first polypeptide comprises or consists of an amino acid sequence as set forth in SEQ ID NO: 1 or an amino acid sequence having at least about 30% identity with SEQ ID NO: 1 as described herein, or a truncated fragment thereof as described herein; and a second polypeptide comprises or consists of an amino acid sequence as set forth in SEQ ID NO: 2 or an amino acid sequence having at least about 30% identity with SEQ ID NO: 2 as described herein, or a truncated fragment thereof as described herein, wherein the first polypeptide (or a truncated fragment thereof), the second polypeptide (or a truncated fragment thereof) and the molecular inducer of proximity form a ternary assembly.
[0058] In some embodiments, the present invention relates to a proximity inducing system comprising two polypeptides and a molecular inducer of proximity as described herein, wherein: a first polypeptide comprises or consists of an amino acid sequence as set forth in SEQ ID NO: 1 or an amino acid sequence having at least about 30% identity with SEQ ID NO: 1 as described herein, or a truncated fragment thereof as described herein; and a second polypeptide comprises or consists of an amino acid sequence as set forth in SEQ ID NO: 2 or an amino acid sequence having at least about 30% identity with SEQ ID NO: 2 as described herein, or a truncated fragment thereof as described herein, wherein the first polypeptide (or a truncated fragment thereof), the second polypeptide (or a truncated fragment thereof) and the molecular inducer of proximity form a ternary assembly, with the concentration of molecular inducer of proximity required to reach half maximal assembly being of less than about 5 pM. [0059] As used herein, the expression “the first polypeptide, the second polypeptide and the molecular inducer of proximity form a ternary assembly” encompasses the formation of a ternary assembly between a truncated fragment of the first polypeptide as described herein, a truncated fragment of the second polypeptide as described herein and the molecular inducer of proximity as described herein.
[0060] According to the present invention, in presence of a molecular inducer of proximity, the first polypeptide as described herein and the second polypeptide as described herein (or truncated fragments thereof) are able to complement and bind the molecular inducer of proximity as described herein to form a ternary assembly. Strikingly, as illustrated hereinafter in the examples, the presence of the molecular inducer of proximity triggers the complementation of the first and the second polypeptides (or truncated fragments thereof) and their binding to the molecular inducer of proximity independently of any initial proximity between the first and the second polypeptides. In others words, a preexisting proximity between the first and the second polypeptides (or truncated fragments thereof) is not required for the molecular inducer of proximity to be able to trigger the complementation of the first and the second polypeptides (or truncated fragments thereof) and their binding to the molecular inducer of proximity.
[0061 ] According to the present invention, the first polypeptide (or a truncated fragment thereof), the second polypeptide (or a truncated fragment thereof) and the molecular inducer of proximity thus form a ternary assembly independently of any initial proximity between the first and the second polypeptides (or truncated fragments thereof). In some embodiments, the first polypeptide (or a truncated fragment thereof), the second polypeptide (or a truncated fragment thereof) and the molecular inducer of proximity thus form a ternary assembly in the absence of any preexisting proximity between the first and the second polypeptides (or truncated fragments thereof).
[0062] In some embodiments, by “proximity” it is meant a distance shorter than about 20 nm, preferably shorter than about 10 nm, and more preferably shorter than about 5 nm.
[0063] Accordingly, the first polypeptide as described herein, or a truncated fragment thereof, is not able on its own to bind the molecular inducer of proximity as described herein. Similarly, the second polypeptide as described herein, or a truncated fragment thereof, is not able on its own to bind the molecular inducer of proximity as described herein. Moreover, the first polypeptide as described herein and the second polypeptide as described herein (or truncated fragments thereof) are not able to complement in the absence of the molecular inducer of proximity as described herein.
[0064] In some embodiments, the formation of a ternary assembly between the first polypeptide as described herein (or a truncated fragment thereof), the second polypeptide as described herein (or a truncated fragment thereof), and the molecular inducer of proximity as described herein is rapid. In some embodiments, by “rapid”, it is meant that a ternary assembly between the first polypeptide as described herein (or a truncated fragment thereof), the second polypeptide as described herein (or a truncated fragment thereof), and the molecular inducer of proximity as described herein is formed in less than about 90 seconds. In some embodiments, by “rapid”, it is meant that half maximal assembly between the first polypeptide as described herein (or a truncated fragment thereof), the second polypeptide as described herein (or a truncated fragment thereof), and the molecular inducer of proximity as described herein is reached in less than about 90 seconds, preferably in less than about 60 seconds, more preferably in less than about 45 second, even more preferably in less than about 30 seconds.
[0065] In some embodiments, the formation of a ternary assembly between the first polypeptide as described herein (or a truncated fragment thereof), the second polypeptide as described herein (or a truncated fragment thereof), and the molecular inducer of proximity as described herein is stable.
[0066] In some embodiments, the formation of a ternary assembly between the first polypeptide as described herein (or a truncated fragment thereof), the second polypeptide as described herein (or a truncated fragment thereof), and the molecular inducer of proximity as described herein is reversible.
[0067] As used herein, the concentration of molecular inducer of proximity required to reach half maximal assembly (that may also be referred to as half maximal ternary assembly) means the concentration of molecular inducer of proximity required to reach 50% of the ternary assembly formed between the first polypeptide, the second polypeptide and the molecular inducer of proximity.
[0068] Methods to evaluate, assess, or quantify the ternary assembly formed between the first polypeptide, the second polypeptide and the molecular inducer of proximity are well-known in the art. Such methods may notably rely on the evaluation, assessment, or quantification of the fluorescence emitted by a fluorogenic molecular inducer of proximity upon binding to the first and second polypeptides, and include spectrofluorimetry, flow cytometry analysis, and fluorescence microscopy (including laser scanning- and spinning-disk based confocal microscopy, multiphoton-microscopy, super-resolution microscopy).
[0069] In some embodiments, the concentration of molecular inducer of proximity required to reach half maximal assembly is less than about 5, 4.5, 4, 3.5, 3, 2.5, 2, 1.5, 1, 0.5, or 0.25 pM. In some embodiments, the concentration of molecular inducer of proximity required to reach half maximal assembly ranges from about 0.05 to about 5 pM, preferably from about 0.1 to about 2.5 pM, more preferably from about 0.1 to about 1 pM.
[0070] In some embodiments, the first polypeptide as described herein comprises or consists of an amino acid sequence as set forth in SEQ ID NO: 1 or an amino acid sequence having at least about 30%, 35%, 40%, 45%, 50%, or 55%, preferably at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 1. In some embodiments, the first polypeptide as described herein comprises or consists of an amino acid sequence as set forth in SEQ ID NO: 1 or an amino acid sequence having at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 1.
[0071] Examples of sequences having at least about 60% identity with SEQ ID NO: 1 include SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, and SEQ ID NO: 14. In some embodiments, the sequence having at least about 60% identity with SEQ ID NO: 1 is selected from the group comprising or consisting of SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, and SEQ ID NO: 14.
[0072] Examples of sequences having at least about 65% identity with SEQ ID NO: 1 include SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, and SEQ ID NO: 14. In some embodiments, the sequence having at least about 65% with SEQ ID NO: 1 is selected from the group comprising or consisting of SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, and SEQ ID NO: 14.
[0073] Examples of sequences having at least about 70% identity with SEQ ID NO: 1 include SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 11, and SEQ ID NO: 12. In some embodiments, the sequence having at least about 70% identity with SEQ ID NO: 1 is selected from the group comprising or consisting of SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 11, and SEQ ID NO: 12.
[0074] In some embodiments, the first polypeptide comprises or consists of an amino acid sequence as set forth in SEQ ID NO: 1. In the experimental section hereinafter, the polypeptide consisting of an amino acid sequence as set forth in SEQ ID NO: 1 is referred to as “FIREmate”.
[0075] In some embodiments, the amino acid sequence having at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 1 as described herein comprises at least one, two, three, four, or five of the following amino acid residues at the indicated positions with reference to SEQ ID NO: 1 : a valine (V) at position 30, a leucine (L) at position 41, a threonine (T) at position 72, an alanine (A) at position 83, a threonine (T) at position 95, and/or a leucine (L) at position 109.
[0076] In some embodiments, the amino acid sequence having at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 1 as described herein comprises two, three, four, five, or all of the following amino acid residues at the indicated positions with reference to SEQ ID NO: 1 : a valine (V) at position 30, a leucine (L) at position 41, a threonine (T) at position 72, an alanine (A) at position 83, a threonine (T) at position 95, and/or a leucine (L) at position 109.
[0077] In some embodiments, the amino acid sequence having at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 1 as described herein comprises the following amino acid residues at the indicated positions with reference to SEQ ID NO: 1 : a valine (V) at position 30, a leucine (L) at position 41, a threonine (T) at position 72, an alanine (A) at position 83, a threonine (T) at position 95, and a leucine (L) at position 109.
[0078] In some embodiments, the amino acid sequence having at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 1 as described herein comprises at least one, two, three, four, five, six, seven, or eight of the following amino acid residues at the indicated positions with reference to SEQ ID NO: 1 : an asparagine (N) at position 17, a glutamic acid (E) at position 21, an arginine (R) at position 25, a valine (V) at position 30, a leucine (L) at position 41, a threonine (T) at position 72, an alanine (A) at position 83, a threonine (T) at position 95, and/or a leucine (L) at position 109.
[0079] In some embodiments, the amino acid sequence having at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 1 as described herein comprises two, three, four, five, six, seven, eight, or all of the following amino acid residues at the indicated positions with reference to SEQ ID NO: 1 : an asparagine (N) at position 17, a glutamic acid (E) at position 21, an arginine (R) at position 25, a valine (V) at position 30, a leucine (L) at position 41, a threonine (T) at position 72, an alanine (A) at position 83, a threonine (T) at position 95, and/or a leucine (L) at position 109.
[0080] In some embodiments, the amino acid sequence having at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 1 as described herein comprises the following amino acid residues at the indicated positions with reference to SEQ ID NO: 1 : an asparagine (N) at position 17, a glutamic acid (E) at position 21, an arginine (R) at position 25, a valine (V) at position 30, a leucine (L) at position 41, a threonine (T) at position 72, an alanine (A) at position 83, a threonine (T) at position 95, and a leucine (L) at position 109.
[0081] In some embodiments, the amino acid sequence having at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 1 as described herein further comprises a glycine (G) at position 69 with reference to SEQ ID NO: 1.
[0082] In some embodiments, the amino acid sequence having at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 1 as described herein further comprises at least one, two, three, four, five, or six of the following amino acid residues at the indicated positions with reference to SEQ ID NO: 1 : a tryptophan (W) at position 94, an isoleucine (I) at position 96, a proline (P) at position 97, a threonine (T) at position 98, a serine (S) at position 99, an arginine (R) at position 100, and/or a glycine (G) at position 101.
[0083] In some embodiments, the amino acid sequence having at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 1 as described herein further comprises the following amino acid residues at the indicated positions with reference to SEQ ID NO: 1 : a tryptophan (W) at position 94, an isoleucine (I) at position 96, a proline (P) at position 97, a threonine (T) at position 98, a serine (S) at position 99, an arginine (R) at position 100, and a glycine (G) at position 101.
[0084] In some embodiments, the amino acid sequence having at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 1 as described herein comprises:
- the following amino acid residues at the indicated positions with reference to SEQ ID NO: 1 : a glycine (G) at position 69, a tryptophan (W) at position 94, an isoleucine (I) at position 96, a proline (P) at position 97, a threonine (T) at position 98, a serine (S) at position 99, an arginine (R) at position 100, and a glycine (G) at position 101; and at least one, two, three, four, or five of the following amino acid residues at the indicated positions with reference to SEQ ID NO: 1 : a valine (V) at position 30, a leucine (L) at position 41, a threonine (T) at position 72, an alanine (A) at position 83, a threonine (T) at position 95, and/or a leucine (L) at position 109.
[0085] In some embodiments, the amino acid sequence having at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 1 as described herein comprises:
- the following amino acid residues at the indicated positions with reference to SEQ ID NO: 1 : a glycine (G) at position 69, a tryptophan (W) at position 94, an isoleucine (I) at position 96, a proline (P) at position 97, a threonine (T) at position 98, a serine (S) at position 99, an arginine (R) at position 100, and a glycine (G) at position 101; and
- two, three, four, five or all of the following amino acid residues at the indicated positions with reference to SEQ ID NO: 1 : a valine (V) at position 30, a leucine (L) at position 41, a threonine (T) at position 72, an alanine (A) at position 83, a threonine (T) at position 95, and/or a leucine (L) at position 109.
[0086] In some embodiments, the amino acid sequence having at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 1 as described herein comprises:
- the following amino acid residues at the indicated positions with reference to SEQ ID NO: 1 : a glycine (G) at position 69, a tryptophan (W) at position 94, an isoleucine (I) at position 96, a proline (P) at position 97, a threonine (T) at position 98, a serine (S) at position 99, an arginine (R) at position 100, and a glycine (G) at position 101; and at least one, two, three, four, five, six, seven, or eight of the following amino acid residues at the indicated positions with reference to SEQ ID NO: 1 : an asparagine (N) at position 17, a glutamic acid (E) at position 21, an arginine (R) at position 25, a valine (V) at position 30, a leucine (L) at position 41, a threonine (T) at position 72, an alanine (A) at position 83, a threonine (T) at position 95, and/or a leucine (L) at position 109.
[0087] In some embodiments, the amino acid sequence having at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 1 as described herein comprises:
- the following amino acid residues at the indicated positions with reference to SEQ ID NO: 1 : a glycine (G) at position 69, a tryptophan (W) at position 94, an isoleucine (I) at position 96, a proline (P) at position 97, a threonine (T) at position 98, a serine (S) at position 99, an arginine (R) at position 100, and a glycine (G) at position 101; and
- two, three, four, five, six, seven, eight, or all of the following amino acid residues at the indicated positions with reference to SEQ ID NO: 1 : an asparagine (N) at position 17, a glutamic acid (E) at position 21, an arginine (R) at position 25, a valine (V) at position 30, a leucine (L) at position 41, a threonine (T) at position 72, an alanine (A) at position 83, a threonine (T) at position 95, and/or a leucine (L) at position 109.
[0088] In some embodiments, the amino acid sequence having at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 1 as described herein comprises at least one, two, three, four, five, six, seven, eight, nine, ten, eleven, twelve, or thirteen of the following amino acid residues at the indicated positions with reference to SEQ ID NO: 1 : a valine (V) at position 30, a leucine (L) at position 41, a glycine (G) at position 69, a threonine (T) at position 72, an alanine (A) at position 83, a tryptophan (W) at position 94, a threonine (T) at position 95, an isoleucine (I) at position 96, a proline (P) at position 97, a threonine (T) at position 98, a serine (S) at position 99, an arginine (R) at position 100, a glycine (G) at position 101, and/or a leucine (L) at position 109.
[0089] In some embodiments, the amino acid sequence having at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 1 as described herein comprises the following amino acid residues at the indicated positions with reference to SEQ ID NO: 1 : a valine (V) at position 30, a leucine (L) at position 41, a glycine (G) at position 69, a threonine (T) at position 72, an alanine (A) at position 83, a tryptophan (W) at position 94, a threonine (T) at position 95, an isoleucine (I) at position 96, a proline (P) at position 97, a threonine (T) at position 98, a serine (S) at position 99, an arginine (R) at position 100, a glycine (G) at position 101, and a leucine (L) at position 109.
[0090] In some embodiments, the amino acid sequence having at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 1 as described herein comprises at least one, two, three, four, five, six, seven, eight, nine, ten, eleven, twelve, thirteen, fourteen, fifteen, or sixteen of the following amino acid residues at the indicated positions with reference to SEQ ID NO: 1 : an asparagine (N) at position 17, a glutamic acid (E) at position 21, an arginine (R) at position 25, a valine (V) at position 30, a leucine (L) at position 41, a glycine (G) at position 69, a threonine (T) at position 72, an alanine (A) at position 83, a tryptophan (W) at position 94, a threonine (T) at position 95, an isoleucine (I) at position 96, a proline (P) at position 97, a threonine (T) at position 98, a serine (S) at position 99, an arginine (R) at position 100, a glycine (G) at position 101, and/or a leucine (L) at position 109.
[0091] In some embodiments, the amino acid sequence having at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 1 as described herein comprises the following amino acid residues at the indicated positions with reference to SEQ ID NO: 1 : an asparagine (N) at position 17, a glutamic acid (E) at position 21, an arginine (R) at position 25, a valine (V) at position 30, a leucine (L) at position 41, a glycine (G) at position 69, a threonine (T) at position 72, an alanine (A) at position 83, a tryptophan (W) at position 94, a threonine (T) at position 95, an isoleucine (I) at position 96, a proline (P) at position 97, a threonine (T) at position 98, a serine (S) at position 99, an arginine (R) at position 100, a glycine (G) at position 101, and a leucine (L) at position 109.
[0092] In some embodiments, the truncated fragment of the first polypeptide as described herein results from the deletion of consecutive amino acid residues starting from the N-terminal end of the first polypeptide as described herein, preferably the deletion of a number of consecutive amino acid residues ranging from 1 amino acid residue to 40, 35, 30, or 25 amino acid residues from the N-terminal end of the first polypeptide as described herein.
[0093] In some embodiments, the truncated fragment of the first polypeptide as described herein is truncated of the amino acid residues from position 1 to at most position 25 of SEQ ID NO: 1 or of the sequence having at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 1 as described herein.
[0094] In some embodiments, the truncated fragment of the first polypeptide as described herein is truncated of 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, or 25 consecutive amino acid residue(s) from the N-terminal end (also referred to as N-ter) of SEQ ID NO: 1 or of the sequence having at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 1 as described herein.
[0095] In some embodiments, the truncated fragment of the first polypeptide as described herein comprises at least 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, or 112 consecutive amino acid residues from the C-terminal end of SEQ ID NO: 1 or of the sequence having at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 1 as described herein.
[0096] In some embodiments, the truncated fragment of the first polypeptide as described herein comprises 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, or 113 consecutive amino acid residues from the C-terminal end of SEQ ID NO: 1 or of the sequence having at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 1 as described herein.
[0097] In some embodiments, the second polypeptide comprises or consists of an amino acid sequence as set forth in SEQ ID NO: 2 or an amino acid sequence having at least about 30%, 35%, 40%, 45%, 50%, or 55%, preferably at least about 60%, 65% 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 2.
[0098] In some embodiments, the second polypeptide comprises or consists of an amino acid sequence as set forth in SEQ ID NO: 2 or an amino acid sequence having at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 2.
[0099] Examples of sequences having at least about 60% identity with SEQ ID NO: 2 include SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, and SEQ ID NO: 20. In some embodiments, the sequence having at least about 60% identity with SEQ ID NO: 2 is selected from the group comprising or consisting of SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, and SEQ ID NO: 20.
[0100] Examples of sequences having at least about 70% identity with SEQ ID NO: 2 include SEQ ID NO: 15, SEQ ID NO: 18, SEQ ID NO: 19, and SEQ ID NO: 20. In some embodiments, the amino acid sequence having at least about 70% identity with SEQ ID NO: 2 is selected from the group comprising or consisting of SEQ ID NO: 15, SEQ ID NO: 18, SEQ ID NO: 19, and SEQ ID NO: 20.
[0101] Examples of sequences having at least about 80% identity with SEQ ID NO: 2 include SEQ ID NO: 18, SEQ ID NO: 19, and SEQ ID NO: 20. In some embodiments, the amino acid sequence having at least about 80% identity with SEQ ID NO: 2 is selected from the group comprising or consisting of SEQ ID NO: 18, SEQ ID NO: 19, and SEQ ID NO: 20.
[0102] In some embodiments, the second polypeptide comprises or consists of an amino acid sequence as set forth in SEQ ID NO: 2. In the experimental section hereinafter, the polypeptide consisting of an amino acid sequence as set forth in SEQ ID NO: 2 is referred to as “FIREtag”.
[0103] In some embodiments, the amino acid sequence having at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 2 as described herein comprises the following amino acid residue at the indicated position with reference to SEQ ID NO: 2: an arginine (R) at position 3.
[0104] In some embodiments, the amino acid sequence having at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 2 as described herein further comprises the following amino acid residue at the indicated position with reference to SEQ ID NO: 2: an isoleucine (I) at position 8.
[0105] In some embodiments, the truncated fragment of the second polypeptide as described herein comprises at least 8 consecutive amino acid residues of the second polypeptide as described herein. In some embodiments, the truncated fragment of the second polypeptide as described herein comprises 8, 9, or 10 consecutive amino acid residues of SEQ ID NO: 2 or of the sequence having at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 2 as described herein.
[0106] In some embodiments, the truncated fragment of the second polypeptide as described herein results from the deletion of consecutive amino acid residues starting from the N-terminal end and/or the C-terminal end of the second polypeptide as described herein, preferably the deletion of 1, 2 or 3 consecutive amino acid residues.
[0107] In some embodiments, the truncated fragment of the second polypeptide as described herein results from the deletion of consecutive amino acid residues starting from the N-terminal end of the second polypeptide as described herein, preferably the deletion of 1, 2 or 3 consecutive amino acid residues. In some embodiments, the truncated fragment of the second polypeptide as described herein results from the deletion of consecutive amino acid residues starting from the C-terminal end of the second polypeptide as described herein, preferably the deletion of 1, 2 or 3 consecutive amino acid residues.
[0108] In some embodiments, the truncated fragment of the second polypeptide as described herein comprises at least 8, 9, or 10 consecutive amino acid residues from the N-terminal end of SEQ ID NO: 2 or of the sequence having at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 2 as described herein. In some embodiments, the truncated fragment of the second polypeptide as described herein comprises 8, 9, or 10 consecutive amino acid residues from the N-terminal end of SEQ ID NO: 2 or of the sequence having at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 2 as described herein.
[0109] In some embodiments, the second polypeptide as described herein, or a truncated fragment thereof as described herein, is fused to a 113-amino acid fragment of a photoactive yellow protein (PYP) polypeptide having an amino acid sequence as set forth in SEQ ID NO: 21 or an amino acid sequence having at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99%, or more identity with the amino acid sequence as set forth in SEQ ID NO: 21, or to a truncated fragment thereof. Said truncated fragment may comprise at least 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, or 112 consecutive amino acid residues from the C-terminal end of SEQ ID NO: 21 or of the sequence having at least about 60, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 21 as described herein. Said truncated fragment may thus comprise 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, or 112 consecutive amino acid residues from the C-terminal end of SEQ ID NO: 21 or of the sequence having at least about 60, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 21 as described herein
[0110] In some embodiments, the molecular inducer of proximity is a compound of formula (I)
Figure imgf000030_0001
or a salt and/or solvate thereof; wherein R1, R2, R5 and R6 each independently represents H, halo, hydroxyl, aryl, alkyl, cycloalkyl, heteroalkyl (e.g., alkoxy) or heterocycloalkyl, wherein said aryl, alkyl, cycloalkyl, heteroalkyl or heterocycloalkyl is optionally substituted by at least one group selected from halo, hydroxyl, oxo, nitro, amido, carboxy, amino, cyano, haloalkoxy and haloalkyl;
R3 represents a non-binding doublet (z.e., a free pair of electrons), H, halo, hydroxyl, aryl, alkyl, cycloalkyl, heteroalkyl or heterocycloalkyl, wherein said aryl, alkyl, cycloalkyl, heteroalkyl or heterocycloalkyl is optionally substituted by at least one group selected from halo, hydroxyl, oxo, nitro, amido, carboxy, amino, cyano, haloalkoxy and haloalkyl;
R4 represents a single or a double bound, interrupted or terminated by one S, O or N heteroatom, wherein said heteroatom is optionally substituted by at least one group selected from H, hydroxyl, aryl, alkyl, cycloalkyl, heteroalkyl and heterocycloalkyl, wherein said aryl, alkyl, cycloalkyl, heteroalkyl or heterocycloalkyl is optionally substituted by at least one group selected from halo, hydroxyl, oxo, nitro, amido, carboxy, amino, cyano, haloalkoxy and haloalkyl;
W represents OH, SH, NHR7 or NR7R8, wherein R7 and R8 each independently represents H, halo, hydroxyl, aryl, alkyl, cycloalkyl, heteroalkyl or heterocycloalkyl, wherein said aryl, alkyl, cycloalkyl, heteroalkyl or heterocycloalkyl is optionally substituted by at least one group selected from halo, hydroxyl, oxo, nitro, amido, carboxy, amino, cyano, haloalkoxy and haloalkyl;
X represents O, S or NH; and
Z represents O, S or NH.
[0111] According to the present invention, the alkyl, cycloalkyl, heteroalkyl or heterocycloalkyl groups may be saturated or unsaturated. In some embodiments, the alkyl, cycloalkyl, heteroalkyl or heterocycloalkyl groups are saturated.
[0112] In some embodiments, R1 represents H, alkyl or alkoxy. In some embodiments, R2 represents H, alkyl or alkoxy. In some embodiments, R3 represents H. In some embodiments, R4 represents a double bound terminated by S, O or NH. In some embodiments, R5 represents H In some embodiments, R6 represents H, alkyl or alkoxy. In some embodiments, W represents OH. In some embodiments, Z represents O.
[0113] In some embodiments, the molecular inducer of proximity is selected from the group comprising or consisting of: (Z)-5-(4-hydroxybenzylidene)-2-thioxo-l,3- thiazolidin-4-one (HBR); (Z)-5-(4-hydroxy-3-methylbenzylidene)-2-thioxo-l,3- thiazolidin-4-one (HMBR); (Z)-5-(4-hydroxy-3,5-dimethoxybenzylidene)-2- thioxothiazolidin-4-one (HBR-3,5DOM); (Z)-5-(4-hydroxy-3-methoxybenzylidene)-2- thioxo-l,3-thiazolidin-4-one (HBR-3OM); (Z)-5-(4-hydroxy-3,5-dimethylbenzylidene)- 2-thioxothiazolidin-4-one (HBR-3,5DM); (Z)-5-(4-hydroxy-2,5-dimethylbenzylidene)- 2-thioxo-l,3-thiazolidin-4-one (HBR-2,5DM); (Z)-5-(3-ethyl-4-hydroxybenzylidene)-2- thioxothiazolidin-4-one (HBR-3E); (Z)-5-(3-ethoxy-4-hydroxybenzylidene)-2- thioxothiazolidin-4-one (HBR-3OE); (Z)-5-(4-hydroxybenzylidene)-3-methyl-2- thioxothiazolidin-4-one (HBMR); (Z)-5-(2,4-dihydroxybenzylidene)-2-thioxo-l,3- thiazolidin-4-one (DHBR); (Z)-2-(5-(3-ethyl-4-hydroxybenzylidene)-4-oxo-2- thioxothiazolidin-3-yl)acetic acid (HBRAA-3E); (Z)-2-(5-(4-hydroxy-3- ethoxybenzylidene)-4-oxo-2-thioxothiazolidin-3-yl)acetic acid (HBRAA-3OE); (Z)-2- (5-(4-hydroxy-2-methoxybenzylidene)-4-oxo-2-thioxothiazolidin-3-yl)acetic acid (HBRAA-2OM); (Z)-2-(5-(4-hydroxybenzylidene)-4-oxo-2-thioxothiazolidin-3- yl)acetic acid (HBRAA); (Z)-2-(5-(4-hydroxy-3-methylbenzylidene)-4-oxo-2- thioxothiazolidin-3-yl)acetic acid (HBRAA-3M); (Z)-2-(5-(4-hydroxy-3- methoxybenzylidene)-4-oxo-2-thioxothiazolidin-3-yl)acetic acid (HBRAA-3OM); (Z)- 2-(5-(4-hydroxy-2-methoxybenzylidene)-4-oxo-2-thioxothiazolidin-3-yl)acetic acid (HBRAA-2OM); (Z)-2-(5-(4-hydroxy-2, 5-dimethylbenzylidene)-4-oxo-2- thioxothiazolidin-3-yl) acetic acid (HBRAA-2,5DM); (Z)-2-(5-(4-hydroxy-3,5- dimethylbenzylidene)-4-oxo-2-thioxothiazolidin-3-yl)acetic acid (HBRAA-3,5DM), (Z)-2-(5-(4-hydroxy-3,5-dimethoxybenzylidene)-4-oxo-2-thioxothiazolidin-3-yl) acetic acid (HBRAA-3,5DOM), (Z)-5-(4-hydroxy-3-methylbenzylidene)-4-thioxo-l,3- thiazolidin-2-one (HBIR-3M), and (Z)-5-(4-hydroxy-2,5-dimethoxybenzylidene)-2- thioxo- 1 ,3 -thiazolidin-4-one (HBR-2,5DOM) [0114] In some embodiments, the molecular inducer of proximity is a compound of formula (II)
Figure imgf000033_0001
or a salt and/or solvate thereof; wherein
R1, R2 and R6 each independently represents H, alkyl or alkoxy; and
X, Y and Y each independently represents O, S or NH.
[0115] In some embodiments, R1 represents H, methyl (Me) or methoxy (OMe). In some embodiments, R2 represents H, methyl or methoxy. In some embodiments, R6 represents H, methyl or methoxy.
[0116] In some embodiments, R1, R2 and R6 respectively represent: (Me, H and H), (H, Me and Me), (Me, H and Me), (OMe, H and OMe) or (H, OMe and OMe).
[0117] In some embodiments, X, Y and Y respectively represent: (S, S and O), (S, O and S), (S, NH and O), (S, O and O) or (O, O and O).
[0118] In some embodiments, R1, R2, R6, X, Y and Y respectively represent: (Me, H, H, S, S and O), (H, Me, Me, S, O and S), (Me, H, Me, S, NH and O), (OMe, H, OMe, S, O and O) or (H, OMe, OMe, O, O and O).
[0119] In some embodiments, the molecular inducer of proximity is selected from the group comprising or consisting of (Z)-5-(4-hydroxy-2,5-dimethylbenzylidene)-2-thioxo- l,3-thiazolidin-4-one (HBR-2,5DM), (Z)-5-(4-hydroxy-3-methylbenzylidene)-2-thioxo- l,3-thiazolidin-4-one (HMBR), (Z)-5-(4-hydroxy-3-methylbenzylidene)-4-thioxo-l,3- thiazolidin-2-one (HBIR-3M), (Z)-5-(4-hydroxy-3,5-dimethylbenzylidene)-2- thioxothiazolidin-4-one (HBR-3,5DM), (Z)-5-(4-hydroxy-3,5-dimethoxybenzylidene)- 2-thioxothiazolidin-4-one (HBR-3,5DOM), (Z)-5-(4-hydroxy-3-methoxybenzylidene)- 2-thioxo-l,3-thiazolidin-4-one (HBR-3OM), and (Z)-5-(4-hydroxy-2,5- dimethoxybenzylidene)-2-thi oxo-1, 3-thiazolidin-4-one (HBR-2,5DOM).
[0120] In some embodiments, the molecular inducer of proximity is selected from the group comprising or consisting of (Z)-5-(4-hydroxy-2,5-dimethylbenzylidene)-2-thioxo- l,3-thiazolidin-4-one (HBR-2,5DM), (Z)-5-(4-hydroxy-3-methylbenzylidene)-2-thioxo-
1.3-thiazolidin-4-one (HMBR) and (Z)-5-(4-hydroxy-3-methylbenzylidene)-4-thioxo-
1.3-thiazolidin-2-one (HBIR-3M).
[0121] In some embodiments, the molecular inducer of proximity is (Z)-5-(4-hydroxy- 2,5-dimethylbenzylidene)-2-thioxo-l,3-thiazolidin-4-one (HBR-2,5DM)
Figure imgf000034_0001
[0122] In some embodiments, the molecular inducer of proximity is (Z)-5-(4-hydroxy- 3-methylbenzylidene)-2-thi oxo-1, 3-thiazolidin-4-one (HMBR)
Figure imgf000034_0002
[0123] In some embodiments, the molecular inducer of proximity is (Z)-5-(4-hydroxy- 3-methylbenzylidene)-4-thi oxo-1, 3-thiazolidin-2-one (HBIR-3M)
Figure imgf000035_0001
[0124] In some embodiments, the molecular inducer of proximity is a chromophore.
[0125] In some embodiments, the molecular inducer of proximity is non-fluorogenic. In some embodiments, the molecular inducer of proximity is a non-fluorogenic chromophore. Examples of non-fluorogenic molecular inducers of proximity include (Z)- 5-(4-hydroxy-3-methylbenzylidene)-4-thi oxo-1, 3-thiazolidin-2-one (HBIR-3M).
[0126] In some embodiments, the molecular inducer of proximity is fluorogenic. In some embodiments, the molecular inducer of proximity is a fluorogenic chromophore. Examples of fluorogenic molecular inducers of proximity include (Z)-5-(4-hydroxy-2,5- dimethylbenzylidene)-2-thi oxo-1, 3-thiazolidin-4-one (HBR-2,5DM), (Z)-5-(4-hydroxy- 3-methylbenzylidene)-2-thi oxo-1, 3-thiazolidin-4-one (HMBR), (Z)-5-(4-hydroxy-3,5- dimethylbenzylidene)-2-thioxothiazolidin-4-one (HBR-3,5DM), (Z)-5-(4-hydroxy-3,5- dimethoxybenzylidene)-2-thioxothiazolidin-4-one (HBR-3,5DOM), (Z)-5-(4-hydroxy- 3-methoxybenzylidene)-2-thioxo-l,3-thiazolidin-4-one (HBR-3OM), and (Z)-5-(4- hydroxy-2,5-dimethoxybenzylidene)-2-thi oxo-1, 3-thiazolidin-4-one (HBR-2,5DOM). In some embodiments, the fluorogenic chromophore is selected from the group comprising or consisting of (Z)-5-(4-hydroxy-2,5-dimethylbenzylidene)-2-thioxo-l,3- thiazolidin-4-one (HBR-2,5DM), (Z)-5-(4-hydroxy-3-methylbenzylidene)-2-thioxo-l,3- thiazolidin-4-one (HMBR), (Z)-5-(4-hydroxy-3,5-dimethylbenzylidene)-2- thioxothiazolidin-4-one (HBR-3,5DM), (Z)-5-(4-hydroxy-3,5-dimethoxybenzylidene)- 2-thioxothiazolidin-4-one (HBR-3,5DOM), (Z)-5-(4-hydroxy-3-methoxybenzylidene)- 2-thioxo-l,3-thiazolidin-4-one (HBR-3OM), and (Z)-5-(4-hydroxy-2,5- dimethoxybenzylidene)-2-thioxo-l,3-thiazolidin-4-one (HBR-2,5DOM). In some embodiments, the fluorogenic chromophore is (Z)-5-(4-hydroxy-2,5- dimethylbenzylidene)-2-thioxo-l,3-thiazolidin-4-one (HBR-2,5DM) and (Z)-5-(4- hydroxy-3-methylbenzylidene)-2-thi oxo-1, 3-thiazolidin-4-one (HMBR).
[0127] Another object of the invention is a biological molecule comprising the first polypeptide as described herein, or a truncated fragment thereof as described herein, and/or the second polypeptide as described herein, or a truncated fragment thereof as described herein.
[0128] In some embodiments, the biological molecule comprising the first and/or second polypeptide, or a truncated fragment thereof, is a biological molecule to which the first and/or second polypeptide, or a truncated fragment thereof, is coupled or attached, either covalently or non-covalently.
[0129] Methods for non-covalently attaching a peptide or polypeptide to a biological molecule as described herein are well-known and include, for example, chemical coupling. Methods for covalently attaching a peptide or polypeptide to a biological molecule as described herein are well-known and include, for example, chemical coupling or genetic fusion as described herein.
[0130] As used herein, the term “biological molecule” encompasses any molecule present in a living organism. Examples of biological molecules include amino acids, polypeptides, proteins, monosaccharides, polysaccharides, nucleotides, nucleic acids, lipids, fatty acids, glycolipids, sterols, vitamins, hormones, neurotransmitters, and metabolites. In particular, the biological molecule may be a nucleic acid or a protein. Examples of nucleic acids include DNA and RNA such as messenger RNA (mRNA), transfer RNA (tRNA). Examples of proteins include transcription factors, enzymes, receptors, immunoreceptors, immunoglobulins, signaling proteins.
[0131] In some embodiments, the biological molecule is a protein.
[0132] Another object of the invention is thus a fusion protein comprising a protein of interest fused to the first polypeptide as described herein, or a truncated fragment thereof as described herein, and/or the second polypeptide as described herein, or a truncated fragment thereof as described herein. As illustrated in the experimental section hereinafter, the first and second polypeptide, or truncated fragments thereof, may each be expressed in fusion with any protein of interest within a host cell by inserting (for example via transformation or transfection) a nucleotide sequence which encodes both the polypeptide and the protein of interest.
[0133] Methods for fusing a peptide or polypeptide to a protein of interest are well- known. Briefly, such methods comprise inserting the nucleotide sequence encoding a protein of interest in frame with the nucleic sequence encoding the peptide(s) or polypeptide(s). The nucleic sequence encoding the protein of interest can be inserted so that the encoded peptide or polypeptide is situated at the N-terminal end of the protein of interest or at the C-terminal end of the protein of interest, or internally, as desired. Additionally, a short nucleic sequence encoding a linker or spacer may be present between the sequences coding for the peptide or polypeptide and for the protein of interest.
[0134] The protein of interest may be a natural protein, a chimeric protein resulting from the fusion of various protein domains, or a synthetic protein. The protein of interest may be an intracellular protein, a membrane protein, a cell surface protein present at least in part at the extra membranous surface, or a secreted protein. The protein of interest may be a transcription factor, an enzyme, a receptor, an immunoreceptor, an immunoglobulin, or a signaling protein.
[0135] In some embodiments, the protein of interest is a reporter protein. Reporter proteins allow to detect, assess and/or monitor a variation in a sample. Examples of variation that can be detected, assessed and/or monitored using reporter proteins include cell signaling, gene expression, metabolite or analyte concentration, cell-cell interaction, cell movement, cell death, intracellular transport, secretion, cell-cycle phase, and circadian rhythm.
[0136] In some embodiments, the fusion protein as described herein comprises at least one additional element other than the first and/or second polypeptide as described herein, or truncated fragments thereof as described herein, and the protein of interest. Example of such additional elements include linkers, targeting signals, localization signals, protease target sites, fluorescent proteins (such as fluorescent protein tags), antibody crystallizable fragments (Fc), enzymes.
[0137] In some embodiments, the fusion protein as described herein comprises a linker. Methods for designing or selecting a linker are well-known to one skilled in the art.
[0138] Another object of the invention is a kit comprising a biological molecule as described herein, such as a fusion protein as described herein, and a molecular inducer of proximity as described herein.
[0139] Another object of the invention is a pair of biological molecules as described herein, wherein one biological molecule comprises the first polypeptide as described herein, or a truncated fragment thereof as described herein, and the other biological molecule comprises the second polypeptide as described herein, or a truncated fragment thereof as described herein.
[0140] In some embodiments, the present invention relates to a pair of biological molecules as described herein, with one biological molecule comprising the first polypeptide as described herein, or a truncated fragment thereof as described herein, and the other biological molecule comprising the second polypeptide as described herein, or a truncated fragment thereof as described herein, wherein in presence of a molecular inducer of proximity as described herein, the first polypeptide, the second polypeptide and the molecular inducer of proximity are able to form a ternary assembly, with the concentration of molecular inducer of proximity required to reach half maximal assembly being of less than about 5 pM.
[0141] In some embodiments, the pair of biological molecules consists of one biological molecule to which the first polypeptide, or a truncated fragment thereof, is coupled or attached, either covalently or non-covalently; and another biological molecule to which the second polypeptide, or a truncated fragment thereof, is coupled or attached, either covalently or non-covalently.
[0142] The pair of biological molecules may consist of two distinct biological molecules. The pair of biological molecules may consist of two biological molecules of different structure, such as a nucleic acid and a protein. The pair of biological molecules may consist of two distinct biological molecules of the same structure, such as two nucleic acids or two proteins. The pair of biological molecules may consist of two copies of the same biological molecule (such as two copies of the same protein), with one copy comprising the first polypeptide, or a truncated fragment thereof, and the other comprising the second polypeptide, or a truncated fragment thereof.
[0143] In some embodiment, the pair of biological molecules is a pair of fusion proteins as described herein, with one fusion protein comprising a protein of interest fused to the first polypeptide as described herein, or a truncated fragment thereof as described herein, and the other comprising a protein of interest fused to the second polypeptide as described herein, or a truncated fragment thereof as described herein.
[0144] In some embodiments, both fusion proteins of the pair comprise the same protein of interest. In other words, in some embodiments, one fusion protein comprises the first polypeptide, or a truncated fragment thereof, and a protein of interest, and the other fusion protein comprises the second polypeptide, or a truncated fragment thereof, and the same protein of interest. In some embodiments, each fusion protein of the pair comprises a distinct protein of interest. In other words, in some embodiments, one fusion protein comprises the first polypeptide, or a truncated fragment thereof, and a first protein of interest, and the other fusion protein comprises the second polypeptide, or a truncated fragment thereof, and a second, different, protein of interest.
[0145] In some embodiments, at least one of the fusion proteins of the pair comprises a protein of interest being a reporter protein. In some embodiments, both fusion proteins of the pair comprise a protein of interest being a reporter protein, either the same reporter protein or different reporter proteins.
[0146] In some embodiments, at least one of the fusion proteins of the pair comprises an additional element as described herein, such as a linker, a targeting signal, a localization signal, a protease target site, an antibody crystallizable fragment (Fc), a fluorescent protein (such as a fluorescent protein tag), or an enzyme. [0147] Another object of the invention is a kit comprising a pair of biological molecules as described herein, such as a pair of fusion proteins as described herein, and a molecular inducer of proximity as described herein.
[0148] Another object of the invention is a nucleic acid molecule comprising or consisting of the nucleotide sequence encoding the first polypeptide as described herein, or a truncated fragment thereof as described herein, and/or the second polypeptide as described herein, or a truncated fragment thereof as described herein. Examples of nucleic acid molecule include DNA molecules and RNA molecules. Examples of DNA molecules include cDNA molecules (complementary DNA). Examples of RNA molecules include mRNA molecules (messenger RNA).
[0149] In some embodiments, by “nucleotide sequence encoding the first and/or the second polypeptide as described herein, or a truncated fragment thereof as described herein” it is meant all nucleotide sequences that are degenerate versions of each other and that encode the same first and/or second polypeptide, or a truncated fragment thereof. One skilled in the art is familiar with methods for adapting a coding sequence on the basis of the genetic code, such as methods making use of codon degeneracy to introduce silent mutations and methods taking into account codon usage bias and variation of the standard genetic code relevant to the host cell considered.
[0150] In some embodiments, the nucleic acid molecule comprises or consists of a nucleotide sequence encoding a first polypeptide comprising or consisting of an amino acid sequence as set forth in SEQ ID NO: 1, or a truncated fragment thereof as described herein. An example of nucleotide sequence encoding a first polypeptide comprising or consisting of an amino acid sequence as set forth in SEQ ID NO: 1 is a nucleotide sequence comprising or consisting of the sequence as set forth in SEQ ID NO: 22. Thus, in some embodiments, the nucleic acid molecule comprises or consists of a nucleotide sequence as set forth in SEQ ID NO: 22 or a nucleotide sequence having at least about 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 22. [0151] In some embodiments, the nucleic acid molecule comprises or consists of a nucleotide sequence encoding the second polypeptide comprising or consisting of an amino acid sequence as set forth in SEQ ID NO: 2, or a truncated fragment thereof as described herein. An example of nucleotide sequence encoding a second polypeptide comprising or consisting of an amino acid sequence as set forth in SEQ ID NO: 2 is a nucleotide sequence comprising or consisting of as set forth in SEQ ID NO: 23. Thus, in some embodiments, the nucleic acid molecule comprises or consists of a nucleotide sequence as set forth in SEQ ID NO: 23 or a nucleotide sequence having at least about 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 23.
[0152] In some embodiments, the nucleic acid molecule comprises or consists of a nucleotide sequence encoding a fusion protein as described herein.
[0153] Another object of the invention is a kit comprising the nucleic acid molecule as described herein, and a molecular inducer of proximity as described herein.
[0154] Another object of the invention is a pair of nucleic acid molecules as described herein, wherein one nucleic acid molecule comprises or consists of a nucleotide sequence encoding the first polypeptide as described herein, or a truncated fragment thereof as described herein, and the other biological molecule comprises or consists of a nucleotide sequence encoding the second polypeptide as described herein, or a truncated fragment thereof as described herein.
[0155] In some embodiments, the pair of nucleic acid molecules consists of one nucleic acid molecule comprising or consisting of a nucleotide sequence encoding a first polypeptide comprising or consisting of an amino acid sequence as set forth in SEQ ID NO: 1, or a truncated fragment thereof as described herein; and another nucleic acid molecule comprising or consisting of a nucleotide sequence encoding a second polypeptide comprising or consisting of an amino acid sequence as set forth in SEQ ID NO: 2, or a truncated fragment thereof as described herein.
[0156] In some embodiments, the pair of nucleic acid molecules consists of one nucleic acid molecule comprising or consisting of a nucleotide sequence as set forth in SEQ ID NO: 22 or a nucleotide sequence having at least about 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 22; and another nucleic acid molecule comprising or consisting of a nucleotide sequence as set forth in SEQ ID NO: 23 or a nucleotide sequence having at least about 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 23.
[0157] In some embodiments, the pair of nucleic acid molecules consists of two nucleic acid molecules comprising or consisting of nucleotides sequences encoding a fusion protein as described herein, wherein one nucleic acid molecule comprises or consists of a nucleotide sequence encoding a fusion protein comprising the first polypeptide or a truncated fragment thereof as described herein and a protein of interest as described herein, and the other biological molecule comprises or consists of a nucleotide sequence encoding a second fusion protein comprising the second polypeptide or a truncated fragment thereof as described herein and a protein of interest as described herein.
[0158] Another object of the invention is a kit comprising the pair of nucleic acid molecule as described herein, and a molecular inducer of proximity as described herein.
[0159] Another object of the invention is a vector, in particular an expression vector, comprising a nucleotide sequence encoding the first polypeptide as described herein, or a truncated fragment thereof as described herein, and/or the second polypeptide as described herein, or a truncated fragment thereof as described herein. Examples of vectors include, without being limited to, plasmids, viral vectors, artificial chromosomes, liposomes, and lipid nanoparticles.
[0160] In some embodiments, the vector is a nucleic acid molecule as described herein. Example of such vectors include plasmids, artificial chromosomes.
[0161] In some embodiments, the vector as described herein comprises: a first nucleotide sequence encoding the first polypeptide as described herein, or a truncated fragment thereof as described herein; and a second nucleotide sequence encoding the second polypeptide as described herein, or a truncated fragment thereof as described herein. [0162] In some embodiments, the vector as described herein comprises: a first nucleotide sequence encoding a first polypeptide comprising or consisting of an amino acid sequence as set forth in SEQ ID NO: 1 or a truncated fragment thereof as described herein; and a second nucleotide sequence encoding a second polypeptide comprising or consisting of an amino acid sequence as set forth in SEQ ID NO: 2 or a truncated fragment thereof as described herein.
[0163] In some embodiments, the vector as described herein comprises: a first nucleotide sequence comprising or consisting of the sequence as set forth in SEQ ID NO: 22 or a nucleotide sequence having at least about 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 22; and a second nucleotide sequence comprising or consisting of the sequence as set forth in SEQ ID NO: 23 or a nucleotide sequence having at least about 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 23.
[0164] The vector may comprise the nucleotide sequence(s) required for the fusion of the nucleic sequence encoding a protein of interest in frame with the nucleic sequence encoding the first polypeptide as described herein and/or the second polypeptide as described herein, or a truncated fragment thereof as described herein.
[0165] In some embodiments, the vector as described herein comprises a nucleotide sequence encoding at least one fusion protein as described herein.
[0166] Another obj ect of the invention is a kit comprising the vector as described herein, and a molecular inducer of proximity as described herein.
[0167] Another object of the invention is a pair of vectors as described herein, wherein one vector comprises a nucleotide sequence encoding the first polypeptide as described herein, or a truncated fragment thereof as described herein, and the other vector comprises a nucleotide sequence encoding the second polypeptide as described herein, or a truncated fragment thereof as described herein. [0168] In some embodiments, the pair of vectors consists of one vector comprising a nucleotide sequence encoding a first polypeptide comprising or consisting of an amino acid sequence as set forth in SEQ ID NO: 1, or a truncated fragment thereof as described herein; and another vector comprising a nucleotide sequence encoding a second polypeptide comprising or consisting of an amino acid sequence as set forth in SEQ ID NO: 2, or a truncated fragment thereof as described herein.
[0169] In some embodiments, the pair of vectors consists of one vector comprising a nucleotide sequence as set forth in SEQ ID NO: 22 or a nucleotide sequence having at least about 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 22; and another vector comprising a nucleotide sequence as set forth in SEQ ID NO: 23 or a nucleotide sequence having at least about 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identity with SEQ ID NO: 23.
[0170] In some embodiments, the pair of vectors consists of two vectors each comprising a nucleotide sequence encoding a fusion protein as described herein, wherein one vector comprises a nucleotide sequence encoding a fusion protein comprising the first polypeptide or a truncated fragment thereof as described herein and a protein of interest as described herein, and the other vector comprises a nucleotide sequence encoding a second fusion protein comprising the second polypeptide or a truncated fragment thereof as described herein and a protein of interest as described herein.
[0171] Another obj ect of the invention is a kit comprising the pair of vectors as described herein, and a molecular inducer of proximity as described herein.
[0172] Another object of the invention is a cell or a cell line comprising or expressing the first polypeptide as described herein, or a truncated fragment thereof as described herein, and/or the second polypeptide as described herein, or a truncated fragment thereof as described herein. In some embodiments, the cell or cell line described herein comprises or expresses both the first polypeptide as described herein, or a truncated fragment thereof as described herein, and the second polypeptide as described herein, or a truncated fragment thereof as described herein. [0173] Another object of the invention is a cell or a cell line comprising or expressing a nucleic acid molecule comprising or consisting of the nucleotide sequence encoding the first polypeptide as described herein, or a truncated fragment thereof as described herein, and/or the second polypeptide as described herein, or a truncated fragment thereof as described herein. In some embodiments, the cell or cell line comprises or expresses a pair of nucleic acid molecules as described herein, wherein one nucleic acid molecule comprises or consists of a nucleotide sequence encoding the first polypeptide as described herein, or a truncated fragment thereof as described herein, and the other biological molecule comprises or consists of a nucleotide sequence encoding the second polypeptide as described herein, or a truncated fragment thereof as described herein.
[0174] The cell or cell line as described herein may be a genetically modified cell or cell line, that is to say a cell or cell line genetically modified to express the first polypeptide as described herein, or a truncated fragment thereof as described herein, and/or the second polypeptide as described herein, or a truncated fragment thereof as described herein.
[0175] The cell or cell line may be a prokaryotic cell or cell line, or a eukaryotic cell or cell line. The cell or cell line may be a bacterium cell or cell line, an archaea cell or cell line, a yeast cell or cell line, a plant cell or cell line, or an animal cell or cell line. The cell or cell line may be a mammal cell or cell line. The cell or cell line may be a human cell or cell line. The cell or cell line may be a primary cell or cell line, in particular a human primary cell or cell line. The cell or cell line may be an immortalized cell or cell line, in particular a human immortalized cell or cell line. The cell or cell line may be an immune cell or cell line, in particular a human immune cell or cell line.
[0176] In some embodiments, the cell or cell line as described herein comprises a vector as described herein. In some embodiments, the cell or cell line as described herein comprises a pair of vectors as described herein.
[0177] Within the cell or cell line as described herein, the first polypeptide as described herein, or a truncated fragment thereof as described herein, and/or the second polypeptide as described herein, or a truncated fragment thereof as described herein, may be comprised within a biological molecule as described herein. Thus, within the cell or cell line as described herein, the first and/or second polypeptide, or a truncated fragment thereof, may be coupled or attached, either covalently or non-covalently, to a biological molecule.
[0178] In some embodiments, the cell or cell line as described herein comprises or expresses at least one biological molecule comprising the first polypeptide as described herein, or a truncated fragment thereof as described herein, and/or the second polypeptide as described herein, or a truncated fragment thereof as described herein.
[0179] In some embodiments, the cell or cell line as described herein comprises or expresses at least one fusion protein comprising the first polypeptide as described herein, or a truncated fragment thereof as described herein, and/or the second polypeptide as described herein, or a truncated fragment thereof as described herein.
[0180] In some embodiments, the cell or cell line as described herein comprises or expresses a pair of biological molecules as described herein. Thus, in some embodiments, the cell or cell line as described herein comprises or expresses a first biological molecule comprising the first polypeptide as described herein, or a truncated fragment thereof as described herein, and a second biological molecule comprising the second polypeptide as described herein, or a truncated fragment thereof as described herein.
[0181] In some embodiments, the cell or cell line as described herein comprises or expresses two biological molecules, wherein: a first biological molecule is coupled to the first polypeptide as described herein, or a truncated fragment thereof as described herein; a second biological molecule is coupled to the second polypeptide as described herein, or a truncated fragment thereof as described herein, wherein in presence of a molecular inducer of proximity as described herein, the first polypeptide, the second polypeptide and the molecular inducer of proximity are able to form a ternary assembly, with the concentration of molecular inducer of proximity required to reach half maximal assembly being of less than about 5 pM.
[0182] In some embodiments, the cell or cell line as described herein comprises or expresses a pair of fusion proteins as described herein. Thus, in some embodiments, the cell or cell line as described herein comprises or expresses a first fusion protein comprising the first polypeptide as described herein, or a truncated fragment thereof as described herein, and a second fusion protein comprising the second polypeptide as described herein, or a truncated fragment thereof as described herein.
[0183] In some embodiments, the cell or cell line as described herein comprises or expresses two fusion proteins as described herein, wherein: a first fusion protein comprises the first polypeptide as described herein, or a truncated fragment thereof as described herein, and a protein of interest as described herein; a second fusion protein comprises the second polypeptide as described herein, or a truncated fragment thereof as described herein, and a protein of interest as described herein, wherein in presence of a molecular inducer of proximity as described herein, the first polypeptide, the second polypeptide and the molecular inducer of proximity are able to form a ternary assembly, with the concentration of molecular inducer of proximity required to reach half maximal assembly being of less than about 5 pM.
[0184] Another object of the invention is a kit comprising the cell or cell line as described herein, and a molecular inducer of proximity as described herein.
[0185] In some embodiments, at least one of the fusion proteins comprised within or expressed by the cell or cell line as described herein comprises a reporter protein. In some embodiments, both fusion proteins comprised within or expressed by the cell or cell line as described herein comprise a reporter protein.
[0186] In some embodiments, the cell or cell line as described herein is a biosensor. By « biosensor », it is meant that the cell or cell line as described herein may be used to assess and/or monitor a target of interest, a physiological mechanism or a biological process of interest.
[0187] Examples of targets of interest include analytes or metabolites. In some embodiments, the cell as described herein may thus be used to assess and/or monitor the presence in the cell of an analyte or metabolite. [0188] Examples of physiological mechanisms or biological processes of interest include apoptosis, cell cycle, viral infection, intracellular protein trafficking and transport.
[0189] In some embodiments, the cell or cell line as described herein is a biosensor of proximity. By « biosensor of proximity », it is meant that the cell or cell line as described herein may be used to assess and/or monitor the intracellular localization and/or transport of a biological molecule, in particular a protein, such as a cargo protein.
[0190] In some embodiments, at least one of the biological molecule comprised within or expressed by the cell or cell line as described herein is a therapeutic effector. In some embodiments, at least one of the biological molecule comprised within or expressed by the cell or cell line as described herein is a fusion protein comprising a protein of interest which is a therapeutic effector. By “therapeutic effector”, it is meant a biological molecule, in particular a protein, able to induce a therapeutic effect upon administration to a subject.
[0191] The therapeutic effector may be a protein selected from the group comprising or consisting of transcription factors, enzymes, receptors, immunoreceptors, immunoglobulins, and signaling proteins. The therapeutic effector may be a signaling protein such as, for example, a chimeric antigen receptor (CAR). The therapeutic effector may be a transcription factor.
[0192] In some embodiments, the activity and/or function of the therapeutic effector is controlled through chemically induced proximity, chemically induced localization, chemically induced transport, chemically induced dimerization, and/or chemically induced degradation.
[0193] In some embodiments, at least one of the biological molecule comprised within or expressed by the cell or cell line as described herein is a safety switch. In some embodiments, at least one of the biological molecule comprised within or expressed by the cell or cell line as described herein is a fusion protein comprising a protein of interest which is a safety switch. By “safety switch”, it is meant a biological molecule, in particular a protein, able to control the activity and/or viability of a cell, for example by controlling gene expression, and/or by inducing cell death (apoptosis). [0194] The safety switch may be a protein selected from the group comprising or consisting of transcription factors and enzymes. The safety switch may be an enzyme such as, for example, a caspase. The safety switch may be a transcription factor.
[0195] In some embodiments, the activity and/or function of the safety switch is controlled through chemically induced proximity, chemically induced localization, chemically induced transport, chemically induced dimerization, and/or chemically induced degradation.
[0196] Another object of the present invention is a pharmaceutical composition comprising or consisting of any one of: a biological molecule as described herein, a pair of biological molecules as described herein, a nucleic acid molecule as described herein, a pair of nucleic acid molecules as described herein, a vector as described herein, a pair of vectors as described herein, or a cell or cell line as described herein; and optionally at least one pharmaceutically acceptable excipient or pharmaceutically acceptable carrier.
[0197] In some embodiments, the pharmaceutical composition as described herein further comprises a molecular inducer of proximity as described herein.
[0198] Another obj ect of the present invention is a medicament comprising or consisting of any one of: a biological molecule as described herein, a pair of biological molecules as described herein, a nucleic acid molecule as described herein, a pair of nucleic acid molecules as described herein, a vector as described herein, a pair of vectors as described herein, or a cell or cell line as described herein. [0199] In some embodiments, the medicament as described herein further comprises a molecular inducer of proximity a described herein.
[0200] Another object of the present invention is a kit-of-parts comprising or consisting of: a first part comprising any one of: a biological molecule as described herein, a pair of biological molecules as described herein, a nucleic acid molecule as described herein, a pair of nucleic acid molecules as described herein, a vector as described herein, a pair of vectors as described herein, or a cell or cell line as described herein; and a second part comprising a molecular inducer of proximity a described herein.
[0201] Another object of the present invention is any one of: a biological molecule as described herein, a pair of biological molecules as described herein, a nucleic acid molecule as described herein, a pair of nucleic acid molecules as described herein, a vector as described herein, a pair of vectors as described herein, a cell or cell line as described herein, a kit comprising any one of the above, and a molecular inducer of proximity as described herein, a pharmaceutical composition as described herein, or a kit-of-parts as described herein, for use as a medicament.
[0202] In some embodiments, any one of: a cell or cell line as described herein, a pharmaceutical composition as described herein comprising said cell or cell line, a medicament as described herein comprising said cell or cell line, or a kit-of-parts as described herein comprising said cell or cell line, is for use as a cell therapy or as a cellular therapy.
[0203] In some embodiments, any one of: a cell or cell line as described herein, a pharmaceutical composition as described herein comprising said cell or cell line, a medicament as described herein comprising said cell or cell line, or a kit-of-parts as described herein comprising said cell or cell line, is for use for in vivo transplantation in a subject in need thereof.
[0204] Another object of the invention is a method for inducing proximity between two biological molecules in a sample, said method comprising the following steps: obtaining a first biological molecule coupled to the first polypeptide as described herein, or a truncated fragment thereof as described herein; obtaining a second biological molecule coupled to the second polypeptide as described herein, or a truncated fragment thereof as described herein; adding a molecular inducer of proximity as described herein to the sample; wherein, upon addition of the molecular inducer of proximity, the first polypeptide, the second polypeptide and the molecular inducer of proximity form a ternary assembly, thereby inducing proximity between the first and the second biological molecules present in the sample.
[0205] In some embodiments, the concentration of molecular inducer of proximity required to reach half maximal assembly is less than about 5, 4.5, 4, 3.5, 3, 2.5, 2, 1.5, 1, 0.5, or 0.25 pM. In some embodiments, the concentration of molecular inducer of proximity required to reach half maximal assembly ranges from about 0.05 to about 5 pM, preferably from about 0.1 to about 2.5 pM, more preferably from about 0.1 to about 1 pM.
[0206] Thus, in some embodiments, the present invention relates to a method for inducing proximity between two biological molecules in a sample, said method comprising the following steps: obtaining a first biological molecule coupled to the first polypeptide as described herein, or a truncated fragment thereof as described herein; obtaining a second biological molecule coupled to the second polypeptide as described herein, or a truncated fragment thereof as described herein; adding a molecular inducer of proximity as described herein to the sample; wherein, upon addition of the molecular inducer of proximity, the first polypeptide, the second polypeptide and the molecular inducer of proximity form a ternary assembly, with the concentration of molecular inducer of proximity required to reach half maximal assembly in the sample being of less than about 5 pM, thereby inducing proximity between the first and the second biological molecules present in the sample.
[0207] In some embodiments, the method is for inducing proximity between two biological molecules in a sample, said method comprising the following steps: coupling or attaching a first biological molecule to the first polypeptide as described herein, or a truncated fragment thereof as described herein; coupling or attaching a second biological molecule to the second polypeptide as described herein, or a truncated fragment thereof as described herein; adding a molecular inducer of proximity as described herein to the sample; wherein, upon addition of the molecular inducer of proximity, the first polypeptide, the second polypeptide and the molecular inducer of proximity form a ternary assembly, with the concentration of molecular inducer of proximity required to reach half maximal assembly in the sample being of less than about 5 pM, thereby inducing proximity between the first and the second biological molecules present in the sample.
[0208] In some embodiments, the method is for inducing proximity between two biological molecules in a sample, said method comprising the following steps: expressing in the sample a first biological molecule coupled to the first polypeptide as described herein, or a truncated fragment thereof as described herein; expressing in the sample a second biological molecule coupled to the second polypeptide as described herein, or a truncated fragment thereof as described herein; adding a molecular inducer of proximity as described herein to the sample; wherein, upon addition of the molecular inducer of proximity, the first polypeptide, the second polypeptide and the molecular inducer of proximity form a ternary assembly, with the concentration of molecular inducer of proximity required to reach half maximal assembly in the sample being of less than about 5 pM, thereby inducing proximity between the first and the second biological molecules present in the sample.
[0209] The two biological molecules may be two distinct biological molecules. The two biological molecules may be two biological molecules of different structure, such as a nucleic acid and a protein. The two biological molecules may be two distinct biological molecules of the same structure, such as two nucleic acids or two proteins. The two biological molecules may be two copies of the same biological molecule (such as two copies of the same protein), with one copy coupled to the first polypeptide as described herein, or a truncated fragment thereof as described herein, and the other copy coupled to the second polypeptide as described herein, or a truncated fragment thereof as described herein.
[0210] The biological molecules may be selected from the group comprising or consisting of amino acids, polypeptides, proteins, monosaccharides, polysaccharides, nucleotides, nucleic acids, lipids, fatty acids, glycolipids, sterols, vitamins, hormones, neurotransmitters, and metabolites. In some embodiments, the biological molecules are proteins or nucleic acids. In some embodiments, the biological molecules are proteins.
[0211] In some embodiments, the biological molecules coupled to the first or second polypeptide as described herein, or a truncated fragment thereof as described herein, are fusion proteins comprising said first or second polypeptide, or a truncated fragment thereof, and a protein of interest as described herein. The two fusion proteins may comprise the same protein of interest or different proteins of interest.
[0212] In some embodiments, the biological molecules are non-covalently coupled to the first or second polypeptide as described herein, or a truncated fragment thereof as described herein. In some embodiments, the biological molecules are covalently coupled to the first or second polypeptide as described herein, or a truncated fragment thereof as described herein. In some embodiments, the biological molecules are proteins, and said proteins are fused to the first or second polypeptide as described herein, or a truncated fragment thereof as described herein.
[0213] In some embodiments, the method is for inducing proximity in a sample between a first biological molecule coupled to the first polypeptide as described herein, or a truncated fragment thereof as described herein, and a second biological molecule coupled to the second polypeptide as described herein, or a truncated fragment thereof as described herein, said method comprising: adding a molecular inducer of proximity as described herein to the sample, wherein, upon addition of the molecular inducer of proximity, the first polypeptide, the second polypeptide and the molecular inducer of proximity form a ternary assembly, with the concentration of molecular inducer of proximity required to reach half maximal assembly in the sample being of less than about 5 pM, thereby inducing proximity between the first and the second biological molecules present in the sample.
[0214] The sample may be a biological sample. The sample may be an organism. Organisms considered herein are for instance model organisms used in biomedical research such as, for example, bacterium, yeast, fruit fly, nematode, zebrafish, mouse, rat, guinea pig, rabbit, and dog. Other organisms considered herein are plant organisms, such as Arabidopsis thaliana. Tobacco (for example Nicotiana labaciim). wheat (for example Triticum aestivum, Triticum durum, Triticum compactum), rice (for example Oryza sativa). Accordingly, the organism is not a human organism. The sample may be a cell, a cell line, or a cell culture. Examples of cells include, bacteria cells, archaea cells, yeast cells, plant cells, animal cells. Example of animal cells include mammal cells, in particular human cells.
[0215] The cells may be primary cells. The cells may be immortalized cells. The cells in the sample may be alive. The cells in the sample may be fixed for microscopy and imaging. [0216] In some embodiments, the organism or the cells in the sample, or a sub set thereof, is/are genetically modified to express the first and second polypeptides as described herein, or truncated fragments thereof as described herein, attached to the biological molecules as described herein, preferably proteins of interest.
[0217] In some embodiments, the method is a method for inducing proximity between two biological molecules in a cell sample, said method comprising the following steps: expressing a first biological molecule coupled to the first polypeptide as described herein, or a truncated fragment thereof as described herein, in the cells of the sample; expressing a second biological molecule coupled to the second polypeptide as described herein, or a truncated fragment thereof as described herein, in the cells of the sample; adding a molecular inducer of proximity as described herein to the cell sample; wherein, upon addition of the molecular inducer of proximity, the first polypeptide, the second polypeptide and the molecular inducer of proximity form a ternary assembly in the cell sample, with the concentration of molecular inducer of proximity required to reach half maximal assembly in the sample being of less than about 5 pM, thereby inducing proximity between the first and the second biological molecules expressed in the cells of the sample.
[0218] By “in the cell sample” it is meant that the ternary assembly formed between the first polypeptide, the second polypeptide and the molecular inducer of proximity may be intracellular, extracellular or localized at the membrane of the cells. Indeed, the first biological molecule and/or the second biological molecule expressed in the cells of the sample may be secreted by the cells of the sample or may be localized at the membrane of the cells of the sample.
[0219] In some embodiments, the method is a method for inducing proximity between two biological molecules in an organism, such as a plant organism or a non-human model organism, said method comprising the following steps: expressing a first biological molecule coupled to the first polypeptide as described herein, or a truncated fragment thereof as described herein, in the organism; expressing a second biological molecule coupled to the second polypeptide as described herein, or a truncated fragment thereof as described herein, in the organism; administering a molecular inducer of proximity as described herein to the organism; wherein, upon administration of the molecular inducer of proximity, the first polypeptide, the second polypeptide and the molecular inducer of proximity form a ternary assembly, with the concentration of molecular inducer of proximity required to reach half maximal assembly in the sample being of less than about 5 pM, thereby inducing proximity between the first and the second biological molecules expressed in the organism.
[0220] One skilled in the art is familiar with methods allowing to genetically modify organisms or cells in a culture in order for them to express a biological molecule of interest. Examples of such methods include transfection, electroporation, injection and transgenesis of nucleic acid molecules, such as a nucleic acid molecules as described herein. Said methods also include transplantation, injection and co-culture of cells modified to express proteins of interest.
[0221] In some embodiments, the method is for inducing proximity between two biological molecules in a sample, said method comprising the following steps: fusing a first biological molecule, preferably a protein, to the first polypeptide as described herein, or a truncated fragment thereof as described herein; fusing a second biological molecule, preferably a protein, to the second polypeptide as described herein, or a truncated fragment thereof as described herein; expressing both biological molecules in the sample, adding a molecular inducer of proximity as described herein to the sample; wherein, upon addition of the molecular inducer of proximity, the first polypeptide, the second polypeptide and the molecular inducer of proximity form a ternary assembly, with the concentration of molecular inducer of proximity required to reach half maximal assembly in the sample being of less than about 5 pM, thereby inducing proximity between the first and the second biological molecules present in the sample. [0222] In some embodiments, in the methods as described herein, the molecular inducer of proximity as described herein is added in the sample at a concentration of less than about 5, 4.5, 4, 3.5, 3, 2.5, 2, 1.5, 1, 0.5, or 0.25 pM. In some embodiments, in the methods as described herein, the molecular inducer of proximity as described herein is added in the sample at a concentration ranging from about 0.05 to about 5 pM, preferably from about 0.1 to about 2.5 pM, more preferably from about 0.1 to about 1 pM.
[0223] The molecular inducer of proximity may be non-fluorogenic. The molecular inducer of proximity may be fluorogenic.
[0224] The method may thus further comprise a step of detecting and/or visualizing a fluorescence resulting from the formation of the ternary assembly between the first polypeptide, the second polypeptide and a fluorogenic molecular inducer of proximity. The method may thus allow to detect and/or visualize the localization of the ternary assembly formed between the first polypeptide, the second polypeptide and a fluorogenic molecular inducer of proximity.
[0225] The method may further comprise a step of monitoring over time a fluorescence resulting from the formation of the ternary assembly between the first polypeptide, the second polypeptide and a fluorogenic molecular inducer of proximity. The method may thus allow to monitor over time the formation of the ternary assembly between the first polypeptide, the second polypeptide and a fluorogenic molecular inducer of proximity.
[0226] One skilled in the art is familiar with the techniques available to detect fluorescence in a sample and is able to choose among those depending on the sample, the excitation and emission spectra of the fluorogenic compound(s) and the desired readout. Examples of such techniques include direct observation, fluorescence spectroscopy, flow cytometry, fluorescence microscopy (including laser scanning- and spinning-disk based confocal microscopy, multiphoton-microscopy, super-resolution microscopy), and fluorescence tomography.
[0227] The method may further comprise a step of assessing and/or quantifying a fluorescence resulting from the formation of the ternary assembly between the first polypeptide, the second polypeptide and a fluorogenic molecular inducer of proximity. The method may thus allow to assess and/or quantify the formation of the ternary assembly between the first polypeptide, the second polypeptide and a fluorogenic molecular inducer of proximity.
[0228] In some embodiments, the method comprises a step of detecting a variation of the fluorescence resulting from the formation of the ternary assembly between the first polypeptide, the second polypeptide and a fluorogenic molecular inducer of proximity. The method may thus comprise a step of detecting an increase or a decrease of the fluorescence resulting from the formation of the ternary assembly between the first polypeptide, the second polypeptide and a fluorogenic molecular inducer of proximity.
[0229] In some embodiments, the method further allows to measure the amount of the ternary assembly formed between the first polypeptide, the second polypeptide and a fluorogenic molecular inducer of proximity relative to a reference value by measuring fluorescence intensity. In some embodiments, said reference value corresponds to the fluorescence intensity value in a control sample. In some embodiments, said reference value corresponds to the fluorescence intensity value(s) at a different location(s) and/or time point(s) in the same sample.
[0230] Another object of the invention is an assay relying on the induction of proximity between two biological molecules in a sample, said assay comprising the following steps: obtaining a first biological molecule coupled to the first polypeptide as described herein, or a truncated fragment thereof as described herein; obtaining a second biological molecule coupled to the second polypeptide as described herein, or a truncated fragment thereof as described herein; adding a molecular inducer of proximity as described herein to the sample; wherein, upon addition of the molecular inducer of proximity, the first polypeptide, the second polypeptide and the molecular inducer of proximity form a ternary assembly, thereby inducing proximity between the first and the second biological molecules present in the sample.
[0231] In some embodiments, the concentration of molecular inducer of proximity required to reach half maximal assembly is less than about 5, 4.5, 4, 3.5, 3, 2.5, 2, 1.5, 1, 0.5, or 0.25 pM. In some embodiments, the concentration of molecular inducer of proximity required to reach half maximal assembly ranges from about 0.05 to about 5 pM, preferably from about 0.1 to about 2.5 pM, more preferably from about 0.1 to about 1 pM.
[0232] Thus, in some embodiments, the present invention relates to an assay relying on the induction of proximity between two biological molecules in a sample, said assay comprising the following steps: obtaining a first biological molecule coupled to the first polypeptide as described herein, or a truncated fragment thereof as described herein; obtaining a second biological molecule coupled to the second polypeptide as described herein, or a truncated fragment thereof as described herein; adding a molecular inducer of proximity as described herein to the sample; wherein, upon addition of the molecular inducer of proximity, the first polypeptide, the second polypeptide and the molecular inducer of proximity form a ternary assembly, with the concentration of molecular inducer of proximity required to reach half maximal assembly in the sample being of less than about 5 pM, thereby inducing proximity between the first and the second biological molecules present in the sample.
[0233] In some embodiments, the assay comprises the following steps: coupling or attaching a first biological molecule to the first polypeptide as described herein, or a truncated fragment thereof as described herein; coupling or attaching a second biological molecule to the second polypeptide as described herein, or a truncated fragment thereof as described herein; adding a molecular inducer of proximity as described herein to the sample; wherein, upon addition of the molecular inducer of proximity, the first polypeptide, the second polypeptide and the molecular inducer of proximity form a ternary assembly, with the concentration of molecular inducer of proximity required to reach half maximal assembly in the sample being of less than about 5 pM, thereby inducing proximity between the first and the second biological molecules present in the sample.
[0234] In some embodiments, the assay comprises the following steps: expressing in the sample a first biological molecule coupled to the first polypeptide as described herein, or a truncated fragment thereof as described herein; expressing in the sample a second biological molecule coupled to the second polypeptide as described herein, or a truncated fragment thereof as described herein; adding a molecular inducer of proximity as described herein to the sample; wherein, upon addition of the molecular inducer of proximity, the first polypeptide, the second polypeptide and the molecular inducer of proximity form a ternary assembly, with the concentration of molecular inducer of proximity required to reach half maximal assembly in the sample being of less than about 5 pM, thereby inducing proximity between the first and the second biological molecules present in the sample.
[0235] In some embodiments, the assay relies on the induction of proximity in a sample between a first biological molecule coupled to the first polypeptide as described herein, or a truncated fragment thereof as described herein, and a second biological molecule coupled to the second polypeptide as described herein, or a truncated fragment thereof as described herein, and the assay comprises: adding a molecular inducer of proximity as described herein to the sample, wherein, upon addition of the molecular inducer of proximity, the first polypeptide, the second polypeptide and the molecular inducer of proximity form a ternary assembly, with the concentration of molecular inducer of proximity required to reach half maximal assembly in the sample being of less than about 5 pM, thereby inducing proximity between the first and the second biological molecules present in the sample.
[0236] The assay as described herein may be used for controlling the localization, transport, stability, degradation and/or function of a biological molecule or of a pair of biological molecules as described herein. In particular, the assay as described herein may be used for controlling in a cell the localization, transport, stability, degradation and/or function of a protein or of a pair of proteins as described herein. As mentioned herein, the protein(s) may be fusion protein(s) comprising the first or second polypeptide as described herein, or a truncated fragment as described herein, and a protein of interest as described herein.
[0237] In some embodiments, the assay as described herein is for controlling the localization, transport and/or function of a biological molecule or of a pair of biological molecules.
[0238] In some embodiments, the assay as described herein is for controlling the localization of a biological molecule, in particular a protein, or of a pair of biological molecules, in particular a pair of proteins. For example, as illustrated in the experimental section hereinafter, the assay may be used for controlling in a cell the localization of a cytosolic protein, in particular a cytosolic fusion protein, by inducing proximity between said protein and a protein, in particular a fusion protein, localized in a cellular compartment, such as for example an organelle.
[0239] In some embodiments, the assay as described herein is for controlling the transport of a biological molecule, in particular a protein, or of a pair of biological molecules, in particular a pair of proteins. For example, as illustrated in the experimental section hereinafter, the assay may be used for controlling the nucleocytoplasmic trafficking of a protein in a cell by inducing proximity between a protein, in particular a fusion protein, comprising a nuclear localization signal (NLS) and a cytosolic protein, in particular a cytosolic fusion protein, such as a protein comprising a nuclear export signal (NES).
[0240] In some embodiments, the assay as described herein is for controlling the function of a biological molecule, in particular a protein, or of a pair of biological molecules, in particular a pair of proteins.
[0241] In some embodiments, the assay as described herein is for controlling the stability of a biological molecule, in particular a protein, or of a pair of biological molecules, in particular a pair of proteins. In some embodiments, the assay as described herein is for inducing the degradation of a biological molecule, in particular a protein, or of a pair of biological molecules, in particular a pair of proteins. [0242] In some embodiments, the assay as described herein is for detecting in the sample the co-occurrence of two biological molecules, in particular of two proteins. For example, the assay may be used for detecting in a cell the co-occurrence of two biological molecules, in particular of two proteins, in a cellular compartment, such as an organelle.
[0243] In some embodiments, in the assays as described herein, the molecular inducer of proximity as described herein is added to the sample at a concentration of less than about 5, 4.5, 4, 3.5, 3, 2.5, 2, 1.5, 1, 0.5, or 0.25 pM. In some embodiments, in the assays as described herein, the molecular inducer of proximity as described herein is added in the sample at a concentration ranging from about 0.05 to about 5 pM, preferably from about 0.1 to about 2.5 pM, more preferably from about 0.1 to about 1 pM.
[0244] The molecular inducer of proximity may be non-fluorogenic. The molecular inducer of proximity may be fluorogenic.
[0245] The assay may further comprise a step of detecting and/or visualizing a fluorescence resulting from the formation of the ternary assembly between the first polypeptide, the second polypeptide and a fluorogenic molecular inducer of proximity. The assay may thus allow to detect and/or visualize the localization of the ternary assembly formed between the first polypeptide, the second polypeptide and a fluorogenic molecular inducer of proximity.
[0246] The assay may further comprise a step of monitoring over time a fluorescence resulting from the formation of the ternary assembly between the first polypeptide, the second polypeptide and a fluorogenic molecular inducer of proximity. The assay may thus allow to monitor over time the formation of the ternary assembly between the first polypeptide, the second polypeptide and a fluorogenic molecular inducer of proximity.
[0247] The assay may further comprise a step of assessing and/or quantifying a fluorescence resulting from the formation of the ternary assembly between the first polypeptide, the second polypeptide and a fluorogenic molecular inducer of proximity. The assay may thus allow to assess and/or quantify the formation of the ternary assembly between the first polypeptide, the second polypeptide and a fluorogenic molecular inducer of proximity. [0248] In some embodiments, the assay comprises a step of detecting a variation of the fluorescence resulting from the formation of the ternary assembly between the first polypeptide, the second polypeptide and a fluorogenic molecular inducer of proximity. The assay may thus comprise a step of detecting an increase or a decrease of the fluorescence resulting from the formation of the ternary assembly between the first polypeptide, the second polypeptide and a fluorogenic molecular inducer of proximity.
[0249] In some embodiments, the assay further comprises measuring the amount of the ternary assembly formed between the first polypeptide, the second polypeptide and a fluorogenic molecular inducer of proximity relative to a reference value by measuring fluorescence intensity. In some embodiments, said reference value corresponds to the fluorescence intensity value in a control sample. In some embodiments, said reference value corresponds to the fluorescence intensity value(s) at a different location(s) and/or time point(s) in the same sample. TABLE OF SEQUENCES
Figure imgf000063_0001
Figure imgf000064_0001
BRIEF DESCRIPTION OF THE DRAWINGS
[0250] Figure l is a schematic representation of the working principle of the proximity inducing system as described herein. The system relies on the coupling (for example through genetic fusion) of two proteins to two distinct polypeptides, called FIREmate (i.e., polypeptide of SEQ ID NO: 1) and FIREtag (z.e., polypeptide of SEQ ID NO: 2). These two polypeptides can interact in a specific manner only in presence of a small synthetic molecule (z.e., a molecular inducer of proximity called the “match”) that stabilizes the interaction between the two polypeptides. Advantageously, the match may be a fluorogenic chromophore, such as HBR-2,5DM or HMBR, that becomes fluorescent (z.e., “bright match” also referred to as “FIRE fluorescence” or “FIRE signal”) specifically upon formation of the ternary assembly consisting of the two polypeptides and the match (so-called “FIRE” ternary assembly). The system described herein is thus able to induce proximity between any two proteins and, optionally, to visualize the recruitment process by fluorescence imaging.
[0251] Figures 2A and 2B are graphs illustrating the fluorogen-dependent interaction between the two polypeptides EIREmate and EIREtag. The graphs show the normalized average fluorescence detected from about 50,000 HEK293T cells co-expressing the FK506-binding protein (FKBP) fused to EIREtag and the FKBP-rapamycin-binding domain of mammalian target of rapamycin (FRB) fused to EIREmate treated without (triangles) or with (squares) 500 nM of rapamycin, and with 1, 5, 10, 25 or 50 pM of HBR-2,5DM (2A) or HMBR (2B). Data represent the mean values ± SD of three independent experiments.
[0252] Figures 3A-E illustrate the recruitment of cytoplasmic proteins to mitochondria with the proximity inducing system described herein. Figure 3A is a schematic representation of the experimental design wherein HeLa cells co-expressing mCherry-EIREtag and Tom20-ECFP-EIREmate were treated with 10 pM of match (either HBR-2,5DM or HMBR), and imaged by time-lapse confocal microscopy. Figures 3B-C are graphs showing the temporal evolution of the FIRE signal detected upon addition at 0 min of HBR-2,5DM (matchl - 3B) or HMBR (match2 - 3C). Data represents the mean values ± SD of 20 cells from n = 3 independent experiments (3B, D), or of 10 cells from n = 3 independent experiments (3C, E). Figures 3D-E are sets of representative confocal micrographs of cells before (0 min) and after (5 min) addition of HBR-2,5DM (matchl - 3D) or HMBR (match2 - 3E) (mCherry: ex/em 561/606-675 nm; FIRE: ex/em 488/508-570 nm; ECFP: ex/em 445/455-499 nm). Experiments were repeated 3 times with similar results. Scale bars are 20 pm.
[0253] Figures 4A-D illustrate the recruitment over time of cytoplasmic proteins to mitochondria with the proximity inducing system described herein. (4A, C) HeLa cells co-expressing mCherry-FIREtag and Tom20-ECFP-FIREmate were treated with 10 pM HBR-2,5DM and imaged by time-lapse confocal microscopy at 1 image per 5 seconds. (4B, D) U2OS cells co-expressing mCherry-EIREtag and Tom20-ECFP-EIREmate were treated with 10 pM HBR-2,5DM and imaged by time-lapse confocal microscopy at 1 image every 2 minutes. Figure 4A is a representative time-lapse of the HeLa cells from the addition of HBR-2,5DM at 0 sec (ex/em 488/508-570 nm). Experiment was repeated 3 times with similar results. Scale bars are 20 pm. Figure 4B is a representative time-lapse of the U2OS cells from the addition ofHBR-2,5DM at 0 min (mCherry: ex/em 561/606-675 nm; FIRE: ex/em 488/508-570 nm; ECFP: ex/em 445/455-499 nm). Experiment was repeated 3 times with similar results. Scale bars are 20 pm. Figure 4C is a graph showing the temporal evolution of the FIRE signal detected in HeLa cells upon addition of HBR-2,5DM at 0 sec. Data represents the mean values ± SD of 15 cells from n = 3 independent experiments. Figure 4D is a graph showing the temporal evolution of the FIRE signal detected in U2OS cells upon addition of HBR-2,5DM at 0 min. Data represents the mean values ± SD of 10 cells from n = 3 independent experiments.
[0254] Figures 5A-D demonstrate that the EIREtag positioning does not influence the proximity inducing system described herein. HeLa cells co-expressing either EIREtag-mCherry (5A,C) or mCherry-EIREtag-mCherry (5B, D) and Tom20-ECFP- EIREmate were treated with 10 pM HBR-2,5DM (matchl) and imaged by time-lapse confocal microscopy. Figures 5A-B are sets of representative confocal micrographs of cells before (0 min) and after (5 min) addition of HBR-2,5DM (matchl) (mCherry: ex/em 561/606-675 nm; FIRE: ex/em 488/508-570 nm; ECFP: ex/em 445/455-499 nm). Experiments were repeated 3 times with similar results. Scale bars are 20 pm. Figures 5C-D are graphs showing the temporal evolution of the FIRE signal detected upon addition of HBR-2,5DM at 0 min. Data represents the mean values ± SD of 17 cells from n = 3 independent experiments (5 A, C) or of 13 cells from n = 3 independent experiments (5B, D)
[0255] Figures 6A-E illustrate the recruitment of cytoplasmic proteins to the Golgi apparatus with the proximity inducing system described herein. Figure 6A is a representation of the experimental design wherein HeLa cells co-expressing mCherry-FIREtag and FIREmate-ECFP-Giantin were treated with 10 pM of match (either HBR-2,5DM or HMBR), and imaged by time-lapse confocal microscopy. Figures 6B-C are graphs showing the temporal evolution of the FIRE signal detected upon addition at 0 min of HBR-2,5DM (matchl - 6B) or HMBR (match2 - 6C). Data represents the mean values ± SD of 10 cells from n = 3 independent experiments (6B, D), or of 12 cells from n = 3 independent experiments (6C, E). Figures 6D-E are sets of representative confocal micrographs of cells before (0 min) and after (5 min) addition of HBR-2,5DM (matchl - 6D) or HMBR (match2 - 6E) (mCherry: ex/em 561/606-675 nm; FIRE: ex/em 488/508-570 nm; ECFP: ex/em 445/455-499 nm). Experiments were repeated 3 times with similar results. Scale bars are 20 pm.
[0256] Figures 7A-E illustrate the recruitment of cytoplasmic proteins to the endoplasmic reticulum with the proximity inducing system described herein. Figure 7A is a representation of the experimental design wherein HeLa cells co-expressing mCherry -EIREtag and EIREmate-ECFP-Cb5 were treated with 10 pM of match (either HBR-2,5DM or HMBR), and imaged by time-lapse confocal microscopy. Figures 7B-C are graphs showing the temporal evolution of the FIRE signal detected upon addition at 0 min of HBR-2,5DM (matchl - 7C) or HMBR (match2 - 7D). Data represents the mean values ± SD of 17 cells from n = 3 independent experiments (7B, D), or of 18 cells from n = 3 independent experiments (7C, E). Figures 7D-E are sets of representative confocal micrographs of cells before (0 min) and after (5 min) addition of HBR-2,5DM (matchl - 7D) or HMBR (match2 - 7E) (mCherry: ex/em 561/606-675 nm; FIRE: ex/em 488/508-570 nm; ECFP: ex/em 445/455-499 nm). Experiments were repeated 3 times with similar results. Scale bars are 20 pm.
[0257] Figures 8A-E illustrate the recruitment of cytoplasmic proteins to the plasma membrane with the proximity inducing system described herein. Figure 8A is a representation of the experimental design wherein HEK293 cells co-expressing mCherry-FIREtag and Lynl l-ECFP-FIREmate were treated with 10 pM of match (either HBR-2,5DM or HMBR), and imaged by time-lapse confocal microscopy. Figures 8B-C are graphs showing the temporal evolution of the FIRE signal detected upon addition at 0 min of HBR-2,5DM (matchl - 8B) or HMBR (match2 - 8C). Data represents the mean values ± SD of 22 cells from n = 3 independent experiments (8B, D), or of 30 cells from n = 3 independent experiments (8C, E). Figures 8D-E are sets of representative confocal micrographs of cells before (0 min) and after (5 min) addition of HBR-2,5DM (matchl - 8D) or HMBR (match2 - 8E) (mCherry: ex/em 561/606-675 nm; FIRE: ex/em 488/508-570 nm; ECFP: ex/em 445/455-499 nm). Experiments were repeated 3 times with similar results. Scale bars are 20 pm.
[0258] Figures 9A-D demonstrate the reversibility of the proximity inducing system described herein and the possibility to induce non-fluorogenic recruitment. HeLa cells co-expressing mCherry-EIREtag and EIREmate-ECFP-Giantin were treated with 10 pM HBR-2,5DM (matchl), washed with HBR-2,5DM-free medium and then treated with 10 pM HBIR-3M. Cells were imaged by time-lapse spinning-disk confocal microscopy. Figures 9A-D are sets of representative confocal micrographs of untreated cells (9A), cells after addition of HBR-2,5DM (matchl) (9B), cells after washout of HBR-2,5DM (matchl) (9C), and cells after addition of HBIR-3M (9D). (mCherry: ex/em 561/606-675 nm; FIRE: ex/em 488/508-570 nm). Scale bars are 10 pm.
[0259] Figure 10 is a graph illustrating the fluorogen-dependent interaction between caged EIREtag and EIREmate. The graph shows the normalized average fluorescence detected from about 50,000 HEK293T cells co-expressing the FK506-binding protein (FKBP) fused to caged EIREtag and the FKBP-rapamycin-binding domain of mammalian target of rapamycin (FRB) fused to EIREmate treated without or with 500 nM of rapamycin, and with 1, 5, 10, 25 or 50 pM of HBR-2,5DM. Data represent the mean values ± SD of 3 independent experiments.
[0260] Figures 11A-C illustrate the use of a caged proximity inducing system as described herein. Figure 11A is a representation of the experimental design wherein HeLa cells co-expressing mCherry-cagedFIREtag and FIREmate-ECFP-Giantin were treated with 10 pM of match (z.e., HBR-2,5DM), and imaged by time-lapse confocal microscopy. Figure 1 IB is a graph showing the temporal evolution of the FIRE signal detected upon addition of HBR-2,5DM at 0 min, either with EIREtag or with cagedEIREtag. Data represents the mean values ± SD of 7 cells from n = 3 independent experiments. Figure 11C is a set of representative confocal micrographs of cells before (0 min) and after (9 min) addition of HBR-2,5DM (match 1) (mCherry: ex/em 561/606-675 nm; FIRE: ex/em 488/508-570 nm; ECFP: ex/em 445/455-499 nm). Experiments were repeated 3 times with similar results. Scale bars are 20 pm.
[0261] Figures 12A-B demonstrate that the proximity inducing system described herein is orthogonal to the FRB/FKBP/rapamycin CIP system. Figure 12A is a representation of the experimental design wherein HeLa cells co-expressing mCherry-FKBP-EIREtag, TOM20-EIREmate and FRB-ECFP-Giantin were treated with 10 pM of match (z.e., HBR-2,5DM) and 500 nM rapamycin, and imaged by time-lapse confocal microscopy. Figure 12B is a set of representative confocal micrographs of cells before and after addition of HBR-2,5DM (match 1) and rapamycin (rap) (mCherry: ex/em 561/606-675 nm; FIRE: ex/em 488/508-570 nm; ECFP: ex/em 445/455-499 nm). Scale bars are 20 pm.
[0262] Figures 13A-D illustrate the nuclear export of proteins mediated by the proximity inducing system described herein. Figure 13A is a representation of the experimental design wherein HeLa cell co-expressing NLS-mCherry-EIREmate and NES- ECFP-EIREtag were treated with 10 pM of match (i.e., HBR-2,5DM) and imaged by timelapse confocal microscopy. Figure 13B is a graph showing the temporal evolution of the ratio nucleus-to-cytoplasm mCherry fluorescence intensities of n = 5 cells from 2 independent experiments. Figure 13C is a graph showing the temporal evolution of the FIRE signal detected upon addition of HBR-2,5DM at 0 min of n = 5 cells from 2 independent experiments. Figure 13D is a set of representative confocal micrographs before (0 min) and after (4 min) treatment with HBR-2,5DM (mCherry: ex/em 561/606-675 nm; FIRE: ex/em 488/508-570 nm; ECFP: ex/em 445/455-499 nm). Scale bars are 20 pm.
EXAMPLES
[0263] The present invention is further illustrated by the following examples.
Example:
Materials and Methods
Molecular cloning
[0264] Synthetic oligonucleotides used for cloning were purchased from Integrated DNA technology. PCR reactions were performed with Q5 polymerase (New England Biolabs) in the buffer provided. PCR products were purified using QIAquick PCR purification kit (Qiagen). DNAse I, T4 ligase, Fusion polymerase, Taq ligase and Taq exonuclease were purchased from New England Biolabs and used with accompanying buffers and according to the manufacturer’s protocols. Isothermal assemblies (Gibson assembly) were performed using homemade mix prepared according to previously described protocols (Gibson et al. Enzymatic assembly of DNA molecules up to several hundred kilobases. Nat Methods. 2009 May;6(5):343-5). Small-scale isolation of plasmid DNA was done using QIAprep miniprep kit (Qiagen) from 2 mL overnight bacterial culture supplemented with appropriate antibiotics. Large-scale isolation of plasmid DNA was done using the QIAprep maxiprep kit (Qiagen) from 150 mL of overnight bacterial culture supplemented with appropriate antibiotics. All plasmid sequences were confirmed by Sanger sequencing with appropriate sequencing primers (GATC Biotech).
Chromophore synthesis
[0265] The synthesis of HMBR and HBR-2,5DM were previously reported (Plamont etal., Small fluorescence-activating and absorption-shifting tag for tunable protein imaging in vivo. Proc Natl Acad Sci U S A. 2016 Jan 19; 113(3):497-502) and Li et al.. Dynamic multicolor protein labeling in living cells. Chem Sci. 2017 Aug l;8(8):5598-5605).
Cell culture
[0266] HeLa cells (ATCC CRM-CCL2) were cultured in Minimal Essential Media (MEM) supplemented with phenol red, Glutamax I, 1 mM of sodium pyruvate, 1% (vol/vol) of non-essential amino-acids and 10% (vol/vol) fetal calf serum (FCS), at 37 °C in a 5% CO2 atmosphere. HEK 293T cells (ATCC CRL-3216) were cultured in Dulbecco's Modified Eagle Medium (DMEM) supplemented with phenol red and 10% (vol/vol) FCS at 37 °C in a 5% CO2 atmosphere. U2OS cells (ATCC HTB-96) were cultured in Dulbecco’s modified Eagle’s medium (DMEM) supplemented with phenol red and 10% (vol/vol) FCS and 1% (vol/vol) penicillin-streptomycin at 37°C in a 5% CO2 atmosphere. For imaging, cells were seeded in pDish IBIDI (Biovalley) coated with poly-L-lysine. Cells were transiently transfected using Genejuice (Merck) according to the manufacturer’s protocols for 24 hours prior to imaging. Cells were washed with DPBS (Dulbecco’s Phosphate-Buffered Saline), and treated with DMEM medium (without serum and phenol red) supplemented with the compounds (e.g., molecular inducer of proximity) at the indicated concentration.
Flow cytometry analysis
[0267] Flow cytometry in HEK 293T cells was performed on a MACSQuant® analyzer equipped with 405 nm, 488 nm and 635 nm lasers and seven channels. To prepare samples, cells were first grown in cell culture flasks, then transiently co-transfected 24 hours after seeding using Genejuice (Merck) according to the manufacturer’s protocol for 24 hours. After 24 hours, cells were centrifuged in PBS-BSA (Phosphate-Buffered Saline with 1 mg/mL bovine serum albumin) and resuspended in PBS-BSA supplemented with the appropriate amounts of compounds (e.g., molecular inducer of proximity). For each experiment, 20,000 cells positively expressing mTurquoise2 (ex 434 nm/ em 473 nm) and iRFP670 (ex 638 nm / em 660 ± 10 nm) were analyzed with the following parameters: excitation (or ex) 488nm, emission (or em) 525 ± 20 nm. mTurquoise2 and iRFP670 are transfection markers used to select cells expressing the constructs of interest. Data were analyzed using FlowJo vlO.7.1. Fluorescence microscopy
[0268] The confocal micrographs were acquired on a Zeiss LSM 980 Laser Scanning Microscope equipped with a plan apochromat 63x /1.4 NA oil immersion objective. ZEN software was used to collect the data. The images were analyzed using Icy (2.4.0.0) and Fiji (Image J). In short, to track fluorescence signal in a specific organelle, a ROI (region of interest) was determined by masking the signal in the ECFP (Enhanced Cyan Fluorescent Protein) channel with the plug-in HK-means with the following parameters: intensity class equals 100, minimum object size (pixels or px) equals 500, maximum object size (px) equals 1500-3000. The signal intensity of the ROI was tracked over time for each channel by using plug-in Active Contours. The background signal was subtracted. Data were processed using GraphPad Prism v9.3.0.
Results
Proximity inducing system
[0269] Bisection of a mutated photoactive yellow protein (PYP) gave two fragments: a first polypeptide of 114 amino acid residues having a sequence as set forth in SEQ ID NO: 1 (also referred to as FIREmate), and a second polypeptide of 11 amino acid residues having a sequence as set forth in SEQ ID NO: 2 (also referred to as FIREtag). As shown on Figure 1, the two fragments, z.e., the two polypeptides of SEQ ID NO: 1 (z.e., EIREmate) and SEQ ID NO: 2 (z.e., EIREtag), were surprisingly found to be able to complement and form a fluorescent ternary assembly (also referred to as “FIRE”) with a fluorogen such as HBR-2,5DM or HMBR (also referred to as “the match”). Of note, fluorogens such as HBR-2,5DM or HMBR only emit substantial fluorescence when in an environment constraining their conformation and excluding deexcitation of their excited state, such as the assembly formed with the two polypeptides of SEQ ID NO: 1 (z.e., EIREmate) and SEQ ID NO: 2 (z.e., EIREtag). Strikingly, the ternary assembly between the first polypeptide of SEQ ID NO: 1 (z.e., EIREmate), the second polypeptide of SEQ ID NO: 2 (z.e., EIREtag), and the fluorogen was found to occur upon addition of the fluorogen, regardless of any initial proximity between the two polypeptides of SEQ ID NO: 1 and SEQ ID NO: 2. With EIREmate and EIREtag, fluorogens such as HBR-2,5DM or HMBR thus act as molecular inducers of proximity. [0270] The formation of a fluorescent ternary assembly was assessed by flow cytometry in HEK293 cells expressing FIREmate and FIREtag fused to the C-termini of FKBP (FK506-binding protein) and FRB (FKBP-rapamycin-binding domain of mammalian target of rapamycin), respectively. FKBP and FRB are able to dimerize upon addition of rapamycin. Treatment of the HEK293 cells with rapamycin is thus expected to induce the dimerization of FKBP and FRB, and subsequently to induce proximity between EIREtag and EIREmate. As shown on Figure 2, treatment of the HEK293 cells with the fluorogens HBR-2,5DM (Figure 2A) or HMBR (Figure 2B) induced the formation of a ternary fluorescent assembly at all tested fluorogen concentrations (1, 5, 10, 25 or 50 pM as indicated). Of note, upon addition of the fluorogen, the formation of a ternary fluorescent assembly occurred regardless of the presence or absence of rapamycin, showing that initial proximity between EIREtag and EIREmate was not necessary to induce the formation of the ternary fluorescent assembly between EIREtag, EIREmate, and HBR-2,5DM or HMBR. Accordingly, even in the absence of rapamycin, EIREtag, EIREmate, and HBR-2,5DM or HMBR were able to induce the dimerization of FKBP and FRB, that is to say to induce proximity between FKBP and FRB.
Proximity does not occur in the absence of fluorogens
[0271] To test whether or not EIREmate and EIREtag self-complement in absence of fluorogens, the two fragments were spatially separated to analyze their mutual affinity. As shown on Figure 3A, EIREtag was fused to the C-terminus of the red fluorescent protein mCherry for diffuse cytosolic expression, and EIREmate was fused to the C-terminus of the N-terminal domain of the mitochondrial outer membrane protein TOM20 (TOM20 1-34), so that pFASTl-114 faced the cytosol. An enhanced cyan fluorescent protein (ECFP) was inserted between EIREmate and TOM20 1-34 to monitor the localization of the resulting fusion protein (blue fluorescence). Fluorescence imaging of HeLa cells co-expressing the two fusion proteins in absence of fluorogen showed a diffused cytosolic localization of the mCherry-EIREtag fusion protein (red fluorescence) and a mitochondria localization of the TOM20-ECFP-EIREmate fusion protein (blue fluorescence), suggesting that EIREmate and EIREtag do not spontaneously complement (Figures 3B-E). [0272] Addition of HBR-2,5DM triggered a very rapid translocation of the mCherry-FIREtag fusion protein from the cytosol to mitochondria, as observed by the cytosol -to-mitochondria shift of the red fluorescence (z.e., mCherry fluorescence), and led to the simultaneous appearance of a strong green fluorescent signal (z.e., FIRE signal corresponding to the HBR-2,5DM fluorescence)) at the mitochondria (Figures 3B, D). These observations suggest that fluorogen-induced complementation of FIREmate and EIREtag is able to mediate the interaction of mCherry and TOM20-ECFP in a sustainable manner, and to simultaneously image the formed complex. Similar results were observed with the fluorogen HMBR (Figures 3C, E).
[0273] These experiments thus demonstrate that fluorogens such as HBR-2,5DM or HMBR can induce the rapid and sustainable proximity/interaction of two proteins fused to EIREmate and EIREtag, respectively, and light up the recruitment process through formation of a sustainable ternary fluorescent assembly between EIREmate, EIREtag, and a fluorogen such as HBR-2,5DM or HMBR.
Induced proximity is extremely rapid and stable
[0274] Additional time-lapse confocal microscopy experiments were conducted in HeLa cells co-expressing the two fusion proteins (z.e., mCherry- EIREtag and TOM20-ECFP-EIREmate) and treated with HBR-2,5DM. As shown on Figures 4A, C, these experiments indicated that the formation of the ternary fluorescent assembly occurred with a half-time of about 20 seconds. This suggests that the process is extremely rapid and almost only limited by the cellular uptake and diffusion of the fluorogen. Furthermore, the time-lapse confocal microscopy experiments also indicated that the ternary fluorescent assembly was stable on a long period of time as observed on Figures 4B, D, with HBR-2,5DM fluorescence detected for more than 25 minutes.
Induced proximity does not depend on the positioning within a fusion protein
[0275] To examine the impact of the EIREtag positioning on the efficiency of formation of the fluorescent ternary assembly, the EIREtag was fused at the N-terminus of mCherry (EIREtag-mCherry), or inserted in between two mCherry (mCherry-EIREtag-mCherry). The TOM20-ECFP-EIREmate fusion protein as described above was co-expressed in HeLa cells with either the EIREtag-mCherry fusion protein or with the mCherry-EIREtag-mCherry fusion protein. Addition of HBR-2,5DM resulted in a similar rapid and stable translocation of both the FIREtag-mCherry fusion protein (Figures 5A, C) and the mCherry-FIREtag-mCherry fusion protein (Figures 5B, D) to mitochondria, demonstrating that the positioning of the EIREtag into a fusion protein has no impact on the efficiency of formation of the fluorescent ternary assembly.
Use of the proximity inducing system to control localization of a protein within a cell [0276] To further evaluate the scope of the proximity inducing system described herein and show its versatility, EIREmate was anchored at the membrane of various organelles or cellular structures so that EIREmate faced the cytosol. EIREmate was thus fused:
- to the N-terminus of the transmembrane domain (amino acid residues at positions 3131-3259) of Giantin for location at the cis-medial rims of the Golgi apparatus (Figure 6A),
- to the N-terminus of the transmembrane domain (amino acid residues at positions 100-134) of the cytochrome b5 (Cb5) for location at the endoplasmic reticulum membrane (Figure 7A), and
- to the C-terminus of the Lynl 1 inner membrane targeting sequence for location at the plasma membrane (Figure 8A).
For each construct, an ECFP was inserted between EIREmate and the other protein domain/sequence to monitor the localization of the resulting fusion protein (blue fluorescence). Each fusion protein was co-expressed in mammalian cells (HeLa cells or HEK293 cells as indicated) together with mCherry-EIREtag (red fluorescence). Addition of either HBR-2,5DM (Figures 6B&D, 7B&D, 8B&D) or HMBR (Figures 6C&E, 7C&E, 8C&E) resulted in rapid translocation of mCherry-EIREtag to the organelle where the EIREmate fusion protein was expressed, as observed by the cytosol-to-organelle shift of the red fluorescence and the simultaneous appearance of green fluorescence (ie., FIRE signal corresponding to the HBR-2,5DM or HMBR fluorescence) at the organelle localization, demonstrating efficient formation of a fluorescent ternary assembly between EIREtag, EIREmate, and either HBR-2,5DM or HMBR. The translocation of mCherry- EIREtag occurred with very similar kinetics regardless of the organelle or cellular structure (z.e., Golgi apparatus, endoplasmic reticulum membrane, or plasma membrane). Induced proximity is reversible
[0277] To test whether formation of the fluorescent ternary assembly could be reversed though removal of the fluorogen, the fusion proteins mCherry-FIREtag (red fluorescence) and FIREmate-ECFP-Giantin (blue fluorescence) were co-expressed in HeLa cells (Figure 9A). Formation of a fluorescent ternary assembly between EIREtag, EIREmate and HBR-2,5DM was induced by addition of 10 pM of HBR-2,5DM (Figure 9B). The washing of cells with HBR-2,5DM-free medium led to rapid dissociation of the fluorescent ternary assembly and simultaneous release of mCherry into the cytosol (Figure 9C), demonstrating the full reversibility of the fluorescence ternary assembly. The ability to reverse the fluorescence ternary assembly at will allowed to induce repeated catch and release cycles through sequential addition and removal of HBR-2,5DM. The ability to control protein proximity through addition and removal of a molecular inducer of proximity is unprecedented and opens great prospect to control the proximity of two proteins with high temporal resolution.
Proximity can be induced with a non-fluorogenic chromophore
[0278] Next, experiments were carried out to investigate whether other chromophores could act as molecular inducer of proximity and provide new properties. In particular, non-fluorogenic chromophores, such as HBIR-3M, were investigated. HBIR-3M is able to form very tight non-fluorescent ‘invisible’ assembly with EIREtag and EIREmate. The fusion proteins mCherry-EIREtag (red fluorescence) and EIREmate-ECFP-Giantin (blue fluorescence) were co-expressed in HeLa cells as described above. Through cycles of sequential addition and removal, HBIR-3M was shown to induce the proximity of the proteins fused to EIREmate (ie., ECFP-Giantin) and EIREtag (z.e., mCherry) as efficiently as HBR-2,5DM (Figure 9D).
Caged proximity inducing system
[0279] Although, as indicated above, no self-complementation between EIREtag and EIREmate could be observed in the absence of fluorogen (see Figure 3), a caged EIREtag was designed to prevent any interaction with EIREmate in the absence of fluorogen, even at high expression levels. To cage EIREtag, the N-terminal domain 2-114 of Halorhodospira halophila photoactive yellow protein (PYP) C69G (corresponding to SEQ ID NO: 21) was fused to FIREtag. The intramolecular interaction between FIREtag and the N-terminal domain 2-114 of H. halophila PYP C69G induces the folding of the cagedEIREtag protein, thus preventing any intermolecular interaction of the cagedEIREtag with EIREmate. The fusion proteins FKBP-cagedEIREtag and FRB-EIREmate were co-expressed in HEK293T cells. The cells were then treated with the fluorogens HBR-2,5DM at various concentrations (1, 5, 10, 25 or 50 pM as indicated), in the presence or absence of rapamycin. As shown on Figure 10, analysis by flow cytometry showed that cagedEIREtag could still form a fluorescent ternary assembly with EIREmate and HBR-2,5DM, even in absence of rapamycin, that is to say regardless of any initial proximity between cagedEIREtag and EIREmate. Of note, in accordance with the caging of EIREtag, higher concentrations of HBR-2,5DM were required to reach full ternary assembly in absence of rapamycin. These results suggest that the formation of a highly stable ternary assembly provides the driving force required to uncage EIREtag for efficient complementation with EIREmate via a mutually exclusive folding mechanism.
[0280] To further verify that cagedEIREtag and EIREmate do not interact together, the fusion proteins mCherry-cagedEIREtag and EIREmate-ECFP-Giantin were co-expressed in HeLa cells (Figure 11 A). The cells were then imaged in absence or in presence of HBR-2,5DM (Figure 11C). In absence of HBR-2,5DM, the two fusion proteins did not colocalize, confirming that cagedEIREtag and EIREmate exhibit no significant binding affinity. However, treatment with HBR-2,5DM resulted in efficient recruitment of mCherry-cagedEIREtag to the Golgi, demonstrating efficient complementation of cagedEIREtag and EIREmate. Of note, the need for unfolding cagedEIREtag for efficient complementation led however to slower kinetics, as compared to fluorescent ternary assembly between EIREtag, EIREmate, and HBR-2,5DM (Figure 11B).
The proximity inducing system can be used simultaneously with a CIP system
[0281] In order to control more than one protein-protein interaction, the combination of the “FIRE” proximity inducing system described herein with the FRB-FKBP-rapamycin CIP system was next investigated (Figure 12). The chimeric fusion protein mCherry-FKBP-EIREtag was expressed together with both the FRB-EGFP-Giantin and the TOM20-EIREmate fusion proteins in HeLa cells (Figure 12A). Simultaneous addition of HBR-2,5DM and rapamycin triggered the translocation of mCherry-FKBP-FIREtag to the Golgi apparatus and to the mitochondria (Figure 12B), demonstrating that the two systems are orthogonal and can be used for controlling two interactions simultaneously.
Use of the proximity inducing system to control nucleocytoplasmic trafficking of a protein [0282] To further demonstrate the potential of the proximity inducing system described herein, an assay was developed to control nucleocytoplasmic trafficking of proteins (Figure 13). Nucleocytoplasmic trafficking is tightly regulated through mechanisms involving nuclear localization signals (NLS) and nuclear export signal (NES). Control of nuclear protein export was first investigated using NLS-mCherry-FIREmate as an NLS-containing cargo molecule and NES-ECFP-FIREtag as the export NES-containing cargo partner (Figure 13A). In absence of HBR-2,5DM (Figure 13D - 0 min), NLS-mCherry-EIREmate was almost equally distributed between the nucleus and the cytosol. The export cargo molecule NES-ECFP-FIRFtag could freely enter the nuclear compartment by diffusion. As shown on Figure 13D (4 min), addition of HBR-2,5DM resulted in efficient cytoplasmic localization of the formed protein complex, as judged by a nuclear-to-cytoplasmic shift of the NLS-mCherry-FIRFmate fluorescence (Figure 13B) and the appearance of a strong cytoplasmic green fluorescence (Figure 13C). These results confirm that in protein assemblies containing both NLS and NES, the export activity prevails over the import activity (Busch et al. Quantification of nuclear protein transport using induced heterodimerization. Traffic. 2009 Sep;10(9): 1221-7).

Claims

CLAIMS A method for inducing proximity between two biological molecules in a sample, said method comprising the following steps: obtaining a first biological molecule coupled to a first polypeptide, wherein said first polypeptide comprises an amino acid sequence as set forth in SEQ ID NO: 1 or an amino acid sequence having at least about 70% identity with SEQ ID NO: 1, or a truncated fragment thereof comprising at least 89 consecutive amino acid residues from the C-terminal end of SEQ ID NO: 1 or of the sequence having at least about 70% identity with SEQ ID NO: 1; obtaining a second biological molecule coupled to a second polypeptide, wherein said second polypeptide comprises an amino acid sequence as set forth in SEQ ID NO: 2 or an amino acid sequence having at least about 70% identity with SEQ ID NO: 2, or a truncated fragment thereof comprising at least 8 consecutive amino acid residues from the N-terminal end of SEQ ID NO: 2 or of the sequence having at least about 70% identity with SEQ ID NO: 2; adding a molecular inducer of proximity to the sample; wherein, upon addition of the molecular inducer of proximity, the first polypeptide, the second polypeptide and the molecular inducer of proximity form a ternary assembly, with the concentration of molecular inducer of proximity required to reach half maximal assembly in the sample being of less than about 5 pM, thereby inducing proximity between the first and the second biological molecules present in the sample. An assay relying on the induction of proximity between two biological molecules in a sample, said assay comprising the following steps: obtaining a first biological molecule coupled to a first polypeptide, wherein said first polypeptide comprises an amino acid sequence as set forth in SEQ ID NO: 1 or an amino acid sequence having at least about 70% identity with SEQ ID NO: 1, or a truncated fragment thereof comprising at least 89 consecutive amino acid residues from the C-terminal end of SEQ ID NO: 1 or of the sequence having at least about 70% identity with SEQ ID NO: 1; obtaining a second biological molecule coupled to a second polypeptide, wherein said second polypeptide comprises an amino acid sequence as set forth in SEQ ID NO: 2 or an amino acid sequence having at least about 70% identity with SEQ ID NO: 2, or a truncated fragment thereof comprising at least 8 consecutive amino acid residues from the N-terminal end of SEQ ID NO: 2 or of the sequence having at least about 70% identity with SEQ ID NO: 2; adding a molecular inducer of proximity to the sample; wherein, upon addition of the molecular inducer of proximity, the first polypeptide, the second polypeptide and the molecular inducer of proximity form a ternary assembly, with the concentration of molecular inducer of proximity required to reach half maximal assembly in the sample being of less than about 5 pM, thereby inducing proximity between the first and the second biological molecules present in the sample. The method according to claim 1 or the assay according to claim 2, wherein the molecular inducer of proximity is fluorogenic and wherein the method or assay further comprises detecting a fluorescence resulting from the formation of the ternary assembly between the first polypeptide, the second polypeptide and the fluorogenic molecular inducer of proximity. The method according to claim 1 or 3 or the assay according to claim 2 or 3, wherein the first polypeptide consists of an amino acid sequence as set forth in SEQ ID NO: 1, and the second polypeptide consists of an amino acid sequence as set forth in SEQ ID NO: 2. The method according to any one of claims 1 or 3-4 or the assay according to any one of claims 2 to 4, wherein the molecular inducer of proximity is a compound of formula (I)
Figure imgf000081_0001
or a salt and/or solvate thereof; wherein
R1, R2, R5 and R6 each independently represents H, halo, hydroxyl, aryl, alkyl, cycloalkyl, heteroalkyl (e.g., alkoxy) or heterocycloalkyl, wherein said aryl, alkyl, cycloalkyl, heteroalkyl or heterocycloalkyl is optionally substituted by at least one group selected from halo, hydroxyl, oxo, nitro, amido, carboxy, amino, cyano, haloalkoxy and haloalkyl;
R3 represents a non-binding doublet (z.e., a free pair of electrons), H, halo, hydroxyl, aryl, alkyl, cycloalkyl, heteroalkyl or heterocycloalkyl, wherein said aryl, alkyl, cycloalkyl, heteroalkyl or heterocycloalkyl is optionally substituted by at least one group selected from halo, hydroxyl, oxo, nitro, amido, carboxy, amino, cyano, haloalkoxy and haloalkyl;
R4 represents a single or a double bound, interrupted or terminated by one S, O or N heteroatom, wherein said heteroatom is optionally substituted by at least one group selected from H, hydroxyl, aryl, alkyl, cycloalkyl, heteroalkyl and heterocycloalkyl, wherein said aryl, alkyl, cycloalkyl, heteroalkyl or heterocycloalkyl is optionally substituted by at least one group selected from halo, hydroxyl, oxo, nitro, amido, carboxy, amino, cyano, haloalkoxy and haloalkyl;
W represents OH, SH, NHR7 or NR7R8, wherein R7 and R8 each independently represents H, halo, hydroxyl, aryl, alkyl, cycloalkyl, heteroalkyl or heterocycloalkyl, wherein said aryl, alkyl, cycloalkyl, heteroalkyl or heterocycloalkyl is optionally substituted by at least one group selected from halo, hydroxyl, oxo, nitro, amido, carboxy, amino, cyano, haloalkoxy and haloalkyl; X represents O, S or NH; and Z represents O, S or NH.
6. The assay according to any one of claims 2 to 5, wherein said assay is for controlling the localization, the transport and/or the function of a biological molecule.
7. The assay according to any one of claims 2 to 6, wherein said assay is for inducing the degradation of a biological molecule.
8. The assay according to any one of claims 2 to 5, wherein said assay is for detecting the co-occurrence of two biological molecules in the sample.
9. The method according to any one of claim 1 to 5 or the assay according to any one of claim 2 to 8, wherein the first and second biological molecules are proteins.
10. A proximity inducing system comprising two polypeptides and a molecular inducer of proximity, wherein: a first polypeptide comprises an amino acid sequence as set forth in SEQ ID NO: 1 or an amino acid sequence having at least about 70% identity with SEQ ID NO: 1, or a truncated fragment thereof comprising at least 89 consecutive amino acid residues from the C-terminal end of SEQ ID NO: 1 or of the sequence having at least about 70% identity with SEQ ID NO: 1; and a second polypeptide comprises an amino acid sequence as set forth in SEQ ID NO: 2 or an amino acid sequence having at least about 70% identity with SEQ ID NO: 2, or a truncated fragment thereof comprising at least 8 consecutive amino acid residues from the N-terminal end of SEQ ID NO: 2 or of the sequence having at least about 70% identity with SEQ ID NO: 2, wherein the first polypeptide, the second polypeptide and the molecular inducer of proximity form a ternary assembly, with the concentration of molecular inducer of proximity required to reach half maximal assembly being of less than about 5 pM. The proximity inducing system according to claim 10, wherein the molecular inducer of proximity is fluorogenic. The proximity inducing system according to claim 10 or 11, wherein the molecular inducer of proximity is a compound of formula (I)
Figure imgf000083_0001
or a salt and/or solvate thereof; wherein
R1, R2, R5 and R6 each independently represents H, halo, hydroxyl, aryl, alkyl, cycloalkyl, heteroalkyl (e.g., alkoxy) or heterocycloalkyl, wherein said aryl, alkyl, cycloalkyl, heteroalkyl or heterocycloalkyl is optionally substituted by at least one group selected from halo, hydroxyl, oxo, nitro, amido, carboxy, amino, cyano, haloalkoxy and haloalkyl;
R3 represents a non-binding doublet (z.e., a free pair of electrons), H, halo, hydroxyl, aryl, alkyl, cycloalkyl, heteroalkyl or heterocycloalkyl, wherein said aryl, alkyl, cycloalkyl, heteroalkyl or heterocycloalkyl is optionally substituted by at least one group selected from halo, hydroxyl, oxo, nitro, amido, carboxy, amino, cyano, haloalkoxy and haloalkyl;
R4 represents a single or a double bound, interrupted or terminated by one S, O or N heteroatom, wherein said heteroatom is optionally substituted by at least one group selected from H, hydroxyl, aryl, alkyl, cycloalkyl, heteroalkyl and heterocycloalkyl, wherein said aryl, alkyl, cycloalkyl, heteroalkyl or heterocycloalkyl is optionally substituted by at least one group selected from halo, hydroxyl, oxo, nitro, amido, carboxy, amino, cyano, haloalkoxy and haloalkyl; W represents OH, SH, NHR7 or NR7R8, wherein R7 and R8 each independently represents H, halo, hydroxyl, aryl, alkyl, cycloalkyl, heteroalkyl or heterocycloalkyl, wherein said aryl, alkyl, cycloalkyl, heteroalkyl or heterocycloalkyl is optionally substituted by at least one group selected from halo, hydroxyl, oxo, nitro, amido, carboxy, amino, cyano, haloalkoxy and haloalkyl;
X represents O, S or NH; and
Z represents O, S or NH. The proximity inducing system according to any one of claims 10 to 12, wherein the first polypeptide consists of an amino acid sequence as set forth in SEQ ID NO: 1, and the second polypeptide consists of an amino acid sequence as set forth in SEQ ID NO: 2. A cell or cell line comprising two biological molecules, wherein: a first biological molecule is coupled to a first polypeptide, wherein said first polypeptide comprises an amino acid sequence as set forth in SEQ ID NO: 1 or an amino acid sequence having at least about 70% identity with SEQ ID NO: 1, or a truncated fragment thereof comprising at least 89 consecutive amino acid residues from the C-terminal end of SEQ ID NO: 1 or of the sequence having at least about 70% identity with SEQ ID NO: 1; a second biological molecule is coupled to a second polypeptide, wherein said second polypeptide comprises an amino acid sequence as set forth in SEQ ID NO: 2 or an amino acid sequence having at least about 70% identity with SEQ ID NO: 2, or a truncated fragment thereof comprising at least 8 consecutive amino acid residues from the N-terminal end of SEQ ID NO: 2 or of the sequence having at least about 70% identity with SEQ ID NO: 2, wherein in presence of a molecular inducer of proximity, the first polypeptide, the second polypeptide and the molecular inducer of proximity are able to form a ternary assembly, with the concentration of molecular inducer of proximity required to reach half maximal assembly being of less than about 5 pM.
15. The cell or cell line according to claim 14, wherein the first and second biological molecule are proteins.
PCT/EP2023/074340 2022-09-05 2023-09-05 Proximity inducing system and uses thereof WO2024052353A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
EP22306308 2022-09-05
EP22306308.2 2022-09-05

Publications (1)

Publication Number Publication Date
WO2024052353A1 true WO2024052353A1 (en) 2024-03-14

Family

ID=83898036

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/EP2023/074340 WO2024052353A1 (en) 2022-09-05 2023-09-05 Proximity inducing system and uses thereof

Country Status (1)

Country Link
WO (1) WO2024052353A1 (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2016001437A2 (en) * 2014-07-04 2016-01-07 Paris Sciences Et Lettres - Quartier Latin Fluorogen activating and shifting tag (fast)
EP3719007A1 (en) * 2019-04-05 2020-10-07 Paris Sciences et Lettres - Quartier Latin Split photoactive yellow protein complementation system and uses thereof
US20210396762A1 (en) * 2020-06-19 2021-12-23 Encodia, Inc. Methods for peptide analysis employing multi-component detection agent and related kits
WO2022038353A1 (en) * 2020-08-21 2022-02-24 Nuclera Nucleics Ltd Monitoring of in vitro protein synthesis

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2016001437A2 (en) * 2014-07-04 2016-01-07 Paris Sciences Et Lettres - Quartier Latin Fluorogen activating and shifting tag (fast)
EP3719007A1 (en) * 2019-04-05 2020-10-07 Paris Sciences et Lettres - Quartier Latin Split photoactive yellow protein complementation system and uses thereof
US20210396762A1 (en) * 2020-06-19 2021-12-23 Encodia, Inc. Methods for peptide analysis employing multi-component detection agent and related kits
WO2022038353A1 (en) * 2020-08-21 2022-02-24 Nuclera Nucleics Ltd Monitoring of in vitro protein synthesis

Non-Patent Citations (13)

* Cited by examiner, † Cited by third party
Title
ALTSCHUL ET AL., J. MOL. BIOL., vol. 215, no. 3, 1990, pages 403 - 410
BATZER ET AL., NUCLEIC ACID RES., vol. 19, 1991, pages 5081
BUSCH ET AL.: "Quantification of nuclear protein transport using induced heterodimerization", TRAFFIC, vol. 10, no. 9, September 2009 (2009-09-01), pages 1221 - 7, XP002594231
CARILLO ET AL., SIAM J. APPL. MATH., vol. 48, no. 5, 1988, pages 1073 - 1082
DEVEREUX ET AL.: "Nucl. Acid. Res.", vol. 12, 1984, UNIVERSITY OF WISCONSIN BIOTECHNOLOGY CENTER, pages: 387 - 395
DOUGLAS W. SMITH: "Biocomputing: Informatics and Genome Projects", 1993, ACADEMIC PRESS
GIBSON ET AL.: "Enzymatic assembly of DNA molecules up to several hundred kilobases", NAT METHODS., vol. 6, no. 5, May 2009 (2009-05-01), pages 343 - 5, XP055224105, DOI: 10.1038/nmeth.1318
GUNNAR VON HEINJE: "Sequence Analysis in Molecular Biology: Treasure Trove or Trivial Pursuit", 1987, ACADEMIC PRESS
LI ET AL.: "Dynamic multicolor protein labeling in living cells", CHEM SCI, vol. 8, no. 8, 1 August 2017 (2017-08-01), pages 5598 - 5605
OHTSUKA ET AL., J. BIOL. CHEM., vol. 260, 1985, pages 2605 - 2608
PLAMONT ET AL.: "Small fluorescence-activating and absorption-shifting tag for tunable protein imaging in vivo", PROC NATL ACAD SCI USA., vol. 113, no. 3, 19 January 2016 (2016-01-19), pages 497 - 502, XP002794322, DOI: 10.1073/pnas.1513094113
ROSSOLINI ET AL., MOL. CELL. PROBES, vol. 8, 1994, pages 91 - 98
ZENG GUIHUA ET AL: "Self-Reporting Chemically Induced Protein Proximity System Based on a Malachite Green Derivative and the L5** Fluorogen Activating Protein", vol. 29, no. 9, 19 September 2018 (2018-09-19), US, pages 3010 - 3015, XP093022350, ISSN: 1043-1802, Retrieved from the Internet <URL:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6233874/pdf/nihms-986260.pdf> DOI: 10.1021/acs.bioconjchem.8b00415 *

Similar Documents

Publication Publication Date Title
Lönn et al. Enhancing endosomal escape for intracellular delivery of macromolecular biologic therapeutics
Ma et al. Structure of the substrate-engaged SecA-SecY protein translocation machine
Milech et al. GFP-complementation assay to detect functional CPP and protein delivery into living cells
Peier et al. NanoClick: a high throughput, target-agnostic peptide cell permeability assay
Munch et al. Extension of helix 12 in Munc18-1 induces vesicle priming
Fuchs et al. Arginine grafting to endow cell permeability
Choi et al. NSF-mediated disassembly of on-and off-pathway SNARE complexes and inhibition by complexin
US20160146786A1 (en) Method of monitoring cellular trafficking of peptides
US20210087238A1 (en) Cell penetrating peptides and related compositions and methods
Allen et al. Hydrophobicity is a key determinant in the activity of arginine-rich cell penetrating peptides
Buyanova et al. Discovery of a cyclic cell-penetrating peptide with improved endosomal escape and cytosolic delivery efficiency
US20180246092A1 (en) Engineered red blood cell-based biosensors
Doh et al. MiniVIPER is a peptide tag for imaging and translocating proteins in cells
Xu et al. Kinesin-1 transports morphologically distinct intracellular virions during vaccinia infection
Sharma et al. Recombinant production, bioconjugation and membrane binding studies of Pn3a, a selective NaV1. 7 inhibitor
Rusch et al. Oligomeric states of the SecA and SecYEG core components of the bacterial Sec translocon
Jedlitzke et al. A light‐activatable photocaged variant of the ultra‐high affinity ALFA‐tag nanobody
Diaz et al. Elucidating the Impact of Payload Conjugation on the Cell-Penetrating Efficiency of the Endosomal Escape Peptide dfTAT: Implications for Future Designs for CPP-Based Delivery Systems
Suyama et al. Orthogonal Versatile Interacting Peptide Tags for Imaging Cellular Proteins
Shen et al. A general method for chemogenetic control of peptide function
WO2024052353A1 (en) Proximity inducing system and uses thereof
EP3622292B1 (en) Genetically encoded potassium ion indicators
US9400249B2 (en) Detection of biopolymer interactions, cancer cells, and pathogens using split-supercharged GFP
Mann et al. Enhancement of muramyl dipeptide‐dependent NOD2 activity by a self‐derived peptide
Shen et al. Rational Engineering of an Improved Genetically Encoded pH Sensor Based on Superecliptic pHluorin

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 23767848

Country of ref document: EP

Kind code of ref document: A1