FUNGAL CHROMOSOME-END KNOCKOFF STRATEGY
This application claims priority from U.S. Provisional Application Ser. No. 62/346,728, filed Jun. 7, 2016, the entire disclosure of which is incorporated herein by this reference. This invention was made with government support under Agriculture and Food Research Initiative grant 2012-67013-19384 awarded by the U.S. Department of Agriculture. The government has certain rights in the invention. The presently-disclosed subject matter relates to compositions and methods for eliminating target genes in endophyte and other fungi strains. Tall fescue ( According to their complexity, ergot alkaloids can be classified in three groups: clavine alkaloids, lysergic acid and its simple amides, and the notoriously toxic ergopeptines. Their biosynthesis proceeds through clavines to lysergic acid, then to the ergopeptines, such as ergovaline produced by Intensive surveys of tall fescue in Europe and North Africa have identified some Moroccan ecotypes that lack ergot alkaloids, and certain non-toxic endophytes (NTE) have been cultured from those ecotypes and used to replace the CTE in order to produce novel cultivars. As expected, livestock performance on the novel cultivars is significantly better than on cultivars with their original CTE, and not significantly different than on tall fescue lacking endophyte. However, the NTE strains currently used for novel cultivars are derived from very different tall fescue ecotypes (summer dormant, Moroccan) than the northern (north European) ecotypes from which the summer active cultivars are derived, and in which the NTE are now being deployed. For that reason it is unsurprising that problems have been reported with the NTE strains in those cultivars. Some have exhibited less stability than the CTE strains or appear less effective against root-parasitic nematodes, a potentially important limitation to productivity and drought tolerance in the southeastern U.S. [6, 9, 12]. The reasons that Moroccan endophytes exhibit inconsistent anti-nematode activity are unknown, but conceivably relate to lower production of loline alkaloids, which are translocated to roots and can affect nematodes. Thus, although the search for existing NTE strains has been a commercial and agricultural success, they may not be the optimal choices for mixing and matching plant and symbiont strains for some U.S. pasturelands. Thus, there exists a need for NTE strains for novel cultivars in tall fescue ecotypes for U.S. pasturelands, and an approach for generating such NTE strains. The presently-disclosed subject matter meets some or all of the above-identified needs, as will become evident to those of ordinary skill in the art after a study of information provided in this document. This summary describes several embodiments of the presently-disclosed subject matter, and in many cases lists variations and permutations of these embodiments. This summary is merely exemplary of the numerous and varied embodiments. Mention of one or more representative features of a given embodiment is likewise exemplary. Such an embodiment can typically exist with or without the feature(s) mentioned; likewise, those features can be applied to other embodiments of the presently-disclosed subject matter, whether listed in this summary or not. To avoid excessive repetition, this summary does not list or suggest all possible combinations of such features. The presently-disclosed subject matter includes methods for eliminating target genes in endophyte and other fungi strains, and the strains produced by the disclosed methods. In particular, a chromosome knock-off strategy is employed for elimination of toxin genes in the endophyte and other fungi strains. The presently-disclosed subject matter, in one aspect, comprises an isolated nucleic acid molecule that includes a nucleic acid sequence of a selectable marker gene, a nucleic acid sequence of one or more telomere repeats, and a nucleic acid sequence including a variant or fragment of a wild type gene of an ergot alkaloid gene cluster. In some embodiments, the nucleic acid sequence of a selectable marker gene where the marker gene is hph, neomycin phosphotransferase II gene, or a gene for resistance to phleomycin or bialophos. In some embodiments, the nucleic acid sequence of a series of telomere repeats includes a sequence of a telomere repeat represented by one of SEQ ID NOs: 31-54 and the series includes about 3 to about 26 or more telomere repeats. In some embodiments, the recombination sequence includes a variant or fragment of a wild type gene of an ergot alkaloid gene cluster selected from EAS1 or EAS2 in Vectors for use in modifying target genes in an endophyte or fungus are also disclosed herein. In some embodiments, the endophyte or fungus is selected from an Methods for eliminating target gene(s) in a fungus are also disclosed herein. In particular, methods for eliminating some or substantially all of a target gene located in a subterminal region of a chromosome in a fungus are provided. In some embodiments, the fungus is selected from an In some embodiments of the presently-disclosed methods, a portion of the ergot alkaloid gene cluster is eliminated sufficient to affect the expression of toxins in the fungus. In some embodiments, the expression of toxins in the fungus is reduced by at least 50%. Endophyte and/or fungus strains produced by the methods disclosed herein are also provided. In some embodiments, the fungus strains are “knockoffs,” and include a recombinant nucleic acid sequence substantially lacking the target gene(s) and/or sequence, which in some embodiments, is an ergot alkaloid gene cluster. In some embodiments, the endophyte strain lacks a gene for dimethylallytryptophan synthase (dmaW) or lysergylpeptide synthetase (lpsB). In some embodiments, the fungus strain is from The novel features of the invention are set forth with particularity in the following description and in the appended claims. A better understanding of the features and advantages of the present invention will be obtained by reference to the following detailed description that sets forth illustrative embodiments, in which the principles of the invention(s) are used, and the accompanying drawings of which: SEQ ID NO: 1 is the nucleic acid sequence encoding the e7480 contig 32884. SEQ ID NO: 2 is the nucleic acid sequence encoding the e7479 contig 40627. SEQ ID NO: 3 is the nucleic acid sequence from lpsA1 used in pKAE329. SEQ ID NO: 4 is the polynucleotide primer from the polylinkerDdeI and polylinkerSpeI. SEQ ID NO: 5 is the polynucleotide primer lpsA1SpeI(f). SEQ ID NO: 6 is the polynucleotide primer lpsA1MluI(r). SEQ ID NO: 7 is the polynucleotide primer from the polylinkerDdeI. SEQ ID NO: 8 is the polynucleotide primer from the polylinkerSpeI. SEQ ID NO: 9 is the polynucleotide primer dmaWe19copy2(+)-1d. SEQ ID NO: 10 is the polynucleotide primer dmaWe19copy2-(−)-5u. SEQ ID NO: 11 is the polynucleotide primer dmaw1(f) SEQ ID NO: 12 is the polynucleotide primer dmaWe19(−)-10 SEQ ID NO: 13 is the polynucleotide primer 144lpsBDraI(f2). SEQ ID NO: 14 is the polynucleotide primer 144lpsB(r). SEQ ID NO: 15 is the polynucleotide primer 215hphlpsB(f) SEQ ID NO: 16 is the polynucleotide primer 215lpsBhph(r) SEQ ID NO: 17 is the polynucleotide primer hph.3d. SEQ ID NO: 18 is the polynucleotide primer hph.3u SEQ ID NO: 19 is the polynucleotide primer RTq-E.c.easE(f) SEQ ID NO: 20 is the polynucleotide primer RTq-E.c.easE(r) SEQ ID NO: 21 is the polynucleotide primer RTq-E.c.easA(f) SEQ ID NO: 22 is the polynucleotide primer RTq-E.c.easA(r) SEQ ID NO: 23 is the polynucleotide primer RTq-E.c.easC(f). SEQ ID NO: 24 is the polynucleotide primer RTq-E.c.easC(r). SEQ ID NO: 25 is the polynucleotide primer RTq-E.c.easF(f). SEQ ID NO: 26 is the polynucleotide primer RTq-E.c.easF(r). SEQ ID NO: 27 is the polynucleotide primer RTq-E.c.easG(f). SEQ ID NO: 28 is the polynucleotide primer RTq-E.c.easG(r). SEQ II) NO: 29 is the polynucleotide primer oligoscreen(f). SEQ ID NO: 30 is the polynucleotide primer lpsAoligo(r) SEQ ID NO: 31 is the nucleic acid sequence CCCTAA which is a telomere repeat that can be repeated about 3 to about 26 times. SEQ ID NO: 32 is the nucleic acid sequence CCCTAATGTTCA which is a telomere repeat that can be repeated about 3 to about 26 times. SEQ ID NO: 33 is the nucleic acid sequence CCCTAAC which is a telomere repeat that can be repeated about 3 to about 26 times. SEQ ID NO: 34 is the nucleic acid sequence CTAACC which is a telomere repeat that can be repeated about 3 to about 26 times. SEQ ID NO: 35 is the nucleic acid sequence TAACCC which is a telomere repeat that can be repeated about 3 to about 26 times. SEQ ID NO: 36 is the nucleic acid sequence AACCCT which is a telomere repeat that can be repeated about 3 to about 26 times. SEQ ID NO: 37 is the nucleic acid sequence ACCCTA which is a telomere repeat that can be repeated about 3 to about 26 times. SEQ ID NO: 38 is the nucleic acid sequence TAAGGG which is a telomere repeat that can be repeated about 3 to about 26 times. SEQ ID NO: 39 is the nucleic acid sequence AAGGGT which is a telomere repeat that can be repeated about 3 to about 26 times. SEQ ID NO: 40 is the nucleic acid sequence AGGGTA which is a telomere repeat that can be repeated about 3 to about 26 times. SEQ ID NO: 41 is the nucleic acid sequence GGGTAA which is a telomere repeat that can be repeated about 3 to about 26 times. SEQ ID NO: 42 is the nucleic acid sequence GGTAAG which is a telomere repeat that can be repeated about 3 to about 26 times. SEQ ID NO: 43 is the nucleic acid sequence GTAAGG which is a telomere repeat that can be repeated about 3 to about 26 times. SEQ ID NO: 44 is the nucleic acid sequence CCTAATGTTCAC which is a telomere repeat that can be repeated about 3 to about 26 times. SEQ ID NO: 45 is the nucleic acid sequence CTAATGTTCACC which is a telomere repeat that can be repeated about 3 to about 26 times. SEQ ID NO: 46 is the nucleic acid sequence TAATGTTCACCC which is a telomere repeat that can be repeated about 3 to about 26 times. SEQ ID NO: 47 is the nucleic acid sequence AATGTTCACCCT which is a telomere repeat that can be repeated about 3 to about 26 times. SEQ ID NO: 48 is the nucleic acid sequence ATGTTCACCCTA which is a telomere repeat that can be repeated about 3 to about 26 times. SEQ ID NO: 49 is the nucleic acid sequence TGTTCACCCTAA which is a telomere repeat that can be repeated about 3 to about 26 times. SEQ ID NO: 50 is the nucleic acid sequence GTTCACCCTAAT which is a telomere repeat that can be repeated about 3 to about 26 times. SEQ ID NO: 51 is the nucleic acid sequence TTCACCCTAATG which is a telomere repeat that can be repeated about 3 to about 26 times. SEQ ID NO: 52 is the nucleic acid sequence TCACCCTAATGT which is a telomere repeat that can be repeated about 3 to about 26 times. SEQ ID NO: 53 is the nucleic acid sequence CACCCTAATGTT which is a telomere repeat that can be repeated about 3 to about 26 times. SEQ ID NO: 54 is the nucleic acid sequence ACCCTAATGTTC which is a telomere repeat that can be repeated about 3 to about 26 times. SEQ ID NO: 55 is a nucleic acid sequence canonical telomere repeat array of the scaffold of the e19 assembly (GenBank accession KC989609.1) downstream of lpsB1. The details of one or more embodiments of the presently-disclosed subject matter are set forth in this document. Modifications to embodiments described in this document, and other embodiments, will be evident to those of ordinary skill in the art after a study of the information provided in this document. The information provided in this document, and particularly the specific details of the described exemplary embodiments, is provided primarily for clearness of understanding and no unnecessary limitations are to be understood therefrom. In case of conflict, the specification of this document, including definitions, will control. The presently-disclosed subject matter meets some or all of the above-identified needs, as will become evident to those of ordinary skill in the art after a study of information provided in this document. To avoid excessive repetition, this Description does not list or suggest all possible combinations of such features. Presented herein is a novel approach to generate non-toxic endophyte strains, based on the tendency for toxin genes to be located near chromosome ends. This approach was also designed to abolish all exogenous genes including the selectable marker used in endophyte transformation. The approach was used to eliminate the telomere-associated EAS1 gene cluster from the genome of a Continental European The technique was tested on In some embodiments, an isolated nucleic acid molecule is provided including a nucleic acid sequence of a marker gene, a series telomere repeats and a recombination gene sequence comprising a variant or fragment of a wild type gene of an ergot alkaloid gene cluster or a sequence that has at least 50% identity to the sequence of any one of a wild type gene of an ergot alkaloid gene cluster. The variant or fragment of a wild type gene can contain at least one substitution and/or at least one deletion modification relative to the sequence. In some embodiments, an isolated nucleic acid molecule is provided with a nucleic acid sequence comprising a nucleic acid sequence of a selectable marker gene selected from hph (GenBank accession No. X03615.1), neomycin phosphotransferase II gene (GenBank accession No. ACH99098.1), or gene for resistance to phleomycin (GenBank accession No. X52869.1) and/or bialophos (GenBank accession No. X05822.1); a series of telomere repeats selected from3-26 which can be represented as [SEQ ID NO:31]3-26; and3-26 which can be represented as [SEQ ID NO:32]3-26 or a sequence of any one of SEQ ID NOs. 31-54 with about 3 to about 26 or more telomere repeats; and a recombination sequence comprising a variant or fragment of a wild type gene of an ergot alkaloid gene cluster selected from EAS1 [GenBank Accession Nos. KC989607, KC989608, and KC989609 for strain e19, and KC989569 for strain e4163] or EAS2 [GenBank Accession Nos. KC989610 and KC98961 I for strain e19, and KC989570 for strain e4163] in The With regard to the marker gene, in some embodiments, the marker gene is a fungal-active hygromycin phosphotransferase gene (hph) selectable marker. Other selectable markers, such as other antibiotic resistance markers, auxotrophic complementation markers and substrate utilization markers can be used. For example, antibiotic resistance markers include genes for resistance to geneticin (GenBank accession No. ACH99098.1), phleomycin (GenBank accession No. X52869.1) or bialophos (GenBank accession No. X05822.1). An auxotrophic complementation marker can be, for example, the gene for orotidine-5′-phosphate decarboxylase, which is required for uracil biosynthesis. A substrate utilization marker can be, for example, a gene for acetamidase, which can confer the ability to grow using acetamide as a sole carbon source or nitrogen reductase. The telomere repeat can depend on the fungus. In some embodiments, the telomere repeat comprises SEQ ID NO:31 CCCTAA, where the number of tandem repeats can range from 3-26. Different fungal species can have different telomere repeat units, such a SEQ ID NO:32 CCCTAATGTTCA in Regarding gene clusters that can be targeted utilizing the methods of the present invention, in some embodiments the aflatoxin clusters in A vector (or kit including a vector) including the nucleic acid molecules disclosed herein is also provided and includes the isolated nucleic acid sequence of a marker gene, a nucleic acid sequence of one or more telomere repeats and a recombination sequence. A method for eliminating a target gene(s) in an endophyte is disclosed herein. The method includes the steps of integrating a nucleic acid molecule of a marker gene, a series of telomere repeats and a recombination sequence into the endophyte by homologous recombination. In some embodiments, the nucleic acid sequence is provided in a plasmid that is linearized on the telomere-distal side of the recombination sequence, and introduced into fungal protoplasts or cells. The recombination sequence targets gene(s) located in the subterminal region of a chromosome, and the recombination replaces the target gene(s). The series of telomere repeats directs spontaneous loss of the nucleic acid sequence downstream of the recombinant telomere repeats, followed by stabilization of a new chromosome end including the recombinant telomere repeats. In some embodiments, the methods eliminate 70%, 75% or substantially all of the ergot alkaloid gene cluster. In some embodiments, a portion of the ergot alkaloid gene cluster is eliminated sufficient to affect the expression of one or more toxins in the endophyte or fungus. In some embodiments, the expression of one or more toxins in the fungus is reduced by at least 50%, 60% or 70%. The present methods to eliminate target genes utilize homologous recombination of the exogenous DNA with the genome. In one embodiment disclosed herein, lpsA1 in Endophyte and fungal strains can be produced by the methods disclosed herein, and include, in some embodiments, a recombinant nucleic acid sequence substantially lacking the ergot alkaloid gene cluster. In some embodiments, the gene dmaW or lpsB is absent from the resulting fungal strain. In some embodiments the fungal strain is any strain of Also disclosed herein is a synthetic combination of the endophyte strain as disclosed herein with a fescue grass or perennial ryegrass. In some embodiments, the fescue grass is a tall fescue grass. Regarding sequences to knock out or eliminate, to completely eliminate ergot alkaloid production, in one embodiment, the target gene to eliminate is dmaW. However, one of skill can appreciate that when certain other ergot alkaloid genes are targeted, some products will be expressed and others eliminated. Because of the functional duplicates in the genome of Sequences of the junctions of the telomere repeats, oligotag, and remnant of the recombination sequence (a fragment of the lpsA gene), also described as terminal contigs in two EAS1 knockoff derivatives of Similarly, the e7479-1 ΔEAS1 assembly included a 228-bp contig (SEQ ID NO:2) with telomere sequence at positions 1-106, the oligotag at positions 107-151, and partial lpsA1 sequence at positions 153-228. Thus, both strains had assembled contigs with the sequences expected for the truncated chromosome end. The genome sequences also demonstrated, by virtue of their absence, that other genes of the cluster were missing in those genomes. The PCR test for one such gene, dmaW2, in The presently-disclosed subject matter further includes a kit that comprises a vector and instructions for using the vector according to the methods disclosed herein. In addition to the vector, the subject kits will further include instructions for practicing the subject methods. These instructions may be present in the subject kits in a variety of forms, one or more of which may be present in the kit. One form in which these instructions may be present is as printed information on a suitable medium or substrate, e.g. a piece or pieces of paper on which the information is printed, in the packaging of the kit, in a package insert, etc. Yet another means would be a computer readable medium, e.g. diskette, CD, etc., on which the information has been recorded. Yet another means that may be present is a website address which may be used via the internet to access the information at a removed site. Any convenient means may be present in the kits. Methods The presently-disclosed subject matter further includes methods. A method of eliminating target gene(s) in an endophyte is provided. In some embodiments, the method includes integrating a nucleic acid molecule including a marker gene, one or more telomere repeats and a recombination sequence comprising a variant or fragment of a wild type gene of an ergot alkaloid gene cluster, a cyclopiazonic acid, an aflatoxin gene cluster, or an indole-diterpene gene cluster; or a recombination sequence with 90% identity to the nucleotide sequence of a wild type gene of said clusters with at least one substitution and/or at least one deletion modification relative to the gene(s) or a sequence with at least 50% identity to the amino acid of said gene, recombining to replace the target gene(s), wherein the nucleic acid sequence of one or more telomere repeats directs spontaneous loss of the nucleic acid sequence downstream of the recombinant telomere repeats followed by stabilization of a new chromosome end including the recombinant telomere repeats. A “function-conservative variant” also includes a polypeptide or enzyme which has at least 60% amino acid identity as determined by BLAST or FASTA algorithms, preferably at least 75%, most preferably at least 85%, and even more preferably at least 90%, and which has the same or substantially similar properties or functions as the native or parent protein or enzyme to which it is compared. As used herein, the term “homologous” in all its grammatical forms and spelling variations refers to the relationship between proteins that possess a “common evolutionary origin,” including proteins from superfamilies (e.g., the immunoglobulin superfamily) and homologous proteins from different species (e.g., myosin light chain, etc.) Reeck et al.. Such proteins (and their encoding genes) have sequence homology, as reflected by their sequence similarity, whether in terms of percent similarity or the presence of specific residues or motifs at conserved positions. Accordingly, the term “sequence similarity” in all its grammatical forms refers to the degree of identity or correspondence between nucleic acid or amino acid sequences of proteins that may or may not share a common evolutionary origin (see Reeck et al.). However, in common usage and in the instant application, the term “homologous,” when modified with an adverb such as “highly,” may refer to sequence similarity and may or may not relate to a common evolutionary origin. In a specific embodiment, two DNA sequences are “substantially homologous” or “substantially similar” when at least about 30%, 70%, 80% and most preferably at least about 90 or 95%) of the nucleotides match over the defined length of the DNA sequences, as determined by sequence comparison algorithms, such as BLAST, FASTA, DNA Strider, T-COFFEE, MUSCLE, MAFFT, SATE etc. Sequences that are substantially homologous can be identified by comparing the sequences using standard software available in sequence data banks, or in a Southern hybridization experiment under, for example, stringent conditions as defined for that particular system. Similarly, in a particular embodiment, two amino acid sequences are “substantially homologous” or “substantially similar” when greater than 80% of the amino acids are identical, or greater than about 90% are similar (functionally identical). Preferably, the similar or homologous sequences are identified by alignment using, for example, the GCG (Genetics Computer Group, Program Manual for the GCG Package, Version 7, Madison, Wis.) pileup program, or any of the programs described above (BLAST, FASTA, etc.). In some embodiments, the methods produce an endophyte strain comprising a nucleic acid sequence substantially lacking all of a gene cluster. As used herein, when substantially all of the gene cluster is eliminated, at least 80%, at least 90% or at least 95% of the gene cluster sequence is absent or removed. In some embodiments, the removal of a portion of a gene sequence sufficient to affect the function of the gene and/or gene cluster is defined as a loss in the relative expression of the gene. For example, in the removal or elimination of a portion or all of a gene and/or gene cluster, expression of the gene or some or all of the genes in the gene cluster is reduced, substantially eliminated, or eliminated (as compared to an endophyte strain in which the gene and/or gene cluster has not been reduced substantially eliminated, or eliminated). In some embodiments of the presently disclosed subject matter, for example, expression of genes in the ergot alkaloid gene cluster are reduced. In some embodiments, the relative expression of the genes can be measured before and after the removal of a portion of a gene sequence. In some embodiments, measurement of the relative expression of the genes can be measured after insertion of the genes contained in knockoff transformants of the endophyte strain. In some embodiments, a plant containing the transformed endophyte strain can be measured for relative expression of the genes. Additionally, absence of genes in the genome sequences can be verified by, for example, PCR. Substantially homologous nucleic acid molecules specifically hybridize typically at moderate stringency or at high stringency all along the length of the nucleic acid molecule of interest. The term “binding affinity” as used herein refers to a measure of the capacity of a probe to hybridize to a target polynucleotide with specificity. Thus, the probe comprises a polynucleotide sequence that is complementary, or essentially complementary, to at least a portion of the target polynucleotide sequence. Nucleic acid sequences which are “complementary” are those which are base-pairing according to the standard Watson-Crick complementarity rules. As used herein, the term “complementary sequences” means nucleic acid sequences which are substantially complementary, as can be assessed by the same nucleotide comparison set forth above, or as defined as being capable of hybridizing to the nucleic acid segment in question under relatively stringent conditions such as those described herein. A particular example of a contemplated complementary nucleic acid segment is an antisense oligonucleotide. The phrase “substantially hybridizes” refers to complementary hybridization between a probe nucleic acid molecule and a substantially identical target nucleic acid molecule as defined herein. Substantial hybridization is generally permitted by reducing the stringency of the hybridization conditions using art-recognized techniques. “Stringent hybridization conditions” and “stringent hybridization wash conditions” in the context of nucleic acid hybridization experiments are both sequence- and environment-dependent. Longer sequences hybridize specifically at higher temperatures. Generally, highly stringent hybridization and wash conditions are selected to be about 3° C. lower than the thermal melting point (Tm) for the specific sequence at a defined ionic strength and pH. The Tm is the temperature (under defined ionic strength and pH) at which 50% of the target sequence hybridizes to a perfectly matched probe. Very stringent conditions are selected to be equal to the Tm for a particular probe. Typically, under “stringent conditions” a probe hybridizes specifically to its target sequence, but to no other sequences. While the terms used herein are believed to be well understood by one of ordinary skill in the art, definitions are set forth to facilitate explanation of the presently-disclosed subject matter. Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which the presently-disclosed subject matter belongs. Although any methods, devices, and materials similar or equivalent to those described herein can be used in the practice or testing of the presently-disclosed subject matter, representative methods, devices, and materials are now described. Following long-standing patent law convention, the terms “a”, “an”, and “the” refer to “one or more” when used in this application, including the claims. Thus, for example, reference to “a cell” includes a plurality of such cells, and so forth. Unless otherwise indicated, all numbers expressing quantities of ingredients, properties such as reaction conditions, and so forth used in the specification and claims are to be understood as being modified in all instances by the term “about”. Accordingly, unless indicated to the contrary, the numerical parameters set forth in this specification and claims are approximations that can vary depending upon the desired properties sought to be obtained by the presently-disclosed subject matter. As used herein, the term “about,” when referring to a value or to an amount of mass, weight, time, volume, concentration or percentage is meant to encompass variations of in some embodiments ±20%, in some embodiments ±10%, in some embodiments ±5%, in some embodiments ±1%, in some embodiments ±0.5%, and in some embodiments ±0.1% from the specified amount, as such variations are appropriate to perform the disclosed method. As used herein, ranges can be expressed as from “about” one particular value, and/or to “about” another particular value. It is also understood that there are a number of values disclosed herein, and that each value is also herein disclosed as “about” that particular value in addition to the value itself. For example, if the value “10” is disclosed, then “about 10” is also disclosed. It is also understood that each unit between two particular units are also disclosed. For example, if 10 and 15 are disclosed, then 11, 12, 13, and 14 are also disclosed. The presently-disclosed subject matter is further illustrated by the following specific but non-limiting examples. The following examples may include compilations of data that are representative of data gathered at various times during the course of development and experimentation related to the present invention. Molecular genetic techniques to precisely eliminate genes in asexual filamentous fungi require the introduction of a marker gene into the target genome. A novel strategy was developed to eliminate from fungal genomes genes or clusters of genes located in subterminal regions of chromosomes, then eliminate the marker gene and vector backbone used in the transformation procedure. Because many toxin gene clusters are subterminal, this method is particularly suited to generating non-toxigenic fungal strains. The technique was tested on EAS1 cluster in Genome sequence of Identification of EAS gene clusters in Identification of putative MAST chromosome-end knockoff strains. Transformation plasmid pKAES329 was constructed with a large segment of lpsA1 sequence to target homologous integration, an hph selectable marker modified for expression in fungi to confer resistance to hygromycin B, and a telomere repeat array adjacent to the lpsA1 sequence to eventually separate hph and the rest of the vector backbone from the lpsA1 sequence. The lpsA1 gene fragment of SEQ ID NO:3 was used in chromosome-end knockoff vector pKAES329. In a screen of 192 hygromycin B-resistant transformants of Tests for spontaneous losses of vector sequences. For each of the putative knockoff strains, three single spores were randomly chosen and tested by the same PCR screen, confirming the apparent loss of dmaW1 and retention of dmaW2 ( To confirm loss of hph, the single-spore isolates were tested for sensitivity to hygromycin B. All isolates that tested negative for hph by PCR were sensitive to the antibiotic, whereas the two hph-positive single-spore isolates from e7481 ΔEAS1 retained the ability to survive on selective medium. Similarly, all spores derived from e7575 ΔdmaW2 ΔEAS1 failed to grow on medium with hygromycin B ( Genome sequencing of knockoff strains. Inspection of assembled genome sequences of e7479-1 ΔEAS1 and e7480-1 6EAS1 by BLASTn revealed the EAS2-cluster genes, but none of the EAS1-cluster genes, as expected for EAS1 chromosome-end knockoffs. The BLAST queries for the genomes are provided as Appendix A. Furthermore, there were no other sequences from the transformation vector except the expected lpsA1 remnant and the 45-bp noncoding oligotag sequence. Inspection of e7479-1 and e7480-1 genome sequence assemblies confirmed the presence and location of the oligotag as expected. The e7480-1 genome assembly included a 181-bp contig of SEQ ID NO:1. This has telomere repeats at positions 1-21, the oligotag at positions 19-63 (the first three bp overlapping with the telomere sequence), and lpsA1 sequence at positions 65-181. Similarly, the e7479-1 assembly included a 228-bp contig with telomere sequence at positions 1-109, the oligotag at positions 107-151, and partial lpsA1 sequence at positions 153-228. Alkaloid profiles in symbio. Ergovaline and ergine were undetected in the plants with serial numbers 6105 and 6106, which were symbiotic with the ΔEAS1 knockoff strains. However, ergotryptamine, an early pathway spur product and chanoclavine I were at higher concentrations than in the plants with the wild-type strain (Table 1). Plants with the lpsB-complemented strain, e7605 ΔEAS1 lpsB, accumulated chanoclavine I, ergine and ergovaline to concentrations similar to those of plants with the wild-type strain, but chanoclavine I and ergotryptamine were also observed in those plants. No ergot alkaloids were detected in plants symbiotic with e7575-1 ΔdmaW2 ΔEAS1. Plants symbiotic with the knockoff strains had profiles of loline alkaloid similar to those with the wild-type strain e19 (data not shown). Symbiotic stability of the knockoff strains. After seedling inoculation with the endophyte strains the tall fescue plants were grown in the greenhouse for ca. one year, then planted in the field where they vernalized over winter and then set seeds. In seed tests, strong PCR-positive results indicated that at least 96% had Discussion Many years of research have established that CTE Since sequences of the EAS2-cluster genes suggested that all could be functional except lpsB2, we expected that plants with the ΔEAS1 knockoff strains would accumulate lysergic acid as previously shown for an lpsA knockout strain of a perennial ryegrass endophyte [24, 25]. However, the ΔEAS1 strains produced no detectable lysergic acid or even the intermediate tetracyclic clavines. Instead, the ergot-alkaloid profiles were dominated by ergotryptamine and chanoclavine I, similar to the profile previously observed when the four early-pathway genes (dmaW, easF, easC and easE) from Developing and deploying the currently disclosed technology to generate non-transgenic, yet genetically altered strains, avoids risk either real or perceived associated with transgenic organisms. Existing methods for surgical genetic manipulation of asexual fungi require introduction of marker genes derived from other organisms. Such exogenous (“foreign”) genes pose public and regulatory concerns, especially since these endophytes are to be deployed in cultivars that are meant to persist in pastures for decades. Once planted, it may be difficult or impossible to eradicate the plant with its modified endophyte should concerns arise about a transgene in the endophyte genome. Therefore, it is important to address up front the regulatory and public concerns associated with genetically modified organisms. The nature of the presently disclosed knockoff strains was reviewed by the Animal and Plant Health Inspection Service (APHIS) and determined not to fall under their regulation. It is noted that the alkaloid profiles of plants with the knockoff strains are similar to those of some naturally occurring grass- Small animal and livestock feeding studies would determine the effects, if any, of the simple ergot alkaloids associated with the ΔEAS1 knockoff strains. Future plant performance studies in the field, during which the specific alterations in the genome, including the 45-bp oligotag, will facilitate monitoring strain persistence in plant lines and field plots, and possible movement in agroecosystems. Materials and Methods Biological materials. The wild type Because Molecular methods. Fungal DNA was isolated from fresh mycelium using ZR Fungal/Bacterial DNA MiniPrep kit (Zymo Research, Irvine, Calif., USA), or using Geno/Grinder 2000 (SPEX CertiPrep, Metuchen, N.J., USA) and DNeasy 96 Plant Kit (Qiagen, Valencia, Calif., USA). Plasmid DNA was isolated from bacterial cultures using the ZR Plasmid Miniprep-Classic kit (Zymo Research, Irvine, Calif., USA). The mRNA was isolated from plant material using RNeasy Plant Mini Kit (Qiagen). PCR screens were performed using AmpliTaq Gold, and AmpliTaq Gold PCR buffer provided by the manufacturer (Applied Biosystems, Foster City, Calif., USA). For vector construction, the PCR amplifications were performed with Phusion Hot Start High-Fidelity DNA Polymerase (Thermo Scientific, Ratastie, Vantaa, Finland) with HF buffer (with 1.5 mM MgCl2) provided by the manufacturer. The temperature conditions were 98° C. for 3 min, followed by 35 cycles of 98° C. for 10 s, 62° C. for 10 s, and 72° C. for 7 min, then a final 5 min incubation at 72° C. The oligonucleotides used in this study were from Integrated DNA Technologies (Coralville, Iowa, USA), and are listed in Table 3. Genome analysis. Genome sequencing and assembly were performed at the Advanced Genetic Technologies Center of the University of Kentucky, and all genome assemblies were done with Newbler 2.8 (Roche Diagnostics/454 Life Sciences Corporation). The genome of the wild-type The genome of e7479 ΔEAS1 was sequenced by a combination of Ion Torrent PGM (Life Technologies) and 454 pyrosequencing (Roche Diagnostics) to give 2,790,692 Ion Torrent reads totaling 454,053,270 nt, and 855,760 extended pyrosequencing reads totaling 628,200,560 nt. Assuming a 99.6 Mb genome size (from the scaffold assembly of e19), this was 10.9-fold coverage. Assembly with Newbler 2.8 gave 46,534 contigs totaling 87,313,881 bp, of which 32,285 large contigs 500 bp) totaled 83,439,153 with N50=3871 bp. The genome of e7480 ΔEAS1 was sequenced on the MiSeq platform (Illumina, San Diego, Calif., USA) to give 22,358,620 reads at 250 cycles, totaling 4,711,677,366 high-quality bases. This was an estimated 47-fold coverage. Assembly with CLC Genome Workbench 8.0 (CLC Bio LLC, Waltham, Mass., USA), with default parameters, gave 41,794 contigs totaling 70,103,076 bp, N50=9061 bp. Approximately equal base representations of A, T, G and C in the total assembly indicated that the AT-rich intergenic regions were underrepresented in this assembly. Plasmid constructs. A previously cloned telomere repeat array from To construct the lpsB-complementation plasmid pKAES362, pKAES215 was digested with SpeI, end-repaired using End-it DNA End Repair kit (Epicentre, Madison, Wis., USA) and then digested with XbaI. The digested vector was ligated, using the Fast-Link DNA ligation kit (Epicentre), to a fragment containing the lpsB gene and its native promoter (from Fungal transformation. Screening of the knockoff and complementation transformants. To identify putative ΔEAS1 knockoffs the fungal transformants were screened by PCR as follows. DNA was extracted with the DNeasy 96 Plant Kit (Qiagen, Valencia, Calif., USA) and screened by PCR with primers specific for dmaW1 [dmaW1(f) and dmaWe19(−)-10] and dmaW2 (dmaWe19copy2.1d and dmaWe19copy2.5u). All of the putative knockoffs were also screened for the presence or absence of hph by PCR with the primer pair hph.3d and hph.3u. For complementation of the ΔEAS1 knockoff strain the transformants were screened for integration of the lpsB-containing plasmid by PCR with the primer pair 215hphlpsB(f) and 215lpsBhph(r). The PCR reactions were carried out in 25 μl reaction mixtures with 5-10 ng DNA template, 200 μM each dNTP, 0.2 μM each primer, 2.5 units AmpliTaq Gold, and AmpliTaq Gold PCR buffer with MgCl2 (1.5 mM final conc.) provided by the manufacturer (Applied Biosystems, Foster City, Calif., USA), in a model 2720 Thermal Cycler (Applied Biosystems). The temperature regime was as follows: 9 min at 95° C., 35 cycles of 94° C. for 30 s, annealing temperature (61° C. for dmaW2, 57° C. for lpsB-hph, 59° C. for dmaW1 and hph) for 35 s, 72° C. for 2 min, and then a final 7 min incubation at 72° C. Antibiotic sensitivity tests. Mycelium of each putative ΔEAS1 knockoff strain was ground in 500 μl sterile water and aliquots were spread on PDA with and without hygromycin B (50 μg/ml) in wells of Falcon 6-well plates (Becton Dickinson and Co., Franklin Lakes, N.J.). The plates were incubated 4 wk at 21° C. Ergot and loline alkaloid analyses. Alkaloid profiles were determined from one-year-old plants symbiotic with the Seed transmission tests. The plants of elite breeding line KYFA0601 symbiotic with Throughout this document, various references are mentioned. All such references are incorporated herein by reference, including the references set forth in the following list: Non-toxigenic fungal strains, and methods of making and use thereof, are provided and have utility as endophytes in forage crops, and as strains that can outcompete toxigenic strains in forage and food crops. 1. An isolated nucleic acid molecule, comprising:
(a) a marker gene; (b) a series of telomere repeats, wherein each telomere repeat has the nucleic acid sequence of a telomere repeat selected from SEQ ID NOs: 31-54, wherein the series includes about 3 to about 26 telomere repeats; and (c) a recombination sequence comprising a variant or fragment of a wild type gene of a gene cluster selected from: an ergot alkaloid gene cluster EAS1, an ergot alkaloid gene cluster EAS2, an indole-diterpene gene cluster (IDT), and aflatoxin gene clusters. 2. The nucleic acid molecule of 3. The nucleic acid molecule of 4. The nucleic acid molecule of 5. A vector, comprising the isolated nucleic acid molecule of 6. The vector of 7. A kit, comprising the vector of 8. A method for eliminating some or substantially all of one or more target genes located in a subterminal region of a chromosome in a fungus selected from an integrating a nucleic acid molecule of 9. The method of 10. The method of 11. The method of 12. A fungus strain produced by the method of 13. The fungus strain of 14. The fungus strain of 15. The fungus strain of 16. The fungus strain of 17. The fungus strain of 18. The fungus strain of 19. A fungus strain comprising a nucleic acid sequence of any one of SEQ ID NOS: 1-2. 20. A synthetic combination of the fungus strain of RELATED APPLICATIONS
GOVERNMENT INTEREST
TECHNICAL FIELD
INTRODUCTION
SUMMARY
BRIEF DESCRIPTION OF THE DRAWINGS
BRIEF DESCRIPTION OF THE SEQUENCE LISTING
DESCRIPTION OF EXEMPLARY EMBODIMENTS
EXAMPLES
Example 1
Example 2
Example 3
Example 4
Ergot-alkaloid profiles in tall fescue pseudostems with endophyte strains.a Endophyte Ergovaline Ergine Ergotryptamine Chanoclavine I Endophyte strain genotype (nmol/g) (nmol/g) (nmol/g) (nmol/g) e19 WT 12.6 ± 4.1 2.7 ± 1.6 2.7 ± 2.1 3.0 ± 2.2 e7479-1 ΔEAS1 0.0 0.0 86.7 ± 74 5.6 ± 3.7 e7480-1 ΔEAS1 0.0 0.0 100.0 ± 41 5.3 ± 1.2 e7605 ΔEAS1 lpsB 12.1 ± 6.8 2.1 ± 1.3 112.7 ± 75 2.5 ± 1.3 aAlkaloid concentrations are estimated averages ± standard error of the mean for five replicates. Ergot alkaloid profile and estimated concentrations of first generation seeds. Endophyte Ergovaline Ergine Ergotryptamine Chanoclavine Plant seriesa genotype nmol/g nmol/g nmol/g nmol/g 6105 e7479-1 ΔEAS1 0.0 0.0 9.0 0.8 6105 e7479-1 ΔEAS1 0.0 0.0 8.7 0.8 6106 e7480-1 ΔEAS1 0.0 0.0 9.3 0.8 6106 e7480-1 ΔEAS1 0.0 0.0 6.4 0.7 6107 e19 WT 9.4 7.9 1.1 1.4 6107 e19 WT 10.3 7.9 1.5 1.8 aEach sample was a pool of seeds from five plants of the same series. Primers Used in the Study SEQ ID Primer name Sequence NO: lpsA1SpeI(f) GGGACTAGTTAAGAAGCGCTTACGCCGTTCC 5 lpsA1MluI(r) CACACGCGTAGCTGTCGTATGAAGGCACGAT 6 polylinkerDdeI /5′Phos/ 7 TAAGCTCGAGGCCATGATGGCCTTTAAAG TCTACGTACTCA polylinkerSpeI /5′Phos/ 8 CTAGTGAGTACGTAGACTTTAAAGGCCAT CATGGCCTCGAGC dmaWe19copy2 AGAAACAGACAGGGCTATTC 9 (+)-1d dmaWe19copy2- CTCGCCGGCATGCGTCAAAA 10 (−)-5u dmaw1(f) TTATTGGATGAAACCTTAGCTAGTTGG 11 dmaWe19(−)-10 CTCGCCGGCATGCGTCAAAT 12 144lpsBDraI(f2) CACTTTAAACCTAATGCACTACACTAAGACCCC 13 144lpsB(r) AATCTGGCCAACATGGTTCCCATG 14 215hphlpsB(f) GCTTGACAAACGCACCAAGTTATCG 15 215lpsBhph(r) TGTACACCACTTCAACGAGGCTTG 16 hph.3d CGAAGTTATCTCGACGGTATCG 17 hph.3u TCGGCGAGTACTTCTACACA 18 RTq-E.c.easE(f) TCCTTGCCACCAAGGCAGATTG 19 RTq-E.c.easE(r) ACATTGTCCACGGCAAGCCCTC 20 RTq-E.c.easA(f) CGTGCGGATAATGAAGGCGTCC 21 RTq-E.c.easA(r) CGATGAGAAATCCATTGGCACCG 22 RTq-E.c.easC(f) GGCATGGCAGTCAAGTTCTTCAC 23 RTq-E.c.easC(r) ACATTGGCTGTCCAAGTAGGGT 24 RTq-E.c.easF(f) CCCAGAACTTTCGTCATGTCCG 25 RTq-E.c.easF(r) ATCCCGTCCAGTGGCGGAAGTA 26 RTq-E.c.easG(f) TTGCCAAGACTCTCCATGAGAT 27 RTq-E.c.easG(r) ACCACGTCGGTCTTAATACAGCG 28 oligoscreen(f) GATGGCCTTTAAAGTCTACGTACTC 29 lpsAoligo(r) ATATCATGGCAACATTCAGCGCAC 30 REFERENCES
SEQUENCE LISTING: SEQ ID NO: 1: e7480 contig_32884 length = 181 TAACCCTAACCCTAACCCTAAGCTCGAGGCCATGATGGCCTTTAAAGTCT ACGTACTCACTAGTTAAGAAGCGCTTACGCCGTTCCACTTGTGCCTTTGA CTGGATGATGGATACAGATAGTAACTAACCGTGGACAGTATGATATTATT ATGCACGTGGATTCCAAACAACAATGTTACC SEQ ID NO: 2: e7479 contig40627 length = 228 ACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAAC CCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCCTAACCC TAACCCTAAGCTCGAGGCCATGATGGCCTTTAAAGTCTACGTACTCACTA GTTAAGAAGCGCTTACGCCGTTCCACTTGTGCCTTTGACTGGATGATGGA TACAGATAGTAACTAACCGTGGACAGTA SEQ ID NO: 3 lpsA1 gene fragment used in pKAE329 TAAGAAGCGCTTACGCCGTTCCACTTGTGCCTTTGACTGGATGATGGATA CAGATAGTAACTAACCGTGGACAGTATGATATTATTATGCACGTGGATTC CAAACAACAATGTTACCTCTCGTATTGGGACTATTTTCTATCGGATGAAC AGGCAAACCACCTTGCCGGTGCGCTGAATGTTGCCATGATATCCATTTTC GAGTCACCACACCGGCCCGTAGGACAGGTAGAGTTGTTCAGCTACCTCGA CCAGCAGAAACTGCTGCAGTGGAGCGGACAGGCACCCATTGCATCCGAGT CTTGTATTGGCGATCTAATCGGCCAGAATTGCCACTCGCGACCTGATTCT CTTGCTGTAGACTCCTGGGACGGTTCCTTTACCTACCAAGAGCTCGATCG CCTTTCATCCACTCTTGCCAACAAACTTCGTGCGGCGGGCGTGGGACCGG AAGTTTTTGTCACCGCGTGCTTTGATCGTTGTAAGTGGATGCCGGTTGCT ATGCTAGGCATTATCAAAGCCGGCGGCGCCATCTGTGCTCTTGACCCTTC ATACCCTCTCGGCCGCCTAACCGGGATGTGTCAATTGTTAAAATCGACCG TTGTATTGACTACGGCAAATAATGCCCAGCAGGCAGAGCAATTGGAACTT ACCACAATTATCCTCGGTAGTGATCTATGGACAGGCGATAGTTATGACGA ACAGGAACGACAAGCACCTTTGTCCAATGTATGCCCCAGCAATGCATTGT ATGCAGTCTTCACTTCAGGTTCCACTGGGAAGCCGAAAGGGGTCGTGGTT GAACATCGATCATTCTCTTCATGCGCTCTAGCATCACTGAAACCCCTGGA TATCCGGCCACACGACCGCGTACTGCATTTTTCATCATATGCGTTCGATA TCAGCATATTTGAAACCCTCGCAACCCTGACTGCAGGGGCTTCCGTAGCA ATCCCATCTGAAAAGGCCCGCAGAGAGGATCTGCCTGGCGCCATGAGAGA GCTACAGGCGACGTGGGCTTTCCTTACACCCACGGTAGCCCGCATGTATC AACCAGAGGACTTTCCTTCGCTCAGGACTTTGTGTTTAGGGGGCGAGGCC ATCCATACTTCAGATATCGGGCTGTGGGCGTCGAAGAATCTAATCACAGG ATATAATCCCGCGGAATCTTGCCCGCTCGGGATCTCCGGCCTAGCTGATC AGTCTGCGGCCAGTTTTCTGGGGTGGTCTTTTTCGTCCCAGGCCTCCTGG ATCGTGGACCCTCGAGACTACCAGAAGCTGGCCCCTATTGGCGCAGTAGG AGAACTTTTGATCGAAGGACCAACCGTAGCAAGGGGTTATATCGACGACC TAACATGCTCTCATCCTGACTCGCCATTCGTTCTCTCTCCTCCACAATGG TTGTCCCGTTTCCGTTCAAGCACTTCTCAAGATACACGTCTGTATCGAAC GGGTGATCTGGTACAGTATGGCAGTAATGGATCCGTGCACTTTATCGGCC GGAAAGATCTCCAGGTAAAGGTACACGGGCAGAGAGTGGAGCTGTCTGAG ATTGAATTCCAGCTCCACAAGACTTTACTTCCTTTGGATTGCAAGGTGGT CGTCGACGCGGTGACTTTCACAGGTCATACCTCTATAATCGCATTTATTA CCGCTGCAGAGCATTCAGACTTGGAAAACGAAGACGACGCCGACCGGTCA CCGGACGTGAAAGTAATCACCCAAGATTTCGAGATCCAAGCCGCCGATGC AGCAACCAAGTTACAGAGCATTCTTCCGAAACACATGGTTCCAACCATAT ATTTACCTGTGGGACATATTCCCATGTCTAGGAGCGGAAAGGTTGATCGA AAGAAATTAAGATCCCTGGCCATCTCGCTACCCCGAGAGACTCTATATTG CATCGGGCGGCAGCCAGGTCCGGGGGAAATAGTGGCAACCGATGTTGAGC GTCGTCTACAACTCCTCTTTGCTCACGTATTGGATCTTTCTCCCGAGAAA ATCAAGGCGGACAGCGATTTCTTCCGCCTAGGTGGAGATTCTATCTATGC GATGAAGCTTTTAGCGTTAGCTCCTCAACAGGGGCTACGTCATCTCACAT ACGAGGAAATTTTTCGCCATCCGAAATTGAGGGATCTGGCCGCAGCATCA AGTTCATTATCCAACATTTCTTCAGAAATTCCGGAAGACCTTGGACCGGC ACCCTTCAGTCTGGCGCGGGATGCAGACTCGCTCACGAAAATCGCAAGTG AGCAGTGTGGTGTCGCTGTAGGAGACATTGAGGATATTTATCCTTGTACC AGTCTACAGGAGAGTCTCATCGCATCTACTGCCCGTGACCAAGATGCGTA CGTAGGGGTGCAATCCTTTACTCTCAATGAAGATATTGATACAACCCGGC TAAAAATGGCTTGGAAAATGACCTCTGCCGGCCATCCGATTCTTCGCACG CGCATCATTCAGACGGACAGCGGTACACCTTACCAGGCCGTTATAAGAGG GCCGCTCTCCTGGTCGGAGGAAAGCAGCAGCGAGGACCTCCCTCCCCAGT TCAAGCCCTCTATTGGTCTGGGGACCCCATTAGTCCAGCTGTGCCTCACC AAAAGTCGGTTACTTGTTGCCATGCACCACGCACTGTACGACGGTTGGTC GTTGCCATTGCTGCTCGTGGAAGTTGATCAGGCATATCGCCAGCTCTTTG TGCGGCAGCTGCCGCCGTTCAACCGGTATGTGAAGCATGTAACGGAAACC GTGGACTCTGCGGCCTCATTTTGGAAAGCAGAGTTACAGGACGTGGATCC TGTACACTTCCCGCCGCTGCCTCATCTCAATTATAAGCCAGAGCCTCGCG CATTACTCACAAAGTTAATTACCGTTACTGCCCACACCAATGCACAACAC AACGTGACGGTTGCAACCGAGATACAACTAGCCTGGGCCCTCACCAGTCA CACTTATACTAACAGCCAAGACGTGGTTTTCGGAATCATCTCCTCGGGAC GCGGTGCGCCAGTCGCGGGCATTGAGAGGATGTTAGGCCCTACCTTTGCG AGTACGCCGCTCCGAGTGCCCATTGATCCGGCCCAGGAGGTGAGAGAAGC TCTGGAAGAGTTGCAATACCGGCTCGCGGAGCAAACGAAGTATGTGCAAA CTGGCTTGCAGCGAATTCGCCAACAGGGGCCAAATGCAGCGGCAGCTTGC AGCTTCCAAACAATGCTTGTCGTGGAGCCCAACCAACCATTCAAGACTCA GAGTGCTTGGTTCAGCCGACACGAGTTTTTGTCCGAGCTCACAAGATTTA GCAGCCATTCTCTGACACTCAGGTGCAAGTTGCTAGCCAGATCAGTCGAA GTCACTGCGATTTACGACCAGTTGGTTGTACCCGACGCTCAGATGCAACG TATCTTATCCCAATTCCAACACATTTTAACACAGATCCAGGGCATTGGAT CCCGAAACACCACTATTGGAGACATTAATAGGCTAAGCCCCGGAGACTGG AACGAGCTTCAGGCATGGAATTCTACATTGCCTCCAGTGCTGGAGCTGTG TGTGCACCAGATGATTCAAGCGAAAGCCCAAATGCAACCGGAGGCGTTGG CAATTCATTCGTGGGATGGTGCCCTGACGTACAAGGAGCTTGAGGACTAC GCCAAAGGACTTGCCCACCGCCTCCACGCCCTTGGTGTCAGGCCAAATAC CTTCGTTGCCATCTATCTGCAGAAGTCTTTGTGGGTAGTGGTGGCCCAGC TTGCTGTGCTCATGGCGGGTGCTGCTTTTACAACGTTGGAGACCTCTCAA CCCATCAACCGTCTACGCGACGTCTGTCGTACCGTCCAACCCACCGTGGT ATTGACATCTGACGAGTTGCGGTTATCTGGTGCAGACCTTGAAGTACCGG CTCCCCTTCTAGTGATCAATCTGCAGCTTCTCCTTCAAGAGTCCGGTAGC CACAGCCAGCCTTTCGAGAACCACACCATGACGGCCTCCGATGCCATGTA CAGCATTGCCACATCCGGCACCACTGGTAAACCCAAGGTGGTGGTCATCG AGCATCAAGCGTTTCTAGCCAACTCGAGACACCTGATTGATCGCTGGGGG TTCACTGCAGACTCTCGTGTCCTTCAATTTGCCGGATACAGCTTCGATGC AATGATTGTGGAGCATTTCATTACTCTTCTTGCGGGCGGCTGCATCTGCA TTCCTTCCTCATTTGACCGGGACAACCGTCTAGCGACGTGTATTGCCGAG ATGCGCGTCAACTGGGCGATGTTGACGTCTTCAGTTATCCCGCTGCTCAC CCCTGCTACTGTGCCGACCCTGCAAACGCTGGTACAGGCCGGCGAACCCA TGCATCAAGGCATAACTGACTGCTGGGCTTCTCATGTGCGATTGTTCAAC GCCTACGGCCCAACGGAGTGCAGCGTGATCAGTACCACCAGCAACGTCAT CAATCCAGATGCCAGGAACGCAAGGAATATTGGTTTCACGACAGGTGGCG TCTGCTGGATTGTCGACCCCGAGCATCCAGAGAGCCCACCCGTTCCCATC GGGGCTGAGGGTGAACTAATCATTGAGGGCGCTATTCTCGCTCGGGGCTA TCTAGGCGACCGTGTACGAACTGCTGCAGCTTTCACTCCCCGTCCTGGCT GGTTGGATGATTTCAGAGGCAGTAGCGGAGATAATCGAGTTTACCGGACT GGCGACATTGTCAGATACGATCCAGACGGATCCATCTCCTATGTGCGGCG CAAGGATTCCCAGGTCAAGCTCCGCGGCCAGCGAGTAGAGCTGCTGGATG TCGAGCACCATCTCCAGAACTGCTTTCCGGGCGCGCTTCAGGTTGTCGCT GATATTGTTACTGTGCCCAATACTCGGTCGAGCGCCCTGGTGGCCCTTGT ATTAGCCACTCCCACTTCCTCTTCGAGCGCGGCAATCGAATCGTGTCCAA TTGATGACCAAGCAATGACGGCCCATGGCCTCTTGTTACTCGCTAACAAC CCCCAGTTCCTCATTGACGCAAGCGCTGCCGAGCTCGCACTCCAGGACCG AGTTCCCTCGTATATGGTGCCTAGTTTGATCATACCTACCTCACACTTTC CACGGGACGTCAGCGGAAAGGTCAACCGTGGAGAGATAAGCCGGTCTCTC GCAGCTCTTTCTCGGCAAGAGTGGGATGGATATGTTTCCACGAACAGAGT CGCTCCGACTAGCGGTCTCGAACGTGAACTGCAGAAGATCTGGGCCCTCA TTCTGAATATCCCTCCTGATACCATTGGCGTCCATGACAGCTTCTTCCGA CTGGGGGGTGACTCGATCACCTGTATGCAGGTTGCTGCACAGTGTAGTAG AACAGGAATCCCGATTACTGTTAAGGATGTCTTCAAGCGACGGACAATTG AGGAGCTAGCGGCCGCAGCAGTGGTGGTACAGTGTCCTGAATCATCTACA ACAGAGCTTGTCAACACCGCAGAAGCCAAATTCTCCTTTTATGGCCCCGG ACAACTGGAGGAGTATATGATGCAGATCCAACCCCAGCTCGGAGAAGGCC AGATCGTAGAGGACATCTATCCGTGCTCTCCAATCCAGCGGGGGATTCTC ATGAGCCACGCCCGTAACTCCAGCAATTACGAAGAAGTTATTCAGTGGAA GGTCATCAGCAGAGCCCCGGTCAACGTCTATCGTCTGCGTGATGCCTGGG CCCAGGTGGTAGATCGGCACGCGGTTCTTCGTACTTTGTTCTTGCATGTT TGTGAGGAAAACTATTTGGATCAAGTAGTGCTGAGGAGCCATTCACCAAT GGTCCTGGTATACAACGAAGGGGAGGAGCCAGTTAATCCGGTATCAACTG GCTGCTCTCAGCCTATGCATCATCTCCGAGTCAAGCGATCGAGTACGGGC GAGATTACCGTTCGTTTGCATATCAACCACGCACTTGTCGATGGGACCTC CTTGTTCATCATCAGACGGGAGTTGGCCATGGCCTACGAAGGTCGTCTAG CCTCATCTCGTGCATCATCACCCTACCGGGACTACATTGCATACTTGCAA AACTGCCATGCACAAATACAGTCAAAGGAGTACTGGAAGTCGTACATGGA GGGCACAGCACCTTGTCTATTCCCTTCTCTGAAGAACGCGGGCGCACAAG ATTCACAACAGCCTTTTGAGGCTTTCAAGCTGCAACTGGGAGCAACTGCT GACCTGAATCAATTCTGCGAGAATCACCGATTGGCACTTACCAGTGTACT CCACGTGGTGTGGGCTATGGTGGTCCAACGCTACACGGCAATGGACGAGG TCTGCTTTGGCTATATGACTTCTGGTCGCCATGTGCCCGTGGCTGGCGTT CAAGATATCGTAGGCCCGTTGTTCAACATGCTGGTGGCGCGGGTGGGCTT GCCGCATGATGCCACGCTGCTCTCTGTCATGCAGAAGTATCATGACAACT TCCTGATCAGTCTCGACCATCAGCACCAGTCCTTAGCCGAAACACTGCAT TCCGTCGGGTCGGCTTCGGGGGAGTTATTCAACACGCTGGTTTCAATATT CAATGATCAGCGAGAGGGGGAGCCGGCCCATAAGTCGTCTGCCGTCACTC TGGTGGGTGATGACATACATAGCCGATCGGAGGTAGGTGCTGTCTTACTC GCCTTATCGTCTTGCCCCATATTGAGTTCATTGCTAACCAAACACTTCCA CCACAGTACGCCATCACATTAAACGTTCTCATGCTCGCGGACCAGGTTCA TATGCAGCTCTCTTATCACACATCATTGCTGAGTGATAATTATGCCAGGA TGATTGCTAAAACCTTCCGCCATGTCCTGGCCACAGTCCTAGGACAACCT CAGCTCCGTCTCAATGAGATCGAGATGCTGGATGAGGAACATAGAAGCGG CCTCTACGAGCGGAATCATGCGATCGTGCCTTCATACGACAGCT SEQ ID NO: 4 the 45-bp synthetic “oligotag” derived from polylinkerDdeI and polylinkerSpeI 5′- TAAGCTCGAGGCCATGATGGCCTTTAAAGTCTACGTACTCACTAG-3′ SEQ ID NO: 5 GGGACTAGTTAAGAAGCGCTTACGCCGTTCC SEQ ID NO: 6 CACACGCGTAGCTGTCGTATGAAGGCACGAT SEQ ID NO: 7 /5′ Phos/TAAGCTCGAGGCCATGATGGCCCTTTAAAGTCTACGTACTCA SEQ ID NO: 8 /5′ Phos/CTAGTGAGTACGTAGACTTTAAAGGCCATCATGGCCTCGAGC SEQ ID NO: 9 AGAAACAGACAGGGCTATTC SEQ ID NO: 10 CTCGCCGGCATGCGTCAAAA SEQ ID NO: 11 TTATTGGATGAAACCTTAGCTAGTTGG SEQ ID NO: 12 CTCGCCGGCATGCGTCAAAT SEQ ID NO: 13 CACTTTAAACCTAATGCACTACACTAAGACCCC SEQ ID NO: 14 AATCTGGCCAACATGGTTCCCATG SEQ ID NO: 15 GCTTGACAAACGCACCAAGTTATCG SEQ ID NO: 16 TGTACACCACTTCAACGAGGCTTG SEQ ID NO: 17 CGAAGTTATCTCGACGGTATCG SEQ ID NO: 18 TCGGCGAGTACTTCTACACA SEQ ID NO: 19 TCCTTGCCACCAAGGCAGATTG SEQ ID NO: 20 ACATTGTCCACGGCAAGCCCTC SEQ ID NO: 21 CGTGCGGATAATGAAGGCGTCC SEQ ID NO: 22 CGATGAGAAATCCATTGGCACCG SEQ ID NO: 23 GGCATGGCAGTCAAGTTCTTCAC SEQ ID NO: 24 ACATTGGCTGTCCAAGTAGGGT SEQ ID NO: 25 CCCAGAACTTTCGTCATGTCCG SEQ ID NO: 26 ATCCCGTCCAGTGGCGGAAGTA SEQ ID NO: 27 TTGCCAAGACTCTCCATGAGAT SEQ ID NO: 28 ACCACGTCGGTCTTAATACAGCG SEQ ID NO: 29 GATGGCCTTTAAAGTCTACGTACTC SEQ ID NO: 30 ATATCATGGCAACATTCAGCGCAC SEQ ID NO: 31 telomere repeat that can be repeated about 3 to about 26 times CCCTAA SEQ ID NO: 32 telomere repeat that can be repeated about 3 to about 26 times CCCTAATGTTCA SEQ ID NO: 33 telomere repeat that can be repeated about 3 to about 26 times CCCTAAC SEQ ID NO: 34 telomere repeat that can be repeated about 3 to about 26 times CTAACC SEQ ID NO: 35 telomere repeat that can be repeated about 3 to about 26 times TAACCC SEQ ID NO: 36 telomere repeat that can be repeated about 3 to about 26 times AACCCT SEQ ID NO: 37 telomere repeat that can be repeated about 3 to about 26 times ACCCTA SEQ ID NO: 38 telomere repeat that can be repeated about 3 to about 26 times TAAGGG SEQ ID NO: 39 telomere repeat that can be repeated about 3 to about 26 times AAGGGT SEQ ID NO: 40 telomere repeat that can be repeated about 3 to about 26 times AGGGTA SEQ ID NO: 41 telomere repeat that can be repeated about 3 to about 26 times GGGTAA SEQ ID NO: 42 telomere repeat that can be repeated about 3 to about 26 times GGTAAG SEQ ID NO: 43 telomere repeat that can be repeated about 3 to about 26 times GTAAGG SEQ ID NO: 44 telomere repeat that can be repeated about 3 to about 26 times CCTAATGTTCAC SEQ ID NO: 45 telomere repeat that can be repeated about 3 to about 26 times CTAATGTTCACC SEQ ID NO: 46 telomere repeat that can be repeated about 3 to about 26 times TAATGTTCACCC SEQ ID NO: 47 telomere repeat that can be repeated about 3 to about 26 times AATGTTCACCCT SEQ ID NO: 48 telomere repeat that can be repeated about 3 to about 26 times ATGTTCACCCTA SEQ ID NO: 49 telomere repeat that can be repeated about 3 to about 26 times TGTTCACCCTAA SEQ ID NO: 50 telomere repeat that can be repeated about 3 to about 26 times GTTCACCCTAAT SEQ ID NO: 51 telomere repeat that can be repeated about 3 to about 26 times TTCACCCTAATG SEQ ID NO: 52 telomere repeat that can be repeated about 3 to about 26 times TCACCCTAATGT SEQ ID NO: 53 telomere repeat that can be repeated about 3 to about 26 times CACCCTAATGTT SEQ ID NO: 54 telomere repeat that can be repeated about 3 to about 26 times ACCCTAATGTTC SEQ ID NO: 55 telomere repeat array of the scaffold of the e19 assembly downstream of lpsB1 CCCTAACCCTAACCCTAACCCTAACCCTAACCCTAT




