Great research starts with great data.

Learn More
More >
Patent Analysis of

Compositions and methods comprising sequences having meganuclease activity

Updated Time 12 June 2019

Patent Registration Data

Publication Number

US10150956

Application Number

US15/867385

Application Date

10 January 2018

Publication Date

11 December 2018

Current Assignee

PIONEER HI-BRED INTERNATIONAL, INC.,E.I. DU PONT DE NEMOURS AND COMPANY

Original Assignee (Applicant)

E. I. DUPONT DE NEMOURS AND COMPANY,PIONEER HI-BRED INTERNATIONAL, INC.

International Classification

C12N9/22,C12N9/16,C12N15/82,C12N15/81

Cooperative Classification

C12N9/22,C12N9/16,C12Y301/00,C12N15/8213,C12N15/81

Inventor

BERMUDEZ, ERICKA,CIGAN, ANDREW MARK,ENGLISH, JAMES J,FALCO, SAVERIO CARL,GAO, HUIRONG,LIU, LU,LIU, ZHAN-BIN,ONG, AZALEA S,SVITASHEV, SERGEI,YOUNG, JOSHUA K

Patent Images

This patent contains figures and images illustrating the invention and its embodiment.

US10150956 Compositions comprising sequences 1 US10150956 Compositions comprising sequences 2 US10150956 Compositions comprising sequences 3
See all images <>

Abstract

Compositions and methods comprising polynucleotides and polypeptides having meganuclease activity are provided. Further provided are nucleic acid constructs, yeast, plants, plant cells, explants, seeds and grain having the meganuclease sequences. Various methods of employing the meganuclease sequences are provided. Such methods include, for example, methods for producing a meganuclease with increased activity at a wide range of temperatures, methods for producing a yeast, plant, plant cell, explant or seed comprising a meganuclease with increased activity.

Read more

Claims

1. An isolated or recombinant polynucleotide encoding a meganuclease polypeptide, wherein said meganuclease polypeptide has at least 80% sequence identity to SEQ ID NO:283, comprises a lysine (K) at an amino acid position corresponding to position 66 of SEQ ID NO:283, and further comprises:

a) at least one amino acid modification at an amino acid position corresponding to a position of SEQ ID NO: 283 selected from the group consisting of positions 16, 19, 22, 50, 54, 56, 81, 103, 121, 132, 153, 244, 258, 281, 308, 316, 319, and combinations thereof; or, b) an amino acid sequence having at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, or 17 of any of the amino acid modification of (a).

2. The isolated or recombinant polynucleotide of claim 1, wherein said at least one amino acid modification comprises:

a) an isoleucine (I) at a position corresponding to amino acid position 16 in SEQ ID NO: 283; b) a glycine (G) at a position corresponding to amino acid position 19 in SEQ ID NO: 283; c) a cysteine (C) at a position corresponding to amino acid position 22 in SEQ ID NO: 283; d) an arginine (R) or lysine (K) at a position corresponding to amino acid position 50 in SEQ ID NO: 283; e) an isoleucine (I) at a position corresponding to amino acid position 54 in SEQ ID NO: 283; f) an aspartic acid (D) at a position corresponding to amino acid position 56 in SEQ ID NO: 283; g) a lysine (K) at a position corresponding to amino acid position 81 in SEQ ID NO: 283; h) an asparagine (N) at a position corresponding to amino acid position 103 in SEQ ID NO: 283; i) a lysine (K) at a position corresponding to amino acid position 121 in SEQ ID NO: 283; j) an isoleucine (I) at a position corresponding to amino acid position 132 in SEQ ID NO: 283; k) an aspartic acid (D) at a position corresponding to amino acid position 153 in SEQ ID NO: 283; l) a glutamic acid (E) at a position corresponding to amino acid position 244 in SEQ ID NO: 283; m) a serine (S) at a position corresponding to amino acid position 258 in SEQ ID NO: 283; n) a tyrosine (Y) at a position corresponding to amino acid position 281 in SEQ ID NO: 283; o) a lysine (K) at a position corresponding to amino acid position 308 in SEQ ID NO: 283; p) a valine (V) at a position corresponding to amino acid position 316 in SEQ ID NO: 283; q) an isoleucine (I) at a position corresponding to amino acid position 319 in SEQ ID NO: 283; or r) any combination of a) to q).

3. The isolated or recombinant polynucleotide of claim 1, wherein said nucleotide sequence encodes a meganuclease polypeptide, wherein said polypeptide further comprises:

a) an isoleucine (I) or phenylalanine (F) at a position corresponding to amino acid position 16 in SEQ ID NO: 283; b) a serine (S) or a glycine (G) at a position corresponding to amino acid position 19 in SEQ ID NO: 283; c) a cysteine (C) or a serine (S) at a position corresponding to amino acid position 22 in SEQ ID NO: 283; d) an arginine (R) or lysine (K) or glutamine (Q) at a position corresponding to amino acid position 50 in SEQ ID NO: 283; e) an isoleucine (I) or a phenylalanine (F) at a position corresponding to amino acid position 54 in SEQ ID NO: 283; f) a leucine (L) or aspartic acid (D) at a position corresponding to amino acid position 56 in SEQ ID NO: 283; g) a lysine (K) or isoleucine (I) at a position corresponding to amino acid position 81 in SEQ ID NO: 283; h) a valine (V) or asparagine (N) at a position corresponding to amino acid position 103 in SEQ ID NO: 283; i) a glycine (G) or lysine (K) at a position corresponding to amino acid position 121 in SEQ ID NO: 283; j) a valine (V) or isoleucine (I) at a position corresponding to amino acid position 132 in SEQ ID NO: 283; k) an aspartic acid (D) or a methionine (M) at a position corresponding to amino acid position 153 in SEQ ID NO: 283; l) a lysine (K) or a glutamic acid (E) at a position corresponding to amino acid position 244 in SEQ ID NO: 283; m) a serine (S) or a glycine (G) at a position corresponding to amino acid position 258 in SEQ ID NO: 283; n) a phenylalanine (F) or a tyrosine (Y) at a position corresponding to amino acid position 281 in SEQ ID NO: 283; o) a glycine (G) or a lysine (K) at a position corresponding to amino acid position 308 in SEQ ID NO: 283; p) an alanine (A) or a valine (V) at a position corresponding to amino acid position 316 in SEQ ID NO: 283; q) a valine (V) or an isoleucine (I) at a position corresponding to amino acid position 319 in SEQ ID NO: 1; or r) any combination of a) to q).

4. The isolated or recombinant polynucleotide of claim 1, wherein said nucleotide sequence encodes a meganuclease polypeptide selected from the group consisting of SEQ ID NOS: 316, 317, 318, 319, and 320.

5. The isolated or recombinant polynucleotide of claim 1, wherein said nucleotide sequence encodes a meganuclease polypeptide, wherein the polypeptide is capable of recognizing and cleaving a meganuclease recognition sequence of SEQ ID NO: 281.

6. The isolated or recombinant polynucleotide of claim 1, wherein said polypeptide has an increased meganuclease activity when compared to a control meganuclease that lacks said amino acid modification.

7. The isolated or recombinant polynucleotide of claim 6, wherein said control meganuclease is selected from the group of SEQ ID NO: 282 and SEQ ID NO: 283.

8. The isolated or recombinant polynucleotide of claim 6, wherein the increased meganuclease activity is evidenced by:

a) a higher yeast assay score when compared to the control meganuclease that lacks said amino acid modification; or, b) a higher target site mutation rate when compared to the control meganuclease that lacks said amino acid modification; or, c) a higher in-vitro cutting when compared to the control meganuclease that lacks said amino acid modification; or, d) any combination of (a), (b) and (c).

9. The isolated or recombinant polynucleotide of claim 6, wherein the increased meganuclease activity is determined at 16° C., 24° C., 28° C., 30° C. or 37° C.

10. A recombinant DNA construct, comprising the isolated or recombinant polynucleotide of claim 1.

11. A cell comprising at least one polynucleotide of claim 1 or the recombinant DNA construct of claim 10, wherein said polynucleotide is heterologous to the cell.

12. The cell of claim 11, wherein said cell is a plant cell.

13. The cell of claim 11, wherein said plant cell is from a monocot.

14. The cell of claim 11, wherein said plant cell is from a dicot.

15. A plant comprising a plant cell of claim 12.

16. A transgenic seed produced by the plant of claim 15, wherein said transgenic seed comprises the heterologous polynucleotide of claim 1 or the recombinant construct of claim 10.

17. A method for producing a meganuclease having increased activity over a range of temperatures, the method comprising:

a) producing a variant meganuclease, wherein said variant meganuclease has at least 80% sequence identity to SEQ ID NO:283 and comprises a lysine (K) at an amino acid position corresponding to position 66 of SEQ ID NO:283, by modifying at least one amino acid at an amino acid position corresponding to a position of SEQ ID NO: 283 selected from the group consisting of positions 16, 19, 22, 50, 54, 56, 81, 103, 121, 132, 153, 244, 258, 281, 308, 316, 319, and combinations thereof; b) screening said variant meganuclease for the ability to cleave a DNA target sequence over a range of temperatures between and including 16° C. to 37° C.; and c) selecting a variant meganuclease screened in step b) that is able to cleave a DNA target sequence over said temperature range.

18. The method of claim 17, wherein said range of temperatures comprises:

a) 16° C.; b) 18° C.; c) 20° C.; d) 24° C.; e) 28° C.; f) 30° C.; g) 37° C.; or, h) any combination of a), b), c), d), e), f), h), g) and g).

19. A method for producing a meganuclease having an increased meganuclease activity when compared to a control meganuclease, the method comprising:

a) producing a variant meganuclease, wherein said variant meganuclease has at least 80% sequence identity to SEQ ID NO:283 and comprises a lysine (K) at an amino acid position corresponding to position 66 of SEQ ID NO:283, by modifying at least one amino acid at an amino acid position corresponding to a position of SEQ ID NO: 283 selected from the group consisting of positions 16, 19, 22, 50, 54, 56, 81, 103, 121, 132, 153, 244, 258, 281, 308, 316, 319, and combinations thereof; and, b) screening said variant for increased meganuclease activity when compared to a control meganuclease; and c) selecting a variant meganuclease screened in step b) that is able to cleave a DNA target sequence over said temperature range.

20. The method of claim 19, wherein the increased meganuclease activity is evidenced by:

a) a higher yeast assay score when compared to the control meganuclease that lacks said amino acid modification; or, b) a higher target site mutation rate when compared to the control meganuclease that lacks said amino acid modification; or, c) a higher in-vitro cutting when compared to the control meganuclease that lacks said amino acid modification; or, d) any combination of (a), (b) and (c).

21. A composition comprising at least one polynucleotide of claim 1.

Read more

Claim Tree

  • 1
    1. An isolated or recombinant polynucleotide encoding a meganuclease polypeptide, wherein
    • said meganuclease polypeptide has at least 80% sequence identity to SEQ ID NO:283, comprises a lysine (K) at an amino acid position corresponding to position 66 of SEQ ID NO:283, and further comprises:
    • 2. The isolated or recombinant polynucleotide of claim 1, wherein
      • said at least one amino acid modification comprises:
    • 3. The isolated or recombinant polynucleotide of claim 1, wherein
      • said nucleotide sequence encodes a meganuclease polypeptide, wherein
    • 4. The isolated or recombinant polynucleotide of claim 1, wherein
      • said nucleotide sequence encodes a meganuclease polypeptide selected from the group consisting of
    • 5. The isolated or recombinant polynucleotide of claim 1, wherein
      • said nucleotide sequence encodes a meganuclease polypeptide, wherein
    • 6. The isolated or recombinant polynucleotide of claim 1, wherein
      • said polypeptide has an increased meganuclease activity when compared to a control meganuclease that lacks said amino acid modification.
  • 10
    10. A recombinant DNA construct, comprising
    • the isolated or recombinant polynucleotide of claim 1.
  • 11
    11. A cell comprising
    • at least one polynucleotide of claim 1 or the recombinant DNA construct of claim 10, wherein said polynucleotide is heterologous to the cell.
    • 12. The cell of claim 11, wherein
      • said cell is a plant cell.
    • 13. The cell of claim 11, wherein
      • said plant cell is from a monocot.
    • 14. The cell of claim 11, wherein
      • said plant cell is from a dicot.
  • 15
    15. A plant comprising
    • a plant cell of claim 12.
    • 16. A transgenic seed produced by the plant of claim 15, wherein
      • said transgenic seed comprises
  • 17
    17. A method for producing a meganuclease having increased activity over a range of temperatures, the method comprising:
    • a) producing a variant meganuclease, wherein said variant meganuclease has at least 80% sequence identity to SEQ ID NO:283 and comprises a lysine (K) at an amino acid position corresponding to position 66 of SEQ ID NO:283, by modifying at least one amino acid at an amino acid position corresponding to a position of SEQ ID NO: 283 selected from the group consisting of positions 16, 19, 22, 50, 54, 56, 81, 103, 121, 132, 153, 244, 258, 281, 308, 316, 319, and combinations thereof;
    • b) screening said variant meganuclease for the ability to cleave a DNA target sequence over a range of temperatures between and including 16° C. to 37° C.; and
    • c) selecting a variant meganuclease screened in step b) that is able to cleave a DNA target sequence over said temperature range.
    • 18. The method of claim 17, wherein
      • said range of temperatures comprises:
  • 19
    19. A method for producing a meganuclease having an increased meganuclease activity when compared to a control meganuclease, the method comprising:
    • a) producing a variant meganuclease, wherein said variant meganuclease has at least 80% sequence identity to SEQ ID NO:283 and comprises a lysine (K) at an amino acid position corresponding to position 66 of SEQ ID NO:283, by modifying at least one amino acid at an amino acid position corresponding to a position of SEQ ID NO: 283 selected from the group consisting of positions 16, 19, 22, 50, 54, 56, 81, 103, 121, 132, 153, 244, 258, 281, 308, 316, 319, and combinations thereof; and,
    • b) screening said variant for increased meganuclease activity when compared to a control meganuclease; and
    • c) selecting a variant meganuclease screened in step b) that is able to cleave a DNA target sequence over said temperature range.
    • 20. The method of claim 19, wherein
      • the increased meganuclease activity is evidenced by :
  • 21
    21. A composition comprising
    • at least one polynucleotide of claim 1.
See all independent claims <>

Description

REFERENCE TO SEQUENCE LISTING SUBMITTED ELECTRONICALLY

The official copy of the sequence listing is submitted electronically via EFS-Web as an ASCII formatted sequence listing with a file named 20180102_BB2117USDIV2_SequenceListing_ST25.txt created on 2 Jan. 2018 and having a size of 950,424 bytes and is filed concurrently with the specification. The sequence listing contained in this ASCII formatted document is part of the specification and is herein incorporated by reference in its entirety.

FIELD OF THE INVENTION

This invention is in the field of molecular biology. More specifically, this invention pertains to sequences having meganuclease activity.

BACKGROUND OF THE INVENTION

Recombinant DNA technology has made it possible to insert foreign DNA sequences into the genome of an organism, thus, altering the organism's phenotype. The most commonly used plant transformation methods are Agrobacterium infection and biolistic particle bombardment in which transgenes integrate into a plant genome in a random fashion and in an unpredictable copy number. Thus, efforts are undertaken to control transgene integration in plants.

Site-specific integration techniques, which employ site-specific recombination systems, as well as, other types of recombination technologies, have been used to generate targeted insertions of genes of interest in a variety of organism.

Other methods for inserting or modifying a DNA sequence involve homologous DNA recombination by introducing a transgenic DNA sequence flanked by sequences homologous to the genomic target. U.S. Pat. No. 5,527,695 describes transforming eukaryotic cells with DNA sequences that are targeted to a predetermined sequence of the eukaryote's DNA. Transformed cells are identified through use of a selectable marker included as a part of the introduced DNA sequences.

While both systems have provided useful techniques for targeted insertion of sequences of interest, there remains a need for nucleases that will facilitate precise modification of a plant or yeast genome. In addition, there remains a need for meganucleases with increased activity that can introduce a double strand brake at a wide range of temperatures.

BRIEF SUMMARY OF THE INVENTION

Compositions and methods comprising polynucleotides and polypeptides having meganuclease activity are provided. Further provided are compositions comprising polynucleotides encoding variant meganucleases comprising at least one amino acid modification, wherein the variant meganuclease has increased activity. Also provided are nucleic acid constructs, yeast, plants, plant cells, explants, seeds and grain having the meganuclease sequences.

Various methods of employing the meganuclease sequences are provided. Such methods include methods for increasing meganuclease activity in a cell, yeast cell, plant plant cell, plant, explant or seed. Further provided are methods and compositions that allow the various meganuclease polypeptides and variants and fragments thereof to be expressed in a yeast or plant cell at a wide range of temperatures. Such methods and compositions find use in producing yeast, plant cells, plants and explants with improved meganuclease activity.

Thus in a first embodiment, the invention concerns an isolated or recombinant polynucleotide comprising a nucleotide sequence encoding a meganuclease polypeptide, said polypeptide comprising: a) an amino acid sequence having at least one amino acid modification at an amino acid position corresponding to a position of SEQ ID NO: 1 selected from the group consisting of positions 2, 12, 16, 22, 23, 31, 36, 43, 50, 56, 58, 59, 62, 71, 72, 73, 80, 81, 82, 86, 91, 95, 98, 103, 113, 114, 116, 117, 118, 121, 124, 128, 129, 131, 147, 151, 153, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 194, 195, 196, 197, 200, 203, 204, 209, 222, 232, 236, 237, 246, 254, 258, 267, 278, 281, 282, 289, 308, 311, 312, 316, 318, 319, 334, 339, 340, 342, 345, 346, 348 and combinations thereof; or, b) an amino acid sequence having at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43 or 44 of any of the amino acid modification of (a).

In other embodiments, the invention concerns an isolated or recombinant polynucleotide of the present disclosure, wherein said nucleotide sequence encodes a meganuclease polypeptide having at least 80%, 81, %, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% sequence identity to SEQ ID NO: 1.

In another embodiment, the invention concerns the isolated or recombinant polynucleotide of embodiment 1, and its corresponding polypeptide, wherein said at least one amino acid modification comprises; a) an aspartic acid (D) at a position corresponding to amino acid position 2 in SEQ ID NO: 1; b) a histidine (H) at a position corresponding to amino acid position 12 in SEQ ID NO: 1; c) an isoleucine (I) at a position corresponding to amino acid position 16 in SEQ ID NO: 1; d) a cysteine (C) at a position corresponding to amino acid position 22 in SEQ ID NO: 1; e) a leucine (L) at a position corresponding to amino acid position 23 in SEQ ID NO: 1; f) an arginine (R) at a position corresponding to amino acid position 31 in SEQ ID NO: 1; g) an asparagine (N) at a position corresponding to amino acid position 36 in SEQ ID NO: 1; h) a leucine (L) at a position corresponding to amino acid position 43 in SEQ ID NO: 1; i) an arginine (R) or lysine (K) at a position corresponding to amino acid position 50 in SEQ ID NO: 1; j) a leucine (L) at a position corresponding to amino acid position 56 in SEQ ID NO: 1; k) an isoleucine (I) at a position corresponding to amino acid position 58 in SEQ ID NO: 1; l) a histidine (H) or alanine (A) at a position corresponding to amino acid position 59 in SEQ ID NO: 1; m) a valine (V) at a position corresponding to amino acid position 62 in SEQ ID NO: 1; n) a lysine (K) at a position corresponding to amino acid position 71 in SEQ ID NO: 1; o) a threonine (T) at a position corresponding to amino acid position 72 in SEQ ID NO: 1; p) an alanine (A) at a position corresponding to amino acid position 73 in SEQ ID NO: 1; q) an arginine (R) at a position corresponding to amino acid position 80 in SEQ ID NO: 1; r) a lysine (K) at a position corresponding to amino acid position 81 in SEQ ID NO: 1; s) an arginine (R) at a position corresponding to amino acid position 82 in SEQ ID NO: 1; t) an aspartic acid (D) at a position corresponding to amino acid position 86 in SEQ ID NO: 1; u) an isoleucine (I) at a position corresponding to amino acid position 91 in SEQ ID NO: 1; v) an isoleucine (I) at a position corresponding to amino acid position 95 in SEQ ID NO: 1; w) an arginine (R) at a position corresponding to amino acid position 98 in SEQ ID NO: 1; x) a valine (V) at a position corresponding to amino acid position 103 in SEQ ID NO: 1; y) a serine (S) at a position corresponding to amino acid position 113 in SEQ ID NO: 1; z) a proline (P) at a position corresponding to amino acid position 114 in SEQ ID NO: 1; aa) an arginine (R) at a position corresponding to amino acid position 116 in SEQ ID NO: 1; bb) a glycine (G) at a position corresponding to amino acid position 117 in SEQ ID NO: 1; cc) a threonine (T) at a position corresponding to amino acid position 118 in SEQ ID NO: 1; dd) an glycine (G) at a position corresponding to amino acid position 121 in SEQ ID NO: 1; ee) an arginine (R) at a position corresponding to amino acid position 124 in SEQ ID NO: 1; ff) a cysteine (C) at a position corresponding to amino acid position 128 in SEQ ID NO: 1; gg) an alanine (A) at a position corresponding to amino acid position 129 in SEQ ID NO: 1; hh) an arginine (R) at a position corresponding to amino acid position 131 in SEQ ID NO: 1; ii) a serine (S) at a position corresponding to amino acid position 147 in SEQ ID NO: 1; jj) an alanine (A) at a position corresponding to amino acid position 151 in SEQ ID NO: 1; kk) a leucine (L) or a methionine (M) at a position corresponding to amino acid position 153 in SEQ ID NO: 1; ll) a tryptophan (W) at a position corresponding to amino acid position 159 in SEQ ID NO: 1; mm) a glutamic acid (E) at a position corresponding to amino acid position 160 in SEQ ID NO: 1; nn) a valine (V) at a position corresponding to amino acid position 161 in SEQ ID NO: 1; oo) a tyrosine (Y) at a position corresponding to amino acid position 162 in SEQ ID NO: 1; pp) an arginine (R) at a position corresponding to amino acid position 163 in SEQ ID NO: 1; qq) a histidine (H) at a position corresponding to amino acid position 164 in SEQ ID NO: 1; rr) a leucine (L) at a position corresponding to amino acid position 165 in SEQ ID NO: 1; ss) an arginine (R) at a position corresponding to amino acid position 166 in SEQ ID NO: 1; tt) a histidine (H) at a position corresponding to amino acid position 167 in SEQ ID NO: 1; uu) a proline (P) at a position corresponding to amino acid position 168 in SEQ ID NO: 1; vv) an alanine (A) at a position corresponding to amino acid position 169 in SEQ ID NO: 1; ww) a proline (P) at a position corresponding to amino acid position 170 in SEQ ID NO: 1; xx) a histidine (H) at a position corresponding to amino acid position 171 in SEQ ID NO: 1; yy) a proline (P) at a position corresponding to amino acid position 172 in SEQ ID NO: 1; zz) an arginine (R) at a position corresponding to amino acid position 173 in SEQ ID NO: 1; aaa) a leucine (L) at a position corresponding to amino acid position 174 in SEQ ID NO: 1; bbb) a proline (P) at a position corresponding to amino acid position 175 in SEQ ID NO: 1; ccc) a glutamine (Q) at a position corresponding to amino acid position 176 in SEQ ID NO: 1; ddd) an alanine (A) at a position corresponding to amino acid position 177 in SEQ ID NO: 1; eee) an arginine (R) at a position corresponding to amino acid position 178 in SEQ ID NO: 1; fff) a valine (V) at a position corresponding to amino acid position 179 in SEQ ID NO: 1; ggg) a glutamine (Q) at a position corresponding to amino acid position 180 in SEQ ID NO: 1; hhh) a valine (V) at a position corresponding to amino acid position 182 in SEQ ID NO: 1; iii) a proline (P) at a position corresponding to amino acid position 183 in SEQ ID NO: 1; jjj) a lysine (K) at a position corresponding to amino acid position 184 in SEQ ID NO: 1; kkk) a threonine (T) or a histidine (H) at a position corresponding to amino acid position 185 in SEQ ID NO: 1; lll) a serine (S) at a position corresponding to amino acid position 186 in SEQ ID NO: 1; mmm) a glutamic acid (E) at a position corresponding to amino acid position 187 in SEQ ID NO: 1; nnn) a leucine (L) at a position corresponding to amino acid position 188 in SEQ ID NO: 1; ooo) a glutamic acid (E) at a position corresponding to amino acid position 189 in SEQ ID NO: 1; ppp) a glutamine (Q) at a position corresponding to amino acid position 190 in SEQ ID NO: 1; qqq) a leucine (L) at a position corresponding to amino acid position 191 in SEQ ID NO: 1; rrr) a proline (P) at a position corresponding to amino acid position 194 in SEQ ID NO: 1; sss) a lysine (K) at a position corresponding to amino acid position 195 in SEQ ID NO: 1; ttt) a serine (S) at a position corresponding to amino acid position 196 in SEQ ID NO: 1; uuu) a phenylalanine (F) at a position corresponding to amino acid position 197 in SEQ ID NO: 1; vvv) an isoleucine (I) at a position corresponding to amino acid position 200 in SEQ ID NO: 1; www) a valine (V) at a position corresponding to amino acid position 203 in SEQ ID NO: 1; xxx) a leucine (L) at a position corresponding to amino acid position 204 in SEQ ID NO: 1; yyy) a cysteine (C) at a position corresponding to amino acid position 209 in SEQ ID NO: 1; zzz) a leucine (L) at a position corresponding to amino acid position 222 in SEQ ID NO: 1; aaaa) an isoleucine (I) at a position corresponding to amino acid position 232 in SEQ ID NO: 1; bbbb) a serine (S) at a position corresponding to amino acid position 236 in SEQ ID NO: 1; cccc) a leucine (L) or an arginine (R) at a position corresponding to amino acid position 237 in SEQ ID NO: 1; dddd) a histidine (H) at a position corresponding to amino acid position 246 in SEQ ID NO: 1; eeee) an isoleucine (I) at a position corresponding to amino acid position 254 in SEQ ID NO: 1; ffff) a serine (S) at a position corresponding to amino acid position 258 in SEQ ID NO: 1; gggg) an arginine (R) at a position corresponding to amino acid position 267 in SEQ ID NO: 1; hhhh) an isoleucine (I) at a position corresponding to amino acid position 278 in SEQ ID NO: 1; iiii) a tyrosine (Y) at a position corresponding to amino acid position 281 in SEQ ID NO: 1; jjjj) a phenylalanine (F) at a position corresponding to amino acid position 282 in SEQ ID NO: 1; kkkk) a threonine (T) at a position corresponding to amino acid position 289 in SEQ ID NO: 1; IIII) a glycine (G) at a position corresponding to amino acid position 308 in SEQ ID NO: 1; mmmm) an arginine (R) at a position corresponding to amino acid position 311 in SEQ ID NO: 1; nnnn) an alanine (A) at a position corresponding to amino acid position 312 in SEQ ID NO: 1; oooo) an alanine (A) at a position corresponding to amino acid position 316 in SEQ ID NO: 1; pppp) an arginine (R) at a position corresponding to amino acid position 318 in SEQ ID NO: 1; qqqq) an alanine (A) at a position corresponding to amino acid position 334 in SEQ ID NO: 1; rrrr) a phenylalanine (F) at a position corresponding to amino acid position 339 in SEQ ID NO: 1; ssss) a glycine (G) or a leucine (L) at a position corresponding to amino acid position 340 in SEQ ID NO: 1; tttt) a serine (S) at a position corresponding to amino acid position 342 in SEQ ID NO: 1; uuuu) an asparagine (N) at a position corresponding to amino acid position 345 in SEQ ID NO: 1; vvvv) an asparagine (N) at a position corresponding to amino acid position 346 in SEQ ID NO: 1; wwww) an asparagine (N) at a position corresponding to amino acid position 348 in SEQ ID NO: 1; or, xxxx) any combination of a) to wwww).

In another embodiment, the invention concerns the isolated or recombinant polynucleotide of embodiment 1, and its corresponding polypeptide, wherein said nucleotide sequence encodes a meganuclease polypeptide, wherein said polypeptide further comprises at least one amino acid modification described herein such as those shown in FIG. 5A-FIG. 5E, FIG. 9A-FIG. 9N, FIG. 10A-FIG. 10D, FIG. 11, FIG. 12, FIG. 13, FIG. 14A-FIG. 14F and FIG. 15A-FIG. 15E as well any I-Cre1 type modification known and any combination thereof.

In another embodiment, the invention concerns an isolated or recombinant polynucleotide, and its corresponding polypeptide, wherein said nucleotide sequence encodes a meganuclease polypeptide selected from the group consisting of SEQ ID NOS: 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 251, 252, 253, 272, 273, 274, 275, 272, 273, 274, 275, 284, 285, 286, 287, 288, 289, 290, 291, 292, 293, 294, 295, 296, 297, 298, 330, 331, 332, 333, 334, 335, 336, 337, 338, 339, 340, 341, 357, 358, 359, 360, 361, 362, 363, 364, 365, 366, 367, 368, 369, 370, 371, 390, 391, 392, 393, 394, 395, 396, 397, 398, 399, 400, 401, 402, 403, 430, 431, 432 and 433.

In another embodiment, the invention concerns an isolated or recombinant polynucleotide of the present disclosure, and its corresponding polypeptide, wherein said nucleotide sequence encodes a meganuclease polypeptide, wherein the polypeptide is capable of recognizing and cleaving a meganuclease recognition sequence selected from the group consisting of SEQ ID NO: 2, SEQ ID NO: 85, SEQ ID NO: 269, SEQ ID NO: 281, SEQ ID NO: 331, SEQ ID NO: 358, SEQ ID NO: 390, SEQ ID NO: 423 or SEQ ID NO: 424.

In another embodiment, the invention concerns an isolated or recombinant polynucleotide of the present disclosure, and its corresponding polypeptide, wherein said nucleotide sequence encodes a meganuclease polypeptide, wherein said polypeptide has an increased meganuclease activity when compared to a control meganuclease that lacks said amino acid modification. The control meganuclease can be selected from the group of SEQ ID NO: 1, SEQ ID NO: 86, SEQ ID NO: 250, SEQ ID NO: 270, SEQ ID NO: 271, SEQ ID NO: 282, SEQ ID NO: 283, SEQ ID NO: 329, SEQ ID NO: 356, SEQ ID NO: 389, SEQ ID NO: 429 or SEQ ID NO: 435 or any I-CreI type meganuclease. Increased meganuclease activity can be evidenced by any method for measuring meganuclease activity, including but not limited to a) a higher yeast assay score when compared to the control meganuclease that lacks said amino acid modification; or, b) a higher target site mutation rate when compared to the control meganuclease that lacks said amino acid modification; or, c) a higher in-vitro cutting when compared to the control meganuclease that lacks said amino acid modification; or, d) any combination of those methods. Furthermore, increased activity can be measured at a wide range of temperatures such as temperatures including 16° C., 24° C., 28° C., 30° C. or 37° C. and temperatures between 16° C. to 37° C.

In another embodiment, the invention concerns an isolated or recombinant polynucleotide, further comprising a nucleotide sequence encoding a N-terminal nuclear transit peptide and/or a nucleotide sequence encoding a C-terminal histidine tag.

In another embodiment, the invention concerns a recombinant DNA construct, comprising the isolated or recombinant polynucleotide of the present disclosure. The recombinant DNA construct can further comprise a promoter operably linked to said polynucleotide. The promoter can be heterologous with respect to the recombinant polynucleotide.

In another embodiment, the invention concerns a cell, plant cell, yeast cell, plant, yeast or seed comprising the recombinant construct of the present disclosure. The plant cell can be a monocot or a dicot plant cell. The monocot plant cell can be from maize, wheat, rice, barley, sugarcane, sorghum, or rye. The dicot cell can be a from soybean, Brassica, sunflower, cotton, or alfalfa.

In another embodiment, the invention concerns plants comprising the recombinant construct of the present disclosure and seeds or plant extracts, explant obtained from such plants.

In another embodiment, the invention concerns a method for producing a meganuclease having increased activity over a range of temperatures, the method comprising:

    • a) producing a variant meganuclease by modifying at least one amino acid at an amino acid position corresponding to a position of SEQ ID NO: 1 selected from the group consisting of positions 2, 12, 16, 22, 23, 31, 36, 43, 50, 56, 58, 59, 62, 71, 72, 73, 80, 81, 82, 86, 91, 95, 98, 103, 113, 114, 116, 117, 118, 121, 124, 128, 129, 131, 147, 151, 153, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 194, 195, 196, 197, 200, 203, 204, 209, 222, 232, 236, 237, 246, 254, 258, 267, 278, 281, 282, 289, 308, 311, 312, 316, 318, 319, 334, 339, 340, 342, 345, 346, 348 and combinations thereof; and,
    • b) selecting said variant meganuclease from step a) and screening said variant meganuclease for the ability to cleave a DNA target sequence over a range of temperatures between and including 16° C. to 37° C.

In another embodiment, the invention concerns a method for producing a meganuclease having an increased meganuclease activity when compared to a control meganuclease, the method comprising:

    • a) producing a variant meganuclease by modifying at least one amino acid at an amino acid position corresponding to a position of SEQ ID NO: 1 selected from the group consisting of positions 2, 12, 16, 22, 23, 31, 36, 43, 50, 56, 58, 59, 62, 71, 72, 73, 80, 81, 82, 86, 91, 95, 98, 103, 113, 114, 116, 117, 118, 121, 124, 128, 129, 131, 147, 151, 153, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 194, 195, 196, 197, 200, 203, 204, 209, 222, 232, 236, 237, 246, 254, 258, 267, 278, 281, 282, 289, 308, 311, 312, 316, 318, 319, 334, 339, 340, 342, 345, 346, 348 and combinations thereof; and,
    • b) selecting the variant meganuclease from step a) and screening said variant for increased meganuclease activity when compared to a control meganuclease.

In another embodiment, the invention concerns a method of introducing a double-strand break in the genome of a yeast or plant cell, said method comprising:

    • a) contacting at least one plant or yeast cell comprising in its genome a meganuclease recognition site with a variant meganuclease polypeptide selected from the group consisting of SEQ ID NOS: 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 251, 252, 253, 272, 273, 274, 275, 272, 273, 274, 275, 284, 285, 286, 287, 288, 289, 290, 291, 292, 293, 294, 295, 296, 297, 298, 330, 331, 332, 333, 334, 335, 336, 337, 338, 339, 340, 341, 357, 358, 359, 360, 361, 362, 363, 364, 365, 366, 367, 368, 369, 370, 371, 390, 391, 392, 393, 394, 395, 396, 397, 398, 399, 400, 401, 402 and 403, wherein the variant meganuclease is capable of inducing a double-strand break in said recognition site; and,
    • b) selecting the yeast or plant cell from a) and screening said yeast or plant cell for any modification of said recognition sequence.

In another embodiment, the invention concerns a method of integrating a polynucleotide of interest into a recognition site in the genome of a plant or yeast cell, the method comprising:

    • a) contacting at least one plant or yeast cell comprising in its genome a meganuclease recognition site with:

      • (i) a variant meganuclease polypeptide selected from the group consisting of SEQ ID NOS: 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 251, 252, 253, 272, 273, 274, 275, 272, 273, 274, 275, 284, 285, 286, 287, 288, 289, 290, 291, 292, 293, 294, 295, 296, 297, 298, 330, 331, 332, 333, 334, 335, 336, 337, 338, 339, 340, 341, 357, 358, 359, 360, 361, 362, 363, 364, 365, 366, 367, 368, 369, 370, 371, 390, 391, 392, 393, 394, 395, 396, 397, 398, 399, 400, 401, 402 and 403, wherein the variant meganuclease is capable of inducing a double-strand break in said recognition site; and,
      • (ii) a DNA fragment containing a polynucleotide of interest;
    • b) selecting at least one plant or yeast cell comprising integration of the polynucleotide of interest cassette at the recognition site.

In another embodiment, the invention concerns an isolated or recombinant polynucleotide, and its corresponding polypeptide, encoding a meganuclease polypeptide, said polypeptide comprising an amino acid sequence having at least one amino acid modification at an amino acid position corresponding to a position of SEQ ID NO: 1 selected from the group consisting of positions 16, 22, 50, 56, 59, 71, 81, 103, 121, 153, 185, 209, 222, 246, 258, 281, 308, 316, 345, 346, and combinations thereof, and wherein the polypeptide is capable of recognizing and cleaving a meganuclease target site comprising SEQ ID NO: 2.

In another embodiment, the invention concerns an isolated or recombinant polynucleotide encoding a meganuclease polypeptide, the polypeptide comprising an amino acid sequence having at least one amino acid modification at an amino acid position corresponding to a position of SEQ ID NO: 86 selected from the group consisting of positions 2, 12, 16, 22, 23, 36, 43, 50, 56, 58, 59, 72, 73, 81, 86, 91, 95, 103, 113, 114, 120, 121, 124, 128, 129, 131, 151, 153, 200, 204, 209, 232, 236, 237, 246, 254, 258, 267, 281, 308, 311, 312, 316, 319, 334, 339, 340, 342, and combinations thereof, and wherein the polypeptide is capable of recognizing and cleaving a meganuclease target site comprising SEQ ID NO: 85.

In another embodiment, the invention concerns an isolated or recombinant polynucleotide encoding a meganuclease polypeptide, the polypeptide comprising an amino acid sequence having at least one amino acid modification at an amino acid position corresponding to a position of SEQ ID NO: 270 selected from the group consisting of positions 16, 22, 50, 71, 185, 246, 258, 316 and combinations thereof, and wherein the polypeptide is capable of recognizing and cleaving a meganuclease target site comprising SEQ ID NO: 269.

In another embodiment, the invention concerns an isolated or recombinant polynucleotide encoding a meganuclease polypeptide, the polypeptide comprising an amino acid sequence having at least one amino acid modification at an amino acid position corresponding to a position of SEQ ID NO: 329 selected from the group consisting of positions 12, 32, 50, 56, 80, 105, 124, 129, 131, 153, 185, 311, 316, 318, 340, and combinations thereof, and wherein the polypeptide is capable of recognizing and cleaving a meganuclease target site comprising SEQ ID NO: 328.

In another embodiment, the invention concerns an isolated or recombinant polynucleotide encoding a meganuclease polypeptide, the polypeptide comprising an amino acid sequence having at least one amino acid modification at an amino acid position corresponding to a position of SEQ ID NO: 356 selected from the group consisting of positions 12, 24, 36, 50, 56, 62, 73, 80, 124, 129, 147, 182, 203, 237, 252, 311, 316, 318, 340, 348, and combinations thereof, and wherein the polypeptide is capable of recognizing and cleaving a meganuclease target site comprising SEQ ID NO: 355.

In another embodiment, the invention concerns an isolated or recombinant polynucleotide encoding a meganuclease polypeptide, the polypeptide comprising an amino acid sequence having at least one amino acid modification at an amino acid position corresponding to a position of SEQ ID NO: 389 selected from the group consisting of positions 12, 50, 56, 124, 129, 131, 153, 211, 237, 311, 316, and position 318, and combinations thereof, and wherein the polypeptide is capable of recognizing and cleaving a meganuclease target site comprising SEQ ID NO: 388.

In another embodiment, the invention concerns an isolated or recombinant polynucleotide encoding a meganuclease polypeptide, the polypeptide comprising an amino acid sequence having at least one amino acid modification at an amino acid position corresponding to a position of SEQ ID NO: 429 selected from the group consisting of positions 16, 22, 50, 71, 185, 246, 258, 316 and combinations thereof, and wherein the polypeptide is capable of recognizing and cleaving a meganuclease target site comprising SEQ ID NO: 423.

BRIEF DESCRIPTION OF THE DRAWINGS AND SEQUENCE LISTING

The invention can be more fully understood from the following detailed description and the accompanying drawings and Sequence Listing, which form a part of this application. The sequence descriptions and sequence listing attached hereto comply with the rules governing nucleotide and amino acid sequence disclosures in patent applications as set forth in 37 C.F.R. §§ 1.821 1.825. The sequence descriptions contain the three letter codes for amino acids as defined in 37 C.F.R. §§ 1.821 1.825, which are incorporated herein by reference.

FIG. 1A-FIG. 1B show an amino acid alignment of I-CreI meganuclease (I-CreI.pro, SEQ ID NO: 3) with related meganucleases (SEQ ID NOS: 4-13) from various species. The decoration shows amino acid residues sharing identity.

FIG. 2 shows a diagram representing the yeast screening system used to determine the meganuclease activity in yeast. Gene fragments corresponding to the first 1000 nucleotides of the yeast Ade2 coding sequence (Ade2 5′ fragment) and the last 1011 nucleotides of the yeast Ade2 coding sequence (Ade2 3′ fragment) were disrupted by a fragment including the yeast ura3 gene (Ura3) and meganuclease recognition sites for I-SceI.

FIG. 3 shows the numerical scale and corresponding white sectoring of yeast colonies used to quantify meganuclease activity. Since the sectoring phenotype is a qualitative measure of meganuclease activity, a 0-4 numerical scoring system was implemented. A score of 0 indicates that no white sectors (no meganuclease cutting) were observed; a score of 4 indicates completely white colonies (complete cutting of the recognition site); scores of 1-3 indicate intermediate white sectoring phenotypes (and intermediate degrees of recognition site cutting).

FIG. 4 shows the meganuclease expression plasmid pVER8134.

FIG. 5A-FIG. 5E show an amino acid alignment of the parental LIG3-4 (LIG3-4.pro, SEQ ID NO: 1) and LIG3-4 meganuclease variants (Table 1A, SEQ ID NOS: 14-38). The name of the meganuclease listed in FIG. 5A-FIG. 5E corresponds to the name in Table 1A but include a “.pro” to indicate that this is a protein alignment.

FIG. 6A-FIG. 6C show the percent cleavage by the parental LIG3-4 and LIG3-4 variant meganucleases (B65=LIG3-4(B65); hit15=LIG3-4(15); hit7=LIG3-4(7)) of plasmid DNA substrate at 0, 25, 50 and 75 minutes averaged across three replicates of real-time PCR. FIG. 6A shows the % cleavage observed at 23° C. FIG. 6B shows the % cleavage observed at 28° C. FIG. 6C shows the % cleavage observed at 37° C.

FIG. 7A-FIG. 7C show the percent cleavage by the parental LIG3-4 and LIG3-4 variant meganucleases (B65=LIG3-4(B65); hit15=LIG3-4(15); hit7=LIG3-4(7)) of genomic DNA substrate at 50 minutes averaged across three replicates of real-time PCR. FIG. 7A shows the % cleavage observed at 23° C. FIG. 7B shows the % cleavage observed at 28° C. FIG. 7C shows the % cleavage observed at 37° C.

FIG. 8A shows a schematic outline of long fragment PCR reactions used to confirm UBI:moPAT:PinII cassette integration at the endogenous LIG3-4 recognition site. FIG. 8B: shows the results of long fragment PCR on callus from four events where integration occurred at the recognition site. The left panel of FIG. 8B shows the long junction fragment PCR on the HR1 side using genomic primer (HRR1) and moPAT primer (mopatR2); The right panel of FIG. 8B shows the long junction fragment PCR on HR2 side (mopatF2/HR2R2). Primer set mopatF2/HR2R2 amplified a 4 kb fragment, spanning from moPAT gene through the UBI intron, UBI promoter, and the HR2 sequence to the adjacent genomic region. Primer set HRR1/mopatR2 amplified a 2.2 kb fragment, spanning from the moPAT gene through the HR1 to the adjacent genomic region. The sizes of the two long PCR products indicate a perfect integration of the donor gene cassette at LIG3-4 recognition site. Insertion was obtained in T0 and T1 plants from one of the callus event.

FIG. 9A-FIG. 9N show an amino acid alignment of the parental MHP77 and MHP77 meganucleases variants. Amino acid modifications of the variant meganucleases, when compared to the parental meganuclease MHP77, are shown. A (-) indicates that the amino acid residue of the variant and parental meganuclease were identical.

FIG. 10A-FIG. 10D show an amino acid alignment of the parental MHP14 and MHP14 meganuclease variants. Amino acid modifications of the variant meganucleases, when compared to the parental meganuclease MHP14, are shown. A (-) indicates that the amino acid residue of the variant and parental meganuclease were identical.

FIG. 11 provides an amino acid alignment of the parental MHP107 and MHP107 meganucleases variants. Amino acid modifications of the variant meganucleases, when compared to the parental meganuclease, are shown. A (-) indicates that the amino acid residue of the variant and parental meganuclease were identical.

FIG. 12 provides an amino acid alignment of the parental ZM6.3 and ZM6.3 meganucleases variants. Amino acid modifications of the variant meganucleases, when compared to the parental meganuclease, are shown. A (-) indicates that the amino acid residue of the variant and parental meganuclease were identical.

FIG. 13 provides an amino acid alignment of the parental ZM6.22v2 and ZM6.22v2 meganucleases variants. Amino acid modifications of the variant meganucleases, when compared to the parental meganuclease, are shown. A (-) indicates that the amino acid residue of the variant and parental meganuclease were identical.

FIG. 14A-FIG. 14F show an amino acid alignment of the LIG3-4 meganuclease (SEQ ID NO: 1) and multiple meganucleases. Amino acid modifications different from SEQ ID NO: 1 are shown. A (-) indicates that the amino acid residue of the meganuclease is identical to the LIG3-4 meganuclease (SEQ ID NO: 1). Highlighted in gray are mutations which were correlated with increased meganuclease activity on the desired target site.

FIG. 15A-FIG. 15D show an amino acid alignment of some meganucleases comprising a linker polypeptide that links the two re-engineered I-CreI monomers into a single amino chain. FIG. 15E shows the percent identify of some variant (MHP14(10), MHP77(L9-01) and parental (LIG3-4, MHP14, MHP77) meganucleases). Highlighted in gray are the novel linker sequences present in variants MHP14(10) and MHP77(L9-01).

FIG. 16 shows the structural motives of the meganuclease.

SEQUENCES

SEQ ID NO: 1 is the amino acid sequence of the single chain LIG3-4 meganuclease fusion polypeptide.

SEQ ID NO: 2 is the nucleotide sequence of the LIG3-4 recognition sequence.

SEQ ID NO: 3 is the amino acid sequence of the I-CreI meganuclease monomer.

SEQ ID NO: 4 is the amino acid sequence of gi_18654305.

SEQ ID NO: 5 is the amino acid sequence of gi_108773071.

SEQ ID NO: 6 is the amino acid sequence of gi_108773352.

SEQ ID NO: 7 is the amino acid sequence of gi_108796958.

SEQ ID NO: 8 is the amino acid sequence of gi_12667512.

SEQ ID NO: 9 is the amino acid sequence of gi_18654311.

SEQ ID NO: 10 is the amino acid sequence of gi_150406493.

SEQ ID NO: 11 is the amino acid sequence of gi_110225678.

SEQ ID NO: 12 is the amino acid sequence of gi_11467050.

SEQ ID NO: 13 is the amino acid sequence of gi_18654162.

SEQ ID NO: 14 is the amino acid sequence of the LIG3-4 meganuclease.


TABLE 1A
Listing of SEQ ID NOS: (NT = nucleotide sequence; AA = amino acid
sequence) for parental and variant meganucleases.
DNA for
AA
expression
SEQ ID
in yeast
Name
NO:
SEQ ID NO:
LIG3-4
1
40
LIG3-4(B65)
27
54
LIG3-4(B70)
28
55
LIG3-4(B75)
31
58
LIG3-4(B76)
32
59
LIG3-4(B73)
30
57
LIG3-4(B82)
34
61
LIG3-4(B78)
33
60
LIG3-4(B1)
18
45
LIG3-4(15)
15
42
LIG3-4(D8)
38
65
LIG3-4(B15)
19
46
LIG3-4(C1)
35
62
LIG3-4(B71)
29
56
LIG3-4(B39)
24
51
LIG3-4(B16)
20
47
LIG3-4(D7)
37
64
LIG3-4(B38)
23
50
LIG3-4(B40)
25
52
LIG3-4(B36)
22
49
LIG3-4(B24)
21
48
LIG3-4(B55)
26
53
LIG3-4(A4)
16
43
LIG3-4(D5)
36
63
LIG3-4(7)
14
41
LIG3-4(A6)
17
44
MHP77
86
168
MHP77(L72-01a)
87
169
MHP77(L72-08a)
88
170
MHP77(L72-09a)
89
171
MHP77(L73-02a)
90
172
MHP77(L73-05a)
91
173
MHP77(L9-01)
92
174
MHP77(L9-02)
93
175
MHP77(L9-03)
94
176
MHP77(L9-04)
95
177
MHP77(L9-06)
96
178
MHP77(L9-09)
97
179
MHP77(L9-10)
98
180
MHP77(L9-11)
99
181
MHP77(L9-12)
100
182
MHP77(L112-03a)
101
183
MHP77(L113-01)
102
184
MHP77(L13-01a)
103
185
MHP77(L13-02)
104
186
MHP77(L13-04)
105
187
MHP77(L13-06)
106
188
MHP77(L13-08a)
107
189
MHP77(L13-10B1)
108
190
MHP77(L13-11)
109
191
MHP77(L13-12)
110
192
MHP77(L15-02)
111
193
MHP77(L15-03)
112
194
MHP77(L15-05)
113
195
MHP77(L15-06)
114
196
MHP77(L15-08)
115
197
MHP77(L15-10)
116
198
MHP77(L15-11)
117
199
MHP77(L15-12)
118
200
MHP77(L15-13)
119
201
MHP77(L15-15)
120
202
MHP77(L15-16)
121
203
MHP77(L15-18)
122
204
MHP77(L15-20)
123
205
MHP77(L15-21)
124
206
MHP77(L15-23)
125
207
MHP77(L15-24)
126
208
MHP77(L15-28)
127
209
MHP77(L15-29)
128
210
MHP77(L15-33)
129
211
MHP77(L15-34)
130
212
MHP77(L15-35)
131
213
MHP77(L15-36)
132
214
MHP77(L15-39)
133
215
MHP77(L15-40)
134
216
MHP77(L15-41)
135
217
MHP77(L15-42)
136
218
MHP77(L15-43)
137
219
MHP77(L15-45)
138
220
MHP77(L15-46)
139
221
MHP77(L15-27)
140
222
MHP77(L15-30)
141
223
MHP77(L15-31)
142
224
MHP77(L15-47)
143
225
MHP77(L16-01)
144
226
MHP77(L16-02)
145
227
MHP77(L16-03)
146
228
MHP77(L16-04)
147
229
MHP77(L16-05)
148
230
MHP77(L16-06)
149
231
MHP77(L16-07)
150
232
MHP77(L16-08)
151
233
MHP77(L16-09)
152
234
MHP77(L16-11)
153
235
MHP77(L16-12)
154
236
MHP77(L16-14)
155
237
MHP77(L16-15)
156
238
MHP77(L16-16)
157
239
MHP77(L16-17)
158
240
MHP77(L16-18)
159
241
MHP77(L16-19)
160
242
MHP77(L16-21)
161
243
MHP77(L16-23)
162
244
MHP77(L16-24)
163
245
MHP77(L17-12)
164
246
MHP77(L18-01)
165
247
MHP77(L18-12)
166
248
MHP77(L17-01)
167
249

SEQ ID NO: 39 is the plant optimized nucleotide sequence of LIG3-4 comprising a nuclear localization signal and an intron.

SEQ ID NO: 66 is the nucleotide sequence of MN031 primer.

SEQ ID NO: 67 is the nucleotide sequence of MN022 primer.

SEQ ID NO: 68 is the nucleotide sequence of plasmid pVER8134.

SEQ ID NO: 69 is the nucleotide sequence of a nuclear localization signal.

SEQ ID NO: 70 is the amino acid sequence of a nuclear localization signal.

SEQ ID NO: 71 is the amino acid sequence of 6× histidine tag.

SEQ ID NO: 72 is the nucleotide sequence of a nuclear localization signal in maize.

SEQ ID NO: 73 is the plant-optimized nucleotide sequence of the LIG3-4(7) meganuclease with a nuclear localization signal and an intron.

SEQ ID NO: 74 is the plant-optimized nucleotide sequence of the LIG3-4(15) meganuclease with a nuclear localization signal and an intron.

SEQ ID NO: 75 is the plant-optimized nucleotide sequence of the LIG3-4(B65) meganuclease with a nuclear localization signal and an intron.

SEQ ID NO: 76 is the nucleotide sequence of plasmid PHP46961.

SEQ ID NO: 77 is the nucleotide sequence of LIG3-4(HR1).

SEQ ID NO: 78 is the nucleotide sequence of LIG3-4(HR2).

SEQ ID NO: 79 is the nucleotide sequence of LIG3-4 target site qPCR probe.

SEQ ID NO: 80 is the nucleotide sequence of Lig3-4_forward primer.

SEQ ID NO: 81 is the nucleotide sequence of Lig3-4_reverse primer.

SEQ ID NO: 82 is the nucleotide sequence of yeast ade2.

SEQ ID NO: 83 is the nucleotide coding sequence of ade2.

SEQ ID NO: 84 is the nucleotide sequence of plasmid pHD1327.

SEQ ID NO: 85 is the nucleotide sequence the MHP77 recognition site.

SEQ ID NO: 86 is the amino acid sequence of the MHP77 meganuclease.

SEQ ID NO: 250 is the amino acid sequence of the MHP77.3 meganuclease.

SEQ ID NO: 251 is the amino acid sequence of the MHP77.3 (L9-02) meganuclease.

SEQ ID NO: 252 is the amino acid sequence of the MHP77.3 (L9-11) meganuclease.

SEQ ID NO: 253 is the amino acid sequence of the MHP77.3 (L9-12) meganuclease.

SEQ ID NO: 254 is the plant-optimized nucleotide sequence of MHP77 comprising a nuclear localization signal and lacking an intron.

SEQ ID NO: 255 is the plant-optimized nucleotide sequence of MHP77.3 meganuclease MHP77 comprising a nuclear localization signal and lacking an intron.

SEQ ID NO: 256 is the plant-optimized nucleotide sequence of MHP77(L9-02) meganuclease comprising a nuclear localization signal and an intron.

SEQ ID NO: 257 is the plant-optimized nucleotide sequence of the MHP77 (L9-11) meganuclease comprising a nuclear localization signal and an intron.

SEQ ID NO: 258 is the plant-optimized nucleotide sequence of the MHP77 (L9-12) meganuclease comprising a nuclear localization signal and an intron.

SEQ ID NO: 259 is the plant-optimized nucleotide sequence of MHP77.3 (L9-02) meganuclease comprising a nuclear localization signal and an intron.

SEQ ID NO: 260 is the plant-optimized nucleotide sequence of the MHP77.3 (L9-11) meganuclease comprising a nuclear localization signal and an intron.

SEQ ID NO: 261 is the plant-optimized nucleotide sequence of the MHP77.3 (L9-12) meganuclease comprising a nuclear localization signal and an intron.

SEQ ID NO: 262 is the amino acid sequence of the MHP77.3(15) meganuclease.

SEQ ID NO: 263 is the plant-optimized nucleotide sequence of MHP77.3(15) meganuclease comprising a nuclear localization signal and an intron.

SEQ ID NO: 264 is the nucleotide sequence of the MHP77HR1.

SEQ ID NO: 265 is the nucleotide sequence of the MHP77HR2.

SEQ ID NO: 266 is the nucleotide sequence of the MHP77 target site qPCR probe.

SEQ ID NO: 267 is the nucleotide sequence of the MHP77_forward primer.

SEQ ID NO: 268 is the nucleotide sequence of the MHP77_reverse primer.

SEQ ID NO: 269 is the nucleotide sequence of the MS26 recognition site.

SEQ ID NO: 270 is the amino acid sequence of the MS26+ meganuclease.

SEQ ID NO: 271 is the amino acid sequence of the MS26++ meganuclease.

SEQ ID NO: 272 is the amino acid sequence of the MS26+ (7) meganuclease.

SEQ ID NO: 273 is the amino acid sequence of the MS26+ (15) meganuclease.

SEQ ID NO: 274 is the amino acid sequence of the MS26+ (B65) meganuclease.

SEQ ID NO: 275 is the amino acid sequence of the MS26++ (15) meganuclease.

SEQ ID NO: 276 is the plant-optimized nucleotide sequence of MS26+ and no intron.

SEQ ID NO: 419 is the plant-optimized nucleotide sequence of MS26+ (7) and no intron.

SEQ ID NO: 277 is the plant-optimized nucleotide sequence of MS26+ (15) and no intron.

SEQ ID NO: 278 is the plant-optimized nucleotide sequence of MS26+ (B65) and no intron.

SEQ ID NO: 279 is the plant-optimized nucleotide sequence of MS26++ and no intron.

SEQ ID NO: 280 is the plant-optimized nucleotide sequence of MS26++ (15) and no intron.

SEQ ID NO: 281 is the nucleotide sequence of the MHP14 recognition site.


TABLE 1B
Listing of SEQ ID NOS: (NT = nucleotide sequence; AA = amino acid
sequence) for parental and variant meganucleases.
DNA for
AA
expression in
SEQ ID
yeast
Name
NO:
SEQ ID NO:
MHP14
282
299
MHP14+
283
MHP14(01)
284
300
MHP14(02)
285
301
MHP14(03)
286
302
MHP14(04)
287
303
MHP14(06)
288
304
MHP14(07)
289
305
MHP14(08)
290
306
MHP14(09)
291
307
MHP14(10)
292
308
MHP14(12)
293
309
MHP14(13)
294
310
MHP14(14)
295
311
MHP14(L14-03)
296
312
MHP14(L14-04)
297
313
MHP14(L14-07)
298
314

SEQ ID NO: 315 is the amino acid sequence of the MHP14+ (04) meganuclease.

SEQ ID NO: 316 is the amino acid sequence of the MHP14+ (06) meganuclease.

SEQ ID NO: 317 is the amino acid sequence of the MHP14+ (08) meganuclease.

SEQ ID NO: 318 is the amino acid sequence of the MHP14+ (12) meganuclease.

SEQ ID NO: 319 is the amino acid sequence of the MHP14+ (14) meganuclease.

SEQ ID NO: 320 is the amino acid sequence of the MHP14+ (15) meganuclease.

SEQ ID NO: 321 is the plant-optimized nucleotide sequence of MHP14 and an intron.

SEQ ID NO: 322 is the plant-optimized nucleotide sequence of MHP14+ (04) and an intron.

SEQ ID NO: 323 is the plant-optimized nucleotide sequence of MHP14+ (06) and an intron.

SEQ ID NO: 324 is the plant-optimized nucleotide sequence of MHP14+ (08) and an intron.

SEQ ID NO: 325 is the plant-optimized nucleotide sequence of MHP14+ (12) and an intron.

SEQ ID NO: 326 is the plant-optimized nucleotide sequence of MHP14+ (14) and an intron.

SEQ ID NO: 327 is the plant-optimized nucleotide sequence of MHP14+ (15) and an intron.

SEQ ID NO: 328 is the nucleotide sequence of the MHP107 recognition site.


TABLE 1C
Listing of SEQ ID NOS: (NT = nucleotide sequence; AA = amino acid
sequence) for parental and variant meganucleases.
DNA for
AA
expression in
SEQ ID
yeast
Name
NO:
SEQ ID NO:
MHP107
329
342
MHP107(C1)
330
343
MHP107(C2)
331
344
MHP107(C3)
332
345
MHP107(C4)
333
346
MHP107(C5)
334
347
MHP107(C6)
335
348
MHP107(D2)
336
349
MHP107(D3)
337
350
MHP107(D4)
338
351
MHP107(D5)
339
352
MHP107(D1)
340
353
MHP107(D6)
341
354

SEQ ID NO: 355 is the nucleotide sequence of the ZM6.3 recognition site.


TABLE 1D
Listing of SEQ ID NOS: (NT = nucleotide sequence; AA = amino acid
sequence) for parental and variant meganucleases.
DNA for
AA
expression in
SEQ ID
yeast
Name
NO:
SEQ ID NO:
ZM6.3
356
372
ZM6.3(G1)
357
373
ZM6.3(G2)
358
374
ZM6.3(G3)
359
375
ZM6.3(G4)
360
376
ZM6.3(G5)
361
377
ZM6.3(G6)
362
378
ZM6.3(H1)
363
379
ZM6.3(H2)
364
380
ZM6.3(H3)
365
381
ZM6.3(H5)
366
382
ZM6.3(H6)
367
383
ZM6.3(1)
368
384
ZM6.3(3)
369
385
ZM6.3(4)
370
386
ZM6.3(5)
371
387

SEQ ID NO: 388 is the nucleotide sequence of the ZM6.22v2 recognition site.


TABLE 1E
Listing of SEQ ID NOS: (NT = nucleotide sequence; AA = amino acid
sequence) for parental and variant meganucleases.
DNA for
AA
expression in
SEQ ID
yeast
Name
NO:
SEQ ID NO:
ZM6.22v2
389
404
ZM6.22v2(J2)
390
405
ZM6.22v2(J3)
391
406
ZM6.22v2(J4)
392
407
ZM6.22v2(J5)
393
408
ZM6.22v2(I2)
394
409
ZM6.22v2(I3)
395
410
ZM6.22v2(I4)
396
411
ZM6.22v2(I5)
397
412
ZM6.22v2(I6)
398
413
ZM6.22v2(I7)
399
414
ZM6.22v2(I8)
400
415
ZM6.22v2(I9)
401
416
ZM6.22v2(J7)
402
417
ZM6.22v2(J8)
403
418

SEQ ID NO: 419 is the nucleotide sequence of the MS26+ (7) variant meganuclease with no intron.

SEQ ID NO: 420 is the nucleotide sequence of the linker polypeptide of LIG3-4, MHP14, MHP77.

SEQ ID NO: 421 is the nucleotide sequence of the linker polypeptide of MHP14(10).

SEQ ID NO: 422 is the nucleotide sequence of the linker polypeptide of MHP77(L9-01).

SEQ ID NO: 423 is the nucleotide sequence of the TS21 recognition site in soybean genome.

SEQ ID NO: 424 is the nucleotide sequence of the TS14 recognition site in soybean genome.

SEQ ID NO: 425 is the plant-optimized nucleotide sequence of the TS21 meganuclease with a nuclear localization signal and an intron.

SEQ ID NO: 426 is the plant-optimized nucleotide sequence of the TS21(7) meganuclease with a nuclear localization signal and an intron.

SEQ ID NO: 427 is the plant-optimized nucleotide sequence of the TS21(15) meganuclease with a nuclear localization signal and an intron.

SEQ ID NO: 428 plant-optimized nucleotide sequence of the TS21(B65) meganuclease with a nuclear localization signal and an intron.

SEQ ID NO: 429 is the amino acid sequence of the TS21 meganuclease.

SEQ ID NO: 430 is the amino acid sequence of the TS21(7) meganuclease.

SEQ ID NO: 431 is the amino acid sequence of the TS21(15) meganuclease.

SEQ ID NO: 432 is the amino acid sequence of the TS21(B65) meganuclease.

SEQ ID NO: 433 is the plant-optimized nucleotide sequence of TS14 meganuclease with a nuclear localization signal and an intron.

SEQ ID NO: 434 is the plant-optimized nucleotide sequence of TS14(15) meganuclease with a nuclear localization signal and an intron.

SEQ ID NO: 435 is the amino acid sequence of the TS14 meganuclease.

SEQ ID NO: 436 is the amino acid sequence of the TS14(15) meganuclease.

DETAILED DESCRIPTION OF THE INVENTION

The present inventions now will be described more fully hereinafter with reference to the accompanying drawings, in which some, but not all embodiments of the inventions are shown. Indeed, these inventions may be embodied in many different forms and should not be construed as limited to the embodiments set forth herein; rather, these embodiments are provided so that this disclosure will satisfy applicable legal requirements. Like numbers refer to like elements throughout.

Many modifications and other embodiments of the inventions set forth herein will come to mind to one skilled in the art to which these inventions pertain having the benefit of the teachings presented in the foregoing descriptions and the associated drawings. Therefore, it is to be understood that the inventions are not to be limited to the specific embodiments disclosed and that modifications and other embodiments are intended to be included within the scope of the appended claims. Although specific terms are employed herein, they are used in a generic and descriptive sense only and not for purposes of limitation.

All publications and patent applications mentioned in the specification are indicative of the level of those skilled in the art to which this invention pertains. All publications and patent applications are herein incorporated by reference to the same extent as if each individual publication or patent application was specifically and individually indicated to be incorporated by reference.

As used herein and in the appended claims, the singular forms “a”, “an”, and “the” include plural reference unless the context clearly dictates otherwise. Thus, for example, reference to “a plant” includes a plurality of such plants; reference to “a cell” includes one or more cells and equivalents thereof known to those skilled in the art, and so forth.

Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which the invention pertains. Although any methods and materials similar or equivalent to those described herein can be used in the practice for testing of the present invention, specific examples of appropriate materials and methods are described herein.

In the context of this disclosure, a number of terms and abbreviations are used. The following definitions are provided.

I. Overview

Compositions and methods comprising polynucleotides and polypeptides having meganuclease activity are provided. Also provided are compositions with increased meganuclease activity and methods of use. Further provided are nucleic acid constructs, yeasts, plants, plant cells, explants, seeds and grain having the meganuclease sequences. The methods and compositions employ endonucleases capable of inducing a double-strand break at a recognition sequence within a DNA fragment or within the genome of a yeast cell, plant, plant cell or seed.

II. Compositions

As used herein, an “isolated” polynucleotide or polypeptide, or biologically active portion thereof, is substantially or essentially free from components that normally accompany or interact with the polynucleotide or polypeptide as found in its naturally occurring environment. Thus, an isolated or purified polynucleotide or polypeptide is substantially free of other cellular material or culture medium when produced by recombinant techniques, or substantially free of chemical precursors or other chemicals when chemically synthesized. Optimally, an “isolated” polynucleotide is free of sequences (optimally protein encoding sequences) that naturally flank the polynucleotide (i.e., sequences located at the 5′ and 3′ ends of the polynucleotide) in the genomic DNA of the organism from which the polynucleotide is derived. For example, in various embodiments, the isolated polynucleotide can contain less than about 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 0.5 kb, or 0.1 kb of nucleotide sequence that naturally flank the polynucleotide in genomic DNA of the cell from which the polynucleotide is derived. A polypeptide that is substantially free of cellular material includes preparations of polypeptides having less than about 30%, 20%, 10%, 5%, or 1% (by dry weight) of contaminating protein. When the polypeptide of the invention or biologically active portion thereof is recombinantly produced, optimally culture medium represents less than about 30%, 20%, 10%, 5%, or 1% (by dry weight) of chemical precursors or non-protein-of-interest chemicals.

As used herein, polynucleotide or polypeptide is “recombinant” when it is artificial or engineered, or derived from an artificial or engineered protein or nucleic acid. For example, a polynucleotide that is inserted into a vector or any other heterologous location, e.g., in a genome of a recombinant organism, such that it is not associated with nucleotide sequences that normally flank the polynucleotide as it is found in nature is a recombinant polynucleotide. A polypeptide expressed in vitro or in vivo from a recombinant polynucleotide is an example of a recombinant polypeptide. Likewise, a polynucleotide sequence that does not appear in nature, for example, a variant of a naturally occurring gene is recombinant.

A “subsequence” or “fragment” is any portion of an entire sequence.

The terms “target site”, “target sequence”, “genomic target site” and “genomic target sequence” are used interchangeably herein and refer to a polynucleotide sequence in the genome of a plant cell or yeast cell that comprises a recognition sequence for a double-strand break inducing agent.

As used herein, the term “recognition sequence” refers to a DNA sequence at which a double-strand break is induced in the plant cell genome by an endonuclease. The terms “recognition sequence”, “recognition site”, “recognition site for an endonuclease”, “meganuclease recognition sequence” and “meganuclease recognition site” are used interchangeably herein. The recognition site can be an endogenous site in the plant genome, or alternatively, the recognition site can be heterologous to the plant and thereby not be naturally occurring in the genome, or the recognition site can be found in a heterologous genomic location compared to where it occurs in nature. As used herein, the term “endogenous recognition site” refers to an endonuclease recognition site that is endogenous or native to the genome of a plant and is located at the endogenous or native position of that recognition site in the genome of the plant. The length of the recognition site can vary, and includes, for example, recognition sites that are at least 4, 6, 8, 10, 12, 14, 16, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70 or more nucleotides in length. It is further possible that the recognition site could be palindromic, that is, the sequence on one strand reads the same in the opposite direction on the complementary strand. The nick/cleavage site could be within the recognition sequence or the nick/cleavage site could be outside of the recognition sequence. In another variation, the cleavage could occur at nucleotide positions immediately opposite each other to produce a blunt end cut or, in other cases, the incisions could be staggered to produce single-stranded overhangs, also called “sticky ends”, which can be either 5′ overhangs, or 3′ overhangs.

In one embodiment, the recognition sequence of the endonuclease comprises the LIG3-4 (SEQ ID NO: 2), MHP77 (SEQ ID NO: 85), MS26 (SEQ ID NO: 269), MHP14 (SEQ ID NO: 281), MP107 (SEQ ID NO: 328), ZM6.3 (SEQ ID NO: 355) and/or ZM6.22V2 (SEQ ID NO: 388) recognition sites of maize and/or the TS21 (SEQ ID NO: 423) and/or the TS14 (SEQ ID NO: 424) recognition sites of soybean.

Active variants and fragments of the recognition can comprise at least 65%, 70%, 75%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or more sequence identity to the given recognition sequence, wherein the active variants retain biological activity and hence are capable of being recognized and cleaved by an endonuclease.

An “artificial target sequence” is a target sequence that has been introduced into the genome of a plant. Such an artificial target sequence can be identical in sequence to an endogenous or native target sequence in the genome of a plant but be located in a different position (i.e., a non-endogenous or non-native position) in the genome of a plant.

The terms “endogenous target sequence” and “native target sequence” are used interchangeable herein to refer to a target sequence that is endogenous or native to the genome of a plant and is at the endogenous or native position of that target sequence in the genome of the plant.

An “altered target sequence” refers to a target sequence that comprises at least one alteration when compared to non-altered target sequence. Such “alterations” include, for example: (i) replacement of at least one nucleotide, (ii) a deletion of at least one nucleotide, (iii) an insertion of at least one nucleotide, or (iv) any combination of (i)-(iii).

The term “double-strand-break-inducing agent” as used herein refers to any nuclease which produces a double-strand break in the target sequence. Producing the double-strand break in a target sequence or other DNA can be referred to herein as “cutting” or “cleaving” the target sequence or other DNA.

An “endonuclease” refers to an enzyme that cleaves the phosphodiester bond within a polynucleotide chain.

Endonucleases include restriction endonucleases that cleave DNA at specific sites without damaging the bases. Restriction endonucleases include Type I, Type II, Type III, and Type IV endonucleases, which further include subtypes. In the Type I and Type III systems, both the methylase and restriction activities are contained in a single complex.

Type I and Type III restriction endonucleases recognize specific recognition sites, but typically cleave at a variable position from the recognition site, which can be hundreds of base pairs away from the recognition site. In Type II systems the restriction activity is independent of any methylase activity, and cleavage typically occurs at specific sites within or near to the recognition site. Most Type II enzymes cut palindromic sequences, however Type IIa enzymes recognize non-palindromic recognition sites and cleave outside of the recognition site, Type IIb enzymes cut sequences twice with both sites outside of the recognition site, and Type IIs enzymes recognize an asymmetric recognition site and cleave on one side and at a defined distance of about 1-20 nucleotides from the recognition site. Type IV restriction enzymes target methylated DNA. Restriction enzymes are further described and classified, for example in the REBASE database (webpage at rebase.neb.com; Roberts, et al., (2003) Nucleic Acids Res 31:418-420), Roberts, et al., (2003) Nucleic Acids Res 31:1805-1812, and Belfort, et al., (2002) in Mobile DNA II, pp. 761-783, Eds. Craigie, et al., (ASM Press, Washington, D.C.).

An “engineered endonuclease” refers to an endonuclease that is engineered (modified or derived) from its native form to specifically recognize and induce a double-strand break in the desired recognition site. Thus, an engineered endonuclease can be derived from a native, naturally-occurring endonuclease or it could be artificially created or synthesized. The modification of the endonuclease can be as little as one nucleotide. In some embodiments, the engineered endonuclease induces a double-strand break in a recognition site, wherein the recognition site was not a sequence that would have been recognized by a native (non-engineered or non-modified) endonuclease. Producing a double-strand break in a recognition site or other DNA can be referred to herein as “cutting” or “cleaving” the recognition site or other DNA.

A “meganuclease” refers to a homing endonuclease, which like restriction endonucleases, bind and cut at a specific recognition site, however the recognition sites for meganucleases are typically longer, about 18 bp or more. In some embodiments of the invention, the meganuclease has been engineered (or modified) to cut a specific endogenous recognition sequence, wherein the endogenous target sequence prior to being cut by the engineered double-strand-break-inducing agent was not a sequence that would have been recognized by a native (non-engineered or non-modified) endonuclease.

A “meganuclease polypeptide” refers to a polypeptide having meganuclease activity and thus capable of producing a double-strand break in the recognition sequence.

Meganucleases have been classified into four families based on conserved sequence motifs, the families are the LAGLIDADG, GIY-YIG, H—N—H, and His-Cys box families. These motifs participate in the coordination of metal ions and hydrolysis of phosphodiester bonds. HEases are notable for their long recognition sites, and for tolerating some sequence polymorphisms in their DNA substrates. The naming convention for meganuclease is similar to the convention for other restriction endonuclease. Meganucleases are also characterized by prefix F-, I-, or PI- for enzymes encoded by free-standing open reading frames, introns, and inteins, respectively. For example, intron-, intein-, and freestanding gene encoded meganuclease from Saccharomyces cerevisiae are denoted I-SceI, PI-SceI, and F-SceII, respectively. Meganuclease domains, structure and function are known, see for example, Guhan and Muniyappa, (2003) Crit Rev Biochem Mol Biol 38:199-248; Lucas, et al., (2001) Nucleic Acids Res 29:960-969; Jurica and Stoddard, (1999) Cell Mol Life Sci 55:1304-1326; Stoddard, (2006) Q Rev Biophys 38:49-95; and Moure, et al., (2002) Nat Struct Biol 9:764. In some examples a naturally occurring variant, and/or engineered derivative meganuclease is used. Methods for modifying the kinetics, cofactor interactions, expression, optimal conditions, and/or recognition site specificity, and screening for activity are known, see for example, Epinat, et al., (2003) Nucleic Acids Res 31:2952-2962; Chevalier, et al., (2002) Mol Cell 10:895-905; Gimble, et al., (2003) Mol Biol 334:993-1008; Seligman, et al., (2002) Nucleic Acids Res 30:3870-3879; Sussman, et al., (2004) J Mol Biol 342:31-41; Rosen, et al., (2006) Nucleic Acids Res 34:4791-4800; Chames, et al., (2005) Nucleic Acids Res 33:e178; Smith, et al., (2006) Nucleic Acids Res 34:el49; Gruen, et al., (2002) Nucleic Acids Res 30:e29; Chen and Zhao, (2005) Nucleic Acids Res 33:e154; WO 2005/105989; WO 2003/078619; WO 2006/097854; WO 2006/097853; WO 2006/097784; and WO 2004/031346.

Any meganuclease can be used herein, including, but not limited to, I-SceI, I-SceII, I-SceIII, I-SceIV, I-SceV, I-SceVI, I-SceVIII, I-CeuI, I-CeuAIIP, I-CreI, I-CrepsbIP, I-CrepsbIIP, I-CrepsbIIIP, I-CrepsbIVP, I-TliI, I-PpoI, PI-PspI, F-SceI, F-SceII, F-SuvI, F-TevI, F-TevII, I-AmaI, I-AniI, I-ChuI, I-CmoeI, I-CpaI, I-CpaII, I-CsmI, I-CvuI, I-CvuAIP, I-DdiI, I-DdiII, I-DirI, I-DmoI, I-HmuI, I-HmuII, I-HsNIP, I-LlaI, I-MsoI, I-NaaI, I-NanI, I-NcIIP, I-NgrIP, I-NitI, I-NjaI, I-Nsp236IP, I-PakI, I-PboIP, I-PcuIP, I-PcuAI, I-PcuVI, I-PgrIP, I-PobIP, I-PorI, I-PorIIP, I-PbpIP, I-SpBetaIP, I-ScaI, I-SexIP, I-SneIP, I-SpomI, I-SpomCP, I-SpomIP, I-SpomIIP, I-SquIP, I-Ssp6803I, I-SthPhiJP, I-SthPhiST3P, I-SthPhiSTe3bP, I-TdeIP, I-TevI, I-TevII, I-TevIII, I-UarAP, 1-UarHGPAIP, 1-UarHGPA13P, I-VinIP, I-ZbiIP, PI-MtuI, PI-MtuHIP PI-MtuHIIP, PI-PfuI, PI-PfuII, PI-PkoI, PI-PkoII, PI-Rma43812IP, PI-SpBetaIP, PI-SceI, PI-TfuI, PI-TfuII, PI-ThyI, PI-TliI, PI-TliII, or any active variants or fragments thereof. In a specific embodiment, the engineered endonuclease is derived from I-Cre-I having the sequence set forth in SEQ ID NO: 15, 21 or 26 or an active variant or fragment thereof.

TAL effector nucleases are a new class of sequence-specific nucleases that can be used to make double-strand breaks at specific target sequences in the genome of a plant or other organism. TAL effector nucleases are created by fusing a native or engineered transcription activator-like (TAL) effector, or functional part thereof, to the catalytic domain of an endonuclease, such as, for example, Fokl. The unique, modular TAL effector DNA binding domain allows for the design of proteins with potentially any given DNA recognition specificity. Thus, the DNA binding domains of the TAL effector nucleases can be engineered to recognize specific DNA target sites and thus, used to make double-strand breaks at desired target sequences. See, WO 2010/079430; Morbitzer, et al., (2010) PNAS 10.1073/pnas.1013133107; Scholze and Boch, (2010) Virulence 1:428-432; Christian, et al., Genetics (2010) 186:757-761; Li, et al., (2010) Nuc. Acids Res. (2010) doi:10.1093/nar/gkq704; and Miller, et al., (2011) Nature Biotechnology 29:143-148; all of which are herein incorporated by reference.

The term “meganuclease activity” as used herein refers to the ability of a meganuclease to cut at a desired recognition sequence and thus retain double-strand-break-inducing activity.

Assays for meganuclease activity are known and generally measure the overall activity and specificity of the meganuclease on DNA substrates containing the recognition site. For example, the meganuclease activity can be measured using a yeast screening assay as described herein. Yeast cells with a functional Ade2 gene are white, whereas those lacking Ade2 function exhibit red pigmentation due to accumulation of a metabolite earlier in the adenine biosynthetic pathway resulting in red colonies with white sectors as shown in FIGS. 2 and 3. The degree of white sectoring, sometimes extending to entire colonies, indicates the amount of meganuclease cutting activity. Since the sectoring phenotype is a qualitative measure of meganuclease activity, a 0-4 numerical scoring system was implemented. As shown in FIG. 3, a score of 0 indicates that no white sectors (no meganuclease cutting) were observed; a score of 4 indicates completely white colonies (complete cutting of the recognition site); scores of 1-3 indicate intermediate white sectoring phenotypes (and intermediate degrees of recognition site cutting). Meganuclease activity can also be measured in-vitro as described herein. In short, time-course digestions can be carried out on plasmid DNA containing the meganuclease recognition site at 37° C., 28° C., and 23° C. and the % digestion of each sample or loss of meganuclease recognition sites (indicative of meganuclease activity) can be determined by real-time PCR. Furthermore, meganuclease activity can be measured in-planta by determining the Target Site (TS) mutation rate. Target site mutation rate is defined as: (number of events with target site modification/total number events)*100%.

An “increased” or an “increased” activity are used interchangeably herein. An “increased” or “increased” meganuclease activity comprises any statistically significant increase in the activity of the parental meganuclease polypeptide as determined through any activity assays described herein.

The meganuclease can be provided via a polynucleotide encoding the endonuclease. Such a polynucleotide encoding an endonuclease can be modified to substitute codons having a higher frequency of usage in a plant, as compared to the naturally occurring polynucleotide sequence. For example, the polynucleotide encoding the meganuclease can be modified to substitute codons having a higher frequency of usage in a maize or soybean plant, as compared to the naturally occurring polynucleotide sequence.

Various methods and compositions are provided which employ polynucleotides and polypeptides having meganuclease activity.

In one embodiment, the invention concerns an isolated or recombinant polynucleotide comprising a nucleotide sequence encoding a meganuclease polypeptide, said polypeptide comprising: a) an amino acid sequence having at least one amino acid modification at an amino acid position corresponding to a position of SEQ ID NO: 1 selected from the group consisting of positions 2, 12, 16, 22, 23, 31, 36, 43, 50, 56, 58, 59, 62, 71, 72, 73, 80, 81, 82, 86, 91, 95, 98, 103, 113, 114, 116, 117, 118, 121, 124, 128, 129, 131, 147, 151, 153, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 194, 195, 196, 197, 200, 203, 204, 209, 222, 232, 236, 237, 246, 254, 258, 267, 278, 281, 282, 289, 308, 311, 312, 316, 318, 319, 334, 339, 340, 342, 345, 346, 348 and combinations thereof; or, b) an amino acid sequence having at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43 or 44 of any of the amino acid modification of (a).

In another embodiment, the invention concerns an isolated or recombinant polynucleotide of the current disclosure, and its corresponding polypeptide, wherein said nucleotide sequence encodes a meganuclease polypeptide, wherein said polypeptide further comprises at least one amino acid modification described herein such as those shown in FIG. 5A-FIG. 5E, FIG. 9A-FIG. 9N, FIG. 10A-FIG. 10D, FIG. 11, FIG. 12, FIG. 13, FIG. 14A-FIG. 14F and FIG. 15A-FIG. 15E as well any I-Cre1 type modification known and any combination thereof.

Further provided are methods and compositions which employ polynucleotides and polypeptides having increased meganuclease activity when compared to an appropriate control. Such meganuclease polypeptides include those set forth in any one of SEQ ID NOS: 14-38 (LIG3-4 variants), SEQ ID NOS: 87-167 (MHP77 variants, SEQ ID NOS: 251.252, 253, 262 (MHP77.3 variants), SEQ ID NOS: 272-275 (MS26+ variants), SEQ ID NOS: 284-298 (MHP14 variants), SEQ ID NOS: 315-320 (MHP14+ variants), SEQ ID NOS: 330-341 (MH107 variants), SEQ ID NOS: 357-371 (ZM6.3 variants), SEQ ID NOS: 390-403 (ZM6.22V2 variants) or SEQ ID NOS: 430-432 and biologically active variants thereof. Further provided are the polynucleotides encoding these various polypeptides and active variant thereof.

The term “Variant” protein is intended to mean a protein derived from the protein (referred to as parental protein) by deletion (i.e., truncation at the 5′ and/or 3′ end) and/or a deletion or addition of one or more amino acids at one or more internal sites in the parental protein and/or substitution of one or more amino acids at one or more sites in the parental protein. As used herein, a “parental” polynucleotide, polypeptide (protein) can result from human manipulation or from a native protein comprising a naturally occurring nucleotide sequence or amino acid sequence, respectively. Variant proteins encompassed are biologically active, that is they continue to possess the desired biological activity of the parental protein, that is, have meganuclease activity. Such variants may result from, for example, genetic polymorphism or from human manipulation.

The term “variant meganuclease” refers to a variant protein with meganuclease activity. The variant meganuclease is derived from a parental meganuclease wherein the variant meganuclease comprises at least one amino acid modification when compared to the parental meganuclease polypeptide.

Variant meganuclease polypeptides of the invention include those set forth in any one of SEQ ID NOS: 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 251, 252, 253, 262, 272, 273, 274, 275, 284, 285, 286, 287, 288, 289, 290, 291, 292, 293, 294, 295, 296, 297, 298, 315, 316, 317, 318, 319, 320, 330, 331, 332, 334, 335, 336, 337, 338, 339, 340, 341, 357, 358, 359, 360, 361, 362, 363, 364, 365, 366, 367, 368, 369, 370, 370, 371, 390, 391, 392, 393, 394, 395, 396, 397, 398, 399, 400, 401, 402, 403, 430, 431, 432 or 433 and biologically active variants and fragments thereof. Further provided are the polynucleotides encoding these various polypeptides and active variant and fragments thereof.

Any one of the amino acid modifications identified in Examples 3-23 can be transferred to a parental meganuclease to create a variant meganuclease. These meganucleases can be screened for increased activity by methods described herein.

One embodiment of the invention concerns the transfer of at least one amino acid modification selected from the group of Y12 to H, G19 to S or A, Q50 to K or R, F54 to I, D56 to L, V105 to A, E124 to R, V129 to A, I132 to V or T, D153 to M or L, V316 to A or I319 to V to a parental meganuclease in order to improve the activity of the parental meganuclease. Another embodiment concern the transfer of 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, or 12 amino acid modification selected from the group of Y12 to H, G19 to S or A, Q50 to K or R, F54 to I, D56 to L, V105 to A, E124 to R, V129 to A, I132 to V or T, D153 to M or L, V316 to A or I319 to V to a parental meganuclease in order to improve the activity of the parental meganuclease.

Any one of the modifications described herein can be combined with other known modifications of I-CreI type meganucleases.

As used herein with respect to a recombinant polynucleotide encoding a recombinant protein, term “modification” means any insertion, deletion or substitution of an amino acid residue in the recombinant protein sequence relative to a reference or control sequence.

“Variants” is intended to mean substantially similar sequences. For polynucleotides, conservative variants include those sequences that, because of the degeneracy of the genetic code, encode the amino acid sequence of one of the meganuclease polypeptides of the invention. Naturally occurring variants such as these can be identified with the use of well-known molecular biology techniques, as, for example, with polymerase chain reaction (PCR) and hybridization techniques as outlined below. Variant polynucleotides also include synthetically derived polynucleotides, such as those generated, for example, by using site-directed mutagenesis or gene synthesis but which still encode a meganuclease polypeptide.

Biologically Active variants of meganucleases (i.e., variant meganucleases) are also provided. Variant meganucleases are biologically active variants of a meganuclease polypeptide (and the polynucleotide encoding the same) will have at least about 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 95.5%, 95.7%, 95.9%, 96%, 96.3%, 96.5%, 96.9%, 97%, 97.3%, 97.5%, 97.9%, 98%, 98.3%, 98.5%, 98.9%, 99%, 99.3%, 99.5%, 99.6% or more sequence identity to the polypeptide of a control meganuclease, wherein the active variants retain the ability to cut at a desired recognition site. For example, any of the variant meganucleases described herein can be modified from a parental endonuclease sequence and designed to recognize and induce a double strand break at the same recognition site of the parental meganuclease. Thus, in some embodiments, the variant meganuclease contains at least one amino acid modification when compared to the parental meganuclease and has a specificity to induce a double-strand break at the same recognition sequence as the corresponding parental meganuclease recognition sequence.

A “control meganuclease” or “reference meganuclease” can be used interchangeably and refers to any meganuclease to which a variant meganuclease is compared to. Control meganucleases can include, but are not limited to, parental or corresponding meganucleases or any wild-type I-Cre1 type meganucleases.

Numbering of an amino acid or nucleotide polymer, such any one of the meganucleases of the invention, corresponds to numbering of a selected amino acid polymer or nucleic acid when the position of a given monomer component (amino acid residue, incorporated nucleotide, etc.) of the polymer corresponds to the same residue position in a selected reference polypeptide or polynucleotide.

Further provided are biologically active variants of a meganuclease polypeptide (and the polynucleotide encoding the same) that will have at least about 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 95.5%, 95.7%, 95.9%, 96%, 96.3%, 96.5%, 96.9%, 97%, 97.3%, 97.5%, 97.9%, 98%, 98.3%, 98.5%, 98.9%, 99%, 99.3%, 99.5%, 99.6% or more sequence identity to the polypeptide of any one of SEQ ID NO: 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 251, 252, 253, 262, 272, 273, 274, 275, 284, 285, 286, 287, 288, 289, 290, 291, 292, 293, 294, 295, 296, 297, 298, 315, 316, 317, 318, 319, 320, 330, 331, 332, 334, 335, 336, 337, 338, 339, 340, 341, 357, 358, 359, 360, 361, 362, 363, 364, 365, 366, 367, 368, 369, 370, 370, 371, 390, 391, 392, 393, 394, 395, 396, 397, 398, 399, 400, 401, 402, 403, 430, 431, 432 or 433 or with regard to any of the meganuclease polypeptides disclosed herein as determined by sequence alignment.

In one embodiment the variant meganuclease of the present invention comprises a linker polypeptide, wherein said linker polypeptide comprises: a) SEQ ID NO: 420; b) SEQ ID NO: 421; c) SEQ ID NO: 422; or, d) an amino acid sequence consisting of any possible amino acid at positions corresponding to positions 156 to 193 of SEQ ID NO: 1. It is also understood that these linker sequences can be substituted for any other linker sequence that links both I-Cre type monomers while still enabling the single polypeptide meganuclease to provide a double strand break at a target sequence.

As used herein, a “genomic region of interest” is a segment of a chromosome in the genome of a plant that is desirable for introducing a polynucleotide of interest or trait of interest. The genomic region of interest can include, for example, one or more polynucleotides of interest. Generally, a genomic region of interest of the present invention comprises a segment of chromosome that is 0-15 cM.

As used herein, a “polynucleotide of interest” within a genomic region of interest is any coding and/or non-coding portion of the genomic region of interest including, but not limited to, a transgene, a native gene, a mutated gene, and a genetic marker such as, for example, a single nucleotide polymorphism (SNP) marker and a simple sequence repeat (SSR) marker.

As used herein, “physically linked,”“in physical linkage”, and “genetically linked” are used to refer to any two or more genes, transgenes, native genes, mutated genes, alterations, target sites, markers, and the like that are part of the same DNA molecule or chromosome.

Sequence Comparisons

The following terms are used to describe the sequence relationships between two or more polynucleotides or polypeptides: (a) “reference sequence”, (b) “comparison window”, (c) “sequence identity”, and, (d) “percent sequence identity.”

(a) As used herein, “reference sequence” is a defined sequence used as a basis for sequence comparison. A reference sequence may be a subset or the entirety of a specified sequence; for example, as a segment of a full-length cDNA or gene sequence, or the complete cDNA or gene sequence or protein sequence.

(b) As used herein, “comparison window” makes reference to a contiguous and specified segment of a polypeptide sequence, wherein the polypeptide sequence in the comparison window may comprise additions or deletions (i.e., gaps) compared to the reference sequence (which does not comprise additions or deletions) for optimal alignment of the two polypeptides. Generally, the comparison window is at least 5, 10, 15, or 20 contiguous amino acid in length, or it can be 30, 40, 50, 100, or longer. Those of skill in the art understand that to avoid a high similarity to a reference sequence due to inclusion of gaps in the polypeptide sequence a gap penalty is typically introduced and is subtracted from the number of matches.

Methods of alignment of sequences for comparison are well known in the art. Thus, the determination of percent sequence identity between any two sequences can be accomplished using a mathematical algorithm. Non-limiting examples of such mathematical algorithms are the algorithm of Myers and Miller (1988) CABIOS 4:11-17; the local alignment algorithm of Smith, et al., (1981) Adv. Appl. Math. 2:482; the global alignment algorithm of Needleman and Wunsch, (1970) J. Mol. Biol. 48:443-453; the search-for-local alignment method of Pearson and Lipman, (1988) Proc. Natl. Acad. Sci. 85:2444-2448; the algorithm of Karlin and Altschul, (1990) Proc. Natl. Acad. Sci. USA 872264, modified as in Karlin and Altschul, (1993) Proc. Natl. Acad. Sci. USA 90:5873-5877. Computer implementations of these mathematical algorithms can be utilized for comparison of sequences to determine sequence identity. Such implementations include, but are not limited to: CLUSTAL in the PC/Gene program (available from Intelligenetics, Mountain View, Calif.); the ALIGN program (Version 2.0) and GAP, BESTFIT, BLAST, FASTA, and TFASTA in the GCG Wisconsin Genetics Software Package, Version 10 (available from Accelrys Inc., 9685 Scranton Road, San Diego, Calif., USA). Alignments using these programs can be performed using the default parameters. The CLUSTAL program is well described by Higgins, et al., (1988) Gene 73:237-244 (1988); Higgins, et al., (1989) CABIOS 5:151-153; Corpet, et al., (1988) Nucleic Acids Res. 16:10881-10890; Huang, et al., (1992) CABIOS 8:155-165; and Pearson, et al., (1994) Meth. Mol. Biol. 24:307-331. The ALIGN program is based on the algorithm of Myers and Miller, (1988) supra. A PAM120 weight residue table, a gap length penalty of 12, and a gap penalty of 4 can be used with the ALIGN program when comparing amino acid sequences. The BLAST programs of Altschul, et al., (1990) J. Mol. Biol. 215:403 are based on the algorithm of Karlin and Altschul, (1990) supra. BLAST nucleotide searches can be performed with the BLASTN program, score=100, wordlength=12, to obtain nucleotide sequences homologous to a nucleotide sequence encoding a protein of the invention. BLAST protein searches can be performed with the BLASTX program, score=50, wordlength=3, to obtain amino acid sequences homologous to a protein or polypeptide of the invention. BLASTP protein searches can be performed using default parameters. See, blast.ncbi.nlm.nih.gov/Blast.cgi.

Sequence alignments and percent similarity calculations may be determined using the Megalign program of the LASARGENE bioinformatics computing suite (DNASTAR Inc., Madison, Wis.) or using the AlignX program of the Vector NTI bioinformatics computing suite (Invitrogen, Carlsbad, Calif.). Multiple alignment of the sequences are performed using the Clustal method of alignment (Higgins and Sharp, (1989) CABIOS 5:151-153) with the default parameters (GAP PENALTY=10, GAP LENGTH PENALTY=10). Default parameters for pairwise alignments and calculation of percent identity of protein sequences using the Clustal method are KTUPLE=1, GAP PENALTY=3, WINDOW=5 and DIAGONALS SAVED=5. For nucleic acids these parameters are GAP PENALTY=10, GAP LENGTH PENALTY=10, KTUPLE=2, GAP PENALTY=5, WINDOW=4 and DIAGONALS SAVED=4. A “substantial portion” of an amino acid or nucleotide sequence comprises enough of the amino acid sequence of a polypeptide or the nucleotide sequence of a gene to afford putative identification of that polypeptide or gene, either by manual evaluation of the sequence by one skilled in the art, or by computer-automated sequence comparison and identification using algorithms such as BLAST (Altschul, et al., (1993) J. Mol. Biol. 215:403-410) and Gapped Blast (Altschul, et al., (1997) Nucleic Acids Res. 25:3389-3402). BLASTN refers to a BLAST program that compares a nucleotide query sequence against a nucleotide sequence database.

“Gene” refers to a nucleic acid fragment that expresses a specific protein, including regulatory sequences preceding (5′ non-coding sequences) and following (3′ non-coding sequences) the coding sequence. “Native gene” refers to a gene as found in nature with its own regulatory sequences. “Chimeric gene” or “recombinant expression construct”, which are used interchangeably, refers to any gene that is not a native gene, comprising regulatory and coding sequences that are not found together in nature. Accordingly, a chimeric gene may comprise regulatory sequences and coding sequences that are derived from different sources, or regulatory sequences and coding sequences derived from the same source, but arranged in a manner different than that found in nature. “Endogenous gene” refers to a native gene in its natural location in the genome of an organism. A “foreign” gene refers to a gene not normally found in the host organism, but that is introduced into the host organism by gene transfer. Foreign genes can comprise native genes inserted into a non-native organism, or chimeric genes. A “transgene” is a gene that has been introduced into the genome by a transformation procedure.

“Coding sequence” refers to a DNA sequence which codes for a specific amino acid sequence. “Regulatory sequences” refer to nucleotide sequences located upstream (5′ non-coding sequences), within, or downstream (3′ non-coding sequences) of a coding sequence, and which influence the transcription, RNA processing or stability, or translation of the associated coding sequence. Regulatory sequences may include, but are not limited to, promoters, translation leader sequences, introns, and polyadenylation recognition sequences.

“Codon degeneracy” refers to divergence in the genetic code permitting variation of the nucleotide sequence without affecting the amino acid sequence of an encoded polypeptide. Accordingly, the instant invention relates to any nucleic acid fragment comprising a nucleotide sequence that encodes all or a substantial portion of the amino acid sequences set forth herein. The skilled artisan is well aware of the “codon-bias” exhibited by a specific host cell in usage of nucleotide codons to specify a given amino acid. Therefore, when synthesizing a nucleic acid fragment for increased expression in a host cell, it is desirable to design the nucleic acid fragment such that its frequency of codon usage approaches the frequency of preferred codon usage of the host cell.

As used herein, “sequence identity” or “identity” in the context of two polynucleotides or polypeptide sequences makes reference to the residues in the two sequences that are the same when aligned for maximum correspondence over a specified comparison window. When percentage of sequence identity is used in reference to proteins it is recognized that residue positions which are not identical often differ by conservative amino acid substitutions, where amino acid residues are substituted for other amino acid residues with similar chemical properties (e.g., charge or hydrophobicity). When sequences differ in conservative substitutions, the percent sequence identity may be adjusted upwards to correct for the conservative nature of the substitution. Sequences that differ by such conservative substitutions are said to have “sequence similarity” or “similarity”. Means for making this adjustment are well known to those of skill in the art. Typically, this involves scoring a conservative substitution as a partial rather than a full mismatch, thereby increasing the percent sequence identity. Thus, for example, where an identical amino acid is given a score of 1 and a non-conservative substitution is given a score of zero, a conservative substitution is given a score between zero and 1. The scoring of conservative substitutions is calculated, e.g., as implemented in the program PC/GENE (Intelligenetics, Mountain View, Calif.).

As used herein, “percent sequence identity” means the value determined by comparing two aligned sequences over a comparison window, wherein the portion of the polynucleotide sequence in the comparison window may comprise additions or deletions (i.e., gaps) as compared to the reference sequence (which does not comprise additions or deletions) for optimal alignment of the two sequences. The percentage is calculated by determining the number of positions at which the identical nucleic acid base or amino acid residue occurs in both sequences to yield the number of matched positions, dividing the number of matched positions by the total number of positions in the window of comparison, and multiplying the result by 100 to yield the percent sequence identity.

Polynucleotide Constructs

Provided herein are polynucleotides or nucleic acid molecules comprising the meganucleases or any active variants or fragments thereof. The terms “polynucleotide,”“polynucleotide sequence,”“nucleic acid sequence,” and “nucleic acid fragment” are used interchangeably herein. These terms encompass nucleotide sequences and the like. The use of the term “polynucleotide” is not intended to limit the present invention to polynucleotides comprising DNA. Those of ordinary skill in the art will recognize that polynucleotides can comprise ribonucleotides and combinations of ribonucleotides and deoxyribonucleotides. Such deoxyribonucleotides and ribonucleotides include both naturally occurring molecules and synthetic analogues. The polynucleotides of the invention also encompass all forms of sequences including, but not limited to, single-stranded forms, double-stranded forms, hairpins, stem-and-loop structures, and the like.

Further provided are recombinant polynucleotides comprising the various meganucleases. The terms “recombinant polynucleotide”, “recombinant nucleotide”, “recombinant DNA” and “recombinant DNA construct” are used interchangeably herein. A recombinant construct comprises an artificial or heterologous combination of nucleic acid sequences, e.g., regulatory and coding sequences that are not found together in nature. For example, a transfer cassette can comprise restriction sites and a heterologous polynucleotide of interest. In other embodiments, a recombinant construct may comprise regulatory sequences and coding sequences that are derived from different sources, or regulatory sequences and coding sequences derived from the same source, but arranged in a manner different than that found in nature. Such a construct may be used by itself or may be used in conjunction with a vector. If a vector is used, then the choice of vector is dependent upon the method that will be used to transform host cells as is well known to those skilled in the art. For example, a plasmid vector can be used. The skilled artisan is well aware of the genetic elements that must be present on the vector in order to successfully transform, select and propagate host cells comprising any of the isolated nucleic acid fragments provided herein. The skilled artisan will also recognize that different independent transformation events will result in different levels and patterns of expression (Jones, et al., (1985) EMBO J. 4:2411-2418; De Almeida, et al., (1989) Mol. Gen. Genetics 218:78-86), and thus that multiple events must be screened in order to obtain lines displaying the desired expression level and pattern. Such screening may be accomplished by Southern analysis of DNA, Northern analysis of mRNA expression, immunoblotting analysis of protein expression, or phenotypic analysis, among others.

The meganuclease polynucleotides disclosed herein can be provided in expression cassettes for expression in the plant of interest. The cassette can include 5′ and 3′ regulatory sequences operably linked to an meganuclease polynucleotide or active variant or fragment thereof. “Operably linked” is intended to mean a functional linkage between two or more elements. For example, an operable linkage between a polynucleotide of interest and a regulatory sequence (i.e., a promoter) is a functional link that allows for expression of the polynucleotide of interest. Operably linked elements may be contiguous or non-contiguous. When used to refer to the joining of two protein coding regions, by operably linked is intended that the coding regions are in the same reading frame. The cassette may additionally contain at least one additional gene to be cotransformed into the organism. Alternatively, the additional gene(s) can be provided on multiple expression cassettes. Such an expression cassette is provided with a plurality of restriction sites and/or recombination sites for insertion of the meganuclease polynucleotide or active variant or fragment thereof to be under the transcriptional regulation of the regulatory regions. The expression cassette may additionally contain selectable marker genes.

The expression cassette can include in the 5′-3′ direction of transcription, a transcriptional and translational initiation region (i.e., a promoter), a meganuclease polynucleotide or active variant or fragment thereof, and a transcriptional and translational termination region (i.e., termination region) functional in plants. The regulatory regions (i.e., promoters, transcriptional regulatory regions, and translational termination regions) and/or the meganuclease polynucleotide or active variant or fragment thereof may be native/analogous to the host cell or to each other. Alternatively, the regulatory regions and/or the meganuclease polynucleotide of or active variant or fragment thereof may be heterologous to the host cell or to each other.

As used herein, “heterologous” in reference to a sequence is a sequence that originates from a foreign species, or, if from the same species, is substantially modified from its native form in composition and/or genomic locus by deliberate human intervention. For example, a promoter operably linked to a heterologous polynucleotide is from a species different from the species from which the polynucleotide was derived, or, if from the same/analogous species, one or both are substantially modified from their original form and/or genomic locus, or the promoter is not the native promoter for the operably linked polynucleotide.

While it may be optimal to express the sequences using heterologous promoters, the native promoter sequences may be used. Such constructs can change expression levels of the meganuclease polynucleotide in the plant or plant cell. Thus, the phenotype of the plant or plant cell can be altered.

The termination region may be native with the transcriptional initiation region, may be native with the operably linked meganuclease polynucleotide or active variant or fragment thereof, may be native with the plant host, or may be derived from another source (i.e., foreign or heterologous) to the promoter, the meganuclease polynucleotide or active fragment or variant thereof, the plant host, or any combination thereof. Convenient termination regions are available from the Ti-plasmid of A. tumefaciens, such as the octopine synthase and nopaline synthase termination regions. See also, Guerineau, et al., (1991) Mol. Gen. Genet. 262:141-144; Proudfoot (1991) Cell 64:671-674; Sanfacon, et al., (1991) Genes Dev. 5:141-149; Mogen, et al., (1990) Plant Cell 2:1261-1272; Munroe, et al., (1990) Gene 91:151-158; Ballas, et al., (1989) Nucleic Acids Res. 17:7891-7903; and Joshi, et al., (1987) Nucleic Acids Res. 15:9627-9639.

Where appropriate, the polynucleotides may be optimized for increased expression in the transformed plant. That is, the polynucleotides can be synthesized using plant-preferred codons for improved expression. See, for example, Campbell and Gowri, (1990) Plant Physiol. 92:1-11 for a discussion of host-preferred codon usage. Methods are available in the art for synthesizing plant-preferred genes. See, for example, U.S. Pat. Nos. 5,380,831, and 5,436,391, and Murray, et al., (1989) Nucleic Acids Res. 17:477-498, herein incorporated by reference.

Additional sequence modifications are known to enhance gene expression in a cellular host. These include elimination of sequences encoding spurious polyadenylation signals, exon-intron splice site signals, transposon-like repeats, and other such well-characterized sequences that may be deleterious to gene expression. The G-C content of the sequence may be adjusted to levels average for a given cellular host, as calculated by reference to known genes expressed in the host cell. When possible, the sequence is modified to avoid predicted hairpin secondary mRNA structures.

The expression cassettes may additionally contain 5′ leader sequences. Such leader sequences can act to enhance translation. Translation leaders are known in the art and include: picornavirus leaders, for example, EMCV leader (Encephalomyocarditis 5′ noncoding region) (Elroy-Stein, et al., (1989) Proc. Natl. Acad. Sci. USA 86:6126-6130); potyvirus leaders, for example, TEV leader (Tobacco Etch Virus) (Gallie, et al., (1995) Gene 165(2):233-238), MDMV leader (Maize Dwarf Mosaic Virus) (Virology 154:9-20), and human immunoglobulin heavy-chain binding protein (BiP) (Macejak, et al., (1991) Nature 353:90-94); untranslated leader from the coat protein mRNA of alfalfa mosaic virus (AMV RNA 4) (Jobling, et al., (1987) Nature 325:622-625); tobacco mosaic virus leader (TMV) (Gallie, et al., (1989) in Molecular Biology of RNA, ed. Cech (Liss, New York), pp. 237-256); and maize chlorotic mottle virus leader (MCMV) (Lommel, et al., (1991) Virology 81:382-385. See also, Della-Cioppa, et al., (1987) Plant Physiol. 84:965-968.

In preparing the expression cassette, the various DNA fragments may be manipulated, so as to provide for the DNA sequences in the proper orientation and, as appropriate, in the proper reading frame. Toward this end, adapters or linkers may be employed to join the DNA fragments or other manipulations may be involved to provide for convenient restriction sites, removal of superfluous DNA, removal of restriction sites, or the like. For this purpose, in vitro mutagenesis, primer repair, restriction, annealing, resubstitutions, e.g., transitions and transversions, may be involved.

A number of promoters can be used to express the various meganuclease sequence disclosed herein, including the native promoter of the polynucleotide sequence of interest. The promoters can be selected based on the desired outcome. Such promoters include, for example, constitutive, tissue-preferred, or other promoters for expression in plants.

Constitutive promoters include, for example, the core promoter of the Rsyn7 promoter and other constitutive promoters disclosed in WO 1999/43838 and U.S. Pat. No. 6,072,050; the core CaMV 35S promoter (Odell, et al., (1985) Nature 313:810-812); rice actin (McElroy, et al., (1990) Plant Cell 2:163-171); ubiquitin (Christensen, et al., (1989) Plant Mol. Biol. 12:619-632 and Christensen, et al., (1992) Plant Mol. Biol. 18:675-689); pEMU (Last, et al., (1991) Theor. Appl. Genet. 81:581-588); MAS (Velten, et al., (1984) EMBO J. 3:2723-2730); ALS promoter (U.S. Pat. No. 5,659,026), and the like. Other constitutive promoters include, for example, U.S. Pat. Nos. 5,608,149; 5,608,144; 5,604,121; 5,569,597; 5,466,785; 5,399,680; 5,268,463; 5,608,142; and 6,177,611.

Tissue-preferred promoters can be utilized to target enhanced meganuclease expression within a particular plant tissue. Tissue-preferred promoters include those described in Yamamoto, et al., (1997) Plant J. 12(2):255-265; Kawamata, et al., (1997) Plant Cell Physiol. 38(7):792-803; Hansen, et al., (1997) Mol. Gen Genet. 254(3):337-343; Russell, et al., (1997) Transgenic Res. 6(2):157-168; Rinehart, et al., (1996) Plant Physiol. 112(3):1331-1341; Van Camp, et al., (1996) Plant Physiol. 112(2):525-535; Canevascini, et al., (1996) Plant Physiol. 112(2):513-524; Yamamoto, et al., (1994) Plant Cell Physiol. 35(5):773-778; Lam (1994) Results Probl. Cell Differ. 20:181-196; Orozco, et al., (1993) Plant Mol Biol. 23(6):1129-1138; Matsuoka, et al., (1993) Proc Natl. Acad. Sci. USA 90(20):9586-9590; and Guevara-Garcia, et al., (1993) Plant J. 4(3):495-505. Such promoters can be modified, if necessary, for weak expression.

Leaf-preferred promoters are known in the art. See, for example, Yamamoto, et al., (1997) Plant J. 12(2):255-265; Kwon, et al., (1994) Plant Physiol. 105:357-367; Yamamoto, et al., (1994) Plant Cell Physiol. 35(5):773-778; Gotor, et al., (1993) Plant J. 3:509-518; Orozco, et al., (1993) Plant Mol. Biol. 23(6):1129-1138; and Matsuoka, et al., (1993) Proc. Natl. Acad. Sci. USA 90(20):9586-9590.

Synthetic promoters can be used to express meganuclease sequences or biologically active variants and fragments thereof.

The expression cassette can also comprise a selectable marker gene for the selection of transformed cells. Selectable marker genes are utilized for the selection of transformed cells or tissues. Marker genes include genes encoding antibiotic resistance, such as those encoding neomycin phosphotransferase II (NEO) and hygromycin phosphotransferase (HPT), as well as genes conferring resistance to herbicidal compounds, such as glyphosate, glufosinate ammonium, bromoxynil, sulfonylureas, dicamba, and 2,4-dichlorophenoxyacetate (2,4-D). Additional selectable markers include phenotypic markers such as β-galactosidase and fluorescent proteins such as green fluorescent protein (GFP) (Su, et al., (2004) Biotechnol Bioeng 85:610-619 and Fetter, et al., (2004) Plant Cell 16:215-228), cyan florescent protein (CYP) (Bolte, et al., (2004) J. Cell Science 117:943-954 and Kato, et al., (2002) Plant Physiol 129:913-942), and yellow florescent protein (PhiYFP™ from Evrogen, see, Bolte, et al., (2004) J. Cell Science 117:943-954). For additional selectable markers, see generally, Yarranton (1992) Curr. Opin. Biotech. 3:506-511; Christopherson, et al., (1992) Proc. Natl. Acad. Sci. USA 89:6314-6318; Yao, et al., (1992) Cell 71:63-72; Reznikoff (1992) Mol. Microbiol. 6:2419-2422; Barkley, et al., (1980) in The Operon, pp. 177-220; Hu, et al., (1987) Cell 48:555-566; Brown, et al., (1987) Cell 49:603-612; Figge, et al., (1988) Cell 52:713-722; Deuschle, et al., (1989) Proc. Natl. Acad. Aci. USA 86:5400-5404; Fuerst, et al., (1989) Proc. Natl. Acad. Sci. USA 86:2549-2553; Deuschle, et al., (1990) Science 248:480-483; Gossen (1993) Ph.D. Thesis, University of Heidelberg; Reines, et al., (1993) Proc. Natl. Acad. Sci. USA 90:1917-1921; Labow, et al., (1990) Mol. Cell. Biol. 10:3343-3356; Zambretti, et al., (1992) Proc. Natl. Acad. Sci. USA 89:3952-3956; Baim, et al., (1991) Proc. Natl. Acad. Sci. USA 88:5072-5076; Wyborski, et al., (1991) Nucleic Acids Res. 19:4647-4653; Hillenand-Wissman (1989) Topics Mol. Struc. Biol. 10:143-162; Degenkolb, et al., (1991) Antimicrob. Agents Chemother. 35:1591-1595; Kleinschnidt, et al., (1988) Biochemistry 27:1094-1104; Bonin (1993) Ph.D. Thesis, University of Heidelberg; Gossen, et al., (1992) Proc. Natl. Acad. Sci. USA 89:5547-5551; Oliva, et al., (1992) Antimicrob. Agents Chemother. 36:913-919; Hlavka, et al., (1985) Handbook of Experimental Pharmacology, Vol. 78 (Springer-Verlag, Berlin); Gill, et al., (1988) Nature 334:721-724. Such disclosures are herein incorporated by reference. The above list of selectable marker genes is not meant to be limiting. Any selectable marker gene can be used in the present invention.

Method of Introducing

The meganuclease may be introduced by any means known in the art. For example, a cell, yeast or plant having the recognition site in its genome is provided. The meganuclease may be transiently expressed or the polypeptide itself can be directly provided to the cell. Alternatively, a nucleotide sequence capable of expressing the meganuclease may be stably integrated into the genome of the plant. In the presence of the corresponding recognition site and the meganuclease, a donor DNA can be inserted into the transformed plant's genome. Alternatively, the different components may be brought together by sexually crossing transformed plants. Thus, a sequence encoding a meganuclease and/or target site can be sexually crossed to one another to allow each component of the system to be present in a single plant. The meganuclease may be under the control of a constitutive or inducible promoter. Such promoters of interest are discussed in further detail elsewhere herein.

Various methods can be used to introduce a sequence of interest such as, any of the meganuclease of the invention, into a plant or plant part. “Introducing” is intended to mean presenting to the plant, plant cell or plant part the polynucleotide or polypeptide in such a manner that the sequence gains access to the interior of a cell of the plant. The methods of the invention do not depend on a particular method for introducing a sequence into a plant or plant part, only that the polynucleotide or polypeptides gains access to the interior of at least one cell of the plant. Methods for introducing polynucleotide or polypeptides into plants are known in the art including, but not limited to, stable transformation methods, transient transformation methods, and virus-mediated methods.

“Stable transformation” is intended to mean that the nucleotide construct introduced into a plant integrates into the genome of the plant and is capable of being inherited by the progeny thereof. “Transient transformation” is intended to mean that a polynucleotide is introduced into the plant and does not integrate into the genome of the plant or a polypeptide is introduced into a plant.

Transformation protocols as well as protocols for introducing polypeptides or polynucleotide sequences into plants may vary depending on the type of plant or plant cell, i.e., monocot or dicot, targeted for transformation. Suitable methods of introducing polypeptides and polynucleotides into plant cells include microinjection (Crossway, et al., (1986) Biotechniques 4:320-334), electroporation (Riggs, et al., (1986) Proc. Natl. Acad. Sci. USA 83:5602-5606, Agrobacterium-mediated transformation (U.S. Pat. No. 5,563,055 and U.S. Pat. No. 5,981,840), direct gene transfer (Paszkowski, et al., (1984) EMBO J. 3:2717-2722), and ballistic particle acceleration (see, for example, U.S. Pat. No. 4,945,050; U.S. Pat. No. 5,879,918; U.S. Pat. Nos. 5,886,244; and, 5,932,782; Tomes, et al., (1995) in Plant Cell, Tissue, and Organ Culture: Fundamental Methods, ed. Gamborg and Phillips (Springer-Verlag, Berlin); McCabe, et al., (1988) Biotechnology 6:923-926); and Lec1 transformation (WO 2000/28058). Also, see, Weissinger, et al., (1988) Ann. Rev. Genet. 22:421-477; Sanford, et al., (1987) Particulate Science and Technology 5:27-37 (onion); Christou, et al., (1988) Plant Physiol. 87:671-674 (soybean); McCabe, et al., (1988) Bio/Technology 6:923-926 (soybean); Finer and McMullen, (1991) In Vitro Cell Dev. Biol. 27P:175-182 (soybean); Singh, et al., (1998) Theor. Appl. Genet. 96:319-324 (soybean); Datta, et al., (1990) Biotechnology 8:736-740 (rice); Klein, et al., (1988) Proc. Natl. Acad. Sci. USA 85:4305-4309 (maize); Klein, et al., (1988) Biotechnology 6:559-563 (maize); U.S. Pat. Nos. 5,240,855; 5,322,783; and, 5,324,646; Klein, et al., (1988) Plant Physiol. 91:440-444 (maize); Fromm, et al., (1990) Biotechnology 8:833-839 (maize); Hooykaas-Van Slogteren, et al., (1984) Nature (London) 311:763-764; U.S. Pat. No. 5,736,369 (cereals); Bytebier, et al., (1987) Proc. Natl. Acad. Sci. USA 84:5345-5349 (Liliaceae); De Wet, et al., (1985) in The Experimental Manipulation of Ovule Tissues, ed. Chapman, et al., (Longman, New York), pp. 197-209 (pollen); Kaeppler, et al., (1990) Plant Cell Reports 9:415-418 and Kaeppler, et al., (1992) Theor. Appl. Genet. 84:560-566 (whisker-mediated transformation); D'Halluin, et al., (1992) Plant Cell 4:1495-1505 (electroporation); Li, et al., (1993) Plant Cell Reports 12:250-255 and Christou and Ford, (1995) Annals of Botany 75:407-413 (rice); Osjoda, et al., (1996) Nature Biotechnology 14:745-750 (maize via Agrobacterium tumefaciens); all of which are herein incorporated by reference.

In specific embodiments, the meganuclease sequences or active variant or fragments thereof can be provided to a yeast cell or plant using a variety of transient transformation methods. Such transient transformation methods include, but are not limited to, the introduction of the meganuclease protein or active variants and fragments thereof directly into a yeast cell or plant. Such methods include, for example, microinjection or particle bombardment. See, for example, Crossway, et al., (1986) Mol Gen. Genet. 202:179-185; Nomura, et al., (1986) Plant Sci. 44:53-58; Hepler, et al., (1994) Proc. Natl. Acad. Sci. 91:2176-2180 and Hush, et al., (1994) The Journal of Cell Science 107:775-784, all of which are herein incorporated by reference.

In other embodiments, the polynucleotide of the invention may be introduced into yeast cells or plants by contacting plants with a virus or viral nucleic acids. Generally, such methods involve incorporating a nucleotide construct of the invention within a DNA or RNA molecule. It is recognized that the an meganuclease sequence may be initially synthesized as part of a viral polyprotein, which later may be processed by proteolysis in vivo or in vitro to produce the desired recombinant protein. Further, it is recognized that promoters of the invention also encompass promoters utilized for transcription by viral RNA polymerases. Methods for introducing polynucleotides into plants and expressing a protein encoded therein, involving viral DNA or RNA molecules, are known in the art. See, for example, U.S. Pat. Nos. 5,889,191, 5,889,190, 5,866,785, 5,589,367, 5,316,931, and Porta, et al., (1996) Molecular Biotechnology 5:209-221; herein incorporated by reference.

Methods are known in the art for the targeted insertion of a polynucleotide at a specific location in the plant genome. In one embodiment, the insertion of the polynucleotide at a desired genomic location is achieved using a site-specific recombination system. See, for example, WO 1999/25821, WO 1999/25854, WO 1999/25840, WO 1999/25855, and WO 1999/25853, all of which are herein incorporated by reference. Briefly, the polynucleotide of the invention can be contained in transfer cassette flanked by two non-recombinogenic recombination sites. The transfer cassette is introduced into a plant having stably incorporated into its genome a target site which is flanked by two non-recombinogenic recombination sites that correspond to the sites of the transfer cassette. An appropriate recombinase is provided and the transfer cassette is integrated at the target site. The polynucleotide of interest is thereby integrated at a specific chromosomal position in the plant genome. Other methods to target polynucleotides are set forth in WO 2009/114321 (herein incorporated by reference), which describes “custom” meganucleases produced to modify plant genomes, in particular the genome of maize. See, also, Gao, et al., (2010) Plant Journal 1:176-187.

The cells that have been transformed may be grown into plants in accordance with conventional ways. See, for example, McCormick, et al., (1986) Plant Cell Reports 5:81-84. These plants may then be grown, and either pollinated with the same transformed strain or different strains, and the resulting progeny having constitutive expression of the desired phenotypic characteristic identified. Two or more generations may be grown to ensure that expression of the desired phenotypic characteristic is stably maintained and inherited and then seeds harvested to ensure expression of the desired phenotypic characteristic has been achieved. In this manner, the present invention provides transformed seed (also referred to as “transgenic seed”) having a polynucleotide of the invention, for example, an expression cassette of the invention, stably incorporated into their genome.

Method of Detections

Methods for detecting a meganuclease polypeptide or an active variant or fragment thereof are provided. Such methods comprise analyzing plant tissues to detect such polypeptides or the polynucleotides encoding the same. The detection methods can directly assay for the presence of the meganuclease polypeptide or polynucleotide or the detection methods can indirectly assay for the sequences by assaying the phenotype of the cell, yeast, plant, plant cell or plant explant expressing the sequence.

In still other embodiments, the meganuclease polypeptide or active variant or fragment thereof can be detected in a plant tissue by detecting the presence of a polynucleotide encoding any of the various meganuclease polypeptides or active variants and fragments thereof. In one embodiment, the detection method comprises assaying plant tissue using PCR amplification.

As used herein, “primers” are isolated polynucleotides that are annealed to a complementary target DNA strand by nucleic acid hybridization to form a hybrid between the primer and the target DNA strand, then extended along the target DNA strand by a polymerase, e.g., a DNA polymerase. Primer pairs of the invention refer to their use for amplification of a target polynucleotide, e.g., by the polymerase chain reaction (PCR) or other conventional nucleic-acid amplification methods. “PCR” or “polymerase chain reaction” is a technique used for the amplification of specific DNA segments (see, U.S. Pat. Nos. 4,683,195 and 4,800,159; herein incorporated by reference).

Probes and primers are of sufficient nucleotide length to bind to the target DNA sequence and specifically detect and/or identify a polynucleotide encoding a meganuclease polypeptide or active variant or fragment thereof as describe elsewhere herein. It is recognized that the hybridization conditions or reaction conditions can be determined by the operator to achieve this result. This length may be of any length that is of sufficient length to be useful in a detection method of choice. Such probes and primers can hybridize specifically to a target sequence under high stringency hybridization conditions. Probes and primers according to embodiments of the present invention may have complete DNA sequence identity of contiguous nucleotides with the target sequence, although probes differing from the target DNA sequence and that retain the ability to specifically detect and/or identify a target DNA sequence may be designed by conventional methods. Accordingly, probes and primers can share about 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or greater sequence identity or complementarity to the target polynucleotide.

Methods for preparing and using probes and primers are described, for example, in Molecular Cloning: A Laboratory Manual, 2.sup.nd ed, vol. 1-3, ed. Sambrook, et al., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. 1989 (hereinafter, “Sambrook, et al., 1989”); Current Protocols in Molecular Biology, ed. Ausubel, et al., Greene Publishing and Wiley-Interscience, New York, 1992 (with periodic updates) (hereinafter, “Ausubel, et al., 1992”); and Innis, et al., PCR Protocols: A Guide to Methods and Applications, Academic Press: San Diego, 1990. PCR primer pairs can be derived from a known sequence, for example, by using computer programs intended for that purpose such as the PCR primer analysis tool in Vector NTI version 10 (Invitrogen); PrimerSelect (DNASTAR Inc., Madison, Wis.); and Primer (Version 0.5.COPYRGT., 1991, Whitehead Institute for Biomedical Research, Cambridge, Mass.). Additionally, the sequence can be visually scanned and primers manually identified using guidelines known to one of skill in the art.

Method of Identifying Meganuclease Variants.

Various methods can be employed to identify further meganuclease variants. The polynucleotides of the invention are optionally used as substrates for a variety of diversity generating procedures, e.g., mutation, recombination and recursive recombination reactions, in addition to their use in standard cloning methods as set forth in, e.g., Ausubel, Berger and Sambrook, i.e., to produce additional meganuclease polynucleotides and polypeptides with desired properties. A variety of diversity generating protocols can be used. The procedures can be used separately, and/or in combination to produce one or more variants of a polynucleotide or set of polynucleotides, as well variants of encoded proteins. Individually and collectively, these procedures provide robust, widely applicable ways of generating diversified polynucleotides and sets of polynucleotides (including, e.g., polynucleotide libraries) useful, e.g., for the engineering or rapid evolution of polynucleotides, proteins, pathways, cells and/or organisms with new and/or improved characteristics. The process of altering the sequence can result in, for example, single nucleotide substitutions, multiple nucleotide substitutions, and insertion or deletion of regions of the nucleic acid sequence.

While distinctions and classifications are made in the course of the ensuing discussion for clarity, it will be appreciated that the techniques are often not mutually exclusive. Indeed, the various methods can be used singly or in combination, in parallel or in series, to access diverse sequence variants.

The terms “diversification” and “diversity,” as applied to a polynucleotide, refers to generation of a plurality of modified forms of a parental polynucleotide, or plurality of parental polynucleotides. In the case where the polynucleotide encodes a polypeptide, diversity in the nucleotide sequence of the polynucleotide can result in diversity in the corresponding encoded polypeptide, e.g. a diverse pool of polynucleotides encoding a plurality of polypeptide variants. In some embodiments of the invention, this sequence diversity is exploited by screening/selecting a library of diversified polynucleotides for variants with desirable functional attributes, e.g., a polynucleotide encoding a meganuclease with enhanced functional characteristics.

The result of any of the diversity generating procedures described herein can be the generation of one or more polynucleotides, which can be selected or screened for polynucleotides that encode proteins with or which confer desirable properties. Following diversification by one or more of the methods herein, or otherwise available to one of skill, any polynucleotides that are produced can be selected for a desired activity or property, e.g. altered Km, use of alternative cofactors, increased kcat, etc. This can include identifying any activity that can be detected, for example, in an automated or automatable format, by any of the assays in the art. For example, modified meganuclease polypeptides can be detected by assaying for a meganuclease activity. Assays to measure such activity are described elsewhere herein. A variety of related (or even unrelated) properties can be evaluated, in serial or in parallel, at the discretion of the practitioner.

Descriptions of a variety of diversity generating procedures, including family shuffling and methods for generating modified nucleic acid sequences encoding multiple enzymatic domains, are found in the following publications and the references cited therein: Soong, et al., (2000) Nat Genet 25(4):436-439; Stemmer, et al., (1999) Tumor Targeting 4:1-4; Ness, et al., (1999) Nature Biotechnology 17:893-896; Chang, et al., (1999) Nature Biotechnology 17:793-797; Minshull and Stemmer, (1999) Current Opinion in Chemical Biology 3:284-290; Christians, et al., (1999) Nature Biotechnology 17:259-264; Crameri, et al., (1998) Nature 391:288-291; Crameri, et al., (1997) Nature Biotechnology 15:436-438; Zhang, et al., (1997) Proc. Natl. Acad. Sci. USA 94:4504-4509; Patten, et al., (1997) Current Opinion in Biotechnology 8:724-733; Crameri, et al., (1996) Nature Medicine 2:100-103; Crameri, et al., (1996) Nature Biotechnology 14:315-319; Gates, et al., (1996) Journal of Molecular Biology 255:373-386; Stemmer (1996) “Sexual PCR and Assembly PCR” In: The Encyclopedia of Molecular Biology. VCH Publishers, New York. pp. 447-457; Crameri and Stemmer, (1995) BioTechniques 18:194-195; Stemmer, et al., (1995) Gene: 164:49-53; Stemmer (1995) Science 270:1510; Stemmer (1995) Bio/Technology 13:549-553; Stemmer (1994) Nature 370:389-391; and Stemmer (1994) Proc. Natl. Acad. Sci. USA 91:10747-10751. See also WO 2008/073877 and US 2007/0204369, both of which are herein incorporated by reference in their entirety.

Mutational methods of generating diversity include, for example, site-directed mutagenesis (Ling, et al., (1997) Anal Biochem. 254(2):157-178; Dale, et al., (1996) Methods Mol. Biol. 57:369-374; Smith (1985) Ann. Rev. Genet. 19:423-462; Botstein and Shortle, (1985) Science 229:1193-1201; Carter (1986) Biochem. J. 237:1-7; and Kunkel (1987) Nucleic Acids &Molecular Biology (Eckstein and Lilley, eds., Springer Verlag, Berlin)); mutagenesis using uracil containing templates (Kunkel (1985) Proc. Natl. Acad. Sci. USA 82:488-492; Kunkel, et al., (1987) Methods in Enzymol. 154:367-382; and Bass, et al., (1988) Science 242:240-245); oligonucleotide-directed mutagenesis (Methods in Enzymol. 100:468-500 (1983); Methods in Enzymol. 154:329-350 (1987); Zoller and Smith, (1982) Nucleic Acids Res. 10:6487-6500; Zoller and Smith, (1983) Methods in Enzymol. 100:468-500; and Zoller and Smith, (1987) Methods in Enzymol. 154:329-350); phosphorothioate-modified DNA mutagenesis (Taylor, et al., (1985) Nucl. Acids Res. 13:8749-8764; Taylor, et al., (1985) Nucl. Acids Res. 13:8765-8787; Nakamaye and Eckstein, (1986) Nucl. Acids Res. 14:9679-9698; Sayers, et al., (1988) Nucl. Acids Res. 16:791-802; and Sayers, et al., (1988) Nucl. Acids Res. 16: 803-814); mutagenesis using gapped duplex DNA (Kramer, et al., (1984) Nucl. Acids Res. 12:9441-9456; Kramer and Fritz, (1987) Methods in Enzymol. 154:350-367; Kramer, et al., (1988) Nucl. Acids Res. 16:7207; and Fritz, et al., (1988) Nucl. Acids Res. 16: 6987-6999).

Additional suitable methods include, but are not limited to, point mismatch repair (Kramer, et al., (1984) Cell 38:879-887), mutagenesis using repair-deficient host strains (Carter, et al., (1985) Nucl. Acids Res. 13:4431-4443; and Carter (1987) Methods in Enzymol. 154:382-403), deletion mutagenesis (Eghtedarzadeh and Henikoff, (1986) Nucl. Acids Res. 14:5115), restriction-selection and restriction-purification (Wells, et al., (1986) Phil. Trans. R. Soc. Lond. A 317:415-423), mutagenesis by total gene synthesis (Nambiar, et al., (1984) Science 223:1299-1301; Sakamar and Khorana, (1988) Nucl. Acids Res. 14:6361-6372; Wells, et al., (1985) Gene 34:315-323; and Grundström, et al., (1985) Nucl. Acids Res. 13:3305-3316), and double-strand break repair (Mandecki (1986); Arnold (1993) Current Opinion in Biotechnology 4:450-455 and Proc. Natl. Acad. Sci. USA, 83:7177-7181). Additional details on many of the above methods can be found in Methods in Enzymology Volume 154, which also describes useful controls for trouble-shooting problems with various mutagenesis methods.

Additional details regarding various diversity generating methods can be found in the following US Patents, PCT Publications, and EPO Publications: U.S. Pat. No. 5,605,793, U.S. Pat. No. 5,811,238, U.S. Pat. No. 5,830,721, U.S. Pat. No. 5,834,252, U.S. Pat. No. 5,837,458, WO 1995/22625, WO 1996/33207, WO 1997/20078, WO 1997/35966, WO 1999/41402, WO 1999/41383, WO 1999/41369, WO 1999/41368, EP 752008, EP 0932670, WO 1999/23107, WO 1999/21979, WO 1998/31837, WO 1998/27230, WO 1998/13487, WO 2000/00632, WO 2000/09679, WO 1998/42832, WO 1999/29902, WO 1998/41653, WO 1998/41622, WO 1998/42727, WO 2000/18906, WO 2000/04190, WO 2000/42561, WO 2000/42559, WO 2000/42560, WO 2001/23401, and, PCT/US2001/06775. See, also, WO 2007/04303, herein incorporated by reference.

In brief, several different general classes of sequence modification methods, such as mutation, recombination, etc. are applicable to the present invention and set forth, e.g., in the references above. That is, alterations to the component nucleic acid sequences to produced modified gene fusion constructs can be performed by any number of the protocols described, either before cojoining of the sequences, or after the cojoining step. The following exemplify some of the different types of preferred formats for diversity generation in the context of the present invention, including, e.g., certain recombination based diversity generation formats.

Nucleic acids can be recombined in vitro by any of a variety of techniques discussed in the references above, including e.g., DNAse digestion of nucleic acids to be recombined followed by ligation and/or PCR reassembly of the nucleic acids. For example, sexual PCR mutagenesis can be used in which random (or pseudo random, or even non-random) fragmentation of the DNA molecule is followed by recombination, based on sequence similarity, between DNA molecules with different but related DNA sequences, in vitro, followed by fixation of the crossover by extension in a polymerase chain reaction. This process and many process variants are described in several of the references above, e.g., in Stemmer (1994) Proc. Natl. Acad. Sci. USA 91:10747-10751.

Similarly, nucleic acids can be recursively recombined in vivo, e.g., by allowing recombination to occur between nucleic acids in cells. Many such in vivo recombination formats are set forth in the references noted above. Such formats optionally provide direct recombination between nucleic acids of interest, or provide recombination between vectors, viruses, plasmids, etc., comprising the nucleic acids of interest, as well as other formats. Details regarding such procedures are found in the references noted above.

Whole genome recombination methods can also be used in which whole genomes of cells or other organisms are recombined, optionally including spiking of the genomic recombination mixtures with desired library components (e.g., genes corresponding to the pathways of the present invention). These methods have many applications, including those in which the identity of a target gene is not known. Details on such methods are found, e.g., in WO 1998/31837 and in PCT/US1999/15972. Thus, any of these processes and techniques for recombination, recursive recombination, and whole genome recombination, alone or in combination, can be used to generate the modified nucleic acid sequences and/or modified gene fusion constructs of the present invention.

Synthetic recombination methods can also be used, in which oligonucleotides corresponding to targets of interest are synthesized and reassembled in PCR or ligation reactions which include oligonucleotides which correspond to more than one parental nucleic acid, thereby generating new recombined nucleic acids. Oligonucleotides can be made by standard nucleotide addition methods, or can be made, e.g., by tri-nucleotide synthetic approaches. Details regarding such approaches are found in the references noted above, including, e.g., WO 2000/42561, WO 2001/23401, WO 2000/42560, and, WO 2000/42559.

In silico methods of recombination can be affected in which genetic algorithms are used in a computer to recombine sequence strings which correspond to homologous (or even non-homologous) nucleic acids. The resulting recombined sequence strings are optionally converted into nucleic acids by synthesis of nucleic acids which correspond to the recombined sequences, e.g., in concert with oligonucleotide synthesis/gene reassembly techniques. This approach can generate random, partially random or designed variants. Many details regarding in silico recombination, including the use of genetic algorithms, genetic operators and the like in computer systems, combined with generation of corresponding nucleic acids (and/or proteins), as well as combinations of designed nucleic acids and/or proteins (e.g., based on cross-over site selection) as well as designed, pseudo-random or random recombination methods are described in WO 2000/42560 and WO 2000/42559.

Many methods of accessing natural diversity, e.g., by hybridization of diverse nucleic acids or nucleic acid fragments to single-stranded templates, followed by polymerization and/or ligation to regenerate full-length sequences, optionally followed by degradation of the templates and recovery of the resulting modified nucleic acids can be similarly used. In one method employing a single-stranded template, the fragment population derived from the genomic library(ies) is annealed with partial, or, often approximately full length ssDNA or RNA corresponding to the opposite strand. Assembly of complex chimeric genes from this population is then mediated by nuclease-base removal of non-hybridizing fragment ends, polymerization to fill gaps between such fragments and subsequent single stranded ligation. The parental polynucleotide strand can be removed by digestion (e.g., if RNA or uracil-containing), magnetic separation under denaturing conditions (if labeled in a manner conducive to such separation) and other available separation/purification methods. Alternatively, the parental strand is optionally co-purified with the chimeric strands and removed during subsequent screening and processing steps. Additional details regarding this approach are found, e.g., in PCT/US2001/06775.

In another approach, single-stranded molecules are converted to double-stranded DNA (dsDNA) and the dsDNA molecules are bound to a solid support by ligand-mediated binding. After separation of unbound DNA, the selected DNA molecules are released from the support and introduced into a suitable host cell to generate a library enriched sequences which hybridize to the probe. A library produced in this manner provides a desirable substrate for further diversification using any of the procedures described herein.

Any of the preceding general recombination formats can be practiced in a reiterative fashion (e.g., one or more cycles of mutation/recombination or other diversity generation methods, optionally followed by one or more selection methods) to generate a more diverse set of recombinant nucleic acids.

Mutagenesis employing polynucleotide chain termination methods have also been proposed (see, e.g., U.S. Pat. No. 5,965,408 and the references above), and can be applied to the present invention. In this approach, double stranded DNAs corresponding to one or more genes sharing regions of sequence similarity are combined and denatured, in the presence or absence of primers specific for the gene. The single stranded polynucleotides are then annealed and incubated in the presence of a polymerase and a chain terminating reagent (e.g., ultraviolet, gamma or X-ray irradiation; ethidium bromide or other intercalators; DNA binding proteins, such as single strand binding proteins, transcription activating factors, or histones; polycyclic aromatic hydrocarbons; trivalent chromium or a trivalent chromium salt; or abbreviated polymerization mediated by rapid thermocycling; and the like), resulting in the production of partial duplex molecules. The partial duplex molecules, e.g., containing partially extended chains, are then denatured and reannealed in subsequent rounds of replication or partial replication resulting in polynucleotides which share varying degrees of sequence similarity and which are diversified with respect to the starting population of DNA molecules. Optionally, the products, or partial pools of the products, can be amplified at one or more stages in the process. Polynucleotides produced by a chain termination method, such as described above, are suitable substrates for any other described recombination format.

Diversity also can be generated in nucleic acids or populations of nucleic acids using a recombinational procedure termed “incremental truncation for the creation of hybrid enzymes” (“ITCHY”) described in Ostermeier, et al., (1999) Nature Biotech 17:1205. This approach can be used to generate an initial a library of variants which can optionally serve as a substrate for one or more in vitro or in vivo recombination methods. See, also, Ostermeier, et al., (1999) Proc. Natl. Acad. Sci. USA 96:3562-3567; Ostermeier, et al., (1999), Biological and Medicinal Chemistry 7:2139-2144.

Mutational methods which result in the alteration of individual nucleotides or groups of contiguous or non-contiguous nucleotides can be favorably employed to introduce nucleotide diversity into the nucleic acid sequences and/or gene fusion constructs of the present invention. Many mutagenesis methods are found in the above-cited references; additional details regarding mutagenesis methods can be found in following, which can also be applied to the present invention.

For example, error-prone PCR can be used to generate nucleic acid variants. Using this technique, PCR is performed under conditions where the copying fidelity of the DNA polymerase is low, such that a high rate of point mutations is obtained along the entire length of the PCR product. Examples of such techniques are found in the references above and, e.g., in Leung, et al., (1989) Technique 1:11-15 and Caldwell, et al., (1992) PCR Methods Applic. 2:28-33. Similarly, assembly PCR can be used, in a process which involves the assembly of a PCR product from a mixture of small DNA fragments. A large number of different PCR reactions can occur in parallel in the same reaction mixture, with the products of one reaction priming the products of another reaction.

Oligonucleotide directed mutagenesis can be used to introduce site-specific mutations in a nucleic acid sequence of interest. Examples of such techniques are found in the references above and, e.g., in Reidhaar-Olson, et al., (1988) Science 241:53-57. Similarly, cassette mutagenesis can be used in a process that replaces a small region of a double stranded DNA molecule with a synthetic oligonucleotide cassette that differs from the native sequence. The oligonucleotide can contain, e.g., completely and/or partially randomized native sequence(s).

Recursive ensemble mutagenesis is a process in which an algorithm for protein mutagenesis is used to produce diverse populations of phenotypically related mutants, members of which differ in amino acid sequence. This method uses a feedback mechanism to monitor successive rounds of combinatorial cassette mutagenesis. Examples of this approach are found in Arkin and Youvan, (1992) Proc. Natl. Acad. Sci. USA 89:7811-7815.

Exponential ensemble mutagenesis can be used for generating combinatorial libraries with a high percentage of unique and functional mutants. Small groups of residues in a sequence of interest are randomized in parallel to identify, at each altered position, amino acids which lead to functional proteins. Examples of such procedures are found in Delegrave and Youvan, (1993) Biotechnology Research 11:1548-1552.

In vivo mutagenesis can be used to generate random mutations in any cloned DNA of interest by propagating the DNA, e.g., in a strain of E. coli that carries mutations in one or more of the DNA repair pathways. These “mutator” strains have a higher random mutation rate than that of a wild-type parent. Propagating the DNA in one of these strains will eventually generate random mutations within the DNA. Such procedures are described in the references noted above.

Other procedures for introducing diversity into a genome, e.g. a bacterial, fungal, animal or plant genome can be used in conjunction with the above described and/or referenced methods. For example, in addition to the methods above, techniques have been proposed which produce nucleic acid multimers suitable for transformation into a variety of species (see, e.g., U.S. Pat. No. 5,756,316 and the references above). Transformation of a suitable host with such multimers, consisting of genes that are divergent with respect to one another, (e.g., derived from natural diversity or through application of site directed mutagenesis, error prone PCR, passage through mutagenic bacterial strains, and the like), provides a source of nucleic acid diversity for DNA diversification, e.g., by an in vivo recombination process as indicated above.

Alternatively, a multiplicity of monomeric polynucleotides sharing regions of partial sequence similarity can be transformed into a host species and recombined in vivo by the host cell. Subsequent rounds of cell division can be used to generate libraries, members of which, include a single, homogenous population, or pool of monomeric polynucleotides. Alternatively, the monomeric nucleic acid can be recovered by standard techniques, e.g., PCR and/or cloning, and recombined in any of the recombination formats, including recursive recombination formats, described above.

Methods for generating multispecies expression libraries have been described (in addition to the reference noted above, see, e.g., U.S. Pat. No. 5,783,431 and U.S. Pat. No. 5,824,485) and their use to identify protein activities of interest has been proposed (In addition to the references noted above, see, U.S. Pat. No. 5,958,672. Multispecies expression libraries include, in general, libraries comprising cDNA or genomic sequences from a plurality of species or strains, operably linked to appropriate regulatory sequences, in an expression cassette. The cDNA and/or genomic sequences are optionally randomly ligated to further enhance diversity. The vector can be a shuttle vector suitable for transformation and expression in more than one species of host organism, e.g., bacterial species, eukaryotic cells. In some cases, the library is biased by preselecting sequences which encode a protein of interest, or which hybridize to a nucleic acid of interest. Any such libraries can be provided as substrates for any of the methods herein described.

The above described procedures have been largely directed to increasing nucleic acid and/or encoded protein diversity. However, in many cases, not all of the diversity is useful, e.g., functional, and contributes merely to increasing the background of variants that must be screened or selected to identify the few favorable variants. In some applications, it is desirable to preselect or prescreen libraries (e.g., an amplified library, a genomic library, a cDNA library, a normalized library, etc.) or other substrate nucleic acids prior to diversification, e.g., by recombination-based mutagenesis procedures, or to otherwise bias the substrates towards nucleic acids that encode functional products. For example, in the case of antibody engineering, it is possible to bias the diversity generating process toward antibodies with functional antigen binding sites by taking advantage of in vivo recombination events prior to manipulation by any of the described methods. For example, recombined CDRs derived from B cell cDNA libraries can be amplified and assembled into framework regions (e.g., Jirholt, et al., (1998) Gene 215:471) prior to diversifying according to any of the methods described herein.

Libraries can be biased towards nucleic acids which encode proteins with desirable enzyme activities. For example, after identifying a variant from a library which exhibits a specified activity, the variant can be mutagenized using any known method for introducing DNA alterations. A library comprising the mutagenized homologues is then screened for a desired activity, which can be the same as or different from the initially specified activity. An example of such a procedure is proposed in U.S. Pat. No. 5,939,250. Desired activities can be identified by any method known in the art. For example, WO 1999/10539 proposes that gene libraries can be screened by combining extracts from the gene library with components obtained from metabolically rich cells and identifying combinations which exhibit the desired activity. It has also been proposed (e.g., WO 1998/58085) that clones with desired activities can be identified by inserting bioactive substrates into samples of the library, and detecting bioactive fluorescence corresponding to the product of a desired activity using a fluorescent analyzer, e.g., a flow cytometry device, a CCD, a fluorometer, or a spectrophotometer.

Libraries can also be biased towards nucleic acids which have specified characteristics, e.g., hybridization to a selected nucleic acid probe. For example, application WO 1999/10539 proposes that polynucleotides encoding a desired activity (e.g., an enzymatic activity, for example: a lipase, an esterase, a protease, a glycosidase, a glycosyl transferase, a phosphatase, a kinase, an oxygenase, a peroxidase, a hydrolase, a hydratase, a nitrilase, a transaminase, an amidase or an acylase) can be identified from among genomic DNA sequences in the following manner. Single stranded DNA molecules from a population of genomic DNA are hybridized to a ligand-conjugated probe. The genomic DNA can be derived from either a cultivated or uncultivated microorganism, or from an environmental sample. Alternatively, the genomic DNA can be derived from a multicellular organism, or a tissue derived there from. Second strand synthesis can be conducted directly from the hybridization probe used in the capture, with or without prior release from the capture medium or by a wide variety of other strategies known in the art. Alternatively, the isolated single-stranded genomic DNA population can be fragmented without further cloning and used directly in, e.g., a recombination-based approach, that employs a single-stranded template, as described above.

“Non-Stochastic” methods of generating nucleic acids and polypeptides are found in WO 2000/46344. These methods, including proposed non-stochastic polynucleotide reassembly and site-saturation mutagenesis methods be applied to the present invention as well. Random or semi-random mutagenesis using doped or degenerate oligonucleotides is also described in, e.g., Arkin and Youvan, (1992) Biotechnology 10:297-300; Reidhaar-Olson, et al., (1991) Methods Enzymol. 208:564-586; Lim and Sauer, (1991) J. Mol. Biol. 219:359-376; Breyer and Sauer, (1989) J. Biol. Chem. 264:13355-13360); and U.S. Pat. Nos. 5,830,650 and 5,798,208, and EP Patent 0527809 B1.

It will readily be appreciated that any of the above described techniques suitable for enriching a library prior to diversification can also be used to screen the products, or libraries of products, produced by the diversity generating methods. Any of the above described methods can be practiced recursively or in combination to alter nucleic acids, e.g., meganuclease encoding polynucleotides.

The above references provide many mutational formats, including recombination, recursive recombination, recursive mutation and combinations or recombination with other forms of mutagenesis, as well as many modifications of these formats. Regardless of the diversity generation format that is used, the nucleic acids of the present invention can be recombined (with each other, or with related (or even unrelated) sequences) to produce a diverse set of recombinant nucleic acids for use in the gene fusion constructs and modified gene fusion constructs of the present invention, including, e.g., sets of homologous nucleic acids, as well as corresponding polypeptides.

Many of the above-described methodologies for generating modified polynucleotides generate a large number of diverse variants of a parental sequence or sequences. In some embodiments, the modification technique (e.g., some form of shuffling) is used to generate a library of variants that is then screened for a modified polynucleotide or pool of modified polynucleotides encoding some desired functional attribute, e.g., increased meganuclease activity.

For convenience and high through-put it will often be desirable to screen/select for desired modified nucleic acids in a microorganism, e.g., a bacteria such as E. coli. On the other hand, screening in plant cells or plants can in some cases be preferable where the ultimate aim is to generate a modified nucleic acid for expression in a plant system.

In some preferred embodiments of the invention throughput is increased by screening pools of host cells expressing different modified nucleic acids, either alone or as part of a gene fusion construct. Any pools showing significant activity can be deconvoluted to identify single variants expressing the desirable activity.

In high throughput assays, it is possible to screen up to several thousand different variants in a single day. For example, each well of a microtiter plate can be used to run a separate assay, or, if concentration or incubation time effects are to be observed, every 5-10 wells can test a single variant.

In addition to fluidic approaches, it is possible, as mentioned above, simply to grow cells on media plates that select for the desired enzymatic or metabolic function. This approach offers a simple and high-throughput screening method.

A number of well known robotic systems have also been developed for solution phase chemistries useful in assay systems. These systems include automated workstations like the automated synthesis apparatus developed by Takeda Chemical Industries, LTD. (Osaka, Japan) and many robotic systems utilizing robotic arms (Zymate II, Zymark Corporation, Hopkinton, Mass.; Orca, Hewlett-Packard, Palo Alto, Calif.) which mimic the manual synthetic operations performed by a scientist. Any of the above devices are suitable for application to the present invention. The nature and implementation of modifications to these devices (if any) so that they can operate as discussed herein with reference to the integrated system will be apparent to persons skilled in the relevant art.

High throughput screening systems are commercially available (see, e.g., Zymark Corp., Hopkinton, Mass.; Air Technical Industries, Mentor, Ohio; Beckman Instruments, Inc. Fullerton, Calif.; Precision Systems, Inc., Natick, Mass., etc.). These systems typically automate entire procedures including all sample and reagent pipetting, liquid dispensing, timed incubations, and final readings of the microplate in detector(s) appropriate for the assay. These configurable systems provide high throughput and rapid start up as well as a high degree of flexibility and customization.

The manufacturers of such systems provide detailed protocols for the various high throughput devices. Thus, for example, Zymark Corp. provides technical bulletins describing screening systems for detecting the modulation of gene transcription, ligand binding, and the like. Microfluidic approaches to reagent manipulation have also been developed, e.g., by Caliper Technologies (Mountain View, Calif.).

Yeast and Plants

Yeast, plants, plant cells, plant parts and seeds, and grain having the meganuclease sequences disclosed herein are provided. In specific embodiments, the yeast, plants and/or plant parts have stably incorporated at least one heterologous meganuclease polypeptide disclosed herein or an active variant or fragment thereof. Thus, yeast, plants, plant cells, plant parts and seed are provided which comprise at least one heterologous meganuclease sequences of any one of SEQ ID NOS: 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 251, 252, 253, 262, 272, 273, 274, 275, 284, 285, 286, 287, 288, 289, 290, 291, 292, 293, 294, 295, 296, 297, 298, 315, 316, 317, 318, 319, 320, 330, 331, 332, 334, 335, 336, 337, 338, 339, 340, 341, 357, 358, 359, 360, 361, 362, 363, 364, 365, 366, 367, 368, 369, 370, 370, 371, 390, 391, 392, 393, 394, 395, 396, 397, 398, 399, 400, 401, 402 or 403 or any one of other variants disclosed herein, such as those in Example 3-23 or a biologically active fragment and/or variant of the meganuclease sequence. In specific embodiments, the meganuclease sequences are characterized as having meganuclease activity.

In specific embodiments, the heterologous polynucleotide in the plant or plant part is operably linked to a constitutive, tissue-preferred, or other promoter for expression in plants.

The yeast, plant cell, plant, plant part and seed can comprise any of the recognition sequence provided herein. For example, the recognition site can be selected from the group consisting of the LIG3-4 (SEQ ID NO: 2), MHP77 (SEQ ID NO: 85), MS26 (SEQ ID NO: 269), MHP14 (SEQ ID NO: 281), MP107 (SEQ ID NO: 328), ZM6.3 (SEQ ID NO: 355), ZM6.22V2 (SEQ ID NO: 388), TS21 (SEQ ID NO: 423) and/or TS14 (SEQ ID NO: 424) recognition sequences or an active variant thereof.

As used herein, the term plant includes plant cells, plant protoplasts, plant cell tissue cultures from which plants can be regenerated, plant calli, plant clumps, and plant cells that are intact in plants or parts of plants such as embryos, pollen, ovules, seeds, leaves, flowers, branches, fruit, kernels, ears, cobs, husks, stalks, roots, root tips, anthers, and the like. Grain is intended to mean the mature seed produced by commercial growers for purposes other than growing or reproducing the species. Progeny, variants, and mutants of the regenerated plants are also included within the scope of the invention, provided that these parts comprise the introduced polynucleotides.

A transformed plant or transformed plant cell provided herein is one in which genetic alteration, such as transformation, has been affected as to a gene of interest, or is a plant or plant cell which is descended from a plant or cell so altered and which comprises the alteration. A “transgene” is a gene that has been introduced into the genome by a transformation procedure. Accordingly, a “transgenic plant” is a plant that contains a transgene, whether the transgene was introduced into that particular plant by transformation or by breeding; thus, descendants of an originally-transformed plant are encompassed by the definition. A “control” or “control plant” or “control plant cell” provides a reference point for measuring changes in phenotype of the subject plant or plant cell. A control plant or plant cell may comprise, for example: (a) a wild-type plant or cell, i.e., of the same genotype as the starting material for the genetic alteration which resulted in the subject plant or cell; (b) a plant or plant cell of the same genotype as the starting material but which has been transformed with a null construct (i.e., with a construct which does not express the transgene, such as a construct comprising a marker gene); (c) a plant or plant cell which is a non-transformed segregant among progeny of a subject plant or plant cell; (d) a plant or plant cell genetically identical to the subject plant or plant cell but which is not exposed to conditions or stimuli that would induce expression of the transgene; or (e) the subject plant or plant cell itself, under conditions in which the construct is not expressed.

Plant cells that have been transformed to express a meganuclease provided herein can be grown into whole plants. The regeneration, development, and cultivation of plants from single plant protoplast transformants or from various transformed explants is well known in the art. See, for example, McCormick, et al., (1986) Plant Cell Reports 5:81-84; Weissbach and Weissbach, In: Methods for Plant Molecular Biology, (Eds.), Academic Press, Inc. San Diego, Calif., (1988). This regeneration and growth process typically includes the steps of selection of transformed cells, culturing those individualized cells through the usual stages of embryonic development through the rooted plantlet stage. Transgenic embryos and seeds are similarly regenerated. The resulting transgenic rooted shoots are thereafter planted in an appropriate plant growth medium such as soil. Preferably, the regenerated plants are self-pollinated to provide homozygous transgenic plants. Otherwise, pollen obtained from the regenerated plants is crossed to seed-grown plants of agronomically important lines. Conversely, pollen from plants of these important lines is used to pollinate regenerated plants. Two or more generations may be grown to ensure that expression of the desired phenotypic characteristic is stably maintained and inherited and then seeds harvested to ensure expression of the desired phenotypic characteristic has been achieved. In this manner, the compositions presented herein provide transformed seed (also referred to as “transgenic seed”) having a polynucleotide provided herein, for example, a target site, stably incorporated into their genome.

The meganuclease sequences and active variant and fragments thereof disclosed herein may be used for transformation of any plant species, including, but not limited to, monocots and dicots. Examples of plant species of interest include, but are not limited to, corn (Zea mays), Brassica sp. (e.g., B. napus, B. rapa, B. juncea), particularly those Brassica species useful as sources of seed oil, alfalfa (Medicago sativa), rice (Oryza sativa), rye (Secale cereale), sorghum (Sorghum bicolor, Sorghum vulgare), millet (e.g., pearl millet (Pennisetum glaucum), proso millet (Panicum miliaceum), foxtail millet (Setaria italica), finger millet (Eleusine coracana)), sunflower (Helianthus annuus), safflower (Carthamus tinctorius), wheat (Triticum aestivum), soybean (Glycine max), tobacco (Nicotiana tabacum), potato (Solanum tuberosum), peanuts (Arachis hypogaea), cotton (Gossypium barbadense, Gossypium hirsutum), sweet potato (Ipomoea batatus), cassava (Manihot esculenta), coffee (Coffea spp.), coconut (Cocos nucifera), pineapple (Ananas comosus), citrus trees (Citrus spp.), cocoa (Theobroma cacao), tea (Camellia sinensis), banana (Musa spp.), avocado (Persea americana), fig (Ficus casica), guava (Psidium guajava), mango (Mangifera indica), olive (Olea europaea), papaya (Carica papaya), cashew (Anacardium occidentale), macadamia (Macadamia integrifolia), almond (Prunus amygdalus), sugar beets (Beta vulgaris), sugarcane (Saccharum spp.), oats, barley, vegetables, ornamentals, and conifers.

Vegetables include tomatoes (Lycopersicon esculentum), lettuce (e.g., Lactuca sativa), green beans (Phaseolus vulgaris), lima beans (Phaseolus limensis), peas (Lathyrus spp.), and members of the genus Cucumis such as cucumber (C. sativus), cantaloupe (C. cantalupensis), and musk melon (C. melo). Ornamentals include azalea (Rhododendron spp.), hydrangea (Macrophylla hydrangea), hibiscus (Hibiscus rosasanensis), roses (Rosa spp.), tulips (Tulipa spp.), daffodils (Narcissus spp.), petunias (Petunia hybrida), carnation (Dianthus caryophyllus), poinsettia (Euphorbia pulcherrima), and chrysanthemum.

Conifers that may be employed in practicing the present invention include, for example, pines such as loblolly pine (Pinus taeda), slash pine (Pinus elliotii), ponderosa pine (Pinus ponderosa), lodgepole pine (Pinus contorta), and Monterey pine (Pinus radiata); Douglas-fir (Pseudotsuga menziesii); Western hemlock (Tsuga canadensis); Sitka spruce (Picea glauca); redwood (Sequoia sempervirens); true firs such as silver fir (Abies amabilis) and balsam fir (Abies balsamea); and cedars such as Western red cedar (Thuja plicata) and Alaska yellow-cedar (Chamaecyparis nootkatensis), and Poplar and Eucalyptus. In specific embodiments, plants of the present invention are crop plants (for example, corn, alfalfa, sunflower, Brassica, soybean, cotton, safflower, peanut, sorghum, wheat, millet, tobacco, etc.). In other embodiments, corn and soybean plants are optimal, and in yet other embodiments corn plants are optimal.

Other plants of interest include grain plants that provide seeds of interest, oil-seed plants, and leguminous plants. Seeds of interest include grain seeds, such as corn, wheat, barley, rice, sorghum, rye, etc. Oil-seed plants include cotton, soybean, safflower, sunflower, Brassica, maize, alfalfa, palm, coconut, etc. Leguminous plants include beans and peas. Beans include guar, locust bean, fenugreek, soybean, garden beans, cowpea, mungbean, lima bean, fava bean, lentils, chickpea, etc.

Non-limiting examples of compositions and methods disclosed herein are as follows:

  • 1. An isolated or recombinant polynucleotide comprising a nucleotide sequence encoding a meganuclease polypeptide, said polypeptide comprising:

    • a) an amino acid sequence having at least one amino acid modification at an amino acid position corresponding to a position of SEQ ID NO: 1 selected from the group consisting of positions 2, 12, 16, 22, 23, 31, 36, 43, 50, 56, 58, 59, 62, 71, 72, 73, 80, 81, 82, 86, 91, 95, 98, 103, 113, 114, 116, 117, 118, 121, 124, 128, 129, 131, 147, 151, 153, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 194, 195, 196, 197, 200, 203, 204, 209, 222, 232, 236, 237, 246, 254, 258, 267, 278, 281, 282, 289, 308, 311, 312, 316, 318, 319, 334, 339, 340, 342, 345, 346, 348 and combinations thereof; or,
    • b) an amino acid sequence having at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43 or 44 of any of the amino acid modification of (a);
  • 2. The isolated or recombinant polynucleotide of embodiment 1, wherein said nucleotide sequence encodes a meganuclease polypeptide having at least 80%, 81, %, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% sequence identity to SEQ ID NO: 1.
  • 3. The isolated or recombinant polynucleotide of embodiment 1, wherein said at least one amino acid modification comprises;

    • a) an aspartic acid (D) at a position corresponding to amino acid position 2 in SEQ ID NO: 1;
    • b) a histidine (H) at a position corresponding to amino acid position 12 in SEQ ID NO: 1;
    • c) an isoleucine (I) at a position corresponding to amino acid position 16 in SEQ ID NO: 1;
    • d) a cysteine (C) at a position corresponding to amino acid position 22 in SEQ ID NO: 1;
    • e) a leucine (L) at a position corresponding to amino acid position 23 in SEQ ID NO: 1;
    • f) an arginine (R) at a position corresponding to amino acid position 31 in SEQ ID NO: 1;
    • g) an asparagine (N) at a position corresponding to amino acid position 36 in SEQ ID NO: 1;
    • h) a leucine (L) at a position corresponding to amino acid position 43 in SEQ ID NO: 1;
    • i) an arginine (R) or lysine (K) at a position corresponding to amino acid position 50 in SEQ ID NO: 1;
    • j) a leucine (L) at a position corresponding to amino acid position 56 in SEQ ID NO: 1;
    • k) an isoleucine (I) at a position corresponding to amino acid position 58 in SEQ ID NO: 1;
    • l) a histidine (H) or alanine (A) at a position corresponding to amino acid position 59 in SEQ ID NO: 1;
    • m) a valine (V) at a position corresponding to amino acid position 62 in SEQ ID NO: 1;
    • n) a lysine (K) at a position corresponding to amino acid position 71 in SEQ ID NO: 1;
    • o) a threonine (T) at a position corresponding to amino acid position 72 in SEQ ID NO: 1;
    • p) an alanine (A) at a position corresponding to amino acid position 73 in SEQ ID NO: 1;
    • q) an arginine (R) at a position corresponding to amino acid position 80 in SEQ ID NO: 1;
    • r) a lysine (K) at a position corresponding to amino acid position 81 in SEQ ID NO: 1;
    • s) an arginine (R) at a position corresponding to amino acid position 82 in SEQ ID NO: 1;
    • t) an aspartic acid (D) at a position corresponding to amino acid position 86 in SEQ ID NO: 1;
    • u) an isoleucine (I) at a position corresponding to amino acid position 91 in SEQ ID NO: 1;
    • v) an isoleucine (I) at a position corresponding to amino acid position 95 in SEQ ID NO: 1;
    • w) an arginine (R) at a position corresponding to amino acid position 98 in SEQ ID NO: 1;
    • x) a valine (V) at a position corresponding to amino acid position 103 in SEQ ID NO: 1;
    • y) a serine (S) at a position corresponding to amino acid position 113 in SEQ ID NO: 1;
    • z) a proline (P) at a position corresponding to amino acid position 114 in SEQ ID NO: 1;
    • aa) an arginine (R) at a position corresponding to amino acid position 116 in SEQ ID NO: 1;
    • bb) a glycine (G) at a position corresponding to amino acid position 117 in SEQ ID NO: 1;
    • cc) a threonine (T) at a position corresponding to amino acid position 118 in SEQ ID NO: 1;
    • dd) a glycine (G) at a position corresponding to amino acid position 121 in SEQ ID NO: 1;
    • ee) an arginine (R) at a position corresponding to amino acid position 124 in SEQ ID NO: 1;
    • ff) a cysteine (C) at a position corresponding to amino acid position 128 in SEQ ID NO: 1;
    • gg) an alanine (A) at a position corresponding to amino acid position 129 in SEQ ID NO: 1;
    • hh) an arginine (R) at a position corresponding to amino acid position 131 in SEQ ID NO: 1;
    • ii) a serine (S) at a position corresponding to amino acid position 147 in SEQ ID NO: 1;
    • jj) an alanine (A) at a position corresponding to amino acid position 151 in SEQ ID NO: 1;
    • kk) a leucine (L) or a methionine (M) at a position corresponding to amino acid position 153 in SEQ ID NO: 1;
    • ll) a tryptophan (W) at a position corresponding to amino acid position 159 in SEQ ID NO: 1;
    • mm) a glutamic acid (E) at a position corresponding to amino acid position 160 in SEQ ID NO: 1;
    • nn) a valine (V) at a position corresponding to amino acid position 161 in SEQ ID NO: 1;
    • oo) a tyrosine (Y) at a position corresponding to amino acid position 162 in SEQ ID NO: 1;
    • pp) an arginine (R) at a position corresponding to amino acid position 163 in SEQ ID NO: 1;
    • qq) a histidine (H) at a position corresponding to amino acid position 164 in SEQ ID NO: 1;
    • rr) a leucine (L) at a position corresponding to amino acid position 165 in SEQ ID NO: 1;
    • ss) an arginine (R) at a position corresponding to amino acid position 166 in SEQ ID NO: 1;
    • tt) a histidine (H) at a position corresponding to amino acid position 167 in SEQ ID NO: 1;
    • uu) a proline (P) at a position corresponding to amino acid position 168 in SEQ ID NO: 1;
    • vv) an alanine (A) at a position corresponding to amino acid position 169 in SEQ ID NO: 1;
    • ww) a proline (P) at a position corresponding to amino acid position 170 in SEQ ID NO: 1;
    • xx) a histidine (H) at a position corresponding to amino acid position 171 in SEQ ID NO: 1;
    • yy) a proline (P) at a position corresponding to amino acid position 172 in SEQ ID NO: 1;
    • zz) an arginine (R) at a position corresponding to amino acid position 173 in SEQ ID NO: 1;
    • aaa) a leucine (L) at a position corresponding to amino acid position 174 in SEQ ID NO: 1;
    • bbb) a proline (P) at a position corresponding to amino acid position 175 in SEQ ID NO: 1;
    • ccc) a glutamine (Q) at a position corresponding to amino acid position 176 in SEQ ID NO: 1;
    • ddd) an alanine (A) at a position corresponding to amino acid position 177 in SEQ ID NO: 1;
    • eee) an arginine (R) at a position corresponding to amino acid position 178 in SEQ ID NO: 1;
    • fff) a valine (V) at a position corresponding to amino acid position 179 in SEQ ID NO: 1;
    • ggg) a glutamine (Q) at a position corresponding to amino acid position 180 in SEQ ID NO: 1;
    • hhh) a valine (V) at a position corresponding to amino acid position 182 in SEQ ID NO: 1;
    • iii) a proline (P) at a position corresponding to amino acid position 183 in SEQ ID NO: 1;
    • jjj) a lysine (K) at a position corresponding to amino acid position 184 in SEQ ID NO: 1;
    • kkk) a threonine (T) or a histidine (H) at a position corresponding to amino acid position 185 in SEQ ID NO: 1;
    • lll) a serine (S) at a position corresponding to amino acid position 186 in SEQ ID NO: 1;
    • mmm) a glutamic acid (E) at a position corresponding to amino acid position 187 in SEQ ID NO: 1;
    • nnn) a leucine (L) at a position corresponding to amino acid position 188 in SEQ ID NO: 1;
    • ooo) a glutamic acid (E) at a position corresponding to amino acid position 189 in SEQ ID NO: 1;
    • ppp) a glutamine (Q) at a position corresponding to amino acid position 190 in SEQ ID NO: 1;
    • qqq) a leucine (L) at a position corresponding to amino acid position 191 in SEQ ID NO: 1;
    • rrr) a proline (P) at a position corresponding to amino acid position 194 in SEQ ID NO: 1;
    • sss) a lysine (K) at a position corresponding to amino acid position 195 in SEQ ID NO: 1;
    • ttt) a serine (S) at a position corresponding to amino acid position 196 in SEQ ID NO: 1;
    • uuu) a phenylalanine (F) at a position corresponding to amino acid position 197 in SEQ ID NO: 1;
    • vvv) an isoleucine (I) at a position corresponding to amino acid position 200 in SEQ ID NO: 1;
    • www) a valine (V) at a position corresponding to amino acid position 203 in SEQ ID NO: 1;
    • xxx) a leucine (L) at a position corresponding to amino acid position 204 in SEQ ID NO: 1;
    • yyy) a cysteine (C) at a position corresponding to amino acid position 209 in SEQ ID NO: 1;
    • zzz) a leucine (L) at a position corresponding to amino acid position 222 in SEQ ID NO: 1;
    • aaaa) an isoleucine (I) at a position corresponding to amino acid position 232 in SEQ ID NO: 1;
    • bbbb) a serine (S) at a position corresponding to amino acid position 236 in SEQ ID NO: 1;
    • cccc) a leucine (L) or an arginine (R) at a position corresponding to amino acid position 237 in SEQ ID NO: 1;
    • dddd) a histidine (H) at a position corresponding to amino acid position 246 in SEQ ID NO: 1;
    • eeee) an isoleucine (I) at a position corresponding to amino acid position 254 in SEQ ID NO: 1;
    • ffff) a serine (S) at a position corresponding to amino acid position 258 in SEQ ID NO: 1;
    • gggg) an arginine (R) at a position corresponding to amino acid position 267 in SEQ ID NO: 1;
    • hhhh) an isoleucine (I) at a position corresponding to amino acid position 278 in SEQ ID NO: 1;
    • iiii) a tyrosine (Y) at a position corresponding to amino acid position 281 in SEQ ID NO: 1;
    • jjjj) a phenylalanine (F) at a position corresponding to amino acid position 282 in SEQ ID NO: 1;
    • kkkk) a threonine (T) at a position corresponding to amino acid position 289 in SEQ ID NO: 1;
    • llll) a glycine (G) at a position corresponding to amino acid position 308 in SEQ ID NO: 1;
    • mmmm) an arginine (R) at a position corresponding to amino acid position 311 in SEQ ID NO: 1;
    • nnnn) an alanine (A) at a position corresponding to amino acid position 312 in SEQ ID NO: 1;
    • oooo) an alanine (A) at a position corresponding to amino acid position 316 in SEQ ID NO: 1;
    • pppp) an arginine (R) at a position corresponding to amino acid position 318 in SEQ ID NO: 1
    • qqqq) an alanine (A) at a position corresponding to amino acid position 334 in SEQ ID NO: 1;
    • rrrr) a phenylalanine (F) at a position corresponding to amino acid position 339 in SEQ ID NO: 1;
    • ssss) a glycine (G) or a leucine (L) at a position corresponding to amino acid position 340 in SEQ ID NO: 1;
    • tttt) a serine (S) at a position corresponding to amino acid position 342 in SEQ ID NO: 1;
    • uuuu) an asparagine (N) at a position corresponding to amino acid position 345 in SEQ ID NO: 1;
    • vvvv) an asparagine (N) at a position corresponding to amino acid position 346 in SEQ ID NO: 1;
    • wwww) an asparagine (N) at a position corresponding to amino acid position 348 in SEQ ID NO: 1; or,
    • xxxx) any combination of a) to wwww).
  • 4. The isolated or recombinant polynucleotide of embodiment 1, wherein said nucleotide sequence encodes a meganuclease polypeptide, wherein said polypeptide further comprises:

    • a) an aspartic acid (D) at a position corresponding to amino acid position 2 in SEQ ID NO: 1;
    • b) a histidine (H) at a position corresponding to amino acid position 12 in SEQ ID NO: 1;
    • c) an isoleucine (I) at a position corresponding to amino acid position 16 in SEQ ID NO: 1;
    • d) a serine (S) or an alanine (A) at a position corresponding to amino acid position 19 in SEQ ID NO: 1;
    • e) a cysteine (C) at a position corresponding to amino acid position 22 in SEQ ID NO: 1;
    • f) a leucine (L) at a position corresponding to amino acid position 23 in SEQ ID NO: 1;
    • g) a methionine (M) at a position corresponding to amino acid position 24 in SEQ ID NO: 1;
    • h) an arginine (R) or an alanine (A) at a position corresponding to amino acid position 28 in SEQ ID NO: 1;
    • i) an arginine (R), alanine (A), glutamine (Q), cysteine (C), glycine (G), serine (S), threonine (T), leucine (L), glutamic acid (E), or a proline (P) at a position corresponding to amino acid position 30 in SEQ ID NO: 1;
    • j) an arginine (R) at a position corresponding to amino acid position 31 in SEQ ID NO: 1;
    • k) an arginine (R), alanine (A), lysine (K) glutamine (Q), glycine (G) or a leucine (L) at a position corresponding to amino acid position 32 in SEQ ID NO: 1;
    • l) an asparagine (N) at a position corresponding to amino acid position 36 in SEQ ID NO: 1;
    • m) a leucine (L) at a position corresponding to amino acid position 43 in SEQ ID NO: 1;
    • n) an arginine (R) or lysine (K) at a position corresponding to amino acid position 50 in SEQ ID NO: 1;
    • o) an isoleucine (I) or a leucine (L) at a position corresponding to amino acid position 54 in SEQ ID NO: 1;
    • p) a leucine (L) at a position corresponding to amino acid position 56 in SEQ ID NO: 1;
    • q) a glutamic acid (E) at a position corresponding to amino acid position 57 in SEQ ID NO: 1;
    • r) an isoleucine (I) at a position corresponding to amino acid position 58 in SEQ ID NO: 1;
    • s) a histidine (H) or alanine (A) at a position corresponding to amino acid position 59 in SEQ ID NO: 1;
    • t) a valine (V) at a position corresponding to amino acid position 62 in SEQ ID NO: 1;
    • u) a lysine (K) at a position corresponding to amino acid position 71 in SEQ ID NO: 1;
    • v) a threonine (T) at a position corresponding to amino acid position 72 in SEQ ID NO: 1;
    • w) an alanine (A) at a position corresponding to amino acid position 73 in SEQ ID NO: 1;
    • x) a glycine (G) at a position corresponding to amino acid position 79 in SEQ ID NO: 1;
    • y) an arginine (R) at a position corresponding to amino acid position 80 in SEQ ID NO: 1;
    • z) a lysine (K) at a position corresponding to amino acid position 81 in SEQ ID NO: 1;
    • aa) an arginine (R) at a position corresponding to amino acid position 82 in SEQ ID NO: 1;
    • bb) an aspartic acid (D) at a position corresponding to amino acid position 86 in SEQ ID NO: 1;
    • cc) a leucine (L) at a position corresponding to amino acid position 87 in SEQ ID NO: 1;
    • dd) an isoleucine (I) at a position corresponding to amino acid position 91 in SEQ ID NO: 1;
    • ee) an isoleucine (I) at a position corresponding to amino acid position 95 in SEQ ID NO: 1;
    • ff) an arginine (R) at a position corresponding to amino acid position 98 in SEQ ID NO: 1;
    • gg) a valine (V) at a position corresponding to amino acid position 103 in SEQ ID NO: 1;
    • hh) an alanine (A) at a position corresponding to amino acid position 105 in SEQ ID NO: 1;
    • ii) an arginine (R) at a position corresponding to amino acid position 111 in SEQ ID NO: 1;
    • jj) a serine (S) at a position corresponding to amino acid position 113 in SEQ ID NO: 1;
    • kk) a proline (P) at a position corresponding to amino acid position 114 in SEQ ID NO: 1;
    • ll) an arginine (R) at a position corresponding to amino acid position 116 in SEQ ID NO: 1;
    • mm) a glycine (G) at a position corresponding to amino acid position 117 in SEQ ID NO: 1;
    • nn) a threonine (T) at a position corresponding to amino acid position 118 in SEQ ID NO: 1;
    • oo) a glycine (G) at a position corresponding to amino acid position 121 in SEQ ID NO: 1;
    • pp) an arginine (R) at a position corresponding to amino acid position 124 in SEQ ID NO: 1;
    • qq) a cysteine (C) at a position corresponding to amino acid position 128 in SEQ ID NO: 1;
    • rr) an alanine (A) at a position corresponding to amino acid position 129 in SEQ ID NO: 1;
    • ss) an arginine (R) at a position corresponding to amino acid position 131 in SEQ ID NO: 1;
    • tt) a valine (V) at a position corresponding to amino acid position 132 in SEQ ID NO: 1;
    • uu) a serine (S) at a position corresponding to amino acid position 147 in SEQ ID NO: 1;
    • vv) an alanine (A) at a position corresponding to amino acid position 151 in SEQ ID NO: 1;
    • ww) a leucine (L) or a methionine (M) at a position corresponding to amino acid position 153 in SEQ ID NO: 1;
    • xx) a tryptophan (W) at a position corresponding to amino acid position 159 in SEQ ID NO: 1;
    • yy) a glutamic acid (E) at a position corresponding to amino acid position 160 in SEQ ID NO: 1;
    • zz) a valine (V) at a position corresponding to amino acid position 161 in SEQ ID NO: 1;
    • aaa) a tyrosine (Y) at a position corresponding to amino acid position 162 in SEQ ID NO: 1;
    • bbb) an arginine (R) at a position corresponding to amino acid position 163 in SEQ ID NO: 1;
    • ccc) a histidine (H) at a position corresponding to amino acid position 164 in SEQ ID NO: 1;
    • ddd) a leucine (L) at a position corresponding to amino acid position 165 in SEQ ID NO: 1;
    • eee) an arginine (R) at a position corresponding to amino acid position 166 in SEQ ID NO: 1;
    • fff) a histidine (H) at a position corresponding to amino acid position 167 in SEQ ID NO: 1;
    • ggg) a proline (P) at a position corresponding to amino acid position 168 in SEQ ID NO: 1;
    • hhh) an alanine (A) at a position corresponding to amino acid position 169 in SEQ ID NO: 1;
    • iii) a proline (P) at a position corresponding to amino acid position 170 in SEQ ID NO: 1;
    • jjj) a histidine (H) at a position corresponding to amino acid position 171 in SEQ ID NO: 1;
    • kkk) a proline (P) at a position corresponding to amino acid position 172 in SEQ ID NO: 1;
    • lll) an arginine (R) at a position corresponding to amino acid position 173 in SEQ ID NO: 1;
    • mmm) a leucine (L) at a position corresponding to amino acid position 174 in SEQ ID NO: 1;
    • nnn) a proline (P) at a position corresponding to amino acid position 175 in SEQ ID NO: 1;
    • ooo) a glutamine (Q) at a position corresponding to amino acid position 176 in SEQ ID NO: 1;
    • ppp) an alanine (A) at a position corresponding to amino acid position 177 in SEQ ID NO: 1;
    • qqq) an arginine (R) at a position corresponding to amino acid position 178 in SEQ ID NO: 1;
    • rrr) a valine (V) at a position corresponding to amino acid position 179 in SEQ ID NO: 1;
    • sss) a glutamine (Q) at a position corresponding to amino acid position 180 in SEQ ID NO: 1;
    • ttt) a valine (V) at a position corresponding to amino acid position 182 in SEQ ID NO: 1;
    • uuu) a proline (P) at a position corresponding to amino acid position 183 in SEQ ID NO: 1;
    • vvv) a lysine (K) at a position corresponding to amino acid position 184 in SEQ ID NO: 1;
    • www) a threonine (T) or a histidine (H) at a position corresponding to amino acid position 185 in SEQ ID NO: 1;
    • xxx) a serine (S) at a position corresponding to amino acid position 186 in SEQ ID NO: 1;
    • yyy) a glutamic acid (E) at a position corresponding to amino acid position 187 in SEQ ID NO: 1;
    • zzz) a leucine (L) at a position corresponding to amino acid position 188 in SEQ ID NO: 1;
    • aaaa) a glutamic acid (E) at a position corresponding to amino acid position 189 in SEQ ID NO: 1;
    • bbbb) a glutamine (Q) at a position corresponding to amino acid position 190 in SEQ ID NO: 1;
    • cccc) a leucine (L) at a position corresponding to amino acid position 191 in SEQ ID NO: 1;
    • dddd) an amino acid deletion at a position corresponding to amino acid position 192 in SEQ ID NO: 1;
    • eeee) a proline (P) at a position corresponding to amino acid position 194 in SEQ ID NO: 1;
    • ffff) a lysine (K) at a position corresponding to amino acid position 195 in SEQ ID NO: 1;
    • gggg) a serine (S) at a position corresponding to amino acid position 196 in SEQ ID NO: 1;
    • hhhh) a phenylalanine (F) at a position corresponding to amino acid position 197 in SEQ ID NO: 1;
    • iiii) an isoleucine (I) at a position corresponding to amino acid position 200 in SEQ ID NO: 1;
    • jjjj) a valine (V) at a position corresponding to amino acid position 203 in SEQ ID NO: 1;
    • kkkk) a leucine (L) at a position corresponding to amino acid position 204 in SEQ ID NO: 1;
    • llll) an alanine (A) or a serine (S) at a position corresponding to amino acid position 206 in SEQ ID NO: 1;
    • mmmm) a cysteine (C) at a position corresponding to amino acid position 209 in SEQ ID NO: 1;
    • nnnn) a leucine (L) at a position corresponding to amino acid position 222 in SEQ ID NO: 1;
    • oooo) a methionine (M) at a position corresponding to amino acid position 211 in SEQ ID NO: 1;
    • pppp) an isoleucine (I) at a position corresponding to amino acid position 232 in SEQ ID NO: 1;
    • qqqq) a serine (S) at a position corresponding to amino acid position 236 in SEQ ID NO: 1;
    • rrrr) a leucine (L) or an arginine (R) at a position corresponding to amino acid position 237 in SEQ ID NO: 1;
    • ssss) an isoleucine (I) or a leucine (L) at a position corresponding to amino acid position 241 in SEQ ID NO: 1;
    • tttt) a glutamic acid (E) at a position corresponding to amino acid position 244 in SEQ ID NO: 1;
    • uuuu) a histidine (H) at a position corresponding to amino acid position 246 in SEQ ID NO: 1;
    • vvvv) an aspartic acid (D) or histidine (H) at a position corresponding to amino acid position 253 in SEQ ID NO: 1;
    • wwww) an isoleucine (I) at a position corresponding to amino acid position 254 in SEQ ID NO: 1;
    • xxxx) a serine (S) at a position corresponding to amino acid position 258 in SEQ ID NO: 1;
    • yyyy) an arginine (R) at a position corresponding to amino acid position 267 in SEQ ID NO: 1;
    • zzzz) an isoleucine (I) at a position corresponding to amino acid position 278 in SEQ ID NO: 1;
    • aaaaa) a tyrosine (Y) at a position corresponding to amino acid position 281 in SEQ ID NO: 1;
    • bbbbb) a phenylalanine (F) at a position corresponding to amino acid position 282 in SEQ ID NO: 1;
    • ccccc) a threonine (T) at a position corresponding to amino acid position 289 in SEQ ID NO: 1;
    • ddddd) an alanine (A) at a position corresponding to amino acid position 292 in SEQ ID NO: 1;
    • eeeee) a glycine (G) at a position corresponding to amino acid position 308 in SEQ ID NO: 1;
    • fffff) an arginine (R) at a position corresponding to amino acid position 311 in SEQ ID NO: 1;
    • ggggg) an alanine (A) at a position corresponding to amino acid position 312 in SEQ ID NO: 1;
    • hhhhh) an alanine (A) at a position corresponding to amino acid position 316 in SEQ ID NO: 1;
    • iiiii) an arginine (R) at a position corresponding to amino acid position 318 in SEQ ID NO: 1
    • jjjjj) a valine (V) at a position corresponding to amino acid position 319 in SEQ ID NO: 1;
    • kkkkk) an alanine (A) at a position corresponding to amino acid position 334 in SEQ ID NO: 1;
    • llllll) a phenylalanine (F) at a position corresponding to amino acid position 339 in SEQ ID NO: 1;
    • mmmmm) a glycine (G) or a leucine (L) at a position corresponding to amino acid position 340 in SEQ ID NO: 1;
    • nnnnn) a serine (S) at a position corresponding to amino acid position 342 in SEQ ID NO: 1;
    • ooooo) an asparagine (N) at a position corresponding to amino acid position 345 in SEQ ID NO: 1;
    • ppppp) an asparagine (N) at a position corresponding to amino acid position 346 in SEQ ID NO: 1; or,
    • qqqqq) an asparagine (N) at a position corresponding to amino acid position 348 in SEQ ID NO: 1; or,
    • rrrrr) any combination of a) to qqqqq).
  • 5. The isolated or recombinant polynucleotide of embodiment 1, wherein said nucleotide sequence encodes a meganuclease polypeptide selected from the group consisting of SEQ ID NOS: 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 251, 252, 253, 272, 273, 274, 275, 272, 273, 274, 275, 284, 285, 286, 287, 288, 289, 290, 291, 292, 293, 294, 295, 296, 297, 298, 330, 331, 332, 333, 334, 335, 336, 337, 338, 339, 340, 341, 357, 358, 359, 360, 361, 362, 363, 364, 365, 366, 367, 368, 369, 370, 371, 390, 391, 392, 393, 394, 395, 396, 397, 398, 399, 400, 401, 402, 403, 430, 431, 432 and 433.
  • 6. The isolated or recombinant polynucleotide of embodiment 1, wherein said nucleotide sequence encodes a meganuclease polypeptide, wherein the polypeptide is capable of recognizing and cleaving a meganuclease recognition sequence selected from the group consisting of SEQ ID NO: 2 (LIG3-4), SEQ ID NO: 85 (MHP77), SEQ ID NO: 269 (MS26), SEQ ID NO: 281 (MHP14), SEQ ID NO: 331(MP107), SEQ ID NO: 358 (ZM6.3), SEQ ID NO: 390 (ZM6.22v2), SEQ ID NO: 423 or SEQ ID NO: 424.
  • 7. The isolated or recombinant polynucleotide of embodiment 1, wherein said nucleotide sequence encodes a meganuclease polypeptide, wherein said polypeptide has an increased meganuclease activity when compared to a control meganuclease that lacks said amino acid modification.
  • 8. The isolated or recombinant polynucleotide of embodiment 7, wherein said control meganuclease is selected from the group of SEQ ID NO: 1 (LIG3-4), SEQ ID NO: 86 (MHP77), SEQ ID NO: 250 (MHP77.3), SEQ ID NO: 270 (MS26+), SEQ ID NO: 271, SEQ ID NO: 282 (MHP14), SEQ ID NO: 283 (MHP14+), SEQ ID NO: 329 (MP107), SEQ ID NO: 356 (ZM6.3), SEQ ID NO: 389 (ZM6.22v2), SEQ ID NO: 429 or SEQ ID NO: 435.
  • 9. The isolated or recombinant polynucleotide of embodiment 7, wherein the increased meganuclease activity is evidenced by:

    • a) a higher yeast assay score when compared to the control meganuclease that lacks said amino acid modification; or,
    • b) a higher target site mutation rate when compared to the control meganuclease that lacks said amino acid modification; or,
    • c) a higher in-vitro cutting when compared to the control meganuclease that lacks said amino acid modification; or,
    • d) any combination of (a), (b) and (c).
  • 10. The isolated or recombinant polynucleotide of embodiment 1, further comprising a nucleotide sequence encoding a N-terminal nuclear transit peptide.
  • 11. The isolated or recombinant polynucleotide of embodiment 1, further comprising a nucleotide sequence encoding a C-terminal histidine tag.
  • 12. The isolated or recombinant polynucleotide of embodiment 7, wherein the increased meganuclease activity is determined at 16° C., 24° C., 28° C., 30° C. or 37° C.
  • 13. A recombinant DNA construct, comprising the isolated or recombinant polynucleotide of embodiment 1.
  • 14. The recombinant DNA construct of embodiment 13, further comprising a promoter operably linked to said polynucleotide.
  • 15. The recombinant DNA construct of embodiment 14, wherein said promoter is heterologous with respect to said polynucleotide or said promoter is homologous with respect to said polynucleotide.
  • 16. A cell comprising at least one polynucleotide of embodiment 1 or the recombinant DNA construct of embodiment 13, wherein said polynucleotide is heterologous to the cell.
  • 17. The cell of embodiment 16, wherein said cell is a yeast cell.
  • 18. The cell of embodiment 16, wherein said cell is a plant cell.
  • 19. The cell of embodiment 16, wherein said polynucleotide or said recombinant DNA construct is stably incorporated into the genome of said plant cell.
  • 20. The cell of embodiment 16, wherein said polynucleotide or said recombinant DNA construct is stably incorporated into the chloroplast genome of said plant cell.
  • 21. The cell of embodiment 18, wherein said plant cell is from a monocot.
  • 22. The cell of embodiment 21 wherein said monocot is maize, wheat, rice, barley, sugarcane, sorghum, or rye.
  • 23. The cell of embodiment 18, wherein said plant cell is from a dicot.
  • 24. The cell of embodiment 23, wherein the dicot is soybean, Brassica, sunflower, cotton, or alfalfa.
  • 25. A plant comprising a plant cell of embodiment 18.
  • 26. A plant explant comprising a plant cell of embodiment 18.
  • 27. The plant, the explant or the plant cell of embodiment 26, wherein said plant, explant or plant cell exhibits an increased meganuclease activity when compared to a plant, explant or plant cell of the same species, strain or cultivar that does not comprise at least one polynucleotide of embodiments 1.
  • 28. A transgenic seed produced by the plant of embodiment 25.
  • 29. An isolated polypeptide having meganuclease activity, said polypeptide comprising:

    • a) an amino acid sequence having at least one amino acid modification at an amino acid position corresponding to a position of SEQ ID NO: 1 selected from the group consisting of positions 2, 12, 16, 22, 23, 31, 36, 43, 50, 56, 58, 59, 62, 71, 72, 73, 80, 81, 82, 86, 91, 95, 98, 103, 113, 114, 116, 117, 118, 121, 124, 128, 129, 131, 147, 151, 153, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 194, 195, 196, 197, 200, 203, 204, 209, 222, 232, 236, 237, 246, 254, 258, 267, 278, 281, 282, 289, 308, 311, 312, 316, 318, 319, 334, 339, 340, 342, 345, 346, 348 and combinations thereof; or,
    • b) an amino acid sequence having at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43 or 44 of any of the amino acid modification of (a);
  • 30. The isolated polypeptide of embodiment 29, wherein said polypeptide has at least 80% sequence identity to SEQ ID NO: 1.
  • 31. The isolated polypeptide of embodiment 29, wherein said at least one amino acid modification comprises:

    • a) an aspartic acid (D) at a position corresponding to amino acid position 2 in SEQ ID NO: 1;
    • b) a histidine (H) at a position corresponding to amino acid position 12 in SEQ ID NO: 1;
    • c) an isoleucine (I) at a position corresponding to amino acid position 16 in SEQ ID NO: 1;
    • d) a cysteine (C) at a position corresponding to amino acid position 22 in SEQ ID NO: 1;
    • e) a leucine (L) at a position corresponding to amino acid position 23 in SEQ ID NO: 1;
    • f) an arginine (R) at a position corresponding to amino acid position 31 in SEQ ID NO: 1;
    • g) an asparagine (N) at a position corresponding to amino acid position 36 in SEQ ID NO: 1;
    • h) a leucine (L) at a position corresponding to amino acid position 43 in SEQ ID NO: 1;
    • i) an arginine (R) or lysine (K) at a position corresponding to amino acid position 50 in SEQ ID NO: 1;
    • j) a leucine (L) at a position corresponding to amino acid position 56 in SEQ ID NO: 1;
    • k) an isoleucine (I) at a position corresponding to amino acid position 58 in SEQ ID NO: 1;
    • l) a histidine (H) or alanine (A) at a position corresponding to amino acid position 59 in SEQ ID NO: 1;
    • m) a valine (V) at a position corresponding to amino acid position 62 in SEQ ID NO: 1;
    • n) a lysine (K) at a position corresponding to amino acid position 71 in SEQ ID NO: 1;
    • o) a threonine (T) at a position corresponding to amino acid position 72 in SEQ ID NO: 1;
    • p) an alanine (A) at a position corresponding to amino acid position 73 in SEQ ID NO: 1;
    • q) an arginine (R) at a position corresponding to amino acid position 80 in SEQ ID NO: 1;
    • r) a lysine (K) at a position corresponding to amino acid position 81 in SEQ ID NO: 1;
    • s) an arginine (R) at a position corresponding to amino acid position 82 in SEQ ID NO: 1;
    • t) an aspartic acid (D) at a position corresponding to amino acid position 86 in SEQ ID NO: 1;
    • u) an isoleucine (I) at a position corresponding to amino acid position 91 in SEQ ID NO: 1;
    • v) an isoleucine (I) at a position corresponding to amino acid position 95 in SEQ ID NO: 1;
    • w) an arginine (R) at a position corresponding to amino acid position 98 in SEQ ID NO: 1;
    • x) a valine (V) at a position corresponding to amino acid position 103 in SEQ ID NO: 1;
    • y) a serine (S) at a position corresponding to amino acid position 113 in SEQ ID NO: 1;
    • z) a proline (P) at a position corresponding to amino acid position 114 in SEQ ID NO: 1;
    • aa) an arginine (R) at a position corresponding to amino acid position 116 in SEQ ID NO: 1;
    • bb) a glycine (G) at a position corresponding to amino acid position 117 in SEQ ID NO: 1;
    • cc) a threonine (T) at a position corresponding to amino acid position 118 in SEQ ID NO: 1;
    • dd) a glycine (G) at a position corresponding to amino acid position 121 in SEQ ID NO: 1;
    • ee) an arginine (R) at a position corresponding to amino acid position 124 in SEQ ID NO: 1;
    • ff) a cysteine (C) at a position corresponding to amino acid position 128 in SEQ ID NO: 1;
    • gg) an alanine (A) at a position corresponding to amino acid position 129 in SEQ ID NO: 1;
    • hh) an arginine (R) at a position corresponding to amino acid position 131 in SEQ ID NO: 1;
    • ii) a serine (S) at a position corresponding to amino acid position 147 in SEQ ID NO: 1;
    • jj) an alanine (A) at a position corresponding to amino acid position 151 in SEQ ID NO: 1;
    • kk) a leucine (L) or a methionine (M) at a position corresponding to amino acid position 153 in SEQ ID NO: 1;
    • ll) a tryptophan (W) at a position corresponding to amino acid position 159 in SEQ ID NO: 1;
    • mm) a glutamic acid (E) at a position corresponding to amino acid position 160 in SEQ ID NO: 1;
    • nn) a valine (V) at a position corresponding to amino acid position 161 in SEQ ID NO: 1;
    • oo) a tyrosine (Y) at a position corresponding to amino acid position 162 in SEQ ID NO: 1;
    • pp) an arginine (R) at a position corresponding to amino acid position 163 in SEQ ID NO: 1;
    • qq) a histidine (H) at a position corresponding to amino acid position 164 in SEQ ID NO: 1;
    • rr) a leucine (L) at a position corresponding to amino acid position 165 in SEQ ID NO: 1;
    • ss) an arginine (R) at a position corresponding to amino acid position 166 in SEQ ID NO: 1;
    • tt) a histidine (H) at a position corresponding to amino acid position 167 in SEQ ID NO: 1;
    • uu) a proline (P) at a position corresponding to amino acid position 168 in SEQ ID NO: 1;
    • vv) an alanine (A) at a position corresponding to amino acid position 169 in SEQ ID NO: 1;
    • ww) a proline (P) at a position corresponding to amino acid position 170 in SEQ ID NO: 1;
    • xx) a histidine (H) at a position corresponding to amino acid position 171 in SEQ ID NO: 1;
    • yy) a proline (P) at a position corresponding to amino acid position 172 in SEQ ID NO: 1;
    • zz) an arginine (R) at a position corresponding to amino acid position 173 in SEQ ID NO: 1;
    • aaa) a leucine (L) at a position corresponding to amino acid position 174 in SEQ ID NO: 1;
    • bbb) a proline (P) at a position corresponding to amino acid position 175 in SEQ ID NO: 1;
    • ccc) a glutamine (Q) at a position corresponding to amino acid position 176 in SEQ ID NO: 1;
    • ddd) an alanine (A) at a position corresponding to amino acid position 177 in SEQ ID NO: 1;
    • eee) an arginine (R) at a position corresponding to amino acid position 178 in SEQ ID NO: 1;
    • fff) a valine (V) at a position corresponding to amino acid position 179 in SEQ ID NO: 1;
    • ggg) a glutamine (Q) at a position corresponding to amino acid position 180 in SEQ ID NO: 1;
    • hhh) a valine (V) at a position corresponding to amino acid position 182 in SEQ ID NO: 1;
    • iii) a proline (P) at a position corresponding to amino acid position 183 in SEQ ID NO: 1;
    • jjj) a lysine (K) at a position corresponding to amino acid position 184 in SEQ ID NO: 1;
    • kkk) a threonine (T) or a histidine (H) at a position corresponding to amino acid position 185 in SEQ ID NO: 1;
    • lll) a serine (S) at a position corresponding to amino acid position 186 in SEQ ID NO: 1;
    • mmm) a glutamic acid (E) at a position corresponding to amino acid position 187 in SEQ ID NO: 1;
    • nnn) a leucine (L) at a position corresponding to amino acid position 188 in SEQ ID NO: 1;
    • ooo) a glutamic acid (E) at a position corresponding to amino acid position 189 in SEQ ID NO: 1;
    • ppp) a glutamine (Q) at a position corresponding to amino acid position 190 in SEQ ID NO: 1;
    • qqq) a leucine (L) at a position corresponding to amino acid position 191 in SEQ ID NO: 1;
    • rrr) a proline (P) at a position corresponding to amino acid position 194 in SEQ ID NO: 1;
    • sss) a lysine (K) at a position corresponding to amino acid position 195 in SEQ ID NO: 1;
    • ttt) a serine (S) at a position corresponding to amino acid position 196 in SEQ ID NO: 1;
    • uuu) a phenylalanine (F) at a position corresponding to amino acid position 197 in SEQ ID NO: 1;
    • vvv) an isoleucine (I) at a position corresponding to amino acid position 200 in SEQ ID NO: 1;
    • www) a valine (V) at a position corresponding to amino acid position 203 in SEQ ID NO: 1;
    • xxx) a leucine (L) at a position corresponding to amino acid position 204 in SEQ ID NO: 1;
    • yyy) a cysteine (C) at a position corresponding to amino acid position 209 in SEQ ID NO: 1;
    • zzz) a leucine (L) at a position corresponding to amino acid position 222 in SEQ ID NO: 1;
    • aaaa) an isoleucine (I) at a position corresponding to amino acid position 232 in SEQ ID NO: 1;
    • bbbb) a serine (S) at a position corresponding to amino acid position 236 in SEQ ID NO: 1;
    • cccc) a leucine (L) or an arginine (R) at a position corresponding to amino acid position 237 in SEQ ID NO: 1;
    • dddd) a histidine (H) at a position corresponding to amino acid position 246 in SEQ ID NO: 1;
    • eeee) an isoleucine (I) at a position corresponding to amino acid position 254 in SEQ ID NO: 1;
    • ffff) a serine (S) at a position corresponding to amino acid position 258 in SEQ ID NO: 1;
    • gggg) an arginine (R) at a position corresponding to amino acid position 267 in SEQ ID NO: 1;
    • hhhh) an isoleucine (I) at a position corresponding to amino acid position 278 in SEQ ID NO: 1;
    • iiii) a tyrosine (Y) at a position corresponding to amino acid position 281 in SEQ ID NO: 1;
    • jjjj) a phenylalanine (F) at a position corresponding to amino acid position 282 in SEQ ID NO: 1;
    • kkkk) a threonine (T) at a position corresponding to amino acid position 289 in SEQ ID NO: 1;
    • llll) a glycine (G) at a position corresponding to amino acid position 308 in SEQ ID NO: 1;
    • mmmm) an arginine (R) at a position corresponding to amino acid position 311 in SEQ ID NO: 1;
    • nnnn) an alanine (A) at a position corresponding to amino acid position 312 in SEQ ID NO: 1;
    • oooo) an alanine (A) at a position corresponding to amino acid position 316 in SEQ ID NO: 1;
    • pppp) an arginine (R) at a position corresponding to amino acid position 318 in SEQ ID NO: 1
    • qqqq) an alanine (A) at a position corresponding to amino acid position 334 in SEQ ID NO: 1;
    • rrrr) a phenylalanine (F) at a position corresponding to amino acid position 339 in SEQ ID NO: 1;
    • ssss) a glycine (G) or a leucine (L) at a position corresponding to amino acid position 340 in SEQ ID NO: 1;
    • tttt) a serine (S) at a position corresponding to amino acid position 342 in SEQ ID NO: 1;
    • uuuu) an asparagine (N) at a position corresponding to amino acid position 345 in SEQ ID NO: 1;
    • vvvv) an asparagine (N) at a position corresponding to amino acid position 346 in SEQ ID NO: 1;
    • wwww) an asparagine (N) at a position corresponding to amino acid position 348 in SEQ ID NO: 1; or,
    • xxxx) any combination of a) to wwww).
  • 32. The isolated polypeptide of embodiment 29, wherein said polypeptide further comprises:

    • a) an aspartic acid (D) at a position corresponding to amino acid position 2 in SEQ ID NO: 1;
    • b) a histidine (H) at a position corresponding to amino acid position 12 in SEQ ID NO: 1;
    • c) an isoleucine (I) at a position corresponding to amino acid position 16 in SEQ ID NO: 1;
    • d) a serine (S) or an alanine (A) at a position corresponding to amino acid position 19 in SEQ ID NO: 1;
    • e) a cysteine (C) at a position corresponding to amino acid position 22 in SEQ ID NO: 1;
    • f) a leucine (L) at a position corresponding to amino acid position 23 in SEQ ID NO: 1;
    • g) a methionine (M) at a position corresponding to amino acid position 24 in SEQ ID NO: 1;
    • h) an arginine (R) or an alanine (A) at a position corresponding to amino acid position 28 in SEQ ID NO: 1;
    • i) an arginine (R), alanine (A), glutamine (Q), cysteine (C), glycine (G), serine (S), threonine (T), leucine (L), glutamic acid (E), or a proline (P) at a position corresponding to amino acid position 30 in SEQ ID NO: 1;
    • j) an arginine (R) at a position corresponding to amino acid position 31 in SEQ ID NO: 1;
    • k) an arginine (R), alanine (A), lysine (K) glutamine (Q), glycine (G) or a leucine (L) at a position corresponding to amino acid position 32 in SEQ ID NO: 1;
    • l) an asparagine (N) at a position corresponding to amino acid position 36 in SEQ ID NO: 1;
    • m) a leucine (L) at a position corresponding to amino acid position 43 in SEQ ID NO: 1;
    • n) an arginine (R) or lysine (K) at a position corresponding to amino acid position 50 in SEQ ID NO: 1;
    • o) an isoleucine (I) or a leucine (L) at a position corresponding to amino acid position 54 in SEQ ID NO: 1;
    • p) a leucine (L) at a position corresponding to amino acid position 56 in SEQ ID NO: 1;
    • q) a glutamic acid (E) at a position corresponding to amino acid position 57 in SEQ ID NO: 1;
    • r) an isoleucine (I) at a position corresponding to amino acid position 58 in SEQ ID NO: 1;
    • s) a histidine (H) or alanine (A) at a position corresponding to amino acid position 59 in SEQ ID NO: 1;
    • t) a valine (V) at a position corresponding to amino acid position 62 in SEQ ID NO: 1;
    • u) a lysine (K) at a position corresponding to amino acid position 71 in SEQ ID NO: 1;
    • v) a threonine (T) at a position corresponding to amino acid position 72 in SEQ ID NO: 1;
    • w) an alanine (A) at a position corresponding to amino acid position 73 in SEQ ID NO: 1;
    • x) a glycine (G) at a position corresponding to amino acid position 79 in SEQ ID NO: 1;
    • y) an arginine (R) at a position corresponding to amino acid position 80 in SEQ ID NO: 1;
    • z) a lysine (K) at a position corresponding to amino acid position 81 in SEQ ID NO: 1;
    • aa) an arginine (R) at a position corresponding to amino acid position 82 in SEQ ID NO: 1;
    • bb) an aspartic acid (D) at a position corresponding to amino acid position 86 in SEQ ID NO: 1;
    • cc) a leucine (L) at a position corresponding to amino acid position 87 in SEQ ID NO: 1;
    • dd) an isoleucine (I) at a position corresponding to amino acid position 91 in SEQ ID NO: 1;
    • ee) an isoleucine (I) at a position corresponding to amino acid position 95 in SEQ ID NO: 1;
    • ff) an arginine (R) at a position corresponding to amino acid position 98 in SEQ ID NO: 1;
    • gg) a valine (V) at a position corresponding to amino acid position 103 in SEQ ID NO: 1;
    • hh) an alanine (A) at a position corresponding to amino acid position 105 in SEQ ID NO: 1;
    • ii) an arginine (R) at a position corresponding to amino acid position 111 in SEQ ID NO: 1;
    • jj) a serine (S) at a position corresponding to amino acid position 113 in SEQ ID NO: 1;
    • kk) a proline (P) at a position corresponding to amino acid position 114 in SEQ ID NO: 1;
    • ll) an arginine (R) at a position corresponding to amino acid position 116 in SEQ ID NO: 1;
    • mm) a glycine (G) at a position corresponding to amino acid position 117 in SEQ ID NO: 1;
    • nn) a threonine (T) at a position corresponding to amino acid position 118 in SEQ ID NO: 1;
    • oo) a glycine (G) at a position corresponding to amino acid position 121 in SEQ ID NO: 1;
    • pp) an arginine (R) at a position corresponding to amino acid position 124 in SEQ ID NO: 1;
    • qq) a cysteine (C) at a position corresponding to amino acid position 128 in SEQ ID NO: 1;
    • rr) an alanine (A) at a position corresponding to amino acid position 129 in SEQ ID NO: 1;
    • ss) an arginine (R) at a position corresponding to amino acid position 131 in SEQ ID NO: 1;
    • tt) a valine (V) at a position corresponding to amino acid position 132 in SEQ ID NO: 1;
    • uu) a serine (S) at a position corresponding to amino acid position 147 in SEQ ID NO: 1;
    • vv) an alanine (A) at a position corresponding to amino acid position 151 in SEQ ID NO: 1;
    • ww) a leucine (L) or a methionine (M) at a position corresponding to amino acid position 153 in SEQ ID NO: 1;
    • xx) a tryptophan (W) at a position corresponding to amino acid position 159 in SEQ ID NO: 1;
    • yy) a glutamic acid (E) at a position corresponding to amino acid position 160 in SEQ ID NO: 1;
    • zz) a valine (V) at a position corresponding to amino acid position 161 in SEQ ID NO: 1;
    • aaa) a tyrosine (Y) at a position corresponding to amino acid position 162 in SEQ ID NO: 1;
    • bbb) an arginine (R) at a position corresponding to amino acid position 163 in SEQ ID NO: 1;
    • ccc) a histidine (H) at a position corresponding to amino acid position 164 in SEQ ID NO: 1;
    • ddd) a leucine (L) at a position corresponding to amino acid position 165 in SEQ ID NO: 1;
    • eee) an arginine (R) at a position corresponding to amino acid position 166 in SEQ ID NO: 1;
    • fff) a histidine (H) at a position corresponding to amino acid position 167 in SEQ ID NO: 1;
    • ggg) a proline (P) at a position corresponding to amino acid position 168 in SEQ ID NO: 1;
    • hhh) an alanine (A) at a position corresponding to amino acid position 169 in SEQ ID NO: 1;
    • iii) a proline (P) at a position corresponding to amino acid position 170 in SEQ ID NO: 1;
    • jjj) a histidine (H) at a position corresponding to amino acid position 171 in SEQ ID NO: 1;
    • kkk) a proline (P) at a position corresponding to amino acid position 172 in SEQ ID NO: 1;
    • lll) an arginine (R) at a position corresponding to amino acid position 173 in SEQ ID NO: 1;
    • mmm) a leucine (L) at a position corresponding to amino acid position 174 in SEQ ID NO: 1;
    • nnn) a proline (P) at a position corresponding to amino acid position 175 in SEQ ID NO: 1;
    • ooo) a glutamine (Q) at a position corresponding to amino acid position 176 in SEQ ID NO: 1;
    • ppp) an alanine (A) at a position corresponding to amino acid position 177 in SEQ ID NO: 1;
    • qqq) an arginine (R) at a position corresponding to amino acid position 178 in SEQ ID NO: 1;
    • rrr) a valine (V) at a position corresponding to amino acid position 179 in SEQ ID NO: 1;
    • sss) a glutamine (Q) at a position corresponding to amino acid position 180 in SEQ ID NO: 1;
    • ttt) a valine (V) at a position corresponding to amino acid position 182 in SEQ ID NO: 1;
    • uuu) a proline (P) at a position corresponding to amino acid position 183 in SEQ ID NO: 1;
    • vvv) a lysine (K) at a position corresponding to amino acid position 184 in SEQ ID NO: 1;
    • www) a threonine (T) or a histidine (H) at a position corresponding to amino acid position 185 in SEQ ID NO: 1;
    • xxx) a serine (S) at a position corresponding to amino acid position 186 in SEQ ID NO: 1;
    • yyy) a glutamic acid (E) at a position corresponding to amino acid position 187 in SEQ ID NO: 1;
    • zzz) a leucine (L) at a position corresponding to amino acid position 188 in SEQ ID NO: 1;
    • aaaa) a glutamic acid (E) at a position corresponding to amino acid position 189 in SEQ ID NO: 1;
    • bbbb) a glutamine (Q) at a position corresponding to amino acid position 190 in SEQ ID NO: 1;
    • cccc) a leucine (L) at a position corresponding to amino acid position 191 in SEQ ID NO: 1;
    • dddd) an amino acid deletion at a position corresponding to amino acid position 192 in SEQ ID NO: 1;
    • eeee) a proline (P) at a position corresponding to amino acid position 194 in SEQ ID NO: 1;
    • ffff) a lysine (K) at a position corresponding to amino acid position 195 in SEQ ID NO: 1;
    • gggg) a serine (S) at a position corresponding to amino acid position 196 in SEQ ID NO: 1;
    • hhhh) a phenylalanine (F) at a position corresponding to amino acid position 197 in SEQ ID NO: 1;
    • iiii) an isoleucine (I) at a position corresponding to amino acid position 200 in SEQ ID NO: 1;
    • jjjj) a valine (V) at a position corresponding to amino acid position 203 in SEQ ID NO: 1;
    • kkkk) a leucine (L) at a position corresponding to amino acid position 204 in SEQ ID NO: 1;
    • llll) an alanine (A) or a serine (S) at a position corresponding to amino acid position 206 in SEQ ID NO: 1;
    • mmmm) a cysteine (C) at a position corresponding to amino acid position 209 in SEQ ID NO: 1;
    • nnnn) a leucine (L) at a position corresponding to amino acid position 222 in SEQ ID NO: 1;
    • oooo) a methionine (M) at a position corresponding to amino acid position 211 in SEQ ID NO: 1;
    • pppp) an isoleucine (I) at a position corresponding to amino acid position 232 in SEQ ID NO: 1;
    • qqqq) a serine (S) at a position corresponding to amino acid position 236 in SEQ ID NO: 1;
    • rrrr) a leucine (L) or an arginine (R) at a position corresponding to amino acid position 237 in SEQ ID NO: 1;
    • ssss) an isoleucine (I) or a leucine (L) at a position corresponding to amino acid position 241 in SEQ ID NO: 1;
    • tttt) a glutamic acid (E) at a position corresponding to amino acid position 244 in SEQ ID NO: 1;
    • uuuu) a histidine (H) at a position corresponding to amino acid position 246 in SEQ ID NO: 1;
    • vvvv) an aspartic acid (D) or histidine (H) at a position corresponding to amino acid position 253 in SEQ ID NO: 1;
    • wwww) an isoleucine (I) at a position corresponding to amino acid position 254 in SEQ ID NO: 1;
    • xxxx) a serine (S) at a position corresponding to amino acid position 258 in SEQ ID NO: 1;
    • yyyy) an arginine (R) at a position corresponding to amino acid position 267 in SEQ ID NO: 1;
    • zzzz) an isoleucine (I) at a position corresponding to amino acid position 278 in SEQ ID NO: 1;
    • aaaaa) a tyrosine (Y) at a position corresponding to amino acid position 281 in SEQ ID NO: 1;
    • bbbbb) a phenylalanine (F) at a position corresponding to amino acid position 282 in SEQ ID NO: 1;
    • ccccc) a threonine (T) at a position corresponding to amino acid position 289 in SEQ ID NO: 1;
    • ddddd) an alanine (A) at a position corresponding to amino acid position 292 in SEQ ID NO: 1;
    • eeeee) a glycine (G) at a position corresponding to amino acid position 308 in SEQ ID NO: 1;
    • fffff) an arginine (R) at a position corresponding to amino acid position 311 in SEQ ID NO: 1;
    • ggggg) an alanine (A) at a position corresponding to amino acid position 312 in SEQ ID NO: 1;
    • hhhhh) an alanine (A) at a position corresponding to amino acid position 316 in SEQ ID NO: 1;
    • iiiii) an arginine (R) at a position corresponding to amino acid position 318 in SEQ ID NO: 1
    • jjjjj) a valine (V) at a position corresponding to amino acid position 319 in SEQ ID NO: 1;
    • kkkkk) an alanine (A) at a position corresponding to amino acid position 334 in SEQ ID NO: 1;
    • lllll) a phenylalanine (F) at a position corresponding to amino acid position 339 in SEQ ID NO: 1;
    • mmmmm) a glycine (G) or a leucine (L) at a position corresponding to amino acid position 340 in SEQ ID NO: 1;
    • nnnnn) a serine (S) at a position corresponding to amino acid position 342 in SEQ ID NO: 1;
    • ooooo) an asparagine (N) at a position corresponding to amino acid position 345 in SEQ ID NO: 1;
    • ppppp) an asparagine (N) at a position corresponding to amino acid position 346 in SEQ ID NO: 1; or,
    • qqqqq) an asparagine (N) at a position corresponding to amino acid position 348 in SEQ ID NO: 1; or,
    • rrrrr) any combination of a) to qqqqq).
  • 33. The isolated polypeptide of embodiment 29 selected from the group consisting of SEQ ID NOS: 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 251, 252, 253, 272, 273, 274, 275, 272, 273, 274, 275, 284, 285, 286, 287, 288, 289, 290, 291, 292, 293, 294, 295, 296, 297, 298, 330, 331, 332, 333, 334, 335, 336, 337, 338, 339, 340, 341, 357, 358, 359, 360, 361, 362, 363, 364, 365, 366, 367, 368, 369, 370, 371, 390, 391, 392, 393, 394, 395, 396, 397, 398, 399, 400, 401, 402, 430, 431, 432 and 433.
  • 34. The isolated polypeptide of embodiment 29, wherein the polypeptide is capable of recognizing and cleaving a meganuclease recognition sites selected from the group consisting of SEQ ID NO: 2 (LIG3-4), SEQ ID NO: 85 (MHP77), SEQ ID NO: 269 (MS26), SEQ ID NO: 281 (MHP14), SEQ ID NO: 331(MP107), SEQ ID NO: 358 (ZM6.3), SEQ ID NO: 390 (ZM6.22v2), SEQ ID NO: 423 or SEQ ID NO: 424.
  • 35. The isolated polypeptide of embodiment 29, wherein said polypeptide has an increased meganuclease activity when compared to a control meganuclease that lacks said amino acid modification.
  • 36. The isolated polypeptide of embodiment 29, wherein said control meganuclease is selected from the group of SEQ ID NO: 1 (LIG3-4), SEQ ID NO: 86 (MHP77), SEQ ID NO: 250 (MHP77.3), SEQ ID NO: 270 (MS26+), SEQ ID NO: 271, SEQ ID NO: 282 (MHP14), SEQ ID NO: 283 (MHP14+), SEQ ID NO: 329 (MP107), SEQ ID NO: 356 (ZM6.3), SEQ ID NO: 389 (ZM6.22v2), SEQ ID NO: 429 or SEQ ID NO: 435.
  • 37. The isolated polypeptide of embodiment 29, wherein the increased meganuclease activity is evidenced by:

    • a) a higher yeast assay score when compared to the control meganuclease that lacks said amino acid modification; or,
    • b) a higher target site mutation rate when compared to the control meganuclease that lacks said amino acid modification; or,
    • c) a higher in-vitro cutting when compared to the control meganuclease that lacks said amino acid modification; or,
    • d) any combination of (a), (b) and (c).
  • 38. A composition comprising at least one or more polypeptides of embodiment 29.
  • 39. A method for producing a meganuclease having increased activity over a range of temperatures, the method comprising:

    • a) producing a variant meganuclease by modifying at least one amino acid at an amino acid position corresponding to a position of SEQ ID NO: 1 selected from the group consisting of positions 2, 12, 16, 22, 23, 31, 36, 43, 50, 56, 58, 59, 62, 71, 72, 73, 80, 81, 82, 86, 91, 95, 98, 103, 113, 114, 116, 117, 118, 121, 124, 128, 129, 131, 147, 151, 153, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 194, 195, 196, 197, 200, 203, 204, 209, 222, 232, 236, 237, 246, 254, 258, 267, 278, 281, 282, 289, 308, 311, 312, 316, 318, 319, 334, 339, 340, 342, 345, 346, 348 and combinations thereof; and,
    • b) selecting said variant meganuclease from step a) and screening said variant meganuclease for the ability to cleave a DNA target sequence over a range of temperatures between and including 16° C. to 37° C.
  • 40. The method of embodiment 39, wherein said range of temperatures comprises:

    • a) 16° C.;
    • b) 18° C.;
    • c) 20° C.;
    • d) 24° C.;
    • e) 28° C.;
    • f) 30° C.;
    • g) 37° C.; or,
    • h) any combination of a), b), c), d), e), f), h), g) and g).
  • 41. A method for producing a meganuclease having an increased meganuclease activity when compared to a control meganuclease, the method comprising:

    • a) producing a variant meganuclease by modifying at least one amino acid at an amino acid position corresponding to a position of SEQ ID NO: 1 selected from the group consisting of positions 2, 12, 16, 22, 23, 31, 36, 43, 50, 56, 58, 59, 62, 71, 72, 73, 80, 81, 82, 86, 91, 95, 98, 103, 113, 114, 116, 117, 118, 121, 124, 128, 129, 131, 147, 151, 153, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 194, 195, 196, 197, 200, 203, 204, 209, 222, 232, 236, 237, 246, 254, 258, 267, 278, 281, 282, 289, 308, 311, 312, 316, 318, 319, 334, 339, 340, 342, 345, 346, 348 and combinations thereof; and,
    • b) selecting the variant meganuclease from step a) and screening said variant for increased meganuclease activity when compared to a control meganuclease.
  • 42. The method of embodiment 41, wherein the increased meganuclease activity is evidenced by:

    • a) a higher yeast assay score when compared to the control meganuclease that lacks said amino acid modification; or,
    • b) a higher target site mutation rate when compared to the control meganuclease that lacks said amino acid modification; or,
    • c) a higher in-vitro cutting when compared to the control meganuclease that lacks said amino acid modification; or,
    • d) any combination of (a), (b) and (c).
  • 43. The isolated or recombinant polynucleotide of embodiment 1, wherein said meganuclease polypeptide comprises a linker polypeptide, wherein said linker polypeptide comprises:

    • a) SEQ ID NO: 420;
    • b) SEQ ID NO: 421;
    • c) SEQ ID NO: 422; or,
    • d) an amino acid sequence consisting of any possible amino acid at positions corresponding to positions 156 to 193 of SEQ ID NO: 1.
  • 44. A composition comprising at least one or more polynucleotides of embodiment 1.
  • 45. An isolated or recombinant polynucleotide encoding a meganuclease polypeptide, said polypeptide comprising an amino acid sequence having at least one amino acid modification at an amino acid position corresponding to a position of SEQ ID NO: 1 selected from the group consisting of positions 16, 22, 50, 56, 59, 71, 81, 103, 121, 153, 185, 209, 222, 246, 258, 281, 308, 316, 345, 346, and combinations thereof, and wherein the polypeptide is capable of recognizing and cleaving a meganuclease target site comprising SEQ ID NO: 2.
  • 46. The isolated or recombinant polynucleotide of embodiment 45, wherein said nucleotide sequence encodes a meganuclease polypeptide having at least 80% sequence identity to SEQ ID NO: 1.
  • 47. The isolated or recombinant polynucleotide of embodiment 45, wherein said at least one amino acid modification comprises any one of the amino acid modifications shown in FIG. 5A-FIG. 5E.
  • 48. The isolated or recombinant polynucleotide of embodiment 45, wherein said nucleotide sequence encodes a meganuclease polypeptide selected from the group consisting of SEQ ID NOS: 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37 and 38.
  • 49. An isolated or recombinant polynucleotide encoding a meganuclease polypeptide, the polypeptide comprising an amino acid sequence having at least one amino acid modification at an amino acid position corresponding to a position of SEQ ID NO: 86 selected from the group consisting of positions 2, 12, 16, 22, 23, 36, 43, 50, 56, 58, 59, 72, 73, 81, 86, 91, 95, 103, 113, 114, 120, 121, 124, 128, 129, 131, 151, 153, 200, 204, 209, 232, 236, 237, 246, 254, 258, 267, 281, 308, 311, 312, 316, 319, 334, 339, 340, 342, and combinations thereof, and wherein the polypeptide is capable of recognizing and cleaving a meganuclease target site comprising SEQ ID NO: 85.
  • 50. The isolated or recombinant polynucleotide of embodiment 49, wherein said nucleotide sequence encodes a meganuclease polypeptide having at least 80% sequence identity to SEQ ID NO: 86.
  • 51. The isolated or recombinant polynucleotide of embodiment 49, wherein said at least one amino acid modification comprises any one of the amino acid modifications shown in FIG. 9A-FIG. 9N.
  • 52. The isolated or recombinant polynucleotide of embodiment 49, wherein said nucleotide sequence encodes a meganuclease polypeptide selected from the group consisting of SEQ ID NOS: 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 251, 252 and 253.
  • 53. An isolated or recombinant polynucleotide encoding a meganuclease polypeptide, the polypeptide comprising an amino acid sequence having at least one amino acid modification at an amino acid position corresponding to a position of SEQ ID NO: 270 selected from the group consisting of positions 16, 22, 50, 71, 185, 246, 258, 316 and combinations thereof, and wherein the polypeptide is capable of recognizing and cleaving a meganuclease target site comprising SEQ ID NO: 269.
  • 54. The isolated or recombinant polynucleotide of embodiment 53, wherein said nucleotide sequence encodes a meganuclease polypeptide selected from the group consisting of SEQ ID NOS: 272, 273, 274 and 275.
  • 55. An isolated or recombinant polynucleotide encoding a meganuclease polypeptide, the polypeptide comprising an amino acid sequence having at least one amino acid modification at an amino acid position corresponding to a position of SEQ ID NO: 282 selected from the group consisting of positions 12, 16, 22, 31, 50, 56, 59, 62, 81, 98, 103, 105, 116, 118, 121, 153, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 258, 281, 308, 312, 316, 319, and combinations thereof, and wherein the polypeptide is capable of recognizing and cleaving a meganuclease target site comprising SEQ ID NO: 281.
  • 56. The isolated or recombinant polynucleotide of embodiment 55, wherein said nucleotide sequence encodes a meganuclease polypeptide having at least 80% sequence identity to SEQ ID NO: 282.
  • 57. The isolated or recombinant polynucleotide of embodiment 55, wherein said at least one amino acid modification comprises any one of the amino acid modifications shown in FIG. 10A-FIG. 10D.
  • 58. The isolated or recombinant polynucleotide of embodiment 55, wherein said nucleotide sequence encodes a meganuclease polypeptide selected from the group consisting of SEQ ID NOS: 284, 285, 286, 287, 288, 289, 290, 291, 292, 293, 294, 295, 296, 297 and 298.
  • 59. An isolated or recombinant polynucleotide encoding a meganuclease polypeptide, the polypeptide comprising an amino acid sequence having at least one amino acid modification at an amino acid position corresponding to a position of SEQ ID NO: 329 selected from the group consisting of positions 12, 32, 50, 56, 80, 105, 124, 129, 131, 153, 185, 311, 316, 318, 340, and combinations thereof, and wherein the polypeptide is capable of recognizing and cleaving a meganuclease target site comprising SEQ ID NO: 328.
  • 60. The isolated or recombinant polynucleotide of embodiment 59, wherein said nucleotide sequence encodes a meganuclease polypeptide having at least 80% sequence identity to SEQ ID NO: 329.
  • 61. The isolated or recombinant polynucleotide of embodiment 59, wherein said at least one amino acid modification comprises any one of the amino acid modifications shown in FIG. 11.
  • 62. The isolated or recombinant polynucleotide of embodiment 59, wherein said nucleotide sequence encodes a meganuclease polypeptide selected from the group consisting of SEQ ID NOS: 330, 331, 332, 333, 334, 335, 336, 337, 338, 339, 340 and 341.
  • 63. An isolated or recombinant polynucleotide encoding a meganuclease polypeptide, the polypeptide comprising an amino acid sequence having at least one amino acid modification at an amino acid position corresponding to a position of SEQ ID NO: 356 selected from the group consisting of positions 12, 24, 36, 50, 56, 62, 73, 80, 124, 129, 147, 182, 203, 237, 252, 311, 316, 318, 340, 348, and combinations thereof, and wherein the polypeptide is capable of recognizing and cleaving a meganuclease target site comprising SEQ ID NO: 355.
  • 64. The isolated or recombinant polynucleotide of embodiment 63, wherein said nucleotide sequence encodes a meganuclease polypeptide having at least 80% sequence identity to SEQ ID NO: 356.
  • 65. The isolated or recombinant polynucleotide of embodiment 63, wherein said at least one amino acid modification comprises any one of the amino acid modifications shown in FIG. 12.
  • 66. The isolated or recombinant polynucleotide of embodiment 63, wherein said nucleotide sequence encodes a meganuclease polypeptide selected from the group consisting of SEQ ID NOS: 357, 358, 359, 360, 361, 362, 363, 364, 365, 366, 367, 368, 369, 370, and 371.
  • 67. An isolated or recombinant polynucleotide encoding a meganuclease polypeptide, the polypeptide comprising an amino acid sequence having at least one amino acid modification at an amino acid position corresponding to a position of SEQ ID NO: 389 selected from the group consisting of positions 12, 50, 56, 124, 129, 131, 153, 211, 237, 311, 316, and position 318, and combinations thereof, and wherein the polypeptide is capable of recognizing and cleaving a meganuclease target site comprising SEQ ID NO: 388.
  • 68. The isolated or recombinant polynucleotide of embodiment 67, wherein said nucleotide sequence encodes a meganuclease polypeptide having at least 80% sequence identity to SEQ ID NO: 389.
  • 69. The isolated or recombinant polynucleotide of embodiment 67, wherein said at least one amino acid modification comprises any one of the amino acid modifications shown in FIG. 13.
  • 70. The isolated or recombinant polynucleotide of embodiment 67, wherein said nucleotide sequence encodes a meganuclease polypeptide selected from the group consisting of SEQ ID NOS: 390, 391, 392, 393, 394, 395, 396, 397, 398, 399, 400, 401, 402, and 403.
  • 72. A yeast, plant, plant cell, explant or seed comprising the meganuclease created by the method of embodiments 36-42.
  • 73. A method of introducing a double-strand break in the genome of a yeast or plant cell, said method comprising:

    • a) contacting at least one plant or yeast cell comprising in its genome a meganuclease recognition site with a variant meganuclease polypeptide selected from the group consisting of SEQ ID NOS: 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 251, 252, 253, 272, 273, 274, 275, 272, 273, 274, 275, 284, 285, 286, 287, 288, 289, 290, 291, 292, 293, 294, 295, 296, 297, 298, 330, 331, 332, 333, 334, 335, 336, 337, 338, 339, 340, 341, 357, 358, 359, 360, 361, 362, 363, 364, 365, 366, 367, 368, 369, 370, 371, 390, 391, 392, 393, 394, 395, 396, 397, 398, 399, 400, 401, 402 and 403, wherein the variant meganuclease is capable of inducing a double-strand break in said recognition site; and,
    • b) selecting the yeast or plant cell from a) and screening said yeast or plant cell for any modification of said recognition sequence.
  • 74. A method of integrating a polynucleotide of interest into a recognition site in the genome of a plant or yeast cell, the method comprising:

    • a) contacting at least one plant or yeast cell comprising in its genome a meganuclease recognition site with:

      • (i) a variant meganuclease polypeptide selected from the group consisting of SEQ ID NOS: 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 251, 252, 253, 272, 273, 274, 275, 272, 273, 274, 275, 284, 285, 286, 287, 288, 289, 290, 291, 292, 293, 294, 295, 296, 297, 298, 330, 331, 332, 333, 334, 335, 336, 337, 338, 339, 340, 341, 357, 358, 359, 360, 361, 362, 363, 364, 365, 366, 367, 368, 369, 370, 371, 390, 391, 392, 393, 394, 395, 396, 397, 398, 399, 400, 401, 402 and 403,
      • wherein the variant meganuclease is capable of inducing a double-strand break in said recognition site; and,
      • (ii) a DNA fragment containing a polynucleotide of interest;
    • b) selecting at least one plant or yeast cell comprising integration of the polynucleotide of interest cassette at the recognition site.
  • 75. An isolated or recombinant polynucleotide encoding a meganuclease polypeptide, the polypeptide comprising an amino acid sequence having at least one amino acid modification at an amino acid position corresponding to a position of SEQ ID NO: 429 selected from the group consisting of positions 16, 22, 50, 71, 185, 246, 258, 316 and combinations thereof, and wherein the polypeptide is capable of recognizing and cleaving a meganuclease target site comprising SEQ ID NO: 423.
  • 76. The isolated or recombinant polynucleotide of embodiment 75, wherein said nucleotide sequence encodes a meganuclease polypeptide selected from the group consisting of SEQ ID NOS: 430, 431 and 432.
  • 77. An isolated or recombinant polynucleotide encoding a meganuclease polypeptide of SEQ ID NO: 436, wherein the polypeptide is capable of recognizing and cleaving a meganuclease target site comprising SEQ ID NO: 424.

EXPERIMENTAL

Example 1

Transformation of Maize Immature Embryos

Transformation can be accomplished by various methods known to be effective in plants, including particle-mediated delivery, Agrobacterium-mediated transformation, PEG-mediated delivery, and electroporation.

a. Particle-Mediated Delivery

Transformation of maize immature embryos using particle delivery is performed as follows. Media recipes follow below.

The ears are husked and surface sterilized in 30% Clorox bleach plus 0.5% Micro detergent for 20 minutes, and rinsed two times with sterile water. The immature embryos are isolated and placed embryo axis side down (scutellum side up), 25 embryos per plate, on 560Y medium for 4 hours and then aligned within the 2.5-cm target zone in preparation for bombardment. Alternatively, isolated embryos are placed on 560L (Initiation medium) and placed in the dark at temperatures ranging from 26° C. to 37° C. for 8 to 24 hours prior to placing on 560Y for 4 hours at 26° C. prior to bombardment as described above.

Plasmids containing the double strand brake inducing agent and donor DNA are constructed using standard molecular biology techniques and co-bombarded with plasmids containing the developmental genes ODP2 (AP2 domain transcription factor ODP2 (Ovule development protein 2); US 2009/0328252 A1 and Wushel (US 2011/0167516).

The plasmids and DNA of interest are precipitated onto 0.6 m (average diameter) gold pellets using a water-soluble cationic lipid Tfx™-50 (Cat# E1811, Promega, Madison, Wis., USA) as follows. DNA solution is prepared on ice using 1 μg of plasmid DNA and optionally other constructs for co-bombardment such as 50 ng (0.5 μl) of each plasmid containing the developmental genes ODP2 (AP2 domain transcription factor ODP2 (Ovule development protein 2); US 2009/0328252 A1 and Wushel. To the pre-mixed DNA, 20 μl of prepared gold particles (15 mg/ml) and 5 μl Tfx-50 is added in water and mixed carefully. Gold particles are pelleted in a microfuge at 10,000 rpm for 1 min and supernatant is removed. The resulting pellet is carefully rinsed with 100 ml of 100% EtOH without resuspending the pellet and the EtOH rinse is carefully removed. 105 μl of 100% EtOH is added and the particles are resuspended by brief sonication. Then, 10 μl is spotted onto the center of each macrocarrier and allowed to dry about 2 minutes before bombardment.

Alternatively, the plasmids and DNA of interest are precipitated onto 1.1 m (average diameter) tungsten pellets using a calcium chloride (CaCl2) precipitation procedure by mixing 100 μl prepared tungsten particles in water, 10 μl (1 μg) DNA in Tris EDTA buffer (1 μg total DNA), 100 μl 2.5 M CaCl2, and 10 μl 0.1 M spermidine. Each reagent is added sequentially to the tungsten particle suspension, with mixing. The final mixture is sonicated briefly and allowed to incubate under constant vortexing for 10 minutes. After the precipitation period, the tubes are centrifuged briefly, liquid is removed, and the particles are washed with 500 ml 100% ethanol, followed by a 30 second centrifugation. Again, the liquid is removed, and 105 μl 100% ethanol is added to the final tungsten particle pellet. For particle gun bombardment, the tungsten/DNA particles are briefly sonicated. 10 μl of the tungsten/DNA particles is spotted onto the center of each macrocarrier, after which the spotted particles are allowed to dry about 2 minutes before bombardment.

The sample plates are bombarded at level #4 with a Biorad Helium Gun. All samples receive a single shot at 450 PSI, with a total of ten aliquots taken from each tube of prepared particles/DNA.

Following bombardment, the embryos are incubated on 560P (maintenance medium) for 12 to 48 hours at temperatures ranging from 26° C. to 37° C., and then placed at 26° C. After 5 to 7 days the embryos are transferred to 560R selection medium containing 3 mg/liter Bialaphos, and subcultured every 2 weeks at 26° C. After approximately 10 weeks of selection, selection-resistant callus clones are transferred to 288J medium to initiate plant regeneration. Following somatic embryo maturation (2-4 weeks), well-developed somatic embryos are transferred to medium for germination and transferred to a lighted culture room. Approximately 7-10 days later, developing plantlets are transferred to 272V hormone-free medium in tubes for 7-10 days until plantlets are well established. Plants are then transferred to inserts in flats (equivalent to a 2.5″ pot) containing potting soil and grown for 1 week in a growth chamber, subsequently grown an additional 1-2 weeks in the greenhouse, then transferred to Classic 600 pots (1.6 gallon) and grown to maturity. Plants are monitored and scored for transformation efficiency, and/or modification of regenerative capabilities.

Initiation medium (560L) comprises 4.0 g/l N6 basal salts (SIGMA C-1416), 1.0 ml/l Eriksson's Vitamin Mix (1000× SIGMA-1511), 0.5 mg/l thiamine HCl, 20.0 g/l sucrose, 1.0 mg/l 2,4-D, and 2.88 g/l L-proline (brought to volume with D-I H2O following adjustment to pH 5.8 with KOH); 2.0 g/l Gelrite (added after bringing to volume with D-I H2O); and 8.5 mg/l silver nitrate (added after sterilizing the medium and cooling to room temperature).

Maintenance medium (560P) comprises 4.0 g/l N6 basal salts (SIGMA C-1416), 1.0 ml/l Eriksson's Vitamin Mix (1000× SIGMA-1511), 0.5 mg/l thiamine HCl, 30.0 g/l sucrose, 2.0 mg/l 2,4-D, and 0.69 g/l L-proline (brought to volume with D-I H2O following adjustment to pH 5.8 with KOH); 3.0 g/l Gelrite (added after bringing to volume with D-I H2O); and 0.85 mg/l silver nitrate (added after sterilizing the medium and cooling to room temperature).

Bombardment medium (560Y) comprises 4.0 g/l N6 basal salts (SIGMA C-1416), 1.0 ml/l Eriksson's Vitamin Mix (1000× SIGMA-1511), 0.5 mg/l thiamine HCl, 120.0 g/l sucrose, 1.0 mg/l 2,4-D, and 2.88 g/l L-proline (brought to volume with D-I H2O following adjustment to pH 5.8 with KOH); 2.0 g/l Gelrite (added after bringing to volume with D-I H2O); and 8.5 mg/l silver nitrate (added after sterilizing the medium and cooling to room temperature).

Selection medium (560R) comprises 4.0 g/l N6 basal salts (SIGMA C-1416), 1.0 ml/l Eriksson's Vitamin Mix (1000× SIGMA-1511), 0.5 mg/l thiamine HCl, 30.0 g/l sucrose, and 2.0 mg/l 2,4-D (brought to volume with D-I H2O following adjustment to pH 5.8 with KOH); 3.0 g/l Gelrite (added after bringing to volume with D-I H2O); and 0.85 mg/l silver nitrate and 3.0 mg/l bialaphos (both added after sterilizing the medium and cooling to room temperature).

Plant regeneration medium (288J) comprises 4.3 g/l MS salts (GIBCO 11117-074), 5.0 ml/l MS vitamins stock solution (0.100 g nicotinic acid, 0.02 g/l thiamine HCL, 0.10 g/l pyridoxine HCL, and 0.40 g/l glycine brought to volume with polished D-I H2O) (Murashige and Skoog, (1962) Physiol. Plant. 15:473), 100 mg/l myo-inositol, 0.5 mg/l zeatin, 60 g/l sucrose, and 1.0 ml/l of 0.1 mM abscisic acid (brought to volume with polished D-I H2O after adjusting to pH 5.6); 3.0 g/l Gelrite (added after bringing to volume with D-I H2O); and 1.0 mg/l indoleacetic acid and 3.0 mg/l bialaphos (added after sterilizing the medium and cooling to 6000).

Hormone-free medium (272V) comprises 4.3 g/l MS salts (GIBCO 11117-074), 5.0 ml/l MS vitamins stock solution (0.100 g/l nicotinic acid, 0.02 g/l thiamine HCL, 0.10 g/l pyridoxine HCL, and 0.40 g/l glycine brought to volume with polished D-I H2O), 0.1 g/l myo-inositol, and 40.0 g/l sucrose (brought to volume with polished D-I H2O after adjusting pH to 5.6); and 6 g/l bacto-agar (added after bringing to volume with polished D-I H2O), sterilized and cooled to 6000.

b. Agrobacterium-Mediated Transformation

Agrobacterium-mediated transformation was performed essentially as described in Djukanovic, et al., (2006) Plant Biotech J 4:345-357. Briefly, 10-12 day old immature embryos (0.8-2.5 mm in size) were dissected from sterilized kernels and placed into liquid medium (4.0 g/L N6 Basal Salts (Sigma C-1416), 1.0 ml/L Eriksson's Vitamin Mix (Sigma E-1511), 1.0 mg/L thiamine HCl, 1.5 mg/L 2,4-D, 0.690 g/L L-proline, 68.5 g/L sucrose, 36.0 g/L glucose, pH 5.2). After embryo collection, the medium was replaced with 1 ml Agrobacterium at a concentration of 0.35-0.45 OD550. Maize embryos were incubated with Agrobacterium for 5 min at room temperature, then the mixture was poured onto a media plate containing 4.0 g/L N6 Basal Salts (Sigma C-1416), 1.0 ml/L Eriksson's Vitamin Mix (Sigma E-1511), 1.0 mg/L thiamine HCl, 1.5 mg/L 2,4-D, 0.690 g/L L-proline, 30.0 g/L sucrose, 0.85 mg/L silver nitrate, 0.1 nM acetosyringone, and 3.0 g/L Gelrite, pH 5.8. Embryos were incubated axis down, in the dark for 3 days at 2000, then incubated 4 days in the dark at 28° C., then transferred onto new media plates containing 4.0 g/L N6 Basal Salts (Sigma C-1416), 1.0 ml/L Eriksson's Vitamin Mix (Sigma E-1511), 1.0 mg/L thiamine HCl, 1.5 mg/L 2,4-D, 0.69 g/L L-proline, 30.0 g/L sucrose, 0.5 g/L MES buffer, 0.85 mg/L silver nitrate, 3.0 mg/L Bialaphos, 100 mg/L carbenicillin, and 6.0 g/L agar, pH 5.8. Embryos were subcultured every three weeks until transgenic events were identified. Somatic embryogenesis was induced by transferring a small amount of tissue onto regeneration medium (4.3 g/L MS salts (Gibco 11117), 5.0 ml/L MS Vitamins Stock Solution, 100 mg/L myo-inositol, 0.1 μM ABA, 1 mg/L IAA, 0.5 mg/L zeatin, 60.0 g/L sucrose, 1.5 mg/L Bialaphos, 100 mg/L carbenicillin, 3.0 g/L Gelrite, pH 5.6) and incubation in the dark for two weeks at 28° C. All material with visible shoots and roots were transferred onto media containing 4.3 g/L MS salts (Gibco 11117), 5.0 ml/L MS Vitamins Stock Solution, 100 mg/L myo-inositol, 40.0 g/L sucrose, 1.5 g/L Gelrite, pH 5.6, and incubated under artificial light at 28° C. One week later, plantlets were moved into glass tubes containing the same medium and grown until they were sampled and/or transplanted into soil.

Example 2

Transient Expression of BBM Enhances Transformation

Parameters of the transformation protocol can be modified to ensure that the BBM activity is transient. One such method involves precipitating the BBM-containing plasmid in a manner that allows for transcription and expression, but precludes subsequent release of the DNA, for example, by using the chemical PEI.

In one example, the BBM plasmid is precipitated onto gold particles with PEI, while the transgenic expression cassette (UBI::moPAT˜GFPm::PinII; moPAT is the maize optimized PAT gene) to be integrated is precipitated onto gold particles using the standard calcium chloride method.

Briefly, gold particles were coated with PEI as follows. First, the gold particles were washed. Thirty-five mg of gold particles, 1.0 in average diameter (A.S.I. #162-0010), were weighed out in a microcentrifuge tube, and 1.2 ml absolute EtOH was added and vortexed for one minute. The tube was incubated for 15 minutes at room temperature and then centrifuged at high speed using a microfuge for 15 minutes at 4° C. The supernatant was discarded and a fresh 1.2 ml aliquot of ethanol (EtOH) was added, vortexed for one minute, centrifuged for one minute, and the supernatant again discarded (this is repeated twice). A fresh 1.2 ml aliquot of EtOH was added, and this suspension (gold particles in EtOH) was stored at −20° C. for weeks. To coat particles with polyethylimine (PEI; Sigma #P3143), 250 μl of the washed gold particle/EtOH mix was centrifuged and the EtOH discarded. The particles were washed once in 100 μl ddH2O to remove residual ethanol, 250 μl of 0.25 mM PEI was added, followed by a pulse-sonication to suspend the particles and then the tube was plunged into a dry ice/EtOH bath to flash-freeze the suspension, which was then lyophilized overnight. At this point, dry, coated particles could be stored at −80° C. for at least 3 weeks. Before use, the particles were rinsed 3 times with 250 μl aliquots of 2.5 mM HEPES buffer, pH 7.1, with 1× pulse-sonication, and then a quick vortex before each centrifugation. The particles were then suspended in a final volume of 250 μl HEPES buffer. A 25 μl aliquot of the particles was added to fresh tubes before attaching DNA. To attach uncoated DNA, the particles were pulse-sonicated, then 1 μg of DNA (in 5 μl water) was added, followed by mixing by pipetting up and down a few times with a Pipetteman and incubated for 10 minutes. The particles were spun briefly (i.e. 10 seconds), the supernatant removed, and 60 μl EtOH added. The particles with PEI-precipitated DNA-1 were washed twice in 60 μl of EtOH. The particles were centrifuged, the supernatant discarded, and the particles were resuspended in 45 μl water. To attach the second DNA (DNA-2), precipitation using TFX-50 was used. The 45 μl of particles/DNA-1 suspension was briefly sonicated, and then 5 μl of 100 ng/μl of DNA-2 and 2.5 μl of TFX-50 were added. The solution was placed on a rotary shaker for 10 minutes, centrifuged at 10,000 g for 1 minute. The supernatant was removed, and the particles resuspended in 60 μl of EtOH. The solution was spotted onto macrocarriers and the gold particles onto which DNA-1 and DNA-2 had been sequentially attached were delivered into scutellar cells of 10 DAP Hi-II immature embryos using a standard protocol for the PDS-1000. For this experiment, the DNA-1 plasmid contained a UBI::RFP::pinII expression cassette, and DNA-2 contained a UBI::CFP::pinII expression cassette. Two days after bombardment, transient expression of both the CFP and RFP fluorescent markers was observed as numerous red & blue cells on the surface of the immature embryo. The embryos were then placed on non-selective culture medium and allowed to grow for 3 weeks before scoring for stable colonies. After this 3-week period, 10 multicellular, stably-expressing blue colonies were observed, in comparison to only one red colony. This demonstrated that PEI-precipitation could be used to effectively introduce DNA for transient expression while dramatically reducing integration of the PEI-introduced DNA and thus reducing the recovery of RFP-expressing transgenic events. In this manner, PEI-precipitation can be used to deliver transient expression of BBM and/or WUS2.

For example, the particles are first coated with UBI::BBM::pinII using PEI, then coated with UBI::moPAT˜YFP using TFX-50, and then bombarded into scutellar cells on the surface of immature embryos. PEI-mediated precipitation results in a high frequency of transiently expressing cells on the surface of the immature embryo and extremely low frequencies of recovery of stable transformants (relative to the TFX-50 method). Thus, it is expected that the PEI-precipitated BBM cassette expresses transiently and stimulates a burst of embryogenic growth on the bombarded surface of the tissue (i.e., the scutellar surface), but this plasmid will not integrate. The PAT˜GFP plasmid released from the Ca++/gold particles is expected to integrate and express the selectable marker at a frequency that results in substantially improved recovery of transgenic events. As a control treatment, PEI-precipitated particles containing a UBI::GUS::pinII (instead of BBM) are mixed with the PAT˜GFP/Ca++ particles. Immature embryos from both treatments are moved onto culture medium containing 3 mg/l bialaphos. After 6-8 weeks, it is expected that GFP+, bialaphos-resistant calli will be observed in the PEI/BBM treatment at a much higher frequency relative to the control treatment (PEI/GUS).

As an alternative method, the BBM plasmid is precipitated onto gold particles with PEI, and then introduced into scutellar cells on the surface of immature embryos, and subsequent transient expression of the BBM gene elicits a rapid proliferation of embryogenic growth. During this period of induced growth, the explants are treated with Agrobacterium using standard methods for maize (see, Example 1), with T-DNA delivery into the cell introducing a transgenic expression cassette such as UBI::moPAT˜GFPm::pinII. After co-cultivation, explants are allowed to recover on normal culture medium, and then are moved onto culture medium containing 3 mg/l bialaphos. After 6-8 weeks, it is expected that GFP+, bialaphos-resistant calli will be observed in the PEI/BBM treatment at a much higher frequency relative to the control treatment (PEI/GUS).

It may be desirable to “kick start” callus growth by transiently expressing the BBM and/or WUS2 polynucleotide products. This can be done by delivering BBM and WUS2 5′-capped polyadenylated RNA, expression cassettes containing BBM and WUS2 DNA, or BBM and/or WUS2 proteins. All of these molecules can be delivered using a biolistics particle gun. For example, 5′-capped polyadenylated BBM and/or WUS2 RNA can easily be made in vitro using Ambion's mMessage mMachine kit. RNA is co-delivered along with DNA containing a polynucleotide of interest and a marker used for selection/screening such as Ubi::moPAT˜GFPm::PinII. It is expected that the cells receiving the RNA will immediately begin dividing more rapidly and a large portion of these will have integrated the agronomic gene. These events can further be validated as being transgenic clonal colonies because they will also express the PAT˜GFP fusion protein (and thus will display green fluorescence under appropriate illumination). Plants regenerated from these embryos can then be screened for the presence of the polynucleotide of interest.

Example 3

DNA Shuffling to Create Variants of LIG3-4 Meganuclease

A. LIG3-4 Meganuclease and LIG3-4 Recognition Sequence

An endogenous maize genomic target site comprising the LIG3-4 recognition sequence (SEQ ID NO: 2) was selected for design of a custom double-strand break inducing agent. The LIG3-4 recognition sequence is a 22 bp polynucleotide having the following sequence: ATATACCTCACACGTACGCGTA (SEQ ID NO: 2).

Wild type I-CreI meganuclease (SEQ ID NO: 3) was modified to produce the LIG3-4 meganuclease designed to recognize the LIG3-4 recognition sequence as described in US Patent Application Publication Number US 2009-0133152 A1. Wild-type I-CreI meganuclease is a homodimer. In order to recognize the LIG3-4 recognition sequence, different substitutions were made to each monomer and the coding sequences for each monomer were joined by a linker sequence to produce a single-chain fusion polypeptide (LIG3-4, SEQ ID NO: 1)

B. Creation of LIG3-4 Meganuclease Variants

Variants of the LIG3-4 meganuclease were created through gene shuffling methods. Gene shuffling is an iterative process consisting of discrete cycles termed “rounds”. Each round is a cycle of parent selection, library construction, gene evaluation and hit selection. The best hits from one round become the parental genes for the next round.

The first phase of LIG3-4 meganuclease optimization was designed to introduce amino acid substitutions as found in naturally occurring meganuclease proteins. Shuffled gene variant libraries were made based on the LIG3-4 protein template (SEQ ID NO: 1) using techniques including family shuffling, single-gene shuffling, back-crossing, semi-synthetic and synthetic shuffling (Zhang, et al., (1997) Proc Natl Acad Sci 94:4504-4509; Crameri, et al., (1998) Nature 391:288-291; Ness, et al., (2002) Nat Biotech 20:1251-1255). Libraries were based on phylogenetic sequence diversity, random mutagenesis, and structural features based on the crystal structure of I-CreI in Protein Data Bank (PDB). Phylogenetic diversity of several meganuclease proteins (SEQ ID NOS: 4-13), including I-CreI (SEQ ID NO: 3) is shown in FIG. 1A-FIG. 1B. Diversity is defined as the amino acids present within the set of proteins at any position where all proteins do not contain the identical amino acid.

The shuffling process resulted in generation of LIG3-4 variants with recombinations of amino acid modifications, unintended amino acid modifications due to mutagenic PCR, deletions, and insertions (SEQ ID NOS: 14-38). Corresponding DNA sequences for expression of these meganucleases in yeast are shown in SEQ ID NOS: 40-65).

Example 4

Yeast Screening System for Identification of Meganuclease Variants with Increased Activity

Yeast screening strains were generated as hosts for the identification of meganuclease variants. The yeast Ade2 gene (Genetika, (1987) 23(7):1141-1148) (SEQ ID NO: 82) was used as a visible marker as well as a selection in the scheme depicted in FIG. 2. Gene fragments corresponding to the first 1000 nucleotides of the Ade2 coding sequence (SEQ ID NO: 83) (Ade2 5′ fragment) and the last 1011 nucleotides of the Ade2 coding sequence (Ade2 3′ fragment) were disrupted by a fragment including the yeast ura3 gene and meganuclease recognition sites. Three versions of the construct depicted in FIG. 2 were used. Plasmid pHD1327 (SEQ ID NO: 84) included the ZM6.3, ZM6.22, MHP42, MHP107 and LIG3-4 recognition sites. pVER8145 included the LIG3-4 recognition site, and pVER8189 included the MHP14, MHP77 and LIG3-4 recognition sites. There are 305 nucleotides of sequence duplication between the Ade2 5′ fragment and the Ade2 3′ fragment. The resulting constructs were used to replace the Ade2 gene (chromosome 15 nucleotide position 566193-564480) of yeast strain BY4247. The resulting yeast screening strains VER8145, VER8189 and HD1327 can be characterized as BY4742 MATa his3delta1 leu2delta0 lys2delta0 ura3delta0 Gal2+). If meganuclease cutting occurs between the duplicated sequences, homologous recombination can occur, resulting in a functional Ade2 gene.

The generation of a functional Ade2 gene can be used as a selection: when yeast cells are grown on media lacking adenine, only those with a functional Ade2 gene are able to grow.

The generation of a functional Ade2 gene can also be used as a screen. Yeast cells with a functional Ade2 gene are white, whereas those lacking Ade2 function exhibit red pigmentation due to accumulation of a metabolite earlier in the adenine biosynthetic pathway resulting in red colonies with white sectors as shown in FIGS. 2 and 3. The degree of white sectoring, sometimes extending to entire colonies, indicates the amount of meganuclease cutting activity. Since the sectoring phenotype is a qualitative measure of meganuclease activity, a 0-4 numerical scoring system was implemented. As shown in FIG. 3, a score of 0 indicates that no white sectors (no meganuclease cutting) were observed; a score of 4 indicates completely white colonies (complete cutting of the recognition site); scores of 1-3 indicate intermediate white sectoring phenotypes (and intermediate degrees of recognition site cutting).

Example 5

Meganuclease Expression Plasmid

A meganuclease expression plasmid was constructed using the plasmid p415GAL1 (ATCC; Nucleic Acids Res. (1994) 22(25):5767-5768). The LIG3-4 coding sequence was PCR amplified using primers MN031 (SEQ ID NO: 66) and MN022 (SEQ ID NO: 67) and inserted in p415GAL1 as an XbaI-XhoI restriction fragment. The resulting construct (pVER8134; SEQ ID NO: 68) is shown in FIG. 4. The meganuclease expression plasmid contains a centromeric replication origin and a leu2 nutritional marker for growth in yeast as well as the F1 replication origin and an ampicillin antibiotic resistance gene for growth in E. coli. The meganuclease expression cassette consists of the galactose-inducible GAL1 promoter and the CYC1 terminator. The meganuclease coding sequence was preceded with a nuclear localization signal (SEQ ID NO: 69) encoding a 9 amino acid amino-terminal (MAPKKKRKV, SEQ ID NO: 70) and a carboxy-terminal 6× histidine tag (SEQ ID NO: 71) to aid protein purification.

Similar meganuclease expression plasmids were constructed by exchanging the LIG3-4 meganuclease (nucleotide positions 500-1549 of pVER8134, SEQ ID NO: 68) with a variant meganuclease.

Example 6

Transformation of the Yeast Screening Strain (YSS) and Screening for Meganuclease Activity in Yeast

Shuffled meganuclease libraries (comprising the variant meganucleases) were inserted in the expression vector pVER8134 as described in Example 5 and transformed into a yeast screening strain comprising the corresponding meganuclease recognition site (Example 3) by the following procedure.

A 3 mL culture of selective media (MP Biomedical) was inoculated with a single colony of the yeast screening strain and grown at 30° C. overnight. On the following day, a 50 ml YPD culture (MP Biomedical) was started with 2 ml of the overnight culture and grown at 30° C. overnight. On the following day, the cells were harvested by centrifugation at 4000 rpm. The cells were resuspended in 100 ml ice cold water and centrifuged again. The cells were then washed in 1.2 M sorbitol, followed by treatment with 2 ml of 10 mM Tris pH 8.0, 1 mM EDTA, 100 mM Lithium acetate, 10 mM DTT, 0.6M sorbitol for 30 minutes at 30° C. with shaking. The cells were recovered by centrifugation, washed in 40 ml 1.2M sorbitol and finally resuspended in 250 microliters of 1.2M sorbitol. 50 microliter aliquots were transferred to test tubes on ice. Up to 5 microliters of DNA (100-500 nanograms) were added. The suspension was transferred to a 0.2 cm electroporation cuvette, on ice. Electroporation was performed with a pulse charge at 1.5 kV, 200 ohms, 25 microF (pulse time of 5 milliseconds). 1 mL YPD media (MP Biomedical) was added and the cells were allowed to recover at 30° C. for 1-2 hr. The cells were centrifuged, resuspended in 100 uL 1M sorbitol and plated on selective media lacking leucine and containing 2% galactose. The resulting yeast colonies were incubated at various temperatures ranging from 22 to 37° C. for 7-10 days. I-CreI and meganucleases derived from it have maximal activity at or above 37° C. Screening was performed at a range of temperatures from 22 to 37 degrees in order to observe increases in activity at lower temperatures which are relevant to certain biological systems (eg. plant cells, plant cell cultures, etc). At that time the red/white sectoring phenotype, indicative of meganuclease activity was observed. Colonies with increased white sectoring over the parental meganuclease (indicating colonies expressing a meganuclease with increased meganuclease activity), also referred to as “hits” and sometimes completely white, were isolated for further analysis.

These potential “hits” were grown in liquid media to increase the cell density. DNA was extracted and used to transform E. coli. Plasmid DNA was extracted from E. coli cultures. The plasmid DNA corresponding to the potential hits was again transformed into the yeast screening strain as described above.

If the increase in white sectoring phenotype in yeast cells comprising the variant meganuclease (when compared to yeast comprising the parental meganuclease) was repeated, the variant was declared a “confirmed hit”. Meganuclease coding sequences were determined for confirmed hits. Each confirmed hit represents a variant meganuclease and was assigned a meganuclease activity score at various temperatures based on the 0-4 scale described in Example 4.

Table 2 shows the activity of LIG3-4 and LIG3-4 variant meganucleases in Yeast Screening Strain VER8145 assayed at 22° C. and 30° C. with 2% galactose. A score of 0 indicates that no white sectors (no cutting indicating no meganuclease activity) was observed; a score of 4 indicates completely white colonies (complete cutting of the recognition site indicating high meganuclease activity); scores of 1-3 indicate intermediate white sectoring phenotypes (and intermediate degrees of recognition site cutting) was indicative of intermediate meganuclease activity.


TABLE 2
Activity of LIG3-4 and LIG3-4 variant Meganucleases in Yeast Screening
Strain assayed at 22° C. and 30° C.
Assay
Assay
score
score
SEQ ID NO:
Meganuclease
22° C.
30° C.
1
LIG3-4
0
2
27
LIG3-4(B65)
4
4
28
LIG3-4(B70)
4
4
31
LIG3-4(B75)
4
4
32
LIG3-4(B76)
4
4
30
LIG3-4(B73)
4
4
34
LIG3-4(B82)
4
4
33
LIG3-4(B78)
4
4
18
LIG3-4(B1)
3.5
4
15
LIG3-4(15)
3
4
38
LIG3-4(D8)
2.5
4
19
LIG3-4(B15)
2.5
4
35
LIG3-4(C1)
2
4
29
LIG3-4(B71)
2
4
24
LIG3-4(B39)
1
4
20
LIG3-4(B16)
0.5
4
37
LIG3-4(D7)
0.5
4
23
LIG3-4(B38)
0
4
25
LIG3-4(B40)
0
4
22
LIG3-4(B36)
0
4
21
LIG3-4(B24)
0
4
26
LIG3-4(B55)
0
4
16
LIG3-4(A4)
0
3.5
36
LIG3-4(D5)
1
3
14
LIG3-4(7)
1
3
17
LIG3-4(A6)
0
3

Alignment of the LIG3-4 variants relative to the LIG3-4 parent (LIG3-4.pro) is shown in FIG. 5A-FIG. 5E.

The various assay conditions are indicative of meganuclease activity, allowing a precise ranking of the shuffled variants by activity. Large increases in meganuclease activity (high scores) were observed. Complete cutting of the recognition site was observed with some variants even at the low temperature of 22° C. This is significant because the optimal temperature for I-Cre type meganucleases is 37° C., whereas the optimal temperature for certain biological systems (e.g., plant cell cultures) is in the range of 22-25° C. Hence, these variant meganucleases that can cut at lower temperatures are better suited to function well in plant systems when compared to the parental I-Cre type meganuclease.

Table 3A and 3B represent the amino acid modifications of LIG3-4 variants relative to the LIG3-4 parental meganuclease.


TABLE 3A
Amino acid modifications of LIG3-4 variants relative to the LIG3-4.
SEQ ID NO
Meganuclease
16
19
22
50
54
56
59
71
81
103
121
132
153
1
LIG3-4
F
G
S
Q
F
D
V
G
I
N
K
I
D
27
LIG3-4(B65)
C
K
K
28
LIG3-4(B70)
C
K
K
31
LIG3-4(B75)
C
K
P
32
LIG3-4(B76)
C
K
30
LIG3-4(B73)
C
K
34
LIG3-4(B82)
C
K
K
33
LIG3-4(B78)
C
P
18
LIG3-4(B1)
C
P
15
LIG3-4(15)
S
38
LIG3-4(D8)
I
H
G
19
LIG3-4(B15)
C
35
LIG3-4(C1)
C
K
29
LIG3-4(B71)
C
K
24
LIG3-4(B39)
C
P
20
LIG3-4(B16)
C
K
37
LIG3-4(D7)
R
L
K
L
23
LIG3-4(B38)
C
25
LIG3-4(B40)
K
22
LIG3-4(B36)
K
21
LIG3-4(B24)
C
26
LIG3-4(B55)
C
16
LIG3-4(A4)
V
36
LIG3-4(D5)
K
V
M
14
LIG3-4(7)
I
17
LIG3-4(A6)
V


TABLE 3B
(continued from Table 3A)
SEQ ID NO
Meganuclease
185
209
222
244
246
258
281
308
316
319
345
346
1
LIG3-4
A
S
F
K
V
G
F
K
V
I
K
K
27
LIG3-4(B65)
K
28
LIG3-4(B70)
P
31
LIG3-4(B75)
K
32
LIG3-4(B76)
K
30
LIG3-4(B73)
K
34
LIG3-4(B82)
C
K
33
LIG3-4(B78)
K
N
N
18
LIG3-4(B1)
K
15
LIG3-4(15)
38
LIG3-4(D8)
19
LIG3-4(B15)
K
35
LIG3-4(C1)
L
N
N
29
LIG3-4(B71)
P
24
LIG3-4(B39)
P
N
N
20
LIG3-4(B16)
37
LIG3-4(D7)
23
LIG3-4(B38)
P
25
LIG3-4(B40)
K
22
LIG3-4(B36)
K
21
LIG3-4(B24)
26
LIG3-4(B55)
C
K
16
LIG3-4(A4)
V
36
LIG3-4(D5)
H
G
14
LIG3-4(7)
G
E
A
17
LIG3-4(A6)
Y

Example 7

Meganuclease Protein Production in E. coli

In order to further confirm and quantify the activity of meganuclease variants, meganuclease protein was produced in E. coli and subjected to in vitro cutting assay on plasmid or corn genomic DNA containing the meganuclease recognition site. Total DNA was extracted from yeast strains harboring the meganuclease variants. The meganuclease coding sequence was PCR amplified and inserted in the expression vector pQE80 (QIAgen). The resulting plasmid was transformed into E. coli strain BL21 (Stratagene) with growth on LB media containing 100 ppm carbenicillin. A suspension of cells was prepared from the solid media and used to inoculate a 50 ml culture of 2×YT media containing 100 ppm carbenicillin at an optical density of 0.2. The cultures were grown at 37 degrees. When the optical density reached 0.8, protein expression was induced by addition of IPTG. The temperature was adjusted to 20 degrees, and the culture was grown for an additional 2 hours. E. coli cells were harvested by centrifugation, resuspended in Buffer 1 (50 mM Tris pH8, 500 mM NaCl, 10 mM imidizole) and lysed by sonication. Cell debris was removed by centrifugation. The supernatant was transferred to a disposable column loaded with 0.5 ml Nickel-NTA Superflow resin (QIAgen). The column was washed with 4 ml Buffer 2 (50 mM Tris pH8, 500 mM NaCl, 60 mM imidizole). Purified meganuclease protein was eluted with 0.6 ml Buffer 4 (50 mM Tris pH8, 500 mM NaCl, 400 mM imidizole). The meganuclease protein was transferred to a Vivaspin500 concentrator. Buffer exchange and concentration with SAB buffer (25 mM Tris pH8, 100 mM NaCl, 10 mM MgCl2, 5 mM EDTA) containing 50% glycerol, 0.5 mM dithiothreitol was performed. A final volume of approximately 0.1 ml of purified meganuclease protein solution was recovered. Bovine serum albumin was added to a final concentration of 100 microgram per milliliter.

Example 8

In Vitro Assay for Meganuclease Activity

Meganuclease protein was isolated as described in Example 7. Protein concentration was determined visually on Nu-PAGE gels (Life Technologies) by calculating and then comparing band intensity with serially diluted samples of known concentration. DNA concentration was determined using a Hoechst dye fluorometric assay. Time-course digestions were carried out on plasmid DNA containing the meganuclease recognition site at 37° C., 28° C., and 23° C. with 25 nM of purified meganuclease protein and 0.25 nM of linearized plasmid substrate in digestion buffer (100 mM Tris-HCl (pH 7.9)/100 mM NaCl/10 mM MgCl2/1 mM DTT/5 mM EDTA) in a final volume of 80 ul. 20 μl time-points were taken at 0, 25, 50, and 75 minutes and stopped with an equal volume of stop buffer (100 mM Tris-HCl, pH 8.0/600 mM NaCl/2% SDS/100 mM EDTA/1 mg of proteinase K per ml), incubated at 50° C. for 30-45 minutes, and purified with a Qiagen PCR purification column per the manufacturer's instruction. To quantify the % digestion of each sample or loss of meganuclease recognition sites, real-time PCR was performed on 1 μl of purified plasmid DNA diluted 50-fold in water with a TaqMan assay spanning the meganuclease recognition site. The loss of meganuclease recognition sites was calculated via the ΔΔCt method relative to an internal control TaqMan assay. The 0 minute timepoint or mock control was used as the calibrator. Timed digestions were carried out on genomic DNA at 37° C., 28° C., and 23° C. with 6.07 ug of corn genomic DNA and 16 nM of purified meganuclease protein in a final volume of 80 ul. After 50 minutes, digestion reactions were stopped as described above and purified by phenol/chloroform extraction and ethanol precipitated in the presence of 0.2M NaCl. Precipitated genomic DNA was washed twice with 70% ethanol, dried, and resuspended in 34 μl of water. The percent digestion of each sample was quantified by real-time PCR as described above for plasmid substrate except 1 μl of undiluted genomic DNA was assayed by real-time PCR. Since the cleavage activity of the I-CreI endonuclease has been demonstrated to be sensitive to temperatures below 37° C. (Wang, et al., (1997) Nucleic Acids Res. 25:3767-3776), in vitro assays to assess cleavage activity of the I-CreI derived parental meganuclease and its variants were carried out at 37° C., 28° C., and 23° C.

In-Vitro Meganuclease Activity of LIG3-4 and LIG3-4 Variants

On plasmid DNA containing the LIG3-4 recognition site, LIG3-4(B65) was the most active variant sustaining cleavage activity at 23° C. while little if any cleavage was detected for LIG3-4 and only slight cleavage was detected for LIG3-4(7) and LIG3-4(15) (FIG. 6A). At 28° C., hit15 and hit 7 achieved 66% and 50% cleavage, respectively, after 75 minutes while only slight cleavage was detected for LIG3-4 (FIG. 6B). At 37° C., hit7 demonstrated cleavage activity similar to LIG3-4 while hit15 and B65 cleaved the plasmid substrate more rapidly and to a greater extent (FIG. 6C). Based on plasmid DNA cleavage, all of the shuffled variants were more active than LIG3-4 with B65 being the most active variant followed in activity by hit15 and then hit7. These data closely mimicked the observations in the yeast assay.

The activity ranking established on plasmid DNA was conserved when genomic DNA cleavage was monitored. At 23° C., B65 maintained significant activity, cleaving 69% of its genomic substrate (FIG. 7A). At 28° C., no cleavage activity was detected for LIG3-4 while B65, hit15 and hit7 obtained 94%, 33% and 24% cleavage, respectively (FIG. 7B). At 37° C., LIG3-4 exhibited 47% cleavage at the LIG3-4 genomic recognition site while B65, hit15 and hit7 achieved 99%, 92% and 76% cleavage, respectively (FIG. 7C). Again, the data from in vitro cutting of maize genomic DNA were consistent with observations in the yeast assay.

Example 9

Analysis of Meganuclease Activity of LIG3-4 Variants in Maize

LIG3-4 variants were created as described in Example 3 and introduced in maize by particle gun transformation and Agrobacterium-mediated transformation.

Three LIG3-4 variants, LIG3-4 (B65) (SEQ ID NO: 27), LIG3-4(15) (SEQ ID NO: 15) and LIG3-4(7) (SEQ ID NO: 14) showed an increased meganuclease activity in yeast (Example 3) and an increased activity in the in vitro assay (Example 8, FIGS. 6A-6C; FIGS. 7A-7C) and were further tested in-vivo for their activity in maize.

A. Vector Construction for Plant Expression Vectors of the Meganuclease Genes and Repair (Donor) DNAs for Transgene Integration by Homologous Recombination

Genes encoding the meganucleases were codon optimized for expression in maize using standard molecular biology techniques. The resulting plant-optimized nucleotide sequences were also supplemented with DNA sequences encoding a SV40 nuclear localization signal (SEQ ID NO: 72) and further modified by addition of the potato ST-LS1 intron to the coding sequence of the first monomer in order to eliminate its expression in E. coli and Agrobacterium. The resulting LIG3-4 variants, LIG3-4(7) (SEQ ID NO: 73), LIG3-4(15) (SEQ ID NO: 74) and LIG3-4 (B65) (SEQ ID NO: 75) were further tested for their activity in maize (in vivo).

Vectors comprising expression cassettes for the appropriate meganuclease were constructed using standard molecular biological techniques. For each of the meganucleases, a plant expression vector comprising a polynucleotide encoding one of the meganuclease genes was operably linked to a maize constitutive promoter.

To achieve site-specific DNA insertions, a repair DNA (donor DNA) containing the gene of interest has to be simultaneously present in the cell in addition to the recognition site and the meganuclease. A vector PHP46961 (SEQ ID NO: 76) containing a polynucleotide encoding the engineered meganuclease variant LIG3-4(15), and a donor DNA were constructed using standard molecular biology techniques. Similar vectors PHP46949 or PHP47257 were constructed containing the LIG3-4B(65) or LIG3-4(17), meganuclease respectively. The donor DNA contained an herbicide resistance gene (MoPAT, encoding a phosphinothricin acetyltransferase), used as the selection marker for transformation, and was flanked by two homologous recombination fragments, LIG3-4HR1 (SEQ ID NO: 77) and LIG3-4HR2 (SEQ ID NO: 78), which were about 1 kb long genomic DNA sequences flanking the meganuclease recognition sites. A vector containing LIG3-4 (PHP43914, produced as described for PHP46961) was also included as control.

The LIG3-4 variants' expression cassettes were also co-integrated into LBA4404 for Agrobacteria delivery. Vector names were PHP47331 for LIG3-4(B65), PHP47332 for LIG3-4 (15) and PHP47517 for LIG3-4(7), respectively.

Maize immature embryos 9-12 DAP (days after pollination, approximately 1.5-2.0 mm in size) from a maize transformable line were used for gene transformation by bombardment (Example 1 and Example 2). The immature embryos were placed on 560Y medium for 4 hours at 26° C. or alternatively, immature embryos were incubated at temperatures ranging from 26° C. to 37° C. for 8 to 24 hours prior to placing on 560Y preceding bombardment. Developmental genes ODP2 (AP2 domain transcription factor ODP2 (Ovule development protein 2); US 2009/0328252 A1) and Wushel were included in the experiments through co-bombardment (Example 2). Maize immature embryos were transformed with the vectors PHP43914, PHP46949, PHP46961, and PHP47257.

Maize immature embryos 9-12 DAP (days after pollination, approximately 1.5-2.0 mm in size) from a maize transformable line were used for gene transformation by Agrobacterium. No developmental genes ODP2 or Wushel were included in the infection. Maize immature embryos were transformed with vectors of PHP47731, PHP47732, and PHP47517.

B. Meganuclease Activity of LIG3-4 Variants in Maize

To examine whether the LIG3-4 variants showed increased meganuclease activity in maize when compared to LIG3-4 about 2000 maize immature embryos were bombarded with plasmid DNA comprising each variant or control. Following bombardment, embryos were incubated on 560P (maintenance medium) at 28° C., then selected on bialophos herbicide. Successful delivery of LIG3-4 and the LIG3-4 variant donor vectors (PHP43914, PHP46949, PHP46961, and PHP47257) conferred bialaphos herbicide resistance, and was used to identify putative events by callus selection on herbicide containing media. Callus tissues and/or plants regenerated from stable transformants were screened for modification of the endogenous LIG3-4 recognition site.

Herbicide-resistant events were screened for modification at the meganuclease target site (comprising the recognition site) by measuring the target site copy-number using Real time PCR (qPCR). Two copies of the target site indicate that both alleles are wild type and that no modification occurred at the recognition site. If only one copy of the target site is detected by qPCR, this means that one allele of the target site has changed during repair of the double strand break generated by the LIG3-4 or its shuffle variants, while absence of the target site (null or 0) is the result of both alleles bring modified. The copy number can also be in between 1 and 2 due to chimeric nature of callus samples. The probe sequence for qPCR of LIG3-4 target site was ATACCTCACACGTACGCG (SEQ ID NO: 79), the LIG3-4_forward primer was GATTTACGCACCTGCTGGGA (SEQ ID NO: 80) and LIG3-4_reverse primer was CTGAGCTGTATTCCCGCGCA (SEQ ID NO:81). The amplicon was approximately 100 bp.

Transgenic events with a target site copy number of 0, 1, or between 1 and 2 were further analyzed for increased meganuclease activity. The meganuclease activity was determined by measuring the Target Site (TS) mutation rate. Target site mutation rate was defined as: (number of events with target site modification/total number events)*100%. The TS mutation rate for the LIG3-4 meganuclease was 6% (Table 4). The Event Recovery Rate (Table 4) is calculated using number of events recovered divided by total number of embryos bombarded, and may indicate if a meganuclease has some toxic effect or not. Table 4 shows the effect of different LIG3-4 variants after bombardment and 6-8 weeks antibiotic selection. The meganuclease variants LIG3-4 (7) and LIG3-4 (15) both yielded significantly higher mutation frequencies when compared to the parental LIG3-4 meganuclease, consistent with observations in the yeast assay and in vitro DNA cutting assays. LIG3-4(B65) also yielded higher mutation frequency than the parental LIG3-4, but not as high as the other LIG3-4 variants. This may be due to toxicity associated with this very active meganuclease as indicated by the event recovery rate of LIG3-4 (65).


TABLE 4
Activity of LIG3-4 and LIG3-4 variant meganucleases as determined
by target site mutation rate (TS mutation rate) in plant tissue
originated through gene bombardment transformation.
Meganuclease
Event Recovery Rate
TS Mutation Rate
Insertion
LIG3-4
17%
6%
Yes
LIG3-4(7)
13%
29%
Yes
LIG3-4(15)
15%
54%
Yes
LIG3-4(B65)
3%
21%
Yes


TABLE 5
Activity of LIG3-4 and LIG3-4 variant meganucleases as
determined by target site mutation rate (TS mutation rate) in
plant tissue originated through Agrobacterium transformation.
Event Recovery
Meganuclease
Rate
Mutation Rate
Insertion
LIG3-4
~20%
1-3%
No
LIG3-4 (7)
19%
15%
Yes
LIG3-4 (15)
11%
34%
Yes
LIG3-4 (B65)
9%
74%
Yes

Table 5 indicates that all three variant meganucleases (LIG3-4 (7), LIG3-4(15) and LIG3-4(B65) showed an increased meganuclease activity (TS mutation rate of 15%, 34% and 74%, respectively) when compared to the control non-variant LIG3-4 (TS mutation rate 1 to 3%). The highest increase in meganuclease activity was observed when plant tissue was generated through Agrobacterium transformation. (Table 5). These data are very consistent with data obtained in the yeast and in vitro cutting assays with these variants.

When the meganuclease and gene delivery constructs were introduced via Agrobacterium-mediated transformation, there was a much smaller reduction in the recovery of transgenic events (higher event recovery rate in Table 5 when compared to Table 4). This may be due to the fact that less DNA is delivered to the nuclei of the plant cells by this method.

Maize calli were also screened for integration of the transgene cassette from the donor DNA (PHP43914, PHP46949, PHP46961, PHP47257; agro of PHP47331, PHP47332, and PHP47517) at the LIG3-4 recognition site through junction PCR and selected callus events were regenerated into T0 plants (FIG. 8A-FIG. 8B). When integration occurred, e.g., the donor sequence was integrated at the recognition site, Insertion is designated as “Yes”. When no integration occurred, Insertion is designated as “no” (Table 4 and 5). Targeting of transgenes to the LIG3-4 locus was observed with each LIG3-4 variant delivered by particle bombardment (Table 4). When introduced via Agrobacterium-mediated transformation, each LIG34 variant enabled transgene integration at the target site, whereas the parental LIG34 did not (Insertion YES for variants, NO for LIG3-4; Table 5).

Example 10

Creation of MHP77 and MHP77.3 Variant Meganucleases

A. MHP77 & MHP77.3 Meganucleases and MHP 77 Recognition Site

An endogenous maize genomic target site comprising the MHP77 recognition site (SEQ ID NO: 85) was selected for design of a custom double-strand break inducing agent. The MHP77 recognition site is a 22 bp polynucleotide located on chromosome 1 and having the following sequence:


(SEQ ID NO: 85)
GGGCGGTATGTATGTCATACTA

Wild type I-CreI meganuclease (SEQ ID NO: 3) was modified to produce two engineered meganucleases, MHP77 (SEQ ID NO: 86) and MHP77.3 (SEQ ID NO: 250), designed to recognize the MHP77 recognition sequence. The design of custom made meganucleases has been described in US Patent Application Publication Number US 2007/0117128 A1.

B. MHP77 and MHP77.3 Variant Meganucleases

Variants of the MHP77 meganuclease were created through gene shuffling methods in a manner similar to how the LIG3-4 variants were created and described in Example 3. This involved the introduction of amino acid modifications as found in naturally occurring meganuclease proteins and previously identified in LIG3-4 variants as well as random mutations. The shuffling process resulted in generation of MHP77 variants with recombination of amino acid modifications, unintended amino acid modifications due to mutagenic PCR, deletions, and insertions (SEQ ID NOS: 86-167)

Three variants of the MHP77.3 meganuclease were created by incorporating the same amino acid modifications (mutations) of MHP77(L9-02), MHP77(L9-11), or MHP77(L9-12), thus creating MHP77.3 (L9-02) (SEQ ID NO: 251), MHP77.3 (L9-11) (SEQ ID NO: 252), and MHP77.3(L9-12) (SEQ ID NO: 253). MHP77.3 (15) (SEQ ID NO: 262) contained the exact same nucleotide/amino acid modifications as described for LIG3-4 (15). The amino acid modifications were introduced into MHP77.3 through standard molecular biology techniques.

C. MHP77 Variant Meganucleases Activity in Yeast

A total of 79 MHP77 variants with increased activity were confirmed in the yeast system (as described in Example 6). Increased activity was observed across a range of temperatures: 24° C., 28° C., 30° C. and 370° C., as shown in Table 6. A score of 0 indicates that no white sectors (no cutting indicating no meganuclease activity) were observed; a score of 4 indicates completely white colonies (complete cutting of the recognition site indicating high meganuclease activity); scores of 1-3 indicate intermediate white sectoring phenotypes (and intermediate degrees of recognition site cutting) was indicative of intermediate meganuclease activity.


TABLE 6
Activity of MHP77 variant Meganucleases in Yeast Screening Strain
assayed at different temperatures.
#Variant
24° C.
28° C.
30° C.
37° C.
MHP77
0
0
0
0
MHP77(L15-31)
X
4
4
4
MHP77(L16-11)
4
4
4
4
MHP77(L16-09)
2.5
4
4
4
MHP77(L16-04)
2
4
4
4
MHP77(L16-19)
2
4
4
4
MHP77(L16-17)
3
3.5
4
4
MHP77(L16-23)
1
3.5
4
4
MHP77(L15-34)
1
3.5
4
4
MHP77(L15-40)
0.5
3.5
4
4
MHP77(L15-39)
0.5
3.5
4
4
MHP77(L15-45)
0.5
3
4
4
MHP77(L15-29)
0.5
2.5
4
4
MHP77(L15-06)
0
2
4
4
MHP77(L16-08)
1
3
3.5
4
MHP77(L16-05)
1
3
3.5
4
MHP77(L16-02)
0.5
2.5
3.5
4
MHP77(L16-24)
0
2.5
3.5
4
MHP77(L16-21)
0
2.5
3.5
4
MHP77(L16-14)
0
2.5
3.5
4
MHP77(L16-18)
0.5
2
3.5
4
MHP77(L15-27)
0
2
3.5
4
MHP77(L9-02)
0
2
3
4
MHP77(L16-12)
0
2
3
4
MHP77(L16-01)
0
2
3
4
MHP77(L15-05)
0
2
3
4
MHP77(L15-24)
0
2
3
4
MHP77(L16-06)
0
1
3
4
MHP77(L16-15)
0
1
3
4
MHP77(L15-33)
0
1
3
4
MHP77(L16-03)
0
2
2.5
4
MHP77(L15-47)
0
0
2.5
4
MHP77(L15-46)
0
0
2.5
4
MHP77(L9-12)
0
1
2
4
MHP77(L16-16)
0
1
2
4
MHP77(L15-10)
0
1
2
4
MHP77(L9-03)
0
0.5
2
4
MHP77(L15-20)
0
0.5
2
4
MHP77(L15-28)
0
0
2
4
MHP77(L15-21)
0
0
2
4
MHP77(L15-13)
0
0
2
4
MHP77(L9-04)
0
0
1
4
MHP77(L15-18)
0
0
1
4
MHP77(L18-01)
X
0
0
4
MHP77(L17-12)
X
0
0
4
MHP77(L17-01)
X
0
0
4
MHP77(L15-03)
0
0.5
2
3.5
MHP77(L15-11)
0
0.5
1
3.5
MHP77(L18-12)
X
0
1
3.5
MHP77(L15-15)
0
0
1
3.5
MHP77(L15-12)
0
0
1
3.5
MHP77(L9-1)*
0
1
2
3
MHP77(L9-9)
0
0
1
3
MHP77(L9-11)
0
0
1
3
MHP77(L9-10)
0
0
1
3
MHP77(L15-02)
0
0
1
3
MHP77(L15-08)
0
0
0.5
3
MHP77(L16-07)
0
0
0
3
MHP77(L15-35)
0
0
0
2.5
MHP77(L13-12)
0
0
0
2.5
MHP77(L113-01)
0
0
0
2.5
MHP77(L9-06)
0
0.5
0.5
2
MHP77(L15-42)
0
0
0
2
MHP77(L15-41)
0
0
0
2
MHP77(L15-36)
0
0
0
2
MHP77(L15-30)
0
0
0
2
MHP77(L112-03a)
0
0
0
2
MHP77(L73-02a)
0
0
0
1.5
MHP77(L13-10B1)
0
0
0
1.5
MHP77(L72-08a)
X
0
0
1
MHP77(L72-09a)
0
0
0
1
MHP77(L72-01a)
0
0
0
1
MHP77(L13-08a)
0
0
0
1
MHP77(L13-06)
0
0
0
1
MHP77(L13-02)
0
0
0
1
MHP77(L13-01a)
0
0
0
1
MHP77(L73-05a)
0
0
0
0.5
MHP77(L15-43)
0
0
0
0.5
MHP77(L13-04)
0
0
0
0.5
MHP77(L13-11)
0
0
0
0.5
MHP77(L15-23)
0
0
0
0
MHP77(L15-16)
0
0
0
0

Large increases in meganuclease activity (high scores) were observed. Complete cutting of the recognition site was observed with some variants even at the low temperature of 22° C. (see, MHP77(L16-11) Table 6).

FIG. 9A-FIG. 9N show the amino acid modifications of MHP77 variants relative to the MHP77 parental meganuclease. A (-) indicates that the amino acid is identical to the MHP77 reference sequence.

Example 11

Analysis of MHP77 and MHP77.3 Meganuclease Variants in Maize

Genes encoding the MHP77 and MHP77.3 engineered meganucleases (Example 10) were optimized for expression in plants. The engineered meganuclease expression cassettes contained the maize codon-optimized nucleotide sequences for better performance in maize cells. The meganuclease gene sequences were also supplemented with DNA sequences encoding a SV40 nuclear localization signal resulting in the plant optimized sequence of SEQ ID NO: 254 for MHP77 and SEQ ID NO:255 for MHP77.3. The maize ubiquitin promoter and the potato proteinase inhibitor II gene terminator sequences completed the endonuclease gene designs.

The plant optimized nucleotide sequence for the MHP77 and MHP77.3 variants were MHP77(L9-02) (SEQ ID NO: 256), MHP77(L9-11) (SEQ ID NO: 257), MHP77(L9-12) (SEQ ID NO: 258), MHP77.3 (L9-02) (SEQ ID NO: 259, MHP77.3 (L9-11) (SEQ ID NO: 260), and MHP77.3(L9-12) (SEQ ID NO: 261) and MHP77(15) (SEQ ID NO: 263).

A. Vector Construction for Plant Expression Vectors of the Meganuclease Genes and Repair (Donor) DNAs for Transgene Integration by Homologous Recombination

Vectors comprising expression cassettes for the appropriate meganuclease were constructed using standard molecular biological techniques. For each of the meganucleases, a plant expression vector comprising a polynucleotide encoding one of the meganuclease genes was operably linked to a maize constitutive promoter.

To achieve site-specific DNA insertions, a repair DNA (donor DNA) containing the gene of interest has to be simultaneously present in the cell in addition to the recognition site and the meganuclease. Vectors similar to PHP46961 (SEQ ID NO: 76) described in Example 9, but containing a polynucleotide encoding the meganuclease variant MHP77(L9-11), MHP77(L9-12), MHP77(L9-02), MHP77.3(L9-11), MHP77.3(L9-12), MHP77.3(L9-02), or MHP77.3(15); and a donor DNA were constructed using standard molecular biology techniques. These vectors were referred to as PHP53132, PHP53134, PHP53136, PHP53133, PHP53135, PHP53137 and PHP50239. The donor DNA contained an herbicide resistance gene used as the selection marker for transformation. The herbicide resistance gene MoPAT encodes a phosphinothricin acetyltransferase, and was flanked by two homologous recombination fragments, MHP77HR1 (SEQ ID NO:264) and MHP77HR2 (SEQ ID NO: 265), which were about 1 kb long genomic DNA sequences flanking the meganuclease recognition sites.

Maize immature embryos 9-12 DAP (days after pollination, approximately 1.5-2.0 mm in size) from a maize transformable line were used for gene transformation by bombardment (Example 1 and Example 2). The immature embryos were placed on 560Y medium for 4 hours at 26° C. or alternatively, immature embryos were incubated at temperatures ranging from 26° C. to 37° C. for 8 to 24 hours prior to placing on 560Y preceding bombardment. Developmental genes ODP2 (AP2 domain transcription factor ODP2 (Ovule development protein 2); US 2009/0328252 A1) and Wushel were included in the experiments through co-bombardment (Example 2). Maize immature embryos were transformed with the vectors PHP53132, PHP53134, PHP53136, PHP53133, PHP53135, PHP53137, and PHP50239.

B. Meganuclease Activity of MHP77 and MHP77.3 Variants in Maize

To examine whether the MHP77 and MHP77.3 meganuclease variants increased meganuclease activity when compared to MHP77 or MHP77.3, about 2000 maize immature embryos were bombarded with plasmid DNA of each variant and control. Following bombardment, embryos were incubated on 560P (maintenance medium) at 28° C., then selected on Herbicide (bialophos). Successful delivery of the MHP77, MHP77.3 variants donor vectors (PHP45970, PHP50238, PHP53132, PHP53134, PHP53136, PHP53133, PHP53135, PHP53137, or PHP50239) conferred bialaphos herbicide resistance, and was used to identify putative events by callus selection on herbicide containing media. Callus tissues and/or plants regenerated from stable transformants using standard culture and regeneration conditions were screened for modification of the endogenous MHP77 recognition site.

Herbicide-resistant events were screened for modification at the target site (comprising the MHP77 recognition site) by measuring target site copy-number using qPCR as described in Example 9. The probe sequence for qPCR of MHP77 target site was ACTAATTCAAGTGATGGACAAA (SEQ ID NO: 266), the MHP77_forward primer was TCCTTAGGGCGGTATGTATGTCA (SEQ ID NO: 267) and MHP77_reverse primer was CATCGGTCAAAAAACACATAAACTTT (SEQ ID NO: 268). The amplicon was approximately 100 bp.

Target site mutation rate (TS mutation rate, Table 7) indirectly measures the meganuclease activity. Table 7 shows the effect of different shuffle variants of MHP77 and shuffle meganuclease after bombardment and 6-8 weeks antibiotic selection. Table 7 indicates that all the three shuffled variants of MHP77 meganucleases are more active when compared to MHP77 meganuclease. Increased activity of shuffled MHP77 meganuclease also resulted in a reduction of the event recovery when compared to the MHP77 (control).


TABLE 7
Activity of MHP77 and MHP77 variant meganucleases as
determined by target site mutation rate (TS mutation rate) in
plant tissue originated through gene bombardment transformation.
TS Mutation
Meganuclease
Event Recovery Rate
Rate
Insertion
MHP77 (control)
21%
1%
no
MHP77L9-11
11%
4%
no
MHP77L9-12
9%
17%
yes
MHP77L9-02
3%
6%
no


TABLE 8
Activity of MHP77.3 and MHP77.3 variant meganucleases as
determined by target site mutation rate (TS mutation rate) in
plant tissue originated through Agrobacterium transformation.
Meganuclease
Transformation rate
Mutation Rate
Insertion
MHP77.3
14%
11%
no
MHP77.3(15)
13%
22%
yes
MHP77.3L9-11
9%
35%
yes
MHP77.3L9-12
3%
19%
yes
MHP77.3L9-02
2%
5%
no

Table 8 indicates that all the four shuffled variants of MHP77.3 meganucleases are more active when compared to the non variant MHP77.3 meganuclease. Increased activity of some but not all shuffled MHP77 meganuclease resulted in a reduction of the event recovery when compared to the MHP77.

Maize calli were also screened for integration of the transgene cassette from the donor DNA vector (PHP45970, PHP50238, PHP53132, PHP53134, PHP53136, PHP53133, PHP53135, PHP53137, and PHP50239) at the MHP77 recognition site through junction PCR and selected callus events were regenerated into T0 plants. When integration occurred, e.g. the donor sequence was integrated at the recognition site. Insertion (Table 7 and 8) is designated as “Yes”. When no integration occurred, Insertion is designated as “no”.

Example 12

Creation of MS26 Variant Meganucleases

A. MS26+& MS26++ Meganucleases and MS26 Recognition Site

An endogenous maize genomic target site comprising the MS26 recognition site (SEQ ID NO: 269) was selected for design of a custom double-strand break inducing agent. The MHP26 recognition site is a 22 bp polynucleotide and having the following sequence:

(SEQ ID NO: 269) gatggtgacgtacgtgccctac

Wild type I-CreI meganuclease (SEQ ID NO: 3) was modified to produce two engineered meganucleases, MHP26+ (SEQ ID NO: 270) and MHP26++ (SEQ ID NO: 271), designed to recognize the MHP26 recognition sequence. The design of custom made meganucleases has been described in US Patent Application Publication Number US 2007/0117128 A1.

B. MS26 Variant Meganucleases

As described in Example 6 and 9, LIG3-4 variants were introduced into yeast and maize and demonstrated significantly higher meganuclease activity when compared to the non-variant LIG3-4 meganuclease. These LIG3-4 variants were characterized with specific amino acid modifications when compared to the parental LIG3-4 (Table 2 and FIG. 5A-5E). To test if these amino acid modification (and respective nucleotide modifications) can also increase the activity of a MS26+ meganuclease, the exact same nucleotide/amino acid modifications as described for LIG3-4 (7), LIG3-4 (15), and Lig3-4(B65) were introduced into MS26+ through standard molecular biology techniques, resulting in the following three MS26+ variants: MS26+ (7) (SEQ ID NO: 272), MS26+ (15)(SEQ ID NO: 273), and MS26 (B65) (SEQ ID NO: 274) variants.

Similarly, the MS26++ nucleotide/amino acid sequence was optimized to include the nucleotide/amino acid modifications of LIG3-4 (15) resulting in MS26++ (15) meganuclease variant (SEQ ID NO: 275).

Example 13

Analysis of Meganuclease Activity of MS26+ and MS26++ Variants in Maize

Genes encoding the MHP26+ and MHP26++ engineered meganucleases were optimized for expression in plants. The engineered meganuclease expression cassettes contained the maize codon-optimized nucleotide sequences for better performance in maize cells. The meganuclease gene sequences were also supplemented with DNA sequences encoding a SV40 nuclear localization signal (SEQ ID NO: 34) resulting in the plant optimized sequence of SEQ ID NO: 276 for MHP26+ and SEQ ID NO:279 for MS26++. The maize ubiquitin promoter and the potato proteinase inhibitor II gene terminator sequences completed the endonuclease gene designs. Plant optimized sequences for MS26+ and MS26++ variant meganucleases are SEQ ID NOS: 419, 277-279 and SEQ ID NO: 280, respectively.

A. Vector Construction for Plant Expression Vectors of the Meganuclease Genes and Repair (Donor) DNAs for Transgene Integration by Homologous Recombination

Coding parts of the MS26+ variants were introduced into the test vector PHP51583 containing a slot for meganuclease driven by ubiquitin promoter, a fusion of two marker genes, MoPAT and DsRed, also under the control of ubiquitin promoter, and kanamicyn resistance gene.

The resulting constructs were delivered into the scutellum cells of maize immature embryos via microprojectile bombardment as described in Example 1. Developmental genes (BBM and WUS) were also delivered by co-bombarded (Example 1 and 2).

B. Meganuclease Activity of MS26+ Variants in Maize

Callus tissue of transgenic events was collected, total genomic DNA was extracted and used as a template to amplify DNA fragment of about 1 kb comprising the Ms26 recognition site. Frequencies of mutations of the MS26 recognition site (Target site mutation rate) were estimated by the fragments digestion with BsiWI restriction nuclease which cuts the intended Ms26 recognition site. Frequency of mutations was calculated based on the percentage of remaining (uncut) fragment indicating mutations at the target site. Events with at least 50% of undigested fragment were indicative of at least one allel being cut in first stages of development and thus were indicative of mutations. Unlike in the case of LIG3-4 (Example 9), no decrease in frequency of event recovery of the MS26+ variants was observed when compared to the parental MS26+. All three MS26+ variants yielded higher mutation frequencies compared with Ms26+ meganuclease (Table 9). While Ms26+ (B65) and Ms26+ (7) demonstrated moderate increase in meganuclease activity (3 and 4 fold increase, respectively), Ms26+ (15) demonstrated approximately a 10 fold increase of activity (Table 9).


TABLE 9
Activity of MS26+ and MS26+ variant
meganucleases as determined by target site
mutation rate (TS mutation rate) in plant tissue.
Number of
events
TS Mutation
Meganuclease
analyzed
Rate
MS26+
282
2%
MS26+ (7)
191
9%
MS26+ (15)
227
25%
MS26+ (B65)
176
7%

Introducing the same amino acid modifications (mutations) as LIG3-4 (15) into MS26++ (15) resulted in a dramatic increase of meganuclease acivity as measured by the % mutation rate of MS26++ (44%) when compared to MS26 (7%) (Table 10). This data indicates that nearly half of all events analyzed carried mutations at the Ms26 recognition site.


TABLE 10
Activity of MS26+ and MS26+ variant meganucleases as determined
by target site mutation rate (TS mutation rate) in plant tissue.
Number of
events
TS Mutation
Meganuclease
analyzed
Rate
MS26++
189
7%
MS26+ (15)
185
44%

Example 14

Creation of MHP and MHP14+ Variant Meganucleases

A. MHP14 & MHP14+ Meganucleases and MHP14 Recognition Site

An endogenous maize genomic target site comprising the MHP14 recognition site (SEQ ID NO: 281) was selected for design of a custom double-strand break inducing agent. The MHP14 recognition site is a 22 bp polynucleotide located and having the following sequence:

(SEQ ID NO: 281) caaacagattcacgtcagattt

Wild type I-CreI meganuclease was modified to produce the engineered meganucleases MHP14 (SEQ ID NO: 282) and MHP14+ (SEQ ID NO: 283) designed to recognize the MHP14 recognition sequence. The design of custom made meganucleases has been described in US Patent Application Publication Number US 2007/0117128 A1.

B. MHP14 and MHP14+ Variant Meganucleases

Variants of the MHP14 meganuclease were created through gene shuffling methods in a manner similar to how the LIG3-4 variants were created and described in Example 3. This involved the introduction of amino acid modifications as found in naturally occurring meganuclease proteins and previously identified in LIG3-4 variants as well as random mutation. The shuffling process resulted in generation of MHP14 variants with recombination of amino acid modifications, unintended amino acid modifications due to mutagenic PCR, deletions, and insertions (SEQ ID NOS: 284-298). Corresponding DNA sequences are SEQ ID NO: 300-314.

Mutations from five MHP14 variants, MHP14 (04), MHP14 (06), MHP14 (08), MHP14 (12) and MHP14 (14), were introduced into MHP14+, resulting in MHP14+(04) (SEQ ID NO: 315), MHP14+ (06) (SEQ ID NO: 316), MHP14+ (08) (SEQ ID NO: 317), MHP14+ (12) (SEQ ID NO: 318), MHP14+ (14) (SEQ ID NO: 319), respectively. One additional variant was generated by introduction of the G19S mutation from LIG3-4 (15) into MHP14+, resulting in MHP14+ (15) (SEQ ID NO: 320). These mutations were introduced into MHP14+ through standard molecular biology techniques.

Example 15

Analysis of Meganuclease Activity of MHP14 and MHP14+ Variants in Yeast and Maize

A total of 15 MHP14 variants with increased activity were confirmed in the yeast system (as described in Example 6). Increased activity was observed across a range of temperatures: 28° C., 34° C. and 37° C., as shown in Table 11.


TABLE 11
Activity of MHP14 variant meganucleases in yeast Screening Strain
assayed at different temperatures.
meganuclease
28° C.
34° C.
37° C.
MHP14
0
2
2
MHP14(L14-07)
0.5
4
MHP14(01)
0
3
3
MHP14(06)
0
3.5
3.5
MHP14(L14-04)
0
3
MHP14(08)
1
4
4
MHP14(07)
0.5
2.5
2.5
MHP14(03)
0.5
3
3
MHP14(04)
2
x
x
MHP14(02)
2
4
4
MHP14(13)
1
4
3.5
MHP14(L14-03)
0
3
MHP14(14)
1
4
4
MHP14(09)
2
4
4
MHP14(12)
1.5
4
4
MHP14(10)
1
4
4

Large increases in meganuclease activity (high scores) were observed.

FIG. 10A-FIG. 10D show the amino acid modifications of MHP14 variant meganucleases relative to the MHP14 parental meganuclease. A (-) indicates that the amino acid is identical to MHP14.

Results from activity screening of five MHP14+ variants are shown in Table 12.


TABLE 12
Activity of MHP14 + variant meganucleases in yeast Screening Strain
assayed at different temperatures.
Meganuclease
28° C.
37° C.
MHP14
0
2
MHP14+ (04)
2
X
MHP14+ (06)
0
4
MHP14+ (08)
1
4
MHP14+ (12)
2
4
MHP14+ (14)
1
4
MHP14+ (15)

All MHP14+ variants showed higher activity in the Yeast Assay screened at 37° C. when compared to the MHP14 meganuclease (Table 12). Variant MHP14+(04), MHP14+ (08), MHP14+ (12) and MHP14+ (14) showed increased activity even when assayed at lower temperatures temperature of 28° C. (Table 12).

Genes encoding the MHP14 and MHP14+ variant meganucleases were optimized for expression in plants. The engineered meganuclease expression cassettes contained the maize codon-optimized nucleotide sequences for better performance in maize cells. The meganuclease gene sequences were also supplemented with DNA sequences encoding a SV40 nuclear localization resulting in the plant optimized sequences of SEQ ID NOS: 321-327. The maize ubiquitin promoter and the potato proteinase inhibitor II gene terminator sequences completed the endonuclease gene designs.

Testing and analysis of meganuclease activity of the MHP14+ variants in-planta was performed as described for Ms26+ and Ms26++ variants (Example 12) and results are shown in Table 13.


TABLE 13
Activity of MHP14 and MHP14+ variant meganucleases
in maize as determined by target site mutation rate (TS
mutation rate) in plant tissue.
Number of
events
TS Mutation
Meganuclease
analyzed
Rate
MHP14
192
13%
MHP14+ (04)
192
38%
MHP14+ (06)
192
7%
MHP14+ (08)
192
25%
MHP14+ (12)
192
47%
MHP14+ (14)
192
39%
MHP14+ (15)
192
20%

Two variants, MHP14+ (04) and MHP14+ (08), while demonstrating higher activity also showed rather high levels of toxicity. MHP14+ 06 showed no difference in both toxicity and activity when compared to MHP14. Two variants, MHP14+ (12) and MHP14+ (14), demonstrated high levels of activity without increased toxicity. MHP14+ (15) variant showed moderate increase of activity and no increase of toxicity (Table 13).

Example 16

DNA Shuffling to Create Variants of MP107 Meganuclease

An endogenous maize genomic target site comprising the MP107 recognition sequence (SEQ ID NO: 328) was selected for design of a custom double-strand break inducing agent. The MP107 recognition site is a 22 bp polynucleotide having the following sequence:

ctagtatacgtgagagaccttg (SEQ ID NO: 328).

An engineered MP107 meganuclease (SEQ ID NO: 329) was produced as described in Example 3.

The first phase of MP107 meganuclease optimization was designed to introduce amino acid modifications into the MP107 meganuclease as described in Example 3. Libraries were based on introduction of mutations previously identified in LIG3-4, MHP14 and MHP77 variants with increased activity.

The shuffling process resulted in generation of variants with recombination of amino acid modifications, unintended amino acid modifications due to mutagenic PCR, deletions, and insertions (SEQ ID NOS: 330-341). Corresponding nucleotide sequences are shown in SEQ ID NOS: 343-354.

A total of 6 MHP107 variants with increased activity were confirmed in the yeast system (as described in Example 6). Increased activity was observed across a range of temperatures: 28° C., 30° C. and 37° C., as shown in Table 14.


TABLE 14
Activity of MP107 variant Meganucleases in Yeast Screening Strain
assayed at different temperatures.
meganuclease
28° C.
30° C.
37° C.
MHP107
0
0
0
MHP107(D1)
0
0
0
MHP107(D5)
0.5
1.5
MHP107(D3)
0.5
2
MHP107(D2)
0
0
MHP107(C6)
0.5
1
3
MHP107(C4)
0
0
MHP107(D4)
0
2
MHP107(C5)
0
1
MHP107(C1)
2
3
4
MHP107(C2)
0
0
MHP107(D6)
0
0
0
MHP107(C3)
0
0

FIG. 11 show the amino acid modifications of MP107 variants relative to the MP107 parental meganuclease. A (-) indicates that the amino acid is identical to MP107.

Example 17

DNA Shuffling to Create Variants of Zm6.3 Meganuclease

An endogenous maize genomic target site comprising the Zm6.3 recognition sequence (SEQ ID NO: 355) was selected for design of a custom double-strand break inducing agent. The Zm6.3 recognition site is a 22 bp polynucleotide having the following sequence: caggctctcgtaaatgcgcctg (SEQ ID NO: 355).

An engineered Zm6.3_meganuclease (SEQ ID NO: 356) was produced as described in Example 3.

The first phase of Zm6.3 meganuclease optimization was designed to introduce amino acid modifications into the Zm6.3 meganuclease as described in Example 3. Libraries were based on introduction of mutations previously identified in LIG3-4, MHP14 and MHP77 variants with increased activity.

The shuffling process resulted in generation of variants with recombination of amino acid modifications, unintended amino acid modifications due to mutagenic PCR, deletions, and insertions (SEQ ID NOS: 357-371). Corresponding nucleotide sequences are shown in SEQ ID NOS: 373-387.

A total of 15 Zm6.3 variants with increased activity were confirmed in the yeast system (as described in Example 6). Increased activity was observed across a range of temperatures: 28° C., 30° C. and 37° C., as shown in Table 15.


TABLE 15
Activity of Zm6.3 variant Meganucleases in Yeast Screening Strain
assayed at different temperatures.
meganuclease
28° C.
30° C.
37° C.
ZM6.3
0
0.5
2
ZM6.3(4)
1
2
4
ZM6.3(3)
0
0
2
ZM6.3(5)
0.5
1
4
ZM6.3(H2)
2
2
4
ZM6.3(H3)
2
2
4
ZM6.3(1)
1
1.5
4
ZM6.3(G4)
2
2.5
4
ZM6.3(G1)
4
4
4
ZM6.3(G5)
0
0
2
ZM6.3(G2)
2.5
4
4
ZM6.3(H1)
4
4
4
ZM6.3(G6)
4
4
4
ZM6.3(G3)
2
2.5
4
ZM6.3(H6)
4
4
4
ZM6.3(H5)
4
4
4

FIG. 12 shows the amino acid modifications of Zm6.3 variants relative to the Zm6.3 parental meganuclease. A (-) indicates that the amino acid is identical to Zm6.3.

Example 18

DNA Shuffling to Create Variants of Zm6.22v2 Meganuclease

An endogenous maize genomic target site comprising the Zm6.22v2 recognition sequence (SEQ ID NO: 388) was selected for design of a custom double-strand break inducing agent. The Zm6.22v2 recognition site is a 22 bp polynucleotide having the following sequence: attgctctctcacatactttta (SEQ ID NO: 388).

An engineered Zm6.22v2 meganuclease (SEQ ID NO: 389). was produced as described in Example 3.

The first phase of Zm6.22v2 meganuclease optimization was designed to introduce amino acid modifications into the Zm6.22v2 meganuclease as described in Example 3. Libraries were based on introduction of mutations previously identified in LIG3-4, MHP14 and MHP77 variants with increased activity.

The shuffling process resulted in generation of variants with recombination of amino acid modifications, unintended amino acid modifications due to mutagenic PCR, deletions, and insertions (SEQ ID NOS: 390-403). Corresponding nucleotide sequences are shown in SEQ ID NOS: 405-418.

A total of 13 ZM6.22v2 variants with increased activity were confirmed in the yeast system (as described in Example 6). Increased activity was observed across a range of temperatures: 28° C., 30° C. and 37° C., as shown in Table 16.


TABLE 16
Activity of ZM6.22v2 variant Meganucleases in Yeast Screening Strain
assayed at different temperatures.
meganuclease
28° C.
30° C.
37° C.
ZM6.22v2
0
0
1
ZM6.22v2(I2)
1
2
x
ZM6.22v2(J5)
0.5
4
ZM6.22v2(J8)
1
3
ZM6.22v2(J3)
0.5
3
ZM6.22v2(J4)
0.5
3.5
ZM6.22v2(J7)
0.5
3
ZM6.22v2(I6)
0.5
1
2
ZM6.22v2(I4)
0
0
3
ZM6.22v2(I3)
0
0
2
ZM6.22v2(I5)
0
0
0
ZM6.22v2(J2)
0
2
ZM6.22v2(I9)
0
2
ZM6.22v2(I7)
0
2
ZM6.22v2(I8)
0.5
2.5
ZM6.22v2
0
0
1
ZM6.22v2(I2)
1
2
x
ZM6.22v2(J5)
0.5
4
ZM6.22v2(J8)
1
3

FIG. 13 shows the amino acid modifications of Zm6.22v2 variants relative to the Zm6.22v2 parental meganuclease. A (-) indicates that the amino acid is identical to Zm6.22v2

Example 19

Use of Different Amino Acid Linkers Sequences to Create Meganucleases with Increased Activity

As discussed in Example 3, all variant meganucleases comprised a linker polypeptide that links the two re-engineered I-CreI monomers into a single amino chain.

The variant meganucleases MHP14(10) (SEQ ID NO: 292) and MHP77(L9-01) (SEQ ID NO: 92) were created as described in Examples. These variant meganucleases were also characterized by having a different linker sequence when compared to the linker sequence in their respective parent meganucleases (FIG. 15A-FIG. 15D). In MHP14(10), a frameshift occurred at the second codon of the linker E160 and the reading frame was restored at S193, the last residue of the linker. In MHP77(L9-01), a frameshift occurred at the first codon of the linker W159 and the reading frame was restored at L198. So the first 4 amino acids of the second unit of the linked dimer were changed. This data indicates that variant meganucleases can be created with a diverse linker sequence, while still obtaining increased meganuclease activity.

Alignment of the entire amino acid sequence (FIG. 15B) of LIG3-4 (SEQ ID NO: 1), MHP14 (SEQ ID NO: 282) MHP14(10) (SEQ ID NO: 292), MHP77 (SEQ ID NO: 86), and MHP77(L9-01) (SEQ ID NO: 92) revealed a percent identity of as low as 80.8%. Hence, variant meganucleases were created that had increased meganuclease activity while having only 80% similarity to the parental meganuclease.

Example 20

Identification of Amino Acid Modifications in Structural Motifs of Meganucleases

An analysis of the physical positions of amino acid modifications responsible for increased meganuclease activity was performed using a three dimensional structure model of the I-CreI meganuclease dimer (Chevalier, et al., (2001) Nat. Struct. Biol. 8:312). Amino acid modifications in alpha helix-1 positions 12, 16 and 19 were associated with increased activity observed with several meganuclease variants as shown in FIG. 16. Alpha helix-1 encompasses amino acids 8 through 19 on subunit number 1 and amino acids 195 through 206 on subunit number 2 in SEQ ID NO: 1. Additionally, amino acid modifications in alpha helix-5 positions 121, 124, 129, 131 and 132 were associated with increased activity observed in several meganuclease variants as shown in FIG. 16. Alpha helix-5 encompasses amino acids 120-135 on subunit number 1 and amino acids 307 through 322 on subunit number 2 in SEQ ID NO: 1. We predict that additional amino acid modifications in alpha helix-1 and alpha helix-5 have the potential to result in meganuclease variants with increased activity over the corresponding reference meganucleases.

Example 21

Transfer of at Least One Amino Acid Modification to Other Meganuclease to Create Variant Meganuclease with Increased Activity

As described in the Example 3-19, any one of the amino acid modifications identified in Examples 3-19 can be transferred to a parental meganuclease to create a variant meganuclease with increased activity. FIG. 14A-FIG. 14F list a subset of variant I-CreI type meganucleases with increased activity. Anyone of these amino acid modifications can be combined to create a new variant with increased activity.

One embodiment of this invention is the transfer of at least amino acid modification selected from the group of Y12 to H, G19 to S or A, Q50 to K or R, F54 to I, D56 to L, V105 to A, E124 to R, V129 to A, I132 to V or T, D153 to M or L, V316 to A or I319 to V to a parental meganuclease in order to improve the activity of the parental meganuclease.

Example 22

Saturated Mutagenesis to Create Variant Meganucleases with Increased Activity

Saturated mutagenesis can be performed at any of the amino acid modification positions described in examples 3-21. Saturated mutagenesis will result in the production a set of meganucleases wherein one amino acid position is substituted with one of all possible amino acids. This set of meganucleases can then be analyzed for increased activity as described above resulting in identifying more possible modifications for an amino acid position that will result in an increased meganuclease activity.

Example 23

Creation and Analyses of TS21 and TS14 Variant Meganucleases in Soybean

A. TS21 and TS14 Recognition Sites and Meganucleases

An endogenous soybean genomic target site comprising the TS21 recognition sequence (SEQ ID NO: 423) or the TS14 recognition sequence (SEQ ID NO: 424) was selected for design of a custom double-strand break inducing agent. The soybean genomic target sites and design of custom made TS21 and TS14 meganucleases have been described in U.S. patent application Ser. No. 13/427,138, filed on Mar. 22, 2012, which is incorporated by reference in its entirety.

B. TS21 and TS14 Variant Meganucleases

To test if the LIG3-4 amino acid modifications (and respective nucleotide modifications) can also increase the activity of the soybean TS21 meganuclease and TS14 meganuclease, the exact same nucleotide/amino acid modifications as described for LIG3-4 (7), LIG3-4 (15), and Lig3-4(B65) (Table 1A) were introduced into TS21 meganuclease (SEQ ID NOS: 425 and 429) and TS14 meganuclease (SEQ ID NOS: 433 and 435) through standard molecular biology techniques, resulting in the following three TS21 meganuclease variants and one TS14 meganuclease variant: TS21(7) (SEQ ID NOS: 426 and 430), TS21(15) (SEQ ID NOS: 427 and 431), TS21(B65) (SEQ ID NOS: 428 and 432), and TS14(15) (SEQ ID NOS: 434 and 436) variants.

C. Analyses of Meganuclease Activity of TS21 and TS14 Variants in Soybean

Genes encoding the TS21 and TS14 variant meganucleases were optimized for expression in plants. The engineered meganuclease expression cassettes contained the plant codon-optimized nucleotide sequences for better performance in soybean. The plant expression vectors for these soy variants were made by the same methods as described in U.S. patent application Ser. No. 13/427,138. The soybean ubiquitin promoter and the potato proteinase inhibitor II gene terminator sequences were used for controlling meganuclease expression in soybean. The methods used for soybean transformation, qPCR and genomic PCR assays for the TS21 and TS14 target sites were as described in U.S. patent application Ser. No. 13/427,138. The qPCR assays specific to the TS21 and TS14 recognition sequences were used to identify sequence changes. All hygromycin resistant soybean transgenic events were analyzed by qPCR assays. Changes in the meganuclease target sequence caused by DNA cleavage and repair result in the copy number reduction of the meganuclease target site from two copies in wild type soybean genome to either one or zero copies in the transgenic events. From qPCR analyses of the TS21 and TS14 target sites, it was shown that the copy numbers of the target sites in most of the positive transgenic events were reduced by half, indicating one allele of the recognition sites in soybean genome was disrupted by meganuclease cutting/DNA repair mechanism. As shown in Table 17, introducing the same amino acid modifications (mutations) as LIG3-4 variants into the TS21 meganuclease resulted in a dramatic increase of TS21 target site mutation rates for the TS21(7) variant meganuclease (32.1%) and the TS21(15) variant meganuclease (17.2%), a moderate increase for the TS21(B65) variant meganuclease when compared to the parental TS21 meganuclease (8.7%). As shown in Table 18, introducing the LIG3-4 (15) mutation into TS14 meganuclease resulted in a decrease of TS14 target site mutation rate from 16% for the parental TS14 meganuclease to 4% mutation rate for the TS14(15) variant meganuclease.


TABLE 17
Activity of TS21 variant meganucleases as determined
by target site qPCR hit rate (TS mutation rate) in soybean
Number of
events
TS Mutation
Meganuclease
analyzed
Rate
TS21
184
8.7%
TS21 (7)
187
32.1%
TS21 (15)
192
17.2%
TS21 (B65)
134
12.7%


TABLE 18
Activity of TS14 variant meganuclease as determined
by target site mutation rate (TS mutation rate) in soybean
Number of
events
TS Mutation
Meganuclease
analyzed
Rate
TS14
183
16%
TS14 (15)
192
4%


SEQUENCE LISTING
The patent contains a lengthy “Sequence Listing” section. A copy of the “Sequence Listing” is available in electronic form from the USPTO web site (). An electronic copy of the “Sequence Listing” will also be available from the USPTO upon request and payment of the fee set forth in 37 CFR 1.19(b)(3).

<160> NUMBER OF SEQ ID NOS: 440

<140> CURRENT APPLICATION NUMBER: US/15/867,385

<210> SEQ ID NO 1

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: LIG3-4

<400> SEQUENCE: 1

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Lys Ala Gln Ile Lys Pro Asn Gln Ser

20 25 30

Cys Lys Phe Lys His Gln Leu Ser Leu Thr Phe Gln Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Tyr Asp Arg Gly Ser Val Ser Asp Tyr Glu Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Lys Pro Arg Gln Cys Tyr Lys Phe Lys His

210 215 220

Glu Leu Arg Leu Glu Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Arg Gly Ser Val Ser Asp Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 2

<211> LENGTH: 22

<212> TYPE: DNA

<213> ORGANISM: zea mays

<400> SEQUENCE: 2

atatacctca cacgtacgcg ta 22

<210> SEQ ID NO 3

<211> LENGTH: 163

<212> TYPE: PRT

<213> ORGANISM: Chlamydomonas reinhardtii

<400> SEQUENCE: 3

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Asn Gln Ser

20 25 30

Tyr Lys Phe Lys His Gln Leu Ser Leu Ala Phe Gln Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Val Ser Asp Tyr Ile Leu Ser Glu

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Trp Arg Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys

145 150 155 160

Ser Ser Pro

<210> SEQ ID NO 4

<211> LENGTH: 171

<212> TYPE: PRT

<213> ORGANISM: Carteria lunzensis

<400> SEQUENCE: 4

Met Asn Lys Phe Thr Pro Asp Gln Leu Leu Tyr Leu Ala Gly Leu Ile

1 5 10 15

Asp Gly Asp Gly Ser Ile Ile Ala Gln Leu Val Ser Arg Lys Asp Tyr

20 25 30

Thr Trp Glu Phe Gln Ile Arg Leu Thr Val Gln Val Thr Gln Leu Lys

35 40 45

Lys Arg Arg Trp Phe Leu Glu Glu Leu Gln Lys Glu Ile Gly Ala Gly

50 55 60

Ser Val Arg Asp Arg Asp Thr Val Ser Asp Tyr Ile Leu Thr Glu Thr

65 70 75 80

Ser Asn Val Tyr Lys Phe Leu Lys Asp Leu Gln Pro His Leu Arg Leu

85 90 95

Lys Gln Lys Gln Ala Asn Leu Val Leu Arg Ile Ile Glu Gln Leu Pro

100 105 110

Ser Ser Lys Ala Ser Lys Glu Ile Phe Leu Glu Leu Cys Asn Val Val

115 120 125

Asp His Val Ala Thr Leu Asn Asp Thr Lys Lys Arg Lys Tyr Thr Ala

130 135 140

Glu Ile Val Ala Ala Lys Leu Lys Glu Leu Lys Glu Cys Val Val Pro

145 150 155 160

Val Glu Thr Ser Glu Glu Thr Asn Ser Gly Ile

165 170

<210> SEQ ID NO 5

<211> LENGTH: 167

<212> TYPE: PRT

<213> ORGANISM: Scenedesmus obliquus

<400> SEQUENCE: 5

Met Thr Asn Asn Asn Met Gln Asn Lys Gly Met Lys Ile Ile Asp Lys

1 5 10 15

Asp Glu Leu Ile Tyr Leu Ala Gly Phe Ile Asp Gly Asp Gly Ser Leu

20 25 30

Ile Ala Gln Met Val Arg Arg His Asp Tyr Lys Phe Lys Tyr Gln Ile

35 40 45

Lys Cys Thr Val Gln Ile Thr Gln Leu Lys Lys Arg Arg His Phe Leu

50 55 60

Glu Lys Ile Gln Glu Ser Ile Gly Tyr Gly Ile Ile Arg Asp Arg Gly

65 70 75 80

Thr Ile Ser Asp Tyr Val Leu Val Glu Pro Lys Cys Val Tyr Trp Leu

85 90 95

Leu Lys Gln Leu Ser Pro Phe Leu Arg Leu Lys Lys Lys Gln Ala Asp

100 105 110

Leu Ile Ile Arg Ile Ile Glu Gln Leu Thr Ser Ser Lys Asn Ser Ala

115 120 125

Val Leu Phe Val Gln Leu Cys Arg Leu Thr Asp Gln Val Ala Leu Leu

130 135 140

Asn Asp Ser Lys Ser Arg Thr Ile Thr Ala Glu Val Val Glu Thr Thr

145 150 155 160

Leu Arg Glu Leu Gly Leu Ile

165

<210> SEQ ID NO 6

<211> LENGTH: 166

<212> TYPE: PRT

<213> ORGANISM: Haematococcus lacustris

<400> SEQUENCE: 6

Met Lys Asn Ile Asn Ser Thr Arg Phe Ser His Leu Thr Asn Glu Gln

1 5 10 15

Lys Ala Tyr Leu Ala Gly Phe Ile Asp Cys Asp Gly Ser Leu Met Ala

20 25 30

Gln Ile Val Arg Lys Pro Asp Tyr Ala Tyr Lys Phe Gln Ile Arg Val

35 40 45

Thr Ile Gln Leu Ser Gln Arg Thr Ser Arg Ile His Phe Leu Lys Glu

50 55 60

Ile Ala Ser Glu Val Gly Tyr Gly Tyr Val Val Ser Arg Asn Asn Met

65 70 75 80

Ser Asp Tyr Val Ile Thr Gln Ala Asn Ile Val Tyr Glu Leu Leu Ser

85 90 95

Leu Leu Leu Pro Tyr Leu Arg Met Lys Val Lys Gln Ala Asn Leu Ile

100 105 110

Leu Lys Ile Ile Gln Glu Leu Pro Ser Ala Lys Val Ser Lys Asp Lys

115 120 125

Phe Ile Glu Leu Cys Ile Leu Ala Asn Gln Val Ser Ile Leu Asn Thr

130 135 140

Pro Asn Lys Ile Leu Lys Asn Thr Trp Gln Val Val Lys Ala Glu Leu

145 150 155 160

Glu Ser Glu Asp Leu Gln

165

<210> SEQ ID NO 7

<211> LENGTH: 162

<212> TYPE: PRT

<213> ORGANISM: Mesostigma viride

<400> SEQUENCE: 7

Met Lys Leu Ile Asp Glu Lys Val Cys Ala Tyr Ile Ala Gly Phe Leu

1 5 10 15

Asp Gly Asp Gly Cys Leu Met Ala Gln Ile Ile Lys Gly Val Tyr Lys

20 25 30

Tyr Lys Phe Arg Ile Arg Leu Ser Ile Val Phe Tyr Gln His Ala Lys

35 40 45

Arg Lys Trp Phe Leu Leu Gln Leu Lys Lys Leu Phe Asp Asp Ile Gly

50 55 60

Tyr Val Arg Ile His Lys Thr Asn Asn Met Cys Asp Tyr Thr Ile Thr

65 70 75 80

Gly Ser Arg Ser Ile Glu His Ile Leu Leu Gln Leu Met Pro Tyr Ile

85 90 95

Gln Leu Lys Lys Thr Ser Ala Ala Leu Met Leu Gln Leu Ile Lys Lys

100 105 110

Glu Lys Leu Val Thr Thr Lys Ala Asp Phe Ile Glu Val Cys Gln Leu

115 120 125

Val Asp Lys Ile Ala Glu Gln Asn Phe Ser Lys Lys Arg Ile Ile Thr

130 135 140

Ser Lys Val Val Glu Ala Thr Leu Met Leu Pro Val Glu Thr Glu Lys

145 150 155 160

Ser Ser

<210> SEQ ID NO 8

<211> LENGTH: 165

<212> TYPE: PRT

<213> ORGANISM: Oltmannsiellopsis viridis

<400> SEQUENCE: 8

Met Val Ile Pro Arg Glu Asp Leu Leu Asn Ile Thr Met Asn Leu Thr

1 5 10 15

Glu Gln Glu Lys Ile Tyr Leu Gly Gly Phe Val Asp Gly Asp Gly Cys

20 25 30

Ile Asn Ala Gln Ile Val Arg Arg Lys Asp Tyr Leu Leu Lys Phe Gln

35 40 45

Ile Arg Val Thr Val Ser Phe Tyr Gln Lys Thr Lys Arg His Trp Phe

50 55 60

Ile Lys Trp Val His Lys Lys Leu Lys Tyr Gly Ser Ile Arg Lys Arg

65 70 75 80

Asn Asp Gly Met Ser Glu Tyr Asn Ile Val Gly Asn His Ala Val Lys

85 90 95

Lys Val Val Asn Glu Leu Gln Pro Tyr Ile Arg Ile Lys Gln Pro Gln

100 105 110

Val Arg Leu Val Leu Glu Ile Ile Glu Lys Leu Pro Glu Ala Lys Asp

115 120 125

Pro Leu Thr Phe Val Ala Leu Cys Glu Arg Val Asp Leu Phe Glu Arg

130 135 140

Leu Asn Asp Ser Lys Lys Arg Val Ile Thr Ser Glu Thr Val Arg Ser

145 150 155 160

Glu Leu Gly Ile Asp

165

<210> SEQ ID NO 9

<211> LENGTH: 168

<212> TYPE: PRT

<213> ORGANISM: Trichosarcina mucosa

<400> SEQUENCE: 9

Met Ser Lys Leu Lys Lys Leu Thr Pro Glu Glu Leu Cys Tyr Leu Ala

1 5 10 15

Gly Phe Leu Asp Gly Asp Gly Cys Ile Asn Ala Gln Ile Val Arg Arg

20 25 30

Ser Asp Tyr Lys Leu Lys Phe Gln Ile Arg Val Ser Ile Thr Phe Phe

35 40 45

Gln Lys Thr Asn Arg His Trp Phe Leu Ile Trp Leu Asp Lys Lys Leu

50 55 60

Asp Cys Gly Thr Leu Arg Lys Arg Pro Asp Lys Met Ser Glu Tyr Ala

65 70 75 80

Ile Ile Gly Ile Ala Ser Val Arg Asn Ile Leu Leu Leu Phe Lys Pro

85 90 95

Tyr Leu Lys Leu Lys Lys Arg Gln Ala Ile Leu Leu Leu Lys Val Ile

100 105 110

Glu Lys Met Pro Ser Thr Gln Asn Asp Pro Gln Ala Phe Phe Lys Leu

115 120 125

Cys Glu Gln Val Asp Gln Phe Ser Glu Phe Asn Asp Ser Lys Lys Arg

130 135 140

Lys Leu Thr Ser Gln Val Val Arg Ser Glu Ile Gly Thr Ala Ile His

145 150 155 160

Leu Phe Pro Val Glu Thr Glu Thr

165

<210> SEQ ID NO 10

<211> LENGTH: 170

<212> TYPE: PRT

<213> ORGANISM: Chlorokybus atmophyticus

<400> SEQUENCE: 10

Met Asn Val Ile Lys Glu Ser Glu Lys Ala Tyr Ile Ala Gly Phe Leu

1 5 10 15

Asp Gly Asp Gly Cys Ile Asn Ala Gln Leu Val Arg Arg Ser Gly Tyr

20 25 30

Arg Leu Leu Tyr Gln Ile Arg Val Ser Ile Thr Phe Phe Gln Ser Ser

35 40 45

Lys Arg His Trp Phe Leu Leu Gln Leu His Lys Arg Leu Gly Cys Gly

50 55 60

Ser Val Arg Lys Arg Asn Asp Gly Ile Ser Glu Tyr Cys Ile Val Gly

65 70 75 80

Ile Ala Ser Val Ser Ala Val Cys Glu Gln Leu Leu Pro Tyr Leu Arg

85 90 95

Leu Lys Arg Arg Gln Ala Val Leu Ile Cys Thr Ile Ile Lys Ser Leu

100 105 110

Ser Arg His Gln Ser Lys Glu Asp Phe Leu Ser Leu Cys Glu Leu Ala

115 120 125

Asp Gln Ile Ser Phe Leu Asn Asn Ser Lys Lys Arg Ser Asn Thr Ala

130 135 140

Cys Leu Val Arg Lys Glu Leu Asp Leu Leu Phe Pro Val Glu Thr Ser

145 150 155 160

Asp Thr Lys Val Gly Asp Thr Ser Ser Gly

165 170

<210> SEQ ID NO 11

<211> LENGTH: 164

<212> TYPE: PRT

<213> ORGANISM: Acanthamoeba castellanii

<400> SEQUENCE: 11

Met Gln Lys Asn Gln Phe Lys Lys Leu Asn Asn Glu Gln Leu Ala Tyr

1 5 10 15

Leu Ala Gly Phe Val Glu Ala Asp Gly Cys Phe Leu Val Gln Ile Ile

20 25 30

Pro Gly Leu Gln Tyr Arg Tyr Lys His Thr Ile Arg Ile Ser Ile Val

35 40 45

Phe Tyr Gln Lys Lys Asp Lys His Trp Tyr Phe Leu Gln Leu Lys Asn

50 55 60

Leu Ile Gly Leu Gly Ser Ile Arg Phe Arg Asn Asp Gly Met Leu Glu

65 70 75 80

Tyr Ser Ile Thr Gly Leu Ser Leu Val Asn Lys Phe Leu Glu Met Leu

85 90 95

Phe Pro Tyr Leu Ile Leu Lys Lys Asn Leu Ala Val Leu Ile Phe Arg

100 105 110

Ile Ile Lys Gly Leu Asn Asp Val Lys Asn Glu Ala Gly Phe Leu Glu

115 120 125

Val Cys Lys Leu Val Asp Glu Val Ala Asp His Thr Tyr Ser Lys Lys

130 135 140

Arg Lys Asn Thr Ser Leu Thr Val Lys Asn Ser Leu Leu Leu Pro Val

145 150 155 160

Glu Thr Glu Glu

<210> SEQ ID NO 12

<211> LENGTH: 167

<212> TYPE: PRT

<213> ORGANISM: Scherffelia dubia

<400> SEQUENCE: 12

Met Asn Lys Leu Thr Lys Leu Gln Glu Asn Glu Leu Ala Tyr Ile Ala

1 5 10 15

Gly Phe Ile Asp Gly Asp Gly Cys Ile Asn Ala Gln Ile Ile Arg Arg

20 25 30

Val Asp Tyr Lys Leu Lys Phe Gln Ile Arg Phe Ser Val Thr Ile Ile

35 40 45

Gln Lys Thr Ser Arg His Trp Val Val Leu Trp Phe Gln Lys Lys Leu

50 55 60

Gly Cys Gly Gln Ile Arg Lys Arg Asn Asp Gly Ile Ser Glu Tyr Ser

65 70 75 80

Leu Val Gly Lys Asn Asp Val Gln Asn Phe Leu Gln Leu Ile Lys Pro

85 90 95

Phe Leu Lys Val Lys Arg Arg Gln Ala Gln Leu Ile Leu Glu Ile Cys

100 105 110

Gln Gln Ile Ser Lys Glu Gln Asp Pro Gln Ser Phe Val Lys Leu Cys

115 120 125

Glu Arg Val Asp Ser Ile Ala Lys Leu Asn Asp Ser Lys Lys Arg Ile

130 135 140

Ile Thr Ser Asp Val Val Arg Ser Ile Leu Leu Asn Ser Val Gln Lys

145 150 155 160

Phe Pro Val Glu Thr Glu Thr

165

<210> SEQ ID NO 13

<211> LENGTH: 168

<212> TYPE: PRT

<213> ORGANISM: Pseudendoclonium akinetum

<400> SEQUENCE: 13

Met Ser Lys Leu Lys Lys Ile Ile Pro Glu Glu Leu Ser Tyr Leu Ala

1 5 10 15

Gly Phe Leu Asp Gly Asp Gly Cys Ile Asn Ala Gln Ile Val Arg Arg

20 25 30

Ser Asp Tyr Lys Leu Lys Phe Gln Ile Arg Val Ser Ile Thr Phe Phe

35 40 45

Gln Lys Thr Asn Arg His Trp Phe Leu Ile Trp Leu Asp Lys Lys Leu

50 55 60

Asp Cys Gly Thr Leu Arg Lys Arg Pro Asp Gly Met Ser Glu Tyr Ala

65 70 75 80

Ile Ile Gly Ile Ala Ser Val Arg Asn Leu Leu Ser Ile Leu Lys Pro

85 90 95

Tyr Leu Lys Leu Lys Lys Arg Gln Ala Ile Leu Leu Leu Lys Ile Ile

100 105 110

Glu Lys Met Pro His Ile Gln Asn Asp Pro Gln Leu Phe Leu Lys Leu

115 120 125

Cys Glu Gln Val Asp His Phe Ser Glu Phe Asn Asp Ser Lys Lys Arg

130 135 140

Lys Leu Thr Ser Gln Val Val Arg Ser Glu Ile Gly Glu Thr Leu Asn

145 150 155 160

Leu Phe Pro Val Glu Thr Glu Thr

165

<210> SEQ ID NO 14

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: LIG3-4(7)

<400> SEQUENCE: 14

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Ile

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Lys Ala Gln Ile Lys Pro Asn Gln Ser

20 25 30

Cys Lys Phe Lys His Gln Leu Ser Leu Thr Phe Gln Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Tyr Asp Arg Gly Ser Val Ser Asp Tyr Glu Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Gly Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Lys Pro Arg Gln Cys Tyr Lys Phe Lys His

210 215 220

Glu Leu Arg Leu Glu Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Glu Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Arg Gly Ser Val Ser Asp Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Ala Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 15

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: LIG3-4(15)

<400> SEQUENCE: 15

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Ser Asp Gly Ser Ile Lys Ala Gln Ile Lys Pro Asn Gln Ser

20 25 30

Cys Lys Phe Lys His Gln Leu Ser Leu Thr Phe Gln Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Tyr Asp Arg Gly Ser Val Ser Asp Tyr Glu Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Lys Pro Arg Gln Cys Tyr Lys Phe Lys His

210 215 220

Glu Leu Arg Leu Glu Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Arg Gly Ser Val Ser Asp Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 16

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: LIG3-4(A4)

<400> SEQUENCE: 16

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Lys Ala Gln Ile Lys Pro Asn Gln Ser

20 25 30

Cys Lys Phe Lys His Gln Leu Ser Leu Thr Phe Gln Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Tyr Asp Arg Gly Ser Val Ser Asp Tyr Glu Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Val Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Lys Pro Arg Gln Cys Tyr Lys Phe Lys His

210 215 220

Glu Leu Arg Leu Glu Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Arg Gly Ser Val Ser Asp Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Val Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 17

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: LIG3-4(A6)

<400> SEQUENCE: 17

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Lys Ala Gln Ile Lys Pro Asn Gln Ser

20 25 30

Cys Lys Phe Lys His Gln Leu Ser Leu Thr Phe Gln Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Tyr Asp Arg Gly Ser Val Ser Asp Tyr Glu Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Val Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Lys Pro Arg Gln Cys Tyr Lys Phe Lys His

210 215 220

Glu Leu Arg Leu Glu Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Arg Gly Ser Val Ser Asp Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Tyr Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 18

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: LIG3-4(B1)

<400> SEQUENCE: 18

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Cys Ile Lys Ala Gln Ile Lys Pro Asn Gln Ser

20 25 30

Cys Lys Phe Lys His Gln Leu Ser Leu Thr Phe Gln Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Tyr Asp Arg Pro Ser Val Ser Asp Tyr Glu Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Lys Pro Arg Gln Cys Tyr Lys Phe Lys His

210 215 220

Glu Leu Arg Leu Glu Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Arg Lys Ser Val Ser Asp Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 19

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: LIG3-4(B15)

<400> SEQUENCE: 19

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Cys Ile Lys Ala Gln Ile Lys Pro Asn Gln Ser

20 25 30

Cys Lys Phe Lys His Gln Leu Ser Leu Thr Phe Gln Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Tyr Asp Arg Gly Ser Val Ser Asp Tyr Glu Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Lys Pro Arg Gln Cys Tyr Lys Phe Lys His

210 215 220

Glu Leu Arg Leu Glu Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Arg Lys Ser Val Ser Asp Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 20

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: LIG3-4(B16)

<400> SEQUENCE: 20

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Cys Ile Lys Ala Gln Ile Lys Pro Asn Gln Ser

20 25 30

Cys Lys Phe Lys His Gln Leu Ser Leu Thr Phe Gln Val Thr Gln Lys

35 40 45

Thr Lys Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Tyr Asp Arg Gly Ser Val Ser Asp Tyr Glu Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Lys Pro Arg Gln Cys Tyr Lys Phe Lys His

210 215 220

Glu Leu Arg Leu Glu Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Arg Gly Ser Val Ser Asp Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 21

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: LIG3-4(B24)

<400> SEQUENCE: 21

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Cys Ile Lys Ala Gln Ile Lys Pro Asn Gln Ser

20 25 30

Cys Lys Phe Lys His Gln Leu Ser Leu Thr Phe Gln Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Tyr Asp Arg Gly Ser Val Ser Asp Tyr Glu Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Lys Pro Arg Gln Cys Tyr Lys Phe Lys His

210 215 220

Glu Leu Arg Leu Glu Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Arg Gly Ser Val Ser Asp Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 22

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: LIG3-4(B36)

<400> SEQUENCE: 22

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Lys Ala Gln Ile Lys Pro Asn Gln Ser

20 25 30

Cys Lys Phe Lys His Gln Leu Ser Leu Thr Phe Gln Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Tyr Asp Arg Lys Ser Val Ser Asp Tyr Glu Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Lys Pro Arg Gln Cys Tyr Lys Phe Lys His

210 215 220

Glu Leu Arg Leu Glu Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Arg Lys Ser Val Ser Asp Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 23

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: LIG3-4(B38)

<400> SEQUENCE: 23

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Cys Ile Lys Ala Gln Ile Lys Pro Asn Gln Ser

20 25 30

Cys Lys Phe Lys His Gln Leu Ser Leu Thr Phe Gln Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Tyr Asp Arg Gly Ser Val Ser Asp Tyr Glu Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Lys Pro Arg Gln Cys Tyr Lys Phe Lys His

210 215 220

Glu Leu Arg Leu Glu Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Arg Pro Ser Val Ser Asp Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 24

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: LIG3-4(B39)

<400> SEQUENCE: 24

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Cys Ile Lys Ala Gln Ile Lys Pro Asn Gln Ser

20 25 30

Cys Lys Phe Lys His Gln Leu Ser Leu Thr Phe Gln Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Tyr Asp Arg Pro Ser Val Ser Asp Tyr Glu Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Lys Pro Arg Gln Cys Tyr Lys Phe Lys His

210 215 220

Glu Leu Arg Leu Glu Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Arg Pro Ser Val Ser Asp Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Asn Asn Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 25

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: LIG3-4(B40)

<400> SEQUENCE: 25

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Lys Ala Gln Ile Lys Pro Asn Gln Ser

20 25 30

Cys Lys Phe Lys His Gln Leu Ser Leu Thr Phe Gln Val Thr Gln Lys

35 40 45

Thr Lys Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Tyr Asp Arg Gly Ser Val Ser Asp Tyr Glu Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Lys Pro Arg Gln Cys Tyr Lys Phe Lys His

210 215 220

Glu Leu Arg Leu Glu Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Arg Lys Ser Val Ser Asp Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 26

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: LIG3-4(B55)

<400> SEQUENCE: 26

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Cys Ile Lys Ala Gln Ile Lys Pro Asn Gln Ser

20 25 30

Cys Lys Phe Lys His Gln Leu Ser Leu Thr Phe Gln Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Tyr Asp Arg Gly Ser Val Ser Asp Tyr Glu Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Cys Ile Ile Ala Ser Ile Lys Pro Arg Gln Cys Tyr Lys Phe Lys His

210 215 220

Glu Leu Arg Leu Glu Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Arg Lys Ser Val Ser Asp Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 27

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: LIG3-4(B65)

<400> SEQUENCE: 27

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Cys Ile Lys Ala Gln Ile Lys Pro Asn Gln Ser

20 25 30

Cys Lys Phe Lys His Gln Leu Ser Leu Thr Phe Gln Val Thr Gln Lys

35 40 45

Thr Lys Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Tyr Asp Arg Lys Ser Val Ser Asp Tyr Glu Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Lys Pro Arg Gln Cys Tyr Lys Phe Lys His

210 215 220

Glu Leu Arg Leu Glu Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Arg Lys Ser Val Ser Asp Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 28

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: LIG3-4(B70)

<400> SEQUENCE: 28

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Cys Ile Lys Ala Gln Ile Lys Pro Asn Gln Ser

20 25 30

Cys Lys Phe Lys His Gln Leu Ser Leu Thr Phe Gln Val Thr Gln Lys

35 40 45

Thr Lys Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Tyr Asp Arg Lys Ser Val Ser Asp Tyr Glu Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Lys Pro Arg Gln Cys Tyr Lys Phe Lys His

210 215 220

Glu Leu Arg Leu Glu Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Arg Pro Ser Val Ser Asp Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 29

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: LIG3-4(B71)

<400> SEQUENCE: 29

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Cys Ile Lys Ala Gln Ile Lys Pro Asn Gln Ser

20 25 30

Cys Lys Phe Lys His Gln Leu Ser Leu Thr Phe Gln Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Tyr Asp Arg Lys Ser Val Ser Asp Tyr Glu Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Lys Pro Arg Gln Cys Tyr Lys Phe Lys His

210 215 220

Glu Leu Arg Leu Glu Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Arg Pro Ser Val Ser Asp Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 30

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: LIG3-4(B73)

<400> SEQUENCE: 30

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Cys Ile Lys Ala Gln Ile Lys Pro Asn Gln Ser

20 25 30

Cys Lys Phe Lys His Gln Leu Ser Leu Thr Phe Gln Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Tyr Asp Arg Lys Ser Val Ser Asp Tyr Glu Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Lys Pro Arg Gln Cys Tyr Lys Phe Lys His

210 215 220

Glu Leu Arg Leu Glu Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Arg Lys Ser Val Ser Asp Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 31

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: LIG3-4(B75)

<400> SEQUENCE: 31

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Cys Ile Lys Ala Gln Ile Lys Pro Asn Gln Ser

20 25 30

Cys Lys Phe Lys His Gln Leu Ser Leu Thr Phe Gln Val Thr Gln Lys

35 40 45

Thr Lys Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Tyr Asp Arg Pro Ser Val Ser Asp Tyr Glu Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Lys Pro Arg Gln Cys Tyr Lys Phe Lys His

210 215 220

Glu Leu Arg Leu Glu Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Arg Lys Ser Val Ser Asp Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 32

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: LIG3-4(B76)

<400> SEQUENCE: 32

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Cys Ile Lys Ala Gln Ile Lys Pro Asn Gln Ser

20 25 30

Cys Lys Phe Lys His Gln Leu Ser Leu Thr Phe Gln Val Thr Gln Lys

35 40 45

Thr Lys Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Tyr Asp Arg Gly Ser Val Ser Asp Tyr Glu Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Lys Pro Arg Gln Cys Tyr Lys Phe Lys His

210 215 220

Glu Leu Arg Leu Glu Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Arg Lys Ser Val Ser Asp Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 33

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: LIG3-4(B78)

<400> SEQUENCE: 33

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Cys Ile Lys Ala Gln Ile Lys Pro Asn Gln Ser

20 25 30

Cys Lys Phe Lys His Gln Leu Ser Leu Thr Phe Gln Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Tyr Asp Arg Pro Ser Val Ser Asp Tyr Glu Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Lys Pro Arg Gln Cys Tyr Lys Phe Lys His

210 215 220

Glu Leu Arg Leu Glu Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Arg Lys Ser Val Ser Asp Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Asn Asn Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 34

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: LIG3-4(B82)

<400> SEQUENCE: 34

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Cys Ile Lys Ala Gln Ile Lys Pro Asn Gln Ser

20 25 30

Cys Lys Phe Lys His Gln Leu Ser Leu Thr Phe Gln Val Thr Gln Lys

35 40 45

Thr Lys Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Tyr Asp Arg Lys Ser Val Ser Asp Tyr Glu Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Cys Ile Ile Ala Ser Ile Lys Pro Arg Gln Cys Tyr Lys Phe Lys His

210 215 220

Glu Leu Arg Leu Glu Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Arg Lys Ser Val Ser Asp Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 35

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: LIG3-4(C1)

<400> SEQUENCE: 35

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Cys Ile Lys Ala Gln Ile Lys Pro Asn Gln Ser

20 25 30

Cys Lys Phe Lys His Gln Leu Ser Leu Thr Phe Gln Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Tyr Asp Arg Lys Ser Val Ser Asp Tyr Glu Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Lys Pro Arg Gln Cys Tyr Lys Leu Lys His

210 215 220

Glu Leu Arg Leu Glu Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Arg Gly Ser Val Ser Asp Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Asn Asn Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 36

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: LIG3-4(D5)

<400> SEQUENCE: 36

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Lys Ala Gln Ile Lys Pro Asn Gln Ser

20 25 30

Cys Lys Phe Lys His Gln Leu Ser Leu Thr Phe Gln Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Tyr Asp Arg Gly Ser Val Ser Asp Tyr Glu Leu Ser Gln

65 70 75 80

Lys Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Val Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Met Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Lys Pro Arg Gln Cys Tyr Lys Phe Lys His

210 215 220

Glu Leu Arg Leu Glu Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu His Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Arg Gly Ser Val Ser Asp Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Gly Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 37

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: LIG3-4(D7)

<400> SEQUENCE: 37

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Lys Ala Gln Ile Lys Pro Asn Gln Ser

20 25 30

Cys Lys Phe Lys His Gln Leu Ser Leu Thr Phe Gln Val Thr Gln Lys

35 40 45

Thr Arg Arg Arg Trp Phe Leu Leu Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Tyr Asp Arg Gly Ser Val Ser Asp Tyr Glu Leu Ser Gln

65 70 75 80

Lys Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Leu Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Lys Pro Arg Gln Cys Tyr Lys Phe Lys His

210 215 220

Glu Leu Arg Leu Glu Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Arg Gly Ser Val Ser Asp Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 38

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: LIG3-4(D8)

<400> SEQUENCE: 38

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Lys Ala Gln Ile Lys Pro Asn Gln Ser

20 25 30

Cys Lys Phe Lys His Gln Leu Ser Leu Thr Phe Gln Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Ile Leu Asp Lys Leu His Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Tyr Asp Arg Gly Ser Val Ser Asp Tyr Glu Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Gly Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Lys Pro Arg Gln Cys Tyr Lys Phe Lys His

210 215 220

Glu Leu Arg Leu Glu Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Arg Gly Ser Val Ser Asp Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 39

<211> LENGTH: 1050

<212> TYPE: DNA

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: LIG3-4maize

<400> SEQUENCE: 39

atgaacacca agtacaacaa ggagttcctg ctctacctgg ccggcttcgt ggacggcgac 60

ggctccatca aggcgcagat caagccgaac cagtcctgca agttcaagca ccagctctcc 120

ctgaccttcc aggtgaccca gaagacgcag aggcgctggt tcctcgacaa gctggtcgac 180

gagatcgggg tgggctacgt ctacgaccgc gggtcggtgt ccgactacga gctctcccag 240

atcaagcccc tgcacaactt cctcacccag ctccagccgt tcctcaagct gaagcagaag 300

caggcgaacc tcgtcctgaa gatcatcgag cagctcccct cggccaagga gtccccggac 360

aagttcctgg aggtgtgcac gtgggtcgac cagatcgcgg ccctcaacga cagcaagacc 420

cgcaagacga cctcggagac ggtgcgggcg gtcctggact ccctcccagg atccgtggga 480

ggtctatcgc catctcaggc atccagcgcc gcatcctcgg cttcctcaag cccgggttca 540

gggatctccg aagcactcag agctggagca actaagtcca aggaattcct gctctacctg 600

gccggcttcg tggacggcga cggctccatc atcgcgtcca tcaagccgcg ccagtgctac 660

aagttcaagc acgagctccg cctggagttc accgtgaccc agaagacgca gaggcgctgg 720

ttcctcgaca agctggtcga cgagatcggg gtgggctacg tctacgaccg cgggtcggtg 780

tccgactacc gcctctccca gatcaagccc ctgcacaact tcctcaccca gctccagccg 840

ttcctcaagc tgaagcagaa gcaggcgaac ctcgtcctga agatcatcga gcagctcccc 900

tcggccaagg agtccccgga caagttcctg gaggtgtgca cgtgggtcga ccagatcgcg 960

gccctcaacg acagcaagac ccgcaagacg acctcggaga cggtgcgggc ggtcctggac 1020

tccctcagcg agaagaagaa gtcgtccccc 1050

<210> SEQ ID NO 40

<211> LENGTH: 1050

<212> TYPE: DNA

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: LIG3-4yeast

<400> SEQUENCE: 40

atgaacacca agtacaacaa ggagttcctg ctctacctgg ccggcttcgt ggacggcgac 60

ggctccatca aggcgcagat caagccgaac cagtcctgca agttcaagca ccagctctcc 120

ctgaccttcc aggtgaccca gaagacgcag aggcgctggt tcctcgacaa gctggtcgac 180

gagatcgggg tgggctacgt ctacgaccgc gggtcggtgt ccgactacga gctctcccag 240

atcaagcccc tgcacaactt cctcacccag ctccagccgt tcctcaagct gaagcagaag 300

caggcgaacc tcgtcctgaa gatcatcgag cagctcccct cggccaagga gtccccggac 360

aagttcctgg aggtgtgcac gtgggtcgac cagatcgcgg ccctcaacga cagcaagacc 420

cgcaagacga cctcggagac ggtgcgggcg gtcctggact ccctcccagg atccgtggga 480

ggtctatcgc catctcaggc atccagcgcc gcatcctcgg cttcctcaag cccgggttca 540

gggatctccg aagcactcag agctggagca actaagtcca aagagttttt attatattta 600

gcggggtttg tcgatgggga tgggagtata atagccagta taaaacccag gcaatgttat 660

aaatttaaac atgaattaag gttagaattt acggtcacgc aaaaaaccca acgtaggtgg 720

tttttagata aattagtgga tgaaataggc gtcgggtatg tgtatgatag gggcagcgtc 780

agtgattata ggttaagtca aataaaaccg ttacataatt ttttaacgca attacaaccc 840

tttttaaaat taaaacaaaa acaagccaat ttagtgttaa aaataataga acaattaccg 900

agcgcgaaag aaagtcccga taaattttta gaagtctgta cctgggtgga tcaaatagcc 960

gcgttaaatg attcaaaaac gaggaaaacc acgagcgaaa ccgtcagagc cgtgttagat 1020

agtttatcag aaaagaagaa aagcagtccg 1050

<210> SEQ ID NO 41

<211> LENGTH: 1050

<212> TYPE: DNA

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: LIG3-4(7)

<400> SEQUENCE: 41

atgaacacca agtacaacaa ggagttcctg ctctacctgg ccggcatcgt ggacggcgac 60

ggctccatca aggcgcagat caagccgaac cagtcctgca agttcaagca ccagctctcc 120

ctgaccttcc aggtgaccca gaagacgcag aggcgctggt tcctcgacaa gctggtcgac 180

gagatcgggg tgggctacgt ctacgaccgc gggtcggtgt ccgactacga gctctcccag 240

atcaagcccc tgcacaactt cctcacccag ctccagccgt tcctcaagct gaagcagaag 300

caggcgaacc tcgtcctgaa gatcatcgag cagctcccct cggccaagga gtccccggac 360

aagttcctgg aggtgtgcac gtgggtcgac cagatcgcgg ccctcaacga cagcaagacc 420

cgcaagacga cctcggagac ggtgcgggcg gtcctggact ccctcccagg atccgtggga 480

ggtctatcgc catctcaggc atccagcgcc gcatcctcgg cttcctcaag cccgggttca 540

gggatctccg aaggactcag agctggagca actaagtcca aggaattcct gctctacctg 600

gccggcttcg tggacggcga cggctccatc atcgcgtcca tcaagccgcg ccagtgctac 660

aagttcaagc acgagctccg cctggagttc accgtgaccc agaagacgca gaggcgctgg 720

ttcctcgacg agctggtcga cgagatcggg gtgggctacg tctacgaccg cgggtcggtg 780

tccgactacc gcctctccca gatcaagccc ctgcacaact tcctcaccca gctccagccg 840

ttcctcaagc tgaagcagaa gcaggcgaac ctcgtcctga agatcatcga gcagctcccc 900

tcggccaagg agtccccgga caagttcctg gaggtgtgca cgtgggccga ccagatcgcg 960

gccctcaacg acagcaagac ccgcaagacg acctcggaga cggtgcgggc ggtcctggac 1020

tccctcagcg agaagaagaa gtcgtccccc 1050

<210> SEQ ID NO 42

<211> LENGTH: 1050

<212> TYPE: DNA

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: LIG3-4(15)

<400> SEQUENCE: 42

atgaacacca agtacaacaa ggagttcctg ctctacctgg ccggcttcgt ggacagcgac 60

ggctccatca aggcgcagat caagccgaac cagtcctgca agttcaagca ccagctctcc 120

ctgaccttcc aggtgaccca gaagacgcag aggcgctggt tcctcgacaa gctggtcgac 180

gagatcgggg tgggctacgt ctacgaccgc gggtcggtgt ccgactacga gctctcccag 240

atcaagcccc tgcacaactt cctcacccag ctccagccgt tcctcaagct gaagcagaag 300

caggcgaacc tcgtcctgaa gatcatcgag cagctcccct cggccaagga gtccccggac 360

aagttcctgg aggtgtgcac gtgggtcgac cagatcgcgg ccctcaacga cagcaagacc 420

cgcaagacga cctcggagac ggtgcgggcg gtcctggact ccctcccagg atccgtggga 480

ggtctatcgc catctcaggc atccagcgcc gcatcctcgg cttcctcaag cccgggttca 540

gggatctccg aagcactcag agctggagca actaagtcca aggaattcct gctctacctg 600

gccggcttcg tggacggcga cggctccatc atcgcgtcca tcaagccgcg ccagtgctac 660

aagttcaagc acgagctccg cctggagttc accgtgaccc agaagacgca gaggcgctgg 720

ttcctcgaca agctggtcga cgagatcggg gtgggctacg tctacgaccg cgggtcggtg 780

tccgactacc gcctctccca gatcaagccc ctgcacaact tcctcaccca gctccagccg 840

ttcctcaagc tgaagcagaa gcaggcgaac ctcgtcctga agatcatcga gcagctcccc 900

tcggccaagg agtccccgga caagttcctg gaggtgtgca cgtgggtcga ccagatcgcg 960

gccctcaacg acagcaagac ccgcaagacg acctcggaga cggtgcgggc ggtcctggac 1020

tccctcagcg agaagaagaa gtcgtccccc 1050

<210> SEQ ID NO 43

<211> LENGTH: 1050

<212> TYPE: DNA

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: LIG3-4(A4)

<400> SEQUENCE: 43

atgaacacca agtacaacaa ggagttcctg ctctacctgg ccggcttcgt ggacggcgac 60

ggctccatca aggcgcagat caagccgaac cagtcctgca agttcaagca ccagctctcc 120

ctgaccttcc aggtgaccca gaagacgcag aggcgctggt tcctcgacaa gctggtcgac 180

gagatcgggg tgggctacgt ctacgaccgc gggtcggtgt ccgactacga gctctcccag 240

atcaagcccc tgcacaactt cctcacccag ctccagccgt tcctcaagct gaagcagaag 300

caggcgaacc tcgtcctgaa gatcatcgag cagctcccct cggccaagga gtccccggac 360

aagttcctgg aggtgtgcac gtgggtcgac caggtcgcgg ccctcaacga cagcaagacc 420

cgcaagacga cctcggagac ggtgcgggcg gtcctggact ccctcccagg atccgtggga 480

ggtctatcgc catctcaggc atccagcgcc gcatcctcgg cttcctcaag cccgggttca 540

gggatctccg aagcactcag agctggagca actaagtcca aagagttttt attatattta 600

gcggggtttg tcgatgggga tgggagtata atagccagta taaaacccag gcaatgttat 660

aaatttaaac atgaattaag gttagaattt acggtcacgc aaaaaaccca acgtaggtgg 720

tttttagata aattagtgga tgaaataggc gtcgggtatg tgtatgatag gggcagcgtc 780

agtgattata ggttaagtca aataaaaccg ttacataatt ttttaacgca attacaaccc 840

tttttaaaat taaaacaaaa acaagccaat ttagtgttaa aaataataga acaattaccg 900

agcgcgaaag aaagtcccga taaattttta gaagtctgta cctgggtgga ccaagtcgcc 960

gcgttaaatg attcaaaaac gaggaaaacc acgagcgaaa ccgtcagagc cgtgttagat 1020

agtttatcag aaaagaagaa aagcagtccg 1050

<210> SEQ ID NO 44

<211> LENGTH: 1050

<212> TYPE: DNA

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: LIG3-4(A6)

<400> SEQUENCE: 44

atgaacacca agtacaacaa ggagttcctg ctctacctgg ccggcttcgt ggacggcgac 60

ggctccatca aggcgcagat caagccgaac cagtcctgca agttcaagca ccagctctcc 120

ctgaccttcc aggtgaccca gaagacgcag aggcgctggt tcctcgacaa gctggtcgac 180

gagatcgggg tgggctacgt ctacgaccgc gggtcggtgt ccgactacga gctctcccag 240

atcaagcccc tgcacaactt cctcacccag ctccagccgt tcctcaagct gaagcagaag 300

caggcgaacc tcgtcctgaa gatcatcgag cagctcccct cggccaagga gtccccggac 360

aagttcctgg aggtgtgcac gtgggtcgac caggtcgcgg ccctcaacga cagcaagacc 420

cgcaagacga cctcggagac ggtgcgggcg gtcctggact ccctcccagg atccgtggga 480

ggtctatcgc catctcaggc atccagcgcc gcatcctcgg cttcctcaag cccgggttca 540

gggatctccg aagcactcag agctggagca actaagtcca aagagttttt attatattta 600

gcggggtttg tcgatgggga tgggagtata atagccagta taaaacccag gcaatgttat 660

aaatttaaac atgaattaag gttagaattt acggtcacgc aaaaaaccca acgtaggtgg 720

tttttagata aattagtgga tgaaataggc gtcgggtatg tgtatgatag gggcagcgtc 780

agtgattata ggttaagtca aataaaaccg ttacataatt ttttaacgca attacaaccc 840

tatttaaaat taaaacaaaa acaagccaat ttagtgttaa aaataataga acaattaccg 900

agcgcgaaag aaagtcccga taaattttta gaagtctgta cctgggtgga tcaaatagcc 960

gcgttaaatg attcaaaaac gaggaaaacc acgagcgaaa ccgtcagagc cgtgttagat 1020

agtttatcag aaaagaagaa aagcagtccg 1050

<210> SEQ ID NO 45

<211> LENGTH: 1050

<212> TYPE: DNA

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: LIG3-4(B1)

<400> SEQUENCE: 45

atgaacacca agtacaacaa ggagttcctg ctctacctgg ccggcttcgt ggacggcgac 60

ggctgtatca aggcgcagat caagccgaac cagtcctgca agttcaagca ccagctctcc 120

ctgaccttcc aggtgaccca gaagacgcag aggcgctggt tcctcgacaa gctggtcgac 180

gagatcgggg tgggctacgt ctacgaccgc ccatcggtgt ccgactacga gctctcccag 240

atcaagcccc tgcacaactt cctcacccag ctccagccgt tcctcaagct gaagcagaag 300

caggcgaacc tcgtcctgaa gatcatcgag cagctcccct cggccaagga gtccccggac 360

aagttcctgg aggtgtgcac gtgggtcgac cagatcgcgg ccctcaacga cagcaagacc 420

cgcaagacga cctcggagac ggtgcgggcg gtcctggact ccctcccagg atccgtggga 480

ggtctatcgc catctcaggc atccagcgcc gcatcctcgg cttcctcaag cccgggttca 540

gggatctccg aagcactcag agctggagca actaagtcca aagagttttt attatattta 600

gcggggtttg tcgatgggga tgggagtata atagccagta taaaacccag gcaatgttat 660

aaatttaaac atgaattaag gttagaattt acggtcacgc aaaaaaccca acgtaggtgg 720

tttttagata aattagtgga tgaaataggc gtcgggtatg tgtatgatag gaaaagcgtc 780

agtgattata ggttaagtca aataaaaccg ttacataatt ttttaacgca attacaaccc 840

tttttaaaat taaaacaaaa acaagccaat ttagtgttaa aaataataga acaattaccg 900

agcgcgaaag aaagtcccga taaattttta gaagtctgta cctgggtgga tcaaatagcc 960

gcgttaaatg attcaaaaac gaggaaaacc acgagcgaaa ccgtcagagc cgtgttagat 1020

agtttatcag aaaagaagaa aagcagtccg 1050

<210> SEQ ID NO 46

<211> LENGTH: 1050

<212> TYPE: DNA

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: LIG3-4(B15)

<400> SEQUENCE: 46

atgaacacca agtacaacaa ggagttcctg ctctacctgg ccggcttcgt ggacggcgac 60

ggctgtatca aggcgcagat caagccgaac cagtcctgca agttcaagca ccagctctcc 120

ctgaccttcc aggtgaccca gaagacgcaa aggcgctggt tcctcgacaa gctggtcgac 180

gagatcgggg tgggctacgt ctacgaccgc gggtcggtgt ccgactacga gctctcccag 240

atcaagcccc tgcacaactt cctcacccag ctccagccgt tcctcaagct gaagcagaag 300

caggcgaacc tcgtcctgaa gatcatcgag cagctcccct cggccaagga gtccccggac 360

aagttcctgg aggtgtgcac gtgggtcgac cagatcgcgg ccctcaacga cagcaagacc 420

cgcaagacga cctcggagac ggtgcgggcg gtcctggact ccctcccagg atccgtggga 480

ggtctatcgc catctcaggc atccagcgcc gcatcctcgg cttcctcaag cccgggttca 540

gggatctccg aagcactcag agctggagca actaagtcca aagagttttt attatattta 600

gcggggtttg tcgatgggga tgggagtata atagccagta taaaacccag gcaatgttat 660

aaatttaaac atgaattaag gttagaattt acggtcacgc aaaaaaccca acgtaggtgg 720

tttttagata aattagtgga tgaaataggc gtcgggtatg tgtatgatag gaaaagcgtc 780

agtgattata ggttaagtca aataaaaccg ttacataatt ttttaacgca attacaaccc 840

tttttaaaat taaaacaaaa acaagccaat ttagtgttaa aaataataga acaattaccg 900

agcgcgaaag aaagtcccga taaattttta gaagtctgta cctgggtgga tcaaatagcc 960

gcgttaaatg attcaaaaac gaggaaaacc acgagcgaaa ccgtcagagc cgtgttagat 1020

agtttatcag aaaagaagaa aagcagtccg 1050

<210> SEQ ID NO 47

<211> LENGTH: 1050

<212> TYPE: DNA

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: LIG3-4(B16)

<400> SEQUENCE: 47

atgaacacca agtacaacaa ggagttcctg ctctacctgg ccggcttcgt ggacggcgac 60

ggctgtatca aggcgcagat caagccgaac cagtcctgca agttcaagca ccagctctcc 120

ctgaccttcc aggtgaccca gaagacgaaa aggcgctggt tcctcgacaa gctggtcgac 180

gagatcgggg tgggctacgt ctacgaccgc gggtcggtgt ccgactacga gctctcccag 240

atcaagcccc tgcacaactt cctcacccag ctccagccgt tcctcaagct gaagcagaag 300

caggcgaacc tcgtcctgaa gatcatcgag cagctcccct cggccaagga gtccccggac 360

aagttcctgg aggtgtgcac gtgggtcgac cagatcgcgg ccctcaacga cagcaagacc 420

cgcaagacga cctcggagac ggtgcgggcg gtcctggact ccctcccagg atccgtggga 480

ggtctatcgc catctcaggc atccagcgcc gcatcctcgg cttcctcaag cccgggttca 540

gggatctccg aagcactcag agctggagca actaagtcca aagagttttt attatattta 600

gcggggtttg tcgatgggga tgggagtata atagccagta taaaacccag gcaatgttat 660

aaatttaaac atgaattaag gttagaattt acggtcacgc aaaaaaccca acgtaggtgg 720

tttttagata aattagtgga tgaaataggc gtcgggtatg tgtatgatag gggcagcgtc 780

agtgattata ggttaagtca aataaaaccg ttacataatt ttttaacgca attacaaccc 840

tttttaaaat taaaacaaaa acaagccaat ttagtgttaa aaataataga acaattaccg 900

agcgcgaaag aaagtcccga taaattttta gaagtctgta cctgggtgga tcaaatagcc 960

gcgttaaatg attcaaaaac gaggaaaacc acgagcgaaa ccgtcagagc cgtgttagat 1020

agtttatcag aaaagaagaa aagcagtccg 1050

<210> SEQ ID NO 48

<211> LENGTH: 1050

<212> TYPE: DNA

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: LIG3-4(B24)

<400> SEQUENCE: 48

atgaacacca agtacaacaa ggagttcctg ctctacctgg ccggcttcgt ggacggcgac 60

ggctgtatca aggcgcagat caagccgaac cagtcctgca agttcaagca ccagctctcc 120

ctgaccttcc aggtgaccca gaagacgcaa aggcgctggt tcctcgacaa gctggtcgac 180

gagatcgggg tgggctacgt ctacgaccgc gggtcggtgt ccgactacga gctctcccag 240

atcaagcccc tgcacaactt cctcacccag ctccagccgt tcctcaagct gaagcagaag 300

caggcgaacc tcgtcctgaa gatcatcgag cagctcccct cggccaagga gtccccggac 360

aagttcctgg aggtgtgcac gtgggtcgac cagatcgcgg ccctcaacga cagcaagacc 420

cgcaagacga cctcggagac ggtgcgggcg gtcctggact ccctcccagg atccgtggga 480

ggtctatcgc catctcaggc atccagcgcc gcatcctcgg cttcctcaag cccgggttca 540

gggatctccg aagcactcag agctggagca actaagtcca aagagttttt attatattta 600

gcggggtttg tcgatgggga tgggagtata atagccagta taaaacccag gcaatgttat 660

aaatttaaac atgaattaag gttagaattt acggtcacgc aaaaaaccca acgtaggtgg 720

tttttagata aattagtgga tgaaataggc gtcgggtatg tgtatgatag gggcagcgtc 780

agtgattata ggttaagtca aataaaaccg ttacataatt ttttaacgca attacaaccc 840

tttttaaaat taaaacaaaa acaagccaat ttagtgttaa aaataataga acaattaccg 900

agcgcgaaag aaagtcccga taaattttta gaagtctgta cctgggtgga tcaaatagcc 960

gcgttaaatg attcaaaaac gaggaaaacc acgagcgaaa ccgtcagagc cgtgttagat 1020

agtttatcag aaaagaagaa aagcagtccg 1050

<210> SEQ ID NO 49

<211> LENGTH: 1050

<212> TYPE: DNA

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: LIG3-4(B36)

<400> SEQUENCE: 49

atgaacacca agtacaacaa ggagttcctg ctctacctgg ccggcttcgt ggacggcgac 60

ggctccatca aggcgcagat caagccgaac cagtcctgca agttcaagca ccagctctcc 120

ctgaccttcc aggtgaccca gaagacgcaa aggcgctggt tcctcgacaa gctggtcgac 180

gagatcgggg tgggctacgt ctacgaccgc aagtcggtgt ccgactacga gctctcccag 240

atcaagcccc tgcacaactt cctcacccag ctccagccgt tcctcaagct gaagcagaag 300

caggcgaacc tcgtcctgaa gatcatcgag cagctcccct cggccaagga gtccccggac 360

aagttcctgg aggtgtgcac gtgggtcgac cagatcgcgg ccctcaacga cagcaagacc 420

cgcaagacga cctcggagac ggtgcgggcg gtcctggact ccctcccagg atccgtggga 480

ggtctatcgc catctcaggc atccagcgcc gcatcctcgg cttcctcaag cccgggttca 540

gggatctccg aagcactcag agctggagca actaagtcca aagagttttt attatattta 600

gcggggtttg tcgatgggga tgggagtata atagccagta taaaacccag gcaatgttat 660

aaatttaaac atgaattaag gttagaattt acggtcacgc aaaaaaccca acgtaggtgg 720

tttttagata aattagtgga tgaaataggc gtcgggtatg tgtatgatag gaaaagcgtc 780

agtgattata ggttaagtca aataaaaccg ttacataatt ttttaacgca attacaaccc 840

tttttaaaat taaaacaaaa acaagccaat ttagtgttaa aaataataga acaattaccg 900

agcgcgaaag aaagtcccga taaattttta gaagtctgta cctgggtgga tcaaatagcc 960

gcgttaaatg attcaaaaac gaggaaaacc acgagcgaaa ccgtcagagc cgtgttagat 1020

agtttatcag aaaagaagaa aagcagtccg 1050

<210> SEQ ID NO 50

<211> LENGTH: 1050

<212> TYPE: DNA

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: LIG3-4(B38)

<400> SEQUENCE: 50

atgaacacca agtacaacaa ggagttcctg ctctacctgg ccggcttcgt ggacggcgac 60

ggctgtatca aggcgcagat caagccgaac cagtcctgca agttcaagca ccagctctcc 120

ctgaccttcc aggtgaccca gaagacgcag aggcgctggt tcctcgacaa gctggtcgac 180

gagatcgggg tgggctacgt ctacgaccgc gggtcggtgt ccgactacga gctctcccag 240

atcaagcccc tgcacaactt cctcacccag ctccagccgt tcctcaagct gaagcagaag 300

caggcgaacc tcgtcctgaa gatcatcgag cagctcccct cggccaagga gtccccggac 360

aagttcctgg aggtgtgcac gtgggtcgac cagatcgcgg ccctcaacga cagcaagacc 420

cgcaagacga cctcggagac ggtgcgggcg gtcctggact ccctcccagg atccgtggga 480

ggtctatcgc catctcaggc atccagcgcc gcatcctcgg cttcctcaag cccgggttca 540

gggatctccg aagcactcag agctggagca actaagtcca aagagttttt attatattta 600

gcggggtttg tcgatgggga tgggagtata atagccagta taaaacccag gcaatgttat 660

aaatttaaac atgaattaag gttagaattt acggtcacgc aaaaaaccca acgtaggtgg 720

tttttagata aattagtgga tgaaataggc gtcgggtatg tgtatgatag gccaagcgtc 780

agtgattata ggttaagtca aataaaaccg ttacataatt ttttaacgca attacaaccc 840

tttttaaaat taaaacaaaa acaagccaat ttagtgttaa aaataataga acaattaccg 900

agcgcgaaag aaagtcccga taaattttta gaagtctgta cctgggtgga tcaaatagcc 960

gcgttaaatg attcaaaaac gaggaaaacc acgagcgaaa ccgtcagagc cgtgttagat 1020

agtttatcag aaaagaagaa aagcagtccg 1050

<210> SEQ ID NO 51

<211> LENGTH: 1050

<212> TYPE: DNA

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: LIG3-4(B39)

<400> SEQUENCE: 51

atgaacacca agtacaacaa ggagttcctg ctctacctgg ccggcttcgt ggacggcgac 60

ggctgtatca aggcgcagat caagccgaac cagtcctgca agttcaagca ccagctctcc 120

ctgaccttcc aggtgaccca gaagacgcaa aggcgctggt tcctcgacaa gctggtcgac 180

gagatcgggg tgggctacgt ctacgaccgc ccatcggtgt ccgactacga gctctcccag 240

atcaagcccc tgcacaactt cctcacccag ctccagccgt tcctcaagct gaagcagaag 300

caggcgaacc tcgtcctgaa gatcatcgag cagctcccct cggccaagga gtccccggac 360

aagttcctgg aggtgtgcac gtgggtcgac cagatcgcgg ccctcaacga cagcaagacc 420

cgcaagacga cctcggagac ggtgcgggcg gtcctggact ccctcccagg atccgtggga 480

ggtctatcgc catctcaggc atccagcgcc gcatcctcgg cttcctcaag cccgggttca 540

gggatctccg aagcactcag agctggagca actaagtcca aagagttttt attatattta 600

gcggggtttg tcgatgggga tgggagtata atagccagta taaaacccag gcaatgttat 660

aaatttaaac atgaattaag gttagaattt acggtcacgc aaaaaaccca acgtaggtgg 720

tttttagata aattagtgga tgaaataggc gtcgggtatg tgtatgatag gccaagcgtc 780

agtgattata ggttaagtca aataaaaccg ttacataatt ttttaacgca attacaaccc 840

tttttaaaat taaaacaaaa acaagccaat ttagtgttaa aaataataga acaattaccg 900

agcgcgaaag aaagtcccga taaattttta gaagtctgta cctgggtgga tcaaatagcc 960

gcgttaaatg attcaaaaac gaggaaaacc acgagcgaaa ccgtcagagc cgtgttagat 1020

agtttatcag aaaataataa aagcagtccg 1050

<210> SEQ ID NO 52

<211> LENGTH: 1050

<212> TYPE: DNA

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: LIG3-4(B40)

<400> SEQUENCE: 52

atgaacacca agtacaacaa ggagttcctg ctctacctgg ccggcttcgt ggacggcgac 60

ggctccatca aggcgcagat caagccgaac cagtcctgca agttcaagca ccagctctcc 120

ctgaccttcc aggtgaccca gaagacgaaa aggcgctggt tcctcgacaa gctggtcgac 180

gagatcgggg tgggctacgt ctacgaccgc gggtcggtgt ccgactacga gctctcccag 240

atcaagcccc tgcacaactt cctcacccag ctccagccgt tcctcaagct gaagcagaag 300

caggcgaacc tcgtcctgaa gatcatcgag cagctcccct cggccaagga gtccccggac 360

aagttcctgg aggtgtgcac gtgggtcgac cagatcgcgg ccctcaacga cagcaagacc 420

cgcaagacga cctcggagac ggtgcgggcg gtcctggact ccctcccagg atccgtggga 480

ggtctatcgc catctcaggc atccagcgcc gcatcctcgg cttcctcaag cccgggttca 540

gggatctccg aagcactcag agctggagca actaagtcca aagagttttt attatattta 600

gcggggtttg tcgatgggga tgggagtata atagccagta taaaacccag gcaatgttat 660

aaatttaaac atgaattaag gttagaattt acggtcacgc aaaaaaccca acgtaggtgg 720

tttttagata aattagtgga tgaaataggc gtcgggtatg tgtatgatag gaaaagcgtc 780

agtgattata ggttaagtca aataaaaccg ttacataatt ttttaacgca attacaaccc 840

tttttaaaat taaaacaaaa acaagccaat ttagtgttaa aaataataga acaattaccg 900

agcgcgaaag aaagtcccga taaattttta gaagtctgta cctgggtgga tcaaatagcc 960

gcgttaaatg attcaaaaac gaggaaaacc acgagcgaaa ccgtcagagc cgtgttagat 1020

agtttatcag aaaagaagaa aagcagtccg 1050

<210> SEQ ID NO 53

<211> LENGTH: 1050

<212> TYPE: DNA

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: LIG3-4(B55)

<400> SEQUENCE: 53

atgaacacca agtacaacaa ggagttcctg ctctacctgg ccggcttcgt ggacggcgac 60

ggctgtatca aggcgcagat caagccgaac cagtcctgca agttcaagca ccagctctcc 120

ctgaccttcc aggtgaccca gaagacgcaa aggcgctggt tcctcgacaa gctggtcgac 180

gagatcgggg tgggctacgt ctacgaccgc gggtcggtgt ccgactacga gctctcccag 240

atcaagcccc tgcacaactt cctcacccag ctccagccgt tcctcaagct gaagcagaag 300

caggcgaacc tcgtcctgaa gatcatcgag cagctcccct cggccaagga gtccccggac 360

aagttcctgg aggtgtgcac gtgggtcgac cagatcgcgg ccctcaacga cagcaagacc 420

cgcaagacga cctcggagac ggtgcgggcg gtcctggact ccctcccagg atccgtggga 480

ggtctatcgc catctcaggc atccagcgcc gcatcctcgg cttcctcaag cccgggttca 540

gggatctccg aagcactcag agctggagca actaagtcca aagagttttt attatattta 600

gcggggtttg tcgatgggga tgggtgcata atagccagta taaaacccag gcaatgttat 660

aaatttaaac atgaattaag gttagaattt acggtcacgc aaaaaaccca acgtaggtgg 720

tttttagata aattagtgga tgaaataggc gtcgggtatg tgtatgatag gaaaagcgtc 780

agtgattata ggttaagtca aataaaaccg ttacataatt ttttaacgca attacaaccc 840

tttttaaaat taaaacaaaa acaagccaat ttagtgttaa aaataataga acaattaccg 900

agcgcgaaag aaagtcccga taaattttta gaagtctgta cctgggtgga tcaaatagcc 960

gcgttaaatg attcaaaaac gaggaaaacc acgagcgaaa ccgtcagagc cgtgttagat 1020

agtttatcag aaaagaagaa aagcagtccg 1050

<210> SEQ ID NO 54

<211> LENGTH: 1050

<212> TYPE: DNA

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: LIG3-4(B65)

<400> SEQUENCE: 54

atgaacacca agtacaacaa ggagttcctg ctctacctgg ccggcttcgt ggacggcgac 60

ggctgtatca aggcgcagat caagccgaac cagtcctgca agttcaagca ccagctctcc 120

ctgaccttcc aggtgaccca gaagacgaaa aggcgctggt tcctcgacaa gctggtcgac 180

gagatcgggg tgggctacgt ctacgaccgc aagtcggtgt ccgactacga gctctcccag 240

atcaagcccc tgcacaactt cctcacccag ctccagccgt tcctcaagct gaagcagaag 300

caggcgaacc tcgtcctgaa gatcatcgag cagctcccct cggccaagga gtccccggac 360

aagttcctgg aggtgtgcac gtgggtcgac cagatcgcgg ccctcaacga cagcaagacc 420

cgcaagacga cctcggagac ggtgcgggcg gtcctggact ccctcccagg atccgtggga 480

ggtctatcgc catctcaggc atccagcgcc gcatcctcgg cttcctcaag cccgggttca 540

gggatctccg aagcactcag agctggagca actaagtcca aagagttttt attatattta 600

gcggggtttg tcgatgggga tgggagtata atagccagta taaaacccag gcaatgttat 660

aaatttaaac atgaattaag gttagaattt acggtcacgc aaaaaaccca gcgtcgttgg 720

tttttagata aattagtgga tgaaataggc gtcgggtatg tgtatgatag gaaaagcgtc 780

agtgattata ggttaagtca aataaaaccg ttacataatt ttttaacgca attacaaccc 840

tttttaaaat taaaacaaaa acaagccaat ttagtgttaa aaataataga acaattaccg 900

agcgcgaaag aaagtcccga taaattttta gaagtctgta cctgggtgga tcaaatagcc 960

gcgttaaatg attcaaaaac gaggaaaacc acgagcgaaa ccgtcagagc cgtgttagat 1020

agtttatcag aaaagaagaa aagcagtccg 1050

<210> SEQ ID NO 55

<211> LENGTH: 1050

<212> TYPE: DNA

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: LIG3-4(B70)

<400> SEQUENCE: 55

atgaacacca agtacaacaa ggagttcctg ctctacctgg ccggcttcgt ggacggcgac 60

ggctgtatca aggcgcagat caagccgaac cagtcctgca agttcaagca ccagctctcc 120

ctgaccttcc aggtgaccca gaagacgaaa aggcgctggt tcctcgacaa gctggtcgac 180

gagatcgggg tgggctacgt ctacgaccgc aagtcggtgt ccgactacga gctctcccag 240

atcaagcccc tgcacaactt cctcacccag ctccagccgt tcctcaagct gaagcagaag 300

caggcgaacc tcgtcctgaa gatcatcgag cagctcccct cggccaagga gtccccggac 360

aagttcctgg aggtgtgcac gtgggtcgac cagatcgcgg ccctcaacga cagcaagacc 420

cgcaagacga cctcggagac ggtgcgggcg gtcctggact ccctcccagg atccgtggga 480

ggtctatcgc catctcaggc atccagcgcc gcatcctcgg cttcctcaag cccgggttca 540

gggatctccg aagcactcag agctggagca actaagtcca aagagttttt attatattta 600

gcggggtttg tcgatgggga tgggagtata atagccagta taaaacccag gcaatgttat 660

aaatttaaac atgaattaag gttagaattt acggtcacgc aaaaaaccca acgtaggtgg 720

tttttagata aattagtgga tgaaataggc gtcgggtatg tgtatgatag gccaagcgtc 780

agtgattata ggttaagtca aataaaaccg ttacataatt ttttaacgca attacaaccc 840

tttttaaaat taaaacaaaa acaagccaat ttagtgttaa aaataataga acaattaccg 900

agcgcgaaag aaagtcccga taaattttta gaagtctgta cctgggtgga tcaaatagcc 960

gcgttaaatg attcaaaaac gaggaaaacc acgagcgaaa ccgtcagagc cgtgttagat 1020

agtttatcag aaaagaagaa aagcagtccg 1050

<210> SEQ ID NO 56

<211> LENGTH: 1050

<212> TYPE: DNA

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: LIG3-4(B71)

<400> SEQUENCE: 56

atgaacacca agtacaacaa ggagttcctg ctctacctgg ccggcttcgt ggacggcgac 60

ggctgtatca aggcgcagat caagccgaac cagtcctgca agttcaagca ccagctctcc 120

ctgaccttcc aggtgaccca gaagacgcag aggcgctggt tcctcgacaa gctggtcgac 180

gagatcgggg tgggctacgt ctacgaccgc aagtcggtgt ccgactacga gctctcccag 240

atcaagcccc tgcacaactt cctcacccag ctccagccgt tcctcaagct gaagcagaag 300

caggcgaacc tcgtcctgaa gatcatcgag cagctcccct cggccaagga gtccccggac 360

aagttcctgg aggtgtgcac gtgggtcgac cagatcgcgg ccctcaacga cagcaagacc 420

cgcaagacga cctcggagac ggtgcgggcg gtcctggact ccctcccagg atccgtggga 480

ggtctatcgc catctcaggc atccagcgcc gcatcctcgg cttcctcaag cccgggttca 540

gggatctccg aagcactcag agctggagca actaagtcca aagagttttt attatattta 600

gcggggtttg tcgatgggga tgggagtata atagccagta taaaacccag gcaatgttat 660

aaatttaaac atgaattaag gttagaattt acggtcacgc aaaaaaccca acgtaggtgg 720

tttttagata aattagtgga tgaaataggc gtcgggtatg tgtatgatag gccaagcgtc 780

agtgattata ggttaagtca aataaaaccg ttacataatt ttttaacgca attacaaccc 840

tttttaaaat taaaacaaaa acaagccaat ttagtgttaa aaataataga acaattaccg 900

agcgcgaaag aaagtcccga taaattttta gaagtctgta cctgggtgga tcaaatagcc 960

gcgttaaatg attcaaaaac gaggaaaacc acgagcgaaa ccgtcagagc cgtgttagat 1020

agtttatcag aaaagaagaa aagcagtccg 1050

<210> SEQ ID NO 57

<211> LENGTH: 1050

<212> TYPE: DNA

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: LIG3-4(B73)

<400> SEQUENCE: 57

atgaacacca agtacaacaa ggagttcctg ctctacctgg ccggcttcgt ggacggcgac 60

ggctgtatca aggcgcagat caagccgaac cagtcctgca agttcaagca ccagctctcc 120

ctgaccttcc aggtgaccca gaagacgcag aggcgctggt tcctcgacaa gctggtcgac 180

gagatcgggg tgggctacgt ctacgaccgc aagtcggtgt ccgactacga gctctcccag 240

atcaagcccc tgcacaactt cctcacccag ctccagccgt tcctcaagct gaagcagaag 300

caggcgaacc tcgtcctgaa gatcatcgag cagctcccct cggccaagga gtccccggac 360

aagttcctgg aggtgtgcac gtgggtcgac cagatcgcgg ccctcaacga cagcaagacc 420

cgcaagacga cctcggagac ggtgcgggcg gtcctggact ccctcccagg atccgtggga 480

ggtctatcgc catctcaggc atccagcgcc gcatcctcgg cttcctcaag cccgggttca 540

gggatctccg aagcactcag agctggagca actaagtcca aagagttttt attatattta 600

gcggggtttg tcgatgggga tgggagtata atagccagta taaaacccag gcaatgttat 660

aaatttaaac atgaattaag gttagaattt acggtcacgc aaaaaaccca acgtaggtgg 720

tttttagata aattagtgga tgaaataggc gtcgggtatg tgtatgatag gaaaagcgtc 780

agtgattata ggttaagtca aataaaaccg ttacataatt ttttaacgca attacaaccc 840

tttttaaaat taaaacaaaa acaagccaat ttagtgttaa aaataataga acaattaccg 900

agcgcgaaag aaagtcccga taaattttta gaagtctgta cctgggtgga tcaaatagcc 960

gcgttaaatg attcaaaaac gaggaaaacc acgagcgaaa ccgtcagagc cgtgttagat 1020

agtttatcag aaaagaagaa aagcagtccg 1050

<210> SEQ ID NO 58

<211> LENGTH: 1050

<212> TYPE: DNA

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: LIG3-4(B75)

<400> SEQUENCE: 58

atgaacacca agtacaacaa ggagttcctg ctctacctgg ccggcttcgt ggacggcgac 60

ggctgtatca aggcgcagat caagccgaac cagtcctgca agttcaagca ccagctctcc 120

ctgaccttcc aggtgaccca gaagacgaaa aggcgctggt tcctcgacaa gctggtcgac 180

gagatcgggg tgggctacgt ctacgaccgc ccatcggtgt ccgactacga gctctcccag 240

atcaagcccc tgcacaactt cctcacccag ctccagccgt tcctcaagct gaagcagaag 300

caggcgaacc tcgtcctgaa gatcatcgag cagctcccct cggccaagga gtccccggac 360

aagttcctgg aggtgtgcac gtgggtcgac cagatcgcgg ccctcaacga cagcaagacc 420

cgcaagacga cctcggagac ggtgcgggcg gtcctggact ccctcccagg atccgtggga 480

ggtctatcgc catctcaggc atccagcgcc gcatcctcgg cttcctcaag cccgggttca 540

gggatctccg aagcactcag agctggagca actaagtcca aagagttttt attatattta 600

gcggggtttg tcgatgggga tgggagtata atagccagta taaaacccag gcaatgttat 660

aaatttaaac atgaattaag gttagaattt acggtcacgc aaaaaaccca acgtaggtgg 720

tttttagata aattagtgga tgaaataggc gtcgggtatg tgtatgatag gaaaagcgtc 780

agtgattata ggttaagtca aataaaaccg ttacataatt ttttaacgca attacaaccc 840

tttttaaaat taaaacaaaa acaagccaat ttagtgttaa aaataataga acaattaccg 900

agcgcgaaag aaagtcccga taaattttta gaagtctgta cctgggtgga tcaaatagcc 960

gcgttaaatg attcaaaaac gaggaaaacc acgagcgaaa ccgtcagagc cgtgttagat 1020

agtttatcag aaaagaagaa aagcagtccg 1050

<210> SEQ ID NO 59

<211> LENGTH: 1050

<212> TYPE: DNA

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: LIG3-4(B76)

<400> SEQUENCE: 59

atgaacacca agtacaacaa ggagttcctg ctctacctgg ccggcttcgt ggacggcgac 60

ggctgtatca aggcgcagat caagccgaac cagtcctgca agttcaagca ccagctctcc 120

ctgaccttcc aggtgaccca gaagacgaaa aggcgctggt tcctcgacaa gctggtcgac 180

gagatcgggg tgggctacgt ctacgaccgc gggtcggtgt ccgactacga gctctcccag 240

atcaagcccc tgcacaactt cctcacccag ctccagccgt tcctcaagct gaagcagaag 300

caggcgaacc tcgtcctgaa gatcatcgag cagctcccct cggccaagga gtccccggac 360

aagttcctgg aggtgtgcac gtgggtcgac cagatcgcgg ccctcaacga cagcaagacc 420

cgcaagacga cctcggagac ggtgcgggcg gtcctggact ccctcccagg atccgtggga 480

ggtctatcgc catctcaggc atccagcgcc gcatcctcgg cttcctcaag cccgggttca 540

gggatctccg aagcactcag agctggagca actaagtcca aagagttttt attatattta 600

gcggggtttg tcgatgggga tgggagtata atagccagta taaaacccag gcaatgttat 660

aaatttaaac atgaattaag gttagaattt acggtcacgc aaaaaaccca acgtaggtgg 720

tttttagata aattagtgga tgaaataggc gtcgggtatg tgtatgatag gaaaagcgtc 780

agtgattata ggttaagtca aataaaaccg ttacataatt ttttaacgca attacaaccc 840

tttttaaaat taaaacaaaa acaagccaat ttagtgttaa aaataataga acaattaccg 900

agcgcgaaag aaagtcccga taaattttta gaagtctgta cctgggtgga tcaaatagcc 960

gcgttaaatg attcaaaaac gaggaaaacc acgagcgaaa ccgtcagagc cgtgttagat 1020

agtttatcag aaaagaagaa aagcagtccg 1050

<210> SEQ ID NO 60

<211> LENGTH: 1050

<212> TYPE: DNA

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: LIG3-4(B78)

<400> SEQUENCE: 60

atgaacacca agtacaacaa ggagttcctg ctctacctgg ccggcttcgt ggacggcgac 60

ggctgtatca aggcgcagat caagccgaac cagtcctgca agttcaagca ccagctctcc 120

ctgaccttcc aggtgaccca gaagacgcag aggcgctggt tcctcgacaa gctggtcgac 180

gagatcgggg tgggctacgt ctacgaccgc ccatcggtgt ccgactacga gctctcccag 240

atcaagcccc tgcacaactt cctcacccag ctccagccgt tcctcaagct gaagcagaag 300

caggcgaacc tcgtcctgaa gatcatcgag cagctcccct cggccaagga gtccccggac 360

aagttcctgg aggtgtgcac gtgggtcgac cagatcgcgg ccctcaacga cagcaagacc 420

cgcaagacga cctcggagac ggtgcgggcg gtcctggact ccctcccagg atccgtggga 480

ggtctatcgc catctcaggc atccagcgcc gcatcctcgg cttcctcaag cccgggttca 540

gggatctccg aagcactcag agctggagca actaagtcca aagagttttt attatattta 600

gcggggtttg tcgatgggga tgggagtata atagccagta taaaacccag gcaatgttat 660

aaatttaaac atgaattaag gttagaattt acggtcacgc aaaaaaccca acgtaggtgg 720

tttttagata aattagtgga tgaaataggc gtcgggtatg tgtatgatag gaaaagcgtc 780

agtgattata ggttaagtca aataaaaccg ttacataatt ttttaacgca attacaaccc 840

tttttaaaat taaaacaaaa acaagccaat ttagtgttaa aaataataga acaattaccg 900

agcgcgaaag aaagtcccga taaattttta gaagtctgta cctgggtgga tcaaatagcc 960

gcgttaaatg attcaaaaac gaggaaaacc acgagcgaaa ccgtcagagc cgtgttagat 1020

agtttatcag aaaataataa aagcagtccg 1050

<210> SEQ ID NO 61

<211> LENGTH: 1050

<212> TYPE: DNA

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: LIG3-4(B82)

<400> SEQUENCE: 61

atgaacacca agtacaacaa ggagttcctg ctctacctgg ccggcttcgt ggacggcgac 60

ggctgtatca aggcgcagat caagccgaac cagtcctgca agttcaagca ccagctctcc 120

ctgaccttcc aggtgaccca gaagacgaaa aggcgctggt tcctcgacaa gctggtcgac 180

gagatcgggg tgggctacgt ctacgaccgc aagtcggtgt ccgactacga gctctcccag 240

atcaagcccc tgcacaactt cctcacccag ctccagccgt tcctcaagct gaagcagaag 300

caggcgaacc tcgtcctgaa gatcatcgag cagctcccct cggccaagga gtccccggac 360

aagttcctgg aggtgtgcac gtgggtcgac cagatcgcgg ccctcaacga cagcaagacc 420

cgcaagacga cctcggagac ggtgcgggcg gtcctggact ccctcccagg atccgtggga 480

ggtctatcgc catctcaggc atccagcgcc gcatcctcgg cttcctcaag cccgggttca 540

gggatctccg aagcactcag agctggagca actaagtcca aagagttttt attatattta 600

gcggggtttg tcgatgggga tgggtgcata atagccagta taaaacccag gcaatgttat 660

aaatttaaac atgaattaag gttagaattt acggtcacgc aaaaaaccca acgtaggtgg 720

tttttagata aattagtgga tgaaataggc gtcgggtatg tgtatgatag gaaaagcgtc 780

agtgattata ggttaagtca aataaaaccg ttacataatt ttttaacgca attacaaccc 840

tttttaaaat taaaacaaaa acaagccaat ttagtgttaa aaataataga acaattaccg 900

agcgcgaaag aaagtcccga taaattttta gaagtctgta cctgggtgga tcaaatagcc 960

gcgttaaatg attcaaaaac gaggaaaacc acgagcgaaa ccgtcagagc cgtgttagat 1020

agtttatcag aaaagaagaa aagcagtccg 1050

<210> SEQ ID NO 62

<211> LENGTH: 1050

<212> TYPE: DNA

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: LIG3-4(C1)

<400> SEQUENCE: 62

atgaacacca agtacaacaa ggagttcctg ctctacctgg ccggcttcgt ggacggcgac 60

ggctgtatca aggcgcagat caagccgaac cagtcctgca agttcaagca ccagctctcc 120

ctgaccttcc aggtgaccca gaagacgcaa aggcgctggt tcctcgacaa gctggtcgac 180

gagatcgggg tgggctacgt ctacgaccgc aagtcggtgt ccgactacga gctctcccag 240

atcaagcccc tgcacaactt cctcacccag ctccagccgt tcctcaagct gaagcagaag 300

caggcgaacc tcgtcctgaa gatcatcgag cagctcccct cggccaagga gtccccggac 360

aagttcctgg aggtgtgcac gtgggtcgac cagatcgcgg ccctcaacga cagcaagacc 420

cgcaagacga cctcggagac ggtgcgggcg gtcctggact ccctcccagg atccgtggga 480

ggtctatcgc catctcaggc atccagcgcc gcatcctcgg cttcctcaag cccgggttca 540

gggatctccg aagcactcag agctggagca actaagtcca aagagttttt attatattta 600

gcggggtttg tcgatgggga tgggagtata atagccagta taaaacccag gcaatgttat 660

aaattgaaac atgaattaag gttagaattt acggtcacgc aaaaaaccca acgtaggtgg 720

tttttagata aattagtgga tgaaataggc gtcgggtatg tgtatgatag gggcagcgtc 780

agtgattata ggttaagtca aataaaaccg ttacataatt ttttaacgca attacaaccc 840

tttttaaaat taaaacaaaa acaagccaat ttagtgttaa aaataataga acaattaccg 900

agcgcgaaag aaagtcccga taaattttta gaagtctgta cctgggtgga tcaaatagcc 960

gcgttaaatg attcaaaaac gaggaaaacc acgagcgaaa ccgtcagagc cgtgttagat 1020

agtttatcag aaaataataa aagcagtccg 1050

<210> SEQ ID NO 63

<211> LENGTH: 1050

<212> TYPE: DNA

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: LIG3-4(D5)

<400> SEQUENCE: 63

atgaacacca agtacaacaa ggagttcctg ctctacctgg ccggcttcgt ggacggcgac 60

ggctccatca aggcgcagat caagccgaac cagtcctgca agttcaagca ccagctctcc 120

ctgaccttcc aggtgaccca gaagacgcag aggcgctggt tcctcgacaa gctggtcgac 180

gagatcgggg tgggctacgt ctacgaccgc gggtcggtgt ccgactacga gctctcccag 240

aaaaagcccc tgcacaactt cctcacccag ctccagccgt tcctcaagct gaagcagaag 300

caggcggtcc tcgtcctgaa gatcatcgag cagctcccct cggccaagga gtccccggac 360

aagttcctgg aggtgtgcac gtgggtcgac cagatcgcgg ccctcaacga cagcaagacc 420

cgcaagacga cctcggagac ggtgcgggcg gtcctgatgt ccctcccagg atccgtggga 480

ggtctatcgc catctcaggc atccagcgcc gcatcctcgg cttcctcaag cccgggttca 540

gggatctccg aagcactcag agctggagca actaagtcca aagagttttt attatattta 600

gcggggtttg tcgatgggga tgggagtata atagccagta taaaacccag gcaatgttat 660

aaatttaaac atgaattaag gttagaattt acggtcacgc aaaaaaccca acgtaggtgg 720

tttttagata aattacacga tgaaataggc gtcgggtatg tgtatgatag gggcagcgtc 780

agtgattata ggttaagtca aatcaaaccg ttacataatt ttttaacgca attacaaccc 840

tttttaaaat taaaacaaaa acaagccaat ttagtgttaa aaataataga acaattaccg 900

agcgcgaaag aaagtcccga tggattttta gaagtctgta cctgggtgga tcaaatagcc 960

gcgttaaatg attcaaaaac gaggaaaacc acgagcgaaa ccgtcagagc cgtgttagat 1020

agtttatcag aaaagaagaa aagcagtccg 1050

<210> SEQ ID NO 64

<211> LENGTH: 1050

<212> TYPE: DNA

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: LIG3-4(D7)

<400> SEQUENCE: 64

atgaacacca agtacaacaa ggagttcctg ctctacctgg ccggcttcgt ggacggcgac 60

ggctccatca aggcgcagat caagccgaac cagtcctgca agttcaagca ccagctctcc 120

ctgaccttcc aggtgaccca gaagacgcgg aggcgctggt tcctcctgaa gctggtcgac 180

gagatcgggg tgggctacgt ctacgaccgc gggtcggtgt ccgactacga gctctcccag 240

aaaaagccgc tgcacaactt cctcacccag ctccagccgt tcctcaagct gaagcagaag 300

caggcgaacc tcgtcctgaa gatcatcgag cagctcccct cggccaagga gtccccggac 360

aaattcctgg aggtgtgcac gtgggtcgac cagatcgcgg ccctcaacga cagcaagacc 420

cgcaagacga cctcggagac ggtgcgggcg gtcctgttgt ccctcccagg atccgtggga 480

ggtctatcgc catctcaggc atccagcgcc gcatcctcgg cttcctcaag cccgggttca 540

gggatctccg aagcactcag agctggagca actaagtcca aagagttttt attatattta 600

gcggggtttg tcgatgggga tgggagtata atagccagta taaaacccag gcaatgttat 660

aaatttaaac atgaattaag gttagaattt acggtcacgc aaaaaaccca acgtaggtgg 720

tttttagata aattagtgga tgaaataggc gtcgggtatg tgtatgatag gggcagcgtc 780

agtgattata ggttaagtca aataaaaccg ttacataatt ttttaacgca attacaaccc 840

tttttaaaat taaaacaaaa acaagccaat ttagtgttaa aaataataga acaattaccg 900

agcgcgaaag aaagtcccga taaattttta gaagtctgta cctgggtgga tcaaatagcc 960

gcgttaaatg attcaaaaac gaggaaaacc acgagcgaaa ccgtcagagc cgtgttagat 1020

agtttatcag aaaagaagaa aagcagtccg 1050

<210> SEQ ID NO 65

<211> LENGTH: 1050

<212> TYPE: DNA

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: LIG3-4(D8)

<400> SEQUENCE: 65

atgaacacca agtacaacaa ggagttcctg ctctacctgg ccggcttcgt ggacggcgac 60

ggctccatca aggcgcagat caagccgaac cagtcctgca agttcaagca ccagctctcc 120

ctgaccttcc aggtgaccca gaagacgcag aggcgctgga tcctcgacaa gctgcacgac 180

gagatcgggg tgggctacgt ctacgaccgc gggtcggtgt ccgactacga gctctcccag 240

atcaagcccc tgcacaactt cctcacccag ctccagccgt tcctcaagct gaagcagaag 300

caggcgaacc tcgtcctgaa gatcatcgag cagctcccct cggccaagga gtccccggac 360

ggcttcctgg aggtgtgcac gtgggtcgac cagatcgcgg ccctcaacga cagcaagacc 420

cgcaagacga cctcggagac ggtgcgggcg gtcctggact ccctcccagg atccgtggga 480

ggtctatcgc catctcaggc atccagcgcc gcatcctcgg cttcctcaag cccgggttca 540

gggatctccg aagcactcag agctggagca actaagtcca aagagttttt attatattta 600

gcggggtttg tcgatgggga tgggagtata atagccagta taaaacccag gcaatgttat 660

aaatttaaac atgaattaag gttagaattt acggtcacgc aaaaaaccca acgtaggtgg 720

tttttagata aattagtgga tgaaataggc gtcgggtatg tgtatgatag gggcagcgtc 780

agtgattata ggttaagtca aataaaaccg ttacataatt ttttaacgca attacaaccc 840

tttttaaaat taaaacaaaa acaagccaat ttagtgttaa aaataataga acaattaccg 900

agcgcgaaag aaagtcccga taaattttta gaagtctgta cctgggtgga tcaaatagcc 960

gcgttaaatg attcaaaaac gaggaaaacc acgagcgaaa ccgtcagagc cgtgttagat 1020

agtttatcag aaaagaagaa aagcagtccg 1050

<210> SEQ ID NO 66

<211> LENGTH: 67

<212> TYPE: DNA

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MN031 primer

<400> SEQUENCE: 66

aaaaactata tctagacata tggccccgaa gaagaagcgc aaggtgatga acaccaagta 60

caacaag 67

<210> SEQ ID NO 67

<211> LENGTH: 40

<212> TYPE: DNA

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MN022 primer

<400> SEQUENCE: 67

aattacatga ctcgagttaa tggtgatggt gatggtgagg 40

<210> SEQ ID NO 68

<211> LENGTH: 7753

<212> TYPE: DNA

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: pVER8134 (with LIG34=500-1549)

<400> SEQUENCE: 68

tgaagtacgg attagaagcc gccgagcggg tgacagccct ccgaaggaag actctcctcc 60

gtgcgtcctc gtcttcaccg gtcgcgttcc tgaaacgcag atgtgcctcg cgccgcactg 120

ctccgaacaa taaagattct acaatactag cttttatggt tatgaagagg aaaaattggc 180

agtaacctgg ccccacaaac cttcaaatga acgaatcaaa ttaacaacca taggatgata 240

atgcgattag ttttttagcc ttatttctgg ggtaattaat cagcgaagcg atgatttttg 300

atctattaac agatatataa atgcaaaaac tgcataacca ctttaactaa tactttcaac 360

attttcggtt tgtattactt cttattcaaa tgtaataaaa gtatcaacaa aaaattgtta 420

atatacctct atactttaac gtcaaggaga aaaaaccccg gattctagac atatggcccc 480

gaagaagaag cgcaaggtga tgaacaccaa gtacaacaag gagttcctgc tctacctggc 540

cggcttcgtg gacggcgacg gctccatcaa ggcgcagatc aagccgaacc agtcctgcaa 600

gttcaagcac cagctctccc tgaccttcca ggtgacccag aagacgcaga ggcgctggtt 660

cctcgacaag ctggtcgacg agatcggggt gggctacgtc tacgaccgcg ggtcggtgtc 720

cgactacgag ctctcccaga tcaagcccct gcacaacttc ctcacccagc tccagccgtt 780

cctcaagctg aagcagaagc aggcgaacct cgtcctgaag atcatcgagc agctcccctc 840

ggccaaggag tccccggaca agttcctgga ggtgtgcacg tgggtcgacc agatcgcggc 900

cctcaacgac agcaagaccc gcaagacgac ctcggagacg gtgcgggcgg tcctggactc 960

cctcccagga tccgtgggag gtctatcgcc atctcaggca tccagcgccg catcctcggc 1020

ttcctcaagc ccgggttcag ggatctccga agcactcaga gctggagcaa ctaagtccaa 1080

ggaattcctg ctctacctgg ccggcttcgt ggacggcgac ggctccatca tcgcgtccat 1140

caagccgcgc cagtgctaca agttcaagca cgagctccgc ctggagttca ccgtgaccca 1200

gaagacgcag aggcgctggt tcctcgacaa gctggtcgac gagatcgggg tgggctacgt 1260

ctacgaccgc gggtcggtgt ccgactaccg cctctcccag atcaagcccc tgcacaactt 1320

cctcacccag ctccagccgt tcctcaagct gaagcagaag caggcgaacc tcgtcctgaa 1380

gatcatcgag cagctcccct cggccaagga gtccccggac aagttcctgg aggtgtgcac 1440

gtgggtcgac cagatcgcgg ccctcaacga cagcaagacc cgcaagacga cctcggagac 1500

ggtgcgggcg gtcctggact ccctcagcga gaagaagaag tcgtctcctc accatcacca 1560

tcaccattaa ctcgagtcat gtaattagtt atgtcacgct tacattcacg ccctcccccc 1620

acatccgctc taaccgaaaa ggaaggagtt agacaacctg aagtctaggt ccctatttat 1680

ttttttatag ttatgttagt attaagaacg ttatttatat ttcaaatttt tctttttttt 1740

ctgtacagac gcgtgtacgc atgtaacatt atactgaaaa ccttgcttga gaaggttttg 1800

ggacgctcga aggctttaat ttgcggccgg tacccaattc gccctatagt gagtcgtatt 1860

acgcgcgctc actggccgtc gttttacaac gtcgtgactg ggaaaaccct ggcgttaccc 1920

aacttaatcg ccttgcagca catccccctt tcgccagctg gcgtaatagc gaagaggccc 1980

gcaccgatcg cccttcccaa cagttgcgca gcctgaatgg cgaatggcgc gacgcgccct 2040

gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg cagcgtgacc gctacacttg 2100

ccagcgccct agcgcccgct cctttcgctt tcttcccttc ctttctcgcc acgttcgccg 2160

gctttccccg tcaagctcta aatcgggggc tccctttagg gttccgattt agtgctttac 2220

ggcacctcga ccccaaaaaa cttgattagg gtgatggttc acgtagtggg ccatcgccct 2280

gatagacggt ttttcgccct ttgacgttgg agtccacgtt ctttaatagt ggactcttgt 2340

tccaaactgg aacaacactc aaccctatct cggtctattc ttttgattta taagggattt 2400

tgccgatttc ggcctattgg ttaaaaaatg agctgattta acaaaaattt aacgcgaatt 2460

ttaacaaaat attaacgttt acaatttcct gatgcggtat tttctcctta cgcatctgtg 2520

cggtatttca caccgcatat cgacggtcga ggagaacttc tagtatatcc acatacctaa 2580

tattattgcc ttattaaaaa tggaatccca acaattacat caaaatccac attctcttca 2640

aaatcaattg tcctgtactt ccttgttcat gtgtgttcaa aaacgttata tttataggat 2700

aattatactc tatttctcaa caagtaattg gttgtttggc cgagcggtct aaggcgcctg 2760

attcaagaaa tatcttgacc gcagttaact gtgggaatac tcaggtatcg taagatgcaa 2820

gagttcgaat ctcttagcaa ccattatttt tttcctcaac ataacgagaa cacacagggg 2880

cgctatcgca cagaatcaaa ttcgatgact ggaaattttt tgttaatttc agaggtcgcc 2940

tgacgcatat acctttttca actgaaaaat tgggagaaaa aggaaaggtg agaggccgga 3000

accggctttt catatagaat agagaagcgt tcatgactaa atgcttgcat cacaatactt 3060

gaagttgaca atattattta aggacctatt gttttttcca ataggtggtt agcaatcgtc 3120

ttactttcta acttttctta ccttttacat ttcagcaata tatatatata tttcaaggat 3180

ataccattct aatgtctgcc cctatgtctg cccctaagaa gatcgtcgtt ttgccaggtg 3240

accacgttgg tcaagaaatc acagccgaag ccattaaggt tcttaaagct atttctgatg 3300

ttcgttccaa tgtcaagttc gatttcgaaa atcatttaat tggtggtgct gctatcgatg 3360

ctacaggtgt cccacttcca gatgaggcgc tggaagcctc caagaaggtt gatgccgttt 3420

tgttaggtgc tgtgggtggt cctaaatggg gtaccggtag tgttagacct gaacaaggtt 3480

tactaaaaat ccgtaaagaa cttcaattgt acgccaactt aagaccatgt aactttgcat 3540

ccgactctct tttagactta tctccaatca agccacaatt tgctaaaggt actgacttcg 3600

ttgttgtcag agaattagtg ggaggtattt actttggtaa gagaaaggaa gacgatggtg 3660

atggtgtcgc ttgggatagt gaacaataca ccgttccaga agtgcaaaga atcacaagaa 3720

tggccgcttt catggcccta caacatgagc caccattgcc tatttggtcc ttggataaag 3780

ctaatgtttt ggcctcttca agattatgga gaaaaactgt ggaggaaacc atcaagaacg 3840

aattccctac attgaaggtt caacatcaat tgattgattc tgccgccatg atcctagtta 3900

agaacccaac ccacctaaat ggtattataa tcaccagcaa catgtttggt gatatcatct 3960

ccgatgaagc ctccgttatc ccaggttcct tgggtttgtt gccatctgcg tccttggcct 4020

ctttgccaga caagaacacc gcatttggtt tgtacgaacc atgccacggt tctgctccag 4080

atttgccaaa gaataaggtt gaccctatcg ccactatctt gtctgctgca atgatgttga 4140

aattgtcatt gaacttgcct gaagaaggta aggccattga agatgcagtt aaaaaggttt 4200

tggatgcagg tatcagaact ggtgatttag gtggttccaa cagtaccacc gaagtcggtg 4260

atgctgtcgc cgaagaagtt aagaaaatcc ttgcttaaaa agattctctt tttttatgat 4320

atttgtacat aaactttata aatgaaattc ataatagaaa cgacacgaaa ttacaaaatg 4380

gaatatgttc atagggtaga cgaaactata tacgcaatct acatacattt atcaagaagg 4440

agaaaaagga ggatagtaaa ggaatacagg taagcaaatt gatactaatg gctcaacgtg 4500

ataaggaaaa agaattgcac tttaacatta atattgacaa ggaggagggc accacacaaa 4560

aagttaggtg taacagaaaa tcatgaaact acgattccta atttgatatt ggaggatttt 4620

ctctaaaaaa aaaaaaatac aacaaataaa aaacactcaa tgacctgacc atttgatgga 4680

gtttaagtca ataccttctt gaaccatttc ccataatggt gaaagttccc tcaagaattt 4740

tactctgtca gaaacggcct tacgacgtag tcgatatggt gcactctcag tacaatctgc 4800

tctgatgccg catagttaag ccagccccga cacccgccaa cacccgctga cgcgccctga 4860

cgggcttgtc tgctcccggc atccgcttac agacaagctg tgaccgtctc cgggagctgc 4920

atgtgtcaga ggttttcacc gtcatcaccg aaacgcgcga gacgaaaggg cctcgtgata 4980

cgcctatttt tataggttaa tgtcatgata ataatggttt cttaggacgg atcgcttgcc 5040

tgtaacttac acgcgcctcg tatcttttaa tgatggaata atttgggaat ttactctgtg 5100

tttatttatt tttatgtttt gtatttggat tttagaaagt aaataaagaa ggtagaagag 5160

ttacggaatg aagaaaaaaa aataaacaaa ggtttaaaaa atttcaacaa aaagcgtact 5220

ttacatatat atttattaga caagaaaagc agattaaata gatatacatt cgattaacga 5280

taagtaaaat gtaaaatcac aggattttcg tgtgtggtct tctacacaga caagatgaaa 5340

caattcggca ttaatacctg agagcaggaa gagcaagata aaaggtagta tttgttggcg 5400

atccccctag agtcttttac atcttcggaa aacaaaaact attttttctt taatttcttt 5460

ttttactttc tatttttaat ttatatattt atattaaaaa atttaaatta taattatttt 5520

tatagcacgt gatgaaaagg acccaggtgg cacttttcgg ggaaatgtgc gcggaacccc 5580

tatttgttta tttttctaaa tacattcaaa tatgtatccg ctcatgagac aataaccctg 5640

ataaatgctt caataatatt gaaaaaggaa gagtatgagt attcaacatt tccgtgtcgc 5700

ccttattccc ttttttgcgg cattttgcct tcctgttttt gctcacccag aaacgctggt 5760

gaaagtaaaa gatgctgaag atcagttggg tgcacgagtg ggttacatcg aactggatct 5820

caacagcggt aagatccttg agagttttcg ccccgaagaa cgttttccaa tgatgagcac 5880

ttttaaagtt ctgctatgtg gcgcggtatt atcccgtatt gacgccgggc aagagcaact 5940

cggtcgccgc atacactatt ctcagaatga cttggttgag tactcaccag tcacagaaaa 6000

gcatcttacg gatggcatga cagtaagaga attatgcagt gctgccataa ccatgagtga 6060

taacactgcg gccaacttac ttctgacaac gatcggagga ccgaaggagc taaccgcttt 6120

tttgcacaac atgggggatc atgtaactcg ccttgatcgt tgggaaccgg agctgaatga 6180

agccatacca aacgacgagc gtgacaccac gatgcctgta gcaatggcaa caacgttgcg 6240

caaactatta actggcgaac tacttactct agcttcccgg caacaattaa tagactggat 6300

ggaggcggat aaagttgcag gaccacttct gcgctcggcc cttccggctg gctggtttat 6360

tgctgataaa tctggagccg gtgagcgtgg gtctcgcggt atcattgcag cactggggcc 6420

agatggtaag ccctcccgta tcgtagttat ctacacgacg gggagtcagg caactatgga 6480

tgaacgaaat agacagatcg ctgagatagg tgcctcactg attaagcatt ggtaactgtc 6540

agaccaagtt tactcatata tactttagat tgatttaaaa cttcattttt aatttaaaag 6600

gatctaggtg aagatccttt ttgataatct catgaccaaa atcccttaac gtgagttttc 6660

gttccactga gcgtcagacc ccgtagaaaa gatcaaagga tcttcttgag atcctttttt 6720

tctgcgcgta atctgctgct tgcaaacaaa aaaaccaccg ctaccagcgg tggtttgttt 6780

gccggatcaa gagctaccaa ctctttttcc gaaggtaact ggcttcagca gagcgcagat 6840

accaaatact gtccttctag tgtagccgta gttaggccac cacttcaaga actctgtagc 6900

accgcctaca tacctcgctc tgctaatcct gttaccagtg gctgctgcca gtggcgataa 6960

gtcgtgtctt accgggttgg actcaagacg atagttaccg gataaggcgc agcggtcggg 7020

ctgaacgggg ggttcgtgca cacagcccag cttggagcga acgacctaca ccgaactgag 7080

atacctacag cgtgagctat gagaaagcgc cacgcttccc gaagggagaa aggcggacag 7140

gtatccggta agcggcaggg tcggaacagg agagcgcacg agggagcttc cagggggaaa 7200

cgcctggtat ctttatagtc ctgtcgggtt tcgccacctc tgacttgagc gtcgattttt 7260

gtgatgctcg tcaggggggc ggagcctatg gaaaaacgcc agcaacgcgg cctttttacg 7320

gttcctggcc ttttgctggc cttttgctca catgttcttt cctgcgttat cccctgattc 7380

tgtggataac cgtattaccg cctttgagtg agctgatacc gctcgccgca gccgaacgac 7440

cgagcgcagc gagtcagtga gcgaggaagc ggaagagcgc ccaatacgca aaccgcctct 7500

ccccgcgcgt tggccgattc attaatgcag ctggcacgac aggtttcccg actggaaagc 7560

gggcagtgag cgcaacgcaa ttaatgtgag ttacctcact cattaggcac cccaggcttt 7620

acactttatg cttccggctc ctatgttgtg tggaattgtg agcggataac aatttcacac 7680

aggaaacagc tatgaccatg attacgccaa gcgcgcaatt aaccctcact aaagggaaca 7740

aaagctggag ctc 7753

<210> SEQ ID NO 69

<211> LENGTH: 27

<212> TYPE: DNA

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: nuclear localization signal yeast

<400> SEQUENCE: 69

atggccccga agaagaagcg caaggtg 27

<210> SEQ ID NO 70

<211> LENGTH: 9

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: nuclear localization signal

<400> SEQUENCE: 70

Met Ala Pro Lys Lys Lys Arg Lys Val

1 5

<210> SEQ ID NO 71

<211> LENGTH: 6

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: 6X histidine tag

<400> SEQUENCE: 71

His His His His His His

1 5

<210> SEQ ID NO 72

<211> LENGTH: 30

<212> TYPE: DNA

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: nuclear localization signal maize

<400> SEQUENCE: 72

atggccccga agaagaagcg caaggtgatc 30

<210> SEQ ID NO 73

<211> LENGTH: 1269

<212> TYPE: DNA

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: LIG3-4(7) maize (include NLS and ST-LS1 INTRON2)

<400> SEQUENCE: 73

atggcaccga agaagaagcg caaggtgatc atgaacacca agtacaacaa ggagttcctg 60

ctctacctgg ccggcatcgt ggacggcgac ggatccatca aggcgcagat caagccgaac 120

cagtcctgca agttcaagca ccagctctcc ctgaccttcc aggtgaccca gaagacgcag 180

aggcgctggt tcctcgacaa gctggtcgac gagatcgggg tgggctacgt ctacgaccgc 240

gggtcggtgt ccgactacga gctctcccag atcaagcccc tgcacaactt cctcacccag 300

ctgcagccgt tcctcaagct gaagcagaag caggcgaacc tcgtcctgaa gatcatcgag 360

cagctcccct cggccaagga gtccccggac aagttcctgg aggtaagttt ctgcttctac 420

ctttgatata tatataataa ttatcattaa ttagtagtaa tataatattt caaatatttt 480

tttcaaaata aaagaatgta gtatatagca attgcttttc tgtagtttat aagtgtgtat 540

attttaattt ataacttttc taatatatga ccaaaacatg gtgatgtgca ggtgtgcacg 600

tgggtcgacc agatcgcggc cctcaacgac agcaagaccc gcaagacgac ctcggagacg 660

gtgcgggcgg tcctggactc cctcccagga tccgtgggag gtctatcgcc atctcaggca 720

tccagcgccg catcctcggc ttcctcaagc ccgggttcag ggatctccga aggactcaga 780

gctggagcaa ctaagtccaa ggaattcctg ctctacctgg ccggcttcgt ggacggcgac 840

ggctccatca tcgcgtccat caagccgcgc cagtgctaca agttcaagca cgagctccgc 900

ctggagttca ccgtgaccca gaagacgcag aggcgctggt tcctcgacga gctggtcgac 960

gagatcgggg tgggctacgt ctacgaccgc gggtcggtgt ccgactaccg cctctcccag 1020

atcaagcccc tgcacaactt cctcacccag ctccagccgt tcctcaagct taagcagaag 1080

caggcgaacc tcgtcctgaa gatcatcgag cagctcccct cggccaagga gtccccggac 1140

aagttcctgg aggtgtgcac gtgggcggac cagatcgcgg ccctcaacga cagcaagacc 1200

cgcaagacga cctcggagac ggtgcgggcg gttctagact ccctcagcga gaagaagaag 1260

tcgtccccc 1269

<210> SEQ ID NO 74

<211> LENGTH: 1269

<212> TYPE: DNA

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: LIG3-4(15) maize (include NLS and ST-LS1 INTRON2)

<400> SEQUENCE: 74

atggcaccga agaagaagcg caaggtgatc atgaacacca agtacaacaa ggagttcctg 60

ctctacctgg ccggcttcgt ggactccgac ggatccatca aggcgcagat caagccgaac 120

cagtcctgca agttcaagca ccagctctcc ctgaccttcc aggtgaccca gaagacgcag 180

aggcgctggt tcctcgacaa gctggtcgac gagatcgggg tgggctacgt ctacgaccgc 240

gggtcggtgt ccgactacga gctctcccag atcaagcccc tgcacaactt cctcacccag 300

ctgcagccgt tcctcaagct gaagcagaag caggcgaacc tcgtcctgaa gatcatcgag 360

cagctcccct cggccaagga gtccccggac aagttcctgg aggtaagttt ctgcttctac 420

ctttgatata tatataataa ttatcattaa ttagtagtaa tataatattt caaatatttt 480

tttcaaaata aaagaatgta gtatatagca attgcttttc tgtagtttat aagtgtgtat 540

attttaattt ataacttttc taatatatga ccaaaacatg gtgatgtgca ggtgtgcacg 600

tgggtcgacc agatcgcggc cctcaacgac agcaagaccc gcaagacgac ctcggagacg 660

gtgcgggcgg tcctggactc cctcccagga tccgtgggag gtctatcgcc atctcaggca 720

tccagcgccg catcctcggc ttcctcaagc ccgggttcag ggatctccga agcactcaga 780

gctggagcaa ctaagtccaa ggaattcctg ctctacctgg ccggcttcgt ggacggcgac 840

ggctccatca tcgcgtccat caagccgcgc cagtgctaca agttcaagca cgagctccgc 900

ctggagttca ccgtgaccca gaagacgcag aggcgctggt tcctcgacaa gctggtcgac 960

gagatcgggg tgggctacgt ctacgaccgc gggtcggtgt ccgactaccg cctctcccag 1020

atcaagcccc tgcacaactt cctcacccag ctccagccgt tcctcaagct taagcagaag 1080

caggcgaacc tcgtcctgaa gatcatcgag cagctcccct cggccaagga gtccccggac 1140

aagttcctgg aggtgtgcac gtgggtcgac cagatcgcgg ccctcaacga cagcaagacc 1200

cgcaagacga cctcggagac ggtgcgggcg gttctagact ccctcagcga gaagaagaag 1260

tcgtccccc 1269

<210> SEQ ID NO 75

<211> LENGTH: 1269

<212> TYPE: DNA

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: LIG3-4(B65) maize (include NLS and ST-LS1 INTRON2)

<400> SEQUENCE: 75

atggcaccga agaagaagcg caaggtgatc atgaacacca agtacaacaa ggagttcctg 60

ctctacctgg ccggcttcgt ggacggcgac ggatgcatca aggcgcagat caagccgaac 120

cagtcctgca agttcaagca ccagctctcc ctgaccttcc aggtgaccca gaagacgaag 180

aggcgctggt tcctcgacaa gctggtcgac gagatcgggg tgggctacgt ctacgaccgc 240

aagtcggtgt ccgactacga gctctcccag atcaagcccc tgcacaactt cctcacccag 300

ctgcagccgt tcctcaagct gaagcagaag caggcgaacc tcgtcctgaa gatcatcgag 360

cagctcccct cggccaagga gtccccggac aagttcctgg aggtaagttt ctgcttctac 420

ctttgatata tatataataa ttatcattaa ttagtagtaa tataatattt caaatatttt 480

tttcaaaata aaagaatgta gtatatagca attgcttttc tgtagtttat aagtgtgtat 540

attttaattt ataacttttc taatatatga ccaaaacatg gtgatgtgca ggtgtgcacg 600

tgggtcgacc agatcgcggc cctcaacgac agcaagaccc gcaagacgac ctcggagacg 660

gtgcgggcgg tcctggactc cctcccagga tccgtgggag gtctatcgcc atctcaggca 720

tccagcgccg catcctcggc ttcctcaagc ccgggttcag ggatctccga agcactcaga 780

gctggagcaa ctaagtccaa ggaattcctg ctctacctgg ccggcttcgt ggacggcgac 840

ggctccatca tcgcgtccat caagccgcgc cagtgctaca agttcaagca cgagctccgc 900

ctggagttca ccgtgaccca gaagacgcag aggcgctggt tcctcgacaa gctggtcgac 960

gagatcgggg tgggctacgt ctacgaccgc aagtcggtgt ccgactaccg cctctcccag 1020

atcaagcccc tgcacaactt cctcacccag ctccagccgt tcctcaagct taagcagaag 1080

caggcgaacc tcgtcctgaa gatcatcgag cagctcccct cggccaagga gtccccggac 1140

aagttcctgg aggtgtgcac gtgggtcgac cagatcgcgg ccctcaacga cagcaagacc 1200

cgcaagacga cctcggagac ggtgcgggcg gttctagact ccctcagcga gaagaagaag 1260

tcgtccccc 1269

<210> SEQ ID NO 76

<211> LENGTH: 15979

<212> TYPE: DNA

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: plasmid PHP46961

<400> SEQUENCE: 76

gtttacccgc caatatatcc tgtcaaacac tgatagttta aactgaaggc gggaaacgac 60

aatctgatca tgagcggaga attaagggag tcacgttatg acccccgccg atgacgcggg 120

acaagccgtt ttacgtttgg aactgacaga accgcaacgt tgaaggagcc actcagcaag 180

ctggtacgat tgtaatacga ctcactatag ggcgaattga gcgctgttta aacgctcttc 240

aactggaaga gcggttacta ccggctggat ggcggggcct tgatcgtgca ccgccggcgt 300

ccggactaac taactagtcg agctagttac cctatgaggt gacatgaagc gctcacggtt 360

actatgacgg ttagcttcac gactgttggt ggcagtagcg tacgacttag ctatagttcc 420

ggacttaccc ttaagataac ttcgtatagc atacattata cgaagttatg ggcccaccgg 480

tggtacctgg cgaaaggggg atgtgctgca aggcgattaa gttgggtaac gccagggttt 540

tcccagtcac gacgttgtaa aacgacggcc agtgaattgt aatacgactc actatagggc 600

gaattgggtt acccggaccg aagcttgcat gcctgcagtg cagcgtgacc cggtcgtgcc 660

cctctctaga gataatgagc attgcatgtc taagttataa aaaattacca catatttttt 720

ttgtcacact tgtttgaagt gcagtttatc tatctttata catatattta aactttactc 780

tacgaataat ataatctata gtactacaat aatatcagtg ttttagagaa tcatataaat 840

gaacagttag acatggtcta aaggacaatt gagtattttg acaacaggac tctacagttt 900

tatcttttta gtgtgcatgt gttctccttt ttttttgcaa atagcttcac ctatataata 960

cttcatccat tttattagta catccattta gggtttaggg ttaatggttt ttatagacta 1020

atttttttag tacatctatt ttattctatt ttagcctcta aattaagaaa actaaaactc 1080

tattttagtt tttttattta ataatttaga tataaaatag aataaaataa agtgactaaa 1140

aattaaacaa atacccttta agaaattaaa aaaactaagg aaacattttt cttgtttcga 1200

gtagataatg ccagcctgtt aaacgccgtc gacgagtcta acggacacca accagcgaac 1260

cagcagcgtc gcgtcgggcc aagcgaagca gacggcacgg catctctgtc gctgcctctg 1320

gacccctctc gagagttccg ctccaccgtt ggacttgctc cgctgtcggc atccagaaat 1380

tgcgtggcgg agcggcagac gtgagccggc acggcaggcg gcctcctcct cctctcacgg 1440

caccggcagc tacgggggat tcctttccca ccgctccttc gctttccctt cctcgcccgc 1500

cgtaataaat agacaccccc tccacaccct ctttccccaa cctcgtgttg ttcggagcgc 1560

acacacacac aaccagatct cccccaaatc cacccgtcgg cacctccgct tcaaggtacg 1620

ccgctcgtcc tccccccccc ccctctctac cttctctaga tcggcgttcc ggtccatgca 1680

tggttagggc ccggtagttc tacttctgtt catgtttgtg ttagatccgt gtttgtgtta 1740

gatccgtgct gctagcgttc gtacacggat gcgacctgta cgtcagacac gttctgattg 1800

ctaacttgcc agtgtttctc tttggggaat cctgggatgg ctctagccgt tccgcagacg 1860

ggatcgattt catgattttt tttgtttcgt tgcatagggt ttggtttgcc cttttccttt 1920

atttcaatat atgccgtgca cttgtttgtc gggtcatctt ttcatgcttt tttttgtctt 1980

ggttgtgatg atgtggtctg gttgggcggt cgttctagat cggagtagaa ttctgtttca 2040

aactacctgg tggatttatt aattttggat ctgtatgtgt gtgccataca tattcatagt 2100

tacgaattga agatgatgga tggaaatatc gatctaggat aggtatacat gttgatgcgg 2160

gttttactga tgcatataca gagatgcttt ttgttcgctt ggttgtgatg atgtggtgtg 2220

gttgggcggt cgttcattcg ttctagatcg gagtagaata ctgtttcaaa ctacctggtg 2280

tatttattaa ttttggaact gtatgtgtgt gtcatacatc ttcatagtta cgagtttaag 2340

atggatggaa atatcgatct aggataggta tacatgttga tgtgggtttt actgatgcat 2400

atacatgatg gcatatgcag catctattca tatgctctaa ccttgagtac ctatctatta 2460

taataaacaa gtatgtttta taattatttt gatcttgata tacttggatg atggcatatg 2520

cagcagctat atgtggattt ttttagccct gccttcatac gctatttatt tgcttggtac 2580

tgtttctttt gtcgatgctc accctgttgt ttggtgttac ttctgcaggt cgactctaga 2640

ggatccatgg caccgaagaa gaagcgcaag gtgatcatga acaccaagta caacaaggag 2700

ttcctgctct acctggccgg cttcgtggac tccgacggat ccatcaaggc gcagatcaag 2760

ccgaaccagt cctgcaagtt caagcaccag ctctccctga ccttccaggt gacccagaag 2820

acgcagaggc gctggttcct cgacaagctg gtcgacgaga tcggggtggg ctacgtctac 2880

gaccgcgggt cggtgtccga ctacgagctc tcccagatca agcccctgca caacttcctc 2940

acccagctgc agccgttcct caagctgaag cagaagcagg cgaacctcgt cctgaagatc 3000

atcgagcagc tcccctcggc caaggagtcc ccggacaagt tcctggaggt aagtttctgc 3060

ttctaccttt gatatatata taataattat cattaattag tagtaatata atatttcaaa 3120

tatttttttc aaaataaaag aatgtagtat atagcaattg cttttctgta gtttataagt 3180

gtgtatattt taatttataa cttttctaat atatgaccaa aacatggtga tgtgcaggtg 3240

tgcacgtggg tcgaccagat cgcggccctc aacgacagca agacccgcaa gacgacctcg 3300

gagacggtgc gggcggtcct ggactccctc ccaggatccg tgggaggtct atcgccatct 3360

caggcatcca gcgccgcatc ctcggcttcc tcaagcccgg gttcagggat ctccgaagca 3420

ctcagagctg gagcaactaa gtccaaggaa ttcctgctct acctggccgg cttcgtggac 3480

ggcgacggct ccatcatcgc gtccatcaag ccgcgccagt gctacaagtt caagcacgag 3540

ctccgcctgg agttcaccgt gacccagaag acgcagaggc gctggttcct cgacaagctg 3600

gtcgacgaga tcggggtggg ctacgtctac gaccgcgggt cggtgtccga ctaccgcctc 3660

tcccagatca agcccctgca caacttcctc acccagctcc agccgttcct caagcttaag 3720

cagaagcagg cgaacctcgt cctgaagatc atcgagcagc tcccctcggc caaggagtcc 3780

ccggacaagt tcctggaggt gtgcacgtgg gtcgaccaga tcgcggccct caacgacagc 3840

aagacccgca agacgacctc ggagacggtg cgggcggttc tagactccct cagcgagaag 3900

aagaagtcgt ccccctgagg taccacatgg ttaacctaga cttgtccatc ttctggattg 3960

gccaacttaa ttaatgtatg aaataaaagg atgcacacat agtgacatgc taatcactat 4020

aatgtgggca tcaaagttgt gtgttatgtg taattactag ttatctgaat aaaagagaaa 4080

gagatcatcc atatttctta tcctaaatga atgtcacgtg tctttataat tctttgatga 4140

accagatgca tttcattaac caaatccata tacatataaa tattaatcat atataattaa 4200

tatcaattgg gttagcaaaa caaatctagt ctaggtgtgt tttgcgaatg cggccgcgtt 4260

ggacggcacg acgtaggggg ggaactcgtc gcggccgttg gcggcagccg ataggttcat 4320

catcttccca cgccccggcc ggcacgttga cacgatatct cgatcgatcg gtcggcccgg 4380

ccgcccgatc tgtgcaggtg caggtcgcgc gggcgggcgc ggctagctgg gagcctggga 4440

ggggaggagg ggccggaggg agcaggagca ggagtgcccg cgcgcgcaca cacgagcaca 4500

cggaaatgga tgcgtagggg acggagggga ggacggcggc aaggacagcg cttagcggag 4560

agctcggtgg agatcgatcg ctcagtcggt cgcaggtgga cgagcgacag acagagctaa 4620

tacggcggtg ttggccggcc gggccatggt gtgggtgatg gcgatgacac agatatatgc 4680

gcgggcgggc gccagctagt agccgggcag ctagcgcgcg cccttctcgg ccggccggat 4740

ctcttttgct gcggagaggg agagggcgag agcgagggcc gagagcgcga gtacttttgg 4800

tctagggttc catggaacga gtggtggtgg agtgagtttt gggctatatc taagagcgac 4860

gcccgcagct cagctagcaa caaaccgggc gctggcactg acagggccgg ccagtagaga 4920

gagagagaga gatctttaat tggagttggt gagtggtgat agcagccgca gctgctgctg 4980

ctgctgctgc tgcttctttg tggttggttt ggttcgccct ccatttttcc tcacccggga 5040

gtccgtatgc tatctgctat atgctaggct ggctgtgggt gtctatgtat gtatcctcct 5100

cctccgttga aacaacgtag cgtacgacac tgctgcatgt gtggccttga agatatgagt 5160

atctatgcat gatgcggatg ctgtacatgt gcattgcatc gctcttctct ctccctgcgg 5220

tggtgtgtac gagacggtgg gtacgtacgc taacgctagc agctgcctcc cgcgtgacgc 5280

caggggcagc ccggccggac gcggtgtttc gttccatgac catgaggcgt gatctcagtc 5340

atggcccaac tacgccaggg gcttgcattg cccgcgcgcg tagactttgc acctgcgcat 5400

gtatgtatgt acatcctgca gcgcacagca gactgagctg tattcccgcg caaatgagta 5460

gcagcgcacg tatatatacg cgtacgcgta caagggcgaa ttccgctagc ctgcagtgca 5520

gcgtgacccg gtcgtgcccc tctctagaga taatgagcat tgcatgtcta agttataaaa 5580

aattaccaca tatttttttt gtcacacttg tttgaagtgc agtttatcta tctttataca 5640

tatatttaaa ctttactcta cgaataatat aatctatagt actacaataa tatcagtgtt 5700

ttagagaatc atataaatga acagttagac atggtctaaa ggacaattga gtattttgac 5760

aacaggactc tacagtttta tctttttagt gtgcatgtgt tctccttttt ttttgcaaat 5820

agcttcacct atataatact tcatccattt tattagtaca tccatttagg gtttagggtt 5880

aatggttttt atagactaat ttttttagta catctatttt attctatttt agcctctaaa 5940

ttaagaaaac taaaactcta ttttagtttt tttatttaat aatttagata taaaatagaa 6000

taaaataaag tgactaaaaa ttaaacaaat accctttaag aaattaaaaa aactaaggaa 6060

acatttttct tgtttcgagt agataatgcc agcctgttaa acgccgtcga cgagtctaac 6120

ggacaccaac cagcgaacca gcagcgtcgc gtcgggccaa gcgaagcaga cggcacggca 6180

tctctgtcgc tgcctctgga cccctctcga gagttccgct ccaccgttgg acttgctccg 6240

ctgtcggcat ccagaaattg cgtggcggag cggcagacgt gagccggcac ggcaggcggc 6300

ctcctcctcc tctcacggca ccggcagcta cgggggattc ctttcccacc gctccttcgc 6360

tttcccttcc tcgcccgccg taataaatag acaccccctc cacaccctct ttccccaacc 6420

tcgtgttgtt cggagcgcac acacacacaa ccagatctcc cccaaatcca cccgtcggca 6480

cctccgcttc aaggtacgcc gctcgtcctc cccccccccc ctctctacct tctctagatc 6540

ggcgttccgg tccatgcatg gttagggccc ggtagttcta cttctgttca tgtttgtgtt 6600

agatccgtgt ttgtgttaga tccgtgctgc tagcgttcgt acacggatgc gacctgtacg 6660

tcagacacgt tctgattgct aacttgccag tgtttctctt tggggaatcc tgggatggct 6720

ctagccgttc cgcagacggg atcgatttca tgattttttt tgtttcgttg catagggttt 6780

ggtttgccct tttcctttat ttcaatatat gccgtgcact tgtttgtcgg gtcatctttt 6840

catgcttttt tttgtcttgg ttgtgatgat gtggtctggt tgggcggtcg ttctagatcg 6900

gagtagaatt ctgtttcaaa ctacctggtg gatttattaa ttttggatct gtatgtgtgt 6960

gccatacata ttcatagtta cgaattgaag atgatggatg gaaatatcga tctaggatag 7020

gtatacatgt tgatgcgggt tttactgatg catatacaga gatgcttttt gttcgcttgg 7080

ttgtgatgat gtggtgtggt tgggcggtcg ttcattcgtt ctagatcgga gtagaatact 7140

gtttcaaact acctggtgta tttattaatt ttggaactgt atgtgtgtgt catacatctt 7200

catagttacg agtttaagat ggatggaaat atcgatctag gataggtata catgttgatg 7260

tgggttttac tgatgcatat acatgatggc atatgcagca tctattcata tgctctaacc 7320

ttgagtacct atctattata ataaacaagt atgttttata attattttga tcttgatata 7380

cttggatgat ggcatatgca gcagctatat gtggattttt ttagccctgc cttcatacgc 7440

tatttatttg cttggtactg tttcttttgt cgatgctcac cctgttgttt ggtgttactt 7500

ctgcaggtcg actctagagg atcaattcgc tagcgaagtt cctattccga agttcctatt 7560

ctctagaaag tataggaact tcagatccac cgggatccac acgacaccat gtcccccgag 7620

cgccgccccg tcgagatccg cccggccacc gccgccgaca tggccgccgt gtgcgacatc 7680

gtgaaccact acatcgagac ctccaccgtg aacttccgca ccgagccgca gaccccgcag 7740

gagtggatcg acgacctgga gcgcctccag gaccgctacc cgtggctcgt ggccgaggtg 7800

gagggcgtgg tggccggcat cgcctacgcc ggcccgtgga aggcccgcaa cgcctacgac 7860

tggaccgtgg agtccaccgt gtacgtgtcc caccgccacc agcgcctcgg cctcggctcc 7920

accctctaca cccacctcct caagagcatg gaggcccagg gcttcaagtc cgtggtggcc 7980

gtgatcggcc tcccgaacga cccgtccgtg cgcctccacg aggccctcgg ctacaccgcc 8040

cgcggcaccc tccgcgccgc cggctacaag cacggcggct ggcacgacgt cggcttctgg 8100

cagcgcgact tcgagctgcc ggccccgccg cgcccggtgc gcccggtgac gcagatctga 8160

gtcgaaacct agacttgtcc atcttctgga ttggccaact taattaatgt atgaaataaa 8220

aggatgcaca catagtgaca tgctaatcac tataatgtgg gcatcaaagt tgtgtgttat 8280

gtgtaattac tagttatctg aataaaagag aaagagatca tccatatttc ttatcctaaa 8340

tgaatgtcac gtgtctttat aattctttga tgaaccagat gcatttcatt aaccaaatcc 8400

atatacatat aaatattaat catatataat taatatcaat tgggttagca aaacaaatct 8460

agtctaggtg tgttttgcga atgcggccct agcgtatacg aagttcctat tccgaagttc 8520

ctattctcca gaaagtatag gaacttctgt acacctgagc tgattccgat gacttcgtag 8580

gttcctagct caagccgctc gtgtccaagc gtcacttacg attagctaat gattacggca 8640

tctaggaccg actagctaac taactagtac aattcgccct tgtgtgaggt atatatatcc 8700

tccgccgggg cacgtacggt acaattccca gcaggtgcgt aaatcgttac agtatattat 8760

ttccgcagcc gatcaaaaga agtttgcgcg tcgtcacggc actgacttct atttagggcg 8820

gccagagtag gctagcctgc tggaccctct gtgtcccgtc tatctcattc attcactcat 8880

cagctggtgc tctatttttc tctccctaat taagctggtg gaaatttcgt gcttttcgtt 8940

tgcaccgtgt gccattggat cggatctgat atatatgcgc gcggccgtcc gagaccttat 9000

tactcgtcac cttcttcaac ctaacccccc cccccccccc tttaatttgc tagccctaac 9060

tggcaccata tatcattttg cccacacata ataaacgact cctttgccaa ctgcaccagt 9120

cacttggcaa acgactaatt acactcggca aaaggttttg tcgtgtgcca cactcgtcaa 9180

agagctcttg gtgaaacaaa cgccggtaac gatctctttg ctgagcgcca acatactcgg 9240

catagaaggt acattaacgg cgggcatcat ggtgatggga aactaccttt ttgcctagtg 9300

tattgttttg ccgaggaggg tggtattcgg caaatcatat atttgccgag tgctcgctct 9360

cagcaaacgt gtgagcactc ggcaaagagc gtgtctccag ttgtgttgtc attatctatt 9420

ttttttaacc tacggcatgc caccaccaaa ggttttaatt gtcaggaact ttctgtattg 9480

tagttttaat tatataagtt gttctccatc aggacttcga ggtcatgttg tatgatcaat 9540

ggaagaatct caagacggac tcaaagagtg gactctcgtc atggattaaa ggtgtattta 9600

ctgcaaggaa aagtgcaggc aatattcagt actcgagaga atctacattt tactcttagc 9660

tcatcaccta tgtgggatag gtgaaggcgt gaagcactcc gagtcttctt ggctattcaa 9720

agtttccttt tcactttgct ttccttttgg tgtattatag cacacagttt tctatgggtc 9780

aagggcgaat tgtagaatta attcattccg attaatcgtg gcctcttgct cttcaggatg 9840

aagagctatg tttaaacgtg caagcgctac tagacaattc agtacattaa aaacgtccgc 9900

aatgtgttat taagttgtct aagcgtcaat ttgtttacac cacaatatat cctgccacca 9960

gccagccaac agctccccga ccggcagctc ggcacaaaat caccactcga tacaggcagc 10020

ccatcagtcc gggacggcgt cagcgggaga gccgttgtaa ggcggcagac tttgctcatg 10080

ttaccgatgc tattcggaag aacggcaact aagctgccgg gtttgaaaca cggatgatct 10140

cgcggagggt agcatgttga ttgtaacgat gacagagcgt tgctgcctgt gatcaaatat 10200

catctccctc gcagagatcc gaattatcag ccttcttatt catttctcgc ttaaccgtga 10260

caggctgtcg atcttgagaa ctatgccgac ataataggaa atcgctggat aaagccgctg 10320

aggaagctga gtggcgctat ttctttagaa gtgaacgttg acgatcgtcg accgtacccc 10380

gatgaattaa ttcggacgta cgttctgaac acagctggat acttacttgg gcgattgtca 10440

tacatgacat caacaatgta cccgtttgtg taaccgtctc ttggaggttc gtatgacact 10500

agtggttccc ctcagcttgc gactagatgt tgaggcctaa cattttatta gagagcaggc 10560

tagttgctta gatacatgat cttcaggccg ttatctgtca gggcaagcga aaattggcca 10620

tttatgacga ccaatgcccc gcagaagctc ccatctttgc cgccatagac gccgcgcccc 10680

ccttttgggg tgtagaacat ccttttgcca gatgtggaaa agaagttcgt tgtcccattg 10740

ttggcaatga cgtagtagcc ggcgaaagtg cgagacccat ttgcgctata tataagccta 10800

cgatttccgt tgcgactatt gtcgtaattg gatgaactat tatcgtagtt gctctcagag 10860

ttgtcgtaat ttgatggact attgtcgtaa ttgcttatgg agttgtcgta gttgcttgga 10920

gaaatgtcgt agttggatgg ggagtagtca tagggaagac gagcttcatc cactaaaaca 10980

attggcaggt cagcaagtgc ctgccccgat gccatcgcaa gtacgaggct tagaaccacc 11040

ttcaacagat cgcgcatagt cttccccagc tctctaacgc ttgagttaag ccgcgccgcg 11100

aagcggcgtc ggcttgaacg aattgttaga cattatttgc cgactacctt ggtgatctcg 11160

cctttcacgt agtgaacaaa ttcttccaac tgatctgcgc gcgaggccaa gcgatcttct 11220

tgtccaagat aagcctgcct agcttcaagt atgacgggct gatactgggc cggcaggcgc 11280

tccattgccc agtcggcagc gacatccttc ggcgcgattt tgccggttac tgcgctgtac 11340

caaatgcggg acaacgtaag cactacattt cgctcatcgc cagcccagtc gggcggcgag 11400

ttccatagcg ttaaggtttc atttagcgcc tcaaatagat cctgttcagg aaccggatca 11460

aagagttcct ccgccgctgg acctaccaag gcaacgctat gttctcttgc ttttgtcagc 11520

aagatagcca gatcaatgtc gatcgtggct ggctcgaaga tacctgcaag aatgtcattg 11580

cgctgccatt ctccaaattg cagttcgcgc ttagctggat aacgccacgg aatgatgtcg 11640

tcgtgcacaa caatggtgac ttctacagcg cggagaatct cgctctctcc aggggaagcc 11700

gaagtttcca aaaggtcgtt gatcaaagct cgccgcgttg tttcatcaag ccttacagtc 11760

accgtaacca gcaaatcaat atcactgtgt ggcttcaggc cgccatccac tgcggagccg 11820

tacaaatgta cggccagcaa cgtcggttcg agatggcgct cgatgacgcc aactacctct 11880

gatagttgag tcgatacttc ggcgatcacc gcttccctca tgatgtttaa ctcctgaatt 11940

aagccgcgcc gcgaagcggt gtcggcttga atgaattgtt aggcgtcatc ctgtgctccc 12000

gagaaccagt accagtacat cgctgtttcg ttcgagactt gaggtctagt tttatacgtg 12060

aacaggtcaa tgccgccgag agtaaagcca cattttgcgt acaaattgca ggcaggtaca 12120

ttgttcgttt gtgtctctaa tcgtatgcca aggagctgtc tgcttagtgc ccactttttc 12180

gcaaattcga tgagactgtg cgcgactcct ttgcctcggt gcgtgtgcga cacaacaatg 12240

tgttcgatag aggctagatc gttccatgtt gagttgagtt caatcttccc gacaagctct 12300

tggtcgatga atgcgccata gcaagcagag tcttcatcag agtcatcatc cgagatgtaa 12360

tccttccggt aggggctcac acttctggta gatagttcaa agccttggtc ggataggtgc 12420

acatcgaaca cttcacgaac aatgaaatgg ttctcagcat ccaatgtttc cgccacctgc 12480

tcagggatca ccgaaatctt catatgacgc ctaacgcctg gcacagcgga tcgcaaacct 12540

ggcgcggctt ttggcacaaa aggcgtgaca ggtttgcgaa tccgttgctg ccacttgtta 12600

acccttttgc cagatttggt aactataatt tatgttagag gcgaagtctt gggtaaaaac 12660

tggcctaaaa ttgctgggga tttcaggaaa gtaaacatca ccttccggct cgatgtctat 12720

tgtagatata tgtagtgtat ctacttgatc gggggatctg ctgcctcgcg cgtttcggtg 12780

atgacggtga aaacctctga cacatgcagc tcccggagac ggtcacagct tgtctgtaag 12840

cggatgccgg gagcagacaa gcccgtcagg gcgcgtcagc gggtgttggc gggtgtcggg 12900

gcgcagccat gacccagtca cgtagcgata gcggagtgta tactggctta actatgcggc 12960

atcagagcag attgtactga gagtgcacca tatgcggtgt gaaataccgc acagatgcgt 13020

aaggagaaaa taccgcatca ggcgctcttc cgcttcctcg ctcactgact cgctgcgctc 13080

ggtcgttcgg ctgcggcgag cggtatcagc tcactcaaag gcggtaatac ggttatccac 13140

agaatcaggg gataacgcag gaaagaacat gtgagcaaaa ggccagcaaa aggccaggaa 13200

ccgtaaaaag gccgcgttgc tggcgttttt ccataggctc cgcccccctg acgagcatca 13260

caaaaatcga cgctcaagtc agaggtggcg aaacccgaca ggactataaa gataccaggc 13320

gtttccccct ggaagctccc tcgtgcgctc tcctgttccg accctgccgc ttaccggata 13380

cctgtccgcc tttctccctt cgggaagcgt ggcgctttct catagctcac gctgtaggta 13440

tctcagttcg gtgtaggtcg ttcgctccaa gctgggctgt gtgcacgaac cccccgttca 13500

gcccgaccgc tgcgccttat ccggtaacta tcgtcttgag tccaacccgg taagacacga 13560

cttatcgcca ctggcagcag ccactggtaa caggattagc agagcgaggt atgtaggcgg 13620

tgctacagag ttcttgaagt ggtggcctaa ctacggctac actagaagga cagtatttgg 13680

tatctgcgct ctgctgaagc cagttacctt cggaaaaaga gttggtagct cttgatccgg 13740

caaacaaacc accgctggta gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag 13800

aaaaaaagga tctcaagaag atcctttgat cttttctacg gggtctgacg ctcagtggaa 13860

cgaaaactca cgttaaggga ttttggtcat gagattatca aaaaggatct tcacctagat 13920

ccttttaaat taaaaatgaa gttttaaatc aatctaaagt atatatgagt aaacttggtc 13980

tgacagttac caatgcttaa tcagtgaggc acctatctca gcgatctgtc tatttcgttc 14040

atccatagtt gcctgactcc ccgtcgtgta gataactacg atacgggagg gcttaccatc 14100

tggccccagt gctgcaatga taccgcgaga cccacgctca ccggctccag atttatcagc 14160

aataaaccag ccagccggaa gggccgagcg cagaagtggt cctgcaactt tatccgcctc 14220

catccagtct attaattgtt gccgggaagc tagagtaagt agttcgccag ttaatagttt 14280

gcgcaacgtt gttgccattg ctgcaggggg gggggggggg ggggacttcc attgttcatt 14340

ccacggacaa aaacagagaa aggaaacgac agaggccaaa aagcctcgct ttcagcacct 14400

gtcgtttcct ttcttttcag agggtatttt aaataaaaac attaagttat gacgaagaag 14460

aacggaaacg ccttaaaccg gaaaattttc ataaatagcg aaaacccgcg aggtcgccgc 14520

cccgtaacct gtcggatcac cggaaaggac ccgtaaagtg ataatgatta tcatctacat 14580

atcacaacgt gcgtggaggc catcaaacca cgtcaaataa tcaattatga cgcaggtatc 14640

gtattaattg atctgcatca acttaacgta aaaacaactt cagacaatac aaatcagcga 14700

cactgaatac ggggcaacct catgtccccc cccccccccc ccctgcaggc atcgtggtgt 14760

cacgctcgtc gtttggtatg gcttcattca gctccggttc ccaacgatca aggcgagtta 14820

catgatcccc catgttgtgc aaaaaagcgg ttagctcctt cggtcctccg atcgttgtca 14880

gaagtaagtt ggccgcagtg ttatcactca tggttatggc agcactgcat aattctctta 14940

ctgtcatgcc atccgtaaga tgcttttctg tgactggtga gtactcaacc aagtcattct 15000

gagaatagtg tatgcggcga ccgagttgct cttgcccggc gtcaacacgg gataataccg 15060

cgccacatag cagaacttta aaagtgctca tcattggaaa acgttcttcg gggcgaaaac 15120

tctcaaggat cttaccgctg ttgagatcca gttcgatgta acccactcgt gcacccaact 15180

gatcttcagc atcttttact ttcaccagcg tttctgggtg agcaaaaaca ggaaggcaaa 15240

atgccgcaaa aaagggaata agggcgacac ggaaatgttg aatactcata ctcttccttt 15300

ttcaatatta ttgaagcatt tatcagggtt attgtctcat gagcggatac atatttgaat 15360

gtatttagaa aaataaacaa ataggggttc cgcgcacatt tccccgaaaa gtgccacctg 15420

acgtctaaga aaccattatt atcatgacat taacctataa aaataggcgt atcacgaggc 15480

cctttcgtct tcaagaattg gtcgacgatc ttgctgcgtt cggatatttt cgtggagttc 15540

ccgccacaga cccggattga aggcgagatc cagcaactcg cgccagatca tcctgtgacg 15600

gaactttggc gcgtgatgac tggccaggac gtcggccgaa agagcgacaa gcagatcacg 15660

cttttcgaca gcgtcggatt tgcgatcgag gatttttcgg cgctgcgcta cgtccgcgac 15720

cgcgttgagg gatcaagcca cagcagccca ctcgaccttc tagccgaccc agacgagcca 15780

agggatcttt ttggaatgct gctccgtcgt caggctttcc gacgtttggg tggttgaaca 15840

gaagtcatta tcgtacggaa tgccaagcac tcccgagggg aaccctgtgg ttggcatgca 15900

catacaaatg gacgaacgga taaacctttt cacgcccttt taaatatccg ttattctaat 15960

aaacgctctt ttctcttag 15979

<210> SEQ ID NO 77

<211> LENGTH: 1097

<212> TYPE: DNA

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: LIG3-4(HR1)

<400> SEQUENCE: 77

gtgtgaggta tatatatcct ccgccggggc acgtacggta caattcccag caggtgcgta 60

aatcgttaca gtatattatt tccgcagccg atcaaaagaa gtttgcgcgt cgtcacggca 120

ctgacttcta tttagggcgg ccagagtagg ctagcctgct ggaccctctg tgtcccgtct 180

atctcattca ttcactcatc agctggtgct ctatttttct ctccctaatt aagctggtgg 240

aaatttcgtg cttttcgttt gcaccgtgtg ccattggatc ggatctgata tatatgcgcg 300

cggccgtccg agaccttatt actcgtcacc ttcttcaacc taaccccccc ccccccccct 360

ttaatttgct agccctaact ggcaccatat atcattttgc ccacacataa taaacgactc 420

ctttgccaac tgcaccagtc acttggcaaa cgactaatta cactcggcaa aaggttttgt 480

cgtgtgccac actcgtcaaa gagctcttgg tgaaacaaac gccggtaacg atctctttgc 540

tgagcgccaa catactcggc atagaaggta cattaacggc gggcatcatg gtgatgggaa 600

actacctttt tgcctagtgt attgttttgc cgaggagggt ggtattcggc aaatcatata 660

tttgccgagt gctcgctctc agcaaacgtg tgagcactcg gcaaagagcg tgtctccagt 720

tgtgttgtca ttatctattt tttttaacct acggcatgcc accaccaaag gttttaattg 780

tcaggaactt tctgtattgt agttttaatt atataagttg ttctccatca ggacttcgag 840

gtcatgttgt atgatcaatg gaagaatctc aagacggact caaagagtgg actctcgtca 900

tggattaaag gtgtatttac tgcaaggaaa agtgcaggca atattcagta ctcgagagaa 960

tctacatttt actcttagct catcacctat gtgggatagg tgaaggcgtg aagcactccg 1020

agtcttcttg gctattcaaa gtttcctttt cactttgctt tccttttggt gtattatagc 1080

acacagtttt ctatggg 1097

<210> SEQ ID NO 78

<211> LENGTH: 1242

<212> TYPE: DNA

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: LIG3-4(HR2)

<400> SEQUENCE: 78

gcggccgcgt tggacggcac gacgtagggg gggaactcgt cgcggccgtt ggcggcagcc 60

gataggttca tcatcttccc acgccccggc cggcacgttg acacgatatc tcgatcgatc 120

ggtcggcccg gccgcccgat ctgtgcaggt gcaggtcgcg cgggcgggcg cggctagctg 180

ggagcctggg aggggaggag gggccggagg gagcaggagc aggagtgccc gcgcgcgcac 240

acacgagcac acggaaatgg atgcgtaggg gacggagggg aggacggcgg caaggacagc 300

gcttagcgga gagctcggtg gagatcgatc gctcagtcgg tcgcaggtgg acgagcgaca 360

gacagagcta atacggcggt gttggccggc cgggccatgg tgtgggtgat ggcgatgaca 420

cagatatatg cgcgggcggg cgccagctag tagccgggca gctagcgcgc gcccttctcg 480

gccggccgga tctcttttgc tgcggagagg gagagggcga gagcgagggc cgagagcgcg 540

agtacttttg gtctagggtt ccatggaacg agtggtggtg gagtgagttt tgggctatat 600

ctaagagcga cgcccgcagc tcagctagca acaaaccggg cgctggcact gacagggccg 660

gccagtagag agagagagag agatctttaa ttggagttgg tgagtggtga tagcagccgc 720

agctgctgct gctgctgctg ctgcttcttt gtggttggtt tggttcgccc tccatttttc 780

ctcacccggg agtccgtatg ctatctgcta tatgctaggc tggctgtggg tgtctatgta 840

tgtatcctcc tcctccgttg aaacaacgta gcgtacgaca ctgctgcatg tgtggccttg 900

aagatatgag tatctatgca tgatgcggat gctgtacatg tgcattgcat cgctcttctc 960

tctccctgcg gtggtgtgta cgagacggtg ggtacgtacg ctaacgctag cagctgcctc 1020

ccgcgtgacg ccaggggcag cccggccgga cgcggtgttt cgttccatga ccatgaggcg 1080

tgatctcagt catggcccaa ctacgccagg ggcttgcatt gcccgcgcgc gtagactttg 1140

cacctgcgca tgtatgtatg tacatcctgc agcgcacagc agactgagct gtattcccgc 1200

gcaaatgagt agcagcgcac gtatatatac gcgtacgcgt ac 1242

<210> SEQ ID NO 79

<211> LENGTH: 18

<212> TYPE: DNA

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: LIG3-4 target site qPCR probe

<400> SEQUENCE: 79

atacctcaca cgtacgcg 18

<210> SEQ ID NO 80

<211> LENGTH: 20

<212> TYPE: DNA

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: Lig3-4_forward primer

<400> SEQUENCE: 80

gatttacgca cctgctggga 20

<210> SEQ ID NO 81

<211> LENGTH: 20

<212> TYPE: DNA

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: Lig3-4_reverse primer

<400> SEQUENCE: 81

ctgagctgta ttcccgcgca 20

<210> SEQ ID NO 82

<211> LENGTH: 3716

<212> TYPE: DNA

<213> ORGANISM: Saccharomyces cerevisiae

<400> SEQUENCE: 82

ctactagaaa gaagggcgtc aaaaatctgc tatgctatta cgaaattact ctataggtta 60

gaatgtcatc ccataggtat ggccctttta aagaagtatc tccacaatca attgcgagaa 120

gcctatctag aatcaaaacg acactttatt tccaaaaagg gagattcgac caacacttcc 180

tctaccattg catcatcatc tttcgctggc gcatctgttc ctctatcttc aaacgaatca 240

ggaatgctaa acggcttgaa gcaaattaac gaacaacaag aatctacatt agaaaccact 300

caaaaggaag actagtaacg ccgtatcgtg attaacgtat tacataagtt acaggattca 360

tgcttatggg ttagctattt cgcccaatgt gtccatctga cattactatt ttgcatttta 420

atttaattag aacttgacta gcgcactacc agtatatcat ctcatttccg taaataccaa 480

atgtattata tattgaaagc ttttgaccag gttattataa aagaaacttc atgctcgaaa 540

aagatcattt cgaaaagttg cctagtttca tgaaatttta aagcagttta tataaatttt 600

accttttgat gcggaattga ctttttcttg aataatacat aacttttctt aaaagaatca 660

aagacagata aaatttaaga gatattaaat attagtgaga agccgagaat tttgtaacac 720

caacataaca ctgacatctt taacaacttt taattatgat acatttctta cgtcatgatt 780

gattattaca gctatgctga caaatgactc ttgttgcatg gctacgaacc gggtaatact 840

aagtgattga ctcttgctga ccttttatta agaactaaat ggacaatatt atggagcatt 900

tcatgtataa attggtgcgt aaaatcgttg gatctctctt ctaagtacat cctactataa 960

caatcaagaa aaacaagaaa atcggacaaa acaatcaagt atggattcta gaacagttgg 1020

tatattagga gggggacaat tgggacgtat gattgttgag gcagcaaaca ggctcaacat 1080

taagacggta atactagatg ctgaaaattc tcctgccaaa caaataagca actccaatga 1140

ccacgttaat ggctcctttt ccaatcctct tgatatcgaa aaactagctg aaaaatgtga 1200

tgtgctaacg attgagattg agcatgttga tgttcctaca ctaaagaatc ttcaagtaaa 1260

acatcccaaa ttaaaaattt acccttctcc agaaacaatc agattgatac aagacaaata 1320

tattcaaaaa gagcatttaa tcaaaaatgg tatagcagtt acccaaagtg ttcctgtgga 1380

acaagccagt gagacgtccc tattgaatgt tggaagagat ttgggttttc cattcgtctt 1440

gaagtcgagg actttggcat acgatggaag aggtaacttc gttgtaaaga ataaggaaat 1500

gattccggaa gctttggaag tactgaagga tcgtcctttg tacgccgaaa aatgggcacc 1560

atttactaaa gaattagcag tcatgattgt gagatctgtt aacggtttag tgttttctta 1620

cccaattgta gagactatcc acaaggacaa tatttgtgac ttatgttatg cgcctgctag 1680

agttccggac tccgttcaac ttaaggcgaa gttgttggca gaaaatgcaa tcaaatcttt 1740

tcccggttgt ggtatatttg gtgtggaaat gttctattta gaaacagggg aattgcttat 1800

taacgaaatt gccccaaggc ctcacaactc tggacattat accattgatg cttgcgtcac 1860

ttctcaattt gaagctcatt tgagatcaat attggatttg ccaatgccaa agaatttcac 1920

atctttctcc accattacaa cgaacgccat tatgctaaat gttcttggag acaaacatac 1980

aaaagataaa gagctagaaa cttgcgaaag agcattggcg actccaggtt cctcagtgta 2040

cttatatgga aaagagtcta gacctaacag aaaagtaggt cacataaata ttattgcctc 2100

cagtatggcg gaatgtgaac aaaggctgaa ctacattaca ggtagaactg atattccaat 2160

caaaatctct gtcgctcaaa agttggactt ggaagcaatg gtcaaaccat tggttggaat 2220

catcatggga tcagactctg acttgccggt aatgtctgcc gcatgtgcgg ttttaaaaga 2280

ttttggcgtt ccatttgaag tgacaatagt ctctgctcat agaactccac ataggatgtc 2340

agcatatgct atttccgcaa gcaagcgtgg aattaaaaca attatcgctg gagctggtgg 2400

ggctgctcac ttgccaggta tggtggctgc aatgacacca cttcctgtca tcggtgtgcc 2460

cgtaaaaggt tcttgtctag atggagtaga ttctttacat tcaattgtgc aaatgcctag 2520

aggtgttcca gtagctaccg tcgctattaa taatagtacg aacgctgcgc tgttggctgt 2580

cagactgctt ggcgcttatg attcaagtta tacaacgaaa atggaacagt ttttattaaa 2640

gcaagaagaa gaagttcttg tcaaagcaca aaagttagaa actgtcggtt acgaagctta 2700

tctagaaaac aagtaatata taagtttatt gatatacttg tacagcaaat aattataaaa 2760

tgatatacct attttttagg ctttgttatg attacatcaa atgtggactt catacataga 2820

aatcaacgct tacaggtgtc cttttttaag aatttcatac ataagatcac ttattataca 2880

tacatacata tccagtaaca agaagcaagg aataattacc tgcttaagtc tgcgattaaa 2940

aaaataacgt ttcgatacag ttcatataag gcggctcaat gcagaaccga ggatagcgct 3000

acgtcaggat atctttgtag ttcccaaata taaatgcgac aatatagttt ctttctttca 3060

tatcaataat atccttttct ccactgaaat cacgaatcaa acctggagca aaaactaaag 3120

ccaaattata aagcgtcatt cgattccagt gactgtaccg tgtaaccttt tctatatgtt 3180

cactcagtac tcttaacacc ctataatgtt cccttggaag gtcttccaat atgtttttta 3240

aagcgctctt gctcgacata taagtgtccg aattctttgc ttctaaggac aactttcctc 3300

caacaaatgg caagttttcc atcatttttt tagatttaac taacctcatc aacggctcgt 3360

atatttgaaa ggtaaagata gggttgggga gctttcttaa gtatcgcttc aacacaccag 3420

taacaacgtt gagatcttgt tccgttaaaa tatttggcgt ttcggtattt tgttgtactt 3480

tccatgcaga aaattgcttt tctatttctt ctatgactag ctgggaacct gattttctat 3540

aaatgccctc cgatctcata ttttcttcgt ctgattcaat aaaatctatg cagacagata 3600

gtatcatcgg tatttcattg ttttcataat tgcacctagc aacgagactt gaaccataca 3660

aattgcttcc atccaaatat tcttctccat ctttactttg tcccatattt gcatcg 3716

<210> SEQ ID NO 83

<211> LENGTH: 1713

<212> TYPE: DNA

<213> ORGANISM: Saccharomyces cerevisiae

<400> SEQUENCE: 83

atggattcta gaacagttgg tatattagga gggggacaat tgggacgtat gattgttgag 60

gcagcaaaca ggctcaacat taagacggta atactagatg ctgaaaattc tcctgccaaa 120

caaataagca actccaatga ccacgttaat ggctcctttt ccaatcctct tgatatcgaa 180

aaactagctg aaaaatgtga tgtgctaacg attgagattg agcatgttga tgttcctaca 240

ctaaagaatc ttcaagtaaa acatcccaaa ttaaaaattt acccttctcc agaaacaatc 300

agattgatac aagacaaata tattcaaaaa gagcatttaa tcaaaaatgg tatagcagtt 360

acccaaagtg ttcctgtgga acaagccagt gagacgtccc tattgaatgt tggaagagat 420

ttgggttttc cattcgtctt gaagtcgagg actttggcat acgatggaag aggtaacttc 480

gttgtaaaga ataaggaaat gattccggaa gctttggaag tactgaagga tcgtcctttg 540

tacgccgaaa aatgggcacc atttactaaa gaattagcag tcatgattgt gagatctgtt 600

aacggtttag tgttttctta cccaattgta gagactatcc acaaggacaa tatttgtgac 660

ttatgttatg cgcctgctag agttccggac tccgttcaac ttaaggcgaa gttgttggca 720

gaaaatgcaa tcaaatcttt tcccggttgt ggtatatttg gtgtggaaat gttctattta 780

gaaacagggg aattgcttat taacgaaatt gccccaaggc ctcacaactc tggacattat 840

accattgatg cttgcgtcac ttctcaattt gaagctcatt tgagatcaat attggatttg 900

ccaatgccaa agaatttcac atctttctcc accattacaa cgaacgccat tatgctaaat 960

gttcttggag acaaacatac aaaagataaa gagctagaaa cttgcgaaag agcattggcg 1020

actccaggtt cctcagtgta cttatatgga aaagagtcta gacctaacag aaaagtaggt 1080

cacataaata ttattgcctc cagtatggcg gaatgtgaac aaaggctgaa ctacattaca 1140

ggtagaactg atattccaat caaaatctct gtcgctcaaa agttggactt ggaagcaatg 1200

gtcaaaccat tggttggaat catcatggga tcagactctg acttgccggt aatgtctgcc 1260

gcatgtgcgg ttttaaaaga ttttggcgtt ccatttgaag tgacaatagt ctctgctcat 1320

agaactccac ataggatgtc agcatatgct atttccgcaa gcaagcgtgg aattaaaaca 1380

attatcgctg gagctggtgg ggctgctcac ttgccaggta tggtggctgc aatgacacca 1440

cttcctgtca tcggtgtgcc cgtaaaaggt tcttgtctag atggagtaga ttctttacat 1500

tcaattgtgc aaatgcctag aggtgttcca gtagctaccg tcgctattaa taatagtacg 1560

aacgctgcgc tgttggctgt cagactgctt ggcgcttatg attcaagtta tacaacgaaa 1620

atggaacagt ttttattaaa gcaagaagaa gaagttcttg tcaaagcaca aaagttagaa 1680

actgtcggtt acgaagctta tctagaaaac aag 1713

<210> SEQ ID NO 84

<211> LENGTH: 3099

<212> TYPE: DNA

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: pHD1327 ade-de2 with ZM6.22-MHP42-MHP107-ZM6.3-LIG3-4

<400> SEQUENCE: 84

atggattcta gaacagttgg tatattagga gggggacaat tgggacgtat gattgttgag 60

gcagcaaaca ggctcaacat taagacggta atactagatg ctgaaaattc tcctgccaaa 120

caaataagca actccaatga ccacgttaat ggctcctttt ccaatcctct tgatatcgaa 180

aaactagctg aaaaatgtga tgtgctaacg attgagattg agcatgttga tgttcctaca 240

ctaaagaatc ttcaagtaaa acatcccaaa ttaaaaattt acccttctcc agaaacaatc 300

agattgatac aagacaaata tattcaaaaa gagcatttaa tcaaaaatgg tatagcagtt 360

acccaaagtg ttcctgtgga acaagccagt gagacgtccc tattgaatgt tggaagagat 420

ttgggttttc cattcgtctt gaagtcgagg actttggcat acgatggaag aggtaacttc 480

gttgtaaaga ataaggaaat gattccggaa gctttggaag tactgaagga tcgtcctttg 540

tacgccgaaa aatgggcacc atttactaaa gaattagcag tcatgattgt gagatctgtt 600

aacggtttag tgttttctta cccaattgta gagactatcc acaaggacaa tatttgtgac 660

ttatgttatg cgcctgctag agttccggac tccgttcaac ttaaggcgaa gttgttggca 720

gaaaatgcaa tcaaatcttt tcccggttgt ggtatatttg gtgtggaaat gttctattta 780

gaaacagggg aattgcttat taacgaaatt gccccaaggc ctcacaactc tggacattat 840

accattgatg cttgcgtcac ttctcaattt gaagctcatt tgagatcaat attggatttg 900

ccaatgccaa agaatttcac atctttctcc accattacaa cgaacgccat tatgctaaat 960

gttcttggag acaaacatac aaaagataaa gagctagaaa ggcgcgccat tgctctctca 1020

catactttta gtttctcacg cgtgagagcc ttctagtata cgtgagagac cttgcaggct 1080

ctcgtaaatg cgcctgctcg agatatacct cacacgtacg cgtagtgttg aagaaacatg 1140

aaattgccca gtattcttaa cccaactgca cagaacaaaa acctgcagga aacgaagata 1200

aatcatgtcg aaagctacat ataaggaacg tgctgctact catcctagtc ctgttgctgc 1260

caagctattt aatatcatgc acgaaaagca aacaaacttg tgtgcttcat tggatgttcg 1320

taccaccaag gaattactgg agttagttga agcattaggt cccaaaattt gtttactaaa 1380

aacacatgtg gatatcttga ctgatttttc catggagggc acagttaagc cgctaaaggc 1440

attatccgcc aagtacaatt ttttactctt cgaagacaga aaatttgctg acattggtaa 1500

tacagtcaaa ttgcagtact ctgcgggtgt atacagaata gcagaatggg cagacattac 1560

gaatgcacac ggtgtggtgg gcccaggtat tgttagcggt ttgaagcagg cggcagaaga 1620

agtaacaaag gaacctagag gccttttgat gttagcagaa ttgtcatgca agggctccct 1680

atctactgga gaatatacta agggtactgt tgacattgcg aagagcgaca aagattttgt 1740

tatcggcttt attgctcaaa gagacatggg tggaagagat gaaggttacg attggttgat 1800

tatgacaccc ggtgtgggtt tagatgacaa gggagacgca ttgggtcaac agtatagaac 1860

cgtggatgat gtggtctcta caggatctga cattattatt gttggaagag gactatttgc 1920

aaagggaagg gatgctaagg tagagggtga acgttacaga aaagcaggct gggaagcata 1980

tttgagaaga tgcggccagc aaaactaaaa aactgtatta taagtaaatg catgtatact 2040

aaactcacaa attagagctt caatttaatt atatcagtta ttacccttaa ggcgaagttg 2100

ttggcagaaa atgcaatcaa atcttttccc ggttgtggta tatttggtgt ggaaatgttc 2160

tatttagaaa caggggaatt gcttattaac gaaattgccc caaggcctca caactctgga 2220

cattatacca ttgatgcttg cgtcacttct caatttgaag ctcatttgag atcaatattg 2280

gatttgccaa tgccaaagaa tttcacatct ttctccacca ttacaacgaa cgccattatg 2340

ctaaatgttc ttggagacaa acatacaaaa gataaagagc tagaaacttg cgaaagagca 2400

ttggcgactc caggttcctc agtgtactta tatggaaaag agtctagacc taacagaaaa 2460

gtaggtcaca taaatattat tgcctccagt atggcggaat gtgaacaaag gctgaactac 2520

attacaggta gaactgatat tccaatcaaa atctctgtcg ctcaaaagtt ggacttggaa 2580

gcaatggtca aaccattggt tggaatcatc atgggatcag actctgactt gccggtaatg 2640

tctgccgcat gtgcggtttt aaaagatttt ggcgttccat ttgaagtgac aatagtctct 2700

gctcatagaa ctccacatag gatgtcagca tatgctattt ccgcaagcaa gcgtggaatt 2760

aaaacaatta tcgctggagc tggtggggct gctcacttgc caggtatggt ggctgcaatg 2820

acaccacttc ctgtcatcgg tgtgcccgta aaaggttctt gtctagatgg agtagattct 2880

ttacattcaa ttgtgcaaat gcctagaggt gttccagtag ctaccgtcgc tattaataat 2940

agtacgaacg ctgcgctgtt ggctgtcaga ctgcttggcg cttatgattc aagttataca 3000

acgaaaatgg aacagttttt attaaagcaa gaagaagaag ttcttgtcaa agcacaaaag 3060

ttagaaactg tcggttacga agcttatcta gaaaacaag 3099

<210> SEQ ID NO 85

<211> LENGTH: 22

<212> TYPE: DNA

<213> ORGANISM: zea mays

<400> SEQUENCE: 85

gggcggtatg tatgtcatac ta 22

<210> SEQ ID NO 86

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77

<400> SEQUENCE: 86

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Cys

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Val Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 87

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L72-01a)

<400> SEQUENCE: 87

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Cys

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Val Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Ala Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Ile Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 88

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L72-08a)

<400> SEQUENCE: 88

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Ala Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Cys

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Phe Leu Asp Lys Ile Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Val Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 89

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L72-09a)

<400> SEQUENCE: 89

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Ala Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Cys

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Phe Leu Asp Lys Ile Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Val Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Ile Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 90

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L73-02a)

<400> SEQUENCE: 90

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Ala Asp Gly Ser Leu Ile Ala Gln Ile Lys Pro Glu Gln Cys

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Phe Leu Asp Lys Ile Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Val Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Ile Gln Pro Phe Ile Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Ile Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 91

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L73-05a)

<400> SEQUENCE: 91

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Cys

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Val Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Ala Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Ile Gln Pro Phe Phe Lys Leu Lys Gln Lys Gln

275 280 285

Thr Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 92

<211> LENGTH: 349

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L9-01)

<400> SEQUENCE: 92

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Ile

1 5 10 15

Val Asp Ser Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Cys

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Arg Arg Arg Trp Phe Leu Asp Lys Leu His Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Val Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Lys Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Gly Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Val Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Leu Ser Leu Pro Gly Ser Trp Glu

145 150 155 160

Val Tyr Arg His Leu Arg His Pro Ala Pro His Pro Arg Leu Pro Gln

165 170 175

Ala Arg Val Gln Gly Ser Pro Lys His Ser Glu Leu Glu Gln Leu Ser

180 185 190

Pro Lys Ser Phe Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly Ser

195 200 205

Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His Arg

210 215 220

Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp Phe

225 230 235 240

Leu Asp Glu Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp Gln

245 250 255

Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His Asn

260 265 270

Phe Leu Thr Gln Leu Gln Pro Tyr Leu Lys Leu Lys Gln Lys Gln Ala

275 280 285

Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu Ser

290 295 300

Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Ala Asp Gln Val Ala Ala

305 310 315 320

Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg Ala

325 330 335

Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345

<210> SEQ ID NO 93

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L9-02)

<400> SEQUENCE: 93

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Cys

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Arg Arg Arg Trp Ile Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Val Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Lys Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Val Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Gly Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Val Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Leu Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Tyr Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 94

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L9-03)

<400> SEQUENCE: 94

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Ser Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Cys

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Ile Leu Leu Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Val Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Lys Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Val Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Gly Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Leu Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Glu Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Val Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 95

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L9-04)

<400> SEQUENCE: 95

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Cys Ile Ile Ala Gln Ile Lys Pro Glu Gln Cys

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Ile Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Val Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Glu Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Tyr Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Ala Asp Gln Val Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 96

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L9-06)

<400> SEQUENCE: 96

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Ser Asp Gly Cys Ile Ile Ala Gln Ile Lys Pro Glu Gln Cys

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Arg Arg Arg Trp Ile Leu Leu Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Val Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Glu Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Gly Phe Leu Glu Val Cys Thr Trp Val Asp Gln Val Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 97

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L9-09)

<400> SEQUENCE: 97

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Ile

1 5 10 15

Val Asp Ser Asp Gly Cys Ile Ile Ala Gln Ile Lys Pro Glu Gln Cys

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Arg Arg Arg Trp Phe Leu Asp Lys Leu His Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Val Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Val Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Val Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Leu Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Glu Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Gly Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 98

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L9-10)

<400> SEQUENCE: 98

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Ile

1 5 10 15

Val Asp Ser Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Cys

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Lys Arg Arg Trp Ile Leu Leu Lys Leu His Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Val Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Lys Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Gly Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Val Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Leu Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Glu Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Val Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 99

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L9-11)

<400> SEQUENCE: 99

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Cys

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Lys Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Val Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Lys Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Gly Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Leu Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Ala Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 100

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L9-12)

<400> SEQUENCE: 100

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Ser Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Cys

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Ile Leu Leu Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Val Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Leu Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Glu Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Tyr Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Gly Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 101

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L112-03a)

<400> SEQUENCE: 101

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Cys

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Thr Val Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Ile Ala Gly Phe Val Asp Ala Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Ser Leu Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 102

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L113-01)

<400> SEQUENCE: 102

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Cys

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Thr Ala Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Leu Asp Ala Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 103

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L13-01a)

<400> SEQUENCE: 103

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Arg

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Val Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Arg Val Cys Thr Trp

115 120 125

Val Asp Arg Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 104

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L13-02)

<400> SEQUENCE: 104

Met Asp Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Arg Pro Glu Gln Cys

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Leu Thr Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Val Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Ala Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 105

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L13-04)

<400> SEQUENCE: 105

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Cys

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Val Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Ala Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 106

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L13-06)

<400> SEQUENCE: 106

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Gly Gln Cys

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Ala Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Arg Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Ala Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Ala Val Arg

325 330 335

Ala Val Leu Asp Ser Ser Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 107

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L13-08a)

<400> SEQUENCE: 107

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Arg

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Arg Arg Arg Trp Phe Leu Asp Lys Leu Ala Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Val Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asp Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Ser Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 108

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L13-10B1)

<400> SEQUENCE: 108

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu His Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Cys

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Val Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Ala Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Met Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 109

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L13-11)

<400> SEQUENCE: 109

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Arg

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Val Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Pro Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Val Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 110

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L13-12)

<400> SEQUENCE: 110

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Cys

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Ile Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Val Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Val Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Gly Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 111

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L15-02)

<400> SEQUENCE: 111

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Arg

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Ala Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Ala Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Gly Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 112

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L15-03)

<400> SEQUENCE: 112

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu His Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Arg

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Phe Leu Asp Lys Leu Ala Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Ala Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 113

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L15-05)

<400> SEQUENCE: 113

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Arg

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Ile Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Ala Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 114

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L15-06)

<400> SEQUENCE: 114

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu His Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Arg

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Ala Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Ala Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 115

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L15-08)

<400> SEQUENCE: 115

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Arg

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Arg Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Val Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Arg Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Arg Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Phe Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 116

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L15-10)

<400> SEQUENCE: 116

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu His Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Cys

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Arg Arg Arg Trp Phe Leu Asp Lys Leu Ala Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Ala Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 117

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L15-11)

<400> SEQUENCE: 117

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Arg Pro Glu Gln Cys

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Ile Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Val Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Met Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 118

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L15-12)

<400> SEQUENCE: 118

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu His Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Cys

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Ala Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Val Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 119

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L15-13)

<400> SEQUENCE: 119

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu His Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Arg

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Phe Leu Asp Lys Leu Ala Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Ala Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Ala Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 120

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L15-15)

<400> SEQUENCE: 120

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu His Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Arg

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Val Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Ala Leu Lys Ile Ile Glu Gln Leu

100 105 110

Ser Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Arg Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Met Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 121

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L15-16)

<400> SEQUENCE: 121

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Arg Pro Glu Gln Cys

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Val Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Ala Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Arg Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Phe Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 122

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L15-18)

<400> SEQUENCE: 122

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu His Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Arg

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Val Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asp Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Arg Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Val Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 123

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L15-20)

<400> SEQUENCE: 123

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Arg

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Ala Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Ala Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Arg Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 124

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L15-21)

<400> SEQUENCE: 124

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Cys

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Ile Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Ala Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Ser Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Val Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Ala Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 125

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L15-23)

<400> SEQUENCE: 125

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Cys

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Val Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Ala Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Arg Val Cys Thr Cys

115 120 125

Val Asp Arg Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 126

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L15-24)

<400> SEQUENCE: 126

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu His Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Arg

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Arg Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Val Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Gly Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 127

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L15-28)

<400> SEQUENCE: 127

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu His Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Arg

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Leu Thr Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Val Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Ala Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Asp Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Arg Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 128

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L15-29)

<400> SEQUENCE: 128

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Arg

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Ile Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Val Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Arg Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Met Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Arg Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Gly Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 129

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L15-33)

<400> SEQUENCE: 129

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu His Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Arg

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Ala Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Ala Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Arg Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 130

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L15-34)

<400> SEQUENCE: 130

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu His Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Arg

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Ala Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Ser Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Arg Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Ala Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 131

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L15-35)

<400> SEQUENCE: 131

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Arg

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Arg Arg Arg Trp Phe Leu Asp Lys Leu Ala Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Val Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asp Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Arg Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 132

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L15-36)

<400> SEQUENCE: 132

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu His Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Cys

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Arg Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Val Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Arg Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Arg Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Ala Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Gly Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 133

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L15-39)

<400> SEQUENCE: 133

Met Asp Thr Lys Tyr Asn Lys Glu Phe Leu Leu His Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Arg

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Arg Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Val Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Ala Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Arg Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Met Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Arg Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 134

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L15-40)

<400> SEQUENCE: 134

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Arg

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Ile Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Ala Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Arg Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Arg Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 135

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L15-41)

<400> SEQUENCE: 135

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Cys

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Ile Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Val Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Met Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 136

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L15-42)

<400> SEQUENCE: 136

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Arg

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Val Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Arg Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Val Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Met Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Phe Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 137

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L15-43)

<400> SEQUENCE: 137

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Cys

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Val Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Ser Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Arg Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Ala Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 138

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L15-45)

<400> SEQUENCE: 138

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Arg

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Arg Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Val Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Ala Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Arg Val Cys Thr Trp

115 120 125

Val Asp Arg Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Ala Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 139

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L15-46)

<400> SEQUENCE: 139

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu His Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Arg

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Arg Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Val Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asp Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Arg Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Val Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Phe Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 140

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L15-27)

<400> SEQUENCE: 140

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu His Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Arg

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Ala Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Ala Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Arg Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Met Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 141

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L15-30)

<400> SEQUENCE: 141

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Arg Pro Glu Gln Cys

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Leu Thr Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Phe Leu Asp Lys Leu Ala Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Ala Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Val Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Arg Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 142

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L15-31)

<400> SEQUENCE: 142

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Arg

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Ile Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Ala Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Val Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 143

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L15-47)

<400> SEQUENCE: 143

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu His Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Arg Pro Glu Gln Cys

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Val Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Glu Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Ala Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 144

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L16-01)

<400> SEQUENCE: 144

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Arg

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Ile Leu Leu Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Val Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Leu Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Met Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Ala Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 145

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L16-02)

<400> SEQUENCE: 145

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Arg Pro Glu Gln Cys

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Ile Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Val Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Met Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Ala Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 146

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L16-03)

<400> SEQUENCE: 146

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Ser Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Arg

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Val Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Leu Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Met Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Glu Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Tyr Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Ala Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 147

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L16-04)

<400> SEQUENCE: 147

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Arg Pro Glu Gln Arg

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Arg Arg Arg Trp Ile Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Val Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Val Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 148

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L16-05)

<400> SEQUENCE: 148

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Arg

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Arg Arg Arg Trp Ile Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Ala Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Lys Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Leu Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 149

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L16-06)

<400> SEQUENCE: 149

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Cys

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Arg Arg Arg Trp Ile Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Thr Ala Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Ala Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Leu Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 150

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L16-07)

<400> SEQUENCE: 150

Met Asp Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Arg

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Val Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Arg Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Met Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 151

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L16-08)

<400> SEQUENCE: 151

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu His Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Arg

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Thr Ala Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asp Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Ala Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Met Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Arg Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 152

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L16-09)

<400> SEQUENCE: 152

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Ala Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Cys

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Ile Leu Leu Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Ala Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Arg Val Cys Thr Trp

115 120 125

Val Asp Arg Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Ala Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 153

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L16-11)

<400> SEQUENCE: 153

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Arg Pro Glu Gln Arg

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Ile Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Ala Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Arg Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Ala Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 154

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L16-12)

<400> SEQUENCE: 154

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Arg

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Ala Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Arg Val Cys Thr Trp

115 120 125

Val Asp Gln Val Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Ala Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 155

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L16-14)

<400> SEQUENCE: 155

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Arg Pro Glu Gln Arg

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Arg Arg Arg Trp Phe Leu Asp Lys Leu Ala Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Val Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Leu Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Ala Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 156

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L16-15)

<400> SEQUENCE: 156

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Arg

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Ile Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Ala Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Lys Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Val Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Leu Asp Ala Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 157

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L16-16)

<400> SEQUENCE: 157

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Arg Pro Glu Gln Arg

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Ile Leu Asp Lys Ile Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Val Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Arg Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Tyr Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Phe Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 158

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L16-17)

<400> SEQUENCE: 158

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Arg

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Ile Leu Leu Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Thr Ala Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Val Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Glu Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Ala Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 159

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L16-18)

<400> SEQUENCE: 159

Met Asp Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Arg

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Arg Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Val Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Arg Val Cys Thr Trp

115 120 125

Val Asp Arg Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Ala Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 160

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L16-19)

<400> SEQUENCE: 160

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Cys

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Arg Arg Arg Trp Ile Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Val Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Lys Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Ala Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Pro Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Arg Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Leu Asp Ala Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Ala Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 161

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L16-21)

<400> SEQUENCE: 161

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Ala Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Arg

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Thr Ala Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Arg Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Met Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Glu Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Tyr Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Ala Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Phe Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 162

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L16-23)

<400> SEQUENCE: 162

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Arg

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Ile Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Ala Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Leu Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 163

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L16-24)

<400> SEQUENCE: 163

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Cys

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Ile Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Val Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Arg Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Arg Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Ala Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 164

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L17-12)

<400> SEQUENCE: 164

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Ala Asp Gly Ser Leu Ile Ala Gln Ile Lys Pro Glu Gln Cys

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Phe Leu Asp Lys Ile Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Val Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 165

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L18-01)

<400> SEQUENCE: 165

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Cys

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Thr Ala Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Ala Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Ser Leu Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 166

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L18-12)

<400> SEQUENCE: 166

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Cys

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Thr Ala Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Ala Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Ile Gln Pro Phe Phe Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 167

<211> LENGTH: 350

<212> TYPE: PRT

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L17-01)

<400> SEQUENCE: 167

Met Asn Thr Lys Tyr Asn Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe

1 5 10 15

Val Asp Ala Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Glu Gln Cys

20 25 30

Tyr Lys Phe Lys His Arg Leu Met Leu Thr Phe Thr Val Thr Gln Lys

35 40 45

Thr Gln Arg Arg Trp Phe Leu Asp Lys Ile Val Asp Glu Ile Gly Val

50 55 60

Gly Tyr Val Arg Asp Arg Gly Ser Val Ser Asp Tyr Ile Leu Ser Gln

65 70 75 80

Ile Lys Pro Leu His Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys

85 90 95

Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu

100 105 110

Pro Ser Ala Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp

115 120 125

Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr

130 135 140

Ser Glu Thr Val Arg Ala Val Leu Asp Ser Leu Pro Gly Ser Val Gly

145 150 155 160

Gly Leu Ser Pro Ser Gln Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser

165 170 175

Ser Pro Gly Ser Gly Ile Ser Glu Ala Leu Arg Ala Gly Ala Thr Lys

180 185 190

Ser Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly

195 200 205

Ser Ile Ile Ala Ser Ile Arg Pro Glu Gln Ser Arg Lys Phe Lys His

210 215 220

Arg Leu Glu Leu Arg Phe Thr Val Thr Gln Lys Thr Gln Arg Arg Trp

225 230 235 240

Phe Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Ile Tyr Asp

245 250 255

Gln Gly Ser Val Ser His Tyr Arg Leu Ser Gln Ile Lys Pro Leu His

260 265 270

Asn Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln

275 280 285

Ala Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu

290 295 300

Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala

305 310 315 320

Ala Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg

325 330 335

Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro

340 345 350

<210> SEQ ID NO 168

<211> LENGTH: 1050

<212> TYPE: DNA

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77 yeast

<400> SEQUENCE: 168

atgaacacca agtacaacaa ggagttcctg ctctacctgg ccggcttcgt ggacggcgac 60

ggctccatca ttgcgcagat caagccggaa cagtgctaca agttcaagca ccggctcatg 120

ctgaccttca cagtgaccca gaagacgcag aggcgctggt tcctcgacaa gctggtcgac 180

gagatcgggg tgggctacgt ccgtgaccgc gggtcggtgt ccgactacat cctctcccag 240

atcaagcccc tgcacaactt cctcacccag ctccagccgt tcctcaagct gaagcagaag 300

caggcgaacc tcgtcctgaa gatcatcgag cagctcccct cggccaagga gtccccggac 360

aagttcctgg aggtgtgcac gtgggtcgac cagatcgcgg ccctcaacga cagcaagacc 420

cgcaagacga cctcggagac ggtgcgggcg gtcctggact ccctcccagg atccgtggga 480

ggtctatcgc catctcaggc atccagcgcc gcatcctcgg cttcctcaag cccgggttca 540

gggatctccg aagcactcag agctggagca actaagtcca aagagttttt attatattta 600

gcggggtttg tcgatgggga tgggagtata atagccagta tacgccccga acaatcgcgt 660

aaatttaaac atcgcttaga attacgtttt acggtcacgc aaaaaaccca acgtaggtgg 720

tttttagata aattagtgga tgaaataggc gtcgggtatg tgtatgatca gggcagcgtc 780

agtcattata ggttaagtca aataaaaccg ttacataatt ttttaacgca attacaaccc 840

tttttaaaat taaaacaaaa acaagccaat ttagtgttaa aaataataga acaattaccg 900

agcgcgaaag aaagtcccga taaattttta gaagtctgta cctgggtgga tcaaatagcc 960

gcgttaaatg attcaaaaac gaggaaaacc acgagcgaaa ccgtcagagc cgtgttagat 1020

agtttatcag aaaagaagaa aagcagtccg 1050

<210> SEQ ID NO 169

<211> LENGTH: 1050

<212> TYPE: DNA

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L72-01a)

<400> SEQUENCE: 169

atgaacacca agtacaacaa ggagttcctg ctctacctgg ccggcttcgt ggacggcgac 60

ggctccatca ttgcgcagat caagccggaa cagtgctaca agttcaagca ccggctcatg 120

ctgaccttca cagtgaccca gaagacgcag aggcgctggt tcctcgacaa gctggtcgac 180

gagatcgggg tgggctacgt ccgtgaccgc gggtcggtgt ccgactacat cctctcccag 240

atcaagcccc tgcacaactt cctcacccag ctacagccgt tcttaaagct gaagcagaag 300

caggcgaacc tcgtcctgaa gatcatcgag cagctcccct cggccaagga gtccccggac 360

aagttcctgg aggtgtgcac gtgggtcgac cagatcgcgg ccctcaacga cagcaagacc 420

cgcaagacga cctcggagac ggtgcgggcg gtcctggact ccctcccagg atccgtggga 480

ggtctatcgc catctcaggc atccagcgcc gcatcctcgg cttcctcaag cccgggttca 540

gggatctccg aagcactcag agctggagca actaagtcca aagagttttt attatatcta 600

gcggggtttg tcgatgctga tgggagtata atagccagta tacgccccga acaatcgcgt 660

aaatttaaac atcgcttaga attacgtttt acgattacgc aaaaaaccca acgtaggtgg 720

tttttagata aattagtgga tgaaataggc gtcgggtatg tgtatgatca gggcagcgtc 780

agtcattata ggttaagtca aataaaaccg ttacataatt ttttaacgca attacaaccc 840

tttttaaaat taaaacaaaa acaagccaat ttagtgttaa aaataataga acaattaccg 900

agcgcgaaag aaagtcccga taaattttta gaagtctgta cctgggtgga tcaaatagcc 960

gcgttaaatg attcaaaaac gaggaaaacc acgagcgaaa ccgtcagagc cgtgttagat 1020

agtttatcag aaaagaagaa aagcagtccg 1050

<210> SEQ ID NO 170

<211> LENGTH: 1050

<212> TYPE: DNA

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L72-08a)

<400> SEQUENCE: 170

atgaacacca agtacaacaa ggagttcctg ctctacctgg ccggcttcgt ggacgctgat 60

ggcagcatta ttgcgcagat caagccggaa cagtgctaca agttcaagca ccggctcatg 120

ctgaccttca cagttaccca gaagacgcag agacgctggt tcctcgacaa gatagtcgac 180

gagatcgggg tgggctacgt tcgtgaccgc gggtcggtgt ccgactacat cctctcccag 240

atcaagcccc tgcacaactt cctcacccag ctccagccgt tcctcaagct gaagcagaag 300

caggcgaacc tcgtcctgaa gatcatcgag cagctcccct cggccaagga gtccccggac 360

aagttcctgg aggtgtgcac gtgggtcgac cagatcgcgg ccctcaacga cagcaagacc 420

cgcaagacga cctcggagac ggtgcgggcg gtcctggact ccctcccagg atccgtggga 480

ggtctatcgc catctcaggc atccagcgcc gcatcctcgg cttcctcaag cccgggttca 540

gggatctccg aagcactcag agctggagca actaagtcca aagagttttt attatattta 600

gcggggtttg tcgatgggga tgggagtata atagccagta tacgccccga acaatcgcgt 660

aaatttaaac atcgcttaga attacgtttt acggtcacgc aaaaaaccca acgtaggtgg 720

tttttagata aattagtgga tgaaataggc gtcgggtatg tgtatgatca gggcagcgtc 780

agtcattata ggttaagtca aataaaaccg ttacataatt ttttaacgca attacaaccc 840

tttttaaaat taaaacaaaa acaagccaat ttagtgttaa aaataataga acaattaccg 900

agcgcgaaag aaagtcccga taaattttta gaagtctgta cctgggtgga tcaaatagcc 960

gcgttaaatg attcaaaaac gaggaaaacc acgagcgaaa ccgtcagagc cgtgttagat 1020

agtttatcag aaaagaagaa aagcagtccg 1050

<210> SEQ ID NO 171

<211> LENGTH: 1050

<212> TYPE: DNA

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L72-09a)

<400> SEQUENCE: 171

atgaacacca agtacaacaa ggagttcctg ctctacctgg ccggcttcgt ggacgctgat 60

ggcagcatta ttgcgcagat caagccggaa cagtgctaca agttcaagca ccggctcatg 120

ctgaccttca cagtgaccca gaagacgcag aggcgctggt tcctcgacaa gatagtcgac 180

gagatcgggg tgggctacgt ccgtgaccgc gggtcggtgt ccgactacat cctctcccag 240

atcaagcccc tgcacaactt cctcacccag atacagccgt tcctcaagct gaagcagaag 300

caggcgaacc tcgtcctgaa gatcatcgag cagctcccct cggccaagga gtccccggac 360

aagttcctgg aggtgtgcac gtgggtcgac cagatcgcgg ccctcaacga cagcaagacc 420

cgcaagacga cctcggagac ggtgcgggcg gtcctggact ccctcccagg atccgtggga 480

ggtctatcgc catctcaggc atccagcgcc gcatcctcgg cttcctcaag cccgggttca 540

gggatctccg aagcactcag agctggagca actaagtcca aagagttttt attatattta 600

gcggggtttg tcgatgggga tgggagtata atagccagta tacgccccga acaatcgcgt 660

aaatttaaac atcgcttaga attacgtttt acggtcacgc aaaaaaccca acgtaggtgg 720

tttttagata aattagtgga tgaaataggc gtcgggtatg tgtatgatca gggcagcgtc 780

agtcattata ggttaagtca aataaaaccg ttacataatt ttttaacgca attacaaccc 840

tttttaaaat taaaacaaaa acaagccaat ttagtgttaa aaataataga acaattaccg 900

agcgcgaaag aaagtcccga taaattttta gaagtctgta cctgggtgga ccaaatagcc 960

gcgttaaatg attcaaaaac gaggaaaacc acgagcgaaa ccgtcagagc cgtgttagat 1020

agtttatcag aaaagaagaa aagcagtccg 1050

<210> SEQ ID NO 172

<211> LENGTH: 1050

<212> TYPE: DNA

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L73-02a)

<400> SEQUENCE: 172

atgaacacca agtacaacaa ggagttcctg ctctatctag ccggcttcgt ggacgctgat 60

ggcagcttaa ttgcgcagat caagccggaa cagtgctaca agttcaagca ccggctcatg 120

ctgaccttca cagtgaccca gaagacgcag aggcgctggt tcctcgacaa gatagtcgac 180

gagatcgggg tgggctacgt ccgtgaccgc gggtcggtgt ccgactacat cctctcccag 240

atcaagcccc tgcacaactt cctcacccag atacagccgt tcattaagct gaagcagaag 300

caggcgaacc tcgtcctgaa gatcatcgag cagctcccct cggccaagga gtccccggac 360

aagttcctgg aggtgtgcac gtgggtcgac cagatcgcgg ccctcaacga cagcaagacc 420

cgcaagacga cctcggagac ggtgcgggcg gtcctggact ccctcccagg atccgtggga 480

ggtctatcgc catctcaggc atccagcgcc gcatcctcgg cttcctcaag cccgggttca 540

gggatctccg aagcactcag agctggagca actaagtcca aagagttttt attatattta 600

gcggggtttg tcgatgggga tgggagtata atagccagta tacgccccga acaatcgcgt 660

aaatttaaac atcgcttaga attacgtttt acggtcacgc aaaaaaccca acgtaggtgg 720

tttttagata aactagtgga tgaaataggc gtcgggtata tttatgatca gggcagcgtc 780

agtcattata ggttgagtca aataaaaccg ttacataatt ttttaacgca attacaaccc 840

tttttaaaat taaaacaaaa acaagccaat ttagtgttaa aaataataga acaattaccg 900

agcgcgaaag aaagtcccga taaattttta gaagtctgta cctgggtgga tcaaatagcc 960

gcgttaaatg attcaaaaac gaggaaaacc acgagcgaaa ccgtcagagc cgtgttagat 1020

agtttatcag aaaagaagaa aagcagtccg 1050

<210> SEQ ID NO 173

<211> LENGTH: 1050

<212> TYPE: DNA

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L73-05a)

<400> SEQUENCE: 173

atgaacacca agtacaacaa ggagttcctg ctctacctgg ccggcttcgt ggacggcgac 60

ggctccatca ttgcgcagat caagccggaa cagtgctaca agttcaagca ccggctcatg 120

ctgaccttca cagtgaccca gaagacgcag aggcgctggt tcctcgacaa gctggtcgac 180

gagatcgggg tgggctacgt ccgtgaccgc gggtcggtgt ccgactacat cctctcccag 240

atcaagcccc tgcacaactt cctcacccag ctccagccgt tcctcaagct gaagcagaag 300

caggcgaacc tcgtcctgaa gatcatcgag cagctcccct cggccaagga gtccccggac 360

aagttcctgg aggtgtgcac gtgggtcgac cagatcgcgg ccctcaacga cagcaagacc 420

cgcaagacga cctcggagac ggtgcgggcg gtcctggact ccctcccagg atccgtggga 480

ggtctatcgc catctcaggc atccagcgcc gcatcctcgg cttcctcaag cccgggttca 540

gggatctccg aagcactcag agctggagca actaagtcca aagagttttt attatattta 600

gcggggtttg tcgatgctga tgggagtata atagccagta tacgccccga acaatcgcgt 660

aaatttaaac atcgcttaga attacgtttt acggtcacgc aaaaaaccca acgtaggtgg 720

tttttagata aattagtgga tgaaataggc gtcgggtatg tgtatgatca gggcagcgtc 780

agtcattata ggttaagtca aataaaaccg ttacataatt ttttaacgca aatacaaccc 840

ttctttaaat taaagcaaaa acaaacaaat ttagtgttaa aaataataga acaattaccg 900

agcgcgaaag aaagtcccga taaattttta gaagtctgta cctgggtgga ccaaatagcc 960

gcgttaaatg attcaaaaac gaggaaaacc acgagcgaaa ccgtcagagc cgtgttagat 1020

agtttatcag aaaagaagaa aagcagtccg 1050

<210> SEQ ID NO 174

<211> LENGTH: 1047

<212> TYPE: DNA

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L9-01)

<400> SEQUENCE: 174

atgaacacca agtacaacaa ggagttcctg ctctacctgg ccggcatcgt ggacagcgac 60

ggctctatca ttgcgcagat caagccggaa cagtgctaca agttcaagca ccggctcatg 120

ctgaccttca cagtgaccca gaagacgaga aggcgctggt tcctcgataa gctgcatgac 180

gagatcgggg tgggctacgt ccgtgaccgc gggtcggtgt ccgactacat cctctcccag 240

aaaaagcccc tgcacaactt cctcacccag ctccagccgt tcctcaagct gaagcagaag 300

caggcgaacc tcgtcctgaa gatcatcgag cagctcccct cggccaagga gtccccggac 360

ggtttcctgg aggtgtgcac gtgggtcgac caggttgcgg ccctcaacga cagcaagacc 420

cgcaagacga cctcggagac ggtgcgggcg gtcctgctgt ccctcccagg atcctgggag 480

gtctatcgcc atctcaggca tccagcgccg catcctcggc ttcctcaagc ccgggttcag 540

ggatctccga agcactcaga gctggagcaa ctaagtccaa agagtttttt atatttagcg 600

gggtttgtcg atggggatgg gagtataata gccagtatac gccccgaaca atcgcgtaaa 660

tttaaacatc gcttagaatt acgttttacg gtcacgcaaa aaacccaacg taggtggttt 720

ttagatgaat tagtggatga aataggcgtc gggtatgtat atgatcaggg cagcgtcagt 780

cattataggt taagtcaaat aaaaccgtta cataattttt taacgcaatt acaaccctac 840

ttaaaattaa aacaaaaaca agccaattta gtgttaaaaa taatagaaca attaccgagc 900

gcgaaagaaa gtcccgataa atttttagaa gtctgtacct gggcagatca agttgccgcg 960

ttaaatgatt caaaaacgag gaaaaccacg agcgaaaccg tcagagccgt gttagatagt 1020

ttatcagaaa agaagaaaag cagtccg 1047

<210> SEQ ID NO 175

<211> LENGTH: 1050

<212> TYPE: DNA

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L9-02)

<400> SEQUENCE: 175

atgaacacca agtacaacaa ggagttcctg ctctacctgg ccggcttcgt ggacggcgac 60

ggctctatca ttgcgcagat caagccggaa cagtgctaca agttcaagca ccggctcatg 120

ctgaccttca cagtgaccca gaagacgaga aggcgctgga tcctcgataa gctggtagac 180

gagatcgggg tgggctacgt ccgtgaccgc gggtcggtgt ccgactacat cctctcccag 240

aaaaagcccc tgcacaactt cctcacccag ctccagccgt tcctcaagct gaagcagaag 300

caggcggtcc tcgtcctgaa gatcatcgag cagctcccct cggccaagga gtccccggac 360

ggtttcctgg aggtgtgcac gtgggtcgac caggttgcgg ccctcaacga cagcaagacc 420

cgcaagacga cctcggagac ggtgcgggcg gtcctgctgt ccctcccagg atccgtggga 480

ggtctatcgc catctcaggc atccagcgcc gcatcctcgg cttcctcaag cccgggttca 540

gggatctccg aagcactcag agctggagca actaagtcca aagagttttt attatattta 600

gcggggtttg tcgatgggga tgggagtata atagccagta tacgccccga acaatcgcgt 660

aaatttaaac atcgcttaga attacgtttt acggtcacgc aaaaaaccca acgtaggtgg 720

tttttagata aattagtgga tgaaataggc gtcgggtatg tatatgatca gggcagcgtc 780

agtcattata ggttaagtca aataaaaccg ttacataatt ttttaacgca attacaaccc 840

tacttaaaat taaaacaaaa acaagccaat ttagtgttaa aaataataga acaattaccg 900

agcgcgaaag aaagtcccga taaattttta gaagtctgta cctgggtaga tcaaattgcc 960

gcgttaaatg attcaaaaac gaggaaaacc acgagcgaaa ccgtcagagc cgtgttagat 1020

agtttatcag aaaagaagaa aagcagtccg 1050

<210> SEQ ID NO 176

<211> LENGTH: 1050

<212> TYPE: DNA

<213> ORGANISM: artificial sequence

<220> FEATURE:

<223> OTHER INFORMATION: MHP77(L9-03)

<400> SEQUENCE: 176

atgaacacca agtacaacaa ggagttcctg ctctacctgg ccggcttcgt ggacagcgac 60

ggctctatca ttgcgcagat caagccggaa cagtgctaca agttcaagca ccggctcatg 120

ctgaccttca cagtgaccca gaagacgcaa aggcgctgga tcctcttgaa gctggtagac 180

gagatcgggg tgggctacgt ccgtgaccgc gggtcggtgt ccgactacat cctctcccag 240

aaaaagcccc tgcacaactt cctcacccag ctccagccgt tcctcaagct gaagcagaag 300

caggcggtcc tcgtcctgaa gatcatcgag cagctcccct cggccaagga gtccccggac