Great research starts with great data.

Learn More
More >
Patent Analysis of

Subfamily E simian adenoviruses A1321, A1325, A1295, A1309, A1316 and A1322 and uses thereof

Updated Time 12 June 2019

Patent Registration Data

Publication Number

US10149873

Application Number

US15/172938

Application Date

03 June 2016

Publication Date

11 December 2018

Current Assignee

THE TRUSTEES OF THE UNIVERSITY OF PENNSYLVANIA

Original Assignee (Applicant)

THE TRUSTEES OF THE UNIVERSITY OF PENNSYLVANIA

International Classification

C12N15/861,C12N15/33,A61K35/761,C07K14/005,C07K14/075

Cooperative Classification

A61K35/761,C07K14/005,C12N7/00,C12N15/86,A61K48/00

Inventor

ROY, SOUMITRA,WILSON, JAMES M.

Abstract

Recombinant vectors comprise simian adenovirus A1321 (SAdV-A1321), SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, and/or SAdV-A1322 sequences and a heterologous gene under the control of regulatory sequences. A cell line which expresses simian adenovirus SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, and/or SAdV-A1322 gene(s) is also disclosed. Methods of using the vectors and cell lines are provided.

Read more

Claims

1. An adenovirus having a capsid comprising a hexon protein, a penton protein, and a fiber protein, wherein said hexon protein is the hexon protein of SAdV-A1321 with the amino acids 1 to 942 of SEQ ID NO: 10, and wherein said penton protein is the penton protein of SAdV-A1321 with the amino acids 1 to 540 of SEQ ID NO: 5; said capsid encapsidating a heterologous nucleic acid comprising a gene operably linked to expression control sequences which direct transcription, translation, and/or expression thereof in a host cell.

2. The adenovirus according to claim 1, further comprising a 5′ and a 3′ adenovirus cis-element necessary for replication and encapsidation.

3. The adenovirus according to claim 1, wherein said adenovirus lacks all or a part of the E1 gene.

4. The adenovirus according to claim 3, wherein said adenovirus is replication-defective.

5. The adenovirus according to claim 1, wherein said fiber protein is the fiber protein of SAdV-A1321 with the amino acids 1 to 445 of SEQ ID NO: 19.

6. The adenovirus according to claim 1, wherein said capsid is a hybrid capsid.

7. The adenovirus according to claim 6, wherein said hybrid capsid comprises at least one capsid protein from an adenovirus selected from SAdV-A1316, SAdV-1322, and SAdV-A1325.

8. A composition comprising the adenovirus according to claim 1 in a pharmaceutically acceptable carrier.

9. A composition comprising an adenovirus according to claim 1, said adenovirus further comprising at least one simian adenovirus protein selected from the group consisting of:

E1a, SEQ ID NO: 27; E1b, small T/19K, SEQ ID NO: 2; E1b, large T/55K, SEQ ID NO: 21; IX, SEQ ID NO: 3; IIIa, SEQ ID NO: 4; VII, SEQ ID NO: 6; V, SEQ ID NO: 7; pX, SEQ ID NO: 8; VI, SEQ ID NO: 9; Endoprotease, SEQ ID NO: 11; 100 kD, SEQ ID NO: 12; 22 kD, SEQ ID NO: 22; VIII, SEQ ID NO: 13; CR1-alpha SEQ ID NO: 14; gp19K, SEQ ID NO: 23; CR1-beta, SEQ ID NO: 15; CR1-gamma, SEQ ID NO: 16; CR1-delta, SEQ ID NO: 24; RID-alpha, SEQ ID NO: 17; RID-beta, SEQ ID NO: 18; E3/14.7K, SEQ ID NO: 25; and Fiber, SEQ ID NO: 19.

10. An adenovirus having a capsid comprising a hexon protein, a penton protein, and a fiber protein, wherein said hexon protein is the hexon protein of SAdV-A1321 with the amino acids 1 to 942 of SEQ ID NO: 10, and wherein said fiber protein is the fiber protein of SAdV-A1321 with the amino acids 1 to 445 of SEQ ID NO: 19; said capsid encapsidating a heterologous nucleic acid comprising a gene operably linked to expression control sequences which direct transcription, translation, and/or expression thereof in a host cell.

11. The adenovirus according to claim 10, further comprising a 5′ and a 3′ adenovirus cis-element necessary for replication and encapsidation.

12. The adenovirus according to claim 10, wherein said adenovirus lacks all or a part of the E1 gene.

13. The adenovirus according to claim 12, wherein said adenovirus is replication defective.

14. The adenovirus according to claim 10, wherein said capsid is a hybrid capsid.

15. The adenovirus according to claim 14, wherein said hybrid capsid comprises at least one capsid protein from an adenovirus selected from SAdV-A1316, SAdV-1322, and SAdV-A1325.

16. A composition comprising the adenovirus according to claim 10 in a pharmaceutically acceptable carrier.

17. A composition comprising an adenovirus according to claim 10, said adenovirus further comprising at least one simian adenovirus protein selected from the group consisting of:

E1a, SEQ ID NO: 27; E1b, small T/19K, SEQ ID NO: 2; E1b, large T/55K, SEQ ID NO: 21; IX, SEQ ID NO: 3; IIIa, SEQ ID NO: 4; Penton, SEQ ID NO: 5; VII, SEQ ID NO: 6; V, SEQ ID NO: 7; pX, SEQ ID NO: 8; VI, SEQ ID NO: 9; Endoprotease, SEQ ID NO: 11; 100 kD, SEQ ID NO: 12; 22 kD, SEQ ID NO: 22; VIII, SEQ ID NO: 13; CR1-alpha SEQ ID NO: 14; gp19K, SEQ ID NO: 23; CR1-beta, SEQ ID NO: 15; CR1-gamma, SEQ ID NO: 16; CR1-delta, SEQ ID NO: 24; RID-alpha, SEQ ID NO: 17; RID-beta, SEQ ID NO: 18; and E3/14.7K, SEQ ID NO: 25.

Read more

Claim Tree

  • 1
    1. An adenovirus having
    • a capsid comprising a hexon protein, a penton protein, and a fiber protein, wherein said hexon protein is the hexon protein of SAdV-A1321 with the amino acids 1 to 942 of SEQ ID NO: 10, and wherein said penton protein is the penton protein of SAdV-A1321 with the amino acids 1 to 540 of SEQ ID NO: 5
    • said capsid encapsidating a heterologous nucleic acid comprising a gene operably linked to expression control sequences which direct transcription, translation, and/or expression thereof in a host cell.
    • 2. The adenovirus according to claim 1, further comprising
      • a 5′ and a 3′ adenovirus cis-element necessary for replication and encapsidation.
    • 3. The adenovirus according to claim 1, wherein
      • said adenovirus lacks all or a part of the E1 gene.
    • 5. The adenovirus according to claim 1, wherein
      • said fiber protein is the fiber protein of SAdV-A1321 with the amino acids 1 to 445 of SEQ ID NO: 19.
    • 6. The adenovirus according to claim 1, wherein
      • said capsid is a hybrid capsid.
  • 8
    8. A composition comprising
    • the adenovirus according to claim 1 in a pharmaceutically acceptable carrier.
  • 9
    9. A composition comprising
    • an adenovirus according to claim 1, said adenovirus further comprising at least one simian adenovirus protein selected from the group consisting of: E1a, SEQ ID NO: 27
    • E1b, small T/19K, SEQ ID NO: 2
    • E1b, large T/55K, SEQ ID NO: 21
    • IX, SEQ ID NO: 3
    • IIIa, SEQ ID NO: 4
    • VII, SEQ ID NO: 6
    • V, SEQ ID NO: 7
    • pX, SEQ ID NO: 8
    • VI, SEQ ID NO: 9
    • Endoprotease, SEQ ID NO: 11
    • 100 kD, SEQ ID NO: 12
    • 22 kD, SEQ ID NO: 22
    • VIII, SEQ ID NO: 13
    • CR1-alpha SEQ ID NO: 14
    • gp19K, SEQ ID NO: 23
    • CR1-beta, SEQ ID NO: 15
    • CR1-gamma, SEQ ID NO: 16
    • CR1-delta, SEQ ID NO: 24
    • RID-alpha, SEQ ID NO: 17
    • RID-beta, SEQ ID NO: 18
    • E3/14.7K, SEQ ID NO: 25
    • and Fiber, SEQ ID NO: 19.
  • 10
    10. An adenovirus having
    • a capsid comprising a hexon protein, a penton protein, and a fiber protein, wherein said hexon protein is the hexon protein of SAdV-A1321 with the amino acids 1 to 942 of SEQ ID NO: 10, and wherein said fiber protein is the fiber protein of SAdV-A1321 with the amino acids 1 to 445 of SEQ ID NO: 19
    • said capsid encapsidating a heterologous nucleic acid comprising a gene operably linked to expression control sequences which direct transcription, translation, and/or expression thereof in a host cell.
    • 11. The adenovirus according to claim 10, further comprising
      • a 5′ and a 3′ adenovirus cis-element necessary for replication and encapsidation.
    • 12. The adenovirus according to claim 10, wherein
      • said adenovirus lacks all or a part of the E1 gene.
    • 14. The adenovirus according to claim 10, wherein
      • said capsid is a hybrid capsid.
  • 16
    16. A composition comprising
    • the adenovirus according to claim 10 in a pharmaceutically acceptable carrier.
  • 17
    17. A composition comprising
    • an adenovirus according to claim 10, said adenovirus further comprising at least one simian adenovirus protein selected from the group consisting of: E1a, SEQ ID NO: 27
    • E1b, small T/19K, SEQ ID NO: 2
    • E1b, large T/55K, SEQ ID NO: 21
    • IX, SEQ ID NO: 3
    • IIIa, SEQ ID NO: 4
    • Penton, SEQ ID NO: 5
    • VII, SEQ ID NO: 6
    • V, SEQ ID NO: 7
    • pX, SEQ ID NO: 8
    • VI, SEQ ID NO: 9
    • Endoprotease, SEQ ID NO: 11
    • 100 kD, SEQ ID NO: 12
    • 22 kD, SEQ ID NO: 22
    • VIII, SEQ ID NO: 13
    • CR1-alpha SEQ ID NO: 14
    • gp19K, SEQ ID NO: 23
    • CR1-beta, SEQ ID NO: 15
    • CR1-gamma, SEQ ID NO: 16
    • CR1-delta, SEQ ID NO: 24
    • RID-alpha, SEQ ID NO: 17
    • RID-beta, SEQ ID NO: 18
    • and E3/14.7K, SEQ ID NO: 25.
See all independent claims <>

Description

INCORPORATION-BY-REFERENCE OF MATERIAL SUBMITTED IN ELECTRONIC FORM

Applicant hereby incorporates by reference the Sequence Listing material filed in electronic form herewith. This file is labeled “X5708D2USA_ST25.txt”.

BACKGROUND OF THE INVENTION

Adenovirus is a double-stranded DNA virus with a genome size of about 36 kilobases (kb), which has been widely used for gene transfer applications due to its ability to achieve highly efficient gene transfer in a variety of target tissues and large transgene capacity. Conventionally, E1 genes of adenovirus are deleted and replaced with a transgene cassette consisting of the promoter of choice, cDNA sequence of the gene of interest and a poly A signal, resulting in a replication defective recombinant virus.

Adenoviruses have a characteristic morphology with an icosahedral capsid consisting of three major proteins, hexon (II), penton base (III) and a knobbed fibre (IV), along with a number of other minor proteins, VI, VIII, IX, IIIa and IVa2 [W. C. Russell, J. Gen Virol., 81:2573-3704 (November 2000)]. The virus genome is a linear, double-stranded DNA with a terminal protein attached covalently to the 5′ terminus, which have inverted terminal repeats (ITRs). The virus DNA is intimately associated with the highly basic protein VII and a small peptide pX (formerly termed mu). Another protein, V, is packaged with this DNA-protein complex and provides a structural link to the capsid via protein VI. The virus also contains a virus-encoded protease, which is necessary for processing of some of the structural proteins to produce mature infectious virus.

A classification scheme has been developed for the Mastadenovirus family, which includes human, simian, bovine, equine, porcine, ovine, canine and opossum adenoviruses. This classification scheme was developed based on the differing abilities of the adenovirus sequences in the family to agglutinate red blood cells. The result was six subgroups, now referred to as subgroups A, B, C, D, E and F. See, T. Shenk et al., Adenoviridae: The Viruses and their Replication”, Ch. 67, in FIELD'S VIROLOGY, 6th Ed., edited by B. N Fields et al, (Lippincott Raven Publishers, Philadelphia, 1996), p. 111-2112.

Recombinant adenoviruses have been described for delivery of heterologous molecules to host cells. See, U.S. Pat. No. 6,083,716, which describes the genome of two chimpanzee adenoviruses. Simian adenoviruses, C5, C6 and C7, have been described in U.S. Pat. No. 7,247,472 as being useful as vaccine vectors. Other chimpanzee adenoviruses are described in WO 2005/1071093 as being useful for making adenovirus vaccine carriers.

What is needed in the art are vectors which effectively deliver molecules to a target and minimize the effect of pre-existing immunity to selected adenovirus serotypes in the population.

SUMMARY OF THE INVENTION

Isolated nucleic acid sequences and amino acid sequences of six novel subfamily E simian adenoviruses, and vectors containing these sequences, are provided herein. Also provided are a number of methods for using the vectors and cells of the invention. These adenoviruses include SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, and SAdV-A1322.

The methods described herein involve delivering one or more selected heterologous gene(s) to a mammalian patient by administering a vector of the invention. Use of the compositions described herein for vaccination permits presentation of a selected antigen for the elicitation of protective immune responses. The vectors based on these simian adenoviruses may also be used for producing heterologous gene products in vitro. Such gene products are themselves useful for a variety of purposes such as are described herein.

These and other embodiments and advantages of the invention are described in more detail below.

DETAILED DESCRIPTION OF THE INVENTION

Novel nucleic acid and amino acid sequences from simian adenovirus A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, and SAdV-A1322, all of which were isolated from chimpanzee feces, are provided.

Also provided are novel adenovirus vectors and packaging cell lines to produce vectors based on these sequences for use in the in vitro production of recombinant proteins or fragments or other reagents. Further provided are compositions for use in delivering a heterologous molecule for therapeutic or vaccine purposes. Such therapeutic or vaccine compositions contain the adenoviral vectors carrying an inserted heterologous molecule. In addition, the novel SAdV sequences are useful in providing the essential helper functions required for production of recombinant adeno-associated viral (AAV) vectors. Thus, helper constructs, methods and cell lines which use these sequences in such production methods, are provided.

The term “substantial homology” or “substantial similarity,” when referring to a nucleic acid or fragment thereof, indicates that, when optimally aligned with appropriate nucleotide insertions or deletions with another nucleic acid (or its complementary strand), there is nucleotide sequence identity in at least about 95 to 99%, including about 96%, about 97%, about 98%, and about 99% of the aligned sequences.

The term “substantial homology” or “substantial similarity,” when referring to amino acids or fragments thereof, indicates that, when optimally aligned with appropriate amino acid insertions or deletions with another amino acid (or its complementary strand), there is amino acid sequence identity in at least about 95 to 99% %, including about 96%, about 97%, about 98%, and about 99%, of the aligned sequences. Preferably, the homology is over full-length sequence, or a protein thereof, or a fragment thereof which is at least 8 amino acids, or more desirably, at least 15 amino acids in length. Examples of suitable fragments are described herein.

The term “percent sequence identity” or “identical” in the context of nucleic acid sequences refers to the residues in the two sequences that are the same when aligned for maximum correspondence. Where gaps are required to align one sequence with another, the degree of scoring is calculated with respect to the longer sequence without penalty for gaps. Sequences that preserve the functionality of the polynucleotide or a polypeptide encoded thereby are more closely identical. The length of sequence identity comparison may be over the full-length of the genome (e.g., about 36 kbp), the full-length of an open reading frame of a gene, protein, subunit, or enzyme [see, e.g., the tables providing the adenoviral coding regions], or a fragment of at least about 500 to 5000 nucleotides, is desired. However, identity among smaller fragments, e.g. of at least about nine nucleotides, usually at least about 20 to 24 nucleotides, at least about 28 to 32 nucleotides, at least about 36 or more nucleotides, may also be desired. Similarly, “percent sequence identity” may be readily determined for amino acid sequences, over the full-length of a protein, or a fragment thereof. Suitably, a fragment is at least about 8 amino acids in length, and may be up to about 700 amino acids. Examples of suitable fragments are described herein.

Identity is readily determined using such algorithms and computer programs as are defined herein at default settings. Preferably, such identity is over the full length of the protein, enzyme, subunit, or over a fragment of at least about 8 amino acids in length. However, identity may be based upon shorter regions, where suited to the use to which the identical gene product is being put.

As described herein, alignments are performed using any of a variety of publicly or commercially available Multiple Sequence Alignment Programs, such as “Clustal W”, accessible through Web Servers on the internet [Thompson et al, 1994, Nucleic Acids Res, 22, 4673-4680]. Alternatively, Vector NTI® utilities [InVitrogen] are also used. There are also a number of algorithms known in the art that can be used to measure nucleotide sequence identity, including those contained in the programs described above. As another example, polynucleotide sequences can be compared using Fasta, a program in GCG Version 6.1. Fasta provides alignments and percent sequence identity of the regions of the best overlap between the query and search sequences. For instance, percent sequence identity between nucleic acid sequences can be determined using Fasta with its default parameters (a word size of 6 and the NOPAM factor for the scoring matrix) as provided in GCG Version 6.1, herein incorporated by reference. Similarly programs are available for performing amino acid alignments. Generally, these programs are used at default settings, although one of skill in the art can alter these settings as needed. Alternatively, one of skill in the art can utilize another algorithm or computer program that provides at least the level of identity or alignment as that provided by the referenced algorithms and programs.

“Recombinant”, as applied to a polynucleotide, means that the polynucleotide is the product of various combinations of cloning, restriction or ligation steps, and other procedures that result in a construct that is distinct from a polynucleotide found in nature. A recombinant virus is a viral particle comprising a recombinant polynucleotide. The terms respectively include replicates of the original polynucleotide construct and progeny of the original virus construct.

Typically, “heterologous” means derived from a genotypically distinct entity from that of the rest of the entity to which it is being compared. A heterologous nucleic acid sequence refers to any nucleic acid sequence that is not isolated from, derived from, or based upon a naturally occurring nucleic acid sequence of the adenoviral vector. “Naturally occurring” means a sequence found in nature and not synthetically prepared or modified. A sequence is “derived” from a source when it is isolated from a source but modified (e.g., by deletion, substitution (mutation), insertion, or other modification) so as not to disrupt the normal function of the source gene. A sequence is “based upon” a source when the sequence is substantially similar to the source.

For example, a polynucleotide introduced by genetic engineering techniques into a plasmid or vector derived from a different species (and often a different genus, subfamily or family) is a heterologous polynucleotide. A promoter removed from its native coding sequence and operatively linked to a coding sequence with which it is not naturally found linked is a heterologous promoter. A specific recombination site that has been cloned into a genome of a virus or viral vector, wherein the genome of the virus does not naturally contain it, is a heterologous recombination site. A heterologous nucleic acid sequence also includes a sequence naturally found in an adenoviral genome, but located at a non-native position within the adenoviral vector. When a polynucleotide with an encoding sequence for a recombinase is used to genetically alter a cell that does not normally express the recombinase, both the polynucleotide and the recombinase are heterologous to the cell.

A heterologous vaccine refers to the situation where one virus or viral vector is introduced in order to induce immunity against a pathogenic virus of another species. In this case, the term “heterologous” refers an inoculating antigen and challenge antigen derived from viruses having different species, genus, subfamily, or family specificity.

As used throughout this specification and the claims, the term “comprise” and its variants including, “comprises”, “comprising”, among other variants, is inclusive of other components, elements, integers, steps and the like. The term “consists of” or “consisting of” are exclusive of other components, elements, integers, steps and the like.

I. The Simian Adenovirus Sequences

The invention provides nucleic acid sequences and amino acid sequences of simian adenovirus A1321 (SAdV-A1321), SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, and SAdV-A1322, which are each isolated from the other material with which they are associated in nature.

A. Nucleic Acid Sequences

The SAdV-A1321 nucleic acid sequences provided herein include nucleotides 1 to 36546 of SEQ ID NO:1. The SAdV-A1325 nucleic acid sequences herein include nucleotides 1 to 36542 of SEQ ID NO:28. The SAdV-A1295 nucleic acid sequences provided herein include nucleotides 1 to 36643 of SEQ ID NO:57. The SAdV-A1309 nucleic acid sequences provided herein include nucleotides 1 to 36528 of SEQ ID NO:86. The SAdV-A1316 nucleic acid sequences provided herein include nucleotides 1 to 36667 of SEQ ID NO:114. The SAdV-A1322 nucleic acid sequences provided herein include nucleotides 1 to 36770 of SEQ ID NO:139. See, Sequence Listing, which is incorporated by reference herein.

In one embodiment, the nucleic acid sequences of the invention further encompass the strands which are complementary to the sequences of SEQ ID NO: 1, 28, 57, 86, 114, or 139, respectively, as well as the RNA and cDNA sequences corresponding to the sequences and their complementary strands. In another embodiment, the nucleic acid sequences further encompass sequences which are greater than 98.5% identical, and preferably, greater than about 99% identical, to the Sequence Listing. Also included in one embodiment, are natural variants and engineered modifications of the sequence provided in SEQ ID NO: 1, 28, 57, 86, 114, or 139 and their complementary strands. Such modifications include, for example, labels that are known in the art, methylation, and substitution of one or more of the naturally occurring nucleotides with a degenerate nucleotide.


TABLE 1
NUCLEIC ACID REGIONS
SAdV-
SAdV-
SAdV-
SAdV-
SAdV-
SAdV-
A1321 ORF
1325 ORF
A1295 ORF
A1309 ORF
A1316 ORF
A1322 ORF
SEQ ID
SEQ ID
SEQ ID
SEQ ID
SEQ ID
SEQ ID
Regions
NO: 1
NO: 28
NO: 57
NO: 86
NO: 114
NO: 139
ITR
 1 . . . 129
 1 . . . 129
 1 . . . 127
 1 . . . 129
 1 . . . 128
E1a
13S
(576 . . . 1140,
(882 . . . 1146,
(577 . . . 1141,
(576 . . . 1143,
(577 . . . 1144,
(576 . . . 1140,
12S
1234 . . . 1439)
1234 . . . 1439)
1235 . . . 1440)
1228 . . . 1433)
1229 . . . 1434)
1234 . . . 1439)
9S
E1b
Small
1601 . . . 2155
1604 . . . 2173
1602 . . . 2153
1599 . . . 2177
1605 . . . 2186
T/19K
Large
1906 . . . 3396
1909 . . . 3414
1907 . . . 3394
1904 . . . 3418
1905 . . . 3416
1910 . . . 3427
T/55K
IX
3484 . . . 3909
3452 . . . 3922
3481 . . . 3906
3456 . . . 3926
3504 . . . 3929
3512 . . . 3940
E2b
pTP
Complement
Complement
Complement
Complement
Complement
Complement
(8457 . . . 10385,
(8453 . . . 10378,
(8448 . . . 10376,
(8462 . . . 10393,
(8480 . . . 10408,
(8479 . . . 10422,
13829 . . . 13837)
13818 . . . 13826)
13809 . . . 13817)
13849 . . . 13857)
13841 . . . 13849)
13885 . . . 13893)
Poly-
Complement
Complement
Complement
Complement
Complement
Complement
merase
(5077 . . . 8655,
(5091 . . . 8651,
(5074 . . . 8646,
(5094 . . . 8660,
(5094 . . . 8678,
(5102 . . . 8680,
13829 . . . 13837)
13818 . . . 13826)
13809 . . . 13817)
13849 . . . 13857)
13841 . . . 13849)
13885 . . . 13893)
IVa2
Complement
Complement
Complement
Complement
Complement
Complement
(3974 . . . 5304,
(3988 . . . 5318,
(3971 . . . 5301,
(3991 . . . 5321,
(3994 . . . 5324,
(3999 . . . 5332,
5583 . . . 5595)
5597 . . . 5609)
5580 . . . 5592)
5600 . . . 5612)
5603 . . . 5615)
5611 . . . 5623)
L1
52/55D
10809 . . . 11984
10806 . . . 11984
10850 . . . 12022
10841 . . . 12019
10873 . . . 12054
IIIa
12025 . . . 13800
12011 . . . 13789
12011 . . . 13780
12049 . . . 13815
12046 . . . 13812
12078 . . . 13859
L2
Penton
13885 . . . 15504
13866 . . . 15491
13857 . . . 15458
13897 . . . 15516
13897 . . . 15846
13938 . . . 15566
VII
15511 . . . 16089
15498 . . . 16079
15466 . . . 16044
15523 . . . 16104
15490 . . . 16068
15570 . . . 16151
V
16134 . . . 17147
16127 . . . 17167
16092 . . . 17117
16152 . . . 17195
16113 . . . 17144
16193 . . . 17230
pX
17174 . . . 17404
17193 . . . 17423
17141 . . . 17371
17223 . . . 17453
17168 . . . 17398
17251 . . . 17484
L3
VI
17439 . . . 18200
17495 . . . 18220
17407 . . . 18180
17525 . . . 18253
17433 . . . 18206
17516 . . . 18289
Hexon
18309 . . . 21134
18260 . . . 21136
18290 . . . 21109
18348 . . . 21164
18312 . . . 21125
18396 . . . 21218
Endo-
21161 . . . 21781
21161 . . . 21781
21130 . . . 21756
21189 . . . 21809
21152 . . . 21784
21240 . . . 21869
protease
E2a
DBP
Complement
Complement
Complement
Complement
Complement
Complement
(21862 . . . 23391)
(21867 . . . 23399)
(21838 . . . 23373)
(21898 . . . 23433)
(21864 . . . 23399)
(21948 . . . 23486)
L4
100 kD
23420 . . . 25828
23425 . . . 25812
23399 . . . 25819
23456 . . . 25843
23428 . . . 25830
23509 . . . 25902
22 kD
25545 . . . 26105
25532 . . . 26092
25533 . . . 26096
25560 . . . 26129
25625 . . . 26167
VIII
26453 . . . 27133
26430 . . . 27110
26453 . . . 27133
26469 . . . 27149
26452 . . . 27132
26510 . . . 27190
E3
12.5K
27114 . . . 27431
27137 . . . 27545
27153 . . . 27470
27194 . . . 27511
CR1-
27411 . . . 28046
27388 . . . 28014
27411 . . . 28043
27427 . . . 28050
27410 . . . 28042
27468 . . . 28088
alpha
gp19K
28031 . . . 28558
27999 . . . 28526
28028 . . . 28555
28035 . . . 28565
28073 . . . 28597
CR1-
28591 . . . 29193
28559 . . . 29176
28589 . . . 29317
28598 . . . 29215
28532 . . . 29323
28637 . . . 29356
beta
CR1-
29209 . . . 29823
29192 . . . 29800
29333 . . . 29953
29231 . . . 29845
29339 . . . 29962
29372 . . . 29992
gamma
CR1-
29819 . . . 30736
29818 . . . 30681
29976 . . . 30848
29863 . . . 30741
29985 . . . 30881
29854 . . . 30891
delta
RID-
30748 . . . 31020
30692 . . . 30964
30860 . . . 31132
30893 . . . 31165
30902 . . . 31174
alpha
RID-
31029 . . . 31457
30973 . . . 31404
31135 . . . 31572
31033 . . . 31464
31174 . . . 31602
31183 . . . 31614
beta
14.7K
31453 . . . 31854
31400 . . . 31804
31568 . . . 31969
31460 . . . 31864
31598 . . . 31999
31610 . . . 32014
L5
Fiber
32117 . . . 33451
32101 . . . 33429
32227 . . . 33552
32137 . . . 33411
32253 . . . 33578
32270 . . . 33595
E4
Orf 6/7
Complement
Complement
Complement
Complement
Complement
Complement
(33550 . . . 33801,
(33543 . . . 33794,
(33646 . . . 33897,
(33523 . . . 33774,
(33671 . . . 33922,
(33722 . . . 33973,
34524 . . . 34697)
34526 . . . 34696)
34620 . . . 34793)
34506 . . . 34676)
34645 . . . 34818)
34699 . . . 34884)
Orf 6
Complement
Complement
Complement
Complement
Complement
Complement
(33801 . . . 34697)
(33794 . . . 34696)
(33897 . . . 34793)
(33774 . . . 34676)
(33922 . . . 34818)
(33970 . . . 34884)
Orf 4
Complement
Complement
Complement
Complement
Complement
Complement
(34790 . . . 35131)
(34605 . . . 34967)
(34669 . . . 35064)
(34585 . . . 34947)
(34911 . . . 35252)
(34775 . . . 35140)
Orf 3
Complement
Complement
Complement
Complement
Complement
Complement
(34980 . . . 35530)
(34980 . . . 35330)
(35076 . . . 35426)
(34960 . . . 35310)
(35101 . . . 35451)
(35149 . . . 35502)
Orf 2
Complement
Complement
Complement
Complement
Complement
Complement
(35330 . . . 35716)
(35330 . . . 35716)
(35426 . . . 35812)
(35310 . . . 35696)
(35451 . . . 35837)
(35499 . . . 35888)
Orf1
Complement
Complement
Complement
Complement
Complement
Complement
(35769 . . . 36140)
(35760 . . . 36131)
(35865 . . . 36236)
(35739 . . . 36110)
(35890 . . . 36261)
(35939 . . . 36316)
ITR
Complement
Complement
Complement
Complement
Complement
Complement
(36418 . . . 36546)
(36413 . . . 36542)
(36517 . . . 36643)
(36400 . . . 36528)
(36540 . . . 36667)
(36653 . . . 36674)

In one embodiment, fragments of the sequences of SEQ ID NO: 1, 28, 57, 86, 114, or 139 and their complementary strands, cDNA and RNA complementary thereto are provided, along with fragments that have substantial homology thereto. Suitable fragments are at least 15 nucleotides in length, and encompass functional fragments, i.e., fragments which are of biological interest. For example, a functional fragment can express a desired adenoviral product or may be useful in production of recombinant viral vectors. Such fragments include the gene sequences and fragments listed in the tables herein. The tables provide the transcript regions and open reading frames in the SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, and SAdV-A1322 sequences. For certain genes, the transcripts and open reading frames (ORFs) are located on the strand complementary to that presented in SEQ ID NO: 1, 28, 57, 86, 114, or 139. See, e.g., E2a, E2b, and E4. The calculated molecular weights of the encoded proteins are also shown. Note that the E1a open reading frame, E2b open reading frame, and E4 open reading frame contain internal splice sites. These splice sites are noted in the table above.

The SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 adenoviral nucleic acid sequences are useful as therapeutic agents and in construction of a variety of vector systems and host cells. As used herein, a vector includes any suitable nucleic acid molecule including, naked DNA, a plasmid, a virus, a cosmid, or an episome. These sequences and products may be used alone or in combination with other adenoviral sequences or fragments, or in combination with elements from other adenoviral or non-adenoviral sequences. The SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 sequences are also useful as antisense delivery vectors, gene therapy vectors, or vaccine vectors. Thus, further provided are nucleic acid molecules, gene delivery vectors, and host cells which contain the SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 sequences.

For example, the invention encompasses a non-naturally occurring nucleic acid molecule containing simian Ad ITR sequences of the invention. “Non-naturally occurring” refers to sequences or genetic elements that cannot be found in nature and have been synthesized, rearranged, or modified through recombinant, genetic engineering, or other techniques, along with progeny from vectors and host cells containing same. In another example, the invention provides a nucleic acid molecule containing simian Ad sequences of the invention encoding a desired Ad gene product. Still other nucleic acid molecule constructed using the sequences of the invention will be readily apparent to one of skill in the art, in view of the information provided herein.

In one embodiment, the simian Ad gene regions identified herein may be used in a variety of vectors for delivery of a heterologous molecule to a cell. For example, vectors are generated for expression of an adenoviral capsid protein (or fragment thereof) for purposes of generating a viral vector in a packaging host cell. Such vectors may be designed for expression in trans. Alternatively, such vectors are designed to provide cells which stably contain sequences which express desired adenoviral functions, e.g., one or more of E1a, E1b, the terminal repeat sequences, E2a, E2b, E4, E4ORF6 region.

In addition, the adenoviral gene sequences and fragments thereof are useful for providing the helper functions necessary for production of helper-dependent viruses (e.g., adenoviral vectors deleted of essential functions, or adeno-associated viruses (AAV)). For such production methods, the SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 sequences can be utilized in such a method in a manner similar to those described for the human Ad. However, due to the differences in sequences between the SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 sequences and those of human Ad, the use of the SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 sequences greatly minimize or eliminate the possibility of homologous recombination with helper functions in a host cell carrying human Ad E1 functions, e.g., 293 cells, which may produce infectious adenoviral contaminants during rAAV production.

Methods of producing rAAV using adenoviral helper functions have been described at length in the literature with human adenoviral serotypes. See, e.g., U.S. Pat. No. 6,258,595 and the references cited therein. See, also, U.S. Pat. No. 5,871,982; WO 99/14354; WO 99/15685; WO 99/47691. These methods may also be used in production of non-human serotype AAV, including non-human primate AAV serotypes. The SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 sequences which provide the necessary helper functions (e.g., E1a, E1b, E2a, E2b, DNA polymerase and/or E4 ORF6) can be particularly useful in providing the necessary adenoviral function while minimizing or eliminating the possibility of recombination with any other adenoviruses present in the rAAV-packaging cell which are typically of human origin. Thus, selected genes or open reading frames of the SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 sequences may be utilized in these rAAV production methods.

Alternatively, recombinant SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 vectors may be utilized in these methods. Such recombinant adenoviral simian vectors may include, e.g., a hybrid chimp Ad/AAV in which chimp Ad sequences flank a rAAV expression cassette composed of, e.g., AAV 3′ and/or 5′ ITRs and a transgene under the control of regulatory sequences which control its expression. One of skill in the art will recognize that still other simian adenoviral vectors and/or SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 gene sequences will be useful for production of rAAV and other viruses dependent upon adenoviral helper.

In still another embodiment, nucleic acid molecules are designed for delivery and expression of selected adenoviral gene products in a host cell to achieve a desired physiologic effect. For example, a nucleic acid molecule containing sequences encoding an SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 E1a protein may be delivered to a subject for use as a cancer therapeutic. Optionally, such a molecule is formulated in a lipid-based carrier and preferentially targets cancer cells. Such a formulation may be combined with other cancer therapeutics (e.g., cisplatin, taxol, or the like). Still other uses for the adenoviral sequences provided herein will be readily apparent to one of skill in the art.

In addition, one of skill in the art will readily understand that the SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 sequences can be readily adapted for use for a variety of viral and non-viral vector systems for in vitro, ex vivo or in vivo delivery of therapeutic and immunogenic molecules. For example, the SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 simian Ad sequences can be utilized in a variety of rAd and non-rAd vector systems. Such vectors systems may include, e.g., plasmids, lentiviruses, retroviruses, poxviruses, vaccinia viruses, and adeno-associated viral systems, among others. Selection of these vector systems is not a limitation of the present invention.

The invention further provides molecules useful for production of the simian and simian-derived proteins of the invention. Such molecules which carry polynucleotides including the simian Ad DNA sequences of the invention can be in the form of naked DNA, a plasmid, a virus or any other genetic element.

B. SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 Adenoviral Proteins

Gene products of the SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 adenovirus, such as proteins, enzymes, and fragments thereof, which are encoded by the adenoviral nucleic acids described herein are provided. Further encompassed are SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 proteins, enzymes, and fragments thereof, having the amino acid sequences encoded by these nucleic acid sequences which are generated by other methods. Such proteins include those encoded by the open reading frames identified in the table above, the proteins identified in the Tables below with reference to SEQ ID NO, which are provided in the Sequence Listing, and sequences that have substantial homology thereto. Fragments of the proteins and polypeptides identified herein, along with fragments having substantial homology thereto, are also provided.


TABLE 2
PROTEIN SEQUENCES
SAdV-
SAdV-
SAdV-
SAdV-
SAdV-
SAdV-
A1321
A1325
A1295
A1309
A1316
A1322
SEQ ID
SEQ ID
SEQ ID
SEQ ID
SEQ ID
SEQ ID
Regions
NO:
NO:
NO:
NO:
NO:
NO:
E1a
13S
27
56
85
113
138
167
12S
9S
E1b
Small
2
29
58
87
140
T/19K
Large
21
51
80
108
115
161
T/55K
IX
3
30
59
88
116
141
L1
52/55D
31
60
89
117
142
IIIa
4
32
61
90
118
143
L2
Penton
5
33
62
91
119
144
VII
6
34
63
92
120
145
V
7
35
64
93
121
146
pX
8
36
65
94
122
147
L3
VI
9
37
66
95
123
148
Hexon
10
38
67
96
124
149
Endo-
11
39
68
97
125
150
protease
L4
100 kD
12
40
69
98
126
151
22 kD
22
52
81
109
162
VIII
13
41
70
99
127
152
E3
12.5k
42
71
100
153
CR1-
14
53
82
110
128
163
alpha
gp19K
23
43
72
101
154
CR1-
15
44
73
102
129
155
beta
CR1-
16
45
74
103
130
156
gamma
CR1-
24
46
75
104
131
164
delta
RID-
17
47
76
132
157
alpha
RID-
18
48
77
105
133
158
beta
14.7K
25
54
83
111
136
165
L5
Fiber
19
49
78
106
134
159

Thus, in one aspect, unique simian adenoviral proteins which are substantially pure, i.e., are free of other viral and proteinaceous proteins are provided. Preferably, these proteins are at least 10% homogeneous, more preferably 60% homogeneous, and most preferably 95% homogeneous.

In one embodiment, unique simian-derived capsid proteins are provided. As used herein, a simian-derived capsid protein includes any adenoviral capsid protein that contains a SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 capsid protein or a fragment thereof, as defined above, including, without limitation, chimeric capsid proteins, fusion proteins, artificial capsid proteins, synthetic capsid proteins, and recombinant capsid proteins, without limitation to means of generating these proteins. A capsid as described herein may be entirely of one of SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322, may contain capsid proteins of more than one of SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322, or may contain a capsid protein of another adenovirus.

Suitably, these simian-derived capsid proteins contain one or more SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 regions or fragments thereof (e.g., a hexon, penton, fiber, or fragment thereof) in combination with capsid regions or fragments thereof of different adenoviral serotypes, or modified simian capsid proteins or fragments, as described herein. A “modification of a capsid protein associated with altered tropism” as used herein includes an altered capsid protein, i.e., a penton, hexon or fiber protein region, or fragment thereof, such as the knob domain of the fiber region, or a polynucleotide encoding same, such that specificity is altered. The simian-derived capsid may be constructed with one or more of the simian Ad of the invention or another Ad serotype which may be of human or non-human origin. Such Ad may be obtained from a variety of sources including the ATCC, commercial and academic sources, or the sequences of the Ad may be obtained from GenBank or other suitable sources. The amino acid sequences of the penton proteins of SAdV-A1321 [SEQ ID NO: 5], SAdV-A1325 [SEQ ID NO: 33], SAdV-A1295 [SEQ ID NO: 62], SAdV-A1309 [SEQ ID NO: 91], SAdV-A1316 [SEQ ID NO: 119], or SAdV-A1322 [SEQ ID NO: 144], are provided. Suitably, this penton protein, or unique fragments thereof, may be utilized for a variety of purposes. Examples of suitable fragments include the penton having N-terminal and/or C-terminal truncations of about 50, 100, 150, or 200 amino acids, based upon the amino acid numbering provided above and in SEQ ID NO: 5, 33, 62, 91, 119, or 144. Other suitable fragments include shorter internal, C-terminal, or N-terminal fragments. Further, the penton protein may be modified for a variety of purposes known to those of skill in the art.

Also provided is the amino acid sequence of the hexon proteins of SAdV-A1321 [SEQ ID NO: 10], SAdV-A1325 [SEQ ID NO: 38], SAdV-A1295 [SEQ ID NO: 67], SAdV-A1309 [SEQ ID NO: 96], SAdV-A1316 [SEQ ID NO: 124], or SAdV-A1322 [SEQ ID NO: 149]. Suitably, this hexon protein, or unique fragments thereof, may be utilized for a variety of purposes. Examples of suitable fragments include the hexon having N-terminal and/or C-terminal truncations of about 50, 100, 150, 200, 300, 400, or 500 amino acids, based upon the amino acid numbering provided above and in SEQ ID NO: 10, 38, 67, 96, 124, or 149. Other suitable fragments include shorter internal, C-terminal, or N-terminal fragments. For example, one suitable fragment the loop region (domain) of the hexon protein, designated DE1 and FG1, or a hypervariable region thereof. Such fragments include the regions spanning amino acid residues about 125 to 443; about 138 to 441, or smaller fragments, such as those spanning about residue 138 to residue 163; about 170 to about 176; about 195 to about 203; about 233 to about 246; about 253 to about 374; about 287 to about 297; and about 404 to about 430 of the simian hexon proteins, with reference to SEQ ID NO: 10, 38, 67, 96, 124, or 149. Other suitable fragments may be readily identified by one of skill in the art. Further, the hexon protein may be modified for a variety of purposes known to those of skill in the art. Because the hexon protein is the determinant for serotype of an adenovirus, such artificial hexon proteins would result in adenoviruses having artificial serotypes. Other artificial capsid proteins can also be constructed using the chimp Ad penton sequences and/or fiber sequences of the invention and/or fragments thereof.

In one embodiment, an adenovirus having an altered hexon protein utilizing the sequence of the SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 hexon protein may be generated. One suitable method for altering hexon proteins is described in U.S. Pat. No. 5,922,315, which is incorporated by reference. In this method, at least one loop region of the adenovirus hexon is changed with at least one loop region of another adenovirus serotype. Thus, at least one loop region of such an altered adenovirus hexon protein is a simian Ad hexon loop region of SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322. In one embodiment, a loop region of the SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 hexon protein is replaced by a loop region from another adenovirus serotype. In another embodiment, the loop region of the SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 hexon is used to replace a loop region from another adenovirus serotype. Suitable adenovirus serotypes may be readily selected from among human and non-human serotypes, as described herein. The selection of a suitable serotype is not a limitation of the present invention. Still other uses for the SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 hexon protein sequences will be readily apparent to those of skill in the art.

The amino acid sequence of the fiber protein of SAdV-A1321 [SEQ ID NO: 19], SAdV-A1325 [SEQ ID NO: 49], SAdV-A1295 [SEQ ID NO: 78], SAdV-A1309 [SEQ ID NO: 106], SAdV-A1316 [SEQ ID NO: 134], or SAdV-A1322 [SEQ ID NO: 159] are provided. Suitably, this fiber protein, or unique fragments thereof, may be utilized for a variety of purposes. One suitable fragment is the fiber knob, located within SEQ ID NO: 19, 49, 78, 106, 134, or 159. Examples of other suitable fragments include the fiber having N-terminal and/or C-terminal truncations of about 50, 100, 150, or 200 amino acids, based upon the amino acid numbering provided in SEQ ID NO: 19, 49, 78, 106, 134, or 159. Still other suitable fragments include internal fragments. Further, the fiber protein may be modified using a variety of techniques known to those of skill in the art.

Unique fragments of the proteins of the SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 are at least 8 amino acids in length. However, fragments of other desired lengths can be readily utilized. In addition, modifications as may be introduced to enhance yield and/or expression of a SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 gene product, e.g., construction of a fusion molecule in which all or a fragment of the SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 gene product is fused (either directly or via a linker) with a fusion partner to enhance are provided herein. Other suitable modifications include, without limitation, truncation of a coding region (e.g., a protein or enzyme) to eliminate a pre- or pro-protein ordinarily cleaved and to provide the mature protein or enzyme and/or mutation of a coding region to provide a secretable gene product. Still other modifications will be readily apparent to one of skill in the art. Further encompassed are proteins having at least about 98%, about 99%, about 99.5%, or about 99.9 identity to SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 proteins provided herein.

As described herein, vectors of the invention containing the adenoviral capsid proteins of SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 are particularly well suited for use in applications in which the neutralizing antibodies diminish the effectiveness of other Ad serotype based vectors, as well as other viral vectors. The rAd vectors are particularly advantageous in readministration for repeat gene therapy or for boosting immune response (vaccine titers).

Under certain circumstances, it may be desirable to use one or more of the SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 gene products (e.g., a capsid protein or a fragment thereof) to generate an antibody. The term “an antibody,” as used herein, refers to an immunoglobulin molecule which is able to specifically bind to an epitope. The antibodies may exist in a variety of forms including, for example, high affinity polyclonal antibodies, monoclonal antibodies, synthetic antibodies, chimeric antibodies, recombinant antibodies and humanized antibodies. Such antibodies originate from immunoglobulin classes IgG, IgM, IgA, IgD and IgE.

Such antibodies may be generated using any of a number of methods know in the art. Suitable antibodies may be generated by well-known conventional techniques, e.g., Kohler and Milstein and the many known modifications thereof. Similarly desirable high titer antibodies are generated by applying known recombinant techniques to the monoclonal or polyclonal antibodies developed to these antigens [see, e.g., PCT Patent Application No. PCT/GB85/00392; British Patent Application Publication No. GB2188638A; Amit et al., 1986 Science, 233:747-753; Queen et al., 1989 Proc. Nat'l. Acad. Sci. USA, 86:10029-10033; PCT Patent Application No. PCT/WO9007861; and Riechmann et al., Nature, 332:323-327 (1988); Huse et al, 1988a Science, 246:1275-1281]. Alternatively, antibodies can be produced by manipulating the complementarity determining regions of animal or human antibodies to the antigen of this invention. See, e.g., E. Mark and Padlin, “Humanization of Monoclonal Antibodies”, Chapter 4, The Handbook of Experimental Pharmacology, Vol. 113, The Pharmacology of Monoclonal Antibodies, Springer-Verlag (June 1994); Harlow et al., 1999, Using Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory Press, N.Y.; Harlow et al., 1989, Antibodies: A Laboratory Manual, Cold Spring Harbor, N.Y.; Houston et al., 1988, Proc. Natl. Acad. Sci. USA 85:5879-5883; and Bird et al., 1988, Science 242:423-437. Further provided by the present invention are anti-idiotype antibodies (Ab2) and anti-anti-idiotype antibodies (Ab3). See, e.g., M. Wettendorff et al., “Modulation of anti-tumor immunity by anti-idiotypic antibodies.” In Idiotypic Network and Diseases, ed. by J. Cerny and J. Hiernaux, 1990 J. Am. Soc. Microbiol., Washington D.C.: pp. 203-229]. These anti-idiotype and anti-anti-idiotype antibodies are produced using techniques well known to those of skill in the art. These antibodies may be used for a variety of purposes, including diagnostic and clinical methods and kits.

Under certain circumstances, it may be desirable to introduce a detectable label or a tag onto a SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 gene product, antibody or other construct of the invention. As used herein, a detectable label is a molecule which is capable, alone or upon interaction with another molecule, of providing a detectable signal. Most desirably, the label is detectable visually, e.g. by fluorescence, for ready use in immunohistochemical analyses or immunofluorescent microscopy. For example, suitable labels include fluorescein isothiocyanate (FITC), phycoerythrin (PE), allophycocyanin (APC), coriphosphine-O (CPO) or tandem dyes, PE-cyanin-5 (PC5), and PE-Texas Red (ECD). All of these fluorescent dyes are commercially available, and their uses known to the art. Other useful labels include a colloidal gold label. Still other useful labels include radioactive compounds or elements. Additionally, labels include a variety of enzyme systems that operate to reveal a colorimetric signal in an assay, e.g., glucose oxidase (which uses glucose as a substrate) releases peroxide as a product which in the presence of peroxidase and a hydrogen donor such as tetramethyl benzidine (TMB) produces an oxidized TMB that is seen as a blue color. Other examples include horseradish peroxidase (HRP), alkaline phosphatase (AP), and hexokinase in conjunction with glucose-6-phosphate dehydrogenase which reacts with ATP, glucose, and NAD+ to yield, among other products, NADH that is detected as increased absorbance at 340 nm wavelength.

Other label systems that are utilized in the methods described herein are detectable by other means, e.g., colored latex microparticles [Bangs Laboratories, Indiana] in which a dye is embedded are used in place of enzymes to form conjugates with the target sequences to provide a visual signal indicative of the presence of the resulting complex in applicable assays.

Methods for coupling or associating the label with a desired molecule are similarly conventional and known to those of skill in the art. Known methods of label attachment are described [see, for example, Handbook of Fluorescent probes and Research Chemicals, 6th Ed., R. P. M. Haugland, Molecular Probes, Inc., Eugene, Oreg., 1996; Pierce Catalog and Handbook, Life Science and Analytical Research Products, Pierce Chemical Company, Rockford, Ill., 1994/1995]. Thus, selection of the label and coupling methods do not limit this invention.

The sequences, proteins, and fragments of SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 may be produced by any suitable means, including recombinant production, chemical synthesis, or other synthetic means. Suitable production techniques are well known to those of skill in the art. See, e.g., Sambrook et al, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Press (Cold Spring Harbor, N.Y.). Alternatively, peptides can also be synthesized by the well known solid phase peptide synthesis methods (Merrifield, J. Am. Chem. Soc., 85:2149 (1962); Stewart and Young, Solid Phase Peptide Synthesis (Freeman, San Francisco, 1969) pp. 27-62). These and other suitable production methods are within the knowledge of those of skill in the art and are not a limitation of the present invention.

In addition, one of skill in the art will readily understand that the SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 sequences can be readily adapted for use for a variety of viral and non-viral vector systems for in vitro, ex vivo or in vivo delivery of therapeutic and immunogenic molecules. For example, in one embodiment, the simian Ad capsid proteins and other simian adenovirus proteins described herein are used for non-viral, protein-based delivery of genes, proteins, and other desirable diagnostic, therapeutic and immunogenic molecules. In one such embodiment, a protein of the invention is linked, directly or indirectly, to a molecule for targeting to cells with a receptor for adenoviruses. Preferably, a capsid protein such as a hexon, penton, fiber or a fragment thereof having a ligand for a cell surface receptor is selected for such targeting. Suitable molecules for delivery are selected from among the therapeutic molecules described herein and their gene products. A variety of linkers including, lipids, polyLys, and the like may be utilized as linkers. For example, the simian penton protein may be readily utilized for such a purpose by production of a fusion protein using the simian penton sequences in a manner analogous to that described in Medina-Kauwe L K, et al, Gene Ther. 2001 May; 8(10):795-803 and Medina-Kauwe L K, et al, Gene Ther. 2001 December; 8(23): 1753-1761. Alternatively, the amino acid sequences of simian Ad protein IX may be utilized for targeting vectors to a cell surface receptor, as described in US Patent Appln 20010047081. Suitable ligands include a CD40 antigen, an RGD-containing or polylysine-containing sequence, and the like. Still other simian Ad proteins, including, e.g., the hexon protein and/or the fiber protein, may be used for used for these and similar purposes.

Still other SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 adenoviral proteins may be used as alone, or in combination with other adenoviral protein, for a variety of purposes which will be readily apparent to one of skill in the art. In addition, still other uses for the SAdV adenoviral proteins will be readily apparent to one of skill in the art.

II. Recombinant Adenoviral Vectors

The compositions described herein include vectors that deliver a heterologous molecule to cells, either for therapeutic or vaccine purposes. As used herein, a vector may include any genetic element including, without limitation, naked DNA, a phage, transposon, cosmid, episome, plasmid, or a virus. Such vectors contain simian adenovirus DNA of SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 and a minigene. By “minigene” or “expression cassette” is meant the combination of a selected heterologous gene and the other regulatory elements necessary to drive translation, transcription and/or expression of the gene product in a host cell.

Typically, a SAdV-A1321-, SAdV-A1325-, SAdV-A1295-, SAdV-A1309-, SAdV-A1316-, or SAdV-A1322-derived adenoviral vector is designed such that the minigene is located in a nucleic acid molecule which contains other adenoviral sequences in the region native to a selected adenoviral gene. The minigene may be inserted into an existing gene region to disrupt the function of that region, if desired. Alternatively, the minigene may be inserted into the site of a partially or fully deleted adenoviral gene. For example, the minigene may be located in the site of such as the site of a functional E1 deletion or functional E3 deletion, among others that may be selected. The term “functionally deleted” or “functional deletion” means that a sufficient amount of the gene region is removed or otherwise damaged, e.g., by mutation or modification, so that the gene region is no longer capable of producing functional products of gene expression. If desired, the entire gene region may be removed. Other suitable sites for gene disruption or deletion are discussed elsewhere in the application.

For example, for a production vector useful for generation of a recombinant virus, the vector may contain the minigene and either the 5′ end of the adenoviral genome or the 3′ end of the adenoviral genome, or both the 5′ and 3′ ends of the adenoviral genome. The 5′ end of the adenoviral genome contains the 5′ cis-elements necessary for packaging and replication; i.e., the 5′ inverted terminal repeat (ITR) sequences (which function as origins of replication) and the native 5′ packaging enhancer domains (that contain sequences necessary for packaging linear Ad genomes and enhancer elements for the E1 promoter). The 3′ end of the adenoviral genome includes the 3′ cis-elements (including the ITRs) necessary for packaging and encapsidation. Suitably, a recombinant adenovirus contains both 5′ and 3′ adenoviral cis-elements and the minigene is located between the 5′ and 3′ adenoviral sequences. A SAdV-A1321-, SAdV-A1325-, SAdV-A1295-, SAdV-A1309-, SAdV-A1316-, or SAdV-A1322-based adenoviral vector may also contain additional adenoviral sequences.

Suitably, these SAdV-A1321-, SAdV-A1325-, SAdV-A1295-, SAdV-A1309-, SAdV-A1316-, or SAdV-A1322-based adenoviral vectors contain one or more adenoviral elements derived from the adenoviral genome of the invention. In one embodiment, the vectors contain adenoviral ITRs from SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 and additional adenoviral sequences from the same adenoviral serotype. In another embodiment, the vectors contain adenoviral sequences that are derived from a different adenoviral serotype than that which provides the ITRs.

As defined herein, a pseudotyped adenovirus refers to an adenovirus in which the capsid protein of the adenovirus is from a different adenovirus than the adenovirus which provides the ITRs.

Further, chimeric or hybrid adenoviruses may be constructed using the adenoviruses described herein using techniques known to those of skill in the art. See, e.g., U.S. Pat. No. 7,291,498.

The selection of the adenoviral source of the ITRs and the source of any other adenoviral sequences present in vector is not a limitation of the present embodiment. A variety of adenovirus strains are available from the American Type Culture Collection, Manassas, Va., or available by request from a variety of commercial and institutional sources. Further, the sequences of many such strains are available from a variety of databases including, e.g., PubMed and GenBank. Homologous adenovirus vectors prepared from other simian or from human adenoviruses are described in the published literature [see, for example, U.S. Pat. No. 5,240,846]. The DNA sequences of a number of adenovirus types are available from GenBank, including type Ad5 [GenBank Accession No. M73370]. The adenovirus sequences may be obtained from any known adenovirus serotype, such as serotypes 2, 3, 4, 7, 12 and 40, and further including any of the presently identified human types. Similarly adenoviruses known to infect non-human animals (e.g., simians) may also be employed in the vector constructs of this invention. See, e.g., U.S. Pat. No. 6,083,716.

The viral sequences, helper viruses (if needed), and recombinant viral particles, and other vector components and sequences employed in the construction of the vectors described herein are obtained as described above. The DNA sequences of the SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 simian adenovirus of the invention are employed to construct vectors and cell lines useful in the preparation of such vectors.

Modifications of the nucleic acid sequences forming the vectors of this invention, including sequence deletions, insertions, and other mutations may be generated using standard molecular biological techniques and are within the scope of this embodiment.

A. The “Minigene”

The methods employed for the selection of the transgene, the cloning and construction of the “minigene” and its insertion into the viral vector are within the skill in the art given the teachings provided herein.

1. The Transgene

The transgene is a nucleic acid sequence, heterologous to the vector sequences flanking the transgene, which encodes a polypeptide, protein, or other product, of interest. The nucleic acid coding sequence is operatively linked to regulatory components in a manner which permits transgene transcription, translation, and/or expression in a host cell.

The composition of the transgene sequence will depend upon the use to which the resulting vector will be put. For example, one type of transgene sequence includes a reporter sequence, which upon expression produces a detectable signal. Such reporter sequences include, without limitation, DNA sequences encoding β-lactamase, β-galactosidase (LacZ), alkaline phosphatase, thymidine kinase, green fluorescent protein (GFP), chloramphenicol acetyltransferase (CAT), luciferase, membrane bound proteins including, for example, CD2, CD4, CD8, the influenza hemagglutinin protein, and others well known in the art, to which high affinity antibodies directed thereto exist or can be produced by conventional means, and fusion proteins comprising a membrane bound protein appropriately fused to an antigen tag domain from, among others, hemagglutinin or Myc. These coding sequences, when associated with regulatory elements which drive their expression, provide signals detectable by conventional means, including enzymatic, radiographic, colorimetric, fluorescence or other spectrographic assays, fluorescent activating cell sorting assays and immunological assays, including enzyme linked immunosorbent assay (ELISA), radioimmunoassay (RIA) and immunohistochemistry. For example, where the marker sequence is the LacZ gene, the presence of the vector carrying the signal is detected by assays for beta-galactosidase activity. Where the transgene is GFP or luciferase, the vector carrying the signal may be measured visually by color or light production in a luminometer.

In one embodiment, the transgene is a non-marker sequence encoding a product which is useful in biology and medicine, such as proteins, peptides, RNA, enzymes, or catalytic RNAs. Desirable RNA molecules include tRNA, dsRNA, ribosomal RNA, catalytic RNAs, and antisense RNAs. One example of a useful RNA sequence is a sequence which extinguishes expression of a targeted nucleic acid sequence in the treated animal

The transgene may be used for treatment, e.g., of genetic deficiencies, as a cancer therapeutic or vaccine, for induction of an immune response, and/or for prophylactic vaccine purposes. As used herein, induction of an immune response refers to the ability of a molecule (e.g., a gene product) to induce a T cell and/or a humoral immune response to the molecule. The invention further includes using multiple transgenes, e.g., to correct or ameliorate a condition caused by a multi-subunit protein. In certain situations, a different transgene may be used to encode each subunit of a protein, or to encode different peptides or proteins. This is desirable when the size of the DNA encoding the protein subunit is large, e.g., for an immunoglobulin, the platelet-derived growth factor, or a dystrophin protein. In order for the cell to produce the multi-subunit protein, a cell is infected with the recombinant virus containing each of the different subunits. Alternatively, different subunits of a protein may be encoded by the same transgene. In this case, a single transgene includes the DNA encoding each of the subunits, with the DNA for each subunit separated by an internal ribozyme entry site (IRES). This is desirable when the size of the DNA encoding each of the subunits is small, e.g., the total size of the DNA encoding the subunits and the IRES is less than five kilobases. As an alternative to an IRES, the DNA may be separated by sequences encoding a 2A peptide, which self-cleaves in a post-translational event. See, e.g., M. L. Donnelly, et al, J. Gen. Viral., 78(Pt 1):13-21 (January 1997); Furler, S., et al, Gene Ther., 8(11):864-873 (June 2001); Klump H., et al., Gene Ther., 8(10):811-817 (May 2001). This 2A peptide is significantly smaller than an IRES, making it well suited for use when space is a limiting factor. However, the selected transgene may encode any biologically active product or other product, e.g., a product desirable for study.

Suitable transgenes may be readily selected by one of skill in the art. The selection of the transgene is not considered to be a limitation of this embodiment.

2. Regulatory Elements

In addition to the major elements identified above for the minigene, the vector also includes conventional control elements necessary which are operably linked to the transgene in a manner that permits its transcription, translation and/or expression in a cell transfected with the plasmid vector or infected with the virus produced by the invention. As used herein, “operably linked” sequences include both expression control sequences that are contiguous with the gene of interest and expression control sequences that act in trans or at a distance to control the gene of interest.

Expression control sequences include appropriate transcription initiation, termination, promoter and enhancer sequences; efficient RNA processing signals such as splicing and polyadenylation (polyA) signals including rabbit beta-globin polyA; sequences that stabilize cytoplasmic mRNA; sequences that enhance translation efficiency (e.g., Kozak consensus sequence); sequences that enhance protein stability; and when desired, sequences that enhance secretion of the encoded product. Among other sequences, chimeric introns may be used.

A great number of expression control sequences, including promoters which are native, constitutive, inducible and/or tissue-specific, are known in the art and may be utilized. Examples of constitutive promoters include, without limitation, the TBG promoter, the retroviral Rous sarcoma virus (RSV) LTR promoter (optionally with the RSV enhancer), the cytomegalovirus (CMV) promoter (optionally with the CMV enhancer) [see, e.g., Boshart et al, Cell, 41:521-530 (1985)], the SV40 promoter, the dihydrofolate reductase promoter, the β-actin promoter, the phosphoglycerol kinase (PGK) promoter, and the EF1α promoter [Invitrogen].

Inducible promoters allow regulation of gene expression and can be regulated by exogenously supplied compounds, environmental factors such as temperature, or the presence of a specific physiological state, e.g., acute phase, a particular differentiation state of the cell, or in replicating cells only. Inducible promoters and inducible systems are available from a variety of commercial sources, including, without limitation, Invitrogen, Clontech and Ariad. Many other systems have been described and can be readily selected by one of skill in the art. For example, inducible promoters include the zinc-inducible sheep metallothionine (MT) promoter and the dexamethasone (Dex)-inducible mouse mammary tumor virus (MMTV) promoter. Other inducible systems include the T7 polymerase promoter system [WO 98/10088]; the ecdysone insect promoter [No et al, Proc. Natl. Acad. Sci. USA, 93:3346-3351 (1996)], the tetracycline-repressible system [Gossen et al, Proc. Natl. Acad. Sci. USA, 89:5547-5551 (1992)], the tetracycline-inducible system [Gossen et al, Science, 378:1766-1769 (1995), see also Harvey et al, Curr. Opin. Chem. Biol., 2:512-518 (1998)]. Other systems include the FK506 dimer, VP16 or p65 using castradiol, diphenol murislerone, the RU486-inducible system [Wang et al, Nat. Biotech., 15:239-243 (1997) and Wang et al, Gene Ther., 4:432-441 (1997)] and the rapamycin-inducible system [Magari et al, J. Clin. Invest., 100:2865-2872 (1997)]. The effectiveness of some inducible promoters increases over time. In such cases one can enhance the effectiveness of such systems by inserting multiple repressors in tandem, e.g., TetR linked to a TetR by an IRES. Alternatively, one can wait at least 3 days before screening for the desired function. One can enhance expression of desired proteins by known means to enhance the effectiveness of this system. For example, using the Woodchuck Hepatitis Virus Posttranscriptional Regulatory Element (WPRE).

In another embodiment, the native promoter for the transgene will be used. The native promoter may be preferred when it is desired that expression of the transgene should mimic the native expression. The native promoter may be used when expression of the transgene must be regulated temporally or developmentally, or in a tissue-specific manner, or in response to specific transcriptional stimuli. In a further embodiment, other native expression control elements, such as enhancer elements, polyadenylation sites or Kozak consensus sequences may also be used to mimic the native expression.

Another embodiment of the transgene includes a transgene operably linked to a tissue-specific promoter. For instance, if expression in skeletal muscle is desired, a promoter active in muscle should be used. These include the promoters from genes encoding skeletal β-actin, myosin light chain 2A, dystrophin, muscle creatine kinase, as well as synthetic muscle promoters with activities higher than naturally occurring promoters (see Li et al., Nat. Biotech., 17:241-245 (1999)). Examples of promoters that are tissue-specific are known for liver (albumin, Miyatake et al., J. Vivol., 71:5124-32 (1997); hepatitis B virus core promoter, Sandig et al., Gene Ther., 3:1002-9 (1996); alpha-fetoprotein (AFP), Arbuthnot et al., Hum. Gene Ther., 7:1503-14 (1996)), bone osteocalcin (Stein et al., Mol. Biol. Rep., 24:185-96 (1997)); bone sialoprotein (Chen et al. J. Bone Miner. Res., 11:654-64 (1996)), lymphocytes (CD2, Hansal et al., J. Immunol., 161:1063-8 (1998); immunoglobulin heavy chain; T cell receptor chain), neuronal such as neuron-specific enolase (NSE) promoter (Andersen et al., Cell. Mol. Neurobiol., 13:503-15 (1993)), neurofilament light-chain gene (Piccioli et al., Proc. Natl. Acad. Sci. USA, 88:5611-5 (1991)), and the neuron-specific vgf gene (Piccioli et al., Neuron, 15:373-84 (1995)), among others.

Optionally, vectors carrying transgenes encoding therapeutically useful or immunogenic products may also include selectable markers or reporter genes may include sequences encoding geneticin, hygromicin or purimycin resistance, among others. Such selectable reporters or marker genes (preferably located outside the viral genome to be packaged into a viral particle) can be used to signal the presence of the plasmids in bacterial cells, such as ampicillin resistance. Other components of the vector may include an origin of replication. Selection of these and other promoters and vector elements are conventional and many such sequences are available [see, e.g., Sambrook et al, and references cited therein].

These vectors are generated using the techniques and sequences provided herein, in conjunction with techniques known to those of skill in the art. Such techniques include conventional cloning techniques of cDNA such as those described in texts [Sambrook et al, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Press, Cold Spring Harbor, N.Y.], use of overlapping oligonucleotide sequences of the adenovirus genomes, polymerase chain reaction, and any suitable method which provides the desired nucleotide sequence.

III. Production of the Viral Vector

In one embodiment, the simian adenoviral plasmids (or other vectors) are used to produce adenoviral vectors. In one embodiment, the adenoviral vectors are adenoviral particles which are replication-defective. In one embodiment, the adenoviral particles are rendered replication-defective by deletions in the E1a and/or E1b genes. Alternatively, the adenoviruses are rendered replication-defective by another means, optionally while retaining the E1a and/or E1b genes. Similarly, in some embodiments, reduction of an immune response to the vector may be accomplished by deletions in the E2b and/or DNA polymerase genes. The adenoviral vectors can also contain other mutations to the adenoviral genome, e.g., temperature-sensitive mutations or deletions in other genes. In other embodiments, it is desirable to retain an intact E1a and/or E1b region in the adenoviral vectors. Such an intact E1 region may be located in its native location in the adenoviral genome or placed in the site of a deletion in the native adenoviral genome (e.g., in the E3 region).

In the construction of useful simian adenovirus vectors for delivery of a gene to the human (or other mammalian) cell, a range of adenovirus nucleic acid sequences can be employed in the vectors. For example, all or a portion of the adenovirus delayed early gene E3 may be eliminated from the simian adenovirus sequence which forms a part of the recombinant virus. The function of simian E3 is believed to be irrelevant to the function and production of the recombinant virus particle. Simian adenovirus vectors may also be constructed having a deletion of at least the ORF6 region of the E4 gene, and more desirably because of the redundancy in the function of this region, the entire E4 region. Still another vector of this invention contains a deletion in the delayed early gene E2a. Deletions may also be made in any of the late genes L1 through L5 of the simian adenovirus genome. Similarly, deletions in the intermediate genes IX and IVa2 may be useful for some purposes. Other deletions may be made in the other structural or non-structural adenovirus genes. The above discussed deletions may be used individually, i.e., an adenovirus sequence for use as described herein may contain deletions in only a single region. Alternatively, deletions of entire genes or portions thereof effective to destroy their biological activity may be used in any combination. For example, in one exemplary vector, the adenovirus sequence may have deletions of the E1 genes and the E4 gene, or of the E1, E2a and E3 genes, or of the E1 and E3 genes, or of E1, E2a and E4 genes, with or without deletion of E3, and so on. As discussed above, such deletions may be used in combination with other mutations, such as temperature-sensitive mutations, to achieve a desired result.

An adenoviral vector lacking any essential adenoviral sequences (e.g., E1a, E1b, E2a, E2b, E4 ORF6, L1, L2, L3, L4 and L5) may be cultured in the presence of the missing adenoviral gene products which are required for viral infectivity and propagation of an adenoviral particle. These helper functions may be provided by culturing the adenoviral vector in the presence of one or more helper constructs (e.g., a plasmid or virus) or a packaging host cell. See, for example, the techniques described for preparation of a “minimal” human Ad vector in International Patent Application WO96/13597, published May 9, 1996, and incorporated herein by reference.

1. Helper Viruses

Thus, depending upon the simian adenovirus gene content of the viral vectors employed to carry the minigene, a helper adenovirus or non-replicating virus fragment may be necessary to provide sufficient simian adenovirus gene sequences necessary to produce an infective recombinant viral particle containing the minigene. Useful helper viruses contain selected adenovirus gene sequences not present in the adenovirus vector construct and/or not expressed by the packaging cell line in which the vector is transfected. In one embodiment, the helper virus is replication-defective and contains a variety of adenovirus genes in addition to the sequences described above. Such a helper virus is desirably used in combination with an E1-expressing cell line.

Helper viruses may also be formed into poly-cation conjugates as described in Wu et al, J. Biol. Chem., 374:16985-16987 (1989); K. J. Fisher and J. M. Wilson, Biochem. J., 299:49 (Apr. 1, 1994). Helper virus may optionally contain a second reporter minigene. A number of such reporter genes are known to the art. The presence of a reporter gene on the helper virus which is different from the transgene on the adenovirus vector allows both the Ad vector and the helper virus to be independently monitored. This second reporter is used to enable separation between the resulting recombinant virus and the helper virus upon purification.

2. Complementation Cell Lines

To generate recombinant simian adenoviruses (Ad) deleted in any of the genes described above, the function of the deleted gene region, if essential to the replication and infectivity of the virus, must be supplied to the recombinant virus by a helper virus or cell line, i.e., a complementation or packaging cell line. In many circumstances, a cell line expressing the human E1 can be used to transcomplement the chimp Ad vector. This is particularly advantageous because, due to the diversity between the chimp Ad sequences of the invention and the human AdE1 sequences found in currently available packaging cells, the use of the current human E1-containing cells prevents the generation of replication-competent adenoviruses during the replication and production process. However, in certain circumstances, it will be desirable to utilize a cell line which expresses the E1 gene products that can be utilized for production of an E1-deleted simian adenovirus. Such cell lines have been described. See, e.g., U.S. Pat. No. 6,083,716.

If desired, one may utilize the sequences provided herein to generate a packaging cell or cell line that expresses, at a minimum, the adenovirus E1 gene from SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 under the transcriptional control of a promoter for expression in a selected parent cell line. Inducible or constitutive promoters may be employed for this purpose. Examples of such promoters are described in detail elsewhere in this specification. A parent cell is selected for the generation of a novel cell line expressing any desired SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 gene. Without limitation, such a parent cell line may be HeLa [ATCC Accession No. CCL 2], A549 [ATCC Accession No. CCL 185], HEK 293, KB [CCL 17], Detroit [e.g., Detroit 510, CCL 72] and WI-38 [CCL 75] cells, among others. These cell lines are all available from the American Type Culture Collection, 10801 University Boulevard, Manassas, Va. 20110-2209. Other suitable parent cell lines may be obtained from other sources.

Such E1-expressing cell lines are useful in the generation of recombinant simian adenovirus E1 deleted vectors. Additionally, or alternatively, cell lines that express one or more simian adenoviral gene products, e.g., E1a, E1b, E2a, and/or E4 ORF6, can be constructed using essentially the same procedures are used in the generation of recombinant simian viral vectors. Such cell lines can be utilized to transcomplement adenovirus vectors deleted in the essential genes that encode those products, or to provide helper functions necessary for packaging of a helper-dependent virus (e.g., adeno-associated virus). The preparation of a host cell involves techniques such as assembly of selected DNA sequences. This assembly may be accomplished utilizing conventional techniques. Such techniques include cDNA and genomic cloning, which are well known and are described in Sambrook et al., cited above, use of overlapping oligonucleotide sequences of the adenovirus genomes, combined with polymerase chain reaction, synthetic methods, and any other suitable methods which provide the desired nucleotide sequence.

In still another alternative, the essential adenoviral gene products are provided in trans by the adenoviral vector and/or helper virus. In such an instance, a suitable host cell can be selected from any biological organism, including prokaryotic (e.g., bacterial) cells, and eukaryotic cells, including, insect cells, yeast cells and mammalian cells. Particularly desirable host cells are selected from among any mammalian species, including, without limitation, cells such as A549, WEHI, 3T3, 10T1/2, HEK 293 cells or PERC6 (both of which express functional adenoviral E1) [Fallaux, F J et al, (1998), Hum Gene Ther, 9:1909-1917], Saos, C2C12, L cells, HT1080, HepG2 and primary fibroblast, hepatocyte and myoblast cells derived from mammals including human, monkey, mouse, rat, rabbit, and hamster. The selection of the mammalian species providing the cells is not a limitation of this invention; nor is the type of mammalian cell, i.e., fibroblast, hepatocyte, tumor cell, etc.

3. Assembly of Viral Particle and Transfection of a Cell Line

Generally, when delivering the vector comprising the minigene by transfection, the vector is delivered in an amount from about 5 μg to about 100 μg DNA, and preferably about 10 to about 50 μg DNA to about 1×104 cells to about 1×1013 cells, and preferably about 105 cells. However, the relative amounts of vector DNA to host cells may be adjusted, taking into consideration such factors as the selected vector, the delivery method and the host cells selected.

The vector may be any vector known in the art or disclosed above, including naked DNA, a plasmid, phage, transposon, cosmids, episomes, viruses, etc. Introduction into the host cell of the vector may be achieved by any means known in the art or as disclosed above, including transfection, and infection. One or more of the adenoviral genes may be stably integrated into the genome of the host cell, stably expressed as episomes, or expressed transiently. The gene products may all be expressed transiently, on an episome or stably integrated, or some of the gene products may be expressed stably while others are expressed transiently. Furthermore, the promoters for each of the adenoviral genes may be selected independently from a constitutive promoter, an inducible promoter or a native adenoviral promoter. The promoters may be regulated by a specific physiological state of the organism or cell (i.e., by the differentiation state or in replicating or quiescent cells) or by exogenously-added factors, for example.

Introduction of the molecules (as plasmids or viruses) into the host cell may also be accomplished using techniques known to the skilled artisan and as discussed throughout the specification. In preferred embodiment, standard transfection techniques are used, e.g., CaPO4 transfection or electroporation.

Assembly of the selected DNA sequences of the adenovirus (as well as the transgene and other vector elements into various intermediate plasmids, and the use of the plasmids and vectors to produce a recombinant viral particle are all achieved using conventional techniques. Such techniques include conventional cloning techniques of cDNA such as those described in texts [Sambrook et al, cited above], use of overlapping oligonucleotide sequences of the adenovirus genomes, polymerase chain reaction, and any suitable method which provides the desired nucleotide sequence. Standard transfection and co-transfection techniques are employed, e.g., CaPO4 precipitation techniques. Other conventional methods employed include homologous recombination of the viral genomes, plaquing of viruses in agar overlay, methods of measuring signal generation, and the like.

For example, following the construction and assembly of the desired minigene-containing viral vector, the vector is transfected in vitro in the presence of a helper virus into the packaging cell line. Homologous recombination occurs between the helper and the vector sequences, which permits the adenovirus-transgene sequences in the vector to be replicated and packaged into virion capsids, resulting in the recombinant viral vector particles. The current method for producing such virus particles is transfection-based. However, the invention is not limited to such methods.

The resulting recombinant simian adenoviruses are useful in transferring a selected transgene to a selected cell. In in vivo experiments with the recombinant virus grown in the packaging cell lines, the E1-deleted recombinant simian adenoviral vectors of the invention demonstrate utility in transferring a transgene to a non-simian, preferably a human, cell.

IV. Use of the Recombinant Adenovirus Vectors

The recombinant simian adenovirus A1321 (SAdV-A1321)-, SAdV-A1325-, SAdV-A1295-, SAdV-A1309-, SAdV-A1316-, or SAdV-A1322-based vectors are useful for gene transfer to a human or non-simian veterinary patient in vitro, ex vivo, and in vivo.

The recombinant adenovirus vectors described herein can be used as expression vectors for the production of the products encoded by the heterologous genes in vitro. For example, the recombinant adenoviruses containing a gene inserted into the location of an E1 deletion may be transfected into an E1-expressing cell line as described above. Alternatively, replication-competent adenoviruses may be used in another selected cell line. The transfected cells are then cultured in the conventional manner, allowing the recombinant adenovirus to express the gene product from the promoter. The gene product may then be recovered from the culture medium by known conventional methods of protein isolation and recovery from culture.

A SAdV-A1321-, SAdV-A1325-, SAdV-A1295-, SAdV-A1309-, SAdV-A1316-, or SAdV-A1322-derived recombinant simian adenoviral vector provides an efficient gene transfer vehicle that can deliver a selected transgene to a selected host cell in vivo or ex vivo even where the organism has neutralizing antibodies to one or more AAV serotypes. In one embodiment, the rAAV and the cells are mixed ex vivo; the infected cells are cultured using conventional methodologies; and the transduced cells are re-infused into the patient. These compositions are particularly well suited to gene delivery for therapeutic purposes and for immunization, including inducing protective immunity.

More commonly, the SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 recombinant adenoviral vectors will be utilized for delivery of therapeutic or immunogenic molecules, as described below. It will be readily understood for both applications, that the recombinant adenoviral vectors of the invention are particularly well suited for use in regimens involving repeat delivery of recombinant adenoviral vectors. Such regimens typically involve delivery of a series of viral vectors in which the viral capsids are alternated. The viral capsids may be changed for each subsequent administration, or after a pre-selected number of administrations of a particular serotype capsid (e.g., one, two, three, four or more). Thus, a regimen may involve delivery of a rAd with a first simian capsid, delivery with a rAd with a second simian capsid, and delivery with a third simian capsid. A variety of other regimens which use the Ad capsids of the invention alone, in combination with one another, or in combination with other adenoviruses (which are preferably immunologically non-crossreactive) will be apparent to those of skill in the art. Optionally, such a regimen may involve administration of rAd with capsids of other non-human primate adenoviruses, human adenoviruses, or artificial sequences such as are described herein. Each phase of the regimen may involve administration of a series of injections (or other delivery routes) with a single Ad capsid followed by a series with another capsid from a different Ad source. Alternatively, the SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 vectors may be utilized in regimens involving other non-adenoviral-mediated delivery systems, including other viral systems, non-viral delivery systems, protein, peptides, and other biologically active molecules.

The following sections will focus on exemplary molecules which may be delivered via the adenoviral vectors of the invention.

A. Ad-Mediated Delivery of Therapeutic Molecules

In one embodiment, the above-described recombinant vectors are administered to humans according to published methods for gene therapy. A simian adenoviral vector bearing the selected transgene may be administered to a patient, preferably suspended in a biologically compatible solution or pharmaceutically acceptable delivery vehicle. A suitable vehicle includes sterile saline. Other aqueous and non-aqueous isotonic sterile injection solutions and aqueous and non-aqueous sterile suspensions known to be pharmaceutically acceptable carriers and well known to those of skill in the art may be employed for this purpose.

The simian adenoviral vectors are administered in sufficient amounts to transduce the target cells and to provide sufficient levels of gene transfer and expression to provide a therapeutic benefit without undue adverse or with medically acceptable physiological effects, which can be determined by those skilled in the medical arts. Conventional and pharmaceutically acceptable routes of administration include, but are not limited to, direct delivery to the retina and other intraocular delivery methods, direct delivery to the liver, inhalation, intranasal, intravenous, intramuscular, intratracheal, subcutaneous, intradermal, rectal, oral and other parenteral routes of administration. Routes of administration may be combined, if desired, or adjusted depending upon the transgene or the condition. The route of administration primarily will depend on the nature of the condition being treated.

Dosages of the viral vector will depend primarily on factors such as the condition being treated, the age, weight and health of the patient, and may thus vary among patients. For example, a therapeutically effective adult human or veterinary dosage of the viral vector is generally in the range of from about 100 μL to about 100 mL of a carrier containing concentrations of from about 1×106 to about 1×1015 particles, about 1×1011 to 1×1013 particles, or about 1×109 to 1×1012 particles virus. Dosages will range depending upon the size of the animal and the route of administration. For example, a suitable human or veterinary dosage (for about an 80 kg animal) for intramuscular injection is in the range of about 1×109 to about 5×1012 particles per mL, for a single site. Optionally, multiple sites of administration may be delivered. In another example, a suitable human or veterinary dosage may be in the range of about 1×1011 to about 1×1015 particles for an oral formulation. One of skill in the art may adjust these doses, depending on the route of administration and the therapeutic or vaccinal application for which the recombinant vector is employed. The levels of expression of the transgene, or for an immunogen, the level of circulating antibody, can be monitored to determine the frequency of dosage administration. Yet other methods for determining the timing of frequency of administration will be readily apparent to one of skill in the art.

An optional method step involves the co-administration to the patient, either concurrently with, or before or after administration of the viral vector, of a suitable amount of a short acting immune modulator. The selected immune modulator is defined herein as an agent capable of inhibiting the formation of neutralizing antibodies directed against the recombinant vector of this invention or capable of inhibiting cytolytic T lymphocyte (CTL) elimination of the vector. The immune modulator may interfere with the interactions between the T helper subsets (TH1 or TH2) and B cells to inhibit neutralizing antibody formation. Alternatively, the immune modulator may inhibit the interaction between TH1 cells and CTLs to reduce the occurrence of CTL elimination of the vector. A variety of useful immune modulators and dosages for use of same are disclosed, for example, in Yang et al., J. Vivol., 70(9) (September 1996); International Patent Application No. WO96/12406, published May 2, 1996; and International Patent Application No. PCT/US96/03035, all incorporated herein by reference.

1. Therapeutic Transgenes

Useful therapeutic products encoded by the transgene include hormones and growth and differentiation factors including, without limitation, insulin, glucagon, growth hormone (GH), parathyroid hormone (PTH), growth hormone releasing factor (GRF), follicle stimulating hormone (FSH), luteinizing hormone (LH), human chorionic gonadotropin (hCG), vascular endothelial growth factor (VEGF), angiopoietins, angiostatin, granulocyte colony stimulating factor (GCSF), erythropoietin (EPO), connective tissue growth factor (CTGF), basic fibroblast growth factor (bFGF), acidic fibroblast growth factor (aFGF), epidermal growth factor (EGF), transforming growth factor (TGF), platelet-derived growth factor (PDGF), insulin growth factors I and II (IGF-I and IGF-II), any one of the transforming growth factor superfamily, including TGF, activins, inhibins, or any of the bone morphogenic proteins (BMP) BMPs 1-15, any one of the heregluin/neuregulin/ARIA/neu differentiation factor (NDF) family of growth factors, nerve growth factor (NGF), brain-derived neurotrophic factor (BDNF), neurotrophins NT-3 and NT-4/5, ciliary neurotrophic factor (CNTF), glial cell line derived neurotrophic factor (GDNF), neurturin, agrin, any one of the family of semaphorins/collapsins, netrin-1 and netrin-2, hepatocyte growth factor (HGF), ephrins, noggin, sonic hedgehog and tyrosine hydroxylase.

Other useful transgene products include proteins that regulate the immune system including, without limitation, cytokines and lymphokines such as thrombopoietin (TPO), interleukins (IL) IL-1 through IL-25 (including, e.g., IL-2, IL-4, IL-12 and IL-18), monocyte chemoattractant protein, leukemia inhibitory factor, granulocyte-macrophage colony stimulating factor, Fas ligand, tumor necrosis factors and, interferons, and, stem cell factor, flk-2/flt3 ligand. Gene products produced by the immune system are also useful in the invention. These include, without limitation, immunoglobulins IgG, IgM, IgA, IgD and IgE, chimeric immunoglobulins, humanized antibodies, single chain antibodies, T cell receptors, chimeric T cell receptors, single chain T cell receptors, class I and class II MHC molecules, as well as engineered immunoglobulins and MHC molecules. Useful gene products also include complement regulatory proteins such as complement regulatory proteins, membrane cofactor protein (MCP), decay accelerating factor (DAF), CR1, CF2 and CD59.

Still other useful gene products include any one of the receptors for the hormones, growth factors, cytokines, lymphokines, regulatory proteins and immune system proteins. The invention encompasses receptors for cholesterol regulation, including the low density lipoprotein (LDL) receptor, high density lipoprotein (HDL) receptor, the very low density lipoprotein (VLDL) receptor, and the scavenger receptor. The invention also encompasses gene products such as members of the steroid hormone receptor superfamily including glucocorticoid receptors and estrogen receptors, Vitamin D receptors and other nuclear receptors. In addition, useful gene products include transcription factors such as jun, fos, max, mad, serum response factor (SRF), AP-1, AP2, myb, MyoD and myogenin, ETS-box containing proteins, TFE3, E2F, ATF1, ATF2, ATF3, ATF4, ZF5, NFAT, CREB, HNF-4, C/EBP, SP1, CCAAT-box binding proteins, interferon regulation factor (IRF-1), Wilms tumor protein, ETS-binding protein, STAT, GATA-box binding proteins, e.g., GATA-3, and the forkhead family of winged helix proteins.

Other useful gene products include, carbamoyl synthetase I, ornithine transcarbamylase, arginosuccinate synthetase, arginosuccinate lyase, arginase, fumarylacetacetate hydrolase, phenylalanine hydroxylase, alpha-1 antitrypsin, glucose-6-phosphatase, porphobilinogen deaminase, factor VIII, factor IX, cystathione beta-synthase, branched chain ketoacid decarboxylase, albumin, isovaleryl-coA dehydrogenase, propionyl CoA carboxylase, methyl malonyl CoA mutase, glutaryl CoA dehydrogenase, insulin, beta-glucosidase, pyruvate carboxylate, hepatic phosphorylase, phosphorylase kinase, glycine decarboxylase, H-protein, T-protein, a cystic fibrosis transmembrane regulator (CFTR) sequence, and a dystrophin cDNA sequence.

Other useful gene products include non-naturally occurring polypeptides, such as chimeric or hybrid polypeptides having a non-naturally occurring amino acid sequence containing insertions, deletions or amino acid substitutions. For example, single-chain engineered immunoglobulins could be useful in certain immunocompromised patients. Other types of non-naturally occurring gene sequences include antisense molecules and catalytic nucleic acids, such as ribozymes, which could be used to reduce overexpression of a target.

Reduction and/or modulation of expression of a gene are particularly desirable for treatment of hyperproliferative conditions characterized by hyperproliferating cells, as are cancers and psoriasis. Target polypeptides include those polypeptides which are produced exclusively or at higher levels in hyperproliferative cells as compared to normal cells. Target antigens include polypeptides encoded by oncogenes such as myb, myc, fyn, and the translocation gene bcr/abl, ras, src, P53, neu, trk and EGRF. In addition to oncogene products as target antigens, target polypeptides for anti-cancer treatments and protective regimens include variable regions of antibodies made by B cell lymphomas and variable regions of T cell receptors of T cell lymphomas which, in some embodiments, are also used as target antigens for autoimmune disease. Other tumor-associated polypeptides can be used as target polypeptides such as polypeptides which are found at higher levels in tumor cells including the polypeptide recognized by monoclonal antibody 17-1A and folate binding polypeptides.

Other suitable therapeutic polypeptides and proteins include those which may be useful for treating individuals suffering from autoimmune diseases and disorders by conferring a broad based protective immune response against targets that are associated with autoimmunity including cell receptors and cells which produce self-directed antibodies. T cell mediated autoimmune diseases include Rheumatoid arthritis (RA), multiple sclerosis (MS), Sjögren's syndrome, sarcoidosis, insulin dependent diabetes mellitus (IDDM), autoimmune thyroiditis, reactive arthritis, ankylosing spondylitis, scleroderma, polymyositis, dermatomyositis, psoriasis, vasculitis, Wegener's granulomatosis, Crohn's disease and ulcerative colitis. Each of these diseases is characterized by T cell receptors (TCRs) that bind to endogenous antigens and initiate the inflammatory cascade associated with autoimmune diseases.

The simian adenoviral vectors of the invention are particularly well suited for therapeutic regimens in which multiple adenoviral-mediated deliveries of transgenes is desired, e.g., in regimens involving redelivery of the same transgene or in combination regimens involving delivery of other transgenes. Such regimens may involve administration of a SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 simian adenoviral vector, followed by re-administration with a vector from the same serotype adenovirus. Particularly desirable regimens involve administration of a SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 simian adenoviral vector, in which the source of the adenoviral capsid sequences of the vector delivered in the first administration differs from the source of adenoviral capsid sequences of the viral vector utilized in one or more of the subsequent administrations. For example, a therapeutic regimen involves administration of a SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 vector and repeat administration with one or more adenoviral vectors of the same or different serotypes. In another example, a therapeutic regimen involves administration of an adenoviral vector followed by repeat administration with a SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 vector which has a capsid which differs from the source of the capsid in the first delivered adenoviral vector, and optionally further administration with another vector which is the same or, preferably, differs from the source of the adenoviral capsid of the vector in the prior administration steps. These regimens are not limited to delivery of adenoviral vectors constructed using the SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 simian sequences. Rather, these regimens can readily utilize other adenoviral sequences, including, without limitation, other simian adenoviral sequences, (e.g., Pan9 or C68, C1, etc), other non-human primate adenoviral sequences, or human adenoviral sequences, in combination with the SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 vectors. Examples of such simian, other non-human primate and human adenoviral serotypes are discussed elsewhere in this document. Further, these therapeutic regimens may involve either simultaneous or sequential delivery of SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 adenoviral vectors in combination with non-adenoviral vectors, non-viral vectors, and/or a variety of other therapeutically useful compounds or molecules. The invention is not limited to these therapeutic regimens, a variety of which will be readily apparent to one of skill in the art.

B. Ad-Mediated Delivery of Immunogenic Transgenes

The recombinant SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 vectors may also be employed as immunogenic compositions. As used herein, an immunogenic composition is a composition to which a humoral (e.g., antibody) or cellular (e.g., a cytotoxic T cell) response is mounted to a transgene product delivered by the immunogenic composition following delivery to a mammal, and preferably a primate. A recombinant simian Ad can contain in any of its adenovirus sequence deletions a gene encoding a desired immunogen. The simian adenovirus is likely to be better suited for use as a live recombinant virus vaccine in different animal species compared to an adenovirus of human origin, but is not limited to such a use. The recombinant adenoviruses can be used as prophylactic or therapeutic vaccines against any pathogen for which the antigen(s) crucial for induction of an immune response and able to limit the spread of the pathogen has been identified and for which the cDNA is available.

Such vaccinal (or other immunogenic) compositions are formulated in a suitable delivery vehicle, as described above. Generally, doses for the immunogenic compositions are in the range defined above for therapeutic compositions. The levels of immunity of the selected gene can be monitored to determine the need, if any, for boosters. Following an assessment of antibody titers in the serum, optional booster immunizations may be desired.

Optionally, a vaccinal composition of the invention may be formulated to contain other components, including, e.g., adjuvants, stabilizers, pH adjusters, preservatives and the like. Such components are well known to those of skill in the vaccine art. Examples of suitable adjuvants include, without limitation, liposomes, alum, monophosphoryl lipid A, and any biologically active factor, such as cytokine, an interleukin, a chemokine, a ligands, and optimally combinations thereof. Certain of these biologically active factors can be expressed in vivo, e.g., via a plasmid or viral vector. For example, such an adjuvant can be administered with a priming DNA vaccine encoding an antigen to enhance the antigen-specific immune response compared with the immune response generated upon priming with a DNA vaccine encoding the antigen only.

The recombinant adenoviruses are administered in a “an immunogenic amount”, that is, an amount of recombinant adenovirus that is effective in a route of administration to transfect the desired cells and provide sufficient levels of expression of the selected gene to induce an immune response. Where protective immunity is provided, the recombinant adenoviruses are considered to be vaccine compositions useful in preventing infection and/or recurrent disease.

Alternatively, or in addition, the vectors of the invention may contain a transgene encoding a peptide, polypeptide or protein which induces an immune response to a selected immunogen. The recombinant SAdV vectors described herein are expected to be highly efficacious at inducing cytolytic T cells and antibodies to the inserted heterologous antigenic protein expressed by the vector.

For example, immunogens may be selected from a variety of viral families. Example of viral families against which an immune response would be desirable include, the picornavirus family, which includes the genera rhinoviruses, which are responsible for about 50% of cases of the common cold; the genera enteroviruses, which include polioviruses, coxsackieviruses, echoviruses, and human enteroviruses such as hepatitis A virus; and the genera apthoviruses, which are responsible for foot and mouth diseases, primarily in non-human animals. Within the picornavirus family of viruses, target antigens include the VP1, VP2, VP3, VP4, and VPG. Another viral family includes the calcivirus family, which encompasses the Norwalk group of viruses, which are an important causative agent of epidemic gastroenteritis. Still another viral family desirable for use in targeting antigens for inducing immune responses in humans and non-human animals is the togavirus family, which includes the genera alphavirus, which include Sindbis viruses, RossRiver virus, and Venezuelan, Eastern & Western Equine encephalitis, and rubivirus, including Rubella virus. The flaviviridae family includes dengue, yellow fever, Japanese encephalitis, St. Louis encephalitis and tick borne encephalitis viruses. Other target antigens may be generated from the Hepatitis C or the coronavirus family, which includes a number of non-human viruses such as infectious bronchitis virus (poultry), porcine transmissible gastroenteric virus (pig), porcine hemagglutinating encephalomyelitis virus (pig), feline infectious peritonitis virus (cats), feline enteric coronavirus (cat), canine coronavirus (dog), and human respiratory coronaviruses, which may cause the common cold and/or non-A, B or C hepatitis. Within the coronavirus family, target antigens include the E1 (also called M or matrix protein), E2 (also called S or Spike protein), E3 (also called HE or hemagglutin-elterose) glycoprotein (not present in all coronaviruses), or N (nucleocapsid). Still other antigens may be targeted against the rhabdovirus family, which includes the genera vesiculovirus (e.g., Vesicular Stomatitis Virus), and the general lyssavirus (e.g., rabies).

Within the rhabdovirus family, suitable antigens may be derived from the G protein or the N protein. The family filoviridae, which includes hemorrhagic fever viruses such as Marburg and Ebola virus, may be a suitable source of antigens. The paramyxovirus family includes parainfluenza Virus Type 1, parainfluenza Virus Type 3, bovine parainfluenza Virus Type 3, rubulavirus (mumps virus), parainfluenza Virus Type 2, parainfluenza virus Type 4, Newcastle disease virus (chickens), rinderpest, morbillivirus, which includes measles and canine distemper, and pneumovirus, which includes respiratory syncytial virus. The influenza virus is classified within the family orthomyxovirus and is a suitable source of antigen (e.g., the HA protein, the N1 protein). The bunyavirus family includes the genera bunyavirus (California encephalitis, La Crosse), phlebovirus (Rift Valley Fever), hantavirus (puremala is a hemahagin fever virus), nairovirus (Nairobi sheep disease) and various unassigned bungaviruses. The arenavirus family provides a source of antigens against LCM and Lassa fever virus. The reovirus family includes the genera reovirus, rotavirus (which causes acute gastroenteritis in children), orbiviruses, and cultivirus (Colorado Tick fever, Lebombo (humans), equine encephalosis, blue tongue).

The retrovirus family includes the sub-family oncorivirinal which encompasses such human and veterinary diseases as feline leukemia virus, HTLVI and HTLVII, lentivirinal (which includes human immunodeficiency virus (HIV), simian immunodeficiency virus (SIV), feline immunodeficiency virus (FIV), equine infectious anemia virus, and spumavirinal). Among the lentiviruses, many suitable antigens have been described and can readily be selected. Examples of suitable HIV and SIV antigens include, without limitation the gag, pol, Vif, Vpx, VPR, Env, Tat, Nef, and Rev proteins, as well as various fragments thereof. For example, suitable fragments of the Env protein may include any of its subunits such as the gp120, gp160, gp41, or smaller fragments thereof, e.g., of at least about 8 amino acids in length. Similarly, fragments of the that protein may be selected. [See, U.S. Pat. No. 5,891,994 and U.S. Pat. No. 6,193,981.] See, also, the HIV and SIV proteins described in D. H. Barouch et al, J. Virol., 75(5):2462-2467 (March 2001), and R. R. Amara, et al, Science, 292:69-74 (6 Apr. 2001). In another example, the HIV and/or SIV immunogenic proteins or peptides may be used to form fusion proteins or other immunogenic molecules. See, e.g., the HIV-1 Tat and/or Nef fusion proteins and immunization regimens described in WO 01/54719, published Aug. 2, 2001, and WO 99/16884, published Apr. 8, 1999. The invention is not limited to the HIV and/or SIV immunogenic proteins or peptides described herein. In addition, a variety of modifications to these proteins has been described or could readily be made by one of skill in the art. See, e.g., the modified gag protein that is described in U.S. Pat. No. 5,972,596. Further, any desired HIV and/or SIV immunogens may be delivered alone or in combination. Such combinations may include expression from a single vector or from multiple vectors. Optionally, another combination may involve delivery of one or more expressed immunogens with delivery of one or more of the immunogens in protein form. Such combinations are discussed in more detail below.

The papovavirus family includes the sub-family polyomaviruses (BKU and JCU viruses) and the sub-family papillomavirus (associated with cancers or malignant progression of papilloma). The adenovirus family includes viruses (EX, AD7, ARD, O.B.) which cause respiratory disease and/or enteritis. The parvovirus family feline parvovirus (feline enteritis), feline panleucopeniavirus, canine parvovirus, and porcine parvovirus. The herpesvirus family includes the sub-family alphaherpesvirinae, which encompasses the genera simplexvirus (HSVI, HSVII), varicellovirus (pseudorabies, varicella zoster) and the sub-family betaherpesvirinae, which includes the genera cytomegalovirus (HCMV, muromegalovirus) and the sub-family gammaherpesvirinae, which includes the genera lymphocryptovirus, EBV (Burkitts lymphoma), infectious rhinotracheitis, Marek's disease virus, and rhadinovirus. The poxvirus family includes the sub-family chordopoxvirinae, which encompasses the genera orthopoxvirus (Variola (Smallpox) and Vaccinia (Cowpox)), parapoxvirus, avipoxvirus, capripoxvirus, leporipoxvirus, suipoxvirus, and the sub-family entomopoxvirinae. The hepadnavirus family includes the Hepatitis B virus. One unclassified virus which may be suitable source of antigens is the Hepatitis delta virus. Still other viral sources may include avian infectious bursal disease virus and porcine respiratory and reproductive syndrome virus. The alphavirus family includes equine arteritis virus and various Encephalitis viruses.

Immunogens which are useful to immunize a human or non-human animal against other pathogens include, e.g., bacteria, fungi, parasitic microorganisms or multicellular parasites which infect human and non-human vertebrates, or from a cancer cell or tumor cell. Examples of bacterial pathogens include pathogenic gram-positive cocci include pneumococci; staphylococci; and streptococci. Pathogenic gram-negative cocci include meningococcus; gonococcus. Pathogenic enteric gram-negative bacilli include enterobacteriaceae; pseudomonas, acinetobacteria and eikenella; melioidosis; salmonella; shigella; haemophilus; moraxella; H. ducreyi (which causes chancroid); brucella; Franisella tularensis (which causes tularemia); yersinia (pasteurella); streptobacillus moniliformis and spirillum; Gram-positive bacilli include listeria monocytogenes; erysipelothrix rhusiopathiae; Corynebacterium diphtheria (diphtheria); cholera; B. anthracis (anthrax); donovanosis (granuloma inguinale); and bartonellosis. Diseases caused by pathogenic anaerobic bacteria include tetanus; botulism; other clostridia; tuberculosis; leprosy; and other mycobacteria. Pathogenic spirochetal diseases include syphilis; treponematoses: yaws, pinta and endemic syphilis; and leptospirosis. Other infections caused by higher pathogen bacteria and pathogenic fungi include actinomycosis; nocardiosis; cryptococcosis, blastomycosis, histoplasmosis and coccidioidomycosis; candidiasis, aspergillosis, and mucormycosis; sporotrichosis; paracoccidiodomycosis, petriellidiosis, torulopsosis, mycetoma and chromomycosis; and dermatophytosis. Rickettsial infections include Typhus fever, Rocky Mountain spotted fever, Q fever, and Rickettsialpox. Examples of mycoplasma and chlamydial infections include: mycoplasma pneumoniae; lymphogranuloma venereum; psittacosis; and perinatal chlamydial infections. Pathogenic eukaryotes encompass pathogenic protozoa and helminthes and infections produced thereby include: amebiasis; malaria; leishmaniasis; trypanosomiasis; toxoplasmosis; Pneumocystis carinii; Trichans; Toxoplasma gondii; babesiosis; giardiasis; trichinosis; filariasis; schistosomiasis; nematodes; trematodes or flukes; and cestode (tapeworm) infections.

Many of these organisms and/or toxins produced thereby have been identified by the Centers for Disease Control [(CDC), Department of Heath and Human Services, USA], as agents which have potential for use in biological attacks. For example, some of these biological agents, include, Bacillus anthracis (anthrax), Clostridium botulinum and its toxin (botulism), Yersinia pestis (plague), variola major (smallpox), Francisella tularensis (tularemia), and viral hemorrhagic fevers [filoviruses (e.g., Ebola, Marburg], and arenaviruses [e.g., Lassa, Machupo]), all of which are currently classified as Category A agents; Coxiella burnetti (Q fever); Brucella species (brucellosis), Burkholderia mallei (glanders), Burkholderia pseudomallei (meloidosis), Ricinus communis and its toxin (ricin toxin), Clostridium perfringens and its toxin (epsilon toxin), Staphylococcus species and their toxins (enterotoxin B), Chlamydia psittaci (psittacosis), water safety threats (e.g., Vibrio cholerae, Crytosporidium parvum), Typhus fever (Richettsia powazekii), and viral encephalitis (alphaviruses, e.g., Venezuelan equine encephalitis; eastern equine encephalitis; western equine encephalitis); all of which are currently classified as Category B agents; and Nipan virus and hantaviruses, which are currently classified as Category C agents. In addition, other organisms, which are so classified or differently classified, may be identified and/or used for such a purpose in the future. It will be readily understood that the viral vectors and other constructs described herein are useful to deliver antigens from these organisms, viruses, their toxins or other by-products, which will prevent and/or treat infection or other adverse reactions with these biological agents.

Administration of the SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 vectors to deliver immunogens against the variable region of the T cells are anticipated to elicit an immune response including CTLs to eliminate those T cells. In RA, several specific variable regions of TCRs which are involved in the disease have been characterized. These TCRs include V-3, V-14, V-17 and Vα-17. Thus, delivery of a nucleic acid sequence that encodes at least one of these polypeptides will elicit an immune response that will target T cells involved in RA. In MS, several specific variable regions of TCRs which are involved in the disease have been characterized. These TCRs include V-7 and Vα-10. Thus, delivery of a nucleic acid sequence that encodes at least one of these polypeptides will elicit an immune response that will target T cells involved in MS. In scleroderma, several specific variable regions of TCRs which are involved in the disease have been characterized. These TCRs include V-6, V-8, V-14 and Vα-16, Vα-3C, Vα-7, Vα-14, Vα-15, Vα-16, Vα-28 and Vα-12. Thus, delivery of a recombinant simian adenovirus that encodes at least one of these polypeptides will elicit an immune response that will target T cells involved in scleroderma.

C. Ad-Mediated Delivery Methods

The therapeutic levels, or levels of immunity, of the selected gene can be monitored to determine the need, if any, for boosters. Following an assessment of CD8+ T cell response, or optionally, antibody titers, in the serum, optional booster immunizations may be desired. Optionally, the recombinant SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 vectors may be delivered in a single administration or in various combination regimens, e.g., in combination with a regimen or course of treatment involving other active ingredients or in a prime-boost regimen. A variety of such regimens has been described in the art and may be readily selected.

For example, prime-boost regimens may involve the administration of a DNA (e.g., plasmid) based vector to prime the immune system to second, booster, administration with a traditional antigen, such as a protein or a recombinant virus carrying the sequences encoding such an antigen. See, e.g., WO 00/11140, published Mar. 2, 2000, incorporated by reference. Alternatively, an immunization regimen may involve the administration of a recombinant SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 vector to boost the immune response to a vector (either viral or DNA-based) carrying an antigen, or a protein. In still another alternative, an immunization regimen involves administration of a protein followed by booster with a vector encoding the antigen.

In one embodiment, a method of priming and boosting an immune response to a selected antigen by delivering a plasmid DNA vector carrying said antigen, followed by boosting with a recombinant SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 vector is described. In one embodiment, the prime-boost regimen involves the expression of multiproteins from the prime and/or the boost vehicle. See, e.g., R. R. Amara, Science, 292:69-74 (6 Apr. 2001) which describes a multiprotein regimen for expression of protein subunits useful for generating an immune response against HIV and SIV. For example, a DNA prime may deliver the Gag, Pol, Vif, VPX and Vpr and Env, Tat, and Rev from a single transcript. Alternatively, the SIV Gag, Pol and HIV-1 Env is delivered in a recombinant SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 adenovirus construct. Still other regimens are described in WO 99/16884 and WO 01/54719.

However, the prime-boost regimens are not limited to immunization for HIV or to delivery of these antigens. For example, priming may involve delivering with a first SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 vector followed by boosting with a second Ad vector, or with a composition containing the antigen itself in protein form. In one example, the prime-boost regimen can provide a protective immune response to the virus, bacteria or other organism from which the antigen is derived. In another embodiment, the prime-boost regimen provides a therapeutic effect that can be measured using convention assays for detection of the presence of the condition for which therapy is being administered.

The priming composition may be administered at various sites in the body in a dose dependent manner, which depends on the antigen to which the desired immune response is being targeted. The amount or situs of injection(s) or to pharmaceutical carrier is not a limitation. Rather, the regimen may involve a priming and/or boosting step, each of which may include a single dose or dosage that is administered hourly, daily, weekly or monthly, or yearly. As an example, the mammals may receive one or two doses containing between about 10 μg to about 50 μg of plasmid in carrier. A desirable amount of a DNA composition ranges between about 1 μg to about 10,000 μg of the DNA vector. Dosages may vary from about 1 μg to 1000 μg DNA per kg of subject body weight. The amount or site of delivery is desirably selected based upon the identity and condition of the mammal.

The dosage unit of the vector suitable for delivery of the antigen to the mammal is described herein. The vector is prepared for administration by being suspended or dissolved in a pharmaceutically or physiologically acceptable carrier such as isotonic saline; isotonic salts solution or other formulations that will be apparent to those skilled in such administration. The appropriate carrier will be evident to those skilled in the art and will depend in large part upon the route of administration. The compositions described herein may be administered to a mammal according to the routes described above, in a sustained release formulation using a biodegradable biocompatible polymer, or by on-site delivery using micelles, gels and liposomes. Optionally, the priming step also includes administering with the priming composition, a suitable amount of an adjuvant, such as are defined herein.

Preferably, a boosting composition is administered about 2 to about 27 weeks after administering the priming composition to the mammalian subject. The administration of the boosting composition is accomplished using an effective amount of a boosting composition containing or capable of delivering the same antigen as administered by the priming DNA vaccine. The boosting composition may be composed of a recombinant viral vector derived from the same viral source (e.g., adenoviral sequences of the invention) or from another source. Alternatively, the “boosting composition” can be a composition containing the same antigen as encoded in the priming DNA vaccine, but in the form of a protein or peptide, which composition induces an immune response in the host. In another embodiment, the boosting composition contains a DNA sequence encoding the antigen under the control of a regulatory sequence directing its expression in a mammalian cell, e.g., vectors such as well-known bacterial or viral vectors. The primary requirements of the boosting composition are that the antigen of the composition is the same antigen, or a crossreactive antigen, as that encoded by the priming composition.

In another embodiment, the SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 vectors are also well suited for use in a variety of other immunization and therapeutic regimens. Such regimens may involve delivery of SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 vectors simultaneously or sequentially with Ad vectors of different serotype capsids, regimens in which SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 vectors are delivered simultaneously or sequentially with non-Ad vectors, regimens in which the SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 vectors are delivered simultaneously or sequentially with proteins, peptides, and/or other biologically useful therapeutic or immunogenic compounds. Such uses will be readily apparent to one of skill in the art.

In still another embodiment, the invention provides the use of capsid of these viruses (optionally an intact or recombinant viral particle or an empty capsid is used) to induce an immunomodulatory effect response, or to enhance or adjuvant a cytotoxic T cell response to another active agent by delivering an adenovirus SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 capsid to a subject. The SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 capsid can be delivered alone or in a combination regimen with an active agent to enhance the immune response thereto. Advantageously, the desired effect can be accomplished without infecting the host with a subgroup E adenovirus. In another aspect, a method of inducing interferon alpha production in a subject in need thereof comprising delivering the SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 capsid to a subject is provided. In still another aspect, a method for producing one or more cytokines (e.g., IFN-α)/chemokines in culture is provided. This method involves incubating a culture containing dendritic cells and the SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 capsid described herein under conditions suitable to produce cytokines/chemokines, including, alpha interferon, among others.

The cytokines so produced are useful in a variety of applications. For example, in the case of IFNα, the production described herein is particularly desirable, as it is believed that it will provide advantages over commercially available recombinantly produced IFNα, which contain only one or two subtypes of IFNα produced in bacteria. In contrast, the method is anticipated to produce multiple subtypes of natural human IFNα, which is expected to result in a broader spectrum of action. It is believed that each subtype employs a specific biological activity. Further, it is anticipated that the natural interferon produced by the method provided herein will be immunologically indistinguishable from the patient's naturally produced interferon, thereby reducing the risk of the drug being rejected by the subject's immune system, usually caused by the formation of neutralizing antibodies against recombinantly produced interferons.

Other cytokines produced by the subgroup E adenoviruses include, interleukin (IL)-6, IL-8, IP-10, macrophage inflammatory protein-1 alpha (MIP-1α), RANTES, and tumor necrosis factor alpha. Methods of purifying these cytokines/chemokines from culture and therapeutic or adjuvant uses of these cytokines/chemokines have been described in the literature. Further, commercially available columns or kits may used for purification of the cytokines/chemokines prepared according to the invention. The cytokines/chemokines produced using the invention may be formulated for use in a variety of indications.

For example, cytokines described herein include, interferon alpha (IFNα), tumor necrosis factor alpha (TNFα), IP-10 (Interferon gamma Inducible Protein), interleukin-6 (IL-6), and IL-8. IFNα, has been described as being useful in treatment of influenza, hepatitis (including, e.g., hepatitis B and C), and a variety of neoplasms, e.g., kidney (renal cell carcinoma), melanoma, malignant tumor, multiple myeloma, carcinoid tumor, lymphoma and leukemia (e.g., chronic myelogenous leukemia and hairy cell leukemia). A mixture of IFNα subtypes produced as described herein can be purified using known techniques. See, e.g., WO 2006/085092, which describes the use of monoclonal antibodies and column purification. Other techniques have been described in the literature. IFNα produced as described herein can be purified using known methods. See, e.g., U.S. Pat. No. 4,680,260, U.S. Pat. No. 4,732,683, and G. Allen, Biochem J., 207:397-408 (1982). TNFα has been described as being useful in treatment in autoimmune disorders including, e.g., psoriasis and rheumatoid arthritis. IP-10, Interferon gamma Inducible Protein, can be used as a potent inhibitor of angiogenesis and to have a potent thymus-dependent anti-tumor effect.

A method for producing IFNα by incubating a culture containing dendritic cells and a SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 capsid under conditions suitable to produce cytokines is provided. In one embodiment, blood is drawn from healthy donors (preferably human) and peripheral blood leukocytes (PBL) or peripheral blood mononuclear cells (PBMC) are prepared using known techniques. In one embodiment, PBL are used as the cytokine-producing cells according to the method of the invention. In another embodiment, PBMC are used as the cytokine-producing cells. In another embodiment, plasmacytoid dendritic cells are isolated from the PBL or PBMC using known techniques, e.g., using the commercially available kit “human plasmacytoid dendritic cell isolation kit” by Miltenyi Biotec GmbH (Germany) The selected cells are cultured in suspension with an appropriate media and the adenovirus subgroup E capsid protein. Appropriate media can be readily determined by one of skill in the art. However, in one embodiment, the media is a RPMI-1640 medium. Alternatively, other media may be readily selected. The cells may be cultured in a suitable vessel, e.g., a microtiter well, a flask, or a larger vessel. In one embodiment, the concentration of the cells is about 1 million cells/mL culture media. However, other suitable cell concentrations may be readily determined by one of skill in the art. The invention does not require the use of interferons as primers. However, if desired, the media may include a suitable cytokine, IL-3, in order to stimulate cell growth. One suitable concentration is about 20 ng/mL. However, other concentrations may be used. In one embodiment, the SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 capsid protein is introduced into the culture containing the cells. The adenovirus capsid protein can be delivered to the culture in any of the forms described herein (e.g., a viral particle, including an empty capsid particle, a viral vector having an SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 capsid, and the like). Typically the capsid protein will be suspended in a suitable carrier, e.g., culture media, saline, or the like. Suitably, the SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 capsid is added to the culture in an amount of about 100 to 100,000 adenovirus subgroup E particles per cell. The mixture is then incubated, e.g., in the range of about 28° C. to about 40° C., in the range from about 35° C. to about 37° C., or about 37° C. Typically, approximately 12 to 96 hours, or about 48 hours later, cells are spun down and the supernatant is collected. Suitably, this is performed under conditions which avoid cell lysis, thereby reducing or eliminating the presence of cellular debris in the supernatant. Centrifugation permits separation of the cytokines from the cells, thereby providing a crudely isolated cytokine. Sizing columns, and other known columns and methods are available for further purification of cytokines from adenoviruses and adenoviral capsids, and the like. These cytokines, so purified, are available for formulation and use in a variety of applications.

In one embodiment, an empty SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 particle (i.e., an adenoviral capsid having no DNA packaged therein which expresses any adenoviral or transgene product) may be delivered to the cells. In another embodiment, a non-infectious wild-type SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 particle or a recombinant adenoviral vector packaged in an SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 capsid (particle) may be used. Suitable techniques for inactivating such viral particles are known in the art and may include without limitation, e.g., UV irradiation (which effectively cross-links genomic DNA preventing expression).

The following examples describe the cloning of SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 and the construction of exemplary recombinant SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 vectors. These examples are illustrative only, and do not limit the scope of the present invention.

Example 1—Isolation of Simian Adenoviruses

Stool samples were obtained from the chimpanzee colony at the University of Louisiana New Iberia Research Center, 4401 W. Admiral Doyle Drive, New Iberia, La., USA. Filtered supernatants from the stool suspensions were inoculated into cultures of the human cell line A549. After about 1 to 2 weeks in culture, visual cytopathic effect (CPE) was obvious in cell cultures with several of the inocula. The viruses that were isolated by this technique were amplified to a large-scale preparation using A549 cells using the standard adenovirus purification method of cesium chloride gradient banding. DNA from the purified adenoviruses was isolated and completely sequenced by Qiagen Genomics services, Hilden, Germany. Analysis of the complete genomic sequence showed that the isolated virus had a novel sequence that had not been previously reported.

Based on the phylogenetic analysis of the viral DNA sequences, the adenoviruses designated simian adenovirus A1321 (SAdV-A1321), SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, and SAdV-A1322, were determined to be in the same subgroup as subgroup (species) E. Average yields for viral amplification were as follows: A1321 (1.44×1013), SAdV-A1325 (2.73×1013), SAdV-A1295 (1.2×1013), SAdV-A1309 (2.57×1013), SAdV-A1316 (5.58×1013), and SAdV-A1322 (4.71×1013).

Example 2—Vector Construction

An E1 deleted vector using SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 (subgroup E) may be prepared generally as described.

A linker containing SmaI, ClaI, XbaI, SpeI, EcoRV sites flanked by SwaI is cloned into pBR322 cut with EcoRI and NdeI Viral DNA is digested with XbaI and the 6 kb fragments (left and right ends) are gel purified and ligated into pSR5 digested with SmaI and XbaI. 12 minipreps are diagnosed with SmaI and assessed for expected fragment sizes. Minipreps are sequenced to check the integrity of the viral DNA end. The sequence obtained is used to correct the left end Qiagen sequence and deduce the correct right ITR sequence as well.

The plasmid is digested with SnaBI+NdeI and the NdeI site is filled in with Klenow. The EcoRV fragment from pBleuSK I-PI is ligated in. Alternatively the plasmid is digested by SnaBI and NdeI and a double stranded oligonucleotide containing recognition sites for CeuI and PI-SceI is ligated in place of deleted E1 coding regions. Minipreps are diagnosed using PstI. The resulting plasmid is digested with XbaI+EcoRV. The right end (XbaI digest) fragment from the SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 is ligated in. Minipreps are diagnosed using ApaLI. The resulting plasmid is then digested with XbaI+EcoRV. The fragment from the SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, or SAdV-A1322 DNA is ligated in and minipreps are diagnosed using MfeI. 293 cells are then transfected using calcium phosphate or lipofectamine methods according to manufacturer's protocol.

Example 3—Assessment of Cross-Neutralizing Antibodies

A. Wild-type SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, and SAdV-A1322 are assessed for cross-neutralizing activity as compared to human Adenovirus 5 (subspecies C) and chimpanzee adenovirus 7 (SAdV-24), and human pooled IgG using an infection inhibition neutralizing antibody assay monitored by direct immunofluorescence. The human pooled IgG [Hu Pooled IgG] is purchased commercially and is approved for administration in immunocompromised patients, as it contains antibodies against a number of antigens to which the general human population is exposed. The presence or absence of neutralizing antibodies to the simian adenoviruses for the human pooled IgG is a reflection of the prevalence of antibodies to these adenoviruses in the general population.

The assay is performed as follows. Serum samples obtained from rabbits previously injected with HAdV-5 or SAdV-24 are heat inactivated at 56° C. for 35 min Wild type adenovirus (108 particles/well) are diluted in serum-free Dulbecco's modified Eagle's medium (DMEM) and incubated with 2-fold serial dilutions of heat-inactivated serum samples in DMEM for 1 h at 37° C. Subsequently, the serum-adenovirus mixture is added to slides in wells with 105 monolayer A549 cells. After 1 hr, the cells in each well are supplemented with 100 μl of 20% fetal bovine serum (FBS)-DMEM and cultured for 22 h at 37° C. in 5% CO2. Next, cells are rinsed twice with PBS and stained with DAPI and a goat, FITC labeled, broadly cross reactive antibody (Virostat) raised against HAdV-5 following fixation in paraformaldehyde (4%, 30 min) and permeabilization in 0.2% Triton (4° C., 20 min) The level of infection is determined by counting the number of FITC positive cells under microscopy. The NAB titer is reported as the highest serum dilution that inhibits adenovirus infection by 50% or more, compared with the naive serum control. Where a titer value of < 1/20 is shown, the neutralizing antibody concentration is under the limit of detection, i.e., 1/20.

B. Wild-type SAdV-A1321, SAdV-A1325, SAdV-A1295, SAdV-A1309, SAdV-A1316, and SAdV-A1322 were assessed for cross-neutralizing activity as compared to human Adenovirus 5 (HAdV-5; subspecies C). The results are shown in Table 3 below. Less than approximately 15% of the population of human samples (n=20) had a neutralizing antibody titer (NAb titer) greater than 200 for the identified adenoviruses, relative to approximately 40% for HAdV-5.


TABLE 3
Human samples
Wild type
IVIG Nab titer
(n = 20) Nab titer
adenovirus
(10 mlg/ml)
Median
Mean
A1295
20
20
40
A1309
80
60
113
A1316
40
80
91
A1321
20
40
40
A1322
40
40
69
A1325
80
80
88
HAdV-5
640
640
1589

Example 4—Vector Construction

A. SAdV-A1321

An E1 deleted SAdV-A1321 vector is prepared by digesting the pSR5 plasmid (SEQ ID NO: 322) with SmaI+XbaI, and the wild-type SAdV-A1321 sequence (SEQ ID NO: 1) with Xba I to produce an ˜6020 bp fragment for incorporation into the plasmid. The resulting plasmid (pS215-1321) is digested with BsiWI+NdeI or with SnaBI+NdeI, and ICeuPISceI meganuclease cassette(s) (SEQ ID NO: 326) cloned therein via SnaIB (BsiWI)+NdeI sites. The resulting plasmid (pS216-1321) is then digested with EcoRV+XbaI, and the ˜30526 bp fragment of wild-type SAdV-A1321 sequence (SEQ ID NO: 1) digested with XbaI is cloned therein resulting in the pS217-A1321 plasmid.

A suitable transgene expression cassette is then introduced into the pS217-A1321 plasmid. The transgene may be, e.g., a reporter such as eGFP, an influenza A nucleoprotein, or HIV-gag (e.g., from pSh-HIV-short-gag (SEQ ID NO: 319)) via the I-CeuI and PI-SceI sites of the meganuclease cassette. Additional transgenes described herein and known in the art may be used consistent with this example and the skill in the art and are contemplated hereby.

A proposed E1 deleted SAdV-A1321 vector containing an HIV-gag transgene is identified in SEQ ID NO: 168.

B. SAdV-A1325

An E1 deleted SAdV-A1325 vector is prepared by digesting the pSR5 plasmid (SEQ ID NO: 322) with SmaI+XbaI, and the wild-type SAdV-A1325 sequence (SEQ ID NO: 28) with Xba I to produce an ˜5728 bp fragment for incorporation into the plasmid. The resulting plasmid (pS226-1325) is digested with BsiWI+NdeI or with SnaBI+NdeI, and ICeuPISceI meganuclease cassette(s) (SEQ ID NO: 326) cloned therein via SnaIB (BsiWI)+NdeI sites. The resulting plasmid (pS227-1325) is then digested with EcoRV+XbaI, and the ˜30507 bp fragment of wild-type SAdV-A1325 sequence (SEQ ID NO: 28) digested with XbaI is cloned therein resulting in the pS228-A1325 plasmid.

A suitable transgene expression cassette is then introduced into the pS228-A1325 plasmid. The transgene may be, e.g., a reporter such as eGFP, an influenza A nucleoprotein, or HIV-gag (e.g., from pSh-HIV-short-gag (SEQ ID NO: 319)) via the I-CeuI and PI-SceI sites of the meganuclease cassette. Additional transgenes described herein and known in the art may be used consistent with this example and the skill in the art and are contemplated hereby.

A proposed E1 deleted SAdV-A1325 vector containing an HIV-gag transgene is identified in SEQ ID NO: 193.

C. SAdV-A1295

An E1 deleted SAdV-A1295 vector is prepared by digesting the pSR5 plasmid (SEQ ID NO: 322) with SmaI+XbaI, and the wild-type SAdV-A1295 sequence (SEQ ID NO: 57) with Xba I to produce an ˜6017 bp fragment for incorporation into the plasmid. The resulting plasmid (pS200-1295) is digested with BsiWI+NdeI and the ends are filled in with Klenow and treated with CIP. The pBleuSK I-PI plasmid (SEQ ID NO: 324) is digested with EcoRV and the EcoRV fragment from pBleuSK I-PI (harboring sites for I-CeuI and PI-SceI) was ligated in. The resulting plasmid (pS201-1295) is then digested with EcoRV+XbaI, and the ˜6521 bp fragment of wild-type SAdV-A1295 sequence (SEQ ID NO: 57) digested with XbaI is cloned therein resulting in the pS202_A1295 plasmid. The pS202-A1295 plasmid is then digested with XbaI and the ˜24105 bp fragment of wild-type SAdV-A1295 sequence (SEQ ID NO: 57) digested with XbaI is cloned therein resulting in the pS203-A1295 plasmid.

A suitable transgene expression cassette is then introduced into the pS203-A1295 plasmid. The transgene may be, e.g., a reporter such as eGFP, an influenza A nucleoprotein, or HIV-gag (e.g., from pSh-HIV-short-gag (SEQ ID NO: 319)) via the I-CeuI and PI-SceI sites of the pBleuSK I-PI plasmid fragment. Additional transgenes described herein and known in the art may be used consistent with this example and the skill in the art and are contemplated hereby.

A proposed E1 deleted SAdV-A1295 vector containing an HIV-gag transgene is identified in SEQ ID NO: 220.

D. SAdV-A1309

An E1 deleted SAdV-A1309 vector is prepared by digesting the pSR5 plasmid (SEQ ID NO: 322) with SmaI+XbaI, and the wild-type SAdV-A1309 sequence (SEQ ID NO: 86) with Xba I to produce an ˜6037 bp fragment for incorporation into the plasmid. The resulting plasmid (pS205-1309) is digested with NsiWI+NedI or with SnaBI+NdeI and ICeuPISceI meganuclease cassette(s) (SEQ ID NO: 326) cloned therein via SnaIB (BsiWI)+NdeI sites. The resulting plasmid (pS206-1309) is then digested with Fse+EcoRV, and the ˜1924 bp Fse-end fragment of wild-type SAdV-A1309 sequence (SEQ ID NO: 86) is cloned therein resulting in the pS207-A1309 plasmid. pS207-A1309 is digested with Fse and the ˜17,731 bp Fse-Fse fragment of wild-type SAdV-A1309 sequence (SEQ ID NO: 86) is cloned therein resulting in the pS208-A1309 plasmid. pS208 is digested with Spe and the ˜24,718 bp Spe-Spe fragment of wild-type SAdV-A1309 sequence (SEQ ID NO: 86) is cloned therein resulting in the pS209-A1309 plasmid.

A suitable transgene expression cassette is then introduced into the pS209-A1309. The transgene may be, e.g., a reporter such as eGFP, an influenza A nucleoprotein, or HIV-gag (e.g., from pSh-HIV-short-gag (SEQ ID NO: 319)) via the I-CeuI and PI-SceI sites of the meganuclease cassette. Additional transgenes described herein and known in the art may be used consistent with this example and the skill in the art and are contemplated hereby.

A proposed E1 deleted SAdV-A1309 vector containing an HIV-gag transgene is identified in SEQ ID NO: 246.

E. SAdV-A1316

An E1 deleted SAdV-A1316 vector is prepared by digesting the pSR7 plasmid (SEQ ID NO: 323) with SnaBI+NheI, and the wild-type SAdV-A1316 sequence (SEQ ID NO: 114) with NheI to produce an ˜3032 bp fragment for incorporation into the plasmid. The resulting plasmid (0210-A1316) is digested with NsiWI+NedI or with SnaBI+NdeI and ICeuPISceI meganuclease cassette(s) (SEQ ID NO: 326) cloned therein via SnaIB (BsiWI)+NdeI sites. The resulting plasmid (pS211-A1316) was digested with NheI+EcoRV and the ˜771 bp Nhe digested fragment of the wild-type SAdV-A1316 sequence (SEQ ID NO: 114) cloned in. Similarly, the resulting plasmid pS212-A1316 is digested with NheI and the ˜32845 bp Nhe digested fragment of the wild-type SAdV-A1316 sequence (SEQ ID NO: 114) cloned in (resulting in pS213-A1316).

A suitable transgene expression cassette is then introduced into the pS213-A1306 plasmid. The transgene may be, e.g., a reporter such as eGFP, an influenza A nucleoprotein, or HIV-gag (e.g., from pSh-HIV-short-gag (SEQ ID NO: 319)) via the I-CeuI and PI-SceI sites of the meganuclease cassette. Additional transgenes described herein and known in the art may be used consistent with this example and the skill in the art and are contemplated hereby.

A proposed E1 deleted SAdV-A1316 vector containing an HIV-gag transgene is identified in SEQ ID NO: 272.

F. SAdV-A1322

A proposed E1 deleted SAdV-A1322 vector containing an HIV-gag transgene is identified in SEQ ID NO: 295, and may be prepared according as indicated above by one of ordinary skill in the art. An [L1 IIIa region is contained at nt 12273-13776, which is not codable within the Sequence Listing.]

Example 5—T-Cell Induction

The protocols contained in Roy, et al. [“Partial protection against H5N1 influenza in mice with a single dose of a chimpanzee adenovirus vector expressing nucleoprotein”, Vaccine 25:6845-6851 (Aug. 6, 2007)], which is herein incorporated by reference, may be utilized to assess T cell induction by the resulting recombinant adenovirus virus.

Example 6—Cytokine Induction

Characterization of cytokine responses to adenoviral vectors described here is performed according to the methods of Lin, et al., J Vivol. 2007 November; 81(21): 11840-11849 (Vaccines Based on Novel Adeno-Associated Virus Vectors Elicit Aberrant CD8+ T-Cell Responses in Mice), and Lin, et al., Hum. Gene Ther. 2008 July; 19(7): 663-669 (Impact of Preexisting Vector Immunity on the Efficacy of Adeno-Associated Virus-Based HIV-1 Gag Vaccines), including Enzyme-linked immunosorbent assay, Interferon-γ enzyme-linked immunospot assay, and Intracellular cytokine staining (ICCS).

Characterization is expected to reflect an advantageous cytokine profile following vector administration.

All documents recited above, the Sequence Listing, and the entirety of International Patent Application No. PCT/US2011/061632 and U.S. Provisional Patent Application Nos. 61/416,467, 61/416,481, 61/416,491, 61/416,499, 61/416,509, and 61/416,515 (all filed Nov. 23, 2010) are incorporated herein by reference. Numerous modifications and variations are included in the scope of the above-identified specification and are expected to be obvious to one of skill in the art. Such modifications and alterations to the compositions and processes, such as selections of different minigenes or selection or dosage of the vectors or immune modulators are believed to be within the scope of the claims appended hereto.


SEQUENCE LISTING
The patent contains a lengthy “Sequence Listing” section. A copy of the “Sequence Listing” is available in electronic form from the USPTO web site (). An electronic copy of the “Sequence Listing” will also be available from the USPTO upon request and payment of the fee set forth in 37 CFR 1.19(b)(3).

<160> NUMBER OF SEQ ID NOS: 326

<140> CURRENT APPLICATION NUMBER: US/15/172,938

<210> SEQ ID NO 1

<211> LENGTH: 36546

<212> TYPE: DNA

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Simian adenovirus A1321

<220> FEATURE:

<221> NAME/KEY: repeat_region

<222> LOCATION: (1)..(129)

<223> OTHER INFORMATION: ITR

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (1601)..(2155)

<223> OTHER INFORMATION: E1b\19K

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (3484)..(3909)

<223> OTHER INFORMATION: pIX

<220> FEATURE:

<221> NAME/KEY: misc_feature

<222> LOCATION: (3974)..(5595)

<223> OTHER INFORMATION: IVa2 complement (3974..5304,5583..5595)

<220> FEATURE:

<221> NAME/KEY: misc_feature

<222> LOCATION: (5077)..(13837)

<223> OTHER INFORMATION: pol complement (5077..8655,13829..13837)

<220> FEATURE:

<221> NAME/KEY: misc_feature

<222> LOCATION: (8457)..(13837)

<223> OTHER INFORMATION: pTP complement (8457..10385,13829..13837)

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (12025)..(13800)

<223> OTHER INFORMATION: pIIIa

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (13885)..(15504)

<223> OTHER INFORMATION: penton

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (15511)..(16089)

<223> OTHER INFORMATION: pVII

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (16134)..(17147)

<223> OTHER INFORMATION: V

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (17174)..(17404)

<223> OTHER INFORMATION: pX

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (17439)..(18200)

<223> OTHER INFORMATION: pVI

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (18309)..(21134)

<223> OTHER INFORMATION: hexon

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (21161)..(21781)

<223> OTHER INFORMATION: protease

<220> FEATURE:

<221> NAME/KEY: misc_feature

<222> LOCATION: (21862)..(23391)

<223> OTHER INFORMATION: DBP complement (21862..23391)

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (23420)..(25828)

<223> OTHER INFORMATION: 100K

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (26453)..(27133)

<223> OTHER INFORMATION: pVIII

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (27411)..(28046)

<223> OTHER INFORMATION: E3\CR1-alpha

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (28591)..(29193)

<223> OTHER INFORMATION: E3\CR1-beta

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (29209)..(29823)

<223> OTHER INFORMATION: E3\CR1-gamma

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (30748)..(31020)

<223> OTHER INFORMATION: E3\RID-alpha

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (31029)..(31457)

<223> OTHER INFORMATION: E3\RID-beta

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (32117)..(33451)

<223> OTHER INFORMATION: fiber

<220> FEATURE:

<221> NAME/KEY: misc_feature

<222> LOCATION: (33550)..(34697)

<223> OTHER INFORMATION: E4\orf6/7 complement (33550..33801,34524..34697)

<220> FEATURE:

<221> NAME/KEY: misc_feature

<222> LOCATION: (33801)..(34697)

<223> OTHER INFORMATION: E4\orf6 complement (33801..34697)

<220> FEATURE:

<221> NAME/KEY: misc_feature

<222> LOCATION: (34790)..(35131)

<223> OTHER INFORMATION: E4\orf4 complement (34790..35131)

<220> FEATURE:

<221> NAME/KEY: misc_feature

<222> LOCATION: (34980)..(35330)

<223> OTHER INFORMATION: E4\orf3 complement (34980..35330)

<220> FEATURE:

<221> NAME/KEY: misc_feature

<222> LOCATION: (35330)..(35716)

<223> OTHER INFORMATION: E4\orf2 complement (35330..35716)

<220> FEATURE:

<221> NAME/KEY: misc_feature

<222> LOCATION: (35769)..(36140)

<223> OTHER INFORMATION: E4\orf1 complement (35769..36140)

<220> FEATURE:

<221> NAME/KEY: repeat_region

<222> LOCATION: (36418)..(36546)

<223> OTHER INFORMATION: ITR complement (36418..36546)

<400> SEQUENCE: 1

catcatcaat aatatacctc aaacttttgg tgcgcgttaa tatgcaaatg agctgtttga 60

atttggggat ggggcgttgt gattggacga gggatgagcg accgttaggg gcggggcgag 120

tgacgttttg atgacgtggc cgcgaggagg agccagtttg caagttctcg tgggaaaagt 180

gacgtcaaac gaggtgtggt ttgaacacgg aaatactcaa ttttcccgcg ctctctgaca 240

ggaaatgagg tgtttctggg cggatgcaag tgaaaacggg ccattttcgc gcgaaaactg 300

aatgaggaag tgaaaatctg agtaatttcg cgtttatgac agggaggagt atttgccgag 360

ggccgagtag actttgaccg attacgtggg ggtttcgatt accgtgtttt tcacctaaat 420

ttccgcgtac ggtgtcaaag tccggtgttt ttacgtaggt gtcagctgat cgccagggta 480

tttaaacctg cgctctccag tcaagaggcc actcttgagt gccagcgaga agagttttct 540

cctccgcgcc gctagtcaga tctacacttt gaaagatgag gcacctgaga aacctgcccg 600

aggagaaaat cctcatcgct tccgggaacg agattctgga actggtggta aatgccatga 660

tgggcgacga ccctccggag ccccccaccc catttgaggc accttcgcta cacgatttgt 720

atgatctgga ggtggatgtg cccgaggacg accccaacga ggaggcggta aatgatttat 780

ttagcgatgc cgcgctgcta gctgccgagg aggcttcgag ctctagctca gacagcgact 840

cctcactgca tacccctaga cccggcagag gtgagaaaaa gatccccgag cttaaagggg 900

aagagatgga cttgcgctgc tatgaggaat gcttgccccc gagcgatgat gaggacgagc 960

aggcgatcca gaacgcagcg agtcagggaa tgcaagccgc cagagagagc tttgcgctgg 1020

actgcccgcc tctgcccgga cacggctgta agtcttgtga atttcatcgc atgaatactg 1080

gagataaagc tgtgttgtgt gcactttgct atatgagagc ttacaaccat tgtgtttaca 1140

gtaagtgtga ttaagttgaa ctttagaggg aggcagagag cagggtgact gggcgatgac 1200

tggtttattt atgtatatat gttctttata taggtcccgt ctctgacgca gatgatgaga 1260

cccccactac agagtccact tcgtcacccc cagaaattgg cacatctcca cctgagaata 1320

ttgttagacc agttcctgtt agagccactg ggaggagagc agctgtggaa agtttggatg 1380

acttgctaca gggtggggat gaacctttgg acttgtgtac ccggaaacgc cccaggcact 1440

aagtgccaca catgtgtgtt tacttgaggt gatgtcagta tttatagggt gtggagtgca 1500

ataaaaaatg tgttgacttt aagtgcgtgg tttatgactc aggggtgggg actgtgggta 1560

tataagcagg tgcagacctg tgtggttagc tcagagcggc atg gag att tgg acg 1615

Met Glu Ile Trp Thr

1 5

gtc ttg gaa gat ttt cac aag act aga cag ctg cta gag aac gcc tcg 1663

Val Leu Glu Asp Phe His Lys Thr Arg Gln Leu Leu Glu Asn Ala Ser

10 15 20

aac gga gtc tct tac ctg tgg aga ttc tgc ttc ggt ggc gac cta gct 1711

Asn Gly Val Ser Tyr Leu Trp Arg Phe Cys Phe Gly Gly Asp Leu Ala

25 30 35

agg cta gtc tac agg gcc aaa cag gat tat agt gaa caa ttt gag gtt 1759

Arg Leu Val Tyr Arg Ala Lys Gln Asp Tyr Ser Glu Gln Phe Glu Val

40 45 50

att ttg aga gag tgt cct ggt ctt ttt gac gct ctt aac ttg ggc cat 1807

Ile Leu Arg Glu Cys Pro Gly Leu Phe Asp Ala Leu Asn Leu Gly His

55 60 65

cag tct cac ttt aac cag agg att tcg aga gcc ctt gac ttt act act 1855

Gln Ser His Phe Asn Gln Arg Ile Ser Arg Ala Leu Asp Phe Thr Thr

70 75 80 85

cct ggc aga acc act gca gca gta gcc ttt ttt gct ttt att ctt gac 1903

Pro Gly Arg Thr Thr Ala Ala Val Ala Phe Phe Ala Phe Ile Leu Asp

90 95 100

aaa tgg agt caa gaa acc cat ttc agc agg gat tac cag ctg gat ttc 1951

Lys Trp Ser Gln Glu Thr His Phe Ser Arg Asp Tyr Gln Leu Asp Phe

105 110 115

tta gca gta gct ttg tgg aga aca tgg aag tgc cag cgc ctg aat gca 1999

Leu Ala Val Ala Leu Trp Arg Thr Trp Lys Cys Gln Arg Leu Asn Ala

120 125 130

atc tcc ggc tac ttg ccg gta cag ccg cta gac act ctg agg atc ctg 2047

Ile Ser Gly Tyr Leu Pro Val Gln Pro Leu Asp Thr Leu Arg Ile Leu

135 140 145

aat ctc cag gag agt ccc agg gca cgc caa cgt cgc cag cag cag cag 2095

Asn Leu Gln Glu Ser Pro Arg Ala Arg Gln Arg Arg Gln Gln Gln Gln

150 155 160 165

cag gag gat caa gaa gag aac ccg aga gcc ggc ctg gac cct ccg gcg 2143

Gln Glu Asp Gln Glu Glu Asn Pro Arg Ala Gly Leu Asp Pro Pro Ala

170 175 180

gag gag gag gag tagctgacct gtttcctgaa ctgcgccggg tgctgactag 2195

Glu Glu Glu Glu

185

gtcttcgagt ggtcgggaga gggggattaa gcgggagagg catgatgaga ctaatcatag 2255

aaatgaactg actgtcagtt tgatgagtcg caagcgccca gaaacagtgt ggtggcatga 2315

ggtgcagtcg actggcacag atgaggtgtc agtgatgcat gagagatttt ccctagaaca 2375

agtcaagact tgttggttgg agcctgagga tgattgggag gtagccatca ggaattatgc 2435

caagctggct ctcaggccag acaagaagta caagattact aagctgataa atatcagaaa 2495

tgcctgctac atttcaggga atggggctga agtagagatc tgtctccagg aaagggtggc 2555

tttcagatgc tgcatgatga atatgtaccc gggagtggtg ggcatggatg gggtcacctt 2615

tatgaacatg aggttcaggg gagatgggta taatggcacg gtctttatgg ccaataccaa 2675

gctgacagtt catggctgct ccttctttgg ctttaataac acctgcatcg aggcctgggg 2735

tcaggttggt gtgaggggct gcagtttttc agccaactgg atggggatcg tgggcaggac 2795

caagagcgtg ctgtccgtga agaaatgctt gtttgagagg tgccacctgg gggtgatgag 2855

cgagggtgaa gccagaatcc gccactgcgc ctctaccgag acgggctgct ttgtgctgtg 2915

caagggcaat gctaagatca agcataatat gatctgtgga gcctcggacg agcgcggata 2975

ccagatgctg acctgcgccg gtgggaacag ccatatgctg gccaccgtgc atgtggcttc 3035

ccattcccgc aagccctggc ccgagttcga gcacaatgtc atgaccaggt gcaatatgca 3095

tctggggtcc cgccgaggca tgttcatgcc ctaccagtgc aacctgaatt atgtgaaggt 3155

gctgctggag cccgatgcca tgtccagagt gagcctgacg ggggtgtttg acatgaatgt 3215

ggaggtgtgg aagattctga gatatgatga atccaagacc aggtgccgag cctgcgagtg 3275

cggagggaag catgccaggt tccagcccgt gtgtgtggat gtgacggagg acctgcgacc 3335

cgatcatttg gtgttgtcct gcaccgggac ggagttcggt tccagcgggg aagaatctga 3395

ctagagtgag tagtgttctg gggcggtgga gggcatgcat gagggccaga atgactgaaa 3455

tctgtgcttt tctgtgtgtt gcagcagc atg agc gga agc ggc tcc ttt gag 3507

Met Ser Gly Ser Gly Ser Phe Glu

190

gga ggg gta ttc agc cct tat ctg acg ggg cgt ctc cct tcc tgg gcg 3555

Gly Gly Val Phe Ser Pro Tyr Leu Thr Gly Arg Leu Pro Ser Trp Ala

195 200 205

gga gtg cgt cag aat gtg atg gga tct acg gtg gac ggc cgg ccc gtg 3603

Gly Val Arg Gln Asn Val Met Gly Ser Thr Val Asp Gly Arg Pro Val

210 215 220 225

cag ccc gcg aac tct tca acc ctg acc tat gca acc ctg agc tct tcg 3651

Gln Pro Ala Asn Ser Ser Thr Leu Thr Tyr Ala Thr Leu Ser Ser Ser

230 235 240

tcg gtg gac gca gct gcc gcc gca gct gct gca tct gcc gcc agc gcc 3699

Ser Val Asp Ala Ala Ala Ala Ala Ala Ala Ala Ser Ala Ala Ser Ala

245 250 255

gtg cgc gga atg gcc atg ggc gcc ggc tat tac ggc act ctg gtg gcc 3747

Val Arg Gly Met Ala Met Gly Ala Gly Tyr Tyr Gly Thr Leu Val Ala

260 265 270

aac tcg agt tcc acc aat aat ccc gcc agc ctg aac gag gag aag ctg 3795

Asn Ser Ser Ser Thr Asn Asn Pro Ala Ser Leu Asn Glu Glu Lys Leu

275 280 285

ctg ctg ctg atg gcc cag ctc gag gcc ttg acc cag cgc ctg ggc gag 3843

Leu Leu Leu Met Ala Gln Leu Glu Ala Leu Thr Gln Arg Leu Gly Glu

290 295 300 305

ctg acc cag cag gtg gct cag ctg cag gag cag acg cgg gcc gcg gtt 3891

Leu Thr Gln Gln Val Ala Gln Leu Gln Glu Gln Thr Arg Ala Ala Val

310 315 320

gcc acg gtg aaa tcc aaa taaaaaatga atcaataaat aaacggagac 3939

Ala Thr Val Lys Ser Lys

325

ggttgttgat tttaacacag agtctgaatc tttatttgat ttttcgcgcg cggtaggccc 3999

tggaccaccg gtctcgatca ttgagcaccc ggtggatctt ttccaggacc cggtagaggt 4059

gggcttggat gttgaggtac atgggcatga gcccgtcccg ggggtggagg tagctccatt 4119

gcagggcctc gtgctcgggg gtggtgttgt aaatcaccca gtcatagcag gggcgcaggg 4179

catggtgttg cacaatatct ttgaggagga gactgatggc cacgggcagc cctttagtgt 4239

aggtgtttac aaatctgttg agctgggagg gatgcatgcg gggggagatg aggtgcatct 4299

tggcctggat cttgagattg gcgatgttac cgcccagatc ccgcctgggg ttcatgttgt 4359

gcaggaccac cagcacggtg tatccggtgc acttggggaa tttatcatgc aacttggaag 4419

ggaaggcgtg aaagaatttg gcgacgccct tgtgcccgcc caggttttcc atgcattcat 4479

ccatgatgat ggcgatgggc ccgtgggcgg cggcctgggc aaagacgttt cgggggtcgg 4539

acacatcata gttgtggtcc tgggtgagat catcataggc cattttaatg aatttggggc 4599

ggagggtgcc ggactggggg acaaaggtac cctcgatccc gggggcgtag ttcccctcac 4659

agatctgcat ctcccaggct ttgagctcgg agggggggat catgtccacc tgcggggcta 4719

taaagaacac ggtttccggg gcgggggaga tgagctgggc cgaaagcaag ttccggagca 4779

gctgggactt gccgcagccg gtggggccgt agatgacccc gatgaccggc tgcaggtggt 4839

agttgaggga gagacagctg ccgtcctccc ggaggagggg ggccacctcg ttcatcatct 4899

cgcgcacatg catgttctcg cgcaccagtt ccgccaggag gcgctctccc cccagggata 4959

ggagctcctg gagcgaggcg aagtttttca gcggcttgag tccgtcggcc atgggcattt 5019

tggagagggt ctgttgcaag agttccaagc ggtcccagag ctcggtgatg tgctctacgg 5079

catctcgatc cagcagacct cctcgtttcg cgggttggga cgactgcggg agtagggcac 5139

cagacgatgg gcgtccagcg cagccagggt ccggtccttc cagggccgca gcgtccgcgt 5199

cagggtggtc tccgtcacgg tgaaggggtg cgcgccgggc tgggcgcttg cgagggtgcg 5259

cttcaggctc atccggctgg ttgaaaaccg ctcccgatcg gcgccctgcg cgtcggccag 5319

gtagcaattg accatgagtt cgtagttgag cgcctcggcc gcgtggcctt tggcgcggag 5379

cttacctttg gatgtctgcc cacaggcggg acagaggagg gacttgaggg cgtagagctt 5439

gggggcaagg aagacggact cgggggcgta ggcatccgcg ccgcagtggg cgcagacggt 5499

ctcgcactcc acgagccagg tgaggtcggg ctggtcgggg tcaaaaacca gtttcccgcc 5559

gttctttttg atgcgtttct tacctttggt ctccatgagc tcgtgtcccc gctgggtgac 5619

aaagaggctg tccgtgtccc cgtagaccga ctttatgggc cggtcctcga gcggtgtgcc 5679

gcggtcctcc tcgtagagga accccgccca ctccgaaacg aaagcccggg tccaggccag 5739

cacgaaggag gccacgtggg acgggtagcg gtcgttgtcc accagcgggt ccaccttctc 5799

cagggtatgc aaacacatgt ccccctcgtc cacatccagg aaggtgattg gcttgtaagt 5859

gtaggccacg tgaccggggg tcccggccgg gggggtataa aagggggcgg gcccctgctc 5919

gtcctcactg tcttccggat cgctgtccag gagcgccagc tgttggggta ggtattccct 5979

ctcgaaggcg ggcatgacct cggcactcag gttgtcagtt tctagaaacg aggaggattt 6039

gatattgacg gtgccggcgg agatgccttt caagagtccc tcgtccatct ggtcagaaaa 6099

gacgatcttt ttattgtcga gtttggtggc gaaggagccg tagagggcgt tggaaaggag 6159

cttggcgatg gagcgcatgg tctggttctt ttccttgtcg gcgcgctcct tggccgcgat 6219

gttgagttgc acgtactcgc gcgccacgca cttccattcg gggaagacgg tggtcagctc 6279

gtcgggcacg attctgacct gccagccccg attatgcagg gtgatgaggt ccacactggt 6339

ggccacctcg ccgcgcaggg gttcattagt ccagcagagg cgcccgccct tgcgcgagca 6399

gaaggggggc agggggtcga gcatgacctc gtcggggggg tcggcatcga tggtgaagat 6459

gccgggcagg aggtcgggat caaagtagct gatagaagtg gccagatcgt ccagggaagc 6519

ttgccattcg cgcacggcca gcgcgcgctc gtagggactg aggggcatgc cccagggcat 6579

ggggtgggtg agcgcggagg cgtacatgcc gcagatgtcg tagacgtaga ggggctcctc 6639

gaggatgccg atgtaggtgg ggtagcagcg ccccccgcgg atgctggcgc gcacgtagtc 6699

atacagctcg tgcgagggtg cgaggagccc cgggcccagg ttggtgcggc tgggcttttc 6759

ggcgcggtag acgatctggc ggaaaatggc atgcgagttg gaggagatgg tgggcctttg 6819

gaagatgttg aagtgggcgt ggggcagtcc gaccgagtcg cggatgaagt gggcgtagga 6879

gtcttgcagc ttggcgacga gctcggcggt gacgaggacg tccagagcgc agtagtcgag 6939

ggtctcctgg atgatgtcat acttgagctg acccttttgt ttccacagct cgcggttgag 6999

aaggaactct tcgcggtcct tccagtactc ttcgaggggg aacccgtcct gatcggcacg 7059

gtaagagcct agcatgtaga actggttgac ggccttgtag gcgcagcagc ccttctccac 7119

ggggagggcg taggcctggg cggctttgcg cagggaggtg tgcgtgaggg cgaaagtgtc 7179

cctgaccatg actttgagga actggtgctt gaagtcgata tcgtcgcagc ccccctgctc 7239

ccagagttgg aagtccgtgc gcttcttgta ggcggggttg ggcaaagcga aagtaacatc 7299

gttgaagagg atcttgcccg cgcggggcat aaagttgcga gtgatgcgga aaggttgggg 7359

cacctcggcc cggttgttga tgacctgggc ggcgagcacg atctcgtcga agccgttgat 7419

gttgtggccc acgatgtaga gttccacgaa ccgtgggcgg cccttgacgt ggggcagctt 7479

cttgagctct tcgtaggtga gctcgtcggg gtcgctgagc ccgtgctgct cgagcgccca 7539

gtcggcgaga tgggggttgg cgcggaggaa agaagtccag agatccacgg ccagggcggt 7599

ttgcaggcgg tcccggtact gacggaactg ctgcccgacg gccatttttt cgggggtgac 7659

gcagtagaag gtgcgggggt ccccgtgcca gcgatcccat ttgagctgga gggcgagatc 7719

gagggcgagc tcgacgagcc ggtcgtcccc ggagagtttc atgaccagca tgaaggggac 7779

gagctgcttg ccgaaggacc ccatccaggt gtaggtttcc acatcgtagg tgaggaagag 7839

cctttcggtg cgaggatgcg agccgatggg gaagaactgg atctcctgcc accaattgga 7899

ggaatggctg ttgatgtgat ggaagtagaa atgccgacgg cgcgccgaac actcgtgctt 7959

gtgtttatac aagcggccac agtgctcgca acgctgcacg ggatgcacgt gctgcacgag 8019

ctgtacctga gttcctttga cgaggaattt cagtgggaag tggagtcgtg gcgcctgcat 8079

ctcgtgctgt actacgtcgt ggtggtcggc ctggccctct tctgcctcga tggtggtcat 8139

gctgacgagc ccgcgcggga ggcaggtcca gacctcggcg cgagcgggtc ggagagcgag 8199

gacgagggcg cgcagcccgg agctgtccag ggtcctgaga cgctgcggag tcaggtcagt 8259

gggcagcggc ggcgcgcggt tgacttgcag gagtttttcc agggcgcgcg ggaggtccag 8319

atggtacttg atctccaccg cgccgttggt ggcgacgtcg atggcttgca gggtcccgtg 8379

cccctgggga gtgaccaccg tcccccgttt cttcttgggc ggctggggcg acgggggcgg 8439

tgtttccatc tcggttagaa gcggcggcga ggacgcgcgc cgggcggcag aggcggctcg 8499

gggcccggag gcaggggcgg caggggcacg tcggcgccgc gcgcgggtag gttctggtac 8559

tgcgcccgga gaagactggc gtgagcgacg acgcgacggt tgacgtcctg gatctgacgc 8619

ctctgggtga aggccacggg acccgtgagt ttgaacctga aagagagttc gacagaatca 8679

atctcggtat cgttgacggc ggcctgccgc aggatctctt gcacgtcgcc cgagttgtcc 8739

tggtaggcga tctcggtcat gaactgctcg atctcctcct cctgaaggtc tccgcggccg 8799

gcgcgctcca cggtggccgc gaggtcgttg gagatgcggc ccatgagctg cgagaaggcg 8859

ttcatgcccg cctcgttcca gacgcggctg tagaccacga cgccctcggg atcgcgggcg 8919

cgcatgacca cctgggcgag gttgagctcc acgtggcgcg tgaagaccgc gtagttgcag 8979

aggcgctggt agaggtagtt gagcgtggtg gcgatgtgct cggtgacgaa gaaatacatg 9039

atccagcggc ggagcggcat ctcgctgacg tcgcccagcg cctccaagcg ttccatggcc 9099

tcgtaaaagt ccacggcgaa gttgaaaaac tgggagttac gcgccgagac ggtcaactcc 9159

tcctccagaa gacggatgag ctcggcgatg gtggcgcgca cctcgcgctc gaaggccccc 9219

gggagttcct cctcttccat ctcttcttcc tcctccacta acatctcttc tacttcctcc 9279

tcaggcggca gtggtggcgg gggagggggc ctgcgtcgcc ggcggcgcac gggcagacgg 9339

tcgatgaagc gctcgatggt ctcgccgcgc cggcgtcgca tggtctcggt gacggcgcgc 9399

ccgtcctcgc ggggccgcag cgtgaagacg ccgccgcgca tctccaggtg gccggggggg 9459

tccccgttgg gcagggagag ggcgctgacg atgcatctta tcaattgccc cgtagggact 9519

ccgcgcaagg acctgagcgt ctcgagatcc acgggatctg aaaaccgttg aacgaaggct 9579

tcgagccagt cgcagtcgca aggtaggctg agcacggttt cttctgccgg gtcatgttgg 9639

ggagcggggc gggcgatgct gctggtgatg aagttgaaat aggcggttct gagacggcgg 9699

atggtggcga ggagcaccag gtctttgggc ccggcttgct ggatgcgcag acggtcggcc 9759

atgccccagg cgtggtcctg acacctggcc aggtccttgt agtagtcctg catgagccgc 9819

tccacgggca cctcctcctc gcccgcgcgg ccgtgcatgc gcgtgagccc gaagccgcgc 9879

tggggctgga cgagcgccag gtcggcgacg acgcgctcgg cgaggatggc ctgctggatc 9939

tgggtgaggg tggtctggaa gtcgtcaaag tcgacgaagc ggtggtaggc tccggtgttg 9999

atggtgtagg agcagttggc catgacggac cagttgacgg tctggtggcc cggacgcacg 10059

agctcgtggt acttgaggcg cgagtaggcg cgcgtgtcga agatgtagtc gttgcaggtg 10119

cgcaccaggt actggtagcc gatgaggaag tgcggcggcg gctggcggta gagcggccat 10179

cgctcggtgg cgggggcgcc gggcgcgagg tcctcgagca tggtgcggtg gtagccgtag 10239

atgtacctgg acatccaggt gatgccggcg gcggtggtgg aggcgcgcgg gaactcgcgg 10299

acgcggttcc agatgttgcg cagcggcagg aagtagttca tggtgggcac ggtctggccc 10359

gtgaggcgcg cgcagtcgtg gatgctctat acgggcaaaa acgaaagcgg tcagcggctc 10419

gactccgtgg cctggaggct aagcgaacgg gttgggctgc gcgtgtaccc cggttcgaat 10479

ctcgaatcag gctggagccg cagctaacgt ggtactggca ctcccgtctc gacccaagcc 10539

tgcaccaacc ctccaggata cggaggcggg tcgttttgca actttttgga ggccggaaat 10599

gagactagta agcgcggaaa gcggccgacc gcgatggctc gctgccgtag tctggagaag 10659

aatcgccagg gttgcgttgc ggtgtgcccc ggttcgaggc cggccggatt ccgcggctaa 10719

cgagggcgtg gctgccccgt cgtttccaag acccctagcc agccgacttc tccagttacg 10779

gagcgagccc ctcttttgtt ttgtttgttt ttgccagatg catcccgtac tgcggcagat 10839

gcgcccccac caccctccac cgcaacaaca gccccctcca cagccggcgc ttctgccccc 10899

gccccagcag cagcagcaac ttccagccac gaccgccgcg gccgccgtga gcgggactgg 10959

acagacttct cagtatgacc agctggcctt ggaagagggc gaggggctgg cgcgcctggg 11019

ggcgtcgtcg ccggagcggc acccgcgcgt gcagatgaaa agggacgctc gcgaggccta 11079

cgtgcccaag cagaacctgt tcagagacag gagcggcgag gagcccgagg agatgcgcgc 11139

ggcccggttc cacgcggggc gggagctgcg gcgcggcctg gacagaaaga gggtgctgag 11199

ggacgaggat ttcgaggcgg acgagctgac ggggatcagc cccgcgcgtg cgcacgtggc 11259

cgcggccaac ctggtcacgg cgtacgagca gaccgtgaag gaggagagca acttccaaaa 11319

atccttcaac aaccacgtgc gcaccctgat cgcgcgcgag gaggtgaccc tgggcctgat 11379

gcacctgtgg gacctgctgg aggccatcgt gcagaacccc accagcaagc cgctgacggc 11439

gcagctgttc ctggtggtgc aacatagtcg ggacaacgag gcgttcaggg aggcgctgct 11499

gaatatcacc gagcccgagg gccgctggct cctggacctg gtgaacattc tgcagagcat 11559

cgtggtgcag gagcgcgggc tgccgctgtc cgagaagctg gcggccatca acttctcggt 11619

gctgagtctg ggcaagtact acgctaggaa gatctacaag accccgtacg tgcccataga 11679

caaggaggtg aagatcgacg ggttttacat gcgcatgacc ctgaaagtgc tgaccctgag 11739

cgacgatctg ggggtgtacc gcaacgacag gatgcaccgc gcggtgagcg ccagcaggcg 11799

gcgcgagctg agcgaccagg agctgatgca cagcctgcag cgggccctga ccggggccgg 11859

gaccgagggg gagagctact ttgacatggg cgcggacctg cactggcagc ccagccgccg 11919

ggccttggag gcggcggcag gaccctacgt agaagaggtg gacgatgagg tggacgagga 11979

gggcgagtac ctggaagact gatggcgcga ccgtattttt gctag atg caa caa cag 12036

Met Gln Gln Gln

330

cca cct cct gat ccc gcg atg cgg gcg gcg ctg cag agc cag ccg tcc 12084

Pro Pro Pro Asp Pro Ala Met Arg Ala Ala Leu Gln Ser Gln Pro Ser

335 340 345

ggc att aac tcc tcg gac gat tgg acc cag gcc atg caa cgc atc atg 12132

Gly Ile Asn Ser Ser Asp Asp Trp Thr Gln Ala Met Gln Arg Ile Met

350 355 360

gcg ctg acg acc cgc aac ccc gaa gcc ttt aga cag cag ccc cag gcc 12180

Ala Leu Thr Thr Arg Asn Pro Glu Ala Phe Arg Gln Gln Pro Gln Ala

365 370 375

aac cgg ctc tcg gcc atc ctg gag gcc gtg gtg ccc tcg cgc tcc aac 12228

Asn Arg Leu Ser Ala Ile Leu Glu Ala Val Val Pro Ser Arg Ser Asn

380 385 390 395

ccc acg cac gag aag gtc ctg gcc atc gtg aac gcg ctg gtg gag aac 12276

Pro Thr His Glu Lys Val Leu Ala Ile Val Asn Ala Leu Val Glu Asn

400 405 410

aag gcc atc cgc ggc gac gag gcc ggc ctg gtg tac aac gcg ctg ctg 12324

Lys Ala Ile Arg Gly Asp Glu Ala Gly Leu Val Tyr Asn Ala Leu Leu

415 420 425

gag cgc gtg gcc cgc tac aac agc acc aac gtg cag acc aac ctg gac 12372

Glu Arg Val Ala Arg Tyr Asn Ser Thr Asn Val Gln Thr Asn Leu Asp

430 435 440

cgc atg gtg acc gac gtg cgc gag gcc gtg gcc cag cgc gag cgg ttc 12420

Arg Met Val Thr Asp Val Arg Glu Ala Val Ala Gln Arg Glu Arg Phe

445 450 455

cac cgc gag tcc aac ctg gga tcc ctg gtg gcg ctg aac gcc ttc ctc 12468

His Arg Glu Ser Asn Leu Gly Ser Leu Val Ala Leu Asn Ala Phe Leu

460 465 470 475

agc acc cag ccc gcc aac gtg ccc cgg ggc cag gag gac tac acc aac 12516

Ser Thr Gln Pro Ala Asn Val Pro Arg Gly Gln Glu Asp Tyr Thr Asn

480 485 490

ttc atc agc gcc ctg cgc ctg atg gtg acc gag gtg ccc cag agc gag 12564

Phe Ile Ser Ala Leu Arg Leu Met Val Thr Glu Val Pro Gln Ser Glu

495 500 505

gtg tac cag tcc ggg ccg gac tac ttc ttc cag acc agt cgc cag ggc 12612

Val Tyr Gln Ser Gly Pro Asp Tyr Phe Phe Gln Thr Ser Arg Gln Gly

510 515 520

ttg cag acc gtg aac ctg agc cag gcg ttc aag aac ttg cag ggc ctc 12660

Leu Gln Thr Val Asn Leu Ser Gln Ala Phe Lys Asn Leu Gln Gly Leu

525 530 535

tgg ggc gtg cag gcc ccg gtc ggg gac cgc gcg acg gtg tcg agc ctg 12708

Trp Gly Val Gln Ala Pro Val Gly Asp Arg Ala Thr Val Ser Ser Leu

540 545 550 555

ctg acg ccg aac tcg cgc ctg ctg ttg ctg ctg gtg gct ccc ttc acg 12756

Leu Thr Pro Asn Ser Arg Leu Leu Leu Leu Leu Val Ala Pro Phe Thr

560 565 570

gac agc ggc agc atc aac cgc aac tcg tac ctg ggc tac ctg att aac 12804

Asp Ser Gly Ser Ile Asn Arg Asn Ser Tyr Leu Gly Tyr Leu Ile Asn

575 580 585

ctg tac cgc gag gcc atc ggc cag gcg cac gtg gac gag cag acc tac 12852

Leu Tyr Arg Glu Ala Ile Gly Gln Ala His Val Asp Glu Gln Thr Tyr

590 595 600

cag gag atc acc cac gtg agc cgc gcc ctg ggc cag gac gac ccg gga 12900

Gln Glu Ile Thr His Val Ser Arg Ala Leu Gly Gln Asp Asp Pro Gly

605 610 615

aat ctg gaa gcc acc ctg aac ttt ttg ctg acc aat cgg tcg cag aag 12948

Asn Leu Glu Ala Thr Leu Asn Phe Leu Leu Thr Asn Arg Ser Gln Lys

620 625 630 635

atc ccg ccc cag tac gcg ctc agc gcc gag gag gag cgc atc ctg cga 12996

Ile Pro Pro Gln Tyr Ala Leu Ser Ala Glu Glu Glu Arg Ile Leu Arg

640 645 650

tac gtg cag cag agc gtg ggc ctg ttc ctg atg cag gag ggg gcc acc 13044

Tyr Val Gln Gln Ser Val Gly Leu Phe Leu Met Gln Glu Gly Ala Thr

655 660 665

ccc agc gcc gcg ctc gac atg acc gcg cgc aac atg gag ccc agc atg 13092

Pro Ser Ala Ala Leu Asp Met Thr Ala Arg Asn Met Glu Pro Ser Met

670 675 680

tac gcc agc aac cgc ccg ttc atc aat aaa ctg atg gac tac ttg cat 13140

Tyr Ala Ser Asn Arg Pro Phe Ile Asn Lys Leu Met Asp Tyr Leu His

685 690 695

cgg gca gcc gcc atg aac tct gac tat ttc acc aac gcc atc ctg aat 13188

Arg Ala Ala Ala Met Asn Ser Asp Tyr Phe Thr Asn Ala Ile Leu Asn

700 705 710 715

ccc cac tgg ctt ccg ccg ccg ggg ttc tac acg ggc gag tac gac atg 13236

Pro His Trp Leu Pro Pro Pro Gly Phe Tyr Thr Gly Glu Tyr Asp Met

720 725 730

ccc gac ccc aat gac ggg ttc ctg tgg gac gat gtg gac agc agc gtg 13284

Pro Asp Pro Asn Asp Gly Phe Leu Trp Asp Asp Val Asp Ser Ser Val

735 740 745

ttc tcc ccc cga ccg ggt gct aac gag cgc ccc ttg tgg aag aag gaa 13332

Phe Ser Pro Arg Pro Gly Ala Asn Glu Arg Pro Leu Trp Lys Lys Glu

750 755 760

ggc agc gac cga cgc ccg tcc tcg gcg ctg tcc ggc cgc gag ggt gct 13380

Gly Ser Asp Arg Arg Pro Ser Ser Ala Leu Ser Gly Arg Glu Gly Ala

765 770 775

gcc gcg gcg gtg ccc gag gcc gcc agt cct ttc ccg agc ttg ccc ttc 13428

Ala Ala Ala Val Pro Glu Ala Ala Ser Pro Phe Pro Ser Leu Pro Phe

780 785 790 795

tcg ctg aac agt atc cgc agc agc gag ctg ggc agg atc acg cgc ccg 13476

Ser Leu Asn Ser Ile Arg Ser Ser Glu Leu Gly Arg Ile Thr Arg Pro

800 805 810

cgc ttg ctg ggc gag gag gag tac ttg aat gac tcc ctg ctg aga ccc 13524

Arg Leu Leu Gly Glu Glu Glu Tyr Leu Asn Asp Ser Leu Leu Arg Pro

815 820 825

gag cgg gag aag aac ttc ccc aat aac ggg ata gag agc ctg gtg gac 13572

Glu Arg Glu Lys Asn Phe Pro Asn Asn Gly Ile Glu Ser Leu Val Asp

830 835 840

aag atg agc cga tgg aag acg tat gcg cag gag cac agg gac gat ccc 13620

Lys Met Ser Arg Trp Lys Thr Tyr Ala Gln Glu His Arg Asp Asp Pro

845 850 855

cgg gcg tcg cag ggg gcc acg agc cgg ggc agc gcc gcc cgt aaa cgc 13668

Arg Ala Ser Gln Gly Ala Thr Ser Arg Gly Ser Ala Ala Arg Lys Arg

860 865 870 875

cgg tgg cac gac agg cag cgg gga ctg atg tgg gac gat gag gat tcc 13716

Arg Trp His Asp Arg Gln Arg Gly Leu Met Trp Asp Asp Glu Asp Ser

880 885 890

gcc gac gac agc agc gtg ttg gac ttg ggt ggg agt ggt ggt aac ccg 13764

Ala Asp Asp Ser Ser Val Leu Asp Leu Gly Gly Ser Gly Gly Asn Pro

895 900 905

ttc gct cac ctg cgc ccc cgc atc ggg cgc atg atg taagaaaccg 13810

Phe Ala His Leu Arg Pro Arg Ile Gly Arg Met Met

910 915

aaaataaatg atactcacca aggccatagc gaccagcgtg cgttcgtttc ttctctgttg 13870

ttgttgtatc tagt atg atg agg cgt gcg tac ccg gag ggt cct cct ccc 13920

Met Met Arg Arg Ala Tyr Pro Glu Gly Pro Pro Pro

920 925 930

tcg tac gag agc gtg atg cag cag gcg atg gcg gcg gcg atg cag ccc 13968

Ser Tyr Glu Ser Val Met Gln Gln Ala Met Ala Ala Ala Met Gln Pro

935 940 945

ccg ctg gag gct cct tac gtg ccc ccg cgg tac ctg gcg cct acg gag 14016

Pro Leu Glu Ala Pro Tyr Val Pro Pro Arg Tyr Leu Ala Pro Thr Glu

950 955 960

ggg cgg aac agc att cgt tac tcg gag ctg gca ccc ttg tac gat acc 14064

Gly Arg Asn Ser Ile Arg Tyr Ser Glu Leu Ala Pro Leu Tyr Asp Thr

965 970 975

acc cgg ttg tac ctg gtg gac aac aag tcg gcg gac atc gcc tcg ctg 14112

Thr Arg Leu Tyr Leu Val Asp Asn Lys Ser Ala Asp Ile Ala Ser Leu

980 985 990 995

aac tac cag aac gac cac agc aac ttc ctg acc acc gtg gtg cag 14157

Asn Tyr Gln Asn Asp His Ser Asn Phe Leu Thr Thr Val Val Gln

1000 1005 1010

aac aat gac ttc acc ccc acg gag gcc agc acc cag acc atc aac 14202

Asn Asn Asp Phe Thr Pro Thr Glu Ala Ser Thr Gln Thr Ile Asn

1015 1020 1025

ttt gac gag cgc tcg cgg tgg ggc ggc cag ctg aaa acc atc atg 14247

Phe Asp Glu Arg Ser Arg Trp Gly Gly Gln Leu Lys Thr Ile Met

1030 1035 1040

cac acc aac atg ccc aac gtg aac gag ttc atg tac agc aac aag 14292

His Thr Asn Met Pro Asn Val Asn Glu Phe Met Tyr Ser Asn Lys

1045 1050 1055

ttc aag gcg cgg gtg atg gtc tcc cgc aag acc ccc aac ggg gtc 14337

Phe Lys Ala Arg Val Met Val Ser Arg Lys Thr Pro Asn Gly Val

1060 1065 1070

aaa gta gat gac acg tat gat ggt agt cag gat atc ttg gag tat 14382

Lys Val Asp Asp Thr Tyr Asp Gly Ser Gln Asp Ile Leu Glu Tyr

1075 1080 1085

aag tgg gtg gag ttt gag ctg ccc gaa ggc aac ttc tcg gtg acc 14427

Lys Trp Val Glu Phe Glu Leu Pro Glu Gly Asn Phe Ser Val Thr

1090 1095 1100

atg acc atc gac ctg atg aac aac gcc atc atc gac aat tac ttg 14472

Met Thr Ile Asp Leu Met Asn Asn Ala Ile Ile Asp Asn Tyr Leu

1105 1110 1115

gcg gtg ggg cgg cag aac ggg gtc ctg gag agc gat atc ggc gtg 14517

Ala Val Gly Arg Gln Asn Gly Val Leu Glu Ser Asp Ile Gly Val

1120 1125 1130

aag ttc gac act agg aac ttc agg ctg ggc tgg gac ccc gtg acc 14562

Lys Phe Asp Thr Arg Asn Phe Arg Leu Gly Trp Asp Pro Val Thr

1135 1140 1145

gag ctg gtc atg cca ggg gtg tac acc aac gag gcc ttc cac ccc 14607

Glu Leu Val Met Pro Gly Val Tyr Thr Asn Glu Ala Phe His Pro

1150 1155 1160

gat att gtc ttg ctg ccc ggc tgc ggg gtg gac ttc acc gag agc 14652

Asp Ile Val Leu Leu Pro Gly Cys Gly Val Asp Phe Thr Glu Ser

1165 1170 1175

cgc ctc agc aac ctg ctg ggc att cgc aag agg cag cct ttc cag 14697

Arg Leu Ser Asn Leu Leu Gly Ile Arg Lys Arg Gln Pro Phe Gln

1180 1185 1190

gaa ggt ttc cag atc atg tac gat gat ctg gag ggg ggc aac atc 14742

Glu Gly Phe Gln Ile Met Tyr Asp Asp Leu Glu Gly Gly Asn Ile

1195 1200 1205

ccc gcg ctc ctg gat gtc gag gcc tat gag gaa agc aag gaa aaa 14787

Pro Ala Leu Leu Asp Val Glu Ala Tyr Glu Glu Ser Lys Glu Lys

1210 1215 1220

gca gaa gcc gag gcg act gca gcc gtg gct act gcc gcg gcc aat 14832

Ala Glu Ala Glu Ala Thr Ala Ala Val Ala Thr Ala Ala Ala Asn

1225 1230 1235

gca gat gcc aat gtg act aga ggc gat aca ttc gcc act cag gcg 14877

Ala Asp Ala Asn Val Thr Arg Gly Asp Thr Phe Ala Thr Gln Ala

1240 1245 1250

gag gaa gca gcc gcc cta gcg gtc gcc gat gat agt gaa agt aag 14922

Glu Glu Ala Ala Ala Leu Ala Val Ala Asp Asp Ser Glu Ser Lys

1255 1260 1265

ata gtt atc aaa cca gta aaa gtg gat agc aag aac aga agc tac 14967

Ile Val Ile Lys Pro Val Lys Val Asp Ser Lys Asn Arg Ser Tyr

1270 1275 1280

aac gtg ctg ccg gac gag gta aac acc gcc tac cgc agt tgg tac 15012

Asn Val Leu Pro Asp Glu Val Asn Thr Ala Tyr Arg Ser Trp Tyr

1285 1290 1295

ctg gcc tac aac tat ggc gac ccc gag aag ggc gtg cgc tcc tgg 15057

Leu Ala Tyr Asn Tyr Gly Asp Pro Glu Lys Gly Val Arg Ser Trp

1300 1305 1310

acg ctg ctc acc acc tcg gac gtc acc tgc ggc gtg gag caa gtc 15102

Thr Leu Leu Thr Thr Ser Asp Val Thr Cys Gly Val Glu Gln Val

1315 1320 1325

tac tgg tcg ctg ccc gac atg atg caa gac ccg gtc acc ttc cgc 15147

Tyr Trp Ser Leu Pro Asp Met Met Gln Asp Pro Val Thr Phe Arg

1330 1335 1340

tcc acg cgt caa gtt agc aac tac ccg gtg gtg ggc gcc gag ctc 15192

Ser Thr Arg Gln Val Ser Asn Tyr Pro Val Val Gly Ala Glu Leu

1345 1350 1355

ctg ccc gtc tac tcc aag agc ttc ttc aac gag cag gcc gtc tac 15237

Leu Pro Val Tyr Ser Lys Ser Phe Phe Asn Glu Gln Ala Val Tyr

1360 1365 1370

tcg cag cag ctg cgc gcc ttc acc tcg ctc acg cac gtc ttc aac 15282

Ser Gln Gln Leu Arg Ala Phe Thr Ser Leu Thr His Val Phe Asn

1375 1380 1385

cgc ttc ccc gag aac cag atc ctc gtc cgc ccg ccc gcg ccc acc 15327

Arg Phe Pro Glu Asn Gln Ile Leu Val Arg Pro Pro Ala Pro Thr

1390 1395 1400

att acc acc gtc agt gaa aac gtt cct gct ctc aca gat cac ggg 15372

Ile Thr Thr Val Ser Glu Asn Val Pro Ala Leu Thr Asp His Gly

1405 1410 1415

acc ctg ccg ctg cgc agc agt atc cgg gga gtc cag cgc gtg acc 15417

Thr Leu Pro Leu Arg Ser Ser Ile Arg Gly Val Gln Arg Val Thr

1420 1425 1430

gtt act gac gcc aga cgc cgc acc tgc ccc tac gtc tac aag gcc 15462

Val Thr Asp Ala Arg Arg Arg Thr Cys Pro Tyr Val Tyr Lys Ala

1435 1440 1445

ctg ggc ata gtc gcg ccg cgc gtc ctc tcg agc cgc acc ttc taaaaa 15510

Leu Gly Ile Val Ala Pro Arg Val Leu Ser Ser Arg Thr Phe

1450 1455

atg tcc att ctc atc tcg ccc agt aat aac acc ggt tgg ggc ctg 15555

Met Ser Ile Leu Ile Ser Pro Ser Asn Asn Thr Gly Trp Gly Leu

1460 1465 1470

cgc gcg ccc agc aag atg tac gga ggc gct cgc caa cgc tcc acg 15600

Arg Ala Pro Ser Lys Met Tyr Gly Gly Ala Arg Gln Arg Ser Thr

1475 1480 1485

caa cac ccc gtg cgc gtg cgc ggg cac ttc cgc gct ccc tgg ggc 15645

Gln His Pro Val Arg Val Arg Gly His Phe Arg Ala Pro Trp Gly

1490 1495 1500

gcc ctc aag ggc cgc gtg cgg tcg cgc acc acc gtc gac gac gtg 15690

Ala Leu Lys Gly Arg Val Arg Ser Arg Thr Thr Val Asp Asp Val

1505 1510 1515

atc gac cag gtg gtg gcc gac gcg cgc aac tac acc ccc gcc gcc 15735

Ile Asp Gln Val Val Ala Asp Ala Arg Asn Tyr Thr Pro Ala Ala

1520 1525 1530

gcg ccc gtc tcc acc gtg gac gcc gtc atc gac agc gtg gtg gcc 15780

Ala Pro Val Ser Thr Val Asp Ala Val Ile Asp Ser Val Val Ala

1535 1540 1545

gac gcg cgc cgg tac gcc cgc gcc aag agc cgg cgg cgg cgc atc 15825

Asp Ala Arg Arg Tyr Ala Arg Ala Lys Ser Arg Arg Arg Arg Ile

1550 1555 1560

gcc cgg cgg cac cgg agc acc ccc gcc atg cgc gcg gcg cga gcc 15870

Ala Arg Arg His Arg Ser Thr Pro Ala Met Arg Ala Ala Arg Ala

1565 1570 1575

ttg ctg cgc agg gcc agg cgc acg gga cgc agg gcc atg ctc agg 15915

Leu Leu Arg Arg Ala Arg Arg Thr Gly Arg Arg Ala Met Leu Arg

1580 1585 1590

gcg gcc aga cgc gcg gct tca ggc gcc agc gcc ggc agg acc cgg 15960

Ala Ala Arg Arg Ala Ala Ser Gly Ala Ser Ala Gly Arg Thr Arg

1595 1600 1605

aga cgc gcg gcc acg gcg gcg gca gcg gcc atc gcc agc atg tcc 16005

Arg Arg Ala Ala Thr Ala Ala Ala Ala Ala Ile Ala Ser Met Ser

1610 1615 1620

cgc ccg cgg cga ggg aac gtg tac tgg gtg cgc gac gcc gcc acc 16050

Arg Pro Arg Arg Gly Asn Val Tyr Trp Val Arg Asp Ala Ala Thr

1625 1630 1635

ggt gtg cgc gtg ccc gtg cgc acc cgc ccc cct cgc act tgaagatgtt 16099

Gly Val Arg Val Pro Val Arg Thr Arg Pro Pro Arg Thr

1640 1645 1650

gacttcgcga tgttgatgtg tcccagcggc gagg atg tcc aag cgc aaa ttc 16151

Met Ser Lys Arg Lys Phe

1655

aag gaa gag atg ctc cag gtc atc gcg cct gag atc tac ggc ccc 16196

Lys Glu Glu Met Leu Gln Val Ile Ala Pro Glu Ile Tyr Gly Pro

1660 1665 1670

gcg gcg gtg gtg aag gag gaa aga aag ccc cgc aaa atc aag cgg 16241

Ala Ala Val Val Lys Glu Glu Arg Lys Pro Arg Lys Ile Lys Arg

1675 1680 1685

gtc aaa aag gac aaa aag gaa gaa gat gac gat ctg gtg gag ttt 16286

Val Lys Lys Asp Lys Lys Glu Glu Asp Asp Asp Leu Val Glu Phe

1690 1695 1700

gtg cgc gag ttc gcc ccc cgg cgg cgc gtg cag tgg cgc ggg cgg 16331

Val Arg Glu Phe Ala Pro Arg Arg Arg Val Gln Trp Arg Gly Arg

1705 1710 1715

aag gtg caa ccg gtg ctg aga ccc ggc acc acc gtg gtt ttc aca 16376

Lys Val Gln Pro Val Leu Arg Pro Gly Thr Thr Val Val Phe Thr

1720 1725 1730

ccc ggc gag cgc tcc ggc acc gct tcc aag cgc tcc tac gac gag 16421

Pro Gly Glu Arg Ser Gly Thr Ala Ser Lys Arg Ser Tyr Asp Glu

1735 1740 1745

gtg tac ggg gat gat gat atc ctg gag cag gcg gcc gag cgc ctg 16466

Val Tyr Gly Asp Asp Asp Ile Leu Glu Gln Ala Ala Glu Arg Leu

1750 1755 1760

ggc gag ttt gct tac ggc aag cgc agc cgc ccc gcg gcg ccc cca 16511

Gly Glu Phe Ala Tyr Gly Lys Arg Ser Arg Pro Ala Ala Pro Pro

1765 1770 1775

gaa gcg gtg tcc atc ccg ctg gac cac ggc aac ccc acg ccg agc 16556

Glu Ala Val Ser Ile Pro Leu Asp His Gly Asn Pro Thr Pro Ser

1780 1785 1790

ctc aag ccc gtg acc ctg cag cag gtg ctg ccg agc gcg gcg ccg 16601

Leu Lys Pro Val Thr Leu Gln Gln Val Leu Pro Ser Ala Ala Pro

1795 1800 1805

cgc cgg ggc ttc aag cgc gag ggc gag gat ctg tac ccc acc atg 16646

Arg Arg Gly Phe Lys Arg Glu Gly Glu Asp Leu Tyr Pro Thr Met

1810 1815 1820

cag ctg atg gtg ccc aag cgc cag aag ctg gaa gac gtg ctg gag 16691

Gln Leu Met Val Pro Lys Arg Gln Lys Leu Glu Asp Val Leu Glu

1825 1830 1835

acc atg aag gtg gac ccc gag gtg cag ccc gag gtc aag gtg cgg 16736

Thr Met Lys Val Asp Pro Glu Val Gln Pro Glu Val Lys Val Arg

1840 1845 1850

ccc atc aag cag gtg gcc ccg ggc ctg ggc gtg cag acc gtg gac 16781

Pro Ile Lys Gln Val Ala Pro Gly Leu Gly Val Gln Thr Val Asp

1855 1860 1865

atc aag atc ccc acg gag ccc atg gaa acg cag acc gag ccc gtg 16826

Ile Lys Ile Pro Thr Glu Pro Met Glu Thr Gln Thr Glu Pro Val

1870 1875 1880

aag ccc agc acc agc acc atg gag gtg cag acg gat ccc tgg atg 16871

Lys Pro Ser Thr Ser Thr Met Glu Val Gln Thr Asp Pro Trp Met

1885 1890 1895

ccg gcg gcc gct cct aga ccc cgg cgc aag tac ggc gcg gcc agc 16916

Pro Ala Ala Ala Pro Arg Pro Arg Arg Lys Tyr Gly Ala Ala Ser

1900 1905 1910

ctg ctg atg ccc aac tac gcg ctg cat cct tcc atc atc ccc acg 16961

Leu Leu Met Pro Asn Tyr Ala Leu His Pro Ser Ile Ile Pro Thr

1915 1920 1925

ccg ggc tac cgc ggc acg cgc ttc tac cgc ggg cag acc acc cgc 17006

Pro Gly Tyr Arg Gly Thr Arg Phe Tyr Arg Gly Gln Thr Thr Arg

1930 1935 1940

cgc cgc cgc aag acc act cgc cgc cgc cgt cgc cgc aca gcc gct 17051

Arg Arg Arg Lys Thr Thr Arg Arg Arg Arg Arg Arg Thr Ala Ala

1945 1950 1955

gca act tcc gcc gcc gcc ctg gtg cgg aga gtg tac cgc cgc ggc 17096

Ala Thr Ser Ala Ala Ala Leu Val Arg Arg Val Tyr Arg Arg Gly

1960 1965 1970

cgc gcg cct ctg acc ctg ccg cgc gcg cgc tac cac ccg agc atc 17141

Arg Ala Pro Leu Thr Leu Pro Arg Ala Arg Tyr His Pro Ser Ile

1975 1980 1985

gcc att taaacacttt cgcgcctgct ttgcag atg gcc ctc act tgc cgc 17191

Ala Ile Met Ala Leu Thr Cys Arg

1990 1995

att cgc gtt ccc att gcg ggc tac cga gga aga aaa ccg cgc cgt 17236

Ile Arg Val Pro Ile Ala Gly Tyr Arg Gly Arg Lys Pro Arg Arg

2000 2005 2010

aga agg ctg gcg ggg agc ggg atg cgt cgc cac ccc cac cgg cgg 17281

Arg Arg Leu Ala Gly Ser Gly Met Arg Arg His Pro His Arg Arg

2015 2020 2025

cgg cgc gcc atc agc aag cgg ttg ggg gga ggc ttc ctg ccc gcg 17326

Arg Arg Ala Ile Ser Lys Arg Leu Gly Gly Gly Phe Leu Pro Ala

2030 2035 2040

ctg atc ccc atc atc gcc gcg gcg atc ggg gcg atc ccc ggc atc 17371

Leu Ile Pro Ile Ile Ala Ala Ala Ile Gly Ala Ile Pro Gly Ile

2045 2050 2055

gct tcc gtg gcg gta cag gcc tct cag cgc cac tgagacactt 17414

Ala Ser Val Ala Val Gln Ala Ser Gln Arg His

2060 2065

ggaacatctt gtaataaaaa acca atg gac tct gac gct cct ggt cct gtg 17465

Met Asp Ser Asp Ala Pro Gly Pro Val

2070 2075

atg tgt ttt cgt aga cag atg gaa gac atc aat ttt tcg tcc ctg 17510

Met Cys Phe Arg Arg Gln Met Glu Asp Ile Asn Phe Ser Ser Leu

2080 2085 2090

gct ccg cga cac ggc acg cgg ccg ttc atg ggc acc tgg agc gac 17555

Ala Pro Arg His Gly Thr Arg Pro Phe Met Gly Thr Trp Ser Asp

2095 2100 2105

atc ggc aac agc cat ctg aac ggg ggc gcc ttc aat tgg agc agt 17600

Ile Gly Asn Ser His Leu Asn Gly Gly Ala Phe Asn Trp Ser Ser

2110 2115 2120

ctc tgg agc ggg ctt aag aat ttc ggg tcc acg ctt aaa acc tat 17645

Leu Trp Ser Gly Leu Lys Asn Phe Gly Ser Thr Leu Lys Thr Tyr

2125 2130 2135

ggc cac aag gcg tgg aac agc aac aca ggg cag gcg ctg agg gaa 17690

Gly His Lys Ala Trp Asn Ser Asn Thr Gly Gln Ala Leu Arg Glu

2140 2145 2150

aag ctg aaa gag cag aac ttc cag cag aag gtg gtc gat ggc ctc 17735

Lys Leu Lys Glu Gln Asn Phe Gln Gln Lys Val Val Asp Gly Leu

2155 2160 2165

gcc tcg ggc atc aac ggg gtg gtg gac ctg gcc aac cag gcc gtg 17780

Ala Ser Gly Ile Asn Gly Val Val Asp Leu Ala Asn Gln Ala Val

2170 2175 2180

cag cgg cag atc aac agc cga ctg gag ccg gtg ccg ccc gcc ggc 17825

Gln Arg Gln Ile Asn Ser Arg Leu Glu Pro Val Pro Pro Ala Gly

2185 2190 2195

tcc gtg gag atg ccg cag gtg gag gag gag ctg cct ccc ctg gac 17870

Ser Val Glu Met Pro Gln Val Glu Glu Glu Leu Pro Pro Leu Asp

2200 2205 2210

aag cgg ggc gag aag cga ccc cgc ccc gac gcg gag gag acg ctg 17915

Lys Arg Gly Glu Lys Arg Pro Arg Pro Asp Ala Glu Glu Thr Leu

2215 2220 2225

ctg acg cac acg gac gag ccg ccc ccg tac gag gag gcg gtg aaa 17960

Leu Thr His Thr Asp Glu Pro Pro Pro Tyr Glu Glu Ala Val Lys

2230 2235 2240

ctg ggc ctg ccc acc acg cgg ccc atc gcg ccc ctg gcc acc ggg 18005

Leu Gly Leu Pro Thr Thr Arg Pro Ile Ala Pro Leu Ala Thr Gly

2245 2250 2255

gtg ctg aaa ccc gag tct aag ccc gcg acc ctg gac ttg cct cct 18050

Val Leu Lys Pro Glu Ser Lys Pro Ala Thr Leu Asp Leu Pro Pro

2260 2265 2270

ccc ccg aca tcc cgc ccc tcc aca gtg gct aag ccc ctg ccg ccg 18095

Pro Pro Thr Ser Arg Pro Ser Thr Val Ala Lys Pro Leu Pro Pro

2275 2280 2285

gtg gcc cgc gcg cga ccc ggg agc cgc ccg cag gcg aac tgg cag 18140

Val Ala Arg Ala Arg Pro Gly Ser Arg Pro Gln Ala Asn Trp Gln

2290 2295 2300

agc act ctg aac agc atc gtg ggt ctg gga gtg cag agt gtg aag 18185

Ser Thr Leu Asn Ser Ile Val Gly Leu Gly Val Gln Ser Val Lys

2305 2310 2315

cgc cgc cgc tgc tat taaacaacat accgtagcgc ttaacttgct tgtctgtgtg 18240

Arg Arg Arg Cys Tyr

2320

tgtatgtatt atgtcgccgc cgctgtcgcc agaaggagga gtgaagaggc gcgtcgccga 18300

gttgcaag atg gcc acc cca tcg atg ctg ccc cag tgg gcg tac atg 18347

Met Ala Thr Pro Ser Met Leu Pro Gln Trp Ala Tyr Met

2325 2330

cac atc gcc gga cag gac gct tcg gag tac ctg agt ccg ggt ctg 18392

His Ile Ala Gly Gln Asp Ala Ser Glu Tyr Leu Ser Pro Gly Leu

2335 2340 2345

gtg cag ttc gcc cgc gcc aca gac acc tac ttc agt ctg ggg aac 18437

Val Gln Phe Ala Arg Ala Thr Asp Thr Tyr Phe Ser Leu Gly Asn

2350 2355 2360

aag ttt agg aac ccc acg gtg gcg ccc acg cac gat gtg acc acc 18482

Lys Phe Arg Asn Pro Thr Val Ala Pro Thr His Asp Val Thr Thr

2365 2370 2375

gac cgc agc cag cgg ctg acg ctg cgc ttc gtg ccc gtg gac cgc 18527

Asp Arg Ser Gln Arg Leu Thr Leu Arg Phe Val Pro Val Asp Arg

2380 2385 2390

gag gac aac acc tac tcg tac aaa gtg cgc tac acg ctg gcc gtg 18572

Glu Asp Asn Thr Tyr Ser Tyr Lys Val Arg Tyr Thr Leu Ala Val

2395 2400 2405

ggc gac aac cgc gtg ctg gac atg gcc agc acc tac ttt gac atc 18617

Gly Asp Asn Arg Val Leu Asp Met Ala Ser Thr Tyr Phe Asp Ile

2410 2415 2420

cgc ggc gtg ctg gac cgg ggc cct agc ttc aaa ccc tac tcc ggc 18662

Arg Gly Val Leu Asp Arg Gly Pro Ser Phe Lys Pro Tyr Ser Gly

2425 2430 2435

acc gcc tac aac agc ctg gcc ccc aag gga gct ccc aat tcc agc 18707

Thr Ala Tyr Asn Ser Leu Ala Pro Lys Gly Ala Pro Asn Ser Ser

2440 2445 2450

cag tgg gag caa gca aaa aca ggc aat ggg gga act atg gaa aca 18752

Gln Trp Glu Gln Ala Lys Thr Gly Asn Gly Gly Thr Met Glu Thr

2455 2460 2465

cac aca tat ggt gtg gcc cca atg ggc gga gag aat att aca aaa 18797

His Thr Tyr Gly Val Ala Pro Met Gly Gly Glu Asn Ile Thr Lys

2470 2475 2480

gat ggt ctt caa att gga aca gat gct aca gcg gat cag aat aaa 18842

Asp Gly Leu Gln Ile Gly Thr Asp Ala Thr Ala Asp Gln Asn Lys

2485 2490 2495

cca att tat gct gac aaa aca ttc cag cca gaa ccc caa gta gga 18887

Pro Ile Tyr Ala Asp Lys Thr Phe Gln Pro Glu Pro Gln Val Gly

2500 2505 2510

gaa gaa aat tgg caa gaa act gaa agc ttt tat ggt ggt agg gct 18932

Glu Glu Asn Trp Gln Glu Thr Glu Ser Phe Tyr Gly Gly Arg Ala

2515 2520 2525

ctt aaa aaa gat aca aac atg aaa cct tgc tac ggc tcc tat gct 18977

Leu Lys Lys Asp Thr Asn Met Lys Pro Cys Tyr Gly Ser Tyr Ala

2530 2535 2540

aga ccc act aat gaa aag gga gga caa gct aaa ctt aaa gtt gga 19022

Arg Pro Thr Asn Glu Lys Gly Gly Gln Ala Lys Leu Lys Val Gly

2545 2550 2555

gat gat gga gtt cca acc aaa gaa ttt gat ata gac ctg gct ttc 19067

Asp Asp Gly Val Pro Thr Lys Glu Phe Asp Ile Asp Leu Ala Phe

2560 2565 2570

ttt gat act ccc ggt ggt act gtg aat ggc caa gat gag tat aaa 19112

Phe Asp Thr Pro Gly Gly Thr Val Asn Gly Gln Asp Glu Tyr Lys

2575 2580 2585

gca gac att gtc atg tat acc gaa aac gcg tat ctg gaa act cca 19157

Ala Asp Ile Val Met Tyr Thr Glu Asn Ala Tyr Leu Glu Thr Pro

2590 2595 2600

gac aca cat gtg gta tac aaa cca ggc aag gaa gat gca agt tct 19202

Asp Thr His Val Val Tyr Lys Pro Gly Lys Glu Asp Ala Ser Ser

2605 2610 2615

gaa att aac ctg gtt cag cag tcc atg ccc aac agg ccc aac tac 19247

Glu Ile Asn Leu Val Gln Gln Ser Met Pro Asn Arg Pro Asn Tyr

2620 2625 2630

att ggg ttc aga gac aac ttt att ggt ctc atg tac tac aac agc 19292

Ile Gly Phe Arg Asp Asn Phe Ile Gly Leu Met Tyr Tyr Asn Ser

2635 2640 2645

act ggc aac atg ggg gtg ctg gct ggt cag gcc tca cag ttg aat 19337

Thr Gly Asn Met Gly Val Leu Ala Gly Gln Ala Ser Gln Leu Asn

2650 2655 2660

gct gtg gtt gac ttg caa gac aga aac act gaa ctg tcc tac cag 19382

Ala Val Val Asp Leu Gln Asp Arg Asn Thr Glu Leu Ser Tyr Gln

2665 2670 2675

ctc ttg ctt gac tct ctg ggt gac aga acc agg tat ttc agt atg 19427

Leu Leu Leu Asp Ser Leu Gly Asp Arg Thr Arg Tyr Phe Ser Met

2680 2685 2690

tgg aat cag gcg gtg gat agc tat gac cct gat gtg cgc atc att 19472

Trp Asn Gln Ala Val Asp Ser Tyr Asp Pro Asp Val Arg Ile Ile

2695 2700 2705

gaa aac cat ggt gtg gag gat gaa ttg cca aac tat tgc ttc ccc 19517

Glu Asn His Gly Val Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro

2710 2715 2720

ttg gat ggg tcc ggc act aac gcc gct tac caa ggt gta aaa gta 19562

Leu Asp Gly Ser Gly Thr Asn Ala Ala Tyr Gln Gly Val Lys Val

2725 2730 2735

aaa aat ggt gaa gat ggt gat ctt gag agc gag tgg gaa aat gat 19607

Lys Asn Gly Glu Asp Gly Asp Leu Glu Ser Glu Trp Glu Asn Asp

2740 2745 2750

gat act gtg gca gct cga aat caa tta tgt aaa ggc aac att ttt 19652

Asp Thr Val Ala Ala Arg Asn Gln Leu Cys Lys Gly Asn Ile Phe

2755 2760 2765

gcc atg gaa atc aac ctc caa gcc aac ctg tgg agg agt ttc ctc 19697

Ala Met Glu Ile Asn Leu Gln Ala Asn Leu Trp Arg Ser Phe Leu

2770 2775 2780

tac tcg aac gtg gcc ctg tac ctg ccc gat tct tac aag tac acg 19742

Tyr Ser Asn Val Ala Leu Tyr Leu Pro Asp Ser Tyr Lys Tyr Thr

2785 2790 2795

ccg gcc aac atc act ctg ccc acc aac acc aac acc tac gat tac 19787

Pro Ala Asn Ile Thr Leu Pro Thr Asn Thr Asn Thr Tyr Asp Tyr

2800 2805 2810

atg aat ggg aga gtg gtg cct ccc tcg ctg gtg gac gcc tac atc 19832

Met Asn Gly Arg Val Val Pro Pro Ser Leu Val Asp Ala Tyr Ile

2815 2820 2825

aac atc ggg gcg cgc tgg tcg ctg gac ccc atg gac aac gtg aac 19877

Asn Ile Gly Ala Arg Trp Ser Leu Asp Pro Met Asp Asn Val Asn

2830 2835 2840

ccc ttc aac cac cac cgc aac gcg ggc ctg cgc tac cgc tcc atg 19922

Pro Phe Asn His His Arg Asn Ala Gly Leu Arg Tyr Arg Ser Met

2845 2850 2855

ctc ctg ggc aac ggg cgc tac gtg ccc ttc cac atc cag gtg ccc 19967

Leu Leu Gly Asn Gly Arg Tyr Val Pro Phe His Ile Gln Val Pro

2860 2865 2870

cag aag ttc ttc gcc atc aag agc ctc ctg ctc ctg ccc ggg tcc 20012

Gln Lys Phe Phe Ala Ile Lys Ser Leu Leu Leu Leu Pro Gly Ser

2875 2880 2885

tac acc tac gag tgg aac ttc cgc aag gac gtc aac atg atc ctg 20057

Tyr Thr Tyr Glu Trp Asn Phe Arg Lys Asp Val Asn Met Ile Leu

2890 2895 2900

cag agc tcc ctc ggc aac gac ctg cgc acg gac ggg gcc tcc atc 20102

Gln Ser Ser Leu Gly Asn Asp Leu Arg Thr Asp Gly Ala Ser Ile

2905 2910 2915

cag ttc acc agc atc aac ctc tac gcc acc ttc ttc ccc atg gcg 20147

Gln Phe Thr Ser Ile Asn Leu Tyr Ala Thr Phe Phe Pro Met Ala

2920 2925 2930

cac aac acc gcc tcc acg ctc gag gcc atg ctg cgc aac gac acc 20192

His Asn Thr Ala Ser Thr Leu Glu Ala Met Leu Arg Asn Asp Thr

2935 2940 2945

aac gac cag tcc ttc aac gac tac ctc tcg gcg gcc aac atg ctc 20237

Asn Asp Gln Ser Phe Asn Asp Tyr Leu Ser Ala Ala Asn Met Leu

2950 2955 2960

tac ccc atc ccg gct aac gcc acc aac gtg ccc atc tcc atc ccc 20282

Tyr Pro Ile Pro Ala Asn Ala Thr Asn Val Pro Ile Ser Ile Pro

2965 2970 2975

tcg cgc aac tgg gcc gcc ttc cgc ggc tgg tcc ttc acg cgc ctc 20327

Ser Arg Asn Trp Ala Ala Phe Arg Gly Trp Ser Phe Thr Arg Leu

2980 2985 2990

aag acc cgc gag acg ccc tcg ctg ggc tcc ggg ttc gac ccc tac 20372

Lys Thr Arg Glu Thr Pro Ser Leu Gly Ser Gly Phe Asp Pro Tyr

2995 3000 3005

ttc gtc tac tcg ggc tcc atc ccc tac ctc gac ggc acc ttc tac 20417

Phe Val Tyr Ser Gly Ser Ile Pro Tyr Leu Asp Gly Thr Phe Tyr

3010 3015 3020

ctc aac cac acc ttc aag aag gtc tcc atc acc ttc gac tcc tcc 20462

Leu Asn His Thr Phe Lys Lys Val Ser Ile Thr Phe Asp Ser Ser

3025 3030 3035

gtc agc tgg ccc ggc aac gac cgg ctc ctg acg ccc aac gag ttc 20507

Val Ser Trp Pro Gly Asn Asp Arg Leu Leu Thr Pro Asn Glu Phe

3040 3045 3050

gaa atc aag cgc acc gtc gac ggc gag ggg tac aac gtg gcc cag 20552

Glu Ile Lys Arg Thr Val Asp Gly Glu Gly Tyr Asn Val Ala Gln

3055 3060 3065

tgc aac atg acc aag gac tgg ttc ctg gtc cag atg ctg gcc cac 20597

Cys Asn Met Thr Lys Asp Trp Phe Leu Val Gln Met Leu Ala His

3070 3075 3080

tac aac atc ggc tac cag ggc ttc cac gtg ccc gag ggc tac aag 20642

Tyr Asn Ile Gly Tyr Gln Gly Phe His Val Pro Glu Gly Tyr Lys

3085 3090 3095

gac cgc atg tac tcc ttc ttc cgc aac ttc cag ccc atg agc cgc 20687

Asp Arg Met Tyr Ser Phe Phe Arg Asn Phe Gln Pro Met Ser Arg

3100 3105 3110

cag gtg gtc gac gag gtc aac tac aag gac tac cag gcc gtc acc 20732

Gln Val Val Asp Glu Val Asn Tyr Lys Asp Tyr Gln Ala Val Thr

3115 3120 3125

ctg gcc tac cag cac aac aac tcg ggc ttc gtc ggc tac ctc gcg 20777

Leu Ala Tyr Gln His Asn Asn Ser Gly Phe Val Gly Tyr Leu Ala

3130 3135 3140

ccc acc atg cgc cag ggc cag ccc tac ccc gcc aac tac ccc tac 20822

Pro Thr Met Arg Gln Gly Gln Pro Tyr Pro Ala Asn Tyr Pro Tyr

3145 3150 3155

ccg ctc att ggc aag agc gcc gtc acc agc gtc acc cag aaa aag 20867

Pro Leu Ile Gly Lys Ser Ala Val Thr Ser Val Thr Gln Lys Lys

3160 3165 3170

ttc ctc tgc gac cgc gtc atg tgg cgc atc ccc ttc tcc agc aac 20912

Phe Leu Cys Asp Arg Val Met Trp Arg Ile Pro Phe Ser Ser Asn

3175 3180 3185

ttc atg tcc atg ggc gcg ctc acc gac ctc ggc cag aac atg ctc 20957

Phe Met Ser Met Gly Ala Leu Thr Asp Leu Gly Gln Asn Met Leu

3190 3195 3200

tat gcc aac tcc gcc cac gcg cta gac atg aat ttc gaa gtc gac 21002

Tyr Ala Asn Ser Ala His Ala Leu Asp Met Asn Phe Glu Val Asp

3205 3210 3215

ccc atg gat gag tcc acc ctt ctc tat gtt gtc ttc gaa gtc ttc 21047

Pro Met Asp Glu Ser Thr Leu Leu Tyr Val Val Phe Glu Val Phe

3220 3225 3230

gac gtc gtc aga gtg cac cag ccc cac cgc ggc gtc atc gag gcc 21092

Asp Val Val Arg Val His Gln Pro His Arg Gly Val Ile Glu Ala

3235 3240 3245

gtc tac ctg cgc acc ccc ttc tcg gcc ggt aac gcc acc acc 21134

Val Tyr Leu Arg Thr Pro Phe Ser Ala Gly Asn Ala Thr Thr

3250 3255 3260

taagctcttg cttcttgcaa gcagct atg gcc tgc gcg ggc tcc ggc gag 21184

Met Ala Cys Ala Gly Ser Gly Glu

3265 3270

cag gag ctc agg gcc atc atc cgc gac ctg ggc tgc ggg ccc tac 21229

Gln Glu Leu Arg Ala Ile Ile Arg Asp Leu Gly Cys Gly Pro Tyr

3275 3280 3285

ttc ctg ggc acc ttc gat aag cgc ttc ccg gga ttc atg gcc ccg 21274

Phe Leu Gly Thr Phe Asp Lys Arg Phe Pro Gly Phe Met Ala Pro

3290 3295 3300

cac aag ctg gcc tgc gcc atc gtc aac acg gcc ggc cgc gag acc 21319

His Lys Leu Ala Cys Ala Ile Val Asn Thr Ala Gly Arg Glu Thr

3305 3310 3315

ggg ggc gag cac tgg ctg gcc ttc gcc tgg aac ccg cgc tcg aac 21364

Gly Gly Glu His Trp Leu Ala Phe Ala Trp Asn Pro Arg Ser Asn

3320 3325 3330

acc tgc tac ctc ttc gac ccc ttc ggg ttc tcg gac gag cgc ctc 21409

Thr Cys Tyr Leu Phe Asp Pro Phe Gly Phe Ser Asp Glu Arg Leu

3335 3340 3345

aag cag atc tac cag ttc gag tac gag ggc ctg ctg cgc cgc agc 21454

Lys Gln Ile Tyr Gln Phe Glu Tyr Glu Gly Leu Leu Arg Arg Ser

3350 3355 3360

gcc ctg gcc acc gag gac cgc tgc gtc acc ctg gaa aag tcc acc 21499

Ala Leu Ala Thr Glu Asp Arg Cys Val Thr Leu Glu Lys Ser Thr

3365 3370 3375

cag acc gtg cag ggt ccg cgc tcg gcc gcc tgc ggg ctc ttc tgc 21544

Gln Thr Val Gln Gly Pro Arg Ser Ala Ala Cys Gly Leu Phe Cys

3380 3385 3390

tgc atg ttc ctg cac gcc ttc gtg cac tgg ccc gac cgc ccc atg 21589

Cys Met Phe Leu His Ala Phe Val His Trp Pro Asp Arg Pro Met

3395 3400 3405

gac aag aac ccc acc atg aac ttg ctg acg ggg gtg ccc aac ggc 21634

Asp Lys Asn Pro Thr Met Asn Leu Leu Thr Gly Val Pro Asn Gly

3410 3415 3420

atg ctc cag tcg ccc cag gtg gag ccc acc ctg cgc cgc aac caa 21679

Met Leu Gln Ser Pro Gln Val Glu Pro Thr Leu Arg Arg Asn Gln

3425 3430 3435

gag gcg ctc tac cgc ttc ctc aac tcc cac tcc gcc tac ttt cgc 21724

Glu Ala Leu Tyr Arg Phe Leu Asn Ser His Ser Ala Tyr Phe Arg

3440 3445 3450

tcc cac cgc gcg cgc atc gag aag gcc acc gcc ttc gac cgc atg 21769

Ser His Arg Ala Arg Ile Glu Lys Ala Thr Ala Phe Asp Arg Met

3455 3460 3465

aat caa gac atg taaaccgtgt gtgtattttt atctgtgtct ttaataaaca 21821

Asn Gln Asp Met

3470

gcgcttttat gtgacatgca tctctgagat gatttattta gaaatcgaaa gggttctgcc 21881

gggtctcggc atggcccgcg ggcagggaca cgttgcggaa ctggtacttg gccagccact 21941

tgaactcggg gatcagcagt ttcggcagcg gggtgtcggg gaaggagtcg gtccacagct 22001

tccgcgtcag ttgcagggcg cccagcaggt cgggcgcgga gatcttgaaa tcgcagttgg 22061

gacccgcgtt ctgcgcgcgg gagttgcggt acacggggtt gcagcactgg aacaccatca 22121

gggccgggtg tttcacgctc gccagcaccg tcgcgtcggt gatactctcc acgtcaaggt 22181

cctcggcgtt ggccatcccg aagggggtca tcttgcaggt ctgccggccc atggtgggca 22241

cgcacccggg cttgtggttg caatcgcagt gcagggggat cagcatcatc tgggcctggt 22301

cggcgttcat ccccgggtac atggccttca tgaacgcctc cagctgtctg aaggcctgct 22361

gggccttggc tccttcggtg aaaaagaccc cgcaggactt gctagagaac tggttggtag 22421

cgcagccggc gtcgtgcacg cagcagcgcg cgtcgttgtt ggccagctgc accacgctgc 22481

gcccccagcg gttctgggtg atcttggccc ggtcggggtt ctccttcagc gcgcgctgcc 22541

cgttctcgct cgccacatcc atctcgacca tgtgctcctt ctggatcatg gtgctgccgt 22601

gcaggcaccg cagcttgccc tcggcctcgg tgcacccatg cagccacagc gcgcacccgg 22661

tgcactccca gttcttgtgg gcgatctggg aatgcgagtg cacgaagccc tgcaggaagc 22721

ggcccatcat ggtcgtcagg gtcttgttgc tggtgaaggt cagcgggatg ccgcggtgct 22781

cctcgttgat gtacaggtgg cagatgcggc ggtacacctc gccctgctcg ggcatcagct 22841

ggaagttgga tttcagctcg ctttccacgc ggtagcggtc catcagcaga gccatgattt 22901

ccatgccctt ctcccaggcc gagacgatgg gcaggctcat ggggttcttc accatcatct 22961

tggagctagc cgccgcagcc agggggtcgc tctcgtccag ggtctcaaag ctccgcttgc 23021

cgtccttctc ggtgatccgc acgggggggt agctgaagcc cacggccgcc agctcctcct 23081

cggcctgtct ttcgtcctcg ctgtcctggc tgacgtcctg caggaccaca tgcttggtct 23141

tgcggggctt cttcttgggc ggcagcggcg gcggctgttc ctgaggcgag ggggagcgcg 23201

agtgctcgct caccaccaca atttcttctt cttggtccga ggccacgcgg cggtaggtat 23261

gtctcttcag gggcagaggc ggaggcgacg ggctctcctc gcggcccggc gggtggctgg 23321

cagagcccct tccgcgttcg ggggtgcgct cccggcggcg ctctgactga cttcctccgc 23381

ggccggccat tgtgttctcc tagggaacaa caacaagc atg gag act cag cca 23434

Met Glu Thr Gln Pro

3475

tcg tcg cca acc tcg cca tct gcc ccc acc acc gcc gac gag aag 23479

Ser Ser Pro Thr Ser Pro Ser Ala Pro Thr Thr Ala Asp Glu Lys

3480 3485 3490

cag cag cag cag aat gaa agc tta acc gcc ccg ccg ccc agc ccc 23524

Gln Gln Gln Gln Asn Glu Ser Leu Thr Ala Pro Pro Pro Ser Pro

3495 3500 3505

gcc tcc gac gcg gcc gcg gtc cca gac atg caa gag atg gag gaa 23569

Ala Ser Asp Ala Ala Ala Val Pro Asp Met Gln Glu Met Glu Glu

3510 3515 3520

tcc atc gag att gac ctg ggc tat gtg acg ccc gcg gag cac gag 23614

Ser Ile Glu Ile Asp Leu Gly Tyr Val Thr Pro Ala Glu His Glu

3525 3530 3535

gag gag ctg gca gtg cgc ttt tca tcg agc cag gaa gaa aaa gaa 23659

Glu Glu Leu Ala Val Arg Phe Ser Ser Ser Gln Glu Glu Lys Glu

3540 3545 3550

cag cag cca gaa cag gaa gca gag agc gag cag agt cag gct ggg 23704

Gln Gln Pro Glu Gln Glu Ala Glu Ser Glu Gln Ser Gln Ala Gly

3555 3560 3565

ctc gag cat ggc gac tac ctc cac ctg agc ggg ggg gag gac gcg 23749

Leu Glu His Gly Asp Tyr Leu His Leu Ser Gly Gly Glu Asp Ala

3570 3575 3580

ctc atc aag cat ctg gcc cgg cag gcc ctc atc gtc aag gac gcg 23794

Leu Ile Lys His Leu Ala Arg Gln Ala Leu Ile Val Lys Asp Ala

3585 3590 3595

ctg ctc gac cgc atc gag gtg ccc ctc agc gtg gag gag ctc agc 23839

Leu Leu Asp Arg Ile Glu Val Pro Leu Ser Val Glu Glu Leu Ser

3600 3605 3610

cgc gcc tac gag ctg aac ctc ttc tcg ccg cgc gtg ccc ccc aag 23884

Arg Ala Tyr Glu Leu Asn Leu Phe Ser Pro Arg Val Pro Pro Lys

3615 3620 3625

cgc cag ccc aac ggc acc tgc gag ccc aac ccg cgc ctc aac ttc 23929

Arg Gln Pro Asn Gly Thr Cys Glu Pro Asn Pro Arg Leu Asn Phe

3630 3635 3640

tac ccg gtc ttc gcg gtg ccc gag gcc ctg gcc acc tac cac atc 23974

Tyr Pro Val Phe Ala Val Pro Glu Ala Leu Ala Thr Tyr His Ile

3645 3650 3655

ttt ttc aag aac caa aag atc ccc gtc tcc tgc cgc gcc aac cgc 24019

Phe Phe Lys Asn Gln Lys Ile Pro Val Ser Cys Arg Ala Asn Arg

3660 3665 3670

acc cgc gcc gac gcc ctc ttc aac ctg ggc ccc ggc gcc cgc cta 24064

Thr Arg Ala Asp Ala Leu Phe Asn Leu Gly Pro Gly Ala Arg Leu

3675 3680 3685

cct gat atc gcc tcc ttg gaa gag gtt ccc aag atc ttc gag ggt 24109

Pro Asp Ile Ala Ser Leu Glu Glu Val Pro Lys Ile Phe Glu Gly

3690 3695 3700

ctg ggc agc gac gag act cgg gcc gcg aac gct ctg caa gga gaa 24154

Leu Gly Ser Asp Glu Thr Arg Ala Ala Asn Ala Leu Gln Gly Glu

3705 3710 3715

gga gga gag cat gag cac cac agc gcc ctg gtc gag ttg gaa ggc 24199

Gly Gly Glu His Glu His His Ser Ala Leu Val Glu Leu Glu Gly

3720 3725 3730

gac aac gcc cgc ttg gcg gtg ctc aag cgc acg gtc gag ctg acc 24244

Asp Asn Ala Arg Leu Ala Val Leu Lys Arg Thr Val Glu Leu Thr

3735 3740 3745

cat ttc gcc tac ccg gcc ctg aac ctg ccc ccc aaa gtc atg agc 24289

His Phe Ala Tyr Pro Ala Leu Asn Leu Pro Pro Lys Val Met Ser

3750 3755 3760

gcg gtc atg gac cag gtg ctc atc aag cgc gcg tcg ccc atc tcc 24334

Ala Val Met Asp Gln Val Leu Ile Lys Arg Ala Ser Pro Ile Ser

3765 3770 3775

gag gac gag ggc atg caa gac tcc gag gag ggc aag ccc gtg gtc 24379

Glu Asp Glu Gly Met Gln Asp Ser Glu Glu Gly Lys Pro Val Val

3780 3785 3790

agc gac gag cag ctg gcc cgg tgg ctg ggt cct aat gct acc cct 24424

Ser Asp Glu Gln Leu Ala Arg Trp Leu Gly Pro Asn Ala Thr Pro

3795 3800 3805

cag agt ttg gaa gag cgg cgc aag ctc atg atg gcc gtg gtc ctg 24469

Gln Ser Leu Glu Glu Arg Arg Lys Leu Met Met Ala Val Val Leu

3810 3815 3820

gtg acc gtg gag ctg gag tgc ttg cgc cgc ttc ttc gcc gac gcg 24514

Val Thr Val Glu Leu Glu Cys Leu Arg Arg Phe Phe Ala Asp Ala

3825 3830 3835

gag acc ctg cgc aag gtc gag gag aac ctg cac tac ctc ttc agg 24559

Glu Thr Leu Arg Lys Val Glu Glu Asn Leu His Tyr Leu Phe Arg

3840 3845 3850

cac ggg ttc gtg cgc cag gcc tgc aag atc tcc aac gtg gag ctg 24604

His Gly Phe Val Arg Gln Ala Cys Lys Ile Ser Asn Val Glu Leu

3855 3860 3865

acc aac ctg gtc tcc tac atg ggc atc ttg cac gag aac cgc ctg 24649

Thr Asn Leu Val Ser Tyr Met Gly Ile Leu His Glu Asn Arg Leu

3870 3875 3880

ggg cag aac gtg ctg cac acc acc ctg cgc ggg gag gcc cgc cgc 24694

Gly Gln Asn Val Leu His Thr Thr Leu Arg Gly Glu Ala Arg Arg

3885 3890 3895

gac tac atc cgc gac tgc gtc tac ctc tac ctc tgc cac acc tgg 24739

Asp Tyr Ile Arg Asp Cys Val Tyr Leu Tyr Leu Cys His Thr Trp

3900 3905 3910

cag acg ggc atg ggc gtg tgg cag cag tgt ctg gag gag cag aac 24784

Gln Thr Gly Met Gly Val Trp Gln Gln Cys Leu Glu Glu Gln Asn

3915 3920 3925

ctg aaa gag ctc tgc aag ctc ctg cag aag aac ctc aag ggt ctg 24829

Leu Lys Glu Leu Cys Lys Leu Leu Gln Lys Asn Leu Lys Gly Leu

3930 3935 3940

tgg acc ggg ttc aac gag cga acc acc gcc tcg gac ctg gcc gac 24874

Trp Thr Gly Phe Asn Glu Arg Thr Thr Ala Ser Asp Leu Ala Asp

3945 3950 3955

ctc atc ttc ccc gag cgc ctc agg ctg acg ctg cgc aac ggc ctg 24919

Leu Ile Phe Pro Glu Arg Leu Arg Leu Thr Leu Arg Asn Gly Leu

3960 3965 3970

ccc gac ttt atg agc caa agc atg ttg caa aac ttt cgc tct ttc 24964

Pro Asp Phe Met Ser Gln Ser Met Leu Gln Asn Phe Arg Ser Phe

3975 3980 3985

atc ctc gaa cgc tcc gga atc ctg ccc gcc acc tgc tcc gcg ctg 25009

Ile Leu Glu Arg Ser Gly Ile Leu Pro Ala Thr Cys Ser Ala Leu

3990 3995 4000

ccc tcg gac ttc gtg ccg ctg acc ttc cgc gag tgc ccc ccg ccg 25054

Pro Ser Asp Phe Val Pro Leu Thr Phe Arg Glu Cys Pro Pro Pro

4005 4010 4015

ctg tgg agc cac tgc tac ctg ctg cgc ctg gcc aac tac ctg gcc 25099

Leu Trp Ser His Cys Tyr Leu Leu Arg Leu Ala Asn Tyr Leu Ala

4020 4025 4030

tac cac tcg gac gtg atc gag gac gtc agc ggc gag ggc ctg ctc 25144

Tyr His Ser Asp Val Ile Glu Asp Val Ser Gly Glu Gly Leu Leu

4035 4040 4045

gag tgc cac tgc cgc tgc aac ctc tgc acg ccg cac cgc tcc ctg 25189

Glu Cys His Cys Arg Cys Asn Leu Cys Thr Pro His Arg Ser Leu

4050 4055 4060

gct tgc aac ccc cag ctg ctg agc gag acc cag atc atc ggc acc 25234

Ala Cys Asn Pro Gln Leu Leu Ser Glu Thr Gln Ile Ile Gly Thr

4065 4070 4075

ttc gag ttg caa ggg ccc agc gat gag ggt tca gcc aag ggg agt 25279

Phe Glu Leu Gln Gly Pro Ser Asp Glu Gly Ser Ala Lys Gly Ser

4080 4085 4090

ctg aaa ctc acc ccg ggg ctg tgg acc tcg gcc tac ttg cgc aag 25324

Leu Lys Leu Thr Pro Gly Leu Trp Thr Ser Ala Tyr Leu Arg Lys

4095 4100 4105

ttc gtg ccc gag gac tac cat ccc ttc gag atc agg ttc tac gag 25369

Phe Val Pro Glu Asp Tyr His Pro Phe Glu Ile Arg Phe Tyr Glu

4110 4115 4120

gac caa tcc cag ccg ccc aag gcc gag ctg tcg gcc tgc gtc atc 25414

Asp Gln Ser Gln Pro Pro Lys Ala Glu Leu Ser Ala Cys Val Ile

4125 4130 4135

acc cag ggg gcc atc ctg gcc caa ttg caa gcc atc cag aaa tcc 25459

Thr Gln Gly Ala Ile Leu Ala Gln Leu Gln Ala Ile Gln Lys Ser

4140 4145 4150

cgc caa gaa ttc ttg ctg aaa aag ggc cgc ggg gtc tac ctc gac 25504

Arg Gln Glu Phe Leu Leu Lys Lys Gly Arg Gly Val Tyr Leu Asp

4155 4160 4165

ccc cag acc ggt gag gag ctc aac ccc ggc ttc ccc cag gat gcc 25549

Pro Gln Thr Gly Glu Glu Leu Asn Pro Gly Phe Pro Gln Asp Ala

4170 4175 4180

ccg agg aaa caa gaa gct gaa agt gga gct gcc gcc cgt gga gga 25594

Pro Arg Lys Gln Glu Ala Glu Ser Gly Ala Ala Ala Arg Gly Gly

4185 4190 4195

ttt gga gga aga ctg gga gaa cag cag tca ggc aga gga gga gga 25639

Phe Gly Gly Arg Leu Gly Glu Gln Gln Ser Gly Arg Gly Gly Gly

4200 4205 4210

gat gga gga aga ctg gga cag cac tca ggc aga gga gga cag cct 25684

Asp Gly Gly Arg Leu Gly Gln His Ser Gly Arg Gly Gly Gln Pro

4215 4220 4225

gca aga cag tct gga gga aga cga gga gga ggc aga gga gga ggt 25729

Ala Arg Gln Ser Gly Gly Arg Arg Gly Gly Gly Arg Gly Gly Gly

4230 4235 4240

gga aga agc agc cgc cgc cag acc gtc gtc ctc ggc gga gaa agc 25774

Gly Arg Ser Ser Arg Arg Gln Thr Val Val Leu Gly Gly Glu Ser

4245 4250 4255

aag cag cac gga tac cat ctc cgc tcc ggg tcg ggg tcc cgc tcg 25819

Lys Gln His Gly Tyr His Leu Arg Ser Gly Ser Gly Ser Arg Ser

4260 4265 4270

acc cca cag tagatgggac gagaccgggc gattcccgaa ccccaccacc 25868

Thr Pro Gln

cagaccggta agaaggagcg gcagggatac aagtcctggc gggggcacaa aaacgccatc 25928

gtctcctgct tgcaggcctg cgggggcaac atctccttca cccggcgcta cctgctcttc 25988

caccgcgggg tgaacttccc ccgcaacatc ttgcattact accgtcacct ccacagcccc 26048

tactacttcc aagaagaggc aacagcagca gaaaaagacc agaaaaccag cagcagctag 26108

aaaatccaca gcggcaggtg gactgaggat cgcagcgaac gagccggcgc agacccggga 26168

gctgaggaac cggatctttc ccaccctcta tgccatcttc cagcagagtc gggggcagga 26228

gcaggaactg aaagtcaaga accgttctct gcgctcgctc acccgcagtt gtctgtatca 26288

caagagcgaa gaccaacttc agcgcactct cgaggacgcc gaggctctct tcaacaagta 26348

ctgcgcgctc actcttaaag agtagcccgc gcccgcccag ccgcagaaaa aggcgggaat 26408

tacgtcacct gtgcccttcg ccctagccgc ctccacccat catc atg agc aaa gag 26464

Met Ser Lys Glu

4275

att ccc acg cct tac atg tgg agc tac cag ccc cag atg ggc ctg 26509

Ile Pro Thr Pro Tyr Met Trp Ser Tyr Gln Pro Gln Met Gly Leu

4280 4285 4290

gcc gcc ggc gcc gcc cag gac tac tcc acc cgc atg aat tgg ctc 26554

Ala Ala Gly Ala Ala Gln Asp Tyr Ser Thr Arg Met Asn Trp Leu

4295 4300 4305

agc gcc ggg ccc gcg atg atc tca cgg gtg aat gac atc cgc gcc 26599

Ser Ala Gly Pro Ala Met Ile Ser Arg Val Asn Asp Ile Arg Ala

4310 4315 4320

cac cga aac cag ata ctc cta gaa cag tca gcg ctc acc gcc acg 26644

His Arg Asn Gln Ile Leu Leu Glu Gln Ser Ala Leu Thr Ala Thr

4325 4330 4335

ccc cgc aat cac ctc aat ccg cgt aat tgg ccc gcc gcc ctg gtg 26689

Pro Arg Asn His Leu Asn Pro Arg Asn Trp Pro Ala Ala Leu Val

4340 4345 4350

tac cag gaa att ccc cag ccc acg acc gta cta ctt ccg cga gac 26734

Tyr Gln Glu Ile Pro Gln Pro Thr Thr Val Leu Leu Pro Arg Asp

4355 4360 4365

gcc cag gcc gaa gtc cag ctg act aac tca ggt gtc cag ctg gcg 26779

Ala Gln Ala Glu Val Gln Leu Thr Asn Ser Gly Val Gln Leu Ala

4370 4375 4380

ggc ggc gcc acc ctg tgt cgt cac cgc ccc gct cag ggt ata aag 26824

Gly Gly Ala Thr Leu Cys Arg His Arg Pro Ala Gln Gly Ile Lys

4385 4390 4395

cgg ctg gtg atc cgg ggc aga ggc aca cag ctc aac gac gag gtg 26869

Arg Leu Val Ile Arg Gly Arg Gly Thr Gln Leu Asn Asp Glu Val

4400 4405 4410

gtg agc tct tcg ctg ggt ctg cga cct gac gga gtc ttc caa atc 26914

Val Ser Ser Ser Leu Gly Leu Arg Pro Asp Gly Val Phe Gln Ile

4415 4420 4425

gcc gga tcg ggg aga tct tcc ttc acg cct cgt cag gcg gtc ctg 26959

Ala Gly Ser Gly Arg Ser Ser Phe Thr Pro Arg Gln Ala Val Leu

4430 4435 4440

acc ttg gag agt tcg tcc tcg cag ccc cgc tcg ggc ggc atc ggc 27004

Thr Leu Glu Ser Ser Ser Ser Gln Pro Arg Ser Gly Gly Ile Gly

4445 4450 4455

act ctc cag ttc gtg gag gag ttc act ccc tcg gtc tac ttc aac 27049

Thr Leu Gln Phe Val Glu Glu Phe Thr Pro Ser Val Tyr Phe Asn

4460 4465 4470

ccc ttc tcc ggc tcc ccc ggc cac tac ccg gac gag ttc atc ccg 27094

Pro Phe Ser Gly Ser Pro Gly His Tyr Pro Asp Glu Phe Ile Pro

4475 4480 4485

aac ttt gac gcc atc agc gag tca gtg gac ggc tac gat tgaatgtccc 27143

Asn Phe Asp Ala Ile Ser Glu Ser Val Asp Gly Tyr Asp

4490 4495 4500

atggtggcgc ggctgaccta gctcggcttc gacacctgga ccactgccgc cgctttcgct 27203

gcttcgctcg ggacctcgcc gagttcacct actttgagct gcccgaggag catcctcagg 27263

gcccggccca cggagtgcgg atcgtcgtcg aagggggcct agactcccac ctgcttcgga 27323

tcttcagcca gcgcccgatc ctggtcgagc gccaacaggg caacaccctc ctgaccctct 27383

actgcatctg cgaccacccc ggcctgc atg aaa gtc ttt gtt gtc tgc tgt 27434

Met Lys Val Phe Val Val Cys Cys

4505

gta ctg agt ata ata aaa gct gag atc agc gac tac tcc gga ctc 27479

Val Leu Ser Ile Ile Lys Ala Glu Ile Ser Asp Tyr Ser Gly Leu

4510 4515 4520

aac tgt ggt gtt tct gca tcc atc aac cgg tct ctg acc ttc acc 27524

Asn Cys Gly Val Ser Ala Ser Ile Asn Arg Ser Leu Thr Phe Thr

4525 4530 4535

ggg aac gag acc gag ctc cag ctc cag tgt aag ccc cac aag aag 27569

Gly Asn Glu Thr Glu Leu Gln Leu Gln Cys Lys Pro His Lys Lys

4540 4545 4550

tac ctc acc tgg ctg tac cag ggc tcc ccg atc gcc gtt gtt aac 27614

Tyr Leu Thr Trp Leu Tyr Gln Gly Ser Pro Ile Ala Val Val Asn

4555 4560 4565

cac tgc gac gac gac gga gtc ctg ctg aac ggc ccc gcc aac ctt 27659

His Cys Asp Asp Asp Gly Val Leu Leu Asn Gly Pro Ala Asn Leu

4570 4575 4580

act ttt tcc acc cgc aga agc aag cta ctg ctc ttc aga ccc ttc 27704

Thr Phe Ser Thr Arg Arg Ser Lys Leu Leu Leu Phe Arg Pro Phe

4585 4590 4595

ctc ccc ggc acc tat cag tgc atc tcg gga ccc tgc cat cac acc 27749

Leu Pro Gly Thr Tyr Gln Cys Ile Ser Gly Pro Cys His His Thr

4600 4605 4610

ttc cac ctg atc ccg aat acc acc acc tct tcc cca gcg ccg ctc 27794

Phe His Leu Ile Pro Asn Thr Thr Thr Ser Ser Pro Ala Pro Leu

4615 4620 4625

ccc act aac aac caa act aac cac cac caa cgc cac cgt cga gac 27839

Pro Thr Asn Asn Gln Thr Asn His His Gln Arg His Arg Arg Asp

4630 4635 4640

ctc gtt gaa tct aat acc acc cac acc gga ggt gag ctc cga ggt 27884

Leu Val Glu Ser Asn Thr Thr His Thr Gly Gly Glu Leu Arg Gly

4645 4650 4655

cct gaa tcc tct ggg att tat tac ggc ccc tgg gag gtg gtg ggg 27929

Pro Glu Ser Ser Gly Ile Tyr Tyr Gly Pro Trp Glu Val Val Gly

4660 4665 4670

tta ata gct tta ggc tta gta gcg ggt ggg ctt ttg gct ctc tgc 27974

Leu Ile Ala Leu Gly Leu Val Ala Gly Gly Leu Leu Ala Leu Cys

4675 4680 4685

tac cta tac ctc cct tgc ttt tcc tac tta gtg gtg ctt tgt tgc 28019

Tyr Leu Tyr Leu Pro Cys Phe Ser Tyr Leu Val Val Leu Cys Cys

4690 4695 4700

tgg ttt aag aaa tgg gga aga tca ccc tagtgtgcgg tgtgctggtg 28066

Trp Phe Lys Lys Trp Gly Arg Ser Pro

4705 4710

acggtggtgc tttcgattct gggaggggga agcgcggctg tagtgacgga gaagaaggcc 28126

gatccctgct tgactttcaa ccccgataac tgccggctga gttttcagcc agatggcaat 28186

cggtgtgcgg tgttgatcaa gtgcggatgg gaatgcgaga gcgtggtgat ccagtataaa 28246

aacaaaacct ggaacaatac tctcgcgtcc acatggcagc ccggggaccc cgagtggtac 28306

accgtctctg tccctggtgc tgacggctcc ctccgcacgg tgaacaacac tttcattttt 28366

gagcacatgt gcgaaaccgc catgttcatg agcaagcagt acggtatgtg gcccccacgg 28426

aaagagaata tcgtggtctt ctccatcgct tacagcgcgt gcacggtgct aatcaccgcg 28486

atcgtgtgcc tgagcattca catgctcatc gctattcgcc ccagaaataa tgccgagaaa 28546

gagaaacagc cataacacac ttttttcaca cactttttca gacc atg gcc tct gtt 28602

Met Ala Ser Val

4715

act ttt ttg ata att gcc agt ctc att gct gtt ata agt aat gag 28647

Thr Phe Leu Ile Ile Ala Ser Leu Ile Ala Val Ile Ser Asn Glu

4720 4725 4730

aaa ctc act gtt tac att ggt act aat cac aca cta gaa gga att 28692

Lys Leu Thr Val Tyr Ile Gly Thr Asn His Thr Leu Glu Gly Ile

4735 4740 4745

cca aaa tcc tca tgg tat tgc tat ttt gat caa aat cca gag cta 28737

Pro Lys Ser Ser Trp Tyr Cys Tyr Phe Asp Gln Asn Pro Glu Leu

4750 4755 4760

act gta gaa tta tgt ggt aac agt gga aaa aat aaa agc att cat 28782

Thr Val Glu Leu Cys Gly Asn Ser Gly Lys Asn Lys Ser Ile His

4765 4770 4775

tta att aac ttt ggg tgc gga aaa gat ttg ata cta att aat atc 28827

Leu Ile Asn Phe Gly Cys Gly Lys Asp Leu Ile Leu Ile Asn Ile

4780 4785 4790

act aga gag tat gca ggt atg tac tac tac gcc aca gaa ggc att 28872

Thr Arg Glu Tyr Ala Gly Met Tyr Tyr Tyr Ala Thr Glu Gly Ile

4795 4800 4805

tca aac atg caa ttt tat gaa gtt act gtg tct gaa cct acc acg 28917

Ser Asn Met Gln Phe Tyr Glu Val Thr Val Ser Glu Pro Thr Thr

4810 4815 4820

cct aaa atg acc aca acc aca aag act aca gtt gtc agc act ata 28962

Pro Lys Met Thr Thr Thr Thr Lys Thr Thr Val Val Ser Thr Ile

4825 4830 4835

cat ctc agt acc aat ggc att ttt gct atg ctt cag atg gcc aat 29007

His Leu Ser Thr Asn Gly Ile Phe Ala Met Leu Gln Met Ala Asn

4840 4845 4850

aac agc acc agc atg cca cac aca cct aca cta cct agt gag gaa 29052

Asn Ser Thr Ser Met Pro His Thr Pro Thr Leu Pro Ser Glu Glu

4855 4860 4865

ata cct aaa tcc atg gtc gcc atc ata gct gct gta gtg gtg tgt 29097

Ile Pro Lys Ser Met Val Ala Ile Ile Ala Ala Val Val Val Cys

4870 4875 4880

atg ctg att att gtt ttg tgc atg gcc tac tac gct tgc tac tac 29142

Met Leu Ile Ile Val Leu Cys Met Ala Tyr Tyr Ala Cys Tyr Tyr

4885 4890 4895

aga agg cac aga cta agc gac aag ctg gac caa tta ctg agt gtt 29187

Arg Arg His Arg Leu Ser Asp Lys Leu Asp Gln Leu Leu Ser Val

4900 4905 4910

gaa ttt taatttttta gaacc atg aag atc cta gcc ctt ttt gtt tta 29235

Glu Phe Met Lys Ile Leu Ala Leu Phe Val Leu

4915 4920

tct ata att aca tct gct cta tgc aat tct gac aat gag gat gtt 29280

Ser Ile Ile Thr Ser Ala Leu Cys Asn Ser Asp Asn Glu Asp Val

4925 4930 4935

act gtc att gtc gga tcc aat tac aca cta aaa ggc cca acc gag 29325

Thr Val Ile Val Gly Ser Asn Tyr Thr Leu Lys Gly Pro Thr Glu

4940 4945 4950

ggt atg ctt tcg tgg tat tgc tat ttt gga act gat act aag caa 29370

Gly Met Leu Ser Trp Tyr Cys Tyr Phe Gly Thr Asp Thr Lys Gln

4955 4960 4965

aca gag ctc tgt aat gct atg aaa ggt cag atg cct act tcc aga 29415

Thr Glu Leu Cys Asn Ala Met Lys Gly Gln Met Pro Thr Ser Arg

4970 4975 4980

att aaa cat aaa tgt aat ggt act gac cta ata cta ctc aat atc 29460

Ile Lys His Lys Cys Asn Gly Thr Asp Leu Ile Leu Leu Asn Ile

4985 4990 4995

aca aaa gca tat gct ggc agt tat tcc tgc cct gcg gcc aac act 29505

Thr Lys Ala Tyr Ala Gly Ser Tyr Ser Cys Pro Ala Ala Asn Thr

5000 5005 5010

gag gat atg att ttt tac aaa gta act gtt gtt gat ccc acc act 29550

Glu Asp Met Ile Phe Tyr Lys Val Thr Val Val Asp Pro Thr Thr

5015 5020 5025

cca cca cca ccc acc aca act act aaa gct acc aca cac aca cag 29595

Pro Pro Pro Pro Thr Thr Thr Thr Lys Ala Thr Thr His Thr Gln

5030 5035 5040

caa acc aca cca gat aag ggg ttt gcc tta aaa gtt cag caa gaa 29640

Gln Thr Thr Pro Asp Lys Gly Phe Ala Leu Lys Val Gln Gln Glu

5045 5050 5055

ggc ggg gct tct ttt gga gct aag acc cct act cct gat cct cac 29685

Gly Gly Ala Ser Phe Gly Ala Lys Thr Pro Thr Pro Asp Pro His

5060 5065 5070

tgt cca ggg cca tta gtc agc agc att gtc ggt gtg ctt tgt ggt 29730

Cys Pro Gly Pro Leu Val Ser Ser Ile Val Gly Val Leu Cys Gly

5075 5080 5085

tta gca att gtg atc atc tgc ttg ttc att ttt gct tgc tgc tac 29775

Leu Ala Ile Val Ile Ile Cys Leu Phe Ile Phe Ala Cys Cys Tyr

5090 5095 5100

aga agg ctt cac cga caa aaa tca gac cca ctg ctg aac ctc tat 29820

Arg Arg Leu His Arg Gln Lys Ser Asp Pro Leu Leu Asn Leu Tyr

5105 5110 5115

gtt taattttaat ttcagaacca tgaaggcact tatcacttta gtttttcttt 29873

Val

ctttaattgg cactgttttt agtgttagtt ttttgaaaaa aatcaatgtc actgagggag 29933

ataatgtgac attggttggt gtagaaggtg ctgaaaatac cacctggaca aaatatcaca 29993

tggatgggtg gaaagaaatt tgcacctgga atgtcactac ttatacatgt gagggagtta 30053

accttaccat tgtcaatgtt agcgaaattc aaaagggttg gattaaaggg cagagtgtta 30113

gtgtcaccag tgagggatat tttacccaac atacttttat ctatgacatt attgtcatac 30173

cactgcccac acctagccca cctagcacta ctcagaccac acaaacaacc tacactacac 30233

agagctccac aactaccttg caaaccactc agacaaccac ttacactaca tccactcagc 30293

ccaccaccac tacagcagag gcgagtagct caccaaccat caaagtggca tttttgttac 30353

tgccaccatc tagcagtcct gctagtacca atgagcagac tactgaattt ttgtccacta 30413

ttcagagcag caccacagct acctcgagtg ccttctctag caccgccaat ctcacctcgc 30473

tttcctctac gccaatcagt aatgctacta cctccccagc tcctcttccc actcctctga 30533

agcaatcaga gtctagcacg cagctgcaga tcaccctgct cattgtgatc ggggtggtca 30593

tcctggcagt gctgctctac tttatcttct gccgccgcat ccccaacgcg aagccggcct 30653

acaagcccat tgttatcggc acgccggagc cgcttcaggt ggagggaggt ctaaggaatc 30713

ttctcttctc ttttacagta tggtgatttg aact atg att cct aga cat ttc 30765

Met Ile Pro Arg His Phe

5120

att atc act tct cta atc tgt gtg ctc caa gtc tgt gcc acc ctc 30810

Ile Ile Thr Ser Leu Ile Cys Val Leu Gln Val Cys Ala Thr Leu

5125 5130 5135

gct ctc gtg gct aac gcg agt cca gac tgc att gga gcg ttc gcc 30855

Ala Leu Val Ala Asn Ala Ser Pro Asp Cys Ile Gly Ala Phe Ala

5140 5145 5150

tcc tac gtg ctc ttt gcc ttc atc acc tgc atc tgc tgc tgt agc 30900

Ser Tyr Val Leu Phe Ala Phe Ile Thr Cys Ile Cys Cys Cys Ser

5155 5160 5165

ata gtc tgc ctg ctt atc acc ttc ttc cag ttc gtt gac tgg gtc 30945

Ile Val Cys Leu Leu Ile Thr Phe Phe Gln Phe Val Asp Trp Val

5170 5175 5180

ttt gtg cgc atc gcc tac ctg cgc cac cat ccc cag tac cgc gac 30990

Phe Val Arg Ile Ala Tyr Leu Arg His His Pro Gln Tyr Arg Asp

5185 5190 5195

cag aga gtg gcg caa ctg ttg aga ctc atc tgataagc atg cgg gct 31037

Gln Arg Val Ala Gln Leu Leu Arg Leu Ile Met Arg Ala

5200 5205 5210

ctg cta ctt ctc gcg ctt gtg cta gct ccc ctc gcc gcc ccc tta 31082

Leu Leu Leu Leu Ala Leu Val Leu Ala Pro Leu Ala Ala Pro Leu

5215 5220 5225

acc ctc aaa tcc ccc acc cag tcc cct gaa gag gtt cga aaa tgt 31127

Thr Leu Lys Ser Pro Thr Gln Ser Pro Glu Glu Val Arg Lys Cys

5230 5235 5240

aaa ttc caa gaa ccc tgg aaa ttc ctt tca tgc tac aaa ctc aaa 31172

Lys Phe Gln Glu Pro Trp Lys Phe Leu Ser Cys Tyr Lys Leu Lys

5245 5250 5255

tca gaa atg cac ccc agc tgg atc atg atc att gga atc gtg aac 31217

Ser Glu Met His Pro Ser Trp Ile Met Ile Ile Gly Ile Val Asn

5260 5265 5270

atc ctt gcc tgt acc ctc atc tcc ttt gtg att tac ccc tgc ttt 31262

Ile Leu Ala Cys Thr Leu Ile Ser Phe Val Ile Tyr Pro Cys Phe

5275 5280 5285

aac ttt ggg tgg aac gca ccc gag gcg ctc tgg ctc ccg cct gat 31307

Asn Phe Gly Trp Asn Ala Pro Glu Ala Leu Trp Leu Pro Pro Asp

5290 5295 5300

ccc gac aca cca cca cag cag cag caa aat cag gca cat gca cca 31352

Pro Asp Thr Pro Pro Gln Gln Gln Gln Asn Gln Ala His Ala Pro

5305 5310 5315

cca cag cct agg cca caa tac atg ccc atc tta gac tat gag gcc 31397

Pro Gln Pro Arg Pro Gln Tyr Met Pro Ile Leu Asp Tyr Glu Ala

5320 5325 5330

gag cca cag cga gcc atg ctt cct gct att agt tac ttc aat cta 31442

Glu Pro Gln Arg Ala Met Leu Pro Ala Ile Ser Tyr Phe Asn Leu

5335 5340 5345

acc ggc gga gat gac tgaccccatg gccaacaaca ccgtcaacga cctcctggac 31497

Thr Gly Gly Asp Asp

5350

atggacggcc gcgcctcgga gcagcgactc gcccaactcc gcatccgcca gcagcaggag 31557

agagccgtca aggagctgca ggatgcggtg gccatccacc agtgcaagaa aggcatcttc 31617

tgcctggtga agcaggccaa gatcaccttc gaggtgactt ccaccgacca tcgcctctcc 31677

tacgagctcc tgcagcagcg ccagaagttc acctgcctgg tcggagtcaa ccccatcgtc 31737

atcacccagc agtctggcga taccaagggg tgcatccact gctcctgcga ctcccccgag 31797

tgcgttcaca ccttgatcaa gaccctctgc ggcctccgcg acctcctccc catgaactaa 31857

tcaactaacc cctacccctt taccctccag taaaaataaa gattaaaaag agatgatgat 31917

gatttgaatt gatcaataaa gaatcactta cttgaaatct gaaaccaggt ctctgtccat 31977

gttttctgtc agcagcactt cactcccctc ttcccagctc tggtactgca ggccccggcg 32037

ggctgcaaac ttcctccaca ctctgaaggg gatgtcaaat tcctcctgtc cctcaatctt 32097

catttttatc ttctatcag atg tcc aaa aag cgc gcg cgg gtg gat gat 32146

Met Ser Lys Lys Arg Ala Arg Val Asp Asp

5355 5360

ggc ttc gac ccc gtg tac ccc tac gat gca gac aac gca ccg act 32191

Gly Phe Asp Pro Val Tyr Pro Tyr Asp Ala Asp Asn Ala Pro Thr

5365 5370 5375

gtg ccc ttc atc aac cct ccc ttc gtc tct tca gat gga ttc caa 32236

Val Pro Phe Ile Asn Pro Pro Phe Val Ser Ser Asp Gly Phe Gln

5380 5385 5390

gaa aag ccc ctg ggg gtg ttg tcc ctg cgt ctg gcc gac ccc gtc 32281

Glu Lys Pro Leu Gly Val Leu Ser Leu Arg Leu Ala Asp Pro Val

5395 5400 5405

acc acc aag aac ggg gct gtc acc ctc aag ctg ggg gag ggg gtg 32326

Thr Thr Lys Asn Gly Ala Val Thr Leu Lys Leu Gly Glu Gly Val

5410 5415 5420

gac ctc gac gac tcg gga aaa ctc atc tcc aaa aat gcc acc aag 32371

Asp Leu Asp Asp Ser Gly Lys Leu Ile Ser Lys Asn Ala Thr Lys

5425 5430 5435

gcc act gcc cct ctc agt att tcc aac aac acc att tcc ctt aac 32416

Ala Thr Ala Pro Leu Ser Ile Ser Asn Asn Thr Ile Ser Leu Asn

5440 5445 5450

atg gat acc cct ttt tac aac aac aac gga acg cta ggc atg aag 32461

Met Asp Thr Pro Phe Tyr Asn Asn Asn Gly Thr Leu Gly Met Lys

5455 5460 5465

gta act gca cca cta aag atc tta gac aca aat cta cta aaa aca 32506

Val Thr Ala Pro Leu Lys Ile Leu Asp Thr Asn Leu Leu Lys Thr

5470 5475 5480

ctt gtt gtt gct tat ggg cag gga tta gga aca agc acc act ggt 32551

Leu Val Val Ala Tyr Gly Gln Gly Leu Gly Thr Ser Thr Thr Gly

5485 5490 5495

gct ctt gtt gct caa cta gca tac cca ctt gtt ttt aat acc gat 32596

Ala Leu Val Ala Gln Leu Ala Tyr Pro Leu Val Phe Asn Thr Asp

5500 5505 5510

aac aaa att gcc ctt aat tta ggc aat gga cca tta aaa gtg gat 32641

Asn Lys Ile Ala Leu Asn Leu Gly Asn Gly Pro Leu Lys Val Asp

5515 5520 5525

gca aat aaa ctg aac att aat tgc aaa aga ggt ctc tat gtc acc 32686

Ala Asn Lys Leu Asn Ile Asn Cys Lys Arg Gly Leu Tyr Val Thr

5530 5535 5540

ccc aca aaa gat gca ctg gag att aat atc agt tgg gca aat gct 32731

Pro Thr Lys Asp Ala Leu Glu Ile Asn Ile Ser Trp Ala Asn Ala

5545 5550 5555

atg aca ttt ata gga aat gcc att ggt gta aat att gac aca aaa 32776

Met Thr Phe Ile Gly Asn Ala Ile Gly Val Asn Ile Asp Thr Lys

5560 5565 5570

aaa ggc cta cag tac ggc act tca agc act gaa aca gat gtt aaa 32821

Lys Gly Leu Gln Tyr Gly Thr Ser Ser Thr Glu Thr Asp Val Lys

5575 5580 5585

aat gct ttt cca ctc caa gta aaa ctt gga gct ggt ctt aca ttt 32866

Asn Ala Phe Pro Leu Gln Val Lys Leu Gly Ala Gly Leu Thr Phe

5590 5595 5600

gac agc aca ggt gcc att gtt gct tgg aac aaa gat gat gat aaa 32911

Asp Ser Thr Gly Ala Ile Val Ala Trp Asn Lys Asp Asp Asp Lys

5605 5610 5615

ctt aca cta tgg acc atg gct gac cca tct cca aac tgt cac ata 32956

Leu Thr Leu Trp Thr Met Ala Asp Pro Ser Pro Asn Cys His Ile

5620 5625 5630

tat tct gaa aag gat gct aag ctt aca ctc tgc tta aca aag tgt 33001

Tyr Ser Glu Lys Asp Ala Lys Leu Thr Leu Cys Leu Thr Lys Cys

5635 5640 5645

ggc agt cag ata ctg ggc act gtt tcc cta ctg gct gtc gac aat 33046

Gly Ser Gln Ile Leu Gly Thr Val Ser Leu Leu Ala Val Asp Asn

5650 5655 5660

gga agc tta aac cca ata aca gga aaa gta acc aca gct ctt gtt 33091

Gly Ser Leu Asn Pro Ile Thr Gly Lys Val Thr Thr Ala Leu Val

5665 5670 5675

tcg ctg aaa ttt gat gcc aat gga gtt ttg caa gac agt tcg aca 33136

Ser Leu Lys Phe Asp Ala Asn Gly Val Leu Gln Asp Ser Ser Thr

5680 5685 5690

ctt gac aaa gac tat tgg aat ttc aga caa ggt gat gct aca cct 33181

Leu Asp Lys Asp Tyr Trp Asn Phe Arg Gln Gly Asp Ala Thr Pro

5695 5700 5705

gct gaa ccc tat acc aat gct ata ggt ttt atg ccc aat cta aaa 33226

Ala Glu Pro Tyr Thr Asn Ala Ile Gly Phe Met Pro Asn Leu Lys

5710 5715 5720

gca tac cca aaa aac aca aat gga gct gca aaa agt cat att gtt 33271

Ala Tyr Pro Lys Asn Thr Asn Gly Ala Ala Lys Ser His Ile Val

5725 5730 5735

gga aaa gta tac ctt cat ggt gat gta agc aag ccg ctt aat ctg 33316

Gly Lys Val Tyr Leu His Gly Asp Val Ser Lys Pro Leu Asn Leu

5740 5745 5750

ata att act ttt aat gaa acc agt gat gaa tcc tgt act tat tgt 33361

Ile Ile Thr Phe Asn Glu Thr Ser Asp Glu Ser Cys Thr Tyr Cys

5755 5760 5765

att aac ttt cag tgg cag tgg gga act gac caa tat aaa act gaa 33406

Ile Asn Phe Gln Trp Gln Trp Gly Thr Asp Gln Tyr Lys Thr Glu

5770 5775 5780

aca ctt gca gtc agt tca ttc aca ttc tcc tac att gcc aag gaa 33451

Thr Leu Ala Val Ser Ser Phe Thr Phe Ser Tyr Ile Ala Lys Glu

5785 5790 5795

taaaccctgc atgacaaccc attcccacca ctgtatctac aatggaaaac tctgaagcag 33511

aaaaataaag ttcaagtgtt ttattgattc aacagttttc acaggattcg agtagttatt 33571

ttccctcctc cctcccaact catggaatac accaccctct ccccacgcac agccttaaac 33631

atctgaatgc cattggtaat ggacatggtt ttggtctcca cattccacac agtttcagag 33691

cgagccagtc tcgggtcggt cagggagatg aaaccctccg ggcactcctg catctgcacc 33751

tcaaagttca gtagctgagg gctgtcctcg gtggtcggga tcacagttat ctggaagaag 33811

agcgatgaga gtcataatcc gcgaacggga tcgggcggtt gtggcgcatc aggccccgca 33871

gcagtcgctg tctgcgccgc tccgtcaagc tgctgctcaa ggggtccggg tccagggact 33931

ccctgcgcat gatgccgatg gccctaagca tcagtcgcct ggtgcggcgg gcgcagcagc 33991

ggatgcggat ctcactcagg tcggagcagt atgtgcagca cagcaccacc aagttgttca 34051

acagtccata gttcaacaca ctccagccaa aactcatctg tggaactatg ctgcccacat 34111

gtccatcgta ccagatcctg atgtaaatca ggtggcgccc cctccagaac acactgccca 34171

tgtacatgat ctccttgggc atatgcaggt tcaccacctc ccggtaccac atcacccgct 34231

ggttaaacat gcagccctgg ataatcctgc ggaaccagat ggccagcacc gccccgcccg 34291

ccatgcagcg cagggacccc gggtcctggc aatggcagtg gatgatccac cgctcgcggc 34351

cgtggatcag ctgggagctg aacaagtcta tgttggcaca gcacaggcac acgctcatgc 34411

atgtcttcag cactctcagt tcctcggggg tcaagaccat gtcccagggc acggggaact 34471

cttgcaggac agtgaacccg gcagaacagg gcaaccctcg cacacaactt acattgtgca 34531

tggacagggt atcgcaatca ggcagcaccg gatgatcctc caccagagaa gcacgggtct 34591

cggtctcctc acagcgaggt aaggggaccg gcggttggta cggatgatgg cgggatgacg 34651

ctaatcgtgt tctggatcgt gtcatgatgg agctgtttcc tgacattttc gtacttcacg 34711

aagcagaatc tggtccgggc actgcacacc gctcgtcggc gacggtctcg gcgcttcgag 34771

cgctcggtgt tgaagttata gaacagccac tccctcagag cgtgcagtat ctcctgagcc 34831

tcttgggtga tgaaaatccc atccgccctg atggctctga tcacatcgac cacggtggaa 34891

tgggccagac ccagccagat gatgcaattt tgttgggttt cggtgacggc gggggaggga 34951

agaacaggaa gaaccatgat taactttatt ccaaacggtc tcggagcact tcaaaatgca 35011

ggtcccggag gtggcacctc tcgcccccac tgtgttggtg gaaaataaca gccaggtcaa 35071

aggtgacacg gttctcgaga tgttccacgg tggcttccag caaagcctcc acgcgcacat 35131

ccagaaacaa gaggacagcg aaagcgggag cgttttctaa ttcctcaatc atcatattac 35191

actcctgcac catccccaga taattttcat ttttccagcc ttgaatgatt cgtattagtt 35251

cctgaggtaa atccaagcca gccatgataa aaagctcgcg cagagcgccc tccaccggca 35311

ttcttaagca caccctcata attccaagag attctgctcc tggttcacct gcagcagatt 35371

aacaatggga atatcaaaat ctctgccgcg atccctaagc tcctccctca acaataactg 35431

tatgtaatct ttcatatcat ctccgaaatt tttagccata ggaccgccag gaataagaga 35491

agggcaagcc acattacaga taaagcgaag tcctccccag tgagcattgc caaatgtaag 35551

attgaaataa gcatgctggc tagaccctgt gatatcttcc agataactgg acagaaaatc 35611

aggcaagcaa tttttaagaa aatcaacaaa agaaaagtcg tctaggtgca cgtttagagc 35671

ctcaggaaca acgatggaat aagtgcaagg agtgcgttcc agcatggtta gtgttttttt 35731

ggcgatctgt agaacaaaaa ataaacatgc aatattaaac catgctagcc tggcgaacag 35791

gtggataaat cactctttcc agcaccaggc aagctacggg gtctccggca cgaccctcgt 35851

agaaactgtc accgtgattg aaaagcatca ccgaaagagt ctcccggtgg ccggcatgga 35911

tgattcgaga agaagcatac actccgggaa cattggcgtc cgtgagtgaa aaaaagcgac 35971

ccaggaagcc ccgaggcact acaatgctca atctcagttc cagcagggcg accccatgcg 36031

gatgaagcac aaaattggca ggtgcgtaaa aaatgtaatt actcccctcc tgcacaggca 36091

gcaaagcccc cgctccctcc agaaacacat acaaagcctc agcgtccata gcttaccgag 36151

cacggcaggc gcaagagtca gagaaaaggc tgagctctaa cctgactgcc cgctcctgag 36211

ctcaatatat agccctaacc tacactgacg taaaggccaa agtctaaaaa tacccgccaa 36271

aatgacacac acgcccagca cacgcccaga aaccggtgac acactcaaaa aaatacgtgc 36331

gcttcctcaa acgcccaaac cagcgtcatt tccgggttcc cacgctacgt caccgctcag 36391

cgactttcaa attccgtcga ccgttaaaaa cgtcactcgc cccgccccta acggtcgctc 36451

atccctcgtc caatcacaac gccccatccc caaattcaaa cagctcattt gcatattaac 36511

gcgcaccaaa agtttgaggt atattattga tgatg 36546

<210> SEQ ID NO 2

<211> LENGTH: 185

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 2

Met Glu Ile Trp Thr Val Leu Glu Asp Phe His Lys Thr Arg Gln Leu

1 5 10 15

Leu Glu Asn Ala Ser Asn Gly Val Ser Tyr Leu Trp Arg Phe Cys Phe

20 25 30

Gly Gly Asp Leu Ala Arg Leu Val Tyr Arg Ala Lys Gln Asp Tyr Ser

35 40 45

Glu Gln Phe Glu Val Ile Leu Arg Glu Cys Pro Gly Leu Phe Asp Ala

50 55 60

Leu Asn Leu Gly His Gln Ser His Phe Asn Gln Arg Ile Ser Arg Ala

65 70 75 80

Leu Asp Phe Thr Thr Pro Gly Arg Thr Thr Ala Ala Val Ala Phe Phe

85 90 95

Ala Phe Ile Leu Asp Lys Trp Ser Gln Glu Thr His Phe Ser Arg Asp

100 105 110

Tyr Gln Leu Asp Phe Leu Ala Val Ala Leu Trp Arg Thr Trp Lys Cys

115 120 125

Gln Arg Leu Asn Ala Ile Ser Gly Tyr Leu Pro Val Gln Pro Leu Asp

130 135 140

Thr Leu Arg Ile Leu Asn Leu Gln Glu Ser Pro Arg Ala Arg Gln Arg

145 150 155 160

Arg Gln Gln Gln Gln Gln Glu Asp Gln Glu Glu Asn Pro Arg Ala Gly

165 170 175

Leu Asp Pro Pro Ala Glu Glu Glu Glu

180 185

<210> SEQ ID NO 3

<211> LENGTH: 142

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 3

Met Ser Gly Ser Gly Ser Phe Glu Gly Gly Val Phe Ser Pro Tyr Leu

1 5 10 15

Thr Gly Arg Leu Pro Ser Trp Ala Gly Val Arg Gln Asn Val Met Gly

20 25 30

Ser Thr Val Asp Gly Arg Pro Val Gln Pro Ala Asn Ser Ser Thr Leu

35 40 45

Thr Tyr Ala Thr Leu Ser Ser Ser Ser Val Asp Ala Ala Ala Ala Ala

50 55 60

Ala Ala Ala Ser Ala Ala Ser Ala Val Arg Gly Met Ala Met Gly Ala

65 70 75 80

Gly Tyr Tyr Gly Thr Leu Val Ala Asn Ser Ser Ser Thr Asn Asn Pro

85 90 95

Ala Ser Leu Asn Glu Glu Lys Leu Leu Leu Leu Met Ala Gln Leu Glu

100 105 110

Ala Leu Thr Gln Arg Leu Gly Glu Leu Thr Gln Gln Val Ala Gln Leu

115 120 125

Gln Glu Gln Thr Arg Ala Ala Val Ala Thr Val Lys Ser Lys

130 135 140

<210> SEQ ID NO 4

<211> LENGTH: 592

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 4

Met Gln Gln Gln Pro Pro Pro Asp Pro Ala Met Arg Ala Ala Leu Gln

1 5 10 15

Ser Gln Pro Ser Gly Ile Asn Ser Ser Asp Asp Trp Thr Gln Ala Met

20 25 30

Gln Arg Ile Met Ala Leu Thr Thr Arg Asn Pro Glu Ala Phe Arg Gln

35 40 45

Gln Pro Gln Ala Asn Arg Leu Ser Ala Ile Leu Glu Ala Val Val Pro

50 55 60

Ser Arg Ser Asn Pro Thr His Glu Lys Val Leu Ala Ile Val Asn Ala

65 70 75 80

Leu Val Glu Asn Lys Ala Ile Arg Gly Asp Glu Ala Gly Leu Val Tyr

85 90 95

Asn Ala Leu Leu Glu Arg Val Ala Arg Tyr Asn Ser Thr Asn Val Gln

100 105 110

Thr Asn Leu Asp Arg Met Val Thr Asp Val Arg Glu Ala Val Ala Gln

115 120 125

Arg Glu Arg Phe His Arg Glu Ser Asn Leu Gly Ser Leu Val Ala Leu

130 135 140

Asn Ala Phe Leu Ser Thr Gln Pro Ala Asn Val Pro Arg Gly Gln Glu

145 150 155 160

Asp Tyr Thr Asn Phe Ile Ser Ala Leu Arg Leu Met Val Thr Glu Val

165 170 175

Pro Gln Ser Glu Val Tyr Gln Ser Gly Pro Asp Tyr Phe Phe Gln Thr

180 185 190

Ser Arg Gln Gly Leu Gln Thr Val Asn Leu Ser Gln Ala Phe Lys Asn

195 200 205

Leu Gln Gly Leu Trp Gly Val Gln Ala Pro Val Gly Asp Arg Ala Thr

210 215 220

Val Ser Ser Leu Leu Thr Pro Asn Ser Arg Leu Leu Leu Leu Leu Val

225 230 235 240

Ala Pro Phe Thr Asp Ser Gly Ser Ile Asn Arg Asn Ser Tyr Leu Gly

245 250 255

Tyr Leu Ile Asn Leu Tyr Arg Glu Ala Ile Gly Gln Ala His Val Asp

260 265 270

Glu Gln Thr Tyr Gln Glu Ile Thr His Val Ser Arg Ala Leu Gly Gln

275 280 285

Asp Asp Pro Gly Asn Leu Glu Ala Thr Leu Asn Phe Leu Leu Thr Asn

290 295 300

Arg Ser Gln Lys Ile Pro Pro Gln Tyr Ala Leu Ser Ala Glu Glu Glu

305 310 315 320

Arg Ile Leu Arg Tyr Val Gln Gln Ser Val Gly Leu Phe Leu Met Gln

325 330 335

Glu Gly Ala Thr Pro Ser Ala Ala Leu Asp Met Thr Ala Arg Asn Met

340 345 350

Glu Pro Ser Met Tyr Ala Ser Asn Arg Pro Phe Ile Asn Lys Leu Met

355 360 365

Asp Tyr Leu His Arg Ala Ala Ala Met Asn Ser Asp Tyr Phe Thr Asn

370 375 380

Ala Ile Leu Asn Pro His Trp Leu Pro Pro Pro Gly Phe Tyr Thr Gly

385 390 395 400

Glu Tyr Asp Met Pro Asp Pro Asn Asp Gly Phe Leu Trp Asp Asp Val

405 410 415

Asp Ser Ser Val Phe Ser Pro Arg Pro Gly Ala Asn Glu Arg Pro Leu

420 425 430

Trp Lys Lys Glu Gly Ser Asp Arg Arg Pro Ser Ser Ala Leu Ser Gly

435 440 445

Arg Glu Gly Ala Ala Ala Ala Val Pro Glu Ala Ala Ser Pro Phe Pro

450 455 460

Ser Leu Pro Phe Ser Leu Asn Ser Ile Arg Ser Ser Glu Leu Gly Arg

465 470 475 480

Ile Thr Arg Pro Arg Leu Leu Gly Glu Glu Glu Tyr Leu Asn Asp Ser

485 490 495

Leu Leu Arg Pro Glu Arg Glu Lys Asn Phe Pro Asn Asn Gly Ile Glu

500 505 510

Ser Leu Val Asp Lys Met Ser Arg Trp Lys Thr Tyr Ala Gln Glu His

515 520 525

Arg Asp Asp Pro Arg Ala Ser Gln Gly Ala Thr Ser Arg Gly Ser Ala

530 535 540

Ala Arg Lys Arg Arg Trp His Asp Arg Gln Arg Gly Leu Met Trp Asp

545 550 555 560

Asp Glu Asp Ser Ala Asp Asp Ser Ser Val Leu Asp Leu Gly Gly Ser

565 570 575

Gly Gly Asn Pro Phe Ala His Leu Arg Pro Arg Ile Gly Arg Met Met

580 585 590

<210> SEQ ID NO 5

<211> LENGTH: 540

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 5

Met Met Arg Arg Ala Tyr Pro Glu Gly Pro Pro Pro Ser Tyr Glu Ser

1 5 10 15

Val Met Gln Gln Ala Met Ala Ala Ala Met Gln Pro Pro Leu Glu Ala

20 25 30

Pro Tyr Val Pro Pro Arg Tyr Leu Ala Pro Thr Glu Gly Arg Asn Ser

35 40 45

Ile Arg Tyr Ser Glu Leu Ala Pro Leu Tyr Asp Thr Thr Arg Leu Tyr

50 55 60

Leu Val Asp Asn Lys Ser Ala Asp Ile Ala Ser Leu Asn Tyr Gln Asn

65 70 75 80

Asp His Ser Asn Phe Leu Thr Thr Val Val Gln Asn Asn Asp Phe Thr

85 90 95

Pro Thr Glu Ala Ser Thr Gln Thr Ile Asn Phe Asp Glu Arg Ser Arg

100 105 110

Trp Gly Gly Gln Leu Lys Thr Ile Met His Thr Asn Met Pro Asn Val

115 120 125

Asn Glu Phe Met Tyr Ser Asn Lys Phe Lys Ala Arg Val Met Val Ser

130 135 140

Arg Lys Thr Pro Asn Gly Val Lys Val Asp Asp Thr Tyr Asp Gly Ser

145 150 155 160

Gln Asp Ile Leu Glu Tyr Lys Trp Val Glu Phe Glu Leu Pro Glu Gly

165 170 175

Asn Phe Ser Val Thr Met Thr Ile Asp Leu Met Asn Asn Ala Ile Ile

180 185 190

Asp Asn Tyr Leu Ala Val Gly Arg Gln Asn Gly Val Leu Glu Ser Asp

195 200 205

Ile Gly Val Lys Phe Asp Thr Arg Asn Phe Arg Leu Gly Trp Asp Pro

210 215 220

Val Thr Glu Leu Val Met Pro Gly Val Tyr Thr Asn Glu Ala Phe His

225 230 235 240

Pro Asp Ile Val Leu Leu Pro Gly Cys Gly Val Asp Phe Thr Glu Ser

245 250 255

Arg Leu Ser Asn Leu Leu Gly Ile Arg Lys Arg Gln Pro Phe Gln Glu

260 265 270

Gly Phe Gln Ile Met Tyr Asp Asp Leu Glu Gly Gly Asn Ile Pro Ala

275 280 285

Leu Leu Asp Val Glu Ala Tyr Glu Glu Ser Lys Glu Lys Ala Glu Ala

290 295 300

Glu Ala Thr Ala Ala Val Ala Thr Ala Ala Ala Asn Ala Asp Ala Asn

305 310 315 320

Val Thr Arg Gly Asp Thr Phe Ala Thr Gln Ala Glu Glu Ala Ala Ala

325 330 335

Leu Ala Val Ala Asp Asp Ser Glu Ser Lys Ile Val Ile Lys Pro Val

340 345 350

Lys Val Asp Ser Lys Asn Arg Ser Tyr Asn Val Leu Pro Asp Glu Val

355 360 365

Asn Thr Ala Tyr Arg Ser Trp Tyr Leu Ala Tyr Asn Tyr Gly Asp Pro

370 375 380

Glu Lys Gly Val Arg Ser Trp Thr Leu Leu Thr Thr Ser Asp Val Thr

385 390 395 400

Cys Gly Val Glu Gln Val Tyr Trp Ser Leu Pro Asp Met Met Gln Asp

405 410 415

Pro Val Thr Phe Arg Ser Thr Arg Gln Val Ser Asn Tyr Pro Val Val

420 425 430

Gly Ala Glu Leu Leu Pro Val Tyr Ser Lys Ser Phe Phe Asn Glu Gln

435 440 445

Ala Val Tyr Ser Gln Gln Leu Arg Ala Phe Thr Ser Leu Thr His Val

450 455 460

Phe Asn Arg Phe Pro Glu Asn Gln Ile Leu Val Arg Pro Pro Ala Pro

465 470 475 480

Thr Ile Thr Thr Val Ser Glu Asn Val Pro Ala Leu Thr Asp His Gly

485 490 495

Thr Leu Pro Leu Arg Ser Ser Ile Arg Gly Val Gln Arg Val Thr Val

500 505 510

Thr Asp Ala Arg Arg Arg Thr Cys Pro Tyr Val Tyr Lys Ala Leu Gly

515 520 525

Ile Val Ala Pro Arg Val Leu Ser Ser Arg Thr Phe

530 535 540

<210> SEQ ID NO 6

<211> LENGTH: 193

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 6

Met Ser Ile Leu Ile Ser Pro Ser Asn Asn Thr Gly Trp Gly Leu Arg

1 5 10 15

Ala Pro Ser Lys Met Tyr Gly Gly Ala Arg Gln Arg Ser Thr Gln His

20 25 30

Pro Val Arg Val Arg Gly His Phe Arg Ala Pro Trp Gly Ala Leu Lys

35 40 45

Gly Arg Val Arg Ser Arg Thr Thr Val Asp Asp Val Ile Asp Gln Val

50 55 60

Val Ala Asp Ala Arg Asn Tyr Thr Pro Ala Ala Ala Pro Val Ser Thr

65 70 75 80

Val Asp Ala Val Ile Asp Ser Val Val Ala Asp Ala Arg Arg Tyr Ala

85 90 95

Arg Ala Lys Ser Arg Arg Arg Arg Ile Ala Arg Arg His Arg Ser Thr

100 105 110

Pro Ala Met Arg Ala Ala Arg Ala Leu Leu Arg Arg Ala Arg Arg Thr

115 120 125

Gly Arg Arg Ala Met Leu Arg Ala Ala Arg Arg Ala Ala Ser Gly Ala

130 135 140

Ser Ala Gly Arg Thr Arg Arg Arg Ala Ala Thr Ala Ala Ala Ala Ala

145 150 155 160

Ile Ala Ser Met Ser Arg Pro Arg Arg Gly Asn Val Tyr Trp Val Arg

165 170 175

Asp Ala Ala Thr Gly Val Arg Val Pro Val Arg Thr Arg Pro Pro Arg

180 185 190

Thr

<210> SEQ ID NO 7

<211> LENGTH: 338

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 7

Met Ser Lys Arg Lys Phe Lys Glu Glu Met Leu Gln Val Ile Ala Pro

1 5 10 15

Glu Ile Tyr Gly Pro Ala Ala Val Val Lys Glu Glu Arg Lys Pro Arg

20 25 30

Lys Ile Lys Arg Val Lys Lys Asp Lys Lys Glu Glu Asp Asp Asp Leu

35 40 45

Val Glu Phe Val Arg Glu Phe Ala Pro Arg Arg Arg Val Gln Trp Arg

50 55 60

Gly Arg Lys Val Gln Pro Val Leu Arg Pro Gly Thr Thr Val Val Phe

65 70 75 80

Thr Pro Gly Glu Arg Ser Gly Thr Ala Ser Lys Arg Ser Tyr Asp Glu

85 90 95

Val Tyr Gly Asp Asp Asp Ile Leu Glu Gln Ala Ala Glu Arg Leu Gly

100 105 110

Glu Phe Ala Tyr Gly Lys Arg Ser Arg Pro Ala Ala Pro Pro Glu Ala

115 120 125

Val Ser Ile Pro Leu Asp His Gly Asn Pro Thr Pro Ser Leu Lys Pro

130 135 140

Val Thr Leu Gln Gln Val Leu Pro Ser Ala Ala Pro Arg Arg Gly Phe

145 150 155 160

Lys Arg Glu Gly Glu Asp Leu Tyr Pro Thr Met Gln Leu Met Val Pro

165 170 175

Lys Arg Gln Lys Leu Glu Asp Val Leu Glu Thr Met Lys Val Asp Pro

180 185 190

Glu Val Gln Pro Glu Val Lys Val Arg Pro Ile Lys Gln Val Ala Pro

195 200 205

Gly Leu Gly Val Gln Thr Val Asp Ile Lys Ile Pro Thr Glu Pro Met

210 215 220

Glu Thr Gln Thr Glu Pro Val Lys Pro Ser Thr Ser Thr Met Glu Val

225 230 235 240

Gln Thr Asp Pro Trp Met Pro Ala Ala Ala Pro Arg Pro Arg Arg Lys

245 250 255

Tyr Gly Ala Ala Ser Leu Leu Met Pro Asn Tyr Ala Leu His Pro Ser

260 265 270

Ile Ile Pro Thr Pro Gly Tyr Arg Gly Thr Arg Phe Tyr Arg Gly Gln

275 280 285

Thr Thr Arg Arg Arg Arg Lys Thr Thr Arg Arg Arg Arg Arg Arg Thr

290 295 300

Ala Ala Ala Thr Ser Ala Ala Ala Leu Val Arg Arg Val Tyr Arg Arg

305 310 315 320

Gly Arg Ala Pro Leu Thr Leu Pro Arg Ala Arg Tyr His Pro Ser Ile

325 330 335

Ala Ile

<210> SEQ ID NO 8

<211> LENGTH: 77

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 8

Met Ala Leu Thr Cys Arg Ile Arg Val Pro Ile Ala Gly Tyr Arg Gly

1 5 10 15

Arg Lys Pro Arg Arg Arg Arg Leu Ala Gly Ser Gly Met Arg Arg His

20 25 30

Pro His Arg Arg Arg Arg Ala Ile Ser Lys Arg Leu Gly Gly Gly Phe

35 40 45

Leu Pro Ala Leu Ile Pro Ile Ile Ala Ala Ala Ile Gly Ala Ile Pro

50 55 60

Gly Ile Ala Ser Val Ala Val Gln Ala Ser Gln Arg His

65 70 75

<210> SEQ ID NO 9

<211> LENGTH: 254

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 9

Met Asp Ser Asp Ala Pro Gly Pro Val Met Cys Phe Arg Arg Gln Met

1 5 10 15

Glu Asp Ile Asn Phe Ser Ser Leu Ala Pro Arg His Gly Thr Arg Pro

20 25 30

Phe Met Gly Thr Trp Ser Asp Ile Gly Asn Ser His Leu Asn Gly Gly

35 40 45

Ala Phe Asn Trp Ser Ser Leu Trp Ser Gly Leu Lys Asn Phe Gly Ser

50 55 60

Thr Leu Lys Thr Tyr Gly His Lys Ala Trp Asn Ser Asn Thr Gly Gln

65 70 75 80

Ala Leu Arg Glu Lys Leu Lys Glu Gln Asn Phe Gln Gln Lys Val Val

85 90 95

Asp Gly Leu Ala Ser Gly Ile Asn Gly Val Val Asp Leu Ala Asn Gln

100 105 110

Ala Val Gln Arg Gln Ile Asn Ser Arg Leu Glu Pro Val Pro Pro Ala

115 120 125

Gly Ser Val Glu Met Pro Gln Val Glu Glu Glu Leu Pro Pro Leu Asp

130 135 140

Lys Arg Gly Glu Lys Arg Pro Arg Pro Asp Ala Glu Glu Thr Leu Leu

145 150 155 160

Thr His Thr Asp Glu Pro Pro Pro Tyr Glu Glu Ala Val Lys Leu Gly

165 170 175

Leu Pro Thr Thr Arg Pro Ile Ala Pro Leu Ala Thr Gly Val Leu Lys

180 185 190

Pro Glu Ser Lys Pro Ala Thr Leu Asp Leu Pro Pro Pro Pro Thr Ser

195 200 205

Arg Pro Ser Thr Val Ala Lys Pro Leu Pro Pro Val Ala Arg Ala Arg

210 215 220

Pro Gly Ser Arg Pro Gln Ala Asn Trp Gln Ser Thr Leu Asn Ser Ile

225 230 235 240

Val Gly Leu Gly Val Gln Ser Val Lys Arg Arg Arg Cys Tyr

245 250

<210> SEQ ID NO 10

<211> LENGTH: 942

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 10

Met Ala Thr Pro Ser Met Leu Pro Gln Trp Ala Tyr Met His Ile Ala

1 5 10 15

Gly Gln Asp Ala Ser Glu Tyr Leu Ser Pro Gly Leu Val Gln Phe Ala

20 25 30

Arg Ala Thr Asp Thr Tyr Phe Ser Leu Gly Asn Lys Phe Arg Asn Pro

35 40 45

Thr Val Ala Pro Thr His Asp Val Thr Thr Asp Arg Ser Gln Arg Leu

50 55 60

Thr Leu Arg Phe Val Pro Val Asp Arg Glu Asp Asn Thr Tyr Ser Tyr

65 70 75 80

Lys Val Arg Tyr Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met

85 90 95

Ala Ser Thr Tyr Phe Asp Ile Arg Gly Val Leu Asp Arg Gly Pro Ser

100 105 110

Phe Lys Pro Tyr Ser Gly Thr Ala Tyr Asn Ser Leu Ala Pro Lys Gly

115 120 125

Ala Pro Asn Ser Ser Gln Trp Glu Gln Ala Lys Thr Gly Asn Gly Gly

130 135 140

Thr Met Glu Thr His Thr Tyr Gly Val Ala Pro Met Gly Gly Glu Asn

145 150 155 160

Ile Thr Lys Asp Gly Leu Gln Ile Gly Thr Asp Ala Thr Ala Asp Gln

165 170 175

Asn Lys Pro Ile Tyr Ala Asp Lys Thr Phe Gln Pro Glu Pro Gln Val

180 185 190

Gly Glu Glu Asn Trp Gln Glu Thr Glu Ser Phe Tyr Gly Gly Arg Ala

195 200 205

Leu Lys Lys Asp Thr Asn Met Lys Pro Cys Tyr Gly Ser Tyr Ala Arg

210 215 220

Pro Thr Asn Glu Lys Gly Gly Gln Ala Lys Leu Lys Val Gly Asp Asp

225 230 235 240

Gly Val Pro Thr Lys Glu Phe Asp Ile Asp Leu Ala Phe Phe Asp Thr

245 250 255

Pro Gly Gly Thr Val Asn Gly Gln Asp Glu Tyr Lys Ala Asp Ile Val

260 265 270

Met Tyr Thr Glu Asn Ala Tyr Leu Glu Thr Pro Asp Thr His Val Val

275 280 285

Tyr Lys Pro Gly Lys Glu Asp Ala Ser Ser Glu Ile Asn Leu Val Gln

290 295 300

Gln Ser Met Pro Asn Arg Pro Asn Tyr Ile Gly Phe Arg Asp Asn Phe

305 310 315 320

Ile Gly Leu Met Tyr Tyr Asn Ser Thr Gly Asn Met Gly Val Leu Ala

325 330 335

Gly Gln Ala Ser Gln Leu Asn Ala Val Val Asp Leu Gln Asp Arg Asn

340 345 350

Thr Glu Leu Ser Tyr Gln Leu Leu Leu Asp Ser Leu Gly Asp Arg Thr

355 360 365

Arg Tyr Phe Ser Met Trp Asn Gln Ala Val Asp Ser Tyr Asp Pro Asp

370 375 380

Val Arg Ile Ile Glu Asn His Gly Val Glu Asp Glu Leu Pro Asn Tyr

385 390 395 400

Cys Phe Pro Leu Asp Gly Ser Gly Thr Asn Ala Ala Tyr Gln Gly Val

405 410 415

Lys Val Lys Asn Gly Glu Asp Gly Asp Leu Glu Ser Glu Trp Glu Asn

420 425 430

Asp Asp Thr Val Ala Ala Arg Asn Gln Leu Cys Lys Gly Asn Ile Phe

435 440 445

Ala Met Glu Ile Asn Leu Gln Ala Asn Leu Trp Arg Ser Phe Leu Tyr

450 455 460

Ser Asn Val Ala Leu Tyr Leu Pro Asp Ser Tyr Lys Tyr Thr Pro Ala

465 470 475 480

Asn Ile Thr Leu Pro Thr Asn Thr Asn Thr Tyr Asp Tyr Met Asn Gly

485 490 495

Arg Val Val Pro Pro Ser Leu Val Asp Ala Tyr Ile Asn Ile Gly Ala

500 505 510

Arg Trp Ser Leu Asp Pro Met Asp Asn Val Asn Pro Phe Asn His His

515 520 525

Arg Asn Ala Gly Leu Arg Tyr Arg Ser Met Leu Leu Gly Asn Gly Arg

530 535 540

Tyr Val Pro Phe His Ile Gln Val Pro Gln Lys Phe Phe Ala Ile Lys

545 550 555 560

Ser Leu Leu Leu Leu Pro Gly Ser Tyr Thr Tyr Glu Trp Asn Phe Arg

565 570 575

Lys Asp Val Asn Met Ile Leu Gln Ser Ser Leu Gly Asn Asp Leu Arg

580 585 590

Thr Asp Gly Ala Ser Ile Gln Phe Thr Ser Ile Asn Leu Tyr Ala Thr

595 600 605

Phe Phe Pro Met Ala His Asn Thr Ala Ser Thr Leu Glu Ala Met Leu

610 615 620

Arg Asn Asp Thr Asn Asp Gln Ser Phe Asn Asp Tyr Leu Ser Ala Ala

625 630 635 640

Asn Met Leu Tyr Pro Ile Pro Ala Asn Ala Thr Asn Val Pro Ile Ser

645 650 655

Ile Pro Ser Arg Asn Trp Ala Ala Phe Arg Gly Trp Ser Phe Thr Arg

660 665 670

Leu Lys Thr Arg Glu Thr Pro Ser Leu Gly Ser Gly Phe Asp Pro Tyr

675 680 685

Phe Val Tyr Ser Gly Ser Ile Pro Tyr Leu Asp Gly Thr Phe Tyr Leu

690 695 700

Asn His Thr Phe Lys Lys Val Ser Ile Thr Phe Asp Ser Ser Val Ser

705 710 715 720

Trp Pro Gly Asn Asp Arg Leu Leu Thr Pro Asn Glu Phe Glu Ile Lys

725 730 735

Arg Thr Val Asp Gly Glu Gly Tyr Asn Val Ala Gln Cys Asn Met Thr

740 745 750

Lys Asp Trp Phe Leu Val Gln Met Leu Ala His Tyr Asn Ile Gly Tyr

755 760 765

Gln Gly Phe His Val Pro Glu Gly Tyr Lys Asp Arg Met Tyr Ser Phe

770 775 780

Phe Arg Asn Phe Gln Pro Met Ser Arg Gln Val Val Asp Glu Val Asn

785 790 795 800

Tyr Lys Asp Tyr Gln Ala Val Thr Leu Ala Tyr Gln His Asn Asn Ser

805 810 815

Gly Phe Val Gly Tyr Leu Ala Pro Thr Met Arg Gln Gly Gln Pro Tyr

820 825 830

Pro Ala Asn Tyr Pro Tyr Pro Leu Ile Gly Lys Ser Ala Val Thr Ser

835 840 845

Val Thr Gln Lys Lys Phe Leu Cys Asp Arg Val Met Trp Arg Ile Pro

850 855 860

Phe Ser Ser Asn Phe Met Ser Met Gly Ala Leu Thr Asp Leu Gly Gln

865 870 875 880

Asn Met Leu Tyr Ala Asn Ser Ala His Ala Leu Asp Met Asn Phe Glu

885 890 895

Val Asp Pro Met Asp Glu Ser Thr Leu Leu Tyr Val Val Phe Glu Val

900 905 910

Phe Asp Val Val Arg Val His Gln Pro His Arg Gly Val Ile Glu Ala

915 920 925

Val Tyr Leu Arg Thr Pro Phe Ser Ala Gly Asn Ala Thr Thr

930 935 940

<210> SEQ ID NO 11

<211> LENGTH: 207

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 11

Met Ala Cys Ala Gly Ser Gly Glu Gln Glu Leu Arg Ala Ile Ile Arg

1 5 10 15

Asp Leu Gly Cys Gly Pro Tyr Phe Leu Gly Thr Phe Asp Lys Arg Phe

20 25 30

Pro Gly Phe Met Ala Pro His Lys Leu Ala Cys Ala Ile Val Asn Thr

35 40 45

Ala Gly Arg Glu Thr Gly Gly Glu His Trp Leu Ala Phe Ala Trp Asn

50 55 60

Pro Arg Ser Asn Thr Cys Tyr Leu Phe Asp Pro Phe Gly Phe Ser Asp

65 70 75 80

Glu Arg Leu Lys Gln Ile Tyr Gln Phe Glu Tyr Glu Gly Leu Leu Arg

85 90 95

Arg Ser Ala Leu Ala Thr Glu Asp Arg Cys Val Thr Leu Glu Lys Ser

100 105 110

Thr Gln Thr Val Gln Gly Pro Arg Ser Ala Ala Cys Gly Leu Phe Cys

115 120 125

Cys Met Phe Leu His Ala Phe Val His Trp Pro Asp Arg Pro Met Asp

130 135 140

Lys Asn Pro Thr Met Asn Leu Leu Thr Gly Val Pro Asn Gly Met Leu

145 150 155 160

Gln Ser Pro Gln Val Glu Pro Thr Leu Arg Arg Asn Gln Glu Ala Leu

165 170 175

Tyr Arg Phe Leu Asn Ser His Ser Ala Tyr Phe Arg Ser His Arg Ala

180 185 190

Arg Ile Glu Lys Ala Thr Ala Phe Asp Arg Met Asn Gln Asp Met

195 200 205

<210> SEQ ID NO 12

<211> LENGTH: 803

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 12

Met Glu Thr Gln Pro Ser Ser Pro Thr Ser Pro Ser Ala Pro Thr Thr

1 5 10 15

Ala Asp Glu Lys Gln Gln Gln Gln Asn Glu Ser Leu Thr Ala Pro Pro

20 25 30

Pro Ser Pro Ala Ser Asp Ala Ala Ala Val Pro Asp Met Gln Glu Met

35 40 45

Glu Glu Ser Ile Glu Ile Asp Leu Gly Tyr Val Thr Pro Ala Glu His

50 55 60

Glu Glu Glu Leu Ala Val Arg Phe Ser Ser Ser Gln Glu Glu Lys Glu

65 70 75 80

Gln Gln Pro Glu Gln Glu Ala Glu Ser Glu Gln Ser Gln Ala Gly Leu

85 90 95

Glu His Gly Asp Tyr Leu His Leu Ser Gly Gly Glu Asp Ala Leu Ile

100 105 110

Lys His Leu Ala Arg Gln Ala Leu Ile Val Lys Asp Ala Leu Leu Asp

115 120 125

Arg Ile Glu Val Pro Leu Ser Val Glu Glu Leu Ser Arg Ala Tyr Glu

130 135 140

Leu Asn Leu Phe Ser Pro Arg Val Pro Pro Lys Arg Gln Pro Asn Gly

145 150 155 160

Thr Cys Glu Pro Asn Pro Arg Leu Asn Phe Tyr Pro Val Phe Ala Val

165 170 175

Pro Glu Ala Leu Ala Thr Tyr His Ile Phe Phe Lys Asn Gln Lys Ile

180 185 190

Pro Val Ser Cys Arg Ala Asn Arg Thr Arg Ala Asp Ala Leu Phe Asn

195 200 205

Leu Gly Pro Gly Ala Arg Leu Pro Asp Ile Ala Ser Leu Glu Glu Val

210 215 220

Pro Lys Ile Phe Glu Gly Leu Gly Ser Asp Glu Thr Arg Ala Ala Asn

225 230 235 240

Ala Leu Gln Gly Glu Gly Gly Glu His Glu His His Ser Ala Leu Val

245 250 255

Glu Leu Glu Gly Asp Asn Ala Arg Leu Ala Val Leu Lys Arg Thr Val

260 265 270

Glu Leu Thr His Phe Ala Tyr Pro Ala Leu Asn Leu Pro Pro Lys Val

275 280 285

Met Ser Ala Val Met Asp Gln Val Leu Ile Lys Arg Ala Ser Pro Ile

290 295 300

Ser Glu Asp Glu Gly Met Gln Asp Ser Glu Glu Gly Lys Pro Val Val

305 310 315 320

Ser Asp Glu Gln Leu Ala Arg Trp Leu Gly Pro Asn Ala Thr Pro Gln

325 330 335

Ser Leu Glu Glu Arg Arg Lys Leu Met Met Ala Val Val Leu Val Thr

340 345 350

Val Glu Leu Glu Cys Leu Arg Arg Phe Phe Ala Asp Ala Glu Thr Leu

355 360 365

Arg Lys Val Glu Glu Asn Leu His Tyr Leu Phe Arg His Gly Phe Val

370 375 380

Arg Gln Ala Cys Lys Ile Ser Asn Val Glu Leu Thr Asn Leu Val Ser

385 390 395 400

Tyr Met Gly Ile Leu His Glu Asn Arg Leu Gly Gln Asn Val Leu His

405 410 415

Thr Thr Leu Arg Gly Glu Ala Arg Arg Asp Tyr Ile Arg Asp Cys Val

420 425 430

Tyr Leu Tyr Leu Cys His Thr Trp Gln Thr Gly Met Gly Val Trp Gln

435 440 445

Gln Cys Leu Glu Glu Gln Asn Leu Lys Glu Leu Cys Lys Leu Leu Gln

450 455 460

Lys Asn Leu Lys Gly Leu Trp Thr Gly Phe Asn Glu Arg Thr Thr Ala

465 470 475 480

Ser Asp Leu Ala Asp Leu Ile Phe Pro Glu Arg Leu Arg Leu Thr Leu

485 490 495

Arg Asn Gly Leu Pro Asp Phe Met Ser Gln Ser Met Leu Gln Asn Phe

500 505 510

Arg Ser Phe Ile Leu Glu Arg Ser Gly Ile Leu Pro Ala Thr Cys Ser

515 520 525

Ala Leu Pro Ser Asp Phe Val Pro Leu Thr Phe Arg Glu Cys Pro Pro

530 535 540

Pro Leu Trp Ser His Cys Tyr Leu Leu Arg Leu Ala Asn Tyr Leu Ala

545 550 555 560

Tyr His Ser Asp Val Ile Glu Asp Val Ser Gly Glu Gly Leu Leu Glu

565 570 575

Cys His Cys Arg Cys Asn Leu Cys Thr Pro His Arg Ser Leu Ala Cys

580 585 590

Asn Pro Gln Leu Leu Ser Glu Thr Gln Ile Ile Gly Thr Phe Glu Leu

595 600 605

Gln Gly Pro Ser Asp Glu Gly Ser Ala Lys Gly Ser Leu Lys Leu Thr

610 615 620

Pro Gly Leu Trp Thr Ser Ala Tyr Leu Arg Lys Phe Val Pro Glu Asp

625 630 635 640

Tyr His Pro Phe Glu Ile Arg Phe Tyr Glu Asp Gln Ser Gln Pro Pro

645 650 655

Lys Ala Glu Leu Ser Ala Cys Val Ile Thr Gln Gly Ala Ile Leu Ala

660 665 670

Gln Leu Gln Ala Ile Gln Lys Ser Arg Gln Glu Phe Leu Leu Lys Lys

675 680 685

Gly Arg Gly Val Tyr Leu Asp Pro Gln Thr Gly Glu Glu Leu Asn Pro

690 695 700

Gly Phe Pro Gln Asp Ala Pro Arg Lys Gln Glu Ala Glu Ser Gly Ala

705 710 715 720

Ala Ala Arg Gly Gly Phe Gly Gly Arg Leu Gly Glu Gln Gln Ser Gly

725 730 735

Arg Gly Gly Gly Asp Gly Gly Arg Leu Gly Gln His Ser Gly Arg Gly

740 745 750

Gly Gln Pro Ala Arg Gln Ser Gly Gly Arg Arg Gly Gly Gly Arg Gly

755 760 765

Gly Gly Gly Arg Ser Ser Arg Arg Gln Thr Val Val Leu Gly Gly Glu

770 775 780

Ser Lys Gln His Gly Tyr His Leu Arg Ser Gly Ser Gly Ser Arg Ser

785 790 795 800

Thr Pro Gln

<210> SEQ ID NO 13

<211> LENGTH: 227

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 13

Met Ser Lys Glu Ile Pro Thr Pro Tyr Met Trp Ser Tyr Gln Pro Gln

1 5 10 15

Met Gly Leu Ala Ala Gly Ala Ala Gln Asp Tyr Ser Thr Arg Met Asn

20 25 30

Trp Leu Ser Ala Gly Pro Ala Met Ile Ser Arg Val Asn Asp Ile Arg

35 40 45

Ala His Arg Asn Gln Ile Leu Leu Glu Gln Ser Ala Leu Thr Ala Thr

50 55 60

Pro Arg Asn His Leu Asn Pro Arg Asn Trp Pro Ala Ala Leu Val Tyr

65 70 75 80

Gln Glu Ile Pro Gln Pro Thr Thr Val Leu Leu Pro Arg Asp Ala Gln

85 90 95

Ala Glu Val Gln Leu Thr Asn Ser Gly Val Gln Leu Ala Gly Gly Ala

100 105 110

Thr Leu Cys Arg His Arg Pro Ala Gln Gly Ile Lys Arg Leu Val Ile

115 120 125

Arg Gly Arg Gly Thr Gln Leu Asn Asp Glu Val Val Ser Ser Ser Leu

130 135 140

Gly Leu Arg Pro Asp Gly Val Phe Gln Ile Ala Gly Ser Gly Arg Ser

145 150 155 160

Ser Phe Thr Pro Arg Gln Ala Val Leu Thr Leu Glu Ser Ser Ser Ser

165 170 175

Gln Pro Arg Ser Gly Gly Ile Gly Thr Leu Gln Phe Val Glu Glu Phe

180 185 190

Thr Pro Ser Val Tyr Phe Asn Pro Phe Ser Gly Ser Pro Gly His Tyr

195 200 205

Pro Asp Glu Phe Ile Pro Asn Phe Asp Ala Ile Ser Glu Ser Val Asp

210 215 220

Gly Tyr Asp

225

<210> SEQ ID NO 14

<211> LENGTH: 212

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 14

Met Lys Val Phe Val Val Cys Cys Val Leu Ser Ile Ile Lys Ala Glu

1 5 10 15

Ile Ser Asp Tyr Ser Gly Leu Asn Cys Gly Val Ser Ala Ser Ile Asn

20 25 30

Arg Ser Leu Thr Phe Thr Gly Asn Glu Thr Glu Leu Gln Leu Gln Cys

35 40 45

Lys Pro His Lys Lys Tyr Leu Thr Trp Leu Tyr Gln Gly Ser Pro Ile

50 55 60

Ala Val Val Asn His Cys Asp Asp Asp Gly Val Leu Leu Asn Gly Pro

65 70 75 80

Ala Asn Leu Thr Phe Ser Thr Arg Arg Ser Lys Leu Leu Leu Phe Arg

85 90 95

Pro Phe Leu Pro Gly Thr Tyr Gln Cys Ile Ser Gly Pro Cys His His

100 105 110

Thr Phe His Leu Ile Pro Asn Thr Thr Thr Ser Ser Pro Ala Pro Leu

115 120 125

Pro Thr Asn Asn Gln Thr Asn His His Gln Arg His Arg Arg Asp Leu

130 135 140

Val Glu Ser Asn Thr Thr His Thr Gly Gly Glu Leu Arg Gly Pro Glu

145 150 155 160

Ser Ser Gly Ile Tyr Tyr Gly Pro Trp Glu Val Val Gly Leu Ile Ala

165 170 175

Leu Gly Leu Val Ala Gly Gly Leu Leu Ala Leu Cys Tyr Leu Tyr Leu

180 185 190

Pro Cys Phe Ser Tyr Leu Val Val Leu Cys Cys Trp Phe Lys Lys Trp

195 200 205

Gly Arg Ser Pro

210

<210> SEQ ID NO 15

<211> LENGTH: 201

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 15

Met Ala Ser Val Thr Phe Leu Ile Ile Ala Ser Leu Ile Ala Val Ile

1 5 10 15

Ser Asn Glu Lys Leu Thr Val Tyr Ile Gly Thr Asn His Thr Leu Glu

20 25 30

Gly Ile Pro Lys Ser Ser Trp Tyr Cys Tyr Phe Asp Gln Asn Pro Glu

35 40 45

Leu Thr Val Glu Leu Cys Gly Asn Ser Gly Lys Asn Lys Ser Ile His

50 55 60

Leu Ile Asn Phe Gly Cys Gly Lys Asp Leu Ile Leu Ile Asn Ile Thr

65 70 75 80

Arg Glu Tyr Ala Gly Met Tyr Tyr Tyr Ala Thr Glu Gly Ile Ser Asn

85 90 95

Met Gln Phe Tyr Glu Val Thr Val Ser Glu Pro Thr Thr Pro Lys Met

100 105 110

Thr Thr Thr Thr Lys Thr Thr Val Val Ser Thr Ile His Leu Ser Thr

115 120 125

Asn Gly Ile Phe Ala Met Leu Gln Met Ala Asn Asn Ser Thr Ser Met

130 135 140

Pro His Thr Pro Thr Leu Pro Ser Glu Glu Ile Pro Lys Ser Met Val

145 150 155 160

Ala Ile Ile Ala Ala Val Val Val Cys Met Leu Ile Ile Val Leu Cys

165 170 175

Met Ala Tyr Tyr Ala Cys Tyr Tyr Arg Arg His Arg Leu Ser Asp Lys

180 185 190

Leu Asp Gln Leu Leu Ser Val Glu Phe

195 200

<210> SEQ ID NO 16

<211> LENGTH: 205

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 16

Met Lys Ile Leu Ala Leu Phe Val Leu Ser Ile Ile Thr Ser Ala Leu

1 5 10 15

Cys Asn Ser Asp Asn Glu Asp Val Thr Val Ile Val Gly Ser Asn Tyr

20 25 30

Thr Leu Lys Gly Pro Thr Glu Gly Met Leu Ser Trp Tyr Cys Tyr Phe

35 40 45

Gly Thr Asp Thr Lys Gln Thr Glu Leu Cys Asn Ala Met Lys Gly Gln

50 55 60

Met Pro Thr Ser Arg Ile Lys His Lys Cys Asn Gly Thr Asp Leu Ile

65 70 75 80

Leu Leu Asn Ile Thr Lys Ala Tyr Ala Gly Ser Tyr Ser Cys Pro Ala

85 90 95

Ala Asn Thr Glu Asp Met Ile Phe Tyr Lys Val Thr Val Val Asp Pro

100 105 110

Thr Thr Pro Pro Pro Pro Thr Thr Thr Thr Lys Ala Thr Thr His Thr

115 120 125

Gln Gln Thr Thr Pro Asp Lys Gly Phe Ala Leu Lys Val Gln Gln Glu

130 135 140

Gly Gly Ala Ser Phe Gly Ala Lys Thr Pro Thr Pro Asp Pro His Cys

145 150 155 160

Pro Gly Pro Leu Val Ser Ser Ile Val Gly Val Leu Cys Gly Leu Ala

165 170 175

Ile Val Ile Ile Cys Leu Phe Ile Phe Ala Cys Cys Tyr Arg Arg Leu

180 185 190

His Arg Gln Lys Ser Asp Pro Leu Leu Asn Leu Tyr Val

195 200 205

<210> SEQ ID NO 17

<211> LENGTH: 91

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 17

Met Ile Pro Arg His Phe Ile Ile Thr Ser Leu Ile Cys Val Leu Gln

1 5 10 15

Val Cys Ala Thr Leu Ala Leu Val Ala Asn Ala Ser Pro Asp Cys Ile

20 25 30

Gly Ala Phe Ala Ser Tyr Val Leu Phe Ala Phe Ile Thr Cys Ile Cys

35 40 45

Cys Cys Ser Ile Val Cys Leu Leu Ile Thr Phe Phe Gln Phe Val Asp

50 55 60

Trp Val Phe Val Arg Ile Ala Tyr Leu Arg His His Pro Gln Tyr Arg

65 70 75 80

Asp Gln Arg Val Ala Gln Leu Leu Arg Leu Ile

85 90

<210> SEQ ID NO 18

<211> LENGTH: 143

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 18

Met Arg Ala Leu Leu Leu Leu Ala Leu Val Leu Ala Pro Leu Ala Ala

1 5 10 15

Pro Leu Thr Leu Lys Ser Pro Thr Gln Ser Pro Glu Glu Val Arg Lys

20 25 30

Cys Lys Phe Gln Glu Pro Trp Lys Phe Leu Ser Cys Tyr Lys Leu Lys

35 40 45

Ser Glu Met His Pro Ser Trp Ile Met Ile Ile Gly Ile Val Asn Ile

50 55 60

Leu Ala Cys Thr Leu Ile Ser Phe Val Ile Tyr Pro Cys Phe Asn Phe

65 70 75 80

Gly Trp Asn Ala Pro Glu Ala Leu Trp Leu Pro Pro Asp Pro Asp Thr

85 90 95

Pro Pro Gln Gln Gln Gln Asn Gln Ala His Ala Pro Pro Gln Pro Arg

100 105 110

Pro Gln Tyr Met Pro Ile Leu Asp Tyr Glu Ala Glu Pro Gln Arg Ala

115 120 125

Met Leu Pro Ala Ile Ser Tyr Phe Asn Leu Thr Gly Gly Asp Asp

130 135 140

<210> SEQ ID NO 19

<211> LENGTH: 445

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 19

Met Ser Lys Lys Arg Ala Arg Val Asp Asp Gly Phe Asp Pro Val Tyr

1 5 10 15

Pro Tyr Asp Ala Asp Asn Ala Pro Thr Val Pro Phe Ile Asn Pro Pro

20 25 30

Phe Val Ser Ser Asp Gly Phe Gln Glu Lys Pro Leu Gly Val Leu Ser

35 40 45

Leu Arg Leu Ala Asp Pro Val Thr Thr Lys Asn Gly Ala Val Thr Leu

50 55 60

Lys Leu Gly Glu Gly Val Asp Leu Asp Asp Ser Gly Lys Leu Ile Ser

65 70 75 80

Lys Asn Ala Thr Lys Ala Thr Ala Pro Leu Ser Ile Ser Asn Asn Thr

85 90 95

Ile Ser Leu Asn Met Asp Thr Pro Phe Tyr Asn Asn Asn Gly Thr Leu

100 105 110

Gly Met Lys Val Thr Ala Pro Leu Lys Ile Leu Asp Thr Asn Leu Leu

115 120 125

Lys Thr Leu Val Val Ala Tyr Gly Gln Gly Leu Gly Thr Ser Thr Thr

130 135 140

Gly Ala Leu Val Ala Gln Leu Ala Tyr Pro Leu Val Phe Asn Thr Asp

145 150 155 160

Asn Lys Ile Ala Leu Asn Leu Gly Asn Gly Pro Leu Lys Val Asp Ala

165 170 175

Asn Lys Leu Asn Ile Asn Cys Lys Arg Gly Leu Tyr Val Thr Pro Thr

180 185 190

Lys Asp Ala Leu Glu Ile Asn Ile Ser Trp Ala Asn Ala Met Thr Phe

195 200 205

Ile Gly Asn Ala Ile Gly Val Asn Ile Asp Thr Lys Lys Gly Leu Gln

210 215 220

Tyr Gly Thr Ser Ser Thr Glu Thr Asp Val Lys Asn Ala Phe Pro Leu

225 230 235 240

Gln Val Lys Leu Gly Ala Gly Leu Thr Phe Asp Ser Thr Gly Ala Ile

245 250 255

Val Ala Trp Asn Lys Asp Asp Asp Lys Leu Thr Leu Trp Thr Met Ala

260 265 270

Asp Pro Ser Pro Asn Cys His Ile Tyr Ser Glu Lys Asp Ala Lys Leu

275 280 285

Thr Leu Cys Leu Thr Lys Cys Gly Ser Gln Ile Leu Gly Thr Val Ser

290 295 300

Leu Leu Ala Val Asp Asn Gly Ser Leu Asn Pro Ile Thr Gly Lys Val

305 310 315 320

Thr Thr Ala Leu Val Ser Leu Lys Phe Asp Ala Asn Gly Val Leu Gln

325 330 335

Asp Ser Ser Thr Leu Asp Lys Asp Tyr Trp Asn Phe Arg Gln Gly Asp

340 345 350

Ala Thr Pro Ala Glu Pro Tyr Thr Asn Ala Ile Gly Phe Met Pro Asn

355 360 365

Leu Lys Ala Tyr Pro Lys Asn Thr Asn Gly Ala Ala Lys Ser His Ile

370 375 380

Val Gly Lys Val Tyr Leu His Gly Asp Val Ser Lys Pro Leu Asn Leu

385 390 395 400

Ile Ile Thr Phe Asn Glu Thr Ser Asp Glu Ser Cys Thr Tyr Cys Ile

405 410 415

Asn Phe Gln Trp Gln Trp Gly Thr Asp Gln Tyr Lys Thr Glu Thr Leu

420 425 430

Ala Val Ser Ser Phe Thr Phe Ser Tyr Ile Ala Lys Glu

435 440 445

<210> SEQ ID NO 20

<211> LENGTH: 29960

<212> TYPE: DNA

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Simian adenovirus A1321

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (6)..(1496)

<223> OTHER INFORMATION: E1b\55K

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (23645)..(24205)

<223> OTHER INFORMATION: 22K

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (26131)..(26658)

<223> OTHER INFORMATION: E3\gp19K

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (27919)..(28836)

<223> OTHER INFORMATION: E3\CR1-delta

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (29553)..(29954)

<223> OTHER INFORMATION: E3\14.7K

<400> SEQUENCE: 20

gacaa atg gag tca aga aac cca ttt cag cag gga tta cca gct gga ttt 50

Met Glu Ser Arg Asn Pro Phe Gln Gln Gly Leu Pro Ala Gly Phe

1 5 10 15

ctt agc agt agc ttt gtg gag aac atg gaa gtg cca gcg cct gaa tgc 98

Leu Ser Ser Ser Phe Val Glu Asn Met Glu Val Pro Ala Pro Glu Cys

20 25 30

aat ctc cgg cta ctt gcc ggt aca gcc gct aga cac tct gag gat cct 146

Asn Leu Arg Leu Leu Ala Gly Thr Ala Ala Arg His Ser Glu Asp Pro

35 40 45

gaa tct cca gga gag tcc cag ggc acg cca acg tcg cca gca gca gca 194

Glu Ser Pro Gly Glu Ser Gln Gly Thr Pro Thr Ser Pro Ala Ala Ala

50 55 60

gca gga gga tca aga aga gaa ccc gag agc cgg cct gga ccc tcc ggc 242

Ala Gly Gly Ser Arg Arg Glu Pro Glu Ser Arg Pro Gly Pro Ser Gly

65 70 75

gga gga gga gga gta gct gac ctg ttt cct gaa ctg cgc cgg gtg ctg 290

Gly Gly Gly Gly Val Ala Asp Leu Phe Pro Glu Leu Arg Arg Val Leu

80 85 90 95

act agg tct tcg agt ggt cgg gag agg ggg att aag cgg gag agg cat 338

Thr Arg Ser Ser Ser Gly Arg Glu Arg Gly Ile Lys Arg Glu Arg His

100 105 110

gat gag act aat cat aga aat gaa ctg act gtc agt ttg atg agt cgc 386

Asp Glu Thr Asn His Arg Asn Glu Leu Thr Val Ser Leu Met Ser Arg

115 120 125

aag cgc cca gaa aca gtg tgg tgg cat gag gtg cag tcg act ggc aca 434

Lys Arg Pro Glu Thr Val Trp Trp His Glu Val Gln Ser Thr Gly Thr

130 135 140

gat gag gtg tca gtg atg cat gag aga ttt tcc cta gaa caa gtc aag 482

Asp Glu Val Ser Val Met His Glu Arg Phe Ser Leu Glu Gln Val Lys

145 150 155

act tgt tgg ttg gag cct gag gat gat tgg gag gta gcc atc agg aat 530

Thr Cys Trp Leu Glu Pro Glu Asp Asp Trp Glu Val Ala Ile Arg Asn

160 165 170 175

tat gcc aag ctg gct ctc agg cca gac aag aag tac aag att act aag 578

Tyr Ala Lys Leu Ala Leu Arg Pro Asp Lys Lys Tyr Lys Ile Thr Lys

180 185 190

ctg ata aat atc aga aat gcc tgc tac att tca ggg aat ggg gct gaa 626

Leu Ile Asn Ile Arg Asn Ala Cys Tyr Ile Ser Gly Asn Gly Ala Glu

195 200 205

gta gag atc tgt ctc cag gaa agg gtg gct ttc aga tgc tgc atg atg 674

Val Glu Ile Cys Leu Gln Glu Arg Val Ala Phe Arg Cys Cys Met Met

210 215 220

aat atg tac ccg gga gtg gtg ggc atg gat ggg gtc acc ttt atg aac 722

Asn Met Tyr Pro Gly Val Val Gly Met Asp Gly Val Thr Phe Met Asn

225 230 235

atg agg ttc agg gga gat ggg tat aat ggc acg gtc ttt atg gcc aat 770

Met Arg Phe Arg Gly Asp Gly Tyr Asn Gly Thr Val Phe Met Ala Asn

240 245 250 255

acc aag ctg aca gtt cat ggc tgc tcc ttc ttt ggc ttt aat aac acc 818

Thr Lys Leu Thr Val His Gly Cys Ser Phe Phe Gly Phe Asn Asn Thr

260 265 270

tgc atc gag gcc tgg ggt cag gtt ggt gtg agg ggc tgc agt ttt tca 866

Cys Ile Glu Ala Trp Gly Gln Val Gly Val Arg Gly Cys Ser Phe Ser

275 280 285

gcc aac tgg atg ggg atc gtg ggc agg acc aag agc gtg ctg tcc gtg 914

Ala Asn Trp Met Gly Ile Val Gly Arg Thr Lys Ser Val Leu Ser Val

290 295 300

aag aaa tgc ttg ttt gag agg tgc cac ctg ggg gtg atg agc gag ggt 962

Lys Lys Cys Leu Phe Glu Arg Cys His Leu Gly Val Met Ser Glu Gly

305 310 315

gaa gcc aga atc cgc cac tgc gcc tct acc gag acg ggc tgc ttt gtg 1010

Glu Ala Arg Ile Arg His Cys Ala Ser Thr Glu Thr Gly Cys Phe Val

320 325 330 335

ctg tgc aag ggc aat gct aag atc aag cat aat atg atc tgt gga gcc 1058

Leu Cys Lys Gly Asn Ala Lys Ile Lys His Asn Met Ile Cys Gly Ala

340 345 350

tcg gac gag cgc gga tac cag atg ctg acc tgc gcc ggt ggg aac agc 1106

Ser Asp Glu Arg Gly Tyr Gln Met Leu Thr Cys Ala Gly Gly Asn Ser

355 360 365

cat atg ctg gcc acc gtg cat gtg gct tcc cat tcc cgc aag ccc tgg 1154

His Met Leu Ala Thr Val His Val Ala Ser His Ser Arg Lys Pro Trp

370 375 380

ccc gag ttc gag cac aat gtc atg acc agg tgc aat atg cat ctg ggg 1202

Pro Glu Phe Glu His Asn Val Met Thr Arg Cys Asn Met His Leu Gly

385 390 395

tcc cgc cga ggc atg ttc atg ccc tac cag tgc aac ctg aat tat gtg 1250

Ser Arg Arg Gly Met Phe Met Pro Tyr Gln Cys Asn Leu Asn Tyr Val

400 405 410 415

aag gtg ctg ctg gag ccc gat gcc atg tcc aga gtg agc ctg acg ggg 1298

Lys Val Leu Leu Glu Pro Asp Ala Met Ser Arg Val Ser Leu Thr Gly

420 425 430

gtg ttt gac atg aat gtg gag gtg tgg aag att ctg aga tat gat gaa 1346

Val Phe Asp Met Asn Val Glu Val Trp Lys Ile Leu Arg Tyr Asp Glu

435 440 445

tcc aag acc agg tgc cga gcc tgc gag tgc gga ggg aag cat gcc agg 1394

Ser Lys Thr Arg Cys Arg Ala Cys Glu Cys Gly Gly Lys His Ala Arg

450 455 460

ttc cag ccc gtg tgt gtg gat gtg acg gag gac ctg cga ccc gat cat 1442

Phe Gln Pro Val Cys Val Asp Val Thr Glu Asp Leu Arg Pro Asp His

465 470 475

ttg gtg ttg tcc tgc acc ggg acg gag ttc ggt tcc agc ggg gaa gaa 1490

Leu Val Leu Ser Cys Thr Gly Thr Glu Phe Gly Ser Ser Gly Glu Glu

480 485 490 495

tct gac tagagtgagt agtgttctgg ggcggtggag ggcatgcatg agggccagaa 1546

Ser Asp

tgactgaaat ctgtgctttt ctgtgtgttg cagcagcatg agcggaagcg gctcctttga 1606

gggaggggta ttcagccctt atctgacggg gcgtctccct tcctgggcgg gagtgcgtca 1666

gaatgtgatg ggatctacgg tggacggccg gcccgtgcag cccgcgaact cttcaaccct 1726

gacctatgca accctgagct cttcgtcggt ggacgcagct gccgccgcag ctgctgcatc 1786

tgccgccagc gccgtgcgcg gaatggccat gggcgccggc tattacggca ctctggtggc 1846

caactcgagt tccaccaata atcccgccag cctgaacgag gagaagctgc tgctgctgat 1906

ggcccagctc gaggccttga cccagcgcct gggcgagctg acccagcagg tggctcagct 1966

gcaggagcag acgcgggccg cggttgccac ggtgaaatcc aaataaaaaa tgaatcaata 2026

aataaacgga gacggttgtt gattttaaca cagagtctga atctttattt gatttttcgc 2086

gcgcggtagg ccctggacca ccggtctcga tcattgagca cccggtggat cttttccagg 2146

acccggtaga ggtgggcttg gatgttgagg tacatgggca tgagcccgtc ccgggggtgg 2206

aggtagctcc attgcagggc ctcgtgctcg ggggtggtgt tgtaaatcac ccagtcatag 2266

caggggcgca gggcatggtg ttgcacaata tctttgagga ggagactgat ggccacgggc 2326

agccctttag tgtaggtgtt tacaaatctg ttgagctggg agggatgcat gcggggggag 2386

atgaggtgca tcttggcctg gatcttgaga ttggcgatgt taccgcccag atcccgcctg 2446

gggttcatgt tgtgcaggac caccagcacg gtgtatccgg tgcacttggg gaatttatca 2506

tgcaacttgg aagggaaggc gtgaaagaat ttggcgacgc ccttgtgccc gcccaggttt 2566

tccatgcatt catccatgat gatggcgatg ggcccgtggg cggcggcctg ggcaaagacg 2626

tttcgggggt cggacacatc atagttgtgg tcctgggtga gatcatcata ggccatttta 2686

atgaatttgg ggcggagggt gccggactgg gggacaaagg taccctcgat cccgggggcg 2746

tagttcccct cacagatctg catctcccag gctttgagct cggagggggg gatcatgtcc 2806

acctgcgggg ctataaagaa cacggtttcc ggggcggggg agatgagctg ggccgaaagc 2866

aagttccgga gcagctggga cttgccgcag ccggtggggc cgtagatgac cccgatgacc 2926

ggctgcaggt ggtagttgag ggagagacag ctgccgtcct cccggaggag gggggccacc 2986

tcgttcatca tctcgcgcac atgcatgttc tcgcgcacca gttccgccag gaggcgctct 3046

ccccccaggg ataggagctc ctggagcgag gcgaagtttt tcagcggctt gagtccgtcg 3106

gccatgggca ttttggagag ggtctgttgc aagagttcca agcggtccca gagctcggtg 3166

atgtgctcta cggcatctcg atccagcaga cctcctcgtt tcgcgggttg ggacgactgc 3226

gggagtaggg caccagacga tgggcgtcca gcgcagccag ggtccggtcc ttccagggcc 3286

gcagcgtccg cgtcagggtg gtctccgtca cggtgaaggg gtgcgcgccg ggctgggcgc 3346

ttgcgagggt gcgcttcagg ctcatccggc tggttgaaaa ccgctcccga tcggcgccct 3406

gcgcgtcggc caggtagcaa ttgaccatga gttcgtagtt gagcgcctcg gccgcgtggc 3466

ctttggcgcg gagcttacct ttggatgtct gcccacaggc gggacagagg agggacttga 3526

gggcgtagag cttgggggca aggaagacgg actcgggggc gtaggcatcc gcgccgcagt 3586

gggcgcagac ggtctcgcac tccacgagcc aggtgaggtc gggctggtcg gggtcaaaaa 3646

ccagtttccc gccgttcttt ttgatgcgtt tcttaccttt ggtctccatg agctcgtgtc 3706

cccgctgggt gacaaagagg ctgtccgtgt ccccgtagac cgactttatg ggccggtcct 3766

cgagcggtgt gccgcggtcc tcctcgtaga ggaaccccgc ccactccgaa acgaaagccc 3826

gggtccaggc cagcacgaag gaggccacgt gggacgggta gcggtcgttg tccaccagcg 3886

ggtccacctt ctccagggta tgcaaacaca tgtccccctc gtccacatcc aggaaggtga 3946

ttggcttgta agtgtaggcc acgtgaccgg gggtcccggc cgggggggta taaaaggggg 4006

cgggcccctg ctcgtcctca ctgtcttccg gatcgctgtc caggagcgcc agctgttggg 4066

gtaggtattc cctctcgaag gcgggcatga cctcggcact caggttgtca gtttctagaa 4126

acgaggagga tttgatattg acggtgccgg cggagatgcc tttcaagagt ccctcgtcca 4186

tctggtcaga aaagacgatc tttttattgt cgagtttggt ggcgaaggag ccgtagaggg 4246

cgttggaaag gagcttggcg atggagcgca tggtctggtt cttttccttg tcggcgcgct 4306

ccttggccgc gatgttgagt tgcacgtact cgcgcgccac gcacttccat tcggggaaga 4366

cggtggtcag ctcgtcgggc acgattctga cctgccagcc ccgattatgc agggtgatga 4426

ggtccacact ggtggccacc tcgccgcgca ggggttcatt agtccagcag aggcgcccgc 4486

ccttgcgcga gcagaagggg ggcagggggt cgagcatgac ctcgtcgggg gggtcggcat 4546

cgatggtgaa gatgccgggc aggaggtcgg gatcaaagta gctgatagaa gtggccagat 4606

cgtccaggga agcttgccat tcgcgcacgg ccagcgcgcg ctcgtaggga ctgaggggca 4666

tgccccaggg catggggtgg gtgagcgcgg aggcgtacat gccgcagatg tcgtagacgt 4726

agaggggctc ctcgaggatg ccgatgtagg tggggtagca gcgccccccg cggatgctgg 4786

cgcgcacgta gtcatacagc tcgtgcgagg gtgcgaggag ccccgggccc aggttggtgc 4846

ggctgggctt ttcggcgcgg tagacgatct ggcggaaaat ggcatgcgag ttggaggaga 4906

tggtgggcct ttggaagatg ttgaagtggg cgtggggcag tccgaccgag tcgcggatga 4966

agtgggcgta ggagtcttgc agcttggcga cgagctcggc ggtgacgagg acgtccagag 5026

cgcagtagtc gagggtctcc tggatgatgt catacttgag ctgacccttt tgtttccaca 5086

gctcgcggtt gagaaggaac tcttcgcggt ccttccagta ctcttcgagg gggaacccgt 5146

cctgatcggc acggtaagag cctagcatgt agaactggtt gacggccttg taggcgcagc 5206

agcccttctc cacggggagg gcgtaggcct gggcggcttt gcgcagggag gtgtgcgtga 5266

gggcgaaagt gtccctgacc atgactttga ggaactggtg cttgaagtcg atatcgtcgc 5326

agcccccctg ctcccagagt tggaagtccg tgcgcttctt gtaggcgggg ttgggcaaag 5386

cgaaagtaac atcgttgaag aggatcttgc ccgcgcgggg cataaagttg cgagtgatgc 5446

ggaaaggttg gggcacctcg gcccggttgt tgatgacctg ggcggcgagc acgatctcgt 5506

cgaagccgtt gatgttgtgg cccacgatgt agagttccac gaaccgtggg cggcccttga 5566

cgtggggcag cttcttgagc tcttcgtagg tgagctcgtc ggggtcgctg agcccgtgct 5626

gctcgagcgc ccagtcggcg agatgggggt tggcgcggag gaaagaagtc cagagatcca 5686

cggccagggc ggtttgcagg cggtcccggt actgacggaa ctgctgcccg acggccattt 5746

tttcgggggt gacgcagtag aaggtgcggg ggtccccgtg ccagcgatcc catttgagct 5806

ggagggcgag atcgagggcg agctcgacga gccggtcgtc cccggagagt ttcatgacca 5866

gcatgaaggg gacgagctgc ttgccgaagg accccatcca ggtgtaggtt tccacatcgt 5926

aggtgaggaa gagcctttcg gtgcgaggat gcgagccgat ggggaagaac tggatctcct 5986

gccaccaatt ggaggaatgg ctgttgatgt gatggaagta gaaatgccga cggcgcgccg 6046

aacactcgtg cttgtgttta tacaagcggc cacagtgctc gcaacgctgc acgggatgca 6106

cgtgctgcac gagctgtacc tgagttcctt tgacgaggaa tttcagtggg aagtggagtc 6166

gtggcgcctg catctcgtgc tgtactacgt cgtggtggtc ggcctggccc tcttctgcct 6226

cgatggtggt catgctgacg agcccgcgcg ggaggcaggt ccagacctcg gcgcgagcgg 6286

gtcggagagc gaggacgagg gcgcgcagcc cggagctgtc cagggtcctg agacgctgcg 6346

gagtcaggtc agtgggcagc ggcggcgcgc ggttgacttg caggagtttt tccagggcgc 6406

gcgggaggtc cagatggtac ttgatctcca ccgcgccgtt ggtggcgacg tcgatggctt 6466

gcagggtccc gtgcccctgg ggagtgacca ccgtcccccg tttcttcttg ggcggctggg 6526

gcgacggggg cggtgtttcc atctcggtta gaagcggcgg cgaggacgcg cgccgggcgg 6586

cagaggcggc tcggggcccg gaggcagggg cggcaggggc acgtcggcgc cgcgcgcggg 6646

taggttctgg tactgcgccc ggagaagact ggcgtgagcg acgacgcgac ggttgacgtc 6706

ctggatctga cgcctctggg tgaaggccac gggacccgtg agtttgaacc tgaaagagag 6766

ttcgacagaa tcaatctcgg tatcgttgac ggcggcctgc cgcaggatct cttgcacgtc 6826

gcccgagttg tcctggtagg cgatctcggt catgaactgc tcgatctcct cctcctgaag 6886

gtctccgcgg ccggcgcgct ccacggtggc cgcgaggtcg ttggagatgc ggcccatgag 6946

ctgcgagaag gcgttcatgc ccgcctcgtt ccagacgcgg ctgtagacca cgacgccctc 7006

gggatcgcgg gcgcgcatga ccacctgggc gaggttgagc tccacgtggc gcgtgaagac 7066

cgcgtagttg cagaggcgct ggtagaggta gttgagcgtg gtggcgatgt gctcggtgac 7126

gaagaaatac atgatccagc ggcggagcgg catctcgctg acgtcgccca gcgcctccaa 7186

gcgttccatg gcctcgtaaa agtccacggc gaagttgaaa aactgggagt tacgcgccga 7246

gacggtcaac tcctcctcca gaagacggat gagctcggcg atggtggcgc gcacctcgcg 7306

ctcgaaggcc cccgggagtt cctcctcttc catctcttct tcctcctcca ctaacatctc 7366

ttctacttcc tcctcaggcg gcagtggtgg cgggggaggg ggcctgcgtc gccggcggcg 7426

cacgggcaga cggtcgatga agcgctcgat ggtctcgccg cgccggcgtc gcatggtctc 7486

ggtgacggcg cgcccgtcct cgcggggccg cagcgtgaag acgccgccgc gcatctccag 7546

gtggccgggg gggtccccgt tgggcaggga gagggcgctg acgatgcatc ttatcaattg 7606

ccccgtaggg actccgcgca aggacctgag cgtctcgaga tccacgggat ctgaaaaccg 7666

ttgaacgaag gcttcgagcc agtcgcagtc gcaaggtagg ctgagcacgg tttcttctgc 7726

cgggtcatgt tggggagcgg ggcgggcgat gctgctggtg atgaagttga aataggcggt 7786

tctgagacgg cggatggtgg cgaggagcac caggtctttg ggcccggctt gctggatgcg 7846

cagacggtcg gccatgcccc aggcgtggtc ctgacacctg gccaggtcct tgtagtagtc 7906

ctgcatgagc cgctccacgg gcacctcctc ctcgcccgcg cggccgtgca tgcgcgtgag 7966

cccgaagccg cgctggggct ggacgagcgc caggtcggcg acgacgcgct cggcgaggat 8026

ggcctgctgg atctgggtga gggtggtctg gaagtcgtca aagtcgacga agcggtggta 8086

ggctccggtg ttgatggtgt aggagcagtt ggccatgacg gaccagttga cggtctggtg 8146

gcccggacgc acgagctcgt ggtacttgag gcgcgagtag gcgcgcgtgt cgaagatgta 8206

gtcgttgcag gtgcgcacca ggtactggta gccgatgagg aagtgcggcg gcggctggcg 8266

gtagagcggc catcgctcgg tggcgggggc gccgggcgcg aggtcctcga gcatggtgcg 8326

gtggtagccg tagatgtacc tggacatcca ggtgatgccg gcggcggtgg tggaggcgcg 8386

cgggaactcg cggacgcggt tccagatgtt gcgcagcggc aggaagtagt tcatggtggg 8446

cacggtctgg cccgtgaggc gcgcgcagtc gtggatgctc tatacgggca aaaacgaaag 8506

cggtcagcgg ctcgactccg tggcctggag gctaagcgaa cgggttgggc tgcgcgtgta 8566

ccccggttcg aatctcgaat caggctggag ccgcagctaa cgtggtactg gcactcccgt 8626

ctcgacccaa gcctgcacca accctccagg atacggaggc gggtcgtttt gcaacttttt 8686

ggaggccgga aatgagacta gtaagcgcgg aaagcggccg accgcgatgg ctcgctgccg 8746

tagtctggag aagaatcgcc agggttgcgt tgcggtgtgc cccggttcga ggccggccgg 8806

attccgcggc taacgagggc gtggctgccc cgtcgtttcc aagaccccta gccagccgac 8866

ttctccagtt acggagcgag cccctctttt gttttgtttg tttttgccag atgcatcccg 8926

tactgcggca gatgcgcccc caccaccctc caccgcaaca acagccccct ccacagccgg 8986

cgcttctgcc cccgccccag cagcagcagc aacttccagc cacgaccgcc gcggccgccg 9046

tgagcgggac tggacagact tctcagtatg accagctggc cttggaagag ggcgaggggc 9106

tggcgcgcct gggggcgtcg tcgccggagc ggcacccgcg cgtgcagatg aaaagggacg 9166

ctcgcgaggc ctacgtgccc aagcagaacc tgttcagaga caggagcggc gaggagcccg 9226

aggagatgcg cgcggcccgg ttccacgcgg ggcgggagct gcggcgcggc ctggacagaa 9286

agagggtgct gagggacgag gatttcgagg cggacgagct gacggggatc agccccgcgc 9346

gtgcgcacgt ggccgcggcc aacctggtca cggcgtacga gcagaccgtg aaggaggaga 9406

gcaacttcca aaaatccttc aacaaccacg tgcgcaccct gatcgcgcgc gaggaggtga 9466

ccctgggcct gatgcacctg tgggacctgc tggaggccat cgtgcagaac cccaccagca 9526

agccgctgac ggcgcagctg ttcctggtgg tgcaacatag tcgggacaac gaggcgttca 9586

gggaggcgct gctgaatatc accgagcccg agggccgctg gctcctggac ctggtgaaca 9646

ttctgcagag catcgtggtg caggagcgcg ggctgccgct gtccgagaag ctggcggcca 9706

tcaacttctc ggtgctgagt ctgggcaagt actacgctag gaagatctac aagaccccgt 9766

acgtgcccat agacaaggag gtgaagatcg acgggtttta catgcgcatg accctgaaag 9826

tgctgaccct gagcgacgat ctgggggtgt accgcaacga caggatgcac cgcgcggtga 9886

gcgccagcag gcggcgcgag ctgagcgacc aggagctgat gcacagcctg cagcgggccc 9946

tgaccggggc cgggaccgag ggggagagct actttgacat gggcgcggac ctgcactggc 10006

agcccagccg ccgggccttg gaggcggcgg caggacccta cgtagaagag gtggacgatg 10066

aggtggacga ggagggcgag tacctggaag actgatggcg cgaccgtatt tttgctagat 10126

gcaacaacag ccacctcctg atcccgcgat gcgggcggcg ctgcagagcc agccgtccgg 10186

cattaactcc tcggacgatt ggacccaggc catgcaacgc atcatggcgc tgacgacccg 10246

caaccccgaa gcctttagac agcagcccca ggccaaccgg ctctcggcca tcctggaggc 10306

cgtggtgccc tcgcgctcca accccacgca cgagaaggtc ctggccatcg tgaacgcgct 10366

ggtggagaac aaggccatcc gcggcgacga ggccggcctg gtgtacaacg cgctgctgga 10426

gcgcgtggcc cgctacaaca gcaccaacgt gcagaccaac ctggaccgca tggtgaccga 10486

cgtgcgcgag gccgtggccc agcgcgagcg gttccaccgc gagtccaacc tgggatccct 10546

ggtggcgctg aacgccttcc tcagcaccca gcccgccaac gtgccccggg gccaggagga 10606

ctacaccaac ttcatcagcg ccctgcgcct gatggtgacc gaggtgcccc agagcgaggt 10666

gtaccagtcc gggccggact acttcttcca gaccagtcgc cagggcttgc agaccgtgaa 10726

cctgagccag gcgttcaaga acttgcaggg cctctggggc gtgcaggccc cggtcgggga 10786

ccgcgcgacg gtgtcgagcc tgctgacgcc gaactcgcgc ctgctgttgc tgctggtggc 10846

tcccttcacg gacagcggca gcatcaaccg caactcgtac ctgggctacc tgattaacct 10906

gtaccgcgag gccatcggcc aggcgcacgt ggacgagcag acctaccagg agatcaccca 10966

cgtgagccgc gccctgggcc aggacgaccc gggaaatctg gaagccaccc tgaacttttt 11026

gctgaccaat cggtcgcaga agatcccgcc ccagtacgcg ctcagcgccg aggaggagcg 11086

catcctgcga tacgtgcagc agagcgtggg cctgttcctg atgcaggagg gggccacccc 11146

cagcgccgcg ctcgacatga ccgcgcgcaa catggagccc agcatgtacg ccagcaaccg 11206

cccgttcatc aataaactga tggactactt gcatcgggca gccgccatga actctgacta 11266

tttcaccaac gccatcctga atccccactg gcttccgccg ccggggttct acacgggcga 11326

gtacgacatg cccgacccca atgacgggtt cctgtgggac gatgtggaca gcagcgtgtt 11386

ctccccccga ccgggtgcta acgagcgccc cttgtggaag aaggaaggca gcgaccgacg 11446

cccgtcctcg gcgctgtccg gccgcgaggg tgctgccgcg gcggtgcccg aggccgccag 11506

tcctttcccg agcttgccct tctcgctgaa cagtatccgc agcagcgagc tgggcaggat 11566

cacgcgcccg cgcttgctgg gcgaggagga gtacttgaat gactccctgc tgagacccga 11626

gcgggagaag aacttcccca ataacgggat agagagcctg gtggacaaga tgagccgatg 11686

gaagacgtat gcgcaggagc acagggacga tccccgggcg tcgcaggggg ccacgagccg 11746

gggcagcgcc gcccgtaaac gccggtggca cgacaggcag cggggactga tgtgggacga 11806

tgaggattcc gccgacgaca gcagcgtgtt ggacttgggt gggagtggtg gtaacccgtt 11866

cgctcacctg cgcccccgca tcgggcgcat gatgtaagaa accgaaaata aatgatactc 11926

accaaggcca tagcgaccag cgtgcgttcg tttcttctct gttgttgttg tatctagtat 11986

gatgaggcgt gcgtacccgg agggtcctcc tccctcgtac gagagcgtga tgcagcaggc 12046

gatggcggcg gcgatgcagc ccccgctgga ggctccttac gtgcccccgc ggtacctggc 12106

gcctacggag gggcggaaca gcattcgtta ctcggagctg gcacccttgt acgataccac 12166

ccggttgtac ctggtggaca acaagtcggc ggacatcgcc tcgctgaact accagaacga 12226

ccacagcaac ttcctgacca ccgtggtgca gaacaatgac ttcaccccca cggaggccag 12286

cacccagacc atcaactttg acgagcgctc gcggtggggc ggccagctga aaaccatcat 12346

gcacaccaac atgcccaacg tgaacgagtt catgtacagc aacaagttca aggcgcgggt 12406

gatggtctcc cgcaagaccc ccaacggggt caaagtagat gacacgtatg atggtagtca 12466

ggatatcttg gagtataagt gggtggagtt tgagctgccc gaaggcaact tctcggtgac 12526

catgaccatc gacctgatga acaacgccat catcgacaat tacttggcgg tggggcggca 12586

gaacggggtc ctggagagcg atatcggcgt gaagttcgac actaggaact tcaggctggg 12646

ctgggacccc gtgaccgagc tggtcatgcc aggggtgtac accaacgagg ccttccaccc 12706

cgatattgtc ttgctgcccg gctgcggggt ggacttcacc gagagccgcc tcagcaacct 12766

gctgggcatt cgcaagaggc agcctttcca ggaaggtttc cagatcatgt acgatgatct 12826

ggaggggggc aacatccccg cgctcctgga tgtcgaggcc tatgaggaaa gcaaggaaaa 12886

agcagaagcc gaggcgactg cagccgtggc tactgccgcg gccaatgcag atgccaatgt 12946

gactagaggc gatacattcg ccactcaggc ggaggaagca gccgccctag cggtcgccga 13006

tgatagtgaa agtaagatag ttatcaaacc agtaaaagtg gatagcaaga acagaagcta 13066

caacgtgctg ccggacgagg taaacaccgc ctaccgcagt tggtacctgg cctacaacta 13126

tggcgacccc gagaagggcg tgcgctcctg gacgctgctc accacctcgg acgtcacctg 13186

cggcgtggag caagtctact ggtcgctgcc cgacatgatg caagacccgg tcaccttccg 13246

ctccacgcgt caagttagca actacccggt ggtgggcgcc gagctcctgc ccgtctactc 13306

caagagcttc ttcaacgagc aggccgtcta ctcgcagcag ctgcgcgcct tcacctcgct 13366

cacgcacgtc ttcaaccgct tccccgagaa ccagatcctc gtccgcccgc ccgcgcccac 13426

cattaccacc gtcagtgaaa acgttcctgc tctcacagat cacgggaccc tgccgctgcg 13486

cagcagtatc cggggagtcc agcgcgtgac cgttactgac gccagacgcc gcacctgccc 13546

ctacgtctac aaggccctgg gcatagtcgc gccgcgcgtc ctctcgagcc gcaccttcta 13606

aaaaatgtcc attctcatct cgcccagtaa taacaccggt tggggcctgc gcgcgcccag 13666

caagatgtac ggaggcgctc gccaacgctc cacgcaacac cccgtgcgcg tgcgcgggca 13726

cttccgcgct ccctggggcg ccctcaaggg ccgcgtgcgg tcgcgcacca ccgtcgacga 13786

cgtgatcgac caggtggtgg ccgacgcgcg caactacacc cccgccgccg cgcccgtctc 13846

caccgtggac gccgtcatcg acagcgtggt ggccgacgcg cgccggtacg cccgcgccaa 13906

gagccggcgg cggcgcatcg cccggcggca ccggagcacc cccgccatgc gcgcggcgcg 13966

agccttgctg cgcagggcca ggcgcacggg acgcagggcc atgctcaggg cggccagacg 14026

cgcggcttca ggcgccagcg ccggcaggac ccggagacgc gcggccacgg cggcggcagc 14086

ggccatcgcc agcatgtccc gcccgcggcg agggaacgtg tactgggtgc gcgacgccgc 14146

caccggtgtg cgcgtgcccg tgcgcacccg cccccctcgc acttgaagat gttgacttcg 14206

cgatgttgat gtgtcccagc ggcgaggatg tccaagcgca aattcaagga agagatgctc 14266

caggtcatcg cgcctgagat ctacggcccc gcggcggtgg tgaaggagga aagaaagccc 14326

cgcaaaatca agcgggtcaa aaaggacaaa aaggaagaag atgacgatct ggtggagttt 14386

gtgcgcgagt tcgccccccg gcggcgcgtg cagtggcgcg ggcggaaggt gcaaccggtg 14446

ctgagacccg gcaccaccgt ggttttcaca cccggcgagc gctccggcac cgcttccaag 14506

cgctcctacg acgaggtgta cggggatgat gatatcctgg agcaggcggc cgagcgcctg 14566

ggcgagtttg cttacggcaa gcgcagccgc cccgcggcgc ccccagaagc ggtgtccatc 14626

ccgctggacc acggcaaccc cacgccgagc ctcaagcccg tgaccctgca gcaggtgctg 14686

ccgagcgcgg cgccgcgccg gggcttcaag cgcgagggcg aggatctgta ccccaccatg 14746

cagctgatgg tgcccaagcg ccagaagctg gaagacgtgc tggagaccat gaaggtggac 14806

cccgaggtgc agcccgaggt caaggtgcgg cccatcaagc aggtggcccc gggcctgggc 14866

gtgcagaccg tggacatcaa gatccccacg gagcccatgg aaacgcagac cgagcccgtg 14926

aagcccagca ccagcaccat ggaggtgcag acggatccct ggatgccggc ggccgctcct 14986

agaccccggc gcaagtacgg cgcggccagc ctgctgatgc ccaactacgc gctgcatcct 15046

tccatcatcc ccacgccggg ctaccgcggc acgcgcttct accgcgggca gaccacccgc 15106

cgccgccgca agaccactcg ccgccgccgt cgccgcacag ccgctgcaac ttccgccgcc 15166

gccctggtgc ggagagtgta ccgccgcggc cgcgcgcctc tgaccctgcc gcgcgcgcgc 15226

taccacccga gcatcgccat ttaaacactt tcgcgcctgc tttgcagatg gccctcactt 15286

gccgcattcg cgttcccatt gcgggctacc gaggaagaaa accgcgccgt agaaggctgg 15346

cggggagcgg gatgcgtcgc cacccccacc ggcggcggcg cgccatcagc aagcggttgg 15406

ggggaggctt cctgcccgcg ctgatcccca tcatcgccgc ggcgatcggg gcgatccccg 15466

gcatcgcttc cgtggcggta caggcctctc agcgccactg agacacttgg aacatcttgt 15526

aataaaaaac caatggactc tgacgctcct ggtcctgtga tgtgttttcg tagacagatg 15586

gaagacatca atttttcgtc cctggctccg cgacacggca cgcggccgtt catgggcacc 15646

tggagcgaca tcggcaacag ccatctgaac gggggcgcct tcaattggag cagtctctgg 15706

agcgggctta agaatttcgg gtccacgctt aaaacctatg gccacaaggc gtggaacagc 15766

aacacagggc aggcgctgag ggaaaagctg aaagagcaga acttccagca gaaggtggtc 15826

gatggcctcg cctcgggcat caacggggtg gtggacctgg ccaaccaggc cgtgcagcgg 15886

cagatcaaca gccgactgga gccggtgccg cccgccggct ccgtggagat gccgcaggtg 15946

gaggaggagc tgcctcccct ggacaagcgg ggcgagaagc gaccccgccc cgacgcggag 16006

gagacgctgc tgacgcacac ggacgagccg cccccgtacg aggaggcggt gaaactgggc 16066

ctgcccacca cgcggcccat cgcgcccctg gccaccgggg tgctgaaacc cgagtctaag 16126

cccgcgaccc tggacttgcc tcctcccccg acatcccgcc cctccacagt ggctaagccc 16186

ctgccgccgg tggcccgcgc gcgacccggg agccgcccgc aggcgaactg gcagagcact 16246

ctgaacagca tcgtgggtct gggagtgcag agtgtgaagc gccgccgctg ctattaaaca 16306

acataccgta gcgcttaact tgcttgtctg tgtgtgtatg tattatgtcg ccgccgctgt 16366

cgccagaagg aggagtgaag aggcgcgtcg ccgagttgca agatggccac cccatcgatg 16426

ctgccccagt gggcgtacat gcacatcgcc ggacaggacg cttcggagta cctgagtccg 16486

ggtctggtgc agttcgcccg cgccacagac acctacttca gtctggggaa caagtttagg 16546

aaccccacgg tggcgcccac gcacgatgtg accaccgacc gcagccagcg gctgacgctg 16606

cgcttcgtgc ccgtggaccg cgaggacaac acctactcgt acaaagtgcg ctacacgctg 16666

gccgtgggcg acaaccgcgt gctggacatg gccagcacct actttgacat ccgcggcgtg 16726

ctggaccggg gccctagctt caaaccctac tccggcaccg cctacaacag cctggccccc 16786

aagggagctc ccaattccag ccagtgggag caagcaaaaa caggcaatgg gggaactatg 16846

gaaacacaca catatggtgt ggccccaatg ggcggagaga atattacaaa agatggtctt 16906

caaattggaa cagatgctac agcggatcag aataaaccaa tttatgctga caaaacattc 16966

cagccagaac cccaagtagg agaagaaaat tggcaagaaa ctgaaagctt ttatggtggt 17026

agggctctta aaaaagatac aaacatgaaa ccttgctacg gctcctatgc tagacccact 17086

aatgaaaagg gaggacaagc taaacttaaa gttggagatg atggagttcc aaccaaagaa 17146

tttgatatag acctggcttt ctttgatact cccggtggta ctgtgaatgg ccaagatgag 17206

tataaagcag acattgtcat gtataccgaa aacgcgtatc tggaaactcc agacacacat 17266

gtggtataca aaccaggcaa ggaagatgca agttctgaaa ttaacctggt tcagcagtcc 17326

atgcccaaca ggcccaacta cattgggttc agagacaact ttattggtct catgtactac 17386

aacagcactg gcaacatggg ggtgctggct ggtcaggcct cacagttgaa tgctgtggtt 17446

gacttgcaag acagaaacac tgaactgtcc taccagctct tgcttgactc tctgggtgac 17506

agaaccaggt atttcagtat gtggaatcag gcggtggata gctatgaccc tgatgtgcgc 17566

atcattgaaa accatggtgt ggaggatgaa ttgccaaact attgcttccc cttggatggg 17626

tccggcacta acgccgctta ccaaggtgta aaagtaaaaa atggtgaaga tggtgatctt 17686

gagagcgagt gggaaaatga tgatactgtg gcagctcgaa atcaattatg taaaggcaac 17746

atttttgcca tggaaatcaa cctccaagcc aacctgtgga ggagtttcct ctactcgaac 17806

gtggccctgt acctgcccga ttcttacaag tacacgccgg ccaacatcac tctgcccacc 17866

aacaccaaca cctacgatta catgaatggg agagtggtgc ctccctcgct ggtggacgcc 17926

tacatcaaca tcggggcgcg ctggtcgctg gaccccatgg acaacgtgaa ccccttcaac 17986

caccaccgca acgcgggcct gcgctaccgc tccatgctcc tgggcaacgg gcgctacgtg 18046

cccttccaca tccaggtgcc ccagaagttc ttcgccatca agagcctcct gctcctgccc 18106

gggtcctaca cctacgagtg gaacttccgc aaggacgtca acatgatcct gcagagctcc 18166

ctcggcaacg acctgcgcac ggacggggcc tccatccagt tcaccagcat caacctctac 18226

gccaccttct tccccatggc gcacaacacc gcctccacgc tcgaggccat gctgcgcaac 18286

gacaccaacg accagtcctt caacgactac ctctcggcgg ccaacatgct ctaccccatc 18346

ccggctaacg ccaccaacgt gcccatctcc atcccctcgc gcaactgggc cgccttccgc 18406

ggctggtcct tcacgcgcct caagacccgc gagacgccct cgctgggctc cgggttcgac 18466

ccctacttcg tctactcggg ctccatcccc tacctcgacg gcaccttcta cctcaaccac 18526

accttcaaga aggtctccat caccttcgac tcctccgtca gctggcccgg caacgaccgg 18586

ctcctgacgc ccaacgagtt cgaaatcaag cgcaccgtcg acggcgaggg gtacaacgtg 18646

gcccagtgca acatgaccaa ggactggttc ctggtccaga tgctggccca ctacaacatc 18706

ggctaccagg gcttccacgt gcccgagggc tacaaggacc gcatgtactc cttcttccgc 18766

aacttccagc ccatgagccg ccaggtggtc gacgaggtca actacaagga ctaccaggcc 18826

gtcaccctgg cctaccagca caacaactcg ggcttcgtcg gctacctcgc gcccaccatg 18886

cgccagggcc agccctaccc cgccaactac ccctacccgc tcattggcaa gagcgccgtc 18946

accagcgtca cccagaaaaa gttcctctgc gaccgcgtca tgtggcgcat ccccttctcc 19006

agcaacttca tgtccatggg cgcgctcacc gacctcggcc agaacatgct ctatgccaac 19066

tccgcccacg cgctagacat gaatttcgaa gtcgacccca tggatgagtc cacccttctc 19126

tatgttgtct tcgaagtctt cgacgtcgtc agagtgcacc agccccaccg cggcgtcatc 19186

gaggccgtct acctgcgcac ccccttctcg gccggtaacg ccaccaccta agctcttgct 19246

tcttgcaagc agctatggcc tgcgcgggct ccggcgagca ggagctcagg gccatcatcc 19306

gcgacctggg ctgcgggccc tacttcctgg gcaccttcga taagcgcttc ccgggattca 19366

tggccccgca caagctggcc tgcgccatcg tcaacacggc cggccgcgag accgggggcg 19426

agcactggct ggccttcgcc tggaacccgc gctcgaacac ctgctacctc ttcgacccct 19486

tcgggttctc ggacgagcgc ctcaagcaga tctaccagtt cgagtacgag ggcctgctgc 19546

gccgcagcgc cctggccacc gaggaccgct gcgtcaccct ggaaaagtcc acccagaccg 19606

tgcagggtcc gcgctcggcc gcctgcgggc tcttctgctg catgttcctg cacgccttcg 19666

tgcactggcc cgaccgcccc atggacaaga accccaccat gaacttgctg acgggggtgc 19726

ccaacggcat gctccagtcg ccccaggtgg agcccaccct gcgccgcaac caagaggcgc 19786

tctaccgctt cctcaactcc cactccgcct actttcgctc ccaccgcgcg cgcatcgaga 19846

aggccaccgc cttcgaccgc atgaatcaag acatgtaaac cgtgtgtgta tttttatctg 19906

tgtctttaat aaacagcgct tttatgtgac atgcatctct gagatgattt atttagaaat 19966

cgaaagggtt ctgccgggtc tcggcatggc ccgcgggcag ggacacgttg cggaactggt 20026

acttggccag ccacttgaac tcggggatca gcagtttcgg cagcggggtg tcggggaagg 20086

agtcggtcca cagcttccgc gtcagttgca gggcgcccag caggtcgggc gcggagatct 20146

tgaaatcgca gttgggaccc gcgttctgcg cgcgggagtt gcggtacacg gggttgcagc 20206

actggaacac catcagggcc gggtgtttca cgctcgccag caccgtcgcg tcggtgatac 20266

tctccacgtc aaggtcctcg gcgttggcca tcccgaaggg ggtcatcttg caggtctgcc 20326

ggcccatggt gggcacgcac ccgggcttgt ggttgcaatc gcagtgcagg gggatcagca 20386

tcatctgggc ctggtcggcg ttcatccccg ggtacatggc cttcatgaac gcctccagct 20446

gtctgaaggc ctgctgggcc ttggctcctt cggtgaaaaa gaccccgcag gacttgctag 20506

agaactggtt ggtagcgcag ccggcgtcgt gcacgcagca gcgcgcgtcg ttgttggcca 20566

gctgcaccac gctgcgcccc cagcggttct gggtgatctt ggcccggtcg gggttctcct 20626

tcagcgcgcg ctgcccgttc tcgctcgcca catccatctc gaccatgtgc tccttctgga 20686

tcatggtgct gccgtgcagg caccgcagct tgccctcggc ctcggtgcac ccatgcagcc 20746

acagcgcgca cccggtgcac tcccagttct tgtgggcgat ctgggaatgc gagtgcacga 20806

agccctgcag gaagcggccc atcatggtcg tcagggtctt gttgctggtg aaggtcagcg 20866

ggatgccgcg gtgctcctcg ttgatgtaca ggtggcagat gcggcggtac acctcgccct 20926

gctcgggcat cagctggaag ttggatttca gctcgctttc cacgcggtag cggtccatca 20986

gcagagccat gatttccatg cccttctccc aggccgagac gatgggcagg ctcatggggt 21046

tcttcaccat catcttggag ctagccgccg cagccagggg gtcgctctcg tccagggtct 21106

caaagctccg cttgccgtcc ttctcggtga tccgcacggg ggggtagctg aagcccacgg 21166

ccgccagctc ctcctcggcc tgtctttcgt cctcgctgtc ctggctgacg tcctgcagga 21226

ccacatgctt ggtcttgcgg ggcttcttct tgggcggcag cggcggcggc tgttcctgag 21286

gcgaggggga gcgcgagtgc tcgctcacca ccacaatttc ttcttcttgg tccgaggcca 21346

cgcggcggta ggtatgtctc ttcaggggca gaggcggagg cgacgggctc tcctcgcggc 21406

ccggcgggtg gctggcagag ccccttccgc gttcgggggt gcgctcccgg cggcgctctg 21466

actgacttcc tccgcggccg gccattgtgt tctcctaggg aacaacaaca agcatggaga 21526

ctcagccatc gtcgccaacc tcgccatctg cccccaccac cgccgacgag aagcagcagc 21586

agcagaatga aagcttaacc gccccgccgc ccagccccgc ctccgacgcg gccgcggtcc 21646

cagacatgca agagatggag gaatccatcg agattgacct gggctatgtg acgcccgcgg 21706

agcacgagga ggagctggca gtgcgctttt catcgagcca ggaagaaaaa gaacagcagc 21766

cagaacagga agcagagagc gagcagagtc aggctgggct cgagcatggc gactacctcc 21826

acctgagcgg gggggaggac gcgctcatca agcatctggc ccggcaggcc ctcatcgtca 21886

aggacgcgct gctcgaccgc atcgaggtgc ccctcagcgt ggaggagctc agccgcgcct 21946

acgagctgaa cctcttctcg ccgcgcgtgc cccccaagcg ccagcccaac ggcacctgcg 22006

agcccaaccc gcgcctcaac ttctacccgg tcttcgcggt gcccgaggcc ctggccacct 22066

accacatctt tttcaagaac caaaagatcc ccgtctcctg ccgcgccaac cgcacccgcg 22126

ccgacgccct cttcaacctg ggccccggcg cccgcctacc tgatatcgcc tccttggaag 22186

aggttcccaa gatcttcgag ggtctgggca gcgacgagac tcgggccgcg aacgctctgc 22246

aaggagaagg aggagagcat gagcaccaca gcgccctggt cgagttggaa ggcgacaacg 22306

cccgcttggc ggtgctcaag cgcacggtcg agctgaccca tttcgcctac ccggccctga 22366

acctgccccc caaagtcatg agcgcggtca tggaccaggt gctcatcaag cgcgcgtcgc 22426

ccatctccga ggacgagggc atgcaagact ccgaggaggg caagcccgtg gtcagcgacg 22486

agcagctggc ccggtggctg ggtcctaatg ctacccctca gagtttggaa gagcggcgca 22546

agctcatgat ggccgtggtc ctggtgaccg tggagctgga gtgcttgcgc cgcttcttcg 22606

ccgacgcgga gaccctgcgc aaggtcgagg agaacctgca ctacctcttc aggcacgggt 22666

tcgtgcgcca ggcctgcaag atctccaacg tggagctgac caacctggtc tcctacatgg 22726

gcatcttgca cgagaaccgc ctggggcaga acgtgctgca caccaccctg cgcggggagg 22786

cccgccgcga ctacatccgc gactgcgtct acctctacct ctgccacacc tggcagacgg 22846

gcatgggcgt gtggcagcag tgtctggagg agcagaacct gaaagagctc tgcaagctcc 22906

tgcagaagaa cctcaagggt ctgtggaccg ggttcaacga gcgaaccacc gcctcggacc 22966

tggccgacct catcttcccc gagcgcctca ggctgacgct gcgcaacggc ctgcccgact 23026

ttatgagcca aagcatgttg caaaactttc gctctttcat cctcgaacgc tccggaatcc 23086

tgcccgccac ctgctccgcg ctgccctcgg acttcgtgcc gctgaccttc cgcgagtgcc 23146

ccccgccgct gtggagccac tgctacctgc tgcgcctggc caactacctg gcctaccact 23206

cggacgtgat cgaggacgtc agcggcgagg gcctgctcga gtgccactgc cgctgcaacc 23266

tctgcacgcc gcaccgctcc ctggcttgca acccccagct gctgagcgag acccagatca 23326

tcggcacctt cgagttgcaa gggcccagcg atgagggttc agccaagggg agtctgaaac 23386

tcaccccggg gctgtggacc tcggcctact tgcgcaagtt cgtgcccgag gactaccatc 23446

ccttcgagat caggttctac gaggaccaat cccagccgcc caaggccgag ctgtcggcct 23506

gcgtcatcac ccagggggcc atcctggccc aattgcaagc catccagaaa tcccgccaag 23566

aattcttgct gaaaaagggc cgcggggtct acctcgaccc ccagaccggt gaggagctca 23626

accccggctt cccccagg atg ccc cga gga aac aag aag ctg aaa gtg gag 23677

Met Pro Arg Gly Asn Lys Lys Leu Lys Val Glu

500 505

ctg ccg ccc gtg gag gat ttg gag gaa gac tgg gag aac agc agt cag 23725

Leu Pro Pro Val Glu Asp Leu Glu Glu Asp Trp Glu Asn Ser Ser Gln

510 515 520

gca gag gag gag gag atg gag gaa gac tgg gac agc act cag gca gag 23773

Ala Glu Glu Glu Glu Met Glu Glu Asp Trp Asp Ser Thr Gln Ala Glu

525 530 535 540

gag gac agc ctg caa gac agt ctg gag gaa gac gag gag gag gca gag 23821

Glu Asp Ser Leu Gln Asp Ser Leu Glu Glu Asp Glu Glu Glu Ala Glu

545 550 555

gag gag gtg gaa gaa gca gcc gcc gcc aga ccg tcg tcc tcg gcg gag 23869

Glu Glu Val Glu Glu Ala Ala Ala Ala Arg Pro Ser Ser Ser Ala Glu

560 565 570

aaa gca agc agc acg gat acc atc tcc gct ccg ggt cgg ggt ccc gct 23917

Lys Ala Ser Ser Thr Asp Thr Ile Ser Ala Pro Gly Arg Gly Pro Ala

575 580 585

cga ccc cac agt aga tgg gac gag acc ggg cga ttc ccg aac ccc acc 23965

Arg Pro His Ser Arg Trp Asp Glu Thr Gly Arg Phe Pro Asn Pro Thr

590 595 600

acc cag acc ggt aag aag gag cgg cag gga tac aag tcc tgg cgg ggg 24013

Thr Gln Thr Gly Lys Lys Glu Arg Gln Gly Tyr Lys Ser Trp Arg Gly

605 610 615 620

cac aaa aac gcc atc gtc tcc tgc ttg cag gcc tgc ggg ggc aac atc 24061

His Lys Asn Ala Ile Val Ser Cys Leu Gln Ala Cys Gly Gly Asn Ile

625 630 635

tcc ttc acc cgg cgc tac ctg ctc ttc cac cgc ggg gtg aac ttc ccc 24109

Ser Phe Thr Arg Arg Tyr Leu Leu Phe His Arg Gly Val Asn Phe Pro

640 645 650

cgc aac atc ttg cat tac tac cgt cac ctc cac agc ccc tac tac ttc 24157

Arg Asn Ile Leu His Tyr Tyr Arg His Leu His Ser Pro Tyr Tyr Phe

655 660 665

caa gaa gag gca aca gca gca gaa aaa gac cag aaa acc agc agc agc 24205

Gln Glu Glu Ala Thr Ala Ala Glu Lys Asp Gln Lys Thr Ser Ser Ser

670 675 680

tagaaaatcc acagcggcag gtggactgag gatcgcagcg aacgagccgg cgcagacccg 24265

ggagctgagg aaccggatct ttcccaccct ctatgccatc ttccagcaga gtcgggggca 24325

ggagcaggaa ctgaaagtca agaaccgttc tctgcgctcg ctcacccgca gttgtctgta 24385

tcacaagagc gaagaccaac ttcagcgcac tctcgaggac gccgaggctc tcttcaacaa 24445

gtactgcgcg ctcactctta aagagtagcc cgcgcccgcc cagccgcaga aaaaggcggg 24505

aattacgtca cctgtgccct tcgccctagc cgcctccacc catcatcatg agcaaagaga 24565

ttcccacgcc ttacatgtgg agctaccagc cccagatggg cctggccgcc ggcgccgccc 24625

aggactactc cacccgcatg aattggctca gcgccgggcc cgcgatgatc tcacgggtga 24685

atgacatccg cgcccaccga aaccagatac tcctagaaca gtcagcgctc accgccacgc 24745

cccgcaatca cctcaatccg cgtaattggc ccgccgccct ggtgtaccag gaaattcccc 24805

agcccacgac cgtactactt ccgcgagacg cccaggccga agtccagctg actaactcag 24865

gtgtccagct ggcgggcggc gccaccctgt gtcgtcaccg ccccgctcag ggtataaagc 24925

ggctggtgat ccggggcaga ggcacacagc tcaacgacga ggtggtgagc tcttcgctgg 24985

gtctgcgacc tgacggagtc ttccaaatcg ccggatcggg gagatcttcc ttcacgcctc 25045

gtcaggcggt cctgaccttg gagagttcgt cctcgcagcc ccgctcgggc ggcatcggca 25105

ctctccagtt cgtggaggag ttcactccct cggtctactt caaccccttc tccggctccc 25165

ccggccacta cccggacgag ttcatcccga actttgacgc catcagcgag tcagtggacg 25225

gctacgattg aatgtcccat ggtggcgcgg ctgacctagc tcggcttcga cacctggacc 25285

actgccgccg ctttcgctgc ttcgctcggg acctcgccga gttcacctac tttgagctgc 25345

ccgaggagca tcctcagggc ccggcccacg gagtgcggat cgtcgtcgaa gggggcctag 25405

actcccacct gcttcggatc ttcagccagc gcccgatcct ggtcgagcgc caacagggca 25465

acaccctcct gaccctctac tgcatctgcg accaccccgg cctgcatgaa agtctttgtt 25525

gtctgctgtg tactgagtat aataaaagct gagatcagcg actactccgg actcaactgt 25585

ggtgtttctg catccatcaa ccggtctctg accttcaccg ggaacgagac cgagctccag 25645

ctccagtgta agccccacaa gaagtacctc acctggctgt accagggctc cccgatcgcc 25705

gttgttaacc actgcgacga cgacggagtc ctgctgaacg gccccgccaa ccttactttt 25765

tccacccgca gaagcaagct actgctcttc agacccttcc tccccggcac ctatcagtgc 25825

atctcgggac cctgccatca caccttccac ctgatcccga ataccaccac ctcttcccca 25885

gcgccgctcc ccactaacaa ccaaactaac caccaccaac gccaccgtcg agacctcgtt 25945

gaatctaata ccacccacac cggaggtgag ctccgaggtc ctgaatcctc tgggatttat 26005

tacggcccct gggaggtggt ggggttaata gctttaggct tagtagcggg tgggcttttg 26065

gctctctgct acctatacct cccttgcttt tcctacttag tggtgctttg ttgctggttt 26125

aagaa atg ggg aag atc acc cta gtg tgc ggt gtg ctg gtg acg gtg gtg 26175

Met Gly Lys Ile Thr Leu Val Cys Gly Val Leu Val Thr Val Val

685 690 695

ctt tcg att ctg gga ggg gga agc gcg gct gta gtg acg gag aag aag 26223

Leu Ser Ile Leu Gly Gly Gly Ser Ala Ala Val Val Thr Glu Lys Lys

700 705 710 715

gcc gat ccc tgc ttg act ttc aac ccc gat aac tgc cgg ctg agt ttt 26271

Ala Asp Pro Cys Leu Thr Phe Asn Pro Asp Asn Cys Arg Leu Ser Phe

720 725 730

cag cca gat ggc aat cgg tgt gcg gtg ttg atc aag tgc gga tgg gaa 26319

Gln Pro Asp Gly Asn Arg Cys Ala Val Leu Ile Lys Cys Gly Trp Glu

735 740 745

tgc gag agc gtg gtg atc cag tat aaa aac aaa acc tgg aac aat act 26367

Cys Glu Ser Val Val Ile Gln Tyr Lys Asn Lys Thr Trp Asn Asn Thr

750 755 760

ctc gcg tcc aca tgg cag ccc ggg gac ccc gag tgg tac acc gtc tct 26415

Leu Ala Ser Thr Trp Gln Pro Gly Asp Pro Glu Trp Tyr Thr Val Ser

765 770 775

gtc cct ggt gct gac ggc tcc ctc cgc acg gtg aac aac act ttc att 26463

Val Pro Gly Ala Asp Gly Ser Leu Arg Thr Val Asn Asn Thr Phe Ile

780 785 790 795

ttt gag cac atg tgc gaa acc gcc atg ttc atg agc aag cag tac ggt 26511

Phe Glu His Met Cys Glu Thr Ala Met Phe Met Ser Lys Gln Tyr Gly

800 805 810

atg tgg ccc cca cgg aaa gag aat atc gtg gtc ttc tcc atc gct tac 26559

Met Trp Pro Pro Arg Lys Glu Asn Ile Val Val Phe Ser Ile Ala Tyr

815 820 825

agc gcg tgc acg gtg cta atc acc gcg atc gtg tgc ctg agc att cac 26607

Ser Ala Cys Thr Val Leu Ile Thr Ala Ile Val Cys Leu Ser Ile His

830 835 840

atg ctc atc gct att cgc ccc aga aat aat gcc gag aaa gag aaa cag 26655

Met Leu Ile Ala Ile Arg Pro Arg Asn Asn Ala Glu Lys Glu Lys Gln

845 850 855

cca taacacactt ttttcacaca ctttttcaga ccatggcctc tgttactttt 26708

Pro

860

ttgataattg ccagtctcat tgctgttata agtaatgaga aactcactgt ttacattggt 26768

actaatcaca cactagaagg aattccaaaa tcctcatggt attgctattt tgatcaaaat 26828

ccagagctaa ctgtagaatt atgtggtaac agtggaaaaa ataaaagcat tcatttaatt 26888

aactttgggt gcggaaaaga tttgatacta attaatatca ctagagagta tgcaggtatg 26948

tactactacg ccacagaagg catttcaaac atgcaatttt atgaagttac tgtgtctgaa 27008

cctaccacgc ctaaaatgac cacaaccaca aagactacag ttgtcagcac tatacatctc 27068

agtaccaatg gcatttttgc tatgcttcag atggccaata acagcaccag catgccacac 27128

acacctacac tacctagtga ggaaatacct aaatccatgg tcgccatcat agctgctgta 27188

gtggtgtgta tgctgattat tgttttgtgc atggcctact acgcttgcta ctacagaagg 27248

cacagactaa gcgacaagct ggaccaatta ctgagtgttg aattttaatt ttttagaacc 27308

atgaagatcc tagccctttt tgttttatct ataattacat ctgctctatg caattctgac 27368

aatgaggatg ttactgtcat tgtcggatcc aattacacac taaaaggccc aaccgagggt 27428

atgctttcgt ggtattgcta ttttggaact gatactaagc aaacagagct ctgtaatgct 27488

atgaaaggtc agatgcctac ttccagaatt aaacataaat gtaatggtac tgacctaata 27548

ctactcaata tcacaaaagc atatgctggc agttattcct gccctgcggc caacactgag 27608

gatatgattt tttacaaagt aactgttgtt gatcccacca ctccaccacc acccaccaca 27668

actactaaag ctaccacaca cacacagcaa accacaccag ataaggggtt tgccttaaaa 27728

gttcagcaag aaggcggggc ttcttttgga gctaagaccc ctactcctga tcctcactgt 27788

ccagggccat tagtcagcag cattgtcggt gtgctttgtg gtttagcaat tgtgatcatc 27848

tgcttgttca tttttgcttg ctgctacaga aggcttcacc gacaaaaatc agacccactg 27908

ctgaacctct atg ttt aat ttt aat ttc aga acc atg aag gca ctt atc 27957

Met Phe Asn Phe Asn Phe Arg Thr Met Lys Ala Leu Ile

865 870

act tta gtt ttt ctt tct tta att ggc act gtt ttt agt gtt agt ttt 28005

Thr Leu Val Phe Leu Ser Leu Ile Gly Thr Val Phe Ser Val Ser Phe

875 880 885

ttg aaa aaa atc aat gtc act gag gga gat aat gtg aca ttg gtt ggt 28053

Leu Lys Lys Ile Asn Val Thr Glu Gly Asp Asn Val Thr Leu Val Gly

890 895 900 905

gta gaa ggt gct gaa aat acc acc tgg aca aaa tat cac atg gat ggg 28101

Val Glu Gly Ala Glu Asn Thr Thr Trp Thr Lys Tyr His Met Asp Gly

910 915 920

tgg aaa gaa att tgc acc tgg aat gtc act act tat aca tgt gag gga 28149

Trp Lys Glu Ile Cys Thr Trp Asn Val Thr Thr Tyr Thr Cys Glu Gly

925 930 935

gtt aac ctt acc att gtc aat gtt agc gaa att caa aag ggt tgg att 28197

Val Asn Leu Thr Ile Val Asn Val Ser Glu Ile Gln Lys Gly Trp Ile

940 945 950

aaa ggg cag agt gtt agt gtc acc agt gag gga tat ttt acc caa cat 28245

Lys Gly Gln Ser Val Ser Val Thr Ser Glu Gly Tyr Phe Thr Gln His

955 960 965

act ttt atc tat gac att att gtc ata cca ctg ccc aca cct agc cca 28293

Thr Phe Ile Tyr Asp Ile Ile Val Ile Pro Leu Pro Thr Pro Ser Pro

970 975 980 985

cct agc act act cag acc aca caa aca acc tac act aca cag agc tcc 28341

Pro Ser Thr Thr Gln Thr Thr Gln Thr Thr Tyr Thr Thr Gln Ser Ser

990 995 1000

aca act acc ttg caa acc act cag aca acc act tac act aca tcc 28386

Thr Thr Thr Leu Gln Thr Thr Gln Thr Thr Thr Tyr Thr Thr Ser

1005 1010 1015

act cag ccc acc acc act aca gca gag gcg agt agc tca cca acc 28431

Thr Gln Pro Thr Thr Thr Thr Ala Glu Ala Ser Ser Ser Pro Thr

1020 1025 1030

atc aaa gtg gca ttt ttg tta ctg cca cca tct agc agt cct gct 28476

Ile Lys Val Ala Phe Leu Leu Leu Pro Pro Ser Ser Ser Pro Ala

1035 1040 1045

agt acc aat gag cag act act gaa ttt ttg tcc act att cag agc 28521

Ser Thr Asn Glu Gln Thr Thr Glu Phe Leu Ser Thr Ile Gln Ser

1050 1055 1060

agc acc aca gct acc tcg agt gcc ttc tct agc acc gcc aat ctc 28566

Ser Thr Thr Ala Thr Ser Ser Ala Phe Ser Ser Thr Ala Asn Leu

1065 1070 1075

acc tcg ctt tcc tct acg cca atc agt aat gct act acc tcc cca 28611

Thr Ser Leu Ser Ser Thr Pro Ile Ser Asn Ala Thr Thr Ser Pro

1080 1085 1090

gct cct ctt ccc act cct ctg aag caa tca gag tct agc acg cag 28656

Ala Pro Leu Pro Thr Pro Leu Lys Gln Ser Glu Ser Ser Thr Gln

1095 1100 1105

ctg cag atc acc ctg ctc att gtg atc ggg gtg gtc atc ctg gca 28701

Leu Gln Ile Thr Leu Leu Ile Val Ile Gly Val Val Ile Leu Ala

1110 1115 1120

gtg ctg ctc tac ttt atc ttc tgc cgc cgc atc ccc aac gcg aag 28746

Val Leu Leu Tyr Phe Ile Phe Cys Arg Arg Ile Pro Asn Ala Lys

1125 1130 1135

ccg gcc tac aag ccc att gtt atc ggc acg ccg gag ccg ctt cag 28791

Pro Ala Tyr Lys Pro Ile Val Ile Gly Thr Pro Glu Pro Leu Gln

1140 1145 1150

gtg gag gga ggt cta agg aat ctt ctc ttc tct ttt aca gta tgg 28836

Val Glu Gly Gly Leu Arg Asn Leu Leu Phe Ser Phe Thr Val Trp

1155 1160 1165

tgatttgaac tatgattcct agacatttca ttatcacttc tctaatctgt gtgctccaag 28896

tctgtgccac cctcgctctc gtggctaacg cgagtccaga ctgcattgga gcgttcgcct 28956

cctacgtgct ctttgccttc atcacctgca tctgctgctg tagcatagtc tgcctgctta 29016

tcaccttctt ccagttcgtt gactgggtct ttgtgcgcat cgcctacctg cgccaccatc 29076

cccagtaccg cgaccagaga gtggcgcaac tgttgagact catctgataa gcatgcgggc 29136

tctgctactt ctcgcgcttg tgctagctcc cctcgccgcc cccttaaccc tcaaatcccc 29196

cacccagtcc cctgaagagg ttcgaaaatg taaattccaa gaaccctgga aattcctttc 29256

atgctacaaa ctcaaatcag aaatgcaccc cagctggatc atgatcattg gaatcgtgaa 29316

catccttgcc tgtaccctca tctcctttgt gatttacccc tgctttaact ttgggtggaa 29376

cgcacccgag gcgctctggc tcccgcctga tcccgacaca ccaccacagc agcagcaaaa 29436

tcaggcacat gcaccaccac agcctaggcc acaatacatg cccatcttag actatgaggc 29496

cgagccacag cgagccatgc ttcctgctat tagttacttc aatctaaccg gcggag atg 29555

Met

act gac ccc atg gcc aac aac acc gtc aac gac ctc ctg gac atg 29600

Thr Asp Pro Met Ala Asn Asn Thr Val Asn Asp Leu Leu Asp Met

1170 1175 1180

gac ggc cgc gcc tcg gag cag cga ctc gcc caa ctc cgc atc cgc 29645

Asp Gly Arg Ala Ser Glu Gln Arg Leu Ala Gln Leu Arg Ile Arg

1185 1190 1195

cag cag cag gag aga gcc gtc aag gag ctg cag gat gcg gtg gcc 29690

Gln Gln Gln Glu Arg Ala Val Lys Glu Leu Gln Asp Ala Val Ala

1200 1205 1210

atc cac cag tgc aag aaa ggc atc ttc tgc ctg gtg aag cag gcc 29735

Ile His Gln Cys Lys Lys Gly Ile Phe Cys Leu Val Lys Gln Ala

1215 1220 1225

aag atc acc ttc gag gtg act tcc acc gac cat cgc ctc tcc tac 29780

Lys Ile Thr Phe Glu Val Thr Ser Thr Asp His Arg Leu Ser Tyr

1230 1235 1240

gag ctc ctg cag cag cgc cag aag ttc acc tgc ctg gtc gga gtc 29825

Glu Leu Leu Gln Gln Arg Gln Lys Phe Thr Cys Leu Val Gly Val

1245 1250 1255

aac ccc atc gtc atc acc cag cag tct ggc gat acc aag ggg tgc 29870

Asn Pro Ile Val Ile Thr Gln Gln Ser Gly Asp Thr Lys Gly Cys

1260 1265 1270

atc cac tgc tcc tgc gac tcc ccc gag tgc gtt cac acc ttg atc 29915

Ile His Cys Ser Cys Asp Ser Pro Glu Cys Val His Thr Leu Ile

1275 1280 1285

aag acc ctc tgc ggc ctc cgc gac ctc ctc ccc atg aac taatca 29960

Lys Thr Leu Cys Gly Leu Arg Asp Leu Leu Pro Met Asn

1290 1295 1300

<210> SEQ ID NO 21

<211> LENGTH: 497

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 21

Met Glu Ser Arg Asn Pro Phe Gln Gln Gly Leu Pro Ala Gly Phe Leu

1 5 10 15

Ser Ser Ser Phe Val Glu Asn Met Glu Val Pro Ala Pro Glu Cys Asn

20 25 30

Leu Arg Leu Leu Ala Gly Thr Ala Ala Arg His Ser Glu Asp Pro Glu

35 40 45

Ser Pro Gly Glu Ser Gln Gly Thr Pro Thr Ser Pro Ala Ala Ala Ala

50 55 60

Gly Gly Ser Arg Arg Glu Pro Glu Ser Arg Pro Gly Pro Ser Gly Gly

65 70 75 80

Gly Gly Gly Val Ala Asp Leu Phe Pro Glu Leu Arg Arg Val Leu Thr

85 90 95

Arg Ser Ser Ser Gly Arg Glu Arg Gly Ile Lys Arg Glu Arg His Asp

100 105 110

Glu Thr Asn His Arg Asn Glu Leu Thr Val Ser Leu Met Ser Arg Lys

115 120 125

Arg Pro Glu Thr Val Trp Trp His Glu Val Gln Ser Thr Gly Thr Asp

130 135 140

Glu Val Ser Val Met His Glu Arg Phe Ser Leu Glu Gln Val Lys Thr

145 150 155 160

Cys Trp Leu Glu Pro Glu Asp Asp Trp Glu Val Ala Ile Arg Asn Tyr

165 170 175

Ala Lys Leu Ala Leu Arg Pro Asp Lys Lys Tyr Lys Ile Thr Lys Leu

180 185 190

Ile Asn Ile Arg Asn Ala Cys Tyr Ile Ser Gly Asn Gly Ala Glu Val

195 200 205

Glu Ile Cys Leu Gln Glu Arg Val Ala Phe Arg Cys Cys Met Met Asn

210 215 220

Met Tyr Pro Gly Val Val Gly Met Asp Gly Val Thr Phe Met Asn Met

225 230 235 240

Arg Phe Arg Gly Asp Gly Tyr Asn Gly Thr Val Phe Met Ala Asn Thr

245 250 255

Lys Leu Thr Val His Gly Cys Ser Phe Phe Gly Phe Asn Asn Thr Cys

260 265 270

Ile Glu Ala Trp Gly Gln Val Gly Val Arg Gly Cys Ser Phe Ser Ala

275 280 285

Asn Trp Met Gly Ile Val Gly Arg Thr Lys Ser Val Leu Ser Val Lys

290 295 300

Lys Cys Leu Phe Glu Arg Cys His Leu Gly Val Met Ser Glu Gly Glu

305 310 315 320

Ala Arg Ile Arg His Cys Ala Ser Thr Glu Thr Gly Cys Phe Val Leu

325 330 335

Cys Lys Gly Asn Ala Lys Ile Lys His Asn Met Ile Cys Gly Ala Ser

340 345 350

Asp Glu Arg Gly Tyr Gln Met Leu Thr Cys Ala Gly Gly Asn Ser His

355 360 365

Met Leu Ala Thr Val His Val Ala Ser His Ser Arg Lys Pro Trp Pro

370 375 380

Glu Phe Glu His Asn Val Met Thr Arg Cys Asn Met His Leu Gly Ser

385 390 395 400

Arg Arg Gly Met Phe Met Pro Tyr Gln Cys Asn Leu Asn Tyr Val Lys

405 410 415

Val Leu Leu Glu Pro Asp Ala Met Ser Arg Val Ser Leu Thr Gly Val

420 425 430

Phe Asp Met Asn Val Glu Val Trp Lys Ile Leu Arg Tyr Asp Glu Ser

435 440 445

Lys Thr Arg Cys Arg Ala Cys Glu Cys Gly Gly Lys His Ala Arg Phe

450 455 460

Gln Pro Val Cys Val Asp Val Thr Glu Asp Leu Arg Pro Asp His Leu

465 470 475 480

Val Leu Ser Cys Thr Gly Thr Glu Phe Gly Ser Ser Gly Glu Glu Ser

485 490 495

Asp

<210> SEQ ID NO 22

<211> LENGTH: 187

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 22

Met Pro Arg Gly Asn Lys Lys Leu Lys Val Glu Leu Pro Pro Val Glu

1 5 10 15

Asp Leu Glu Glu Asp Trp Glu Asn Ser Ser Gln Ala Glu Glu Glu Glu

20 25 30

Met Glu Glu Asp Trp Asp Ser Thr Gln Ala Glu Glu Asp Ser Leu Gln

35 40 45

Asp Ser Leu Glu Glu Asp Glu Glu Glu Ala Glu Glu Glu Val Glu Glu

50 55 60

Ala Ala Ala Ala Arg Pro Ser Ser Ser Ala Glu Lys Ala Ser Ser Thr

65 70 75 80

Asp Thr Ile Ser Ala Pro Gly Arg Gly Pro Ala Arg Pro His Ser Arg

85 90 95

Trp Asp Glu Thr Gly Arg Phe Pro Asn Pro Thr Thr Gln Thr Gly Lys

100 105 110

Lys Glu Arg Gln Gly Tyr Lys Ser Trp Arg Gly His Lys Asn Ala Ile

115 120 125

Val Ser Cys Leu Gln Ala Cys Gly Gly Asn Ile Ser Phe Thr Arg Arg

130 135 140

Tyr Leu Leu Phe His Arg Gly Val Asn Phe Pro Arg Asn Ile Leu His

145 150 155 160

Tyr Tyr Arg His Leu His Ser Pro Tyr Tyr Phe Gln Glu Glu Ala Thr

165 170 175

Ala Ala Glu Lys Asp Gln Lys Thr Ser Ser Ser

180 185

<210> SEQ ID NO 23

<211> LENGTH: 176

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 23

Met Gly Lys Ile Thr Leu Val Cys Gly Val Leu Val Thr Val Val Leu

1 5 10 15

Ser Ile Leu Gly Gly Gly Ser Ala Ala Val Val Thr Glu Lys Lys Ala

20 25 30

Asp Pro Cys Leu Thr Phe Asn Pro Asp Asn Cys Arg Leu Ser Phe Gln

35 40 45

Pro Asp Gly Asn Arg Cys Ala Val Leu Ile Lys Cys Gly Trp Glu Cys

50 55 60

Glu Ser Val Val Ile Gln Tyr Lys Asn Lys Thr Trp Asn Asn Thr Leu

65 70 75 80

Ala Ser Thr Trp Gln Pro Gly Asp Pro Glu Trp Tyr Thr Val Ser Val

85 90 95

Pro Gly Ala Asp Gly Ser Leu Arg Thr Val Asn Asn Thr Phe Ile Phe

100 105 110

Glu His Met Cys Glu Thr Ala Met Phe Met Ser Lys Gln Tyr Gly Met

115 120 125

Trp Pro Pro Arg Lys Glu Asn Ile Val Val Phe Ser Ile Ala Tyr Ser

130 135 140

Ala Cys Thr Val Leu Ile Thr Ala Ile Val Cys Leu Ser Ile His Met

145 150 155 160

Leu Ile Ala Ile Arg Pro Arg Asn Asn Ala Glu Lys Glu Lys Gln Pro

165 170 175

<210> SEQ ID NO 24

<211> LENGTH: 306

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 24

Met Phe Asn Phe Asn Phe Arg Thr Met Lys Ala Leu Ile Thr Leu Val

1 5 10 15

Phe Leu Ser Leu Ile Gly Thr Val Phe Ser Val Ser Phe Leu Lys Lys

20 25 30

Ile Asn Val Thr Glu Gly Asp Asn Val Thr Leu Val Gly Val Glu Gly

35 40 45

Ala Glu Asn Thr Thr Trp Thr Lys Tyr His Met Asp Gly Trp Lys Glu

50 55 60

Ile Cys Thr Trp Asn Val Thr Thr Tyr Thr Cys Glu Gly Val Asn Leu

65 70 75 80

Thr Ile Val Asn Val Ser Glu Ile Gln Lys Gly Trp Ile Lys Gly Gln

85 90 95

Ser Val Ser Val Thr Ser Glu Gly Tyr Phe Thr Gln His Thr Phe Ile

100 105 110

Tyr Asp Ile Ile Val Ile Pro Leu Pro Thr Pro Ser Pro Pro Ser Thr

115 120 125

Thr Gln Thr Thr Gln Thr Thr Tyr Thr Thr Gln Ser Ser Thr Thr Thr

130 135 140

Leu Gln Thr Thr Gln Thr Thr Thr Tyr Thr Thr Ser Thr Gln Pro Thr

145 150 155 160

Thr Thr Thr Ala Glu Ala Ser Ser Ser Pro Thr Ile Lys Val Ala Phe

165 170 175

Leu Leu Leu Pro Pro Ser Ser Ser Pro Ala Ser Thr Asn Glu Gln Thr

180 185 190

Thr Glu Phe Leu Ser Thr Ile Gln Ser Ser Thr Thr Ala Thr Ser Ser

195 200 205

Ala Phe Ser Ser Thr Ala Asn Leu Thr Ser Leu Ser Ser Thr Pro Ile

210 215 220

Ser Asn Ala Thr Thr Ser Pro Ala Pro Leu Pro Thr Pro Leu Lys Gln

225 230 235 240

Ser Glu Ser Ser Thr Gln Leu Gln Ile Thr Leu Leu Ile Val Ile Gly

245 250 255

Val Val Ile Leu Ala Val Leu Leu Tyr Phe Ile Phe Cys Arg Arg Ile

260 265 270

Pro Asn Ala Lys Pro Ala Tyr Lys Pro Ile Val Ile Gly Thr Pro Glu

275 280 285

Pro Leu Gln Val Glu Gly Gly Leu Arg Asn Leu Leu Phe Ser Phe Thr

290 295 300

Val Trp

305

<210> SEQ ID NO 25

<211> LENGTH: 134

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 25

Met Thr Asp Pro Met Ala Asn Asn Thr Val Asn Asp Leu Leu Asp Met

1 5 10 15

Asp Gly Arg Ala Ser Glu Gln Arg Leu Ala Gln Leu Arg Ile Arg Gln

20 25 30

Gln Gln Glu Arg Ala Val Lys Glu Leu Gln Asp Ala Val Ala Ile His

35 40 45

Gln Cys Lys Lys Gly Ile Phe Cys Leu Val Lys Gln Ala Lys Ile Thr

50 55 60

Phe Glu Val Thr Ser Thr Asp His Arg Leu Ser Tyr Glu Leu Leu Gln

65 70 75 80

Gln Arg Gln Lys Phe Thr Cys Leu Val Gly Val Asn Pro Ile Val Ile

85 90 95

Thr Gln Gln Ser Gly Asp Thr Lys Gly Cys Ile His Cys Ser Cys Asp

100 105 110

Ser Pro Glu Cys Val His Thr Leu Ile Lys Thr Leu Cys Gly Leu Arg

115 120 125

Asp Leu Leu Pro Met Asn

130

<210> SEQ ID NO 26

<211> LENGTH: 870

<212> TYPE: DNA

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Simian adenovirus A1321

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (6)..(570)

<223> OTHER INFORMATION: E1a

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (664)..(869)

<223> OTHER INFORMATION: E1a

<400> SEQUENCE: 26

gaaag atg agg cac ctg aga aac ctg ccc gag gag aaa atc ctc atc gct 50

Met Arg His Leu Arg Asn Leu Pro Glu Glu Lys Ile Leu Ile Ala

1 5 10 15

tcc ggg aac gag att ctg gaa ctg gtg gta aat gcc atg atg ggc gac 98

Ser Gly Asn Glu Ile Leu Glu Leu Val Val Asn Ala Met Met Gly Asp

20 25 30

gac cct ccg gag ccc ccc acc cca ttt gag gca cct tcg cta cac gat 146

Asp Pro Pro Glu Pro Pro Thr Pro Phe Glu Ala Pro Ser Leu His Asp

35 40 45

ttg tat gat ctg gag gtg gat gtg ccc gag gac gac ccc aac gag gag 194

Leu Tyr Asp Leu Glu Val Asp Val Pro Glu Asp Asp Pro Asn Glu Glu

50 55 60

gcg gta aat gat tta ttt agc gat gcc gcg ctg cta gct gcc gag gag 242

Ala Val Asn Asp Leu Phe Ser Asp Ala Ala Leu Leu Ala Ala Glu Glu

65 70 75

gct tcg agc tct agc tca gac agc gac tcc tca ctg cat acc cct aga 290

Ala Ser Ser Ser Ser Ser Asp Ser Asp Ser Ser Leu His Thr Pro Arg

80 85 90 95

ccc ggc aga ggt gag aaa aag atc ccc gag ctt aaa ggg gaa gag atg 338

Pro Gly Arg Gly Glu Lys Lys Ile Pro Glu Leu Lys Gly Glu Glu Met

100 105 110

gac ttg cgc tgc tat gag gaa tgc ttg ccc ccg agc gat gat gag gac 386

Asp Leu Arg Cys Tyr Glu Glu Cys Leu Pro Pro Ser Asp Asp Glu Asp

115 120 125

gag cag gcg atc cag aac gca gcg agt cag gga atg caa gcc gcc aga 434

Glu Gln Ala Ile Gln Asn Ala Ala Ser Gln Gly Met Gln Ala Ala Arg

130 135 140

gag agc ttt gcg ctg gac tgc ccg cct ctg ccc gga cac ggc tgt aag 482

Glu Ser Phe Ala Leu Asp Cys Pro Pro Leu Pro Gly His Gly Cys Lys

145 150 155

tct tgt gaa ttt cat cgc atg aat act gga gat aaa gct gtg ttg tgt 530

Ser Cys Glu Phe His Arg Met Asn Thr Gly Asp Lys Ala Val Leu Cys

160 165 170 175

gca ctt tgc tat atg aga gct tac aac cat tgt gtt tac a gtaagtgtga 580

Ala Leu Cys Tyr Met Arg Ala Tyr Asn His Cys Val Tyr

180 185

ttaagttgaa ctttagaggg aggcagagag cagggtgact gggcgatgac tggtttattt 640

atgtatatat gttctttata tag gt ccc gtc tct gac gca gat gat gag acc 692

Ser Pro Val Ser Asp Ala Asp Asp Glu Thr

190 195

ccc act aca gag tcc act tcg tca ccc cca gaa att ggc aca tct cca 740

Pro Thr Thr Glu Ser Thr Ser Ser Pro Pro Glu Ile Gly Thr Ser Pro

200 205 210

cct gag aat att gtt aga cca gtt cct gtt aga gcc act ggg agg aga 788

Pro Glu Asn Ile Val Arg Pro Val Pro Val Arg Ala Thr Gly Arg Arg

215 220 225 230

gca gct gtg gaa agt ttg gat gac ttg cta cag ggt ggg gat gaa cct 836

Ala Ala Val Glu Ser Leu Asp Asp Leu Leu Gln Gly Gly Asp Glu Pro

235 240 245

ttg gac ttg tgt acc cgg aaa cgc ccc agg cac t 870

Leu Asp Leu Cys Thr Arg Lys Arg Pro Arg His

250 255

<210> SEQ ID NO 27

<211> LENGTH: 257

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 27

Met Arg His Leu Arg Asn Leu Pro Glu Glu Lys Ile Leu Ile Ala Ser

1 5 10 15

Gly Asn Glu Ile Leu Glu Leu Val Val Asn Ala Met Met Gly Asp Asp

20 25 30

Pro Pro Glu Pro Pro Thr Pro Phe Glu Ala Pro Ser Leu His Asp Leu

35 40 45

Tyr Asp Leu Glu Val Asp Val Pro Glu Asp Asp Pro Asn Glu Glu Ala

50 55 60

Val Asn Asp Leu Phe Ser Asp Ala Ala Leu Leu Ala Ala Glu Glu Ala

65 70 75 80

Ser Ser Ser Ser Ser Asp Ser Asp Ser Ser Leu His Thr Pro Arg Pro

85 90 95

Gly Arg Gly Glu Lys Lys Ile Pro Glu Leu Lys Gly Glu Glu Met Asp

100 105 110

Leu Arg Cys Tyr Glu Glu Cys Leu Pro Pro Ser Asp Asp Glu Asp Glu

115 120 125

Gln Ala Ile Gln Asn Ala Ala Ser Gln Gly Met Gln Ala Ala Arg Glu

130 135 140

Ser Phe Ala Leu Asp Cys Pro Pro Leu Pro Gly His Gly Cys Lys Ser

145 150 155 160

Cys Glu Phe His Arg Met Asn Thr Gly Asp Lys Ala Val Leu Cys Ala

165 170 175

Leu Cys Tyr Met Arg Ala Tyr Asn His Cys Val Tyr Ser Pro Val Ser

180 185 190

Asp Ala Asp Asp Glu Thr Pro Thr Thr Glu Ser Thr Ser Ser Pro Pro

195 200 205

Glu Ile Gly Thr Ser Pro Pro Glu Asn Ile Val Arg Pro Val Pro Val

210 215 220

Arg Ala Thr Gly Arg Arg Ala Ala Val Glu Ser Leu Asp Asp Leu Leu

225 230 235 240

Gln Gly Gly Asp Glu Pro Leu Asp Leu Cys Thr Arg Lys Arg Pro Arg

245 250 255

His

<210> SEQ ID NO 28

<211> LENGTH: 36542

<212> TYPE: DNA

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Simian adenovirus A1325

<220> FEATURE:

<221> NAME/KEY: repeat_region

<222> LOCATION: (1)..(129)

<223> OTHER INFORMATION: ITR

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (1604)..(2173)

<223> OTHER INFORMATION: E1b\19K

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (3452)..(3922)

<223> OTHER INFORMATION: pIX

<220> FEATURE:

<221> NAME/KEY: misc_feature

<222> LOCATION: (3988)..(5609)

<223> OTHER INFORMATION: IVa2 complement (3988..5318,5597..5609)

<220> FEATURE:

<221> NAME/KEY: misc_feature

<222> LOCATION: (5091)..(13826)

<223> OTHER INFORMATION: pol complement (5091..8651,13818..13826)

<220> FEATURE:

<221> NAME/KEY: misc_feature

<222> LOCATION: (8453)..(13826)

<223> OTHER INFORMATION: pTP complement (8453..10378,13818..13826)

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (10809)..(11984)

<223> OTHER INFORMATION: 52K

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (12011)..(13789)

<223> OTHER INFORMATION: pIIIa

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (13866)..(15491)

<223> OTHER INFORMATION: penton

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (15498)..(16079)

<223> OTHER INFORMATION: pVII

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (16127)..(17167)

<223> OTHER INFORMATION: V

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (17193)..(17423)

<223> OTHER INFORMATION: pX

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (17495)..(18220)

<223> OTHER INFORMATION: pVI

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (18260)..(21136)

<223> OTHER INFORMATION: hexon

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (21161)..(21781)

<223> OTHER INFORMATION: protease

<220> FEATURE:

<221> NAME/KEY: misc_feature

<222> LOCATION: (21867)..(23399)

<223> OTHER INFORMATION: DBP complement (21867..23399)

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (23425)..(25812)

<223> OTHER INFORMATION: 100K

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (26430)..(27110)

<223> OTHER INFORMATION: pVIII

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (27114)..(27431)

<223> OTHER INFORMATION: E3\12.5K

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (27999)..(28526)

<223> OTHER INFORMATION: E3\gp19K

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (28559)..(29176)

<223> OTHER INFORMATION: E3\CR1-beta

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (29192)..(29800)

<223> OTHER INFORMATION: E3\CR1-gamma

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (29818)..(30681)

<223> OTHER INFORMATION: E3\CR1-delta

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (30692)..(30964)

<223> OTHER INFORMATION: E3\RID-alpha

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (30973)..(31404)

<223> OTHER INFORMATION: E3\RID-beta

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (32101)..(33429)

<223> OTHER INFORMATION: fiber

<220> FEATURE:

<221> NAME/KEY: misc_feature

<222> LOCATION: (33543)..(34696)

<223> OTHER INFORMATION: E4\orf6/7 complement (33543..33794,34526..34696)

<220> FEATURE:

<221> NAME/KEY: misc_feature

<222> LOCATION: (33794)..(34696)

<223> OTHER INFORMATION: E4\orf6 complement (33794..34696)

<220> FEATURE:

<221> NAME/KEY: misc_feature

<222> LOCATION: (34605)..(34967)

<223> OTHER INFORMATION: E4\orf4 complement (34605..34967)

<220> FEATURE:

<221> NAME/KEY: misc_feature

<222> LOCATION: (34980)..(35330)

<223> OTHER INFORMATION: E4\orf3 complement (34980..35330)

<220> FEATURE:

<221> NAME/KEY: misc_feature

<222> LOCATION: (35330)..(35716)

<223> OTHER INFORMATION: E4\orf2 complement (35330..35716)

<220> FEATURE:

<221> NAME/KEY: misc_feature

<222> LOCATION: (35760)..(36131)

<223> OTHER INFORMATION: E4\orf1 complement (35760..36131)

<220> FEATURE:

<221> NAME/KEY: misc_feature

<222> LOCATION: (36413)..(36542)

<223> OTHER INFORMATION: ITR

<400> SEQUENCE: 28

catcatcaat aatatacctc aaactttttg tgcgcgttaa tatgcaaatg aggcgtttga 60

atttggggat gcggggctgt gattggccga gaggagggcg accgttaggg gcggggcgag 120

tgacgttttg atgacgcgac cgtgaggagg agccagtttg caagttctcg tgggaaaagt 180

gacgtcaaac gaggtgtggt ttgaacacgg aaatactcaa ttttcccgcg ctctctgaca 240

ggaaatgagg tgtttctggg cggatgcaag tgaaaacggg ccattttcgc gcgaaaactg 300

aatgaggaag tgaaaatctg agtaattccg cgtttatggc agggaggagt atttgccgag 360

ggccgagtag actttgaccg attacgtggg ggtttcgatt accgtatttt tcacctaaat 420

ttccgcgtac ggtgtcaaag tccggtgttt ttacgtaggt gtcagctgat cgccagggta 480

tttaaacctg cgctctctag tcaagaggcc actcttgagt gccagcgagt agagttttct 540

cctccgcgcc gcgagtcaga tctacacttt gaaagatgag gcacctgaga aacctgcccg 600

aggagaaaat cctcatcgct tccgggaacg agattctgga actggtggta aatgccatga 660

tgggcgacga ccctccggag ccccccaccc catttgaggc gccttcgctg tacgatttgt 720

atgatctgga ggtggatgtg cccgagaacg accccaacga ggaggcggtg aatgatttgt 780

ttagcgatgc cgcgctgctg gctgccgagc aggctaatac ggactctggc tcagacagcg 840

attcctctct ccataccccg agacccggca gaggtgagaa aaagatcccc gagcttaaag 900

gggaagagct cgacctgcgc tgctatgagg aatgcttgcc tccgagcgat gatgaggagg 960

acgaggaggc gattcgagct gcagcgagcg agggagtgaa agctgcgggc gagagcttta 1020

gcctggactg tcctactctg cccggacacg gctgtaagtc ttgtgaattt catcgcatga 1080

atactggaga taagaatgtg atgtgtgccc tgtgctatat gagagcttac aaccattgtg 1140

tttacagtaa gtgtgattaa ctttagctgg gaaggcagag ggtgactggg tgctgactgg 1200

tttatttatg tatatgtttt tttttatgtg taggtcccgt ctctgacgta gatgagaccc 1260

ccacttcaga gtgtatttca tcacccccag aaattggcga ggaaccgccc gaagatatca 1320

ttcatagacc agttgcagtg agagtcaccg ggcggagagc agctgtggag agtttgaatg 1380

acttgctaca gggtggggat gaacctttgg acttgtgtac ccggaaacgc cccaggcact 1440

aagtgccaca catgtgtgtt tacttaaggt gatgtcagta tttatagggt gtggagtgca 1500

ataaaatccg tgttgacttt aagtgcgtgg tttatgactc agggtgggga ctgtgggtat 1560

ataagcaggt gcagacctgt gtggtcagtt cagagcagga ctc atg gag atc tgg 1615

Met Glu Ile Trp

1

aca gtc ttg gaa gac ttt cac cag act aga cag ctg cta gag aac tca 1663

Thr Val Leu Glu Asp Phe His Gln Thr Arg Gln Leu Leu Glu Asn Ser

5 10 15 20

tcg gag gaa gtc tct tac ctg tgg aga ttc tgc ttc ggt ggg cct cta 1711

Ser Glu Glu Val Ser Tyr Leu Trp Arg Phe Cys Phe Gly Gly Pro Leu

25 30 35

gct aag cta gtc tat agg gcc aaa cag gat tat agg gaa caa ttt gag 1759

Ala Lys Leu Val Tyr Arg Ala Lys Gln Asp Tyr Arg Glu Gln Phe Glu

40 45 50

gat att ttg aga gag tgt cct ggt att ttt gac tct ctc aac ttg ggc 1807

Asp Ile Leu Arg Glu Cys Pro Gly Ile Phe Asp Ser Leu Asn Leu Gly

55 60 65

cat cag tct cac ttt aac cag agt att ctg aga gcc ctt gac ttt tct 1855

His Gln Ser His Phe Asn Gln Ser Ile Leu Arg Ala Leu Asp Phe Ser

70 75 80

act cct ggc aga act acc gct gcg gta gcc ttt ttt gcc ttt atc ctt 1903

Thr Pro Gly Arg Thr Thr Ala Ala Val Ala Phe Phe Ala Phe Ile Leu

85 90 95 100

gac aaa tgg agt caa gaa acc cat ttc agc agg gat tac cgt ctg gac 1951

Asp Lys Trp Ser Gln Glu Thr His Phe Ser Arg Asp Tyr Arg Leu Asp

105 110 115

tgc tta gca gta gct ttg tgg aga aca tgg agg tgc cag cgc ctg aat 1999

Cys Leu Ala Val Ala Leu Trp Arg Thr Trp Arg Cys Gln Arg Leu Asn

120 125 130

gca atc tcc ggc tac ttg cca gta cag ccg gta gac acg ctg agg atc 2047

Ala Ile Ser Gly Tyr Leu Pro Val Gln Pro Val Asp Thr Leu Arg Ile

135 140 145

ctg agt ctc cag tca ccc cag gaa cac caa cgc cgc cag cag ccg cag 2095

Leu Ser Leu Gln Ser Pro Gln Glu His Gln Arg Arg Gln Gln Pro Gln

150 155 160

cag cag caa gag gag gag gag gac cga gaa gag aac ccg aga gcc ggt 2143

Gln Gln Gln Glu Glu Glu Glu Asp Arg Glu Glu Asn Pro Arg Ala Gly

165 170 175 180

ctg gac cct ccg gtg gcg gag gag gag gag tagctgactt gtttcccgag 2193

Leu Asp Pro Pro Val Ala Glu Glu Glu Glu

185 190

ctgtgccggg tgctgactag gtcttccagt ggacgggaga gggggattaa gcgggagagg 2253

catgaggaga ctagtcacag aactgaactg actgtcagtt tgatgagccg caggcgccca 2313

gaatcggtgt ggtggcatga ggttcagtcg cagggggtag atgaggtctc ggtgatgcat 2373

gagaaatatt ccctagaaca agtcaagact tgttggttgg agcccgagga tgattgggag 2433

gtagccatca ggaattatgc caagctggct ctgaggccag acaagaagta caagattacc 2493

aaactgatta atatcagaaa ttcctgctac atttcgggga atggggccga ggtggagatc 2553

agtacccagg agagggtggc cttcagatgt tgtatgatga atatgtaccc gggggtggtg 2613

ggcatggagg gagtcacctt tatgaacgcg aggttcaggg gtgatgggta taatggggtg 2673

gtctttatgg ccaacaccaa gctgacagtg cacggttgct ccttctttgg ctttaataac 2733

atgtgcatcg aggcctgggg cagtgtttca gtgaggggat gcagtttttc agccaactgg 2793

atgggggtcg tgggcagaac caagagcaag gtgtcagtga agaaatgcct gttcgagagg 2853

tgccacctgg gggtgatgag cgagggcgaa gccaaagtca aacactgcgc ctctaccgag 2913

acgggctgct ttgtgctgat caagggcaat gccaaagtca agcataatat gatctgtggg 2973

gcctcggatg agcgcggcta ccagatgctg acctgcgccg gtgggaacag ccatatgctg 3033

gccaccgtgc atgtggcctc gcacccccgc aagacatggc ccgagttcga gcacaacgtc 3093

atgacccgct gcaatgtgca cctgggctcc cgccgaggca tgttcatgcc ataccagtgc 3153

aacatgcaat ttgtgaaggt gctgctggag cccgatgcca tgtccagagt gagcctgacg 3213

ggggtgtttg acatgaatgt ggagctgtgg aaaattctga gatatgatga atccaagacc 3273

aggtgccggg cctgcgaatg cggaggcaag cacgccaggc ttcagcccgt gtgtgtggag 3333

gtgacggagg acctgcgacc cgatcatttg gtgttgtcct gcaacgggac ggagttcggc 3393

tccagcgggg aagaatctga ctagagtgag tagtgtttgg gggtgggtgg gagcctgc 3451

atg atg ggc aga atg act aaa atc tgt gtt ttt ctg cgc agc agc atg 3499

Met Met Gly Arg Met Thr Lys Ile Cys Val Phe Leu Arg Ser Ser Met

195 200 205

agc gga agc gcc tcc ttt gag gga ggg gta ttc agc cct tat ctg acg 3547

Ser Gly Ser Ala Ser Phe Glu Gly Gly Val Phe Ser Pro Tyr Leu Thr

210 215 220

ggg cgt ctc ccc tcc tgg gct gga gtg cgt cag aat gtg atg gga tcc 3595

Gly Arg Leu Pro Ser Trp Ala Gly Val Arg Gln Asn Val Met Gly Ser

225 230 235

acg gtg gac ggc cgg ccc gtg cag ccc gcg aac tct tca acc ctg acc 3643

Thr Val Asp Gly Arg Pro Val Gln Pro Ala Asn Ser Ser Thr Leu Thr

240 245 250

tac gcg acc ctg agc tcc tcg tcc gtg gac gca gct gcc gcc gca gct 3691

Tyr Ala Thr Leu Ser Ser Ser Ser Val Asp Ala Ala Ala Ala Ala Ala

255 260 265 270

gct gct tcc gcc gcc agc gcc gtg cgc gga atg gcc ctg ggt gcc ggc 3739

Ala Ala Ser Ala Ala Ser Ala Val Arg Gly Met Ala Leu Gly Ala Gly

275 280 285

tac tac agc tct ctg gtg gcc aac tcg agt tcc gcc aat aat ccc gcc 3787

Tyr Tyr Ser Ser Leu Val Ala Asn Ser Ser Ser Ala Asn Asn Pro Ala

290 295 300

agc ctg aac gag gag aag ctg ctg ctg ctg atg gcc cag ctc gag gcc 3835

Ser Leu Asn Glu Glu Lys Leu Leu Leu Leu Met Ala Gln Leu Glu Ala

305 310 315

ctg acc cag cgc ctg ggc gag ctg acc cag cag gtg gct cag ctg cag 3883

Leu Thr Gln Arg Leu Gly Glu Leu Thr Gln Gln Val Ala Gln Leu Gln

320 325 330

gcg gag acg cgg gcc gcg gtt gcc acg gtg aaa acc aaa taaaaaatga 3932

Ala Glu Thr Arg Ala Ala Val Ala Thr Val Lys Thr Lys

335 340 345

atcaataaat aaacggagac ggttgttgat tttaacacag agtcttgaat ctttatttga 3992

tttttcgcgc gcggtaggcc ctggaccacc ggtctcgatc attgagcacc cggtggatct 4052

tttccaggac ccggtagagg tgggcttgga tgttgaggta catgggcatg agcccgtccc 4112

gggggtggag gtagctccat tgcagggcct cgtgctcggg ggtggtgttg taaatcaccc 4172

agtcatagca ggggcgcagg gcgtggtgct gcacgatgtc cttgaggagg agactgatgg 4232

ccacgggcag ccccttggtg taggtgttga cgaacctgtt gagctgggag ggatgcatgc 4292

ggggggagat gagatgcatc ttggcctgga tcttgagatt ggcgatgttc ccgcccagat 4352

cccgccgggg gttcatgttg tgcaggacca ccagcacggt gtatccggtg cacttgggga 4412

atttgtcatg caacttggaa gggaaggcgt gaaagaattt ggagacgccc ttgtgaccgc 4472

ccaggttttc catgcactca tccatgatga tggcgatggg cccgtgggcg gcggcctggg 4532

caaagacgtt tcgggggtcg gacacatcgt agttgtggtc ctgggtgagc tcgtcatagg 4592

ccattttaat gaatttgggg cggagggtgc ccgactgggg gacaaaggtg ccctcgatcc 4652

cgggggcgta gttgccctcg cagatctgca tctcccaggc cttgagctcg gaggggggga 4712

tcatgtccac ctgcggggcg atgaaaaaaa cggtttccgg ggcgggggag atgagctgcg 4772

ccgaaagcag gttccggagc agctgggact tgccgcagcc ggtggggccg tagatgaccc 4832

cgatgaccgg ctgcaggtgg tagttgaggg agagacagct gccgtcctcg cggaggaggg 4892

gggccacctc gttcatcatc tcgcgcacat gcatgttctc gcgcacgagt tccgccagga 4952

ggcgctcgcc ccccagcgag aggagctctt gcagcgaggc gaagtttttc agcggcttga 5012

gtccgtcggc catgggcatt ttggagaggg tctgttgcaa gagttccaga cggtcccaga 5072

gctcggtgat gtgctctagg gcatctcgat ccagcagacc tcctcgtttc gcgggttggg 5132

gcgactgcgg gagtagggca ccaggcgatg ggcgtccagc gaggccaggg ttcggtcctt 5192

ccagggccgc agggtccgcg tcagcgtggt ctccgtcacg gtgaaggggt gcgcgccggg 5252

ctgggcgctt gcgagggtgc gcttcaggct catccggctg gtcgagaacc gctcccggtc 5312

ggcgccctgc gcgtcggcca ggtagcaatt gagcatgagt tcgtagttga gcgcctcggc 5372

cgcgtggccc ttggcgcgga gcttaccttt ggaagtgtgt ccgcagacgg gacagaggag 5432

ggacttgagg gcgtagagct tgggggcgag gaagacggac tcgggggcgt aggcgtccgc 5492

gccgcagctg gcgcagacgg tctcgcactc cacgagccag gtgaggtcgg ggcggtcggg 5552

gtcaaaaacg aggtttcctc cgtgcttttt gatgcgtttc ttacctctgg tctccatgag 5612

ctcgtgtccc cgctgggtga caaagaggct gtccgtgtcc ccgtagaccg actttatggg 5672

ccggtcctcg agcggggtgc cgcggtcctc gtcgtagagg aaccccgccc actccgagac 5732

gaaggcccgg gtccaggcca gcacgaagga ggccacgtgg gaggggtagc ggtcgttgtc 5792

caccagcggg tccaccttct ccagggtatg caagcacatg tccccctcgt ccacatccag 5852

gaaggtgatt ggcttgtaag tgtaggccac gtgaccgggg gtcccggccg ggggggtata 5912

aaagggggcg ggcccctgct cgtcctcact gtcttccgga tcgctgtcca ggagcgccag 5972

ctgttggggt aggtattccc tctcgaaggc gggcatgacc tcggcactca ggttgtcagt 6032

ttctagaaac gaggaggatt tgatattgac ggtgccgttg gagacgcctt tcatgagccc 6092

ctcgtccatc tggtcagaaa agacgatctt tttgttgtcg agcttggtgg cgaaggagcc 6152

gtagagggcg ttggagagca gcttggcgat ggagcgcatg gtctggttct tttccttgtc 6212

ggcgcgctcc ttggcggcga tgttgagctg cacgtactcg cgcgccacgc acttccattc 6272

ggggaagacg gtggtgagct cgtcgggcac gattctgacc cgccagccgc ggttgtgcag 6332

ggtgatgagg tccacgctgg tggccacctc gccgcgcagg ggctcgttgg tccagcagag 6392

gcgcccgccc ttgcgcgagc agaagggggg cagcgggtcc agcatgagct cgtcgggggg 6452

gtcggcgtcc acggtgaaga tgccgggcag gagctcgggg tcgaagtagc tgatgcaggt 6512

gcccagatcg tccagcgccg cttgccagtc gcgcacggcc agcgcgcgct cgtaggggct 6572

gaggggcgtg ccccagggca tggggtgcgt gagcgcggag gcatacatgc cgcagatgtc 6632

gtagacgtag aggggctcct ctaggacgcc gatgtaggtg gggtagcagc gccccccgcg 6692

gatgctggcg cgcacgtagt cgtacagctc gtgcgagggc gcgaggaggc ccgtgccgag 6752

gttggagcgc tgcggctttt cggcgcggta gacgatctgg cggaagatgg cgtgggagtt 6812

ggaggagatg gtgggcctct ggaagatgtt gaagtgggcg tggggcaggc cgaccgagtc 6872

cctgatgaag tgggcgtagg agtcctgcag cttggcgacg agctcggcgg tgacgaggac 6932

gtccagggcg cagtagtcga gggtctcttg gatgatgtca tacttgagct ggcccttctg 6992

cttccacagc tcgcggttga gaaggaactc ttcgcggtcc ttccagtact cttcgagggg 7052

gaacccgtcc tgatcggcac ggtaagagcc caccatgtag aactggttga cggccttgta 7112

ggcgcagcag cccttctcca cggggagggc gtaggcttgc gcggccttgc gcagggaggt 7172

gtgggtgagg gcgaaggtgt cgcgcaccat gaccttgagg aactggtgct tgaagtcgag 7232

gtcgtcgcag ccgccctgct cccagagttg gaagtccgtg cgcttcttgt aggcggggtt 7292

gggcaaagcg aaagtaacat cgttgaagag gatcttgccc gcgcggggca tgaagttgcg 7352

agtgatgcgg aaaggctggg gcacctcggc ccggttgttg atgacctggg cggcgaggac 7412

gatctcgtcg aagccgttga tgttgtgccc gacgatgtag agttccacga atcgcgggcg 7472

gcccttgacg tggggcagct tcttgagctc gtcgtaggtg agctcggcgg ggtcgctgag 7532

cccgtgctgc tcgagggccc agtcggcgac gtgggggttg gcgctgagga aggaagtcca 7592

gagatccacg gccagggcgg tctgcaagcg gtcccggtac tgacggaact gctgacccac 7652

ggccattttt tcgggggtga cgcagtagaa ggtgcggggg tcgccgtgcc agcggtccca 7712

cttgagctgg agggcgaggt cgtgggcgag ctcgacgagc ggcgggtccc cggagagttt 7772

catgaccagc atgaagggga cgagctgctt gccgaaggac cccatccagg tgtaggtttc 7832

cacatcgtag gtgaggaaga gcctttcggt gcgaggatgc gagccgatgg ggaagaactg 7892

gatctcctgc caccagttgg aggaatggct gttgatgtga tggaagtaga aatgccgacg 7952

gcgcgccgag cactcgtgct tgtgtttata caagcgtccg cagtgctcgc aacgctgcac 8012

gggatgcacg tgctgcacga gctgtacctg ggttcctttg acgaggaatt tcagtgggca 8072

gtggagcgct ggcggctgca tctcgtgctg tactacgtcc tggccatcgg cgtggccatc 8132

gtctgcctcg atggtggtca tgctgacgag cccgcgcggg aggcaggtcc agacctcggc 8192

tcggacgggt cggagagcga ggacgagggc gcgcaggccg gagctgtcca gggtcctgag 8252

acgctgcgga gtcaggtcag tgggcagcgg cggcgcgcgg ttgacttgca ggagcttttc 8312

cagggcgcgc gggaggtcca gatggtactt gatctccacg gcgccgttgg tggcgacgtc 8372

cacggcttgc agggtcccgt gcccctgggg cgccaccacc gtgccccgtt tcttcttggg 8432

cgctgcttcc atgtcggtca gaagcggcgg cgaggacgcg cgccgggcgg caggggcggc 8492

tcggggcccg gaggcagggg cggcaggggc acgtcggcgc cgcgcgcggg caggttctgg 8552

tactgcgccc ggagaagact ggcgtgagcg acgacgcggc ggttgacgtc ctggatctga 8612

cgcctctggg tgaaggccac gggacccgtg agtttgaacc tgaaagagag ttcgacagaa 8672

tcaatttcgg tatcgttgac ggcggcctgc cgcaggatct cttgcacgtc gcccgagttg 8732

tcctggtagg cgatctcggt catgaactgc tcgatctcct cctcctgaag gtctccgcgg 8792

ccggcgcgct cgacggtggc cgcgaggtcg ttggagatgc ggcccatgag ctgcgagaag 8852

gcgttcatgc cggcctcgtt ccagacgcgg ctgtagacca cggctccgtc ggggtcgcgc 8912

gcgcgcatga ccacctgggc gaggttgagc tcgacgtggc gcgtgaagac cgcgtagttg 8972

cagaggcgct ggtagaggta gttaagcgtg gtggcgatgt gctcggtgac gaagaagtac 9032

atgatccagc ggcggagcgg catctcgctg acgtcgccca gcgcctccaa gcgttccatg 9092

gcctcgtaaa agtccacggc gaagttgaaa aactgggagt tgcgcgccga gacggtcaac 9152

tcctcctcca gaagacggat gagctcggcg atggtggcgc gcacctcgca ctcgaaggcc 9212

cccgggagtt cctccacttc ctcttcttcc atctcctcca ctaacatctc ttctacttcc 9272

tcctcaggcg gtggtggcgg gggagggggc ctgcgtcgcc ggcggcgcac gggcagacgg 9332

tcgatgaaac gctcgatggt ctcgccgcgc cggcgtcgca tggtctcggt gacggcgcgc 9392

ccgtcctcgc ggggccgcag cgtgaagacg ccgccgcgca tctccaggtg gccggggggg 9452

tccccgttgg gcagggagag ggcgctgacg atgcatctta tcaattgccc cgtagggact 9512

ccgcgcaagg acctgagcgt ctcgagatcc acgggatctg aaaaccgttg aacgaaggct 9572

tcgagccagt cgcagtcgca aggtaggctg agcacggttt cttctggcgg gtcatgttgg 9632

ggagcggggc gggcgatgct gctggtgatg aagttgaaat aggcggttct gagacggcgg 9692

atggtggcga ggagcaccag gtctttgggc ccggcttgct ggatgcgcag acggtcggcc 9752

atgccccagg cgtggtcctg acacctggcc aggtccttgt agtagtcctg catgagccgc 9812

tccacgggca cctcctcctc gcccgcgcgg ccgtgcatgc gcgtgagccc gaagccgcgc 9872

tggggctgga cgagcgccag gtcggcgacg acgcgctcgg cgaggatggc ctgctgtatc 9932

tgggtgaggg tggtctggaa gtcgtcaaag tcgacgaagc ggtggtaggc tccggtgttg 9992

atggtgtagg agcagttggc catgacggac cagttgacgg tctggtgacc gggacgcacg 10052

agctcgtggt acttgaggcg cgagtaggcg cgcgtgtcga agatgtagtc gttgcaggtg 10112

cgcaccaggt actgatagcc gatgaggaag tgcggcggcg gctggcggta gagcggccat 10172

cgctcggtgg cgggggcgcc gggcgcgagg tcctcgagca tggtgcggtg gtagccgtag 10232

atgtacctgg acatccaggt gatgccggcg gcggtggtgg aggcgcgcgg gaactcgcgg 10292

acgcggttcc agatgttgcg cagcggcagg aagtagttca tggtgggcac ggtctggccc 10352

gtgaggcgcg cgcagtcgtg gatgctctat acgggcaaaa acgaaagcgg tcagcggctc 10412

gactccgtgg cctggaggct aagcgaacgg gttgggctgc gcgtgtaccc cggttcgaat 10472

ctcgaatcag gctggagccg cagctaacgt ggtactggca ctcccgtctc gacccaagcc 10532

tgcaccaacc ctccaggata cggaggcggg tcgttttgca acttttttcg gaggcctgaa 10592

aactagtaag cgcggaaagc ggccgaccgc gatggctcgc tgccgtagtc tggagaagaa 10652

tcgccagggt tgcgttgcgg tgtgccccgg ttcgaggccg gccggattcc gcggctaacg 10712

agggcgtggc tgccccgtcg tttccaagac cccctagcca gccgacttct ccagttacgg 10772

agcgagcccc tcttttgttt tgtttgtttt tgccag atg cat ccc gta ctg cgg 10826

Met His Pro Val Leu Arg

350

cag atg cgc ccc cac cac cct cca ccg caa caa cag ccc cct cca cag 10874

Gln Met Arg Pro His His Pro Pro Pro Gln Gln Gln Pro Pro Pro Gln

355 360 365

ccg gcg ctt ctg ccc ccg ccc cag cag caa ctt cca gcc acg acc gcc 10922

Pro Ala Leu Leu Pro Pro Pro Gln Gln Gln Leu Pro Ala Thr Thr Ala

370 375 380 385

gcg gcc gcc gtg agc ggg gct ggc cag act tct cag tat gac cac ctg 10970

Ala Ala Ala Val Ser Gly Ala Gly Gln Thr Ser Gln Tyr Asp His Leu

390 395 400

gcc ttg gaa gag ggc gag ggg ctg gcg cgc ctg ggg gcg tcg tcg ccg 11018

Ala Leu Glu Glu Gly Glu Gly Leu Ala Arg Leu Gly Ala Ser Ser Pro

405 410 415

gag cgg cac ccg cgc gtg cag atg aaa agg gac gct cgc gag gcc tac 11066

Glu Arg His Pro Arg Val Gln Met Lys Arg Asp Ala Arg Glu Ala Tyr

420 425 430

gtg ccc aag cag aac ctg ttc aga gac agg agc ggc gag gag ccc gag 11114

Val Pro Lys Gln Asn Leu Phe Arg Asp Arg Ser Gly Glu Glu Pro Glu

435 440 445

gag atg cgc gcg gcc cgg ttc cac gcg ggg cgg gag ctg cgc cgc ggc 11162

Glu Met Arg Ala Ala Arg Phe His Ala Gly Arg Glu Leu Arg Arg Gly

450 455 460 465

ctg gac cga aag agg gtg ctg agg gac gag gat ttc gag gcg gac gag 11210

Leu Asp Arg Lys Arg Val Leu Arg Asp Glu Asp Phe Glu Ala Asp Glu

470 475 480

ctg acg ggg atc agc ccc gcg cgc gcg cac gtg gcc gcg gcc aac ctg 11258

Leu Thr Gly Ile Ser Pro Ala Arg Ala His Val Ala Ala Ala Asn Leu

485 490 495

gtc acg gcg tac gag cag acc gtg aag gag gag agc aac ttc caa aaa 11306

Val Thr Ala Tyr Glu Gln Thr Val Lys Glu Glu Ser Asn Phe Gln Lys

500 505 510

tcc ttc aac aac cac gtg cgc acc ctg atc gcg cgc gag gag gtg acc 11354

Ser Phe Asn Asn His Val Arg Thr Leu Ile Ala Arg Glu Glu Val Thr

515 520 525

ctg ggc ctg atg cac ctg tgg gac ctg ctg gag gcc atc gtg cag aac 11402

Leu Gly Leu Met His Leu Trp Asp Leu Leu Glu Ala Ile Val Gln Asn

530 535 540 545

ccc acc agc aag ccg ctg acg gcg cag ctg ttc ctg gtg gtg caa cat 11450

Pro Thr Ser Lys Pro Leu Thr Ala Gln Leu Phe Leu Val Val Gln His

550 555 560

agt cgg gac aac gag gcg ttc agg gag gcg ctg ctg aat atc acc gag 11498

Ser Arg Asp Asn Glu Ala Phe Arg Glu Ala Leu Leu Asn Ile Thr Glu

565 570 575

ccc gag ggc cgc tgg ctc ctg gac ctg gtg aac att ctg cag agc atc 11546

Pro Glu Gly Arg Trp Leu Leu Asp Leu Val Asn Ile Leu Gln Ser Ile

580 585 590

gtg gtg cag gag cgc ggg ctg ccg ctg tcc gag aag ctg gcg gcc att 11594

Val Val Gln Glu Arg Gly Leu Pro Leu Ser Glu Lys Leu Ala Ala Ile

595 600 605

aac ttc tcg gtg ctg agt ctg ggc aag tac tac gct agg aag atc tac 11642

Asn Phe Ser Val Leu Ser Leu Gly Lys Tyr Tyr Ala Arg Lys Ile Tyr

610 615 620 625

aag acc ccg tac gtg ccc ata gac aag gag gtg aag atc gac ggg ttt 11690

Lys Thr Pro Tyr Val Pro Ile Asp Lys Glu Val Lys Ile Asp Gly Phe

630 635 640

tac atg cgc atg acc ctg aaa gtg ctg acc ctg agc gac gat ctg ggg 11738

Tyr Met Arg Met Thr Leu Lys Val Leu Thr Leu Ser Asp Asp Leu Gly

645 650 655

gtg tac cgc aac gac agg atg cac cgc gcg gtg agc gcc agc agg cgg 11786

Val Tyr Arg Asn Asp Arg Met His Arg Ala Val Ser Ala Ser Arg Arg

660 665 670

cgc gag ctg agc gac cag gag ctg atg cac agc ctg cag cgg gcc ctg 11834

Arg Glu Leu Ser Asp Gln Glu Leu Met His Ser Leu Gln Arg Ala Leu

675 680 685

acc ggg gcc ggg acc gag ggg gag agc tac ttt gac atg ggc gcg gac 11882

Thr Gly Ala Gly Thr Glu Gly Glu Ser Tyr Phe Asp Met Gly Ala Asp

690 695 700 705

ctg cac tgg cag ccc agc cgc cgg gcc ttg gag gcg gcg gca gga cct 11930

Leu His Trp Gln Pro Ser Arg Arg Ala Leu Glu Ala Ala Ala Gly Pro

710 715 720

tat gta gaa gag gtg gac gat gag gtg gac gag gag ggc gag tac ctg 11978

Tyr Val Glu Glu Val Asp Asp Glu Val Asp Glu Glu Gly Glu Tyr Leu

725 730 735

gaa gac tgatggcgcg accgtatttt tgctag atg caa caa caa cag cca cct 12031

Glu Asp Met Gln Gln Gln Gln Pro Pro

740 745

cct gat ccc gcg atg cgg gcg gcg ctg cag agc cag ccg tcc ggc att 12079

Pro Asp Pro Ala Met Arg Ala Ala Leu Gln Ser Gln Pro Ser Gly Ile

750 755 760

aac tcc tcg gac gat tgg acc cag gcc atg caa cgc atc atg gcg ctg 12127

Asn Ser Ser Asp Asp Trp Thr Gln Ala Met Gln Arg Ile Met Ala Leu

765 770 775

acg acc cgc aac ccc gaa gcc ttt aga cag cag ccc cag gcc aac cgg 12175

Thr Thr Arg Asn Pro Glu Ala Phe Arg Gln Gln Pro Gln Ala Asn Arg

780 785 790

ctc tcg gcc atc ctg gag gcc gtg gtg ccc tcg cgc tcc aac ccc acg 12223

Leu Ser Ala Ile Leu Glu Ala Val Val Pro Ser Arg Ser Asn Pro Thr

795 800 805 810

cac gag aag gtc ctg gcc atc gtg aac gcg ctg gtg gag aac aag gcc 12271

His Glu Lys Val Leu Ala Ile Val Asn Ala Leu Val Glu Asn Lys Ala

815 820 825

atc cgc ggc gac gag gcc ggc ctg gtg tac aac gcg ctg ctg gag cgc 12319

Ile Arg Gly Asp Glu Ala Gly Leu Val Tyr Asn Ala Leu Leu Glu Arg

830 835 840

gtg gcc cgc tac aac agc acc aac gtt cag acc aac ctg gac cgc atg 12367

Val Ala Arg Tyr Asn Ser Thr Asn Val Gln Thr Asn Leu Asp Arg Met

845 850 855

gtg acc gac gtg cgc gag gcc gtg gcc cag cgc gag cgg ttc cac cgc 12415

Val Thr Asp Val Arg Glu Ala Val Ala Gln Arg Glu Arg Phe His Arg

860 865 870

gag tcc aac ctg gga tcc atg gtg gcg ctg aat gcc ttc ctc agc acc 12463

Glu Ser Asn Leu Gly Ser Met Val Ala Leu Asn Ala Phe Leu Ser Thr

875 880 885 890

cag ccc gcc aac gtg ccc cgg ggc cag gag gac tac acc aac ttc atc 12511

Gln Pro Ala Asn Val Pro Arg Gly Gln Glu Asp Tyr Thr Asn Phe Ile

895 900 905

agt gcc ctg cgc ctg atg gtg acc gag gtg ccc cag agc gag gtg tac 12559

Ser Ala Leu Arg Leu Met Val Thr Glu Val Pro Gln Ser Glu Val Tyr

910 915 920

cag tcc ggg ccg gac tac ttc ttc cag acc agt cgc cag ggc ttg cag 12607

Gln Ser Gly Pro Asp Tyr Phe Phe Gln Thr Ser Arg Gln Gly Leu Gln

925 930 935

acc gtg aac ctg agc cag gct ttc aag aac ttg cag gga ttg tgg ggc 12655

Thr Val Asn Leu Ser Gln Ala Phe Lys Asn Leu Gln Gly Leu Trp Gly

940 945 950

gtg cag gcc ccg gtc ggg gac cgc gcg acg gtg tcg agc ctg ctg acg 12703

Val Gln Ala Pro Val Gly Asp Arg Ala Thr Val Ser Ser Leu Leu Thr

955 960 965 970

ccg aac tcg cgc ctg ctg ctg ctg ctg gtg gcc ccc ttc acg gac agc 12751

Pro Asn Ser Arg Leu Leu Leu Leu Leu Val Ala Pro Phe Thr Asp Ser

975 980 985

ggc agc atc aac cgc aac tcg tac ctg ggc tac ctg att aac ctg tac 12799

Gly Ser Ile Asn Arg Asn Ser Tyr Leu Gly Tyr Leu Ile Asn Leu Tyr

990 995 1000

cgc gag gcc atc ggc cag gcg cac gtg gac gag cag acc tac cag 12844

Arg Glu Ala Ile Gly Gln Ala His Val Asp Glu Gln Thr Tyr Gln

1005 1010 1015

gag atc acc cac gtg agc cgc gcc ctg ggc cag gac gac ccg ggc 12889

Glu Ile Thr His Val Ser Arg Ala Leu Gly Gln Asp Asp Pro Gly

1020 1025 1030

aac ctg gaa gcc acc ctg aac ttt ttg ctg acc aac cgg tcg cag 12934

Asn Leu Glu Ala Thr Leu Asn Phe Leu Leu Thr Asn Arg Ser Gln

1035 1040 1045

aag atc ccg ccc cag tac gcg ctc agc gcc gag gag gag cgc att 12979

Lys Ile Pro Pro Gln Tyr Ala Leu Ser Ala Glu Glu Glu Arg Ile

1050 1055 1060

ctg cga tac gtg cag cag agc gtg ggc ctg ttc ctg atg cag gag 13024

Leu Arg Tyr Val Gln Gln Ser Val Gly Leu Phe Leu Met Gln Glu

1065 1070 1075

ggg gcc acc ccc agc gcc gcg ctc gac atg acc gcg cgc aac atg 13069

Gly Ala Thr Pro Ser Ala Ala Leu Asp Met Thr Ala Arg Asn Met

1080 1085 1090

gag ccc agc atg tac gcc agc aac cgc ccg ttc atc aat aaa ctg 13114

Glu Pro Ser Met Tyr Ala Ser Asn Arg Pro Phe Ile Asn Lys Leu

1095 1100 1105

atg gac tac ttg cat cgg gcg gcc gcc atg aac tct gac tat ttc 13159

Met Asp Tyr Leu His Arg Ala Ala Ala Met Asn Ser Asp Tyr Phe

1110 1115 1120

acc aac gcc atc ctg aat ccc cac tgg ctc ccg cca ccg ggg ttc 13204

Thr Asn Ala Ile Leu Asn Pro His Trp Leu Pro Pro Pro Gly Phe

1125 1130 1135

tac acg ggc gag tac gac atg ccc gac ccc aat gac ggg ttc ctg 13249

Tyr Thr Gly Glu Tyr Asp Met Pro Asp Pro Asn Asp Gly Phe Leu

1140 1145 1150

tgg gac gat gtg gac agc agc gtg ttc tcc ccc cga ccg ggt gct 13294

Trp Asp Asp Val Asp Ser Ser Val Phe Ser Pro Arg Pro Gly Ala

1155 1160 1165

aac gag cgc ccc ttg tgg aag aag gaa ggc agc gac cga cgc ccg 13339

Asn Glu Arg Pro Leu Trp Lys Lys Glu Gly Ser Asp Arg Arg Pro

1170 1175 1180

tcc tcg gcg ctg tcc ggc cgc gag ggt gct gcc gcg gcg gtg ccc 13384

Ser Ser Ala Leu Ser Gly Arg Glu Gly Ala Ala Ala Ala Val Pro

1185 1190 1195

gaa gcc gcc agt cct ttc ccg agc ttg ccc ttc tcg cta aac agc 13429

Glu Ala Ala Ser Pro Phe Pro Ser Leu Pro Phe Ser Leu Asn Ser

1200 1205 1210

att cgc agc agc gag ctg ggc agg atc acg cgc ccg cgc ttg ctg 13474

Ile Arg Ser Ser Glu Leu Gly Arg Ile Thr Arg Pro Arg Leu Leu

1215 1220 1225

ggc gag gag gag tac ttg aat gac tcg ctg ttg aga ccc gag cgg 13519

Gly Glu Glu Glu Tyr Leu Asn Asp Ser Leu Leu Arg Pro Glu Arg

1230 1235 1240

gag aag aac ttc ccc aat aac gga ata gag agc ctg gtg gac aag 13564

Glu Lys Asn Phe Pro Asn Asn Gly Ile Glu Ser Leu Val Asp Lys

1245 1250 1255

atg agc cgc tgg aaa acg tat gcg cag gag cac agg gac gat ccc 13609

Met Ser Arg Trp Lys Thr Tyr Ala Gln Glu His Arg Asp Asp Pro

1260 1265 1270

cgg gcg tcg cag ggg gcc acg agc cgg ggc agc gcg gcc cgt aaa 13654

Arg Ala Ser Gln Gly Ala Thr Ser Arg Gly Ser Ala Ala Arg Lys

1275 1280 1285

cgc cgg tgg cac gac agg cag cgg gga ctg atg tgg gac gat gag 13699

Arg Arg Trp His Asp Arg Gln Arg Gly Leu Met Trp Asp Asp Glu

1290 1295 1300

gat tcc gcc gac gac agc agc gtg ttg gac ttg ggt ggg agt ggt 13744

Asp Ser Ala Asp Asp Ser Ser Val Leu Asp Leu Gly Gly Ser Gly

1305 1310 1315

ggt aac ccg ttc gct cac ctg cgc ccc cgc atc ggg cgc atg atg 13789

Gly Asn Pro Phe Ala His Leu Arg Pro Arg Ile Gly Arg Met Met

1320 1325 1330

taagaaaccg aaaataaatg atactcacca aggccatggc gaccagcgtg cgttcgtttc 13849

ttctctgttg tttagt atg atg agg cgt gcg tac ccg gag ggt cct cct 13898

Met Met Arg Arg Ala Tyr Pro Glu Gly Pro Pro

1335 1340

ccc tcg tac gag agc gtg atg cag cag gcg atg gcg gcg gcg atg 13943

Pro Ser Tyr Glu Ser Val Met Gln Gln Ala Met Ala Ala Ala Met

1345 1350 1355

cag ccc ccg ctg gag gct cct tac gtg ccc ccg cgg tac ctg gcg 13988

Gln Pro Pro Leu Glu Ala Pro Tyr Val Pro Pro Arg Tyr Leu Ala

1360 1365 1370

cct acg gag ggg cgg aac agc att cgt tac tcg gag ctg gca ccc 14033

Pro Thr Glu Gly Arg Asn Ser Ile Arg Tyr Ser Glu Leu Ala Pro

1375 1380 1385

ttg tac gat acc acc cgg ttg tac ctg gtg gac aac aag tcg gcg 14078

Leu Tyr Asp Thr Thr Arg Leu Tyr Leu Val Asp Asn Lys Ser Ala

1390 1395 1400

gac atc gcc tcg ctg aac tac cag aac gac cac agc aac ttc ctg 14123

Asp Ile Ala Ser Leu Asn Tyr Gln Asn Asp His Ser Asn Phe Leu

1405 1410 1415

acc acc gtg gtg cag aac aat gac ttc acc ccc acg gag gcc agc 14168

Thr Thr Val Val Gln Asn Asn Asp Phe Thr Pro Thr Glu Ala Ser

1420 1425 1430

acc cag acc atc aac ttt gac gag cgc tcg cgg tgg ggc ggc cag 14213

Thr Gln Thr Ile Asn Phe Asp Glu Arg Ser Arg Trp Gly Gly Gln

1435 1440 1445

ctg aaa acc atc atg cac acc aac atg ccc aac gtg aac gag ttc 14258

Leu Lys Thr Ile Met His Thr Asn Met Pro Asn Val Asn Glu Phe

1450 1455 1460

atg tac agc aac aag ttc aag gcg cgg gtg atg gtc tcc cgc aag 14303

Met Tyr Ser Asn Lys Phe Lys Ala Arg Val Met Val Ser Arg Lys

1465 1470 1475

acc ccc aac ggg gtc aaa gta gat gat gac tat gat ggt agt cag 14348

Thr Pro Asn Gly Val Lys Val Asp Asp Asp Tyr Asp Gly Ser Gln

1480 1485 1490

gat atc ttg gag tat gag tgg gtg gag ttt gag ctg ccc gaa ggc 14393

Asp Ile Leu Glu Tyr Glu Trp Val Glu Phe Glu Leu Pro Glu Gly

1495 1500 1505

aac ttc tcg gtg acc atg acc atc gac ctg atg aac aac gcc att 14438

Asn Phe Ser Val Thr Met Thr Ile Asp Leu Met Asn Asn Ala Ile

1510 1515 1520

atc gac aat tac ttg gcg gtg ggg cgg cag aac ggg gtg ctg gag 14483

Ile Asp Asn Tyr Leu Ala Val Gly Arg Gln Asn Gly Val Leu Glu

1525 1530 1535

agc gac atc ggc gtg aag ttc gac acg cgc aac ttc agg ctc ggt 14528

Ser Asp Ile Gly Val Lys Phe Asp Thr Arg Asn Phe Arg Leu Gly

1540 1545 1550

tgg gac ccc gtg acc gag ctg gtc atg ccg ggc gtg tac acc aac 14573

Trp Asp Pro Val Thr Glu Leu Val Met Pro Gly Val Tyr Thr Asn

1555 1560 1565

gag gcc ttc cac ccc gac atc gtc ctg ctg ccc ggc tgc ggc gtg 14618

Glu Ala Phe His Pro Asp Ile Val Leu Leu Pro Gly Cys Gly Val

1570 1575 1580

gac ttc acc gag agc cgc ctc agc aac ctg ctg ggc atc cgc aag 14663

Asp Phe Thr Glu Ser Arg Leu Ser Asn Leu Leu Gly Ile Arg Lys

1585 1590 1595

cgg cag ccc ttc cag gag ggt ttc cag atc atg tac gag gac ctg 14708

Arg Gln Pro Phe Gln Glu Gly Phe Gln Ile Met Tyr Glu Asp Leu

1600 1605 1610

gag ggg ggc aac atc ccc gcg ctg ctg gac gtc gaa gcc tac gag 14753

Glu Gly Gly Asn Ile Pro Ala Leu Leu Asp Val Glu Ala Tyr Glu

1615 1620 1625

aaa agc aag gag gag gcc gcc gca gcg gcg acc gcg gcc gtg gct 14798

Lys Ser Lys Glu Glu Ala Ala Ala Ala Ala Thr Ala Ala Val Ala

1630 1635 1640

act gct gcg acc acc gat gca gat gca gct act act acc agg ggc 14843

Thr Ala Ala Thr Thr Asp Ala Asp Ala Ala Thr Thr Thr Arg Gly

1645 1650 1655

gat aca ttc gcc acc cag gcg gag gaa gca gcc gcc cta gcg gcg 14888

Asp Thr Phe Ala Thr Gln Ala Glu Glu Ala Ala Ala Leu Ala Ala

1660 1665 1670

acc gat gat agt gaa agt aag ata gtc atc aag ccg gtg gag aag 14933

Thr Asp Asp Ser Glu Ser Lys Ile Val Ile Lys Pro Val Glu Lys

1675 1680 1685

gac agc aag gac agg agc tac aac gtt cta tcg gat gga aag aac 14978

Asp Ser Lys Asp Arg Ser Tyr Asn Val Leu Ser Asp Gly Lys Asn

1690 1695 1700

acc gcc tac cgc agc tgg tac ctg gcc tac aac tac ggc gac cct 15023

Thr Ala Tyr Arg Ser Trp Tyr Leu Ala Tyr Asn Tyr Gly Asp Pro

1705 1710 1715

gag aag ggc gtg cgc tcc tgg acg ctg ctc acc acc tcg gac gtc 15068

Glu Lys Gly Val Arg Ser Trp Thr Leu Leu Thr Thr Ser Asp Val

1720 1725 1730

acc tgc ggc gtg gag caa gtc tac tgg tcg ctg ccc gac atg atg 15113

Thr Cys Gly Val Glu Gln Val Tyr Trp Ser Leu Pro Asp Met Met

1735 1740 1745

caa gac ccg gtc acc ttc cgc tcc acg cgt caa gtt agc aac tac 15158

Gln Asp Pro Val Thr Phe Arg Ser Thr Arg Gln Val Ser Asn Tyr

1750 1755 1760

ccg gtg gtg ggc gcc gag ctc ctg ccc gtc tac tcc aag agc ttc 15203

Pro Val Val Gly Ala Glu Leu Leu Pro Val Tyr Ser Lys Ser Phe

1765 1770 1775

ttc aac gag cag gcc gtc tac tcg cag cag ctg cgc gcc ttc acc 15248

Phe Asn Glu Gln Ala Val Tyr Ser Gln Gln Leu Arg Ala Phe Thr

1780 1785 1790

tcg ctc acg cac gtc ttc aac cgc ttc ccc gag aac cag atc ctc 15293

Ser Leu Thr His Val Phe Asn Arg Phe Pro Glu Asn Gln Ile Leu

1795 1800 1805

gtc cgc ccg ccc gcg ccc acc att acc acc gtc agt gaa aac gtt 15338

Val Arg Pro Pro Ala Pro Thr Ile Thr Thr Val Ser Glu Asn Val

1810 1815 1820

cct gct ctc aca gat cac ggg acc ctg ccg ctg cgc agc agt atc 15383

Pro Ala Leu Thr Asp His Gly Thr Leu Pro Leu Arg Ser Ser Ile

1825 1830 1835

cgg gga gtc cag cgc gtg acc gtc act gac gcc aga cgc cgc acc 15428

Arg Gly Val Gln Arg Val Thr Val Thr Asp Ala Arg Arg Arg Thr

1840 1845 1850

tgc ccc tac gtc tac aag gcc ctg ggc ata gtc gcg ccg cgc gtc 15473

Cys Pro Tyr Val Tyr Lys Ala Leu Gly Ile Val Ala Pro Arg Val

1855 1860 1865

ctc tcg agc cgc acc ttc taaaaa atg tcc att ctc atc tcg ccc agt 15521

Leu Ser Ser Arg Thr Phe Met Ser Ile Leu Ile Ser Pro Ser

1870 1875 1880

aat aac acc ggt tgg ggc ctg cgc gcg ccc agc aag atg tac gga 15566

Asn Asn Thr Gly Trp Gly Leu Arg Ala Pro Ser Lys Met Tyr Gly

1885 1890 1895

ggc gct cgc caa cgc tcc acg caa cac ccc gtg cgc gtg cgc ggg 15611

Gly Ala Arg Gln Arg Ser Thr Gln His Pro Val Arg Val Arg Gly

1900 1905 1910

cac ttc cgc gct ccc tgg ggc gcc ctc aag ggc cgc gtg cgc tcg 15656

His Phe Arg Ala Pro Trp Gly Ala Leu Lys Gly Arg Val Arg Ser

1915 1920 1925

cgc acc acc gtc gac gac gtg atc gac cag gtg gtg gcc gac gcg 15701

Arg Thr Thr Val Asp Asp Val Ile Asp Gln Val Val Ala Asp Ala

1930 1935 1940

cgc aac tac acg ccc gcc gcc gcg ccc gtc tcc acc gtg gac gcc 15746

Arg Asn Tyr Thr Pro Ala Ala Ala Pro Val Ser Thr Val Asp Ala

1945 1950 1955

gtc atc gac agc gtg gtg gcc gac gcg cgc cgg tac gcc cgc gcc 15791

Val Ile Asp Ser Val Val Ala Asp Ala Arg Arg Tyr Ala Arg Ala

1960 1965 1970

aag agc cgg cgg cgg cgc atc gcc cgg cgg cac cgg agc acc ccc 15836

Lys Ser Arg Arg Arg Arg Ile Ala Arg Arg His Arg Ser Thr Pro

1975 1980 1985

gcc atg cgc gcg gcg cga gcc ttg ctg cgc agg gcc agg cgc acg 15881

Ala Met Arg Ala Ala Arg Ala Leu Leu Arg Arg Ala Arg Arg Thr

1990 1995 2000

gga cgc agg gcc atg ctc agg gcg gcc aga cgc gcg gcc tcc ggc 15926

Gly Arg Arg Ala Met Leu Arg Ala Ala Arg Arg Ala Ala Ser Gly

2005 2010 2015

agc agc agc gcc ggc agg acc cgc aga cgc gcg gcc acg gcg gcg 15971

Ser Ser Ser Ala Gly Arg Thr Arg Arg Arg Ala Ala Thr Ala Ala

2020 2025 2030

gcg gcg gcc atc gcc agc atg tcc cgc ccg cgg cgc ggc aac gtg 16016

Ala Ala Ala Ile Ala Ser Met Ser Arg Pro Arg Arg Gly Asn Val

2035 2040 2045

tac tgg gtg cgc gac gcc gcc acc ggt gtg cgc gtg ccc gtg cgc 16061

Tyr Trp Val Arg Asp Ala Ala Thr Gly Val Arg Val Pro Val Arg

2050 2055 2060

acc cgc ccc cct cgc act tgaagatgct gacttcgcga tgttgatgtg 16109

Thr Arg Pro Pro Arg Thr

2065

tcccagcggc gaggagg atg tcc aag cgc aaa tac aag gaa gag atg ctc 16159

Met Ser Lys Arg Lys Tyr Lys Glu Glu Met Leu

2070 2075

cag gtc atc gcg cct gag atc tac ggc ccc gcg gcg gcg gtg aag 16204

Gln Val Ile Ala Pro Glu Ile Tyr Gly Pro Ala Ala Ala Val Lys

2080 2085 2090

gag gaa aga aag ccc cgc aaa ctg aag cgg gtc aaa aag gac aaa 16249

Glu Glu Arg Lys Pro Arg Lys Leu Lys Arg Val Lys Lys Asp Lys

2095 2100 2105

aag gag gag gaa gat gac gga ctg gtg gag ttt gtg cgc gag ttc 16294

Lys Glu Glu Glu Asp Asp Gly Leu Val Glu Phe Val Arg Glu Phe

2110 2115 2120

gcc ccc cgg cgg cgc gtg cag tgg cgc ggg cgg aaa gtg aaa ccg 16339

Ala Pro Arg Arg Arg Val Gln Trp Arg Gly Arg Lys Val Lys Pro

2125 2130 2135

gtg ctg cgg ccc ggc acc acg gtg gtc ttc acg ccc ggc gag cgt 16384

Val Leu Arg Pro Gly Thr Thr Val Val Phe Thr Pro Gly Glu Arg

2140 2145 2150

tcc ggc tcc gcc tcc aag cgc tcc tac gac gag gtg tac ggg gac 16429

Ser Gly Ser Ala Ser Lys Arg Ser Tyr Asp Glu Val Tyr Gly Asp

2155 2160 2165

gag gac atc ctc gag cag gcg gcc gag cgt ctg ggc gag ttt gct 16474

Glu Asp Ile Leu Glu Gln Ala Ala Glu Arg Leu Gly Glu Phe Ala

2170 2175 2180

tac ggc aag cgc agc cgc ccc gcg ccc ttg aaa gag gag gcg gtg 16519

Tyr Gly Lys Arg Ser Arg Pro Ala Pro Leu Lys Glu Glu Ala Val

2185 2190 2195

tcc atc ccg ctg gac cac ggc aac ccc acg ccg agc ctg aag ccg 16564

Ser Ile Pro Leu Asp His Gly Asn Pro Thr Pro Ser Leu Lys Pro

2200 2205 2210

gtg acc ctg cag cag gtg ctg ccg agc gcg gcg ccg cgc cgg ggc 16609

Val Thr Leu Gln Gln Val Leu Pro Ser Ala Ala Pro Arg Arg Gly

2215 2220 2225

ttc aag cgc gag ggc ggc gag gat ctg tac ccg acc atg cag ctg 16654

Phe Lys Arg Glu Gly Gly Glu Asp Leu Tyr Pro Thr Met Gln Leu

2230 2235 2240

atg gtg ccc aag cgc cag aag ctg gag gac gtg ctg gag cac atg 16699

Met Val Pro Lys Arg Gln Lys Leu Glu Asp Val Leu Glu His Met

2245 2250 2255

aag gtg gac ccc gag gtg cag ccc gag gtc aag gtg cgg ccc atc 16744

Lys Val Asp Pro Glu Val Gln Pro Glu Val Lys Val Arg Pro Ile

2260 2265 2270

aag cag gtg gcc ccg ggc ctg ggc gtg cag acc gtg gac atc aag 16789

Lys Gln Val Ala Pro Gly Leu Gly Val Gln Thr Val Asp Ile Lys

2275 2280 2285

atc ccc acg gag ccc atg gaa acg cag act gag ccc gtg aag ccc 16834

Ile Pro Thr Glu Pro Met Glu Thr Gln Thr Glu Pro Val Lys Pro

2290 2295 2300

agc acc agc acc atg gag gtg cag acg gat ccc tgg atg ccg gcg 16879

Ser Thr Ser Thr Met Glu Val Gln Thr Asp Pro Trp Met Pro Ala

2305 2310 2315

ccg gct tcc acc acc acc acc cgc cga aga cgc aag tac ggc gcg 16924

Pro Ala Ser Thr Thr Thr Thr Arg Arg Arg Arg Lys Tyr Gly Ala

2320 2325 2330

gcc agc ctg ctg atg ccc aac tac gcg ctg cat cct tcc atc atc 16969

Ala Ser Leu Leu Met Pro Asn Tyr Ala Leu His Pro Ser Ile Ile

2335 2340 2345

ccc acg ccg ggc tac cgc ggc acg cgc ttc tac cgc ggc tac acc 17014

Pro Thr Pro Gly Tyr Arg Gly Thr Arg Phe Tyr Arg Gly Tyr Thr

2350 2355 2360

agc agc cgc cgc cgc aag acc acc acc cgc cgc cgc cgt cgt cgc 17059

Ser Ser Arg Arg Arg Lys Thr Thr Thr Arg Arg Arg Arg Arg Arg

2365 2370 2375

agc cgc cgc agc agc acc gcg act tcc gcc ttg gtg cgg aga gtg 17104

Ser Arg Arg Ser Ser Thr Ala Thr Ser Ala Leu Val Arg Arg Val

2380 2385 2390

tac cgc agc ggg cgc gag cct ctg acc ctg ccg cgc gcg cgc tac 17149

Tyr Arg Ser Gly Arg Glu Pro Leu Thr Leu Pro Arg Ala Arg Tyr

2395 2400 2405

cac ccg agc atc gcc att taactaccgc ctcctacttg cagat atg gcc ctc 17201

His Pro Ser Ile Ala Ile Met Ala Leu

2410 2415

aca tgc cgc ctc cgc gtc ccc att acg ggc tac cga gga aga aag 17246

Thr Cys Arg Leu Arg Val Pro Ile Thr Gly Tyr Arg Gly Arg Lys

2420 2425 2430

ccg cgc cgt aga agg ctg acg ggg aac ggg ctg cgt cgc cat cac 17291

Pro Arg Arg Arg Arg Leu Thr Gly Asn Gly Leu Arg Arg His His

2435 2440 2445

cac cgg cgg cgg cgc gcc atc agc aag cgg ttg ggg gga ggc ttc 17336

His Arg Arg Arg Arg Ala Ile Ser Lys Arg Leu Gly Gly Gly Phe

2450 2455 2460

ctg ccc gcg ctg atc ccc atc atc gcc gcg gcg atc ggg gcg atc 17381

Leu Pro Ala Leu Ile Pro Ile Ile Ala Ala Ala Ile Gly Ala Ile

2465 2470 2475

ccc ggc ata gct tcc gtg gcg gtg cag gcc tct cag cgc cac 17423

Pro Gly Ile Ala Ser Val Ala Val Gln Ala Ser Gln Arg His

2480 2485 2490

tgagacacag cttggaaaat ttgtaataaa aaatggactg acgctcctgg tcctgtgatg 17483

tgtgttttta g atg gaa gac atc aat ttt tcg tcc ctg gca ccg cga 17530

Met Glu Asp Ile Asn Phe Ser Ser Leu Ala Pro Arg

2495 2500

cac ggc acg cgg ccg ttt atg ggc acc tgg agc gac atc ggc aac 17575

His Gly Thr Arg Pro Phe Met Gly Thr Trp Ser Asp Ile Gly Asn

2505 2510 2515

agc caa ctg aac ggg ggc gcc ttc aat tgg agc agt ctc tgg agc 17620

Ser Gln Leu Asn Gly Gly Ala Phe Asn Trp Ser Ser Leu Trp Ser

2520 2525 2530

ggg ctt aag aat ttc ggg tcc acg ctc aaa acc tat ggc aac aag 17665

Gly Leu Lys Asn Phe Gly Ser Thr Leu Lys Thr Tyr Gly Asn Lys

2535 2540 2545

gcg tgg aac agc agc aca ggg cag gcg ctg agg gaa aag ctg aaa 17710

Ala Trp Asn Ser Ser Thr Gly Gln Ala Leu Arg Glu Lys Leu Lys

2550 2555 2560

gag cag aac ttc cag cag aag gtg gtc gat ggc ctg gcc tcg ggc 17755

Glu Gln Asn Phe Gln Gln Lys Val Val Asp Gly Leu Ala Ser Gly

2565 2570 2575

atc aac ggg gtg gtg gac ctg gcc aac cag gcc gtg cag aaa cag 17800

Ile Asn Gly Val Val Asp Leu Ala Asn Gln Ala Val Gln Lys Gln

2580 2585 2590

atc aac agc cgc ctg gac gcg gtc ccg ccc gct ggg tcc gtg gac 17845

Ile Asn Ser Arg Leu Asp Ala Val Pro Pro Ala Gly Ser Val Asp

2595 2600 2605

atg ccc cag gtg gag gag gag ctg cct ccc ctg gac aag cgc ggc 17890

Met Pro Gln Val Glu Glu Glu Leu Pro Pro Leu Asp Lys Arg Gly

2610 2615 2620

gac aag cga ccg cgt ccc gac gct gag gag acg ctg ctg acg cac 17935

Asp Lys Arg Pro Arg Pro Asp Ala Glu Glu Thr Leu Leu Thr His

2625 2630 2635

acg gac gag ccg ccc ccg tac gag gag gcg gtg aaa ctg ggt ctg 17980

Thr Asp Glu Pro Pro Pro Tyr Glu Glu Ala Val Lys Leu Gly Leu

2640 2645 2650

ccc acc acg cgg ccc gtg gcg cct ctg gcc acc ggg gtg ctg aaa 18025

Pro Thr Thr Arg Pro Val Ala Pro Leu Ala Thr Gly Val Leu Lys

2655 2660 2665

ccc agc agc agc agc cag ccc gcg acc ctg gac ttg cct ccg cct 18070

Pro Ser Ser Ser Ser Gln Pro Ala Thr Leu Asp Leu Pro Pro Pro

2670 2675 2680

gct tcc cgc ccc tcc aca gtg gct aag ccc ctg ccg ccg gtg gcc 18115

Ala Ser Arg Pro Ser Thr Val Ala Lys Pro Leu Pro Pro Val Ala

2685 2690 2695

gtc gcg tcg cgc gcc ccc cga ggc cgc ccc cag gcg aac tgg cag 18160

Val Ala Ser Arg Ala Pro Arg Gly Arg Pro Gln Ala Asn Trp Gln

2700 2705 2710

agc act ctg aac agc atc gtg ggt ctg gga gtg cag agt gtg aag 18205

Ser Thr Leu Asn Ser Ile Val Gly Leu Gly Val Gln Ser Val Lys

2715 2720 2725

cgc cgc cgc tgc tat taaaagacac tgtagcgctt aacttgcttg tctgtgtgt 18259

Arg Arg Arg Cys Tyr

2730

atg tat gtc cgc cga cca gaa gga gga gga aga ggc gcg tcg ccg 18304

Met Tyr Val Arg Arg Pro Glu Gly Gly Gly Arg Gly Ala Ser Pro

2735 2740 2745

agt tgc aag atg gcc acc cca tcg atg ctg ccc cag tgg gcg tac 18349

Ser Cys Lys Met Ala Thr Pro Ser Met Leu Pro Gln Trp Ala Tyr

2750 2755 2760

atg cac atc gcc gga cag gac gct tcg gag tac ctg agt ccg ggt 18394

Met His Ile Ala Gly Gln Asp Ala Ser Glu Tyr Leu Ser Pro Gly

2765 2770 2775

ctg gtg cag ttc gcc cgc gcc aca gac acc tac ttc agt ctg ggg 18439

Leu Val Gln Phe Ala Arg Ala Thr Asp Thr Tyr Phe Ser Leu Gly

2780 2785 2790

aac aag ttt agg aac ccc acg gtg gcg ccc acg cac gat gtg acc 18484

Asn Lys Phe Arg Asn Pro Thr Val Ala Pro Thr His Asp Val Thr

2795 2800 2805

acc gac cgc agc cag cgg ctg acg ctg cgc ttc gtg ccc gtg gac 18529

Thr Asp Arg Ser Gln Arg Leu Thr Leu Arg Phe Val Pro Val Asp

2810 2815 2820

cgc gag gac aac acc tac tcg tac aaa gtg cgc tac acg ctg gcc 18574

Arg Glu Asp Asn Thr Tyr Ser Tyr Lys Val Arg Tyr Thr Leu Ala

2825 2830 2835

gtg ggt gac aac cgc gtg ctg gac atg gcc agc acc tac ttt gac 18619

Val Gly Asp Asn Arg Val Leu Asp Met Ala Ser Thr Tyr Phe Asp

2840 2845 2850

atc cgc ggc gtg ctg gat cgg ggc ccc agc ttc aaa ccc tac tcc 18664

Ile Arg Gly Val Leu Asp Arg Gly Pro Ser Phe Lys Pro Tyr Ser

2855 2860 2865

ggc acc gcc tac aac gct ctg gct ccc aaa gct gct cct aat cct 18709

Gly Thr Ala Tyr Asn Ala Leu Ala Pro Lys Ala Ala Pro Asn Pro

2870 2875 2880

agt cag tgg tat gaa acc acc gag gga aca aat gga aac gct gac 18754

Ser Gln Trp Tyr Glu Thr Thr Glu Gly Thr Asn Gly Asn Ala Asp

2885 2890 2895

gcc act act act aaa cat agt ttt ggt ttg gct gcc atg aag ggc 18799

Ala Thr Thr Thr Lys His Ser Phe Gly Leu Ala Ala Met Lys Gly

2900 2905 2910

gat aat att gca gct gaa gga ctg caa ata gga aca gat tct acc 18844

Asp Asn Ile Ala Ala Glu Gly Leu Gln Ile Gly Thr Asp Ser Thr

2915 2920 2925

tct gga tct gat aca ccc atc tat gca gat aaa ctg tat cag cca 18889

Ser Gly Ser Asp Thr Pro Ile Tyr Ala Asp Lys Leu Tyr Gln Pro

2930 2935 2940

gag cct cag ata gga gag gaa tca tgg aca gac aac gat gga acc 18934

Glu Pro Gln Ile Gly Glu Glu Ser Trp Thr Asp Asn Asp Gly Thr

2945 2950 2955

aat gaa aaa ttc ggt gga aga gtt ctt aaa cct gac acc aac atg 18979

Asn Glu Lys Phe Gly Gly Arg Val Leu Lys Pro Asp Thr Asn Met

2960 2965 2970

aaa ccc tgc tat ggc tca ttt gcc aaa ccc act aat aac aaa ggt 19024

Lys Pro Cys Tyr Gly Ser Phe Ala Lys Pro Thr Asn Asn Lys Gly

2975 2980 2985

ggt caa gca aaa caa aag gca acg gaa gga aca gct gtg gat tat 19069

Gly Gln Ala Lys Gln Lys Ala Thr Glu Gly Thr Ala Val Asp Tyr

2990 2995 3000

gat gtt gac atg aac ttt ttc gat ggt aga gat gca gct gct aac 19114

Asp Val Asp Met Asn Phe Phe Asp Gly Arg Asp Ala Ala Ala Asn

3005 3010 3015

ttt act cca gaa gta gtg ttg tat gct gaa aat gtg gat ttg gaa 19159

Phe Thr Pro Glu Val Val Leu Tyr Ala Glu Asn Val Asp Leu Glu

3020 3025 3030

act cca gac acc cat att gta tac aaa cca gga act tca gat ttg 19204

Thr Pro Asp Thr His Ile Val Tyr Lys Pro Gly Thr Ser Asp Leu

3035 3040 3045

agt tcc cat gtg aat ttg ggt caa cag tcc atg ccc aac aga ccc 19249

Ser Ser His Val Asn Leu Gly Gln Gln Ser Met Pro Asn Arg Pro

3050 3055 3060

aac tac att gga ttc aga gat aac ttt atc ggg ctt atg tac tac 19294

Asn Tyr Ile Gly Phe Arg Asp Asn Phe Ile Gly Leu Met Tyr Tyr

3065 3070 3075

aac agc act ggc aat atg ggt gta ctg gct ggt cag gcc tcc cag 19339

Asn Ser Thr Gly Asn Met Gly Val Leu Ala Gly Gln Ala Ser Gln

3080 3085 3090

ctg aat gct gtg gtg gac ttg cag gac aga aac acc gaa ctg tcc 19384

Leu Asn Ala Val Val Asp Leu Gln Asp Arg Asn Thr Glu Leu Ser

3095 3100 3105

tac cag ctc ttg ctt gac tct ctg ggt gac aga acc agg tat ttc 19429

Tyr Gln Leu Leu Leu Asp Ser Leu Gly Asp Arg Thr Arg Tyr Phe

3110 3115 3120

agt atg tgg aat cag gcg gtg gac agt tat gac ccc gat gtg cgc 19474

Ser Met Trp Asn Gln Ala Val Asp Ser Tyr Asp Pro Asp Val Arg

3125 3130 3135

att att gaa aac cat ggt ata gaa gat gaa ctt ccc aac tac tgt 19519

Ile Ile Glu Asn His Gly Ile Glu Asp Glu Leu Pro Asn Tyr Cys

3140 3145 3150

ttc cca att gat gct gtg ggc atc acc aga acc tat caa ggc ata 19564

Phe Pro Ile Asp Ala Val Gly Ile Thr Arg Thr Tyr Gln Gly Ile

3155 3160 3165

aaa gta caa aat gga act ggc act gga acg aca tgg gaa aaa gat 19609

Lys Val Gln Asn Gly Thr Gly Thr Gly Thr Thr Trp Glu Lys Asp

3170 3175 3180

act gat gtt agt aca gcc aac gaa ata gga att ggc aac aat ctg 19654

Thr Asp Val Ser Thr Ala Asn Glu Ile Gly Ile Gly Asn Asn Leu

3185 3190 3195

gcc atg gag atc aac atc cag gcc aac ctg tgg cgg aac ttc ctc 19699

Ala Met Glu Ile Asn Ile Gln Ala Asn Leu Trp Arg Asn Phe Leu

3200 3205 3210

tac gcg aac gtg gcg ctg tac ctg ccc gac tcc tac aag tac acg 19744

Tyr Ala Asn Val Ala Leu Tyr Leu Pro Asp Ser Tyr Lys Tyr Thr

3215 3220 3225

ccg gcc aac atc acg ctg ccc acc aac acc aac acc tac gat tac 19789

Pro Ala Asn Ile Thr Leu Pro Thr Asn Thr Asn Thr Tyr Asp Tyr

3230 3235 3240

atg aac ggc cgc gtg gtg gcg ccc tcg ctg gtg gac gcc tac atc 19834

Met Asn Gly Arg Val Val Ala Pro Ser Leu Val Asp Ala Tyr Ile

3245 3250 3255

aac atc ggg gcg cgc tgg tcg ctg gac ccc atg gac aac gtc aat 19879

Asn Ile Gly Ala Arg Trp Ser Leu Asp Pro Met Asp Asn Val Asn

3260 3265 3270

ccc ttc aac cac cac cgc aac gcg ggc ctg cgc tac cgc tcc atg 19924

Pro Phe Asn His His Arg Asn Ala Gly Leu Arg Tyr Arg Ser Met

3275 3280 3285

ctc ctg ggc aac ggg cgc tac gtg ccc ttc cac atc cag gtg ccc 19969

Leu Leu Gly Asn Gly Arg Tyr Val Pro Phe His Ile Gln Val Pro

3290 3295 3300

caa aag ttt ttc gcc atc aag agc ctc ctg ctc ctg ccc ggg tcc 20014

Gln Lys Phe Phe Ala Ile Lys Ser Leu Leu Leu Leu Pro Gly Ser

3305 3310 3315

tac acc tac gag tgg aac ttc cgc aag gac gtc aac atg atc ctg 20059

Tyr Thr Tyr Glu Trp Asn Phe Arg Lys Asp Val Asn Met Ile Leu

3320 3325 3330

cag agc tcc ctc ggc aac gac ctg cgc acg gac ggg gcc tcc atc 20104

Gln Ser Ser Leu Gly Asn Asp Leu Arg Thr Asp Gly Ala Ser Ile

3335 3340 3345

gcc ttc acc agc atc aac ctc tac gcc acc ttc ttc ccc atg gcg 20149

Ala Phe Thr Ser Ile Asn Leu Tyr Ala Thr Phe Phe Pro Met Ala

3350 3355 3360

cac aac acc gcc tcc acg ctc gag gcc atg ctg cgc aac gac acc 20194

His Asn Thr Ala Ser Thr Leu Glu Ala Met Leu Arg Asn Asp Thr

3365 3370 3375

aac gac cag tcc ttc aac gac tac ctc tcg gcg gcc aac atg ctc 20239

Asn Asp Gln Ser Phe Asn Asp Tyr Leu Ser Ala Ala Asn Met Leu

3380 3385 3390

tac ccc atc ccg gcc aac gcc acc aac gtg ccc atc tcc atc ccc 20284

Tyr Pro Ile Pro Ala Asn Ala Thr Asn Val Pro Ile Ser Ile Pro

3395 3400 3405

tcg cgc aac tgg gcc gcc ttc cgc gga tgg tcc ttc acg cgc ctc 20329

Ser Arg Asn Trp Ala Ala Phe Arg Gly Trp Ser Phe Thr Arg Leu

3410 3415 3420

aag acc cgc gag acg ccc tcg ctc ggc tcc ggg ttc gac ccc tac 20374

Lys Thr Arg Glu Thr Pro Ser Leu Gly Ser Gly Phe Asp Pro Tyr

3425 3430 3435

ttc gtc tac tcg ggc tcc atc ccc tac ctc gac ggc acc ttc tac 20419

Phe Val Tyr Ser Gly Ser Ile Pro Tyr Leu Asp Gly Thr Phe Tyr

3440 3445 3450

ctc aac cac acc ttc aag aag gtc tcc atc acc ttc gac tcc tcc 20464

Leu Asn His Thr Phe Lys Lys Val Ser Ile Thr Phe Asp Ser Ser

3455 3460 3465

gtc agc tgg ccc ggc aac gac cgc ctc ctg acg ccc aac gag ttc 20509

Val Ser Trp Pro Gly Asn Asp Arg Leu Leu Thr Pro Asn Glu Phe

3470 3475 3480

gaa atc aag cgc acc gtc gac gga gag ggg tac aac gtg gcc cag 20554

Glu Ile Lys Arg Thr Val Asp Gly Glu Gly Tyr Asn Val Ala Gln

3485 3490 3495

tgc aac atg acc aag gac tgg ttc ctg gtc cag atg ctg gcc cac 20599

Cys Asn Met Thr Lys Asp Trp Phe Leu Val Gln Met Leu Ala His

3500 3505 3510

tac aac atc ggc tac cag ggc ttc tac gtg ccc gag ggc tac aag 20644

Tyr Asn Ile Gly Tyr Gln Gly Phe Tyr Val Pro Glu Gly Tyr Lys

3515 3520 3525

gac cgc atg tac tcc ttc ttc cgc aac ttc cag ccc atg agc cgc 20689

Asp Arg Met Tyr Ser Phe Phe Arg Asn Phe Gln Pro Met Ser Arg

3530 3535 3540

cag gtc gtg gac gag gtc aac tac aag gac tac cag gcc gtc acc 20734

Gln Val Val Asp Glu Val Asn Tyr Lys Asp Tyr Gln Ala Val Thr

3545 3550 3555

ctg gcc tac cag cac aac aac tcg ggc ttc gtc ggc tac ctc gcg 20779

Leu Ala Tyr Gln His Asn Asn Ser Gly Phe Val Gly Tyr Leu Ala

3560 3565 3570

ccc acc atg cgc cag ggc cag ccc tac ccc gcc aac tac ccc tac 20824

Pro Thr Met Arg Gln Gly Gln Pro Tyr Pro Ala Asn Tyr Pro Tyr

3575 3580 3585

ccg ctc atc ggc aag agc gcc gtc acc agc gtc acc cag aaa aag 20869

Pro Leu Ile Gly Lys Ser Ala Val Thr Ser Val Thr Gln Lys Lys

3590 3595 3600

ttc ctc tgc gac cgg gtc atg tgg cgc atc ccc ttc tcc agc aac 20914

Phe Leu Cys Asp Arg Val Met Trp Arg Ile Pro Phe Ser Ser Asn

3605 3610 3615

ttc atg tcc atg ggc gcg ctc acc gac ctc ggc cag aac atg ctc 20959

Phe Met Ser Met Gly Ala Leu Thr Asp Leu Gly Gln Asn Met Leu

3620 3625 3630

tat gcc aac tcc gcc cac gcg cta gac atg aat ttc gaa gtc gac 21004

Tyr Ala Asn Ser Ala His Ala Leu Asp Met Asn Phe Glu Val Asp

3635 3640 3645

ccc atg gat gag tcc acc ctt ctc tat gtt gtc ttc gaa gtc ttc 21049

Pro Met Asp Glu Ser Thr Leu Leu Tyr Val Val Phe Glu Val Phe

3650 3655 3660

gac gtc gtc cga gtg cac cag ccc cac cgc ggc gtc atc gag gcc 21094

Asp Val Val Arg Val His Gln Pro His Arg Gly Val Ile Glu Ala

3665 3670 3675

gtc tac ctg cgc acg ccc ttc tcg gcc ggc aac gcc acc acc 21136

Val Tyr Leu Arg Thr Pro Phe Ser Ala Gly Asn Ala Thr Thr

3680 3685 3690

taagccgctc ttgcttcttg caag atg acg gcc tgt ggc tcc ggc gag cag 21187

Met Thr Ala Cys Gly Ser Gly Glu Gln

3695 3700

gag ctc agg gcc atc ctc cgc gac ctg ggc tgc ggg ccc tgc ttc 21232

Glu Leu Arg Ala Ile Leu Arg Asp Leu Gly Cys Gly Pro Cys Phe

3705 3710 3715

ctg ggc acc ttc gac aag cgc ttc ccg gga ttc atg gcc ccg cac 21277

Leu Gly Thr Phe Asp Lys Arg Phe Pro Gly Phe Met Ala Pro His

3720 3725 3730

aag ctg gcc tgc gcc atc gtc aac acg gcc ggc cgc gag acc ggg 21322

Lys Leu Ala Cys Ala Ile Val Asn Thr Ala Gly Arg Glu Thr Gly

3735 3740 3745

ggc gag cac tgg ctg gcc ttc gcc tgg aac ccg cgc tcc cac acc 21367

Gly Glu His Trp Leu Ala Phe Ala Trp Asn Pro Arg Ser His Thr

3750 3755 3760

tgc tac ctc ttc gac ccc ttc ggg ttc tca gac gag cgc ctc aag 21412

Cys Tyr Leu Phe Asp Pro Phe Gly Phe Ser Asp Glu Arg Leu Lys

3765 3770 3775

cag atc tac cag ttc gag tac gag ggc ctg ctg cgc cgc agc gcc 21457

Gln Ile Tyr Gln Phe Glu Tyr Glu Gly Leu Leu Arg Arg Ser Ala

3780 3785 3790

ctg gcc acc gag gac cgc tgc gtc acc ctg gaa aag tcc acc cag 21502

Leu Ala Thr Glu Asp Arg Cys Val Thr Leu Glu Lys Ser Thr Gln

3795 3800 3805

acc gtg cag ggt ccg cgc tcg gcc gcc tgc ggg ctc ttc tgc tgc 21547

Thr Val Gln Gly Pro Arg Ser Ala Ala Cys Gly Leu Phe Cys Cys

3810 3815 3820

atg ttc ctg cac gcc ttc gtg cac tgg ccc gac cgc ccc atg gac 21592

Met Phe Leu His Ala Phe Val His Trp Pro Asp Arg Pro Met Asp

3825 3830 3835

aag aac ccc acc atg aac ttg ctg acg ggg gtg ccc aac ggc atg 21637

Lys Asn Pro Thr Met Asn Leu Leu Thr Gly Val Pro Asn Gly Met

3840 3845 3850

ctc cag tcg ccc cag gtg gaa ccc acc ctg cgc cgc aac cag gaa 21682

Leu Gln Ser Pro Gln Val Glu Pro Thr Leu Arg Arg Asn Gln Glu

3855 3860 3865

gcg ctc tac cgc ttc ctc aac gcc cac tcc gcc tac ttt cgc tcc 21727

Ala Leu Tyr Arg Phe Leu Asn Ala His Ser Ala Tyr Phe Arg Ser

3870 3875 3880

cac cgc gcg cgc atc gag aag gcc acc gcc ttc gac cgc atg aat 21772

His Arg Ala Arg Ile Glu Lys Ala Thr Ala Phe Asp Arg Met Asn

3885 3890 3895

caa gac atg taaaccgtgt gtatgtgaat gctttattca tcataataaa 21821

Gln Asp Met

3900

cagcacatgt ttatgccacc ttctctgagg ctctgacttt atttagaaat cgaaggggtt 21881

ctgccggctc tcggcgtgcc ccgcgggcag ggatacgttg cggaactggt acttggacag 21941

ccacttgaac tcggggatca gcagcttggg cacggggagg tcggggaacg agtcgctcca 22001

cagcttgcgc gtgagttgca gggcgcccag caggtcgggc gcggagatct tgaaatcgca 22061

gttgggaccc gcgttctgcg cgcgagagtt acggtacacg gggttgcagc actggaacac 22121

catcagggcc gggtgcttca cgctcgccag caccgtcgcg tcggtgatgc cctccacgtc 22181

cagatcctcg gcgttggcca tcccgaaggg ggtcatcttg caggtctgcc gccccatgct 22241

gggcacgcag ccgggcttgt ggttgcaatc gcagtgcagg gggatcagca tcatctgggc 22301

ctgctcggag ctcatgcccg ggtacatggc cttcatgaaa gcctccagct ggcggaaggc 22361

ctgctgcgcc ttgccgccct cggtgaagaa gaccccacag gacttgctag agaactggtt 22421

ggtggcgcag cccgcgtcgt gcacgcagca gcgcgcgtcg ttgttggcca gctgcaccac 22481

gctgcgcccc cagcggttct gggtgatctt ggcccggtcg gggttctcct tcagcgcgcg 22541

ctgcccattc tcgctcgcca catccatctc gatcgtgtgc tccttctgga tcatcacggt 22601

cccgtgcagg caccgcagct tgccctcggc ctcggtgcac ccgtgcagcc acagcgcgca 22661

gccggtgctc tcccagttct tgtgggcgat ctgggagtgc gagtgcacga agccctgcag 22721

gaagcggccc atcatcgtgg tcagggtctt gttgctggtg aaggtcagcg ggatgccgcg 22781

gtgctcctcg ttcacataca ggtggcagat gcggcggtac acctcgccct gctcgggcat 22841

cagctggaag gcggacttca ggtcgctctc cacgcggtac cggtccatca gcagcgtcat 22901

cacttccatg cccttctccc aggccgaaac gatcggcagg ctcagggggt tcttcaccgt 22961

catcttagtc gccgccgccg aggtcagggg gtcgttctcg tccagggtct caaacactcg 23021

cttgccgtcc ttctcggtga tgcgcacggg ggggaaggcg aagcccacgg ccgccagctc 23081

ctcctcggcc tgcctttcgt cctcgctgtc ctggctgatg tcttgcaaag gcacatgctt 23141

ggtcttgcgg ggtttctttt tgggtggcag aggcggcggc ggagacgtgc tgggcgagcg 23201

cgagttttcg ctcaccacga ctatttcttc tccttggccg tcgtccgaga ccacgcggcg 23261

gtaggcatgc ctcttctggg gcagaggcgg aggcgacggg ctctcgcggt tcggcgggcg 23321

gctggcagag ccccttccgc gttcgggggt gcgctcctgg cggcgctgct ctgactgact 23381

tcctccgcgg ccggccatta tgttctccta gggagcaaca agc atg gag act cag 23436

Met Glu Thr Gln

cca tcg tcg cca aca tcg cca tct gcc ccc gcc gac gag aac cag 23481

Pro Ser Ser Pro Thr Ser Pro Ser Ala Pro Ala Asp Glu Asn Gln

3905 3910 3915

cag cag aat gaa agc tta acc gcc ccg ccg ccc agc ccc acc tcc 23526

Gln Gln Asn Glu Ser Leu Thr Ala Pro Pro Pro Ser Pro Thr Ser

3920 3925 3930

gac gcc gca gcc cca gac atg caa gag atg gag aaa tcc atc gag 23571

Asp Ala Ala Ala Pro Asp Met Gln Glu Met Glu Lys Ser Ile Glu

3935 3940 3945

att gac ctg ggc tac gtg acg ccc gcg gag cac gag gag gag ctg 23616

Ile Asp Leu Gly Tyr Val Thr Pro Ala Glu His Glu Glu Glu Leu

3950 3955 3960

gca gcg cgc ttt tca gcc ccg gaa gag aac cac caa gag cag cca 23661

Ala Ala Arg Phe Ser Ala Pro Glu Glu Asn His Gln Glu Gln Pro

3965 3970 3975

gag cag gaa gca gag agc gag cag cag cag gct ggg ctc gag cat 23706

Glu Gln Glu Ala Glu Ser Glu Gln Gln Gln Ala Gly Leu Glu His

3980 3985 3990

ggc gac tac ctg agc ggg gca gag gac gtg ctc atc aag cat ctg 23751

Gly Asp Tyr Leu Ser Gly Ala Glu Asp Val Leu Ile Lys His Leu

3995 4000 4005

gcc cgc caa tgc atc atc gtc aag gac gcg ctg ctc gac cgc gcc 23796

Ala Arg Gln Cys Ile Ile Val Lys Asp Ala Leu Leu Asp Arg Ala

4010 4015 4020

gag gtg ccc ctc agc gtg gcg gag ctc agc cgc gcc tac gag cgc 23841

Glu Val Pro Leu Ser Val Ala Glu Leu Ser Arg Ala Tyr Glu Arg

4025 4030 4035

aac ctc ttc tcg ccg cgc gtg ccc ccc aag cgc cag ccc aac ggc 23886

Asn Leu Phe Ser Pro Arg Val Pro Pro Lys Arg Gln Pro Asn Gly

4040 4045 4050

acc tgc gag ccc aac ccg cgc ctc aac ttc tac ccg gtc ttc gcg 23931

Thr Cys Glu Pro Asn Pro Arg Leu Asn Phe Tyr Pro Val Phe Ala

4055 4060 4065

gtg ccc gag gcc ctg gcc acc tac cac atc ttt ttc aag aac caa 23976

Val Pro Glu Ala Leu Ala Thr Tyr His Ile Phe Phe Lys Asn Gln

4070 4075 4080

aag atc ccc gtc tcc tgc cgc gcc aac cgc acc cgc gcc gac gcc 24021

Lys Ile Pro Val Ser Cys Arg Ala Asn Arg Thr Arg Ala Asp Ala

4085 4090 4095

ctg ctc aac ctg ggt ccc ggc gcc cgc cta cct gat atc gcc tcc 24066

Leu Leu Asn Leu Gly Pro Gly Ala Arg Leu Pro Asp Ile Ala Ser

4100 4105 4110

ttg gaa gag gtt ccc aag atc ttc gag ggt ctg ggc agc gac gag 24111

Leu Glu Glu Val Pro Lys Ile Phe Glu Gly Leu Gly Ser Asp Glu

4115 4120 4125

act cgg gcc gcg aac gct ctg caa gga agc gga gag gag cat gag 24156

Thr Arg Ala Ala Asn Ala Leu Gln Gly Ser Gly Glu Glu His Glu

4130 4135 4140

cac cac agc gcc ctg gtg gag ttg gaa ggc gac aac gcg cgc ctg 24201

His His Ser Ala Leu Val Glu Leu Glu Gly Asp Asn Ala Arg Leu

4145 4150 4155

gcg gtc ctc aag cgc acg gtc gag ctg acc cac ttc gcc tac ccg 24246

Ala Val Leu Lys Arg Thr Val Glu Leu Thr His Phe Ala Tyr Pro

4160 4165 4170

gcg ctc aac ctg ccc ccc aag gtc atg agc gcc gtc atg gac cag 24291

Ala Leu Asn Leu Pro Pro Lys Val Met Ser Ala Val Met Asp Gln

4175 4180 4185

gtg ctc atc aag cgc gcc tcg ccc ctc tcg gag gag gag atg cag 24336

Val Leu Ile Lys Arg Ala Ser Pro Leu Ser Glu Glu Glu Met Gln

4190 4195 4200

gac ccc gag agc tcg gac gag ggc aag ccc gtg gtc agc gac gag 24381

Asp Pro Glu Ser Ser Asp Glu Gly Lys Pro Val Val Ser Asp Glu

4205 4210 4215

cag ctg gcg cgc tgg ctg gga gcg agt agc acc ccc cag agc ctg 24426

Gln Leu Ala Arg Trp Leu Gly Ala Ser Ser Thr Pro Gln Ser Leu

4220 4225 4230

gaa gag cgg cgc aag ctc atg atg gcc gtg gtc ctg gtg acc gtg 24471

Glu Glu Arg Arg Lys Leu Met Met Ala Val Val Leu Val Thr Val

4235 4240 4245

gag ctg gag tgt ctg cgc cgc ttc ttc gcc gac gcg gag acc ctg 24516

Glu Leu Glu Cys Leu Arg Arg Phe Phe Ala Asp Ala Glu Thr Leu

4250 4255 4260

cgc aag gtc gag gag aac ctg cac tac ctc ttc agg cac ggg ttc 24561

Arg Lys Val Glu Glu Asn Leu His Tyr Leu Phe Arg His Gly Phe

4265 4270 4275

gtg cgc cag gcc tgc aag atc tcc aac gtg gag ctg acc aac ctg 24606

Val Arg Gln Ala Cys Lys Ile Ser Asn Val Glu Leu Thr Asn Leu

4280 4285 4290

gtc tcc tac atg ggc atc ctg cac gag aac cgc ctg ggg cag aac 24651

Val Ser Tyr Met Gly Ile Leu His Glu Asn Arg Leu Gly Gln Asn

4295 4300 4305

gtg ctg cac acc acc ctg cgc ggg gag gcc cgc cgc gac tac atc 24696

Val Leu His Thr Thr Leu Arg Gly Glu Ala Arg Arg Asp Tyr Ile

4310 4315 4320

cgc gac tgc gtc tac ctg tac ctc tgc cac acc tgg cag acg ggc 24741

Arg Asp Cys Val Tyr Leu Tyr Leu Cys His Thr Trp Gln Thr Gly

4325 4330 4335

atg ggc gtg tgg cag cag tgc ctg gag gag cag aac ctg aaa gag 24786

Met Gly Val Trp Gln Gln Cys Leu Glu Glu Gln Asn Leu Lys Glu

4340 4345 4350

ctc tgc aag ctc ctg cag aag aac ctg aag gcc ctg tgg acc ggg 24831

Leu Cys Lys Leu Leu Gln Lys Asn Leu Lys Ala Leu Trp Thr Gly

4355 4360 4365

ttc gac gag cgc acc acc gcc gcg gac ctg gcc gac ctc att ttc 24876

Phe Asp Glu Arg Thr Thr Ala Ala Asp Leu Ala Asp Leu Ile Phe

4370 4375 4380

ccc gag cgc ctg cgg ctg acg ctg cgc aac ggg ctg ccc gac ttt 24921

Pro Glu Arg Leu Arg Leu Thr Leu Arg Asn Gly Leu Pro Asp Phe

4385 4390 4395

atg agc caa agc atg ttg caa aac ttt cgc tct ttc atc ctc gaa 24966

Met Ser Gln Ser Met Leu Gln Asn Phe Arg Ser Phe Ile Leu Glu

4400 4405 4410

cgc tcc ggg atc ctg ccc gcc acc tgc tcc gcg ctg ccc tcg gac 25011

Arg Ser Gly Ile Leu Pro Ala Thr Cys Ser Ala Leu Pro Ser Asp

4415 4420 4425

ttc gtg ccg ctg acc ttc cgc gag tgc ccc ccg ccg ctc tgg agc 25056

Phe Val Pro Leu Thr Phe Arg Glu Cys Pro Pro Pro Leu Trp Ser

4430 4435 4440

cac tgc tac ttg ctg cgc ctg gcc aac tac ctg gcc tac cac tcg 25101

His Cys Tyr Leu Leu Arg Leu Ala Asn Tyr Leu Ala Tyr His Ser

4445 4450 4455

gac gtg atc gag gac gtc agc ggc gag ggt ctg ctc gag tgc cac 25146

Asp Val Ile Glu Asp Val Ser Gly Glu Gly Leu Leu Glu Cys His

4460 4465 4470

tgt cgc tgc aac ctc tgc acg ccg cac cgc tcc ctg gcc tgc aac 25191

Cys Arg Cys Asn Leu Cys Thr Pro His Arg Ser Leu Ala Cys Asn

4475 4480 4485

ccc cag ctg ctg agc gag acc cag atc atc ggc acc ttc gag ttg 25236

Pro Gln Leu Leu Ser Glu Thr Gln Ile Ile Gly Thr Phe Glu Leu

4490 4495 4500

caa ggc ccc ggc gag gag ggc aag ggg ggt ctg aaa ctc acc ccg 25281

Gln Gly Pro Gly Glu Glu Gly Lys Gly Gly Leu Lys Leu Thr Pro

4505 4510 4515

ggg ctg tgg acc tcg gcc tac ttg cgc aag ttc gtg ccc gag gac 25326

Gly Leu Trp Thr Ser Ala Tyr Leu Arg Lys Phe Val Pro Glu Asp

4520 4525 4530

tac cat ccc ttc gag atc agg ttc tac gag gac caa tcc cag ccg 25371

Tyr His Pro Phe Glu Ile Arg Phe Tyr Glu Asp Gln Ser Gln Pro

4535 4540 4545

ccc aag gcc gag ctg tcg gcc tgc gtc atc acc cag ggg gcc atc 25416

Pro Lys Ala Glu Leu Ser Ala Cys Val Ile Thr Gln Gly Ala Ile

4550 4555 4560

ctg gcc caa ttg caa gcc atc cag aaa tcc cgc caa gaa ttt ctg 25461

Leu Ala Gln Leu Gln Ala Ile Gln Lys Ser Arg Gln Glu Phe Leu

4565 4570 4575

ctg aaa aag ggc cac ggg gtc tac ttg gac ccc cag acc gga gag 25506

Leu Lys Lys Gly His Gly Val Tyr Leu Asp Pro Gln Thr Gly Glu

4580 4585 4590

gag ctc aac ccc agc ttc ccc cag gat gcc ccg agg aag cag caa 25551

Glu Leu Asn Pro Ser Phe Pro Gln Asp Ala Pro Arg Lys Gln Gln

4595 4600 4605

gaa gct gaa agt gga gct gcc gcc gcc gcc gga gga ttt gga gga 25596

Glu Ala Glu Ser Gly Ala Ala Ala Ala Ala Gly Gly Phe Gly Gly

4610 4615 4620

aga ctg gga gag cag tca ggc aga gga gat gga aga ctg gga cag 25641

Arg Leu Gly Glu Gln Ser Gly Arg Gly Asp Gly Arg Leu Gly Gln

4625 4630 4635

cac tca ggc aga gga gga cag cct gca aga cag tct gga gga gga 25686

His Ser Gly Arg Gly Gly Gln Pro Ala Arg Gln Ser Gly Gly Gly

4640 4645 4650

aga cga ggt gga gga gga ggc aga gga aga agc agc cgc cgc cag 25731

Arg Arg Gly Gly Gly Gly Gly Arg Gly Arg Ser Ser Arg Arg Gln

4655 4660 4665

acc gtc gtc ctc ggc gga gaa agc aag cag cac gga tac cat ctc 25776

Thr Val Val Leu Gly Gly Glu Ser Lys Gln His Gly Tyr His Leu

4670 4675 4680

cgc tcc ggg tcg ggg tcg cgg cgg ccg ggc cca cag taggtgggac 25822

Arg Ser Gly Ser Gly Ser Arg Arg Pro Gly Pro Gln

4685 4690 4695

gagaccgggc gcttcccgaa ccccaccacc cagaccggta agaaggagcg gcagggatac 25882

aagtcctggc gggggcacaa aaacgccatc gtctcctgct tgcaagcctg cgggggcaac 25942

atctccttca cccggcgcta cctgctcttc caccgcgggg tgaacttccc ccgcaacatc 26002

ttgcattact accgtcacct ccacagcccc tactactgtt tccaagaaga ggcagaaacc 26062

cagcagcagc agcagaaaac cagcggcagc tagaaaatcc acagcggcgg caggtggact 26122

gaggatcgcg gcgaacgagc cggcgcagac ccgggagctg aggaatcgga tctttcccac 26182

cctctatgcc atcttccagc agagtcgggg gcaggagcag gaactgaaag tcaagaaccg 26242

ttctctgcgc tcgctcaccc gcagttgtct gtatcacaag agcgaagacc aacttcagcg 26302

cactctcgag gacgccgagg ctctcttcaa caagtactgc gcgctcactc ttaaagagta 26362

gcccgcgccc gcccacacac ggaaaaaggc gggaattacg tcaccacctg cgcccttcgc 26422

catcatc atg agc aaa gag att ccc acg cct tac atg tgg agc tac 26468

Met Ser Lys Glu Ile Pro Thr Pro Tyr Met Trp Ser Tyr

4700 4705

cag ccc cag atg ggc ctg gcc gcc ggc gcc gcc cag gac tac tcc 26513

Gln Pro Gln Met Gly Leu Ala Ala Gly Ala Ala Gln Asp Tyr Ser

4710 4715 4720

acc cgc atg aac tgg ctc agt gcc ggg ccc gcg atg atc tca cgg 26558

Thr Arg Met Asn Trp Leu Ser Ala Gly Pro Ala Met Ile Ser Arg

4725 4730 4735

gtg aat gac atc ctc gcc cac cga aac cag ata ctc cta gaa cag 26603

Val Asn Asp Ile Leu Ala His Arg Asn Gln Ile Leu Leu Glu Gln

4740 4745 4750

tca gcg atc acc gcc acg ccc cgc cat cac ctt aat ccg cgt aat 26648

Ser Ala Ile Thr Ala Thr Pro Arg His His Leu Asn Pro Arg Asn

4755 4760 4765

tgg ccc gcc gcc ctg gtg tac cag gaa att ccc cag ccc acg acc 26693

Trp Pro Ala Ala Leu Val Tyr Gln Glu Ile Pro Gln Pro Thr Thr

4770 4775 4780

gta cta ctt ccg cga gac gcc cag gcc gaa gtc cag ctg act aac 26738

Val Leu Leu Pro Arg Asp Ala Gln Ala Glu Val Gln Leu Thr Asn

4785 4790 4795

tca ggt gtc cag ctg gcc ggc ggc gcc gcc ctg tgt cgt cac cgc 26783

Ser Gly Val Gln Leu Ala Gly Gly Ala Ala Leu Cys Arg His Arg

4800 4805 4810

ccc gct cag ggt ata aag cgg ctg gtg atc cga ggc aga ggc aca 26828

Pro Ala Gln Gly Ile Lys Arg Leu Val Ile Arg Gly Arg Gly Thr

4815 4820 4825

cag ctc aac gac gag gtg gtg agc tct tcg ctg ggt ctg cga cct 26873

Gln Leu Asn Asp Glu Val Val Ser Ser Ser Leu Gly Leu Arg Pro

4830 4835 4840

gac gga gtc ttc caa ctc gcc gga tcg ggg aga tct tcc ttc acg 26918

Asp Gly Val Phe Gln Leu Ala Gly Ser Gly Arg Ser Ser Phe Thr

4845 4850 4855

cct cgt cag gcc gtc ctg act ttg gag agt tcg tcc tcg cag ccc 26963

Pro Arg Gln Ala Val Leu Thr Leu Glu Ser Ser Ser Ser Gln Pro

4860 4865 4870

cgc tcg ggc ggc atc ggc act ctc cag ttc gtg gag gag ttc act 27008

Arg Ser Gly Gly Ile Gly Thr Leu Gln Phe Val Glu Glu Phe Thr

4875 4880 4885

ccc tcg gtc tac ttc aac ccc ttc tcc ggc tcc ccc ggc cac tac 27053

Pro Ser Val Tyr Phe Asn Pro Phe Ser Gly Ser Pro Gly His Tyr

4890 4895 4900

ccg gac gag ttc atc ccg aac ttc gac gcc atc agc gag tcg gtg 27098

Pro Asp Glu Phe Ile Pro Asn Phe Asp Ala Ile Ser Glu Ser Val

4905 4910 4915

gac ggc tac gat tga atg tcc cat ggt ggc gcg gct gac cta gct 27143

Asp Gly Tyr Asp Met Ser His Gly Gly Ala Ala Asp Leu Ala

4920 4925 4930

cgg ctt cga cac ctg gac cac tgc cgc cgc ttc cgc tgc ttc gct 27188

Arg Leu Arg His Leu Asp His Cys Arg Arg Phe Arg Cys Phe Ala

4935 4940 4945

cgg gat ctc gcc gag ttt gcc tac ttt gag ctg ccc gag gag cac 27233

Arg Asp Leu Ala Glu Phe Ala Tyr Phe Glu Leu Pro Glu Glu His

4950 4955 4960

cct cag ggc ccg gcc cac gga gtg cgg atc gtc gtc gaa ggg ggc 27278

Pro Gln Gly Pro Ala His Gly Val Arg Ile Val Val Glu Gly Gly

4965 4970 4975

ctc gac tcc cac ctg ctt cgg atc ttc agc cag cga ccg atc ctg 27323

Leu Asp Ser His Leu Leu Arg Ile Phe Ser Gln Arg Pro Ile Leu

4980 4985 4990

gtc gag cgc gag caa gga cag acc cgt ctg acc ctg tac tgc atc 27368

Val Glu Arg Glu Gln Gly Gln Thr Arg Leu Thr Leu Tyr Cys Ile

4995 5000 5005

tgc aac cac ccc ggc ctg cat gaa agt ctt tgt tgt ctg ctg tgt 27413

Cys Asn His Pro Gly Leu His Glu Ser Leu Cys Cys Leu Leu Cys

5010 5015 5020

act gag tat aat aaa agc tgagatcagc gactactccg gactcgattg 27461

Thr Glu Tyr Asn Lys Ser

5025

tggtgttcct gctatcaacc ggtccctgtt cttcaccggg aacgagaccg agctccagct 27521

ccagtgtaag ccccacaaga agtatctcac ctggctgttc cagggctccc cgatcgccgt 27581

tgtcaaccac tgcgacaacg acggagtcct gctgagcggc cccgccaacc ttactttttc 27641

cacccgcaga agcaagctcc agctcttcca acccttcctc cccgggacct atcagtgcct 27701

ctcgggaccc tgccatcaca ccttccacct gatcccgaat accacagcgg cgctccccgc 27761

tactaacaac caaactaccc accaacgcca ccgtcgcgac ctttcctctg aatctaatac 27821

taccacccac accggaggtg agctccgagg tcaaccaacc tctgggattt actacggccc 27881

ctgggaggtg gtggggttaa tagcgctagg cctagttgcg ggtgggcttt tggctctctg 27941

ctacctatac ctcccttgct gttcttactt agtggtgctg tgttgctggt ttaagaa 27998

atg ggg aag atc acc cta gtg agc tgc ggt gcg ctg gtg gcg gtg 28043

Met Gly Lys Ile Thr Leu Val Ser Cys Gly Ala Leu Val Ala Val

5030 5035 5040

gtg ctt tcg att gtg gga ctg ggc ggc gcg gct gta gtg aag gag 28088

Val Leu Ser Ile Val Gly Leu Gly Gly Ala Ala Val Val Lys Glu

5045 5050 5055

aag gcc gat ccc tgc ttg cat ttc aat ccc gac aaa tgc cag ctg 28133

Lys Ala Asp Pro Cys Leu His Phe Asn Pro Asp Lys Cys Gln Leu

5060 5065 5070

agt ttt cag ccc gat ggc aat cgg tgc gcg gtg ctg atc aag tgc 28178

Ser Phe Gln Pro Asp Gly Asn Arg Cys Ala Val Leu Ile Lys Cys

5075 5080 5085

gga tgg gaa tgc gag aac gtg aga atc gag tac aat aac aag act 28223

Gly Trp Glu Cys Glu Asn Val Arg Ile Glu Tyr Asn Asn Lys Thr

5090 5095 5100

cgg aac aat act ctc gcg tcc gtg tgg cag ccc ggg gac ccc gag 28268

Arg Asn Asn Thr Leu Ala Ser Val Trp Gln Pro Gly Asp Pro Glu

5105 5110 5115

tgg tac acc gtc tct gtc ccc ggt gct gac ggc tcc ccg cgc acc 28313

Trp Tyr Thr Val Ser Val Pro Gly Ala Asp Gly Ser Pro Arg Thr

5120 5125 5130

gtg aac aat act ttc att ttt gca cac atg tgc gac acg gtc atg 28358

Val Asn Asn Thr Phe Ile Phe Ala His Met Cys Asp Thr Val Met

5135 5140 5145

tgg atg agc aag cag tac gat atg tgg ccc ccc acg aag gag aac 28403

Trp Met Ser Lys Gln Tyr Asp Met Trp Pro Pro Thr Lys Glu Asn

5150 5155 5160

atc gtg gtc ttc tcc atc gct tac agc ctg tgc acg gcg cta atc 28448

Ile Val Val Phe Ser Ile Ala Tyr Ser Leu Cys Thr Ala Leu Ile

5165 5170 5175

acc gct atc gtg tgc ctg agc att cac atg ctc atc gct att cgc 28493

Thr Ala Ile Val Cys Leu Ser Ile His Met Leu Ile Ala Ile Arg

5180 5185 5190

ccc aga aat aat gcc gaa aaa gag aaa cag cca taacacgttt 28536

Pro Arg Asn Asn Ala Glu Lys Glu Lys Gln Pro

5195 5200 5205

tttcacacac ctttttcaga cc atg gcc tct gtt aaa ttt ttt gct tta 28585

Met Ala Ser Val Lys Phe Phe Ala Leu

5210

ttt gcc agt ctc att acc gtc att cat gga atg agt aat gag aaa 28630

Phe Ala Ser Leu Ile Thr Val Ile His Gly Met Ser Asn Glu Lys

5215 5220 5225

att act att tac act ggc act aat cac aca ttg aaa ggt cca gaa 28675

Ile Thr Ile Tyr Thr Gly Thr Asn His Thr Leu Lys Gly Pro Glu

5230 5235 5240

aaa tcc aca gaa gtt tca tgg tat tgt tat ttt aat gaa tca gat 28720

Lys Ser Thr Glu Val Ser Trp Tyr Cys Tyr Phe Asn Glu Ser Asp

5245 5250 5255

gta gct act gaa ctc tgt gga aac aac aac aaa aaa aat gag agc 28765

Val Ala Thr Glu Leu Cys Gly Asn Asn Asn Lys Lys Asn Glu Ser

5260 5265 5270

att act ctc atc aag ttt caa tgt gga tct gac tta acc ctc att 28810

Ile Thr Leu Ile Lys Phe Gln Cys Gly Ser Asp Leu Thr Leu Ile

5275 5280 5285

aac atc act aga gac tat gta ggt atg tat tat gga act aca gca 28855

Asn Ile Thr Arg Asp Tyr Val Gly Met Tyr Tyr Gly Thr Thr Ala

5290 5295 5300

ggc att tcg gac atg gaa ttt tat caa gtt tct gtg tct gaa ccc 28900

Gly Ile Ser Asp Met Glu Phe Tyr Gln Val Ser Val Ser Glu Pro

5305 5310 5315

acc acg cct aga atg acc aca acc aca aaa act aca cct act acc 28945

Thr Thr Pro Arg Met Thr Thr Thr Thr Lys Thr Thr Pro Thr Thr

5320 5325 5330

acc aca cag ctc act acc aat ggc ttt ttt gcc atg ctt caa gtg 28990

Thr Thr Gln Leu Thr Thr Asn Gly Phe Phe Ala Met Leu Gln Val

5335 5340 5345

gct gaa aat agc acc agc att caa ccc acc cca ccc agt gag gaa 29035

Ala Glu Asn Ser Thr Ser Ile Gln Pro Thr Pro Pro Ser Glu Glu

5350 5355 5360

att ccc aaa tcc atg att ggc att att gtt gct gta gtg gtg tgc 29080

Ile Pro Lys Ser Met Ile Gly Ile Ile Val Ala Val Val Val Cys

5365 5370 5375

atg ttg atc atc gcc ttg tgc atg gtg tac tat gcc ttc tgc tac 29125

Met Leu Ile Ile Ala Leu Cys Met Val Tyr Tyr Ala Phe Cys Tyr

5380 5385 5390

aga aag cac aga ctg aac gac aag ctg gaa cac tta cta agt gtt 29170

Arg Lys His Arg Leu Asn Asp Lys Leu Glu His Leu Leu Ser Val

5395 5400 5405

gaa ttt taatttttta gaacc atg aag atc cta tgc ctt tta gtt ttt 29218

Glu Phe Met Lys Ile Leu Cys Leu Leu Val Phe

5410 5415 5420

tat atc att acc tct gct ctt tgt gaa tca gtg gat aaa gat gtt 29263

Tyr Ile Ile Thr Ser Ala Leu Cys Glu Ser Val Asp Lys Asp Val

5425 5430 5435

act att acc act ggt tct aat tat aca ctg aaa gga cca ccc tca 29308

Thr Ile Thr Thr Gly Ser Asn Tyr Thr Leu Lys Gly Pro Pro Ser

5440 5445 5450

ggt atg ctt tcg tgg tat tgc tat ttt gga act gac act gat caa 29353

Gly Met Leu Ser Trp Tyr Cys Tyr Phe Gly Thr Asp Thr Asp Gln

5455 5460 5465

act gaa tta tgc aat ttt caa aaa ggc aaa acc tca aac tct aaa 29398

Thr Glu Leu Cys Asn Phe Gln Lys Gly Lys Thr Ser Asn Ser Lys

5470 5475 5480

atc tct aat tat caa tgc aat ggc act gat ctg ata cta ctc aat 29443

Ile Ser Asn Tyr Gln Cys Asn Gly Thr Asp Leu Ile Leu Leu Asn

5485 5490 5495

gtc acg aaa gca tat ggt ggc agt tat tca tgc cct gga caa aac 29488

Val Thr Lys Ala Tyr Gly Gly Ser Tyr Ser Cys Pro Gly Gln Asn

5500 5505 5510

act gaa gaa atg att ttt tac aaa gtg gaa gtg gtt gat ccc act 29533

Thr Glu Glu Met Ile Phe Tyr Lys Val Glu Val Val Asp Pro Thr

5515 5520 5525

act cca cct cca ccc gcc aca act act cac acc aca cac aca gaa 29578

Thr Pro Pro Pro Pro Ala Thr Thr Thr His Thr Thr His Thr Glu

5530 5535 5540

caa agc aca gca gag gca gca aag tta gcc ttg cag gtc caa gac 29623

Gln Ser Thr Ala Glu Ala Ala Lys Leu Ala Leu Gln Val Gln Asp

5545 5550 5555

agt tca ttt gtt ggc att acc cct aca cct gat cag cgg tgt ccg 29668

Ser Ser Phe Val Gly Ile Thr Pro Thr Pro Asp Gln Arg Cys Pro

5560 5565 5570

ggg ctg ctc gtc agc ggc att gtc ggt gtg ctt tcg gga tta gca 29713

Gly Leu Leu Val Ser Gly Ile Val Gly Val Leu Ser Gly Leu Ala

5575 5580 5585

gtc ata atc atc tgc atg ttc att ttt gct tgc tgc tat aga agg 29758

Val Ile Ile Ile Cys Met Phe Ile Phe Ala Cys Cys Tyr Arg Arg

5590 5595 5600

ctt tac cga caa aaa tca gac cca ctg ctg aac ctc tat gtt 29800

Leu Tyr Arg Gln Lys Ser Asp Pro Leu Leu Asn Leu Tyr Val

5605 5610

taattttttc cagagcc atg aag gca gtt agc gct cta gtt ttt tgt tct 29850

Met Lys Ala Val Ser Ala Leu Val Phe Cys Ser

5615 5620 5625

ttg att ggc att gtt ttt aat agt aaa att acc aga gtt agc ttt 29895

Leu Ile Gly Ile Val Phe Asn Ser Lys Ile Thr Arg Val Ser Phe

5630 5635 5640

att aaa cat gtt aat gta act gaa gga gat aac atc aca cta gca 29940

Ile Lys His Val Asn Val Thr Glu Gly Asp Asn Ile Thr Leu Ala

5645 5650 5655

ggt gta gaa ggt gct caa aac acc acc tgg aca aaa tac cat cta 29985

Gly Val Glu Gly Ala Gln Asn Thr Thr Trp Thr Lys Tyr His Leu

5660 5665 5670

gga tgg aga gat att tgc acc tgg aat gta act tat tat tgc ata 30030

Gly Trp Arg Asp Ile Cys Thr Trp Asn Val Thr Tyr Tyr Cys Ile

5675 5680 5685

gga gtt aat ctt acc att gtt aac gct aac caa tct cag aat ggg 30075

Gly Val Asn Leu Thr Ile Val Asn Ala Asn Gln Ser Gln Asn Gly

5690 5695 5700

tta att aaa gga cag agt gtt agt gtg acc agt gat ggg tac tat 30120

Leu Ile Lys Gly Gln Ser Val Ser Val Thr Ser Asp Gly Tyr Tyr

5705 5710 5715

acc cag cat agt ttt aac tac aac att act gtc ata cca ctg cct 30165

Thr Gln His Ser Phe Asn Tyr Asn Ile Thr Val Ile Pro Leu Pro

5720 5725 5730

acg cct agc cca cct agc act acc gca cag aca acc aca tac agt 30210

Thr Pro Ser Pro Pro Ser Thr Thr Ala Gln Thr Thr Thr Tyr Ser

5735 5740 5745

aca tca aat cag cct acc acc act aca gca gca gag gtt gcc agc 30255

Thr Ser Asn Gln Pro Thr Thr Thr Thr Ala Ala Glu Val Ala Ser

5750 5755 5760

tcg tct ggg gtc cga gtg gca ttt ttg atg ttg gcc cca tct agc 30300

Ser Ser Gly Val Arg Val Ala Phe Leu Met Leu Ala Pro Ser Ser

5765 5770 5775

agt ccc act gct agt acc aat gag cag act act gaa ttt ttg tcc 30345

Ser Pro Thr Ala Ser Thr Asn Glu Gln Thr Thr Glu Phe Leu Ser

5780 5785 5790

act gtc gag agc cac acc aca gct acc tcc agt gcc ttc tct agc 30390

Thr Val Glu Ser His Thr Thr Ala Thr Ser Ser Ala Phe Ser Ser

5795 5800 5805

acc gcc aat ctc tcc tcg ctt tcc tct aca cca atc agt ccc gct 30435

Thr Ala Asn Leu Ser Ser Leu Ser Ser Thr Pro Ile Ser Pro Ala

5810 5815 5820

act act cct agc ccc gct cct ctt ccc act ccc ctg aag caa aca 30480

Thr Thr Pro Ser Pro Ala Pro Leu Pro Thr Pro Leu Lys Gln Thr

5825 5830 5835

gac ggc ggc atg caa tgg cag atc acc ctg ctc att gtg atc ggg 30525

Asp Gly Gly Met Gln Trp Gln Ile Thr Leu Leu Ile Val Ile Gly

5840 5845 5850

ttg gtc atc ctg gcc gtg ttg cta tac tac atc ttc tgc cgc cgc 30570

Leu Val Ile Leu Ala Val Leu Leu Tyr Tyr Ile Phe Cys Arg Arg

5855 5860 5865

att ccc aac gcg cac cgc aag ccg gcc tac aag ccc atc gtt atc 30615

Ile Pro Asn Ala His Arg Lys Pro Ala Tyr Lys Pro Ile Val Ile

5870 5875 5880

ggg cag ccg gag ccg ctt cag gtg gaa ggg ggt cta agg aat ctt 30660

Gly Gln Pro Glu Pro Leu Gln Val Glu Gly Gly Leu Arg Asn Leu

5885 5890 5895

ctc ttc tct ttt aca gta tgg tgattgaatt atg att cct aga caa ttc 30709

Leu Phe Ser Phe Thr Val Trp Met Ile Pro Arg Gln Phe

5900 5905

ttg atc act att ctt atc tgc ctc ctc caa gtc tgt gcc acc ctc 30754

Leu Ile Thr Ile Leu Ile Cys Leu Leu Gln Val Cys Ala Thr Leu

5910 5915 5920

gct ctg gtg gcc aac gcc agt cca gac tgt att ggg ccc ttc gcc 30799

Ala Leu Val Ala Asn Ala Ser Pro Asp Cys Ile Gly Pro Phe Ala

5925 5930 5935

tcc tac gtg ctc ttt gcc ttc atc acc tgc atc tgc tgc tgt agc 30844

Ser Tyr Val Leu Phe Ala Phe Ile Thr Cys Ile Cys Cys Cys Ser

5940 5945 5950

ata gtc tgc ctg ctt atc acc ttc ttc cag ttc att gac tgg atc 30889

Ile Val Cys Leu Leu Ile Thr Phe Phe Gln Phe Ile Asp Trp Ile

5955 5960 5965

ttt gtg cgc atc gcc tac ctg cgc cac cac ccc cag tac cgc gac 30934

Phe Val Arg Ile Ala Tyr Leu Arg His His Pro Gln Tyr Arg Asp

5970 5975 5980

cag cga gtg gcg cgg ctg ctc agg ctc ctc tgataagc atg cgg gct 30981

Gln Arg Val Ala Arg Leu Leu Arg Leu Leu Met Arg Ala

5985 5990 5995

ctg cta ctt ctc gcg ctt ctg ctg tta gtg ctc ccc cgt ccc gtc 31026

Leu Leu Leu Leu Ala Leu Leu Leu Leu Val Leu Pro Arg Pro Val

6000 6005 6010

gac ccc cgg acc ccc acc cag tcc ccc gag gag gtc cgc aaa tgc 31071

Asp Pro Arg Thr Pro Thr Gln Ser Pro Glu Glu Val Arg Lys Cys

6015 6020 6025

aaa ttc caa gaa ccc tgg aaa ttc ctc aaa tgc tac cgc caa aaa 31116

Lys Phe Gln Glu Pro Trp Lys Phe Leu Lys Cys Tyr Arg Gln Lys

6030 6035 6040

tca gac atg cat ccc agc tgg atc atg atc att ggg atc gtg aac 31161

Ser Asp Met His Pro Ser Trp Ile Met Ile Ile Gly Ile Val Asn

6045 6050 6055

att ctg gcc tgc acc ctc atc tcc ttt gtg att tac ccc tgc ttt 31206

Ile Leu Ala Cys Thr Leu Ile Ser Phe Val Ile Tyr Pro Cys Phe

6060 6065 6070

gac ttt ggt tgg aac tcg cca gag gcg ctc tat ctc ccg cct gaa 31251

Asp Phe Gly Trp Asn Ser Pro Glu Ala Leu Tyr Leu Pro Pro Glu

6075 6080 6085

cct gac aca cca cca cag caa cct cag gca cac gca cta cca cca 31296

Pro Asp Thr Pro Pro Gln Gln Pro Gln Ala His Ala Leu Pro Pro

6090 6095 6100

cca cca cag cct agg cca caa tac atg ccc ata tta gac tat gag 31341

Pro Pro Gln Pro Arg Pro Gln Tyr Met Pro Ile Leu Asp Tyr Glu

6105 6110 6115

gcc gag cca cag cga ccc atg ctc ccc gct att agt tac ttc aat 31386

Ala Glu Pro Gln Arg Pro Met Leu Pro Ala Ile Ser Tyr Phe Asn

6120 6125 6130

cta acc ggc gga gat gac tgacccactg gccaacaaca acgtcaacga 31434

Leu Thr Gly Gly Asp Asp

6135

ccttctcctg gacatggacg gccgcgcctc ggagcagcga ctcgcccaac ttcgcattcg 31494

ccagcagcag gagagagccg tcaaggagct gcaggacggc atagccatcc accagtgcaa 31554

gaaaggcatc ttctgcctgg tgaaacaggc caagatctcc tacgaggtca cccagaccga 31614

ccatcgcctc tcctacgagc tcctgcagca gcgccagaag ttcacctgcc tggtcggagt 31674

caaccccatc gtcatcaccc agcagtcggg cgataccaag gggtgcatcc actgctcctg 31734

cgactccccc gactgcgtcc acactctgat caagaccctc tgcggcctcc gcgacctcct 31794

ccccatgaac taatcacccc cttatccagt gaaataaaga tcatattgat gattaaataa 31854

aaaaaataat catttgattt gaaataaaga tacaatcata ttgatgattt gagtttaata 31914

aaaataaaga atcacttact tgaaatctga taccaggtct ctgtccatgt tttctgccaa 31974

caccacttca ctcccctctt cccagctctg gtactgcagg ccccggcggg ctgcaaactt 32034

cctccacacc ctgaagggga tgtcaaattc ctcctgtccc tcaatcttca ttttatcttc 32094

tatcag atg tcc aaa aag cgc gtc cgg gtg gat gat gac ttc gac ccc 32142

Met Ser Lys Lys Arg Val Arg Val Asp Asp Asp Phe Asp Pro

6140 6145 6150

gtc tac ccc tac gat gca gac aac gca ccg acc gtg ccc ttc atc 32187

Val Tyr Pro Tyr Asp Ala Asp Asn Ala Pro Thr Val Pro Phe Ile

6155 6160 6165

aac ccc ccc ttc gtc tct tca gat gga ttc caa gag aag ccc ctg 32232

Asn Pro Pro Phe Val Ser Ser Asp Gly Phe Gln Glu Lys Pro Leu

6170 6175 6180

ggg gtg ttg tcc ctg cga ctg gcc gac ccc gtc acc acc aag aac 32277

Gly Val Leu Ser Leu Arg Leu Ala Asp Pro Val Thr Thr Lys Asn

6185 6190 6195

ggg gaa atc acc ctc aag ctg gga gag ggg gtg gac ctc gac tcc 32322

Gly Glu Ile Thr Leu Lys Leu Gly Glu Gly Val Asp Leu Asp Ser

6200 6205 6210

tcg gga aaa ctc atc tcc aac acg gcc acc aag gcc gcc gcc cct 32367

Ser Gly Lys Leu Ile Ser Asn Thr Ala Thr Lys Ala Ala Ala Pro

6215 6220 6225

ctc agt ttt tcc aac aac acc att tcc ctt aac atg gat acc cct 32412

Leu Ser Phe Ser Asn Asn Thr Ile Ser Leu Asn Met Asp Thr Pro

6230 6235 6240

ctt tat acc aaa gat gga aaa tta tcc tta caa gtt tct cca cca 32457

Leu Tyr Thr Lys Asp Gly Lys Leu Ser Leu Gln Val Ser Pro Pro

6245 6250 6255

tta aac ata tta aaa tca acc att ctg aac act tta gct gta gct 32502

Leu Asn Ile Leu Lys Ser Thr Ile Leu Asn Thr Leu Ala Val Ala

6260 6265 6270

tat gga tca ggt tta gga ctc agt ggt ggc act gct ctt gca gta 32547

Tyr Gly Ser Gly Leu Gly Leu Ser Gly Gly Thr Ala Leu Ala Val

6275 6280 6285

cag ttg gcc tct cca ctc acc ttt gat gaa aaa gga aat att aaa 32592

Gln Leu Ala Ser Pro Leu Thr Phe Asp Glu Lys Gly Asn Ile Lys

6290 6295 6300

att aac cta gcc agt ggt cca tta aca gtt gat gca agt cga ctt 32637

Ile Asn Leu Ala Ser Gly Pro Leu Thr Val Asp Ala Ser Arg Leu

6305 6310 6315

agt atc aac tgc aaa aga ggg gtt act gtc act acc gca gga gat 32682

Ser Ile Asn Cys Lys Arg Gly Val Thr Val Thr Thr Ala Gly Asp

6320 6325 6330

gca att gaa agc aac ata agc tgg cct aaa ggt ata aga ttt gaa 32727

Ala Ile Glu Ser Asn Ile Ser Trp Pro Lys Gly Ile Arg Phe Glu

6335 6340 6345

ggt gat gcc ata gct gca aac att ggc aga gga ttg gaa ttt gga 32772

Gly Asp Ala Ile Ala Ala Asn Ile Gly Arg Gly Leu Glu Phe Gly

6350 6355 6360

acc act agt aca gag act gat gtc aca gat gca tac cca att caa 32817

Thr Thr Ser Thr Glu Thr Asp Val Thr Asp Ala Tyr Pro Ile Gln

6365 6370 6375

gtt aaa ttg ggt act ggt ctc acc ttt gac agt aca ggc gcc att 32862

Val Lys Leu Gly Thr Gly Leu Thr Phe Asp Ser Thr Gly Ala Ile

6380 6385 6390

gtt gca tgg aac aaa gag gat gat aaa ctt aca tta tgg acc aca 32907

Val Ala Trp Asn Lys Glu Asp Asp Lys Leu Thr Leu Trp Thr Thr

6395 6400 6405

gcc gac ccc tcg cca aat tgc aaa ata tac tct gaa aaa gat gct 32952

Ala Asp Pro Ser Pro Asn Cys Lys Ile Tyr Ser Glu Lys Asp Ala

6410 6415 6420

aaa ctc aca ctt tgc ttg aca aaa tgt gga agc caa att ctg ggc 32997

Lys Leu Thr Leu Cys Leu Thr Lys Cys Gly Ser Gln Ile Leu Gly

6425 6430 6435

act gtg act gta ttg gca gtg aat aat gga agt ctc aac cca att 33042

Thr Val Thr Val Leu Ala Val Asn Asn Gly Ser Leu Asn Pro Ile

6440 6445 6450

aca aac aca gta agc act gca ctt gtc tcc ctc aag ttt gat gca 33087

Thr Asn Thr Val Ser Thr Ala Leu Val Ser Leu Lys Phe Asp Ala

6455 6460 6465

agt gga gtt ttg cta agc agc tcc aca tta gac aaa gaa tat tgg 33132

Ser Gly Val Leu Leu Ser Ser Ser Thr Leu Asp Lys Glu Tyr Trp

6470 6475 6480

aac ttc cga aag gga gat gtt aca cct gct gaa ccc tat act aat 33177

Asn Phe Arg Lys Gly Asp Val Thr Pro Ala Glu Pro Tyr Thr Asn

6485 6490 6495

gct ata ggt ttt atg cct aac ata aag gcc tat cct aaa aac aca 33222

Ala Ile Gly Phe Met Pro Asn Ile Lys Ala Tyr Pro Lys Asn Thr

6500 6505 6510

tct gca gct tca aaa agc cat att gtc agt caa gtt tat ctc aat 33267

Ser Ala Ala Ser Lys Ser His Ile Val Ser Gln Val Tyr Leu Asn

6515 6520 6525

ggg gat gaa acc aaa cct ctg atg ctg att att act ttt aat gaa 33312

Gly Asp Glu Thr Lys Pro Leu Met Leu Ile Ile Thr Phe Asn Glu

6530 6535 6540

act gag gat gca act tgc acc tac agt atc act ttt caa tgg aaa 33357

Thr Glu Asp Ala Thr Cys Thr Tyr Ser Ile Thr Phe Gln Trp Lys

6545 6550 6555

tgg gat agt act aag tac aca ggt aaa aca ctt gct acc agc tcc 33402

Trp Asp Ser Thr Lys Tyr Thr Gly Lys Thr Leu Ala Thr Ser Ser

6560 6565 6570

ttc acc ttc tcc tac att gcc caa gaa tgaatactgt attccaccct 33449

Phe Thr Phe Ser Tyr Ile Ala Gln Glu

6575 6580

gcatgccaac ccttcccacc ccactctgtc tatggaaaaa actctgaaac acaaaataaa 33509

ataaagttca agtgttttat tgattcaaca gttttacagg attcgagcag ttatttttcc 33569

tccaccctcc caggacatgg aatacaccac cctctccccc cgcacagcct tgaacatctg 33629

aatgccattg gcgatggaca tgcttttggt ctccacgttc cacacagttt cagagcgagc 33689

cagtctcggg tcggtcaggg agatgaaacc ctccgggcac tcccgcatct gcacctcaca 33749

gctcaacagc tgaggattgt cctcggtggt cgggatcacg gttatctgga agaagcagaa 33809

gagcggcggt gggaatcata gtccgcgaac gggatcggcc ggtggtgtcg catcaggccc 33869

cgcagcagtc gctgtcgccg ccgctccgtc aagctgctgc tcagggggtc cgggtccagg 33929

gactccctca gcatgatgcc cacggccctc agcatcagtc gtctggtgcg gcgggcgcag 33989

cagcgcatgc ggatctcgct caggtcgctg cagtacgtgc aacacaggac caccaggttg 34049

ttcaacagtc catagttcaa cacgctccag ccgaaactca tcgcgggaag gatgctaccc 34109

acgtggccgt cgtaccagat cctcaggtaa atcaagtggc gccccctcca gaacacgctg 34169

cccatgtaca tgatctcctt gggcatgtgg cggttcacca cctcccggta ccacatcacc 34229

ctctggttga acatgcagcc ccggatgatc ctgcggaacc acagggccag caccgccccg 34289

cccgccatgc agcgaagaga ccccgggtcc cgacaatggc aatggaggac ccaccgctcg 34349

tacccgtgga tcatctggga gctgaacaag tctatgttgg cacagcacag gcatatgctc 34409

atgcatctct tcagcactct cagctcctcg ggggtcaaaa ccatatccca gggcacgggg 34469

aactcttgca ggacagcgaa ccccgcagaa cagggcaatc ctcgcacata acttacattg 34529

tgcatggaca gggtatcgca atcaggcagc accgggtgat cctccaccag agaagcgcgg 34589

gtctcggtct cctcacagcg tggtaagggg gccggccgat acgggtgatg gcgggacgcg 34649

gctgatcgtg ttcgcgaccg tgtcatgatg cagttgcttt cggacatttt cgtacttgct 34709

gtagcagaac ctggtccggg cgctgcacac cgatcgccgg cggcggtccc ggcgcttgga 34769

acgctcggtg ttgaagttgt aaaacagcca ctctctcaga ccgtgcagca gatctagggc 34829

ctcaggagtg atgaagatcc catcatgcct gatggctcta atcacatcga ccaccgtgga 34889

atgggccaga cccagccaga tgatgcaatt ttgttgggtt tcggtgacgg cgggggaggg 34949

aagaacagga agaaccatga ttaactttta atccaaacgg tctcggagca cttcaaaatg 35009

aagatcgcgg agatggcacc tctcgccccc gctgtgttgg tggaaaataa cagccaggtc 35069

aaaggtgata cggttctcga gatgttccac ggtggcttcc agcaaagcct ccacgcgcac 35129

atccagaaac aagacaatag cgaaagcggg agggttctct aattcctcaa tcatcatgtt 35189

acactcctgc accatcccca gataattttc atttttccag ccttgaatga ttcgaactag 35249

ttcctgaggt aaatccaagc cagccatgat aaagagctcg cgcagagcgc cctccaccgg 35309

cattcttaag cacaccctca taattccaag atattctgct cctggttcac ctgcagcaga 35369

ttgacaagcg gaatatcaaa atctctgccg cgatccctaa gctcctccct cagcaataac 35429

tgtaagtact ctttcatatc ctctccgaaa tttttagcca taggaccgcc aggaatgaga 35489

ttaggacaag ccacattaca gataaaccga agtccccccc agtgagcatt gccaaatgta 35549

agattgaaat aagcatgctg gctagacccg gtgatatctt ccagataact ggacagaaaa 35609

tcgcccaggc aatttttaag aaaatcaaca aaagaaaaat cttccaggtg cacgtttagg 35669

gcctcgggaa caacgatgga gtaagtgcaa ggggtgcgtt ccagcatggt tagttagctg 35729

atctgtaaaa aaacaaaaaa taaaacatta aaccatgcta gcctggcgaa caggtgggta 35789

aatcgttctc tccagcacca ggcaggccac ggggtctccg gcgcgaccct cgtaaaaatt 35849

gtcgctatga ttgaaaacca tcacagagag acgttcccgg tggccggcgt gaatgattcg 35909

acaagatgaa tacacccccg gaacattggc gtccgcgagt gaaaaaaagc ggccgaggaa 35969

gcaataaggc actacaatgc tcagtctcaa gtccagcaaa gcgatgccat gcggatgaag 36029

cacaaaattc tcaggtgcgt acaaaatgta attactcccc tcctgcacag gcagcaaagc 36089

cccagatccc tccagataca catacaaagc ctcagcgtcc atagcttacc gagcagcagc 36149

acacaacagg cgcaagagtc agagaaaggc tgagctctaa cctgtccccc gctctctgct 36209

caatatatag cccagatcta cactgacgta aaggccaaag tctaaaaata cccgccaaat 36269

aatcacacac gcccagcaca cgcccagaaa ccggtgacac actcaaaaaa atacgcgcac 36329

ttcctcaaac gcccaaactg ccgtcatttc cgggttccca cgctacgtca tcagaattcg 36389

actttcaaat ccgtcgaccg ttaaacacgt cactcgcccc gcccctaacg gtcgccctcc 36449

tctcggccaa tcacagcccc gcatccccaa attcaaacgc ctcatttgca tattaacgcg 36509

cacaaaaagt ttgaggtata ttatttgatg atg 36542

<210> SEQ ID NO 29

<211> LENGTH: 190

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 29

Met Glu Ile Trp Thr Val Leu Glu Asp Phe His Gln Thr Arg Gln Leu

1 5 10 15

Leu Glu Asn Ser Ser Glu Glu Val Ser Tyr Leu Trp Arg Phe Cys Phe

20 25 30

Gly Gly Pro Leu Ala Lys Leu Val Tyr Arg Ala Lys Gln Asp Tyr Arg

35 40 45

Glu Gln Phe Glu Asp Ile Leu Arg Glu Cys Pro Gly Ile Phe Asp Ser

50 55 60

Leu Asn Leu Gly His Gln Ser His Phe Asn Gln Ser Ile Leu Arg Ala

65 70 75 80

Leu Asp Phe Ser Thr Pro Gly Arg Thr Thr Ala Ala Val Ala Phe Phe

85 90 95

Ala Phe Ile Leu Asp Lys Trp Ser Gln Glu Thr His Phe Ser Arg Asp

100 105 110

Tyr Arg Leu Asp Cys Leu Ala Val Ala Leu Trp Arg Thr Trp Arg Cys

115 120 125

Gln Arg Leu Asn Ala Ile Ser Gly Tyr Leu Pro Val Gln Pro Val Asp

130 135 140

Thr Leu Arg Ile Leu Ser Leu Gln Ser Pro Gln Glu His Gln Arg Arg

145 150 155 160

Gln Gln Pro Gln Gln Gln Gln Glu Glu Glu Glu Asp Arg Glu Glu Asn

165 170 175

Pro Arg Ala Gly Leu Asp Pro Pro Val Ala Glu Glu Glu Glu

180 185 190

<210> SEQ ID NO 30

<211> LENGTH: 157

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 30

Met Met Gly Arg Met Thr Lys Ile Cys Val Phe Leu Arg Ser Ser Met

1 5 10 15

Ser Gly Ser Ala Ser Phe Glu Gly Gly Val Phe Ser Pro Tyr Leu Thr

20 25 30

Gly Arg Leu Pro Ser Trp Ala Gly Val Arg Gln Asn Val Met Gly Ser

35 40 45

Thr Val Asp Gly Arg Pro Val Gln Pro Ala Asn Ser Ser Thr Leu Thr

50 55 60

Tyr Ala Thr Leu Ser Ser Ser Ser Val Asp Ala Ala Ala Ala Ala Ala

65 70 75 80

Ala Ala Ser Ala Ala Ser Ala Val Arg Gly Met Ala Leu Gly Ala Gly

85 90 95

Tyr Tyr Ser Ser Leu Val Ala Asn Ser Ser Ser Ala Asn Asn Pro Ala

100 105 110

Ser Leu Asn Glu Glu Lys Leu Leu Leu Leu Met Ala Gln Leu Glu Ala

115 120 125

Leu Thr Gln Arg Leu Gly Glu Leu Thr Gln Gln Val Ala Gln Leu Gln

130 135 140

Ala Glu Thr Arg Ala Ala Val Ala Thr Val Lys Thr Lys

145 150 155

<210> SEQ ID NO 31

<211> LENGTH: 392

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 31

Met His Pro Val Leu Arg Gln Met Arg Pro His His Pro Pro Pro Gln

1 5 10 15

Gln Gln Pro Pro Pro Gln Pro Ala Leu Leu Pro Pro Pro Gln Gln Gln

20 25 30

Leu Pro Ala Thr Thr Ala Ala Ala Ala Val Ser Gly Ala Gly Gln Thr

35 40 45

Ser Gln Tyr Asp His Leu Ala Leu Glu Glu Gly Glu Gly Leu Ala Arg

50 55 60

Leu Gly Ala Ser Ser Pro Glu Arg His Pro Arg Val Gln Met Lys Arg

65 70 75 80

Asp Ala Arg Glu Ala Tyr Val Pro Lys Gln Asn Leu Phe Arg Asp Arg

85 90 95

Ser Gly Glu Glu Pro Glu Glu Met Arg Ala Ala Arg Phe His Ala Gly

100 105 110

Arg Glu Leu Arg Arg Gly Leu Asp Arg Lys Arg Val Leu Arg Asp Glu

115 120 125

Asp Phe Glu Ala Asp Glu Leu Thr Gly Ile Ser Pro Ala Arg Ala His

130 135 140

Val Ala Ala Ala Asn Leu Val Thr Ala Tyr Glu Gln Thr Val Lys Glu

145 150 155 160

Glu Ser Asn Phe Gln Lys Ser Phe Asn Asn His Val Arg Thr Leu Ile

165 170 175

Ala Arg Glu Glu Val Thr Leu Gly Leu Met His Leu Trp Asp Leu Leu

180 185 190

Glu Ala Ile Val Gln Asn Pro Thr Ser Lys Pro Leu Thr Ala Gln Leu

195 200 205

Phe Leu Val Val Gln His Ser Arg Asp Asn Glu Ala Phe Arg Glu Ala

210 215 220

Leu Leu Asn Ile Thr Glu Pro Glu Gly Arg Trp Leu Leu Asp Leu Val

225 230 235 240

Asn Ile Leu Gln Ser Ile Val Val Gln Glu Arg Gly Leu Pro Leu Ser

245 250 255

Glu Lys Leu Ala Ala Ile Asn Phe Ser Val Leu Ser Leu Gly Lys Tyr

260 265 270

Tyr Ala Arg Lys Ile Tyr Lys Thr Pro Tyr Val Pro Ile Asp Lys Glu

275 280 285

Val Lys Ile Asp Gly Phe Tyr Met Arg Met Thr Leu Lys Val Leu Thr

290 295 300

Leu Ser Asp Asp Leu Gly Val Tyr Arg Asn Asp Arg Met His Arg Ala

305 310 315 320

Val Ser Ala Ser Arg Arg Arg Glu Leu Ser Asp Gln Glu Leu Met His

325 330 335

Ser Leu Gln Arg Ala Leu Thr Gly Ala Gly Thr Glu Gly Glu Ser Tyr

340 345 350

Phe Asp Met Gly Ala Asp Leu His Trp Gln Pro Ser Arg Arg Ala Leu

355 360 365

Glu Ala Ala Ala Gly Pro Tyr Val Glu Glu Val Asp Asp Glu Val Asp

370 375 380

Glu Glu Gly Glu Tyr Leu Glu Asp

385 390

<210> SEQ ID NO 32

<211> LENGTH: 593

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 32

Met Gln Gln Gln Gln Pro Pro Pro Asp Pro Ala Met Arg Ala Ala Leu

1 5 10 15

Gln Ser Gln Pro Ser Gly Ile Asn Ser Ser Asp Asp Trp Thr Gln Ala

20 25 30

Met Gln Arg Ile Met Ala Leu Thr Thr Arg Asn Pro Glu Ala Phe Arg

35 40 45

Gln Gln Pro Gln Ala Asn Arg Leu Ser Ala Ile Leu Glu Ala Val Val

50 55 60

Pro Ser Arg Ser Asn Pro Thr His Glu Lys Val Leu Ala Ile Val Asn

65 70 75 80

Ala Leu Val Glu Asn Lys Ala Ile Arg Gly Asp Glu Ala Gly Leu Val

85 90 95

Tyr Asn Ala Leu Leu Glu Arg Val Ala Arg Tyr Asn Ser Thr Asn Val

100 105 110

Gln Thr Asn Leu Asp Arg Met Val Thr Asp Val Arg Glu Ala Val Ala

115 120 125

Gln Arg Glu Arg Phe His Arg Glu Ser Asn Leu Gly Ser Met Val Ala

130 135 140

Leu Asn Ala Phe Leu Ser Thr Gln Pro Ala Asn Val Pro Arg Gly Gln

145 150 155 160

Glu Asp Tyr Thr Asn Phe Ile Ser Ala Leu Arg Leu Met Val Thr Glu

165 170 175

Val Pro Gln Ser Glu Val Tyr Gln Ser Gly Pro Asp Tyr Phe Phe Gln

180 185 190

Thr Ser Arg Gln Gly Leu Gln Thr Val Asn Leu Ser Gln Ala Phe Lys

195 200 205

Asn Leu Gln Gly Leu Trp Gly Val Gln Ala Pro Val Gly Asp Arg Ala

210 215 220

Thr Val Ser Ser Leu Leu Thr Pro Asn Ser Arg Leu Leu Leu Leu Leu

225 230 235 240

Val Ala Pro Phe Thr Asp Ser Gly Ser Ile Asn Arg Asn Ser Tyr Leu

245 250 255

Gly Tyr Leu Ile Asn Leu Tyr Arg Glu Ala Ile Gly Gln Ala His Val

260 265 270

Asp Glu Gln Thr Tyr Gln Glu Ile Thr His Val Ser Arg Ala Leu Gly

275 280 285

Gln Asp Asp Pro Gly Asn Leu Glu Ala Thr Leu Asn Phe Leu Leu Thr

290 295 300

Asn Arg Ser Gln Lys Ile Pro Pro Gln Tyr Ala Leu Ser Ala Glu Glu

305 310 315 320

Glu Arg Ile Leu Arg Tyr Val Gln Gln Ser Val Gly Leu Phe Leu Met

325 330 335

Gln Glu Gly Ala Thr Pro Ser Ala Ala Leu Asp Met Thr Ala Arg Asn

340 345 350

Met Glu Pro Ser Met Tyr Ala Ser Asn Arg Pro Phe Ile Asn Lys Leu

355 360 365

Met Asp Tyr Leu His Arg Ala Ala Ala Met Asn Ser Asp Tyr Phe Thr

370 375 380

Asn Ala Ile Leu Asn Pro His Trp Leu Pro Pro Pro Gly Phe Tyr Thr

385 390 395 400

Gly Glu Tyr Asp Met Pro Asp Pro Asn Asp Gly Phe Leu Trp Asp Asp

405 410 415

Val Asp Ser Ser Val Phe Ser Pro Arg Pro Gly Ala Asn Glu Arg Pro

420 425 430

Leu Trp Lys Lys Glu Gly Ser Asp Arg Arg Pro Ser Ser Ala Leu Ser

435 440 445

Gly Arg Glu Gly Ala Ala Ala Ala Val Pro Glu Ala Ala Ser Pro Phe

450 455 460

Pro Ser Leu Pro Phe Ser Leu Asn Ser Ile Arg Ser Ser Glu Leu Gly

465 470 475 480

Arg Ile Thr Arg Pro Arg Leu Leu Gly Glu Glu Glu Tyr Leu Asn Asp

485 490 495

Ser Leu Leu Arg Pro Glu Arg Glu Lys Asn Phe Pro Asn Asn Gly Ile

500 505 510

Glu Ser Leu Val Asp Lys Met Ser Arg Trp Lys Thr Tyr Ala Gln Glu

515 520 525

His Arg Asp Asp Pro Arg Ala Ser Gln Gly Ala Thr Ser Arg Gly Ser

530 535 540

Ala Ala Arg Lys Arg Arg Trp His Asp Arg Gln Arg Gly Leu Met Trp

545 550 555 560

Asp Asp Glu Asp Ser Ala Asp Asp Ser Ser Val Leu Asp Leu Gly Gly

565 570 575

Ser Gly Gly Asn Pro Phe Ala His Leu Arg Pro Arg Ile Gly Arg Met

580 585 590

Met

<210> SEQ ID NO 33

<211> LENGTH: 542

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 33

Met Met Arg Arg Ala Tyr Pro Glu Gly Pro Pro Pro Ser Tyr Glu Ser

1 5 10 15

Val Met Gln Gln Ala Met Ala Ala Ala Met Gln Pro Pro Leu Glu Ala

20 25 30

Pro Tyr Val Pro Pro Arg Tyr Leu Ala Pro Thr Glu Gly Arg Asn Ser

35 40 45

Ile Arg Tyr Ser Glu Leu Ala Pro Leu Tyr Asp Thr Thr Arg Leu Tyr

50 55 60

Leu Val Asp Asn Lys Ser Ala Asp Ile Ala Ser Leu Asn Tyr Gln Asn

65 70 75 80

Asp His Ser Asn Phe Leu Thr Thr Val Val Gln Asn Asn Asp Phe Thr

85 90 95

Pro Thr Glu Ala Ser Thr Gln Thr Ile Asn Phe Asp Glu Arg Ser Arg

100 105 110

Trp Gly Gly Gln Leu Lys Thr Ile Met His Thr Asn Met Pro Asn Val

115 120 125

Asn Glu Phe Met Tyr Ser Asn Lys Phe Lys Ala Arg Val Met Val Ser

130 135 140

Arg Lys Thr Pro Asn Gly Val Lys Val Asp Asp Asp Tyr Asp Gly Ser

145 150 155 160

Gln Asp Ile Leu Glu Tyr Glu Trp Val Glu Phe Glu Leu Pro Glu Gly

165 170 175

Asn Phe Ser Val Thr Met Thr Ile Asp Leu Met Asn Asn Ala Ile Ile

180 185 190

Asp Asn Tyr Leu Ala Val Gly Arg Gln Asn Gly Val Leu Glu Ser Asp

195 200 205

Ile Gly Val Lys Phe Asp Thr Arg Asn Phe Arg Leu Gly Trp Asp Pro

210 215 220

Val Thr Glu Leu Val Met Pro Gly Val Tyr Thr Asn Glu Ala Phe His

225 230 235 240

Pro Asp Ile Val Leu Leu Pro Gly Cys Gly Val Asp Phe Thr Glu Ser

245 250 255

Arg Leu Ser Asn Leu Leu Gly Ile Arg Lys Arg Gln Pro Phe Gln Glu

260 265 270

Gly Phe Gln Ile Met Tyr Glu Asp Leu Glu Gly Gly Asn Ile Pro Ala

275 280 285

Leu Leu Asp Val Glu Ala Tyr Glu Lys Ser Lys Glu Glu Ala Ala Ala

290 295 300

Ala Ala Thr Ala Ala Val Ala Thr Ala Ala Thr Thr Asp Ala Asp Ala

305 310 315 320

Ala Thr Thr Thr Arg Gly Asp Thr Phe Ala Thr Gln Ala Glu Glu Ala

325 330 335

Ala Ala Leu Ala Ala Thr Asp Asp Ser Glu Ser Lys Ile Val Ile Lys

340 345 350

Pro Val Glu Lys Asp Ser Lys Asp Arg Ser Tyr Asn Val Leu Ser Asp

355 360 365

Gly Lys Asn Thr Ala Tyr Arg Ser Trp Tyr Leu Ala Tyr Asn Tyr Gly

370 375 380

Asp Pro Glu Lys Gly Val Arg Ser Trp Thr Leu Leu Thr Thr Ser Asp

385 390 395 400

Val Thr Cys Gly Val Glu Gln Val Tyr Trp Ser Leu Pro Asp Met Met

405 410 415

Gln Asp Pro Val Thr Phe Arg Ser Thr Arg Gln Val Ser Asn Tyr Pro

420 425 430

Val Val Gly Ala Glu Leu Leu Pro Val Tyr Ser Lys Ser Phe Phe Asn

435 440 445

Glu Gln Ala Val Tyr Ser Gln Gln Leu Arg Ala Phe Thr Ser Leu Thr

450 455 460

His Val Phe Asn Arg Phe Pro Glu Asn Gln Ile Leu Val Arg Pro Pro

465 470 475 480

Ala Pro Thr Ile Thr Thr Val Ser Glu Asn Val Pro Ala Leu Thr Asp

485 490 495

His Gly Thr Leu Pro Leu Arg Ser Ser Ile Arg Gly Val Gln Arg Val

500 505 510

Thr Val Thr Asp Ala Arg Arg Arg Thr Cys Pro Tyr Val Tyr Lys Ala

515 520 525

Leu Gly Ile Val Ala Pro Arg Val Leu Ser Ser Arg Thr Phe

530 535 540

<210> SEQ ID NO 34

<211> LENGTH: 194

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 34

Met Ser Ile Leu Ile Ser Pro Ser Asn Asn Thr Gly Trp Gly Leu Arg

1 5 10 15

Ala Pro Ser Lys Met Tyr Gly Gly Ala Arg Gln Arg Ser Thr Gln His

20 25 30

Pro Val Arg Val Arg Gly His Phe Arg Ala Pro Trp Gly Ala Leu Lys

35 40 45

Gly Arg Val Arg Ser Arg Thr Thr Val Asp Asp Val Ile Asp Gln Val

50 55 60

Val Ala Asp Ala Arg Asn Tyr Thr Pro Ala Ala Ala Pro Val Ser Thr

65 70 75 80

Val Asp Ala Val Ile Asp Ser Val Val Ala Asp Ala Arg Arg Tyr Ala

85 90 95

Arg Ala Lys Ser Arg Arg Arg Arg Ile Ala Arg Arg His Arg Ser Thr

100 105 110

Pro Ala Met Arg Ala Ala Arg Ala Leu Leu Arg Arg Ala Arg Arg Thr

115 120 125

Gly Arg Arg Ala Met Leu Arg Ala Ala Arg Arg Ala Ala Ser Gly Ser

130 135 140

Ser Ser Ala Gly Arg Thr Arg Arg Arg Ala Ala Thr Ala Ala Ala Ala

145 150 155 160

Ala Ile Ala Ser Met Ser Arg Pro Arg Arg Gly Asn Val Tyr Trp Val

165 170 175

Arg Asp Ala Ala Thr Gly Val Arg Val Pro Val Arg Thr Arg Pro Pro

180 185 190

Arg Thr

<210> SEQ ID NO 35

<211> LENGTH: 347

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 35

Met Ser Lys Arg Lys Tyr Lys Glu Glu Met Leu Gln Val Ile Ala Pro

1 5 10 15

Glu Ile Tyr Gly Pro Ala Ala Ala Val Lys Glu Glu Arg Lys Pro Arg

20 25 30

Lys Leu Lys Arg Val Lys Lys Asp Lys Lys Glu Glu Glu Asp Asp Gly

35 40 45

Leu Val Glu Phe Val Arg Glu Phe Ala Pro Arg Arg Arg Val Gln Trp

50 55 60

Arg Gly Arg Lys Val Lys Pro Val Leu Arg Pro Gly Thr Thr Val Val

65 70 75 80

Phe Thr Pro Gly Glu Arg Ser Gly Ser Ala Ser Lys Arg Ser Tyr Asp

85 90 95

Glu Val Tyr Gly Asp Glu Asp Ile Leu Glu Gln Ala Ala Glu Arg Leu

100 105 110

Gly Glu Phe Ala Tyr Gly Lys Arg Ser Arg Pro Ala Pro Leu Lys Glu

115 120 125

Glu Ala Val Ser Ile Pro Leu Asp His Gly Asn Pro Thr Pro Ser Leu

130 135 140

Lys Pro Val Thr Leu Gln Gln Val Leu Pro Ser Ala Ala Pro Arg Arg

145 150 155 160

Gly Phe Lys Arg Glu Gly Gly Glu Asp Leu Tyr Pro Thr Met Gln Leu

165 170 175

Met Val Pro Lys Arg Gln Lys Leu Glu Asp Val Leu Glu His Met Lys

180 185 190

Val Asp Pro Glu Val Gln Pro Glu Val Lys Val Arg Pro Ile Lys Gln

195 200 205

Val Ala Pro Gly Leu Gly Val Gln Thr Val Asp Ile Lys Ile Pro Thr

210 215 220

Glu Pro Met Glu Thr Gln Thr Glu Pro Val Lys Pro Ser Thr Ser Thr

225 230 235 240

Met Glu Val Gln Thr Asp Pro Trp Met Pro Ala Pro Ala Ser Thr Thr

245 250 255

Thr Thr Arg Arg Arg Arg Lys Tyr Gly Ala Ala Ser Leu Leu Met Pro

260 265 270

Asn Tyr Ala Leu His Pro Ser Ile Ile Pro Thr Pro Gly Tyr Arg Gly

275 280 285

Thr Arg Phe Tyr Arg Gly Tyr Thr Ser Ser Arg Arg Arg Lys Thr Thr

290 295 300

Thr Arg Arg Arg Arg Arg Arg Ser Arg Arg Ser Ser Thr Ala Thr Ser

305 310 315 320

Ala Leu Val Arg Arg Val Tyr Arg Ser Gly Arg Glu Pro Leu Thr Leu

325 330 335

Pro Arg Ala Arg Tyr His Pro Ser Ile Ala Ile

340 345

<210> SEQ ID NO 36

<211> LENGTH: 77

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 36

Met Ala Leu Thr Cys Arg Leu Arg Val Pro Ile Thr Gly Tyr Arg Gly

1 5 10 15

Arg Lys Pro Arg Arg Arg Arg Leu Thr Gly Asn Gly Leu Arg Arg His

20 25 30

His His Arg Arg Arg Arg Ala Ile Ser Lys Arg Leu Gly Gly Gly Phe

35 40 45

Leu Pro Ala Leu Ile Pro Ile Ile Ala Ala Ala Ile Gly Ala Ile Pro

50 55 60

Gly Ile Ala Ser Val Ala Val Gln Ala Ser Gln Arg His

65 70 75

<210> SEQ ID NO 37

<211> LENGTH: 242

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 37

Met Glu Asp Ile Asn Phe Ser Ser Leu Ala Pro Arg His Gly Thr Arg

1 5 10 15

Pro Phe Met Gly Thr Trp Ser Asp Ile Gly Asn Ser Gln Leu Asn Gly

20 25 30

Gly Ala Phe Asn Trp Ser Ser Leu Trp Ser Gly Leu Lys Asn Phe Gly

35 40 45

Ser Thr Leu Lys Thr Tyr Gly Asn Lys Ala Trp Asn Ser Ser Thr Gly

50 55 60

Gln Ala Leu Arg Glu Lys Leu Lys Glu Gln Asn Phe Gln Gln Lys Val

65 70 75 80

Val Asp Gly Leu Ala Ser Gly Ile Asn Gly Val Val Asp Leu Ala Asn

85 90 95

Gln Ala Val Gln Lys Gln Ile Asn Ser Arg Leu Asp Ala Val Pro Pro

100 105 110

Ala Gly Ser Val Asp Met Pro Gln Val Glu Glu Glu Leu Pro Pro Leu

115 120 125

Asp Lys Arg Gly Asp Lys Arg Pro Arg Pro Asp Ala Glu Glu Thr Leu

130 135 140

Leu Thr His Thr Asp Glu Pro Pro Pro Tyr Glu Glu Ala Val Lys Leu

145 150 155 160

Gly Leu Pro Thr Thr Arg Pro Val Ala Pro Leu Ala Thr Gly Val Leu

165 170 175

Lys Pro Ser Ser Ser Ser Gln Pro Ala Thr Leu Asp Leu Pro Pro Pro

180 185 190

Ala Ser Arg Pro Ser Thr Val Ala Lys Pro Leu Pro Pro Val Ala Val

195 200 205

Ala Ser Arg Ala Pro Arg Gly Arg Pro Gln Ala Asn Trp Gln Ser Thr

210 215 220

Leu Asn Ser Ile Val Gly Leu Gly Val Gln Ser Val Lys Arg Arg Arg

225 230 235 240

Cys Tyr

<210> SEQ ID NO 38

<211> LENGTH: 959

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 38

Met Tyr Val Arg Arg Pro Glu Gly Gly Gly Arg Gly Ala Ser Pro Ser

1 5 10 15

Cys Lys Met Ala Thr Pro Ser Met Leu Pro Gln Trp Ala Tyr Met His

20 25 30

Ile Ala Gly Gln Asp Ala Ser Glu Tyr Leu Ser Pro Gly Leu Val Gln

35 40 45

Phe Ala Arg Ala Thr Asp Thr Tyr Phe Ser Leu Gly Asn Lys Phe Arg

50 55 60

Asn Pro Thr Val Ala Pro Thr His Asp Val Thr Thr Asp Arg Ser Gln

65 70 75 80

Arg Leu Thr Leu Arg Phe Val Pro Val Asp Arg Glu Asp Asn Thr Tyr

85 90 95

Ser Tyr Lys Val Arg Tyr Thr Leu Ala Val Gly Asp Asn Arg Val Leu

100 105 110

Asp Met Ala Ser Thr Tyr Phe Asp Ile Arg Gly Val Leu Asp Arg Gly

115 120 125

Pro Ser Phe Lys Pro Tyr Ser Gly Thr Ala Tyr Asn Ala Leu Ala Pro

130 135 140

Lys Ala Ala Pro Asn Pro Ser Gln Trp Tyr Glu Thr Thr Glu Gly Thr

145 150 155 160

Asn Gly Asn Ala Asp Ala Thr Thr Thr Lys His Ser Phe Gly Leu Ala

165 170 175

Ala Met Lys Gly Asp Asn Ile Ala Ala Glu Gly Leu Gln Ile Gly Thr

180 185 190

Asp Ser Thr Ser Gly Ser Asp Thr Pro Ile Tyr Ala Asp Lys Leu Tyr

195 200 205

Gln Pro Glu Pro Gln Ile Gly Glu Glu Ser Trp Thr Asp Asn Asp Gly

210 215 220

Thr Asn Glu Lys Phe Gly Gly Arg Val Leu Lys Pro Asp Thr Asn Met

225 230 235 240

Lys Pro Cys Tyr Gly Ser Phe Ala Lys Pro Thr Asn Asn Lys Gly Gly

245 250 255

Gln Ala Lys Gln Lys Ala Thr Glu Gly Thr Ala Val Asp Tyr Asp Val

260 265 270

Asp Met Asn Phe Phe Asp Gly Arg Asp Ala Ala Ala Asn Phe Thr Pro

275 280 285

Glu Val Val Leu Tyr Ala Glu Asn Val Asp Leu Glu Thr Pro Asp Thr

290 295 300

His Ile Val Tyr Lys Pro Gly Thr Ser Asp Leu Ser Ser His Val Asn

305 310 315 320

Leu Gly Gln Gln Ser Met Pro Asn Arg Pro Asn Tyr Ile Gly Phe Arg

325 330 335

Asp Asn Phe Ile Gly Leu Met Tyr Tyr Asn Ser Thr Gly Asn Met Gly

340 345 350

Val Leu Ala Gly Gln Ala Ser Gln Leu Asn Ala Val Val Asp Leu Gln

355 360 365

Asp Arg Asn Thr Glu Leu Ser Tyr Gln Leu Leu Leu Asp Ser Leu Gly

370 375 380

Asp Arg Thr Arg Tyr Phe Ser Met Trp Asn Gln Ala Val Asp Ser Tyr

385 390 395 400

Asp Pro Asp Val Arg Ile Ile Glu Asn His Gly Ile Glu Asp Glu Leu

405 410 415

Pro Asn Tyr Cys Phe Pro Ile Asp Ala Val Gly Ile Thr Arg Thr Tyr

420 425 430

Gln Gly Ile Lys Val Gln Asn Gly Thr Gly Thr Gly Thr Thr Trp Glu

435 440 445

Lys Asp Thr Asp Val Ser Thr Ala Asn Glu Ile Gly Ile Gly Asn Asn

450 455 460

Leu Ala Met Glu Ile Asn Ile Gln Ala Asn Leu Trp Arg Asn Phe Leu

465 470 475 480

Tyr Ala Asn Val Ala Leu Tyr Leu Pro Asp Ser Tyr Lys Tyr Thr Pro

485 490 495

Ala Asn Ile Thr Leu Pro Thr Asn Thr Asn Thr Tyr Asp Tyr Met Asn

500 505 510

Gly Arg Val Val Ala Pro Ser Leu Val Asp Ala Tyr Ile Asn Ile Gly

515 520 525

Ala Arg Trp Ser Leu Asp Pro Met Asp Asn Val Asn Pro Phe Asn His

530 535 540

His Arg Asn Ala Gly Leu Arg Tyr Arg Ser Met Leu Leu Gly Asn Gly

545 550 555 560

Arg Tyr Val Pro Phe His Ile Gln Val Pro Gln Lys Phe Phe Ala Ile

565 570 575

Lys Ser Leu Leu Leu Leu Pro Gly Ser Tyr Thr Tyr Glu Trp Asn Phe

580 585 590

Arg Lys Asp Val Asn Met Ile Leu Gln Ser Ser Leu Gly Asn Asp Leu

595 600 605

Arg Thr Asp Gly Ala Ser Ile Ala Phe Thr Ser Ile Asn Leu Tyr Ala

610 615 620

Thr Phe Phe Pro Met Ala His Asn Thr Ala Ser Thr Leu Glu Ala Met

625 630 635 640

Leu Arg Asn Asp Thr Asn Asp Gln Ser Phe Asn Asp Tyr Leu Ser Ala

645 650 655

Ala Asn Met Leu Tyr Pro Ile Pro Ala Asn Ala Thr Asn Val Pro Ile

660 665 670

Ser Ile Pro Ser Arg Asn Trp Ala Ala Phe Arg Gly Trp Ser Phe Thr

675 680 685

Arg Leu Lys Thr Arg Glu Thr Pro Ser Leu Gly Ser Gly Phe Asp Pro

690 695 700

Tyr Phe Val Tyr Ser Gly Ser Ile Pro Tyr Leu Asp Gly Thr Phe Tyr

705 710 715 720

Leu Asn His Thr Phe Lys Lys Val Ser Ile Thr Phe Asp Ser Ser Val

725 730 735

Ser Trp Pro Gly Asn Asp Arg Leu Leu Thr Pro Asn Glu Phe Glu Ile

740 745 750

Lys Arg Thr Val Asp Gly Glu Gly Tyr Asn Val Ala Gln Cys Asn Met

755 760 765

Thr Lys Asp Trp Phe Leu Val Gln Met Leu Ala His Tyr Asn Ile Gly

770 775 780

Tyr Gln Gly Phe Tyr Val Pro Glu Gly Tyr Lys Asp Arg Met Tyr Ser

785 790 795 800

Phe Phe Arg Asn Phe Gln Pro Met Ser Arg Gln Val Val Asp Glu Val

805 810 815

Asn Tyr Lys Asp Tyr Gln Ala Val Thr Leu Ala Tyr Gln His Asn Asn

820 825 830

Ser Gly Phe Val Gly Tyr Leu Ala Pro Thr Met Arg Gln Gly Gln Pro

835 840 845

Tyr Pro Ala Asn Tyr Pro Tyr Pro Leu Ile Gly Lys Ser Ala Val Thr

850 855 860

Ser Val Thr Gln Lys Lys Phe Leu Cys Asp Arg Val Met Trp Arg Ile

865 870 875 880

Pro Phe Ser Ser Asn Phe Met Ser Met Gly Ala Leu Thr Asp Leu Gly

885 890 895

Gln Asn Met Leu Tyr Ala Asn Ser Ala His Ala Leu Asp Met Asn Phe

900 905 910

Glu Val Asp Pro Met Asp Glu Ser Thr Leu Leu Tyr Val Val Phe Glu

915 920 925

Val Phe Asp Val Val Arg Val His Gln Pro His Arg Gly Val Ile Glu

930 935 940

Ala Val Tyr Leu Arg Thr Pro Phe Ser Ala Gly Asn Ala Thr Thr

945 950 955

<210> SEQ ID NO 39

<211> LENGTH: 207

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 39

Met Thr Ala Cys Gly Ser Gly Glu Gln Glu Leu Arg Ala Ile Leu Arg

1 5 10 15

Asp Leu Gly Cys Gly Pro Cys Phe Leu Gly Thr Phe Asp Lys Arg Phe

20 25 30

Pro Gly Phe Met Ala Pro His Lys Leu Ala Cys Ala Ile Val Asn Thr

35 40 45

Ala Gly Arg Glu Thr Gly Gly Glu His Trp Leu Ala Phe Ala Trp Asn

50 55 60

Pro Arg Ser His Thr Cys Tyr Leu Phe Asp Pro Phe Gly Phe Ser Asp

65 70 75 80

Glu Arg Leu Lys Gln Ile Tyr Gln Phe Glu Tyr Glu Gly Leu Leu Arg

85 90 95

Arg Ser Ala Leu Ala Thr Glu Asp Arg Cys Val Thr Leu Glu Lys Ser

100 105 110

Thr Gln Thr Val Gln Gly Pro Arg Ser Ala Ala Cys Gly Leu Phe Cys

115 120 125

Cys Met Phe Leu His Ala Phe Val His Trp Pro Asp Arg Pro Met Asp

130 135 140

Lys Asn Pro Thr Met Asn Leu Leu Thr Gly Val Pro Asn Gly Met Leu

145 150 155 160

Gln Ser Pro Gln Val Glu Pro Thr Leu Arg Arg Asn Gln Glu Ala Leu

165 170 175

Tyr Arg Phe Leu Asn Ala His Ser Ala Tyr Phe Arg Ser His Arg Ala

180 185 190

Arg Ile Glu Lys Ala Thr Ala Phe Asp Arg Met Asn Gln Asp Met

195 200 205

<210> SEQ ID NO 40

<211> LENGTH: 796

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 40

Met Glu Thr Gln Pro Ser Ser Pro Thr Ser Pro Ser Ala Pro Ala Asp

1 5 10 15

Glu Asn Gln Gln Gln Asn Glu Ser Leu Thr Ala Pro Pro Pro Ser Pro

20 25 30

Thr Ser Asp Ala Ala Ala Pro Asp Met Gln Glu Met Glu Lys Ser Ile

35 40 45

Glu Ile Asp Leu Gly Tyr Val Thr Pro Ala Glu His Glu Glu Glu Leu

50 55 60

Ala Ala Arg Phe Ser Ala Pro Glu Glu Asn His Gln Glu Gln Pro Glu

65 70 75 80

Gln Glu Ala Glu Ser Glu Gln Gln Gln Ala Gly Leu Glu His Gly Asp

85 90 95

Tyr Leu Ser Gly Ala Glu Asp Val Leu Ile Lys His Leu Ala Arg Gln

100 105 110

Cys Ile Ile Val Lys Asp Ala Leu Leu Asp Arg Ala Glu Val Pro Leu

115 120 125

Ser Val Ala Glu Leu Ser Arg Ala Tyr Glu Arg Asn Leu Phe Ser Pro

130 135 140

Arg Val Pro Pro Lys Arg Gln Pro Asn Gly Thr Cys Glu Pro Asn Pro

145 150 155 160

Arg Leu Asn Phe Tyr Pro Val Phe Ala Val Pro Glu Ala Leu Ala Thr

165 170 175

Tyr His Ile Phe Phe Lys Asn Gln Lys Ile Pro Val Ser Cys Arg Ala

180 185 190

Asn Arg Thr Arg Ala Asp Ala Leu Leu Asn Leu Gly Pro Gly Ala Arg

195 200 205

Leu Pro Asp Ile Ala Ser Leu Glu Glu Val Pro Lys Ile Phe Glu Gly

210 215 220

Leu Gly Ser Asp Glu Thr Arg Ala Ala Asn Ala Leu Gln Gly Ser Gly

225 230 235 240

Glu Glu His Glu His His Ser Ala Leu Val Glu Leu Glu Gly Asp Asn

245 250 255

Ala Arg Leu Ala Val Leu Lys Arg Thr Val Glu Leu Thr His Phe Ala

260 265 270

Tyr Pro Ala Leu Asn Leu Pro Pro Lys Val Met Ser Ala Val Met Asp

275 280 285

Gln Val Leu Ile Lys Arg Ala Ser Pro Leu Ser Glu Glu Glu Met Gln

290 295 300

Asp Pro Glu Ser Ser Asp Glu Gly Lys Pro Val Val Ser Asp Glu Gln

305 310 315 320

Leu Ala Arg Trp Leu Gly Ala Ser Ser Thr Pro Gln Ser Leu Glu Glu

325 330 335

Arg Arg Lys Leu Met Met Ala Val Val Leu Val Thr Val Glu Leu Glu

340 345 350

Cys Leu Arg Arg Phe Phe Ala Asp Ala Glu Thr Leu Arg Lys Val Glu

355 360 365

Glu Asn Leu His Tyr Leu Phe Arg His Gly Phe Val Arg Gln Ala Cys

370 375 380

Lys Ile Ser Asn Val Glu Leu Thr Asn Leu Val Ser Tyr Met Gly Ile

385 390 395 400

Leu His Glu Asn Arg Leu Gly Gln Asn Val Leu His Thr Thr Leu Arg

405 410 415

Gly Glu Ala Arg Arg Asp Tyr Ile Arg Asp Cys Val Tyr Leu Tyr Leu

420 425 430

Cys His Thr Trp Gln Thr Gly Met Gly Val Trp Gln Gln Cys Leu Glu

435 440 445

Glu Gln Asn Leu Lys Glu Leu Cys Lys Leu Leu Gln Lys Asn Leu Lys

450 455 460

Ala Leu Trp Thr Gly Phe Asp Glu Arg Thr Thr Ala Ala Asp Leu Ala

465 470 475 480

Asp Leu Ile Phe Pro Glu Arg Leu Arg Leu Thr Leu Arg Asn Gly Leu

485 490 495

Pro Asp Phe Met Ser Gln Ser Met Leu Gln Asn Phe Arg Ser Phe Ile

500 505 510

Leu Glu Arg Ser Gly Ile Leu Pro Ala Thr Cys Ser Ala Leu Pro Ser

515 520 525

Asp Phe Val Pro Leu Thr Phe Arg Glu Cys Pro Pro Pro Leu Trp Ser

530 535 540

His Cys Tyr Leu Leu Arg Leu Ala Asn Tyr Leu Ala Tyr His Ser Asp

545 550 555 560

Val Ile Glu Asp Val Ser Gly Glu Gly Leu Leu Glu Cys His Cys Arg

565 570 575

Cys Asn Leu Cys Thr Pro His Arg Ser Leu Ala Cys Asn Pro Gln Leu

580 585 590

Leu Ser Glu Thr Gln Ile Ile Gly Thr Phe Glu Leu Gln Gly Pro Gly

595 600 605

Glu Glu Gly Lys Gly Gly Leu Lys Leu Thr Pro Gly Leu Trp Thr Ser

610 615 620

Ala Tyr Leu Arg Lys Phe Val Pro Glu Asp Tyr His Pro Phe Glu Ile

625 630 635 640

Arg Phe Tyr Glu Asp Gln Ser Gln Pro Pro Lys Ala Glu Leu Ser Ala

645 650 655

Cys Val Ile Thr Gln Gly Ala Ile Leu Ala Gln Leu Gln Ala Ile Gln

660 665 670

Lys Ser Arg Gln Glu Phe Leu Leu Lys Lys Gly His Gly Val Tyr Leu

675 680 685

Asp Pro Gln Thr Gly Glu Glu Leu Asn Pro Ser Phe Pro Gln Asp Ala

690 695 700

Pro Arg Lys Gln Gln Glu Ala Glu Ser Gly Ala Ala Ala Ala Ala Gly

705 710 715 720

Gly Phe Gly Gly Arg Leu Gly Glu Gln Ser Gly Arg Gly Asp Gly Arg

725 730 735

Leu Gly Gln His Ser Gly Arg Gly Gly Gln Pro Ala Arg Gln Ser Gly

740 745 750

Gly Gly Arg Arg Gly Gly Gly Gly Gly Arg Gly Arg Ser Ser Arg Arg

755 760 765

Gln Thr Val Val Leu Gly Gly Glu Ser Lys Gln His Gly Tyr His Leu

770 775 780

Arg Ser Gly Ser Gly Ser Arg Arg Pro Gly Pro Gln

785 790 795

<210> SEQ ID NO 41

<211> LENGTH: 227

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 41

Met Ser Lys Glu Ile Pro Thr Pro Tyr Met Trp Ser Tyr Gln Pro Gln

1 5 10 15

Met Gly Leu Ala Ala Gly Ala Ala Gln Asp Tyr Ser Thr Arg Met Asn

20 25 30

Trp Leu Ser Ala Gly Pro Ala Met Ile Ser Arg Val Asn Asp Ile Leu

35 40 45

Ala His Arg Asn Gln Ile Leu Leu Glu Gln Ser Ala Ile Thr Ala Thr

50 55 60

Pro Arg His His Leu Asn Pro Arg Asn Trp Pro Ala Ala Leu Val Tyr

65 70 75 80

Gln Glu Ile Pro Gln Pro Thr Thr Val Leu Leu Pro Arg Asp Ala Gln

85 90 95

Ala Glu Val Gln Leu Thr Asn Ser Gly Val Gln Leu Ala Gly Gly Ala

100 105 110

Ala Leu Cys Arg His Arg Pro Ala Gln Gly Ile Lys Arg Leu Val Ile

115 120 125

Arg Gly Arg Gly Thr Gln Leu Asn Asp Glu Val Val Ser Ser Ser Leu

130 135 140

Gly Leu Arg Pro Asp Gly Val Phe Gln Leu Ala Gly Ser Gly Arg Ser

145 150 155 160

Ser Phe Thr Pro Arg Gln Ala Val Leu Thr Leu Glu Ser Ser Ser Ser

165 170 175

Gln Pro Arg Ser Gly Gly Ile Gly Thr Leu Gln Phe Val Glu Glu Phe

180 185 190

Thr Pro Ser Val Tyr Phe Asn Pro Phe Ser Gly Ser Pro Gly His Tyr

195 200 205

Pro Asp Glu Phe Ile Pro Asn Phe Asp Ala Ile Ser Glu Ser Val Asp

210 215 220

Gly Tyr Asp

225

<210> SEQ ID NO 42

<211> LENGTH: 106

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 42

Met Ser His Gly Gly Ala Ala Asp Leu Ala Arg Leu Arg His Leu Asp

1 5 10 15

His Cys Arg Arg Phe Arg Cys Phe Ala Arg Asp Leu Ala Glu Phe Ala

20 25 30

Tyr Phe Glu Leu Pro Glu Glu His Pro Gln Gly Pro Ala His Gly Val

35 40 45

Arg Ile Val Val Glu Gly Gly Leu Asp Ser His Leu Leu Arg Ile Phe

50 55 60

Ser Gln Arg Pro Ile Leu Val Glu Arg Glu Gln Gly Gln Thr Arg Leu

65 70 75 80

Thr Leu Tyr Cys Ile Cys Asn His Pro Gly Leu His Glu Ser Leu Cys

85 90 95

Cys Leu Leu Cys Thr Glu Tyr Asn Lys Ser

100 105

<210> SEQ ID NO 43

<211> LENGTH: 176

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 43

Met Gly Lys Ile Thr Leu Val Ser Cys Gly Ala Leu Val Ala Val Val

1 5 10 15

Leu Ser Ile Val Gly Leu Gly Gly Ala Ala Val Val Lys Glu Lys Ala

20 25 30

Asp Pro Cys Leu His Phe Asn Pro Asp Lys Cys Gln Leu Ser Phe Gln

35 40 45

Pro Asp Gly Asn Arg Cys Ala Val Leu Ile Lys Cys Gly Trp Glu Cys

50 55 60

Glu Asn Val Arg Ile Glu Tyr Asn Asn Lys Thr Arg Asn Asn Thr Leu

65 70 75 80

Ala Ser Val Trp Gln Pro Gly Asp Pro Glu Trp Tyr Thr Val Ser Val

85 90 95

Pro Gly Ala Asp Gly Ser Pro Arg Thr Val Asn Asn Thr Phe Ile Phe

100 105 110

Ala His Met Cys Asp Thr Val Met Trp Met Ser Lys Gln Tyr Asp Met

115 120 125

Trp Pro Pro Thr Lys Glu Asn Ile Val Val Phe Ser Ile Ala Tyr Ser

130 135 140

Leu Cys Thr Ala Leu Ile Thr Ala Ile Val Cys Leu Ser Ile His Met

145 150 155 160

Leu Ile Ala Ile Arg Pro Arg Asn Asn Ala Glu Lys Glu Lys Gln Pro

165 170 175

<210> SEQ ID NO 44

<211> LENGTH: 206

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 44

Met Ala Ser Val Lys Phe Phe Ala Leu Phe Ala Ser Leu Ile Thr Val

1 5 10 15

Ile His Gly Met Ser Asn Glu Lys Ile Thr Ile Tyr Thr Gly Thr Asn

20 25 30

His Thr Leu Lys Gly Pro Glu Lys Ser Thr Glu Val Ser Trp Tyr Cys

35 40 45

Tyr Phe Asn Glu Ser Asp Val Ala Thr Glu Leu Cys Gly Asn Asn Asn

50 55 60

Lys Lys Asn Glu Ser Ile Thr Leu Ile Lys Phe Gln Cys Gly Ser Asp

65 70 75 80

Leu Thr Leu Ile Asn Ile Thr Arg Asp Tyr Val Gly Met Tyr Tyr Gly

85 90 95

Thr Thr Ala Gly Ile Ser Asp Met Glu Phe Tyr Gln Val Ser Val Ser

100 105 110

Glu Pro Thr Thr Pro Arg Met Thr Thr Thr Thr Lys Thr Thr Pro Thr

115 120 125

Thr Thr Thr Gln Leu Thr Thr Asn Gly Phe Phe Ala Met Leu Gln Val

130 135 140

Ala Glu Asn Ser Thr Ser Ile Gln Pro Thr Pro Pro Ser Glu Glu Ile

145 150 155 160

Pro Lys Ser Met Ile Gly Ile Ile Val Ala Val Val Val Cys Met Leu

165 170 175

Ile Ile Ala Leu Cys Met Val Tyr Tyr Ala Phe Cys Tyr Arg Lys His

180 185 190

Arg Leu Asn Asp Lys Leu Glu His Leu Leu Ser Val Glu Phe

195 200 205

<210> SEQ ID NO 45

<211> LENGTH: 203

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 45

Met Lys Ile Leu Cys Leu Leu Val Phe Tyr Ile Ile Thr Ser Ala Leu

1 5 10 15

Cys Glu Ser Val Asp Lys Asp Val Thr Ile Thr Thr Gly Ser Asn Tyr

20 25 30

Thr Leu Lys Gly Pro Pro Ser Gly Met Leu Ser Trp Tyr Cys Tyr Phe

35 40 45

Gly Thr Asp Thr Asp Gln Thr Glu Leu Cys Asn Phe Gln Lys Gly Lys

50 55 60

Thr Ser Asn Ser Lys Ile Ser Asn Tyr Gln Cys Asn Gly Thr Asp Leu

65 70 75 80

Ile Leu Leu Asn Val Thr Lys Ala Tyr Gly Gly Ser Tyr Ser Cys Pro

85 90 95

Gly Gln Asn Thr Glu Glu Met Ile Phe Tyr Lys Val Glu Val Val Asp

100 105 110

Pro Thr Thr Pro Pro Pro Pro Ala Thr Thr Thr His Thr Thr His Thr

115 120 125

Glu Gln Ser Thr Ala Glu Ala Ala Lys Leu Ala Leu Gln Val Gln Asp

130 135 140

Ser Ser Phe Val Gly Ile Thr Pro Thr Pro Asp Gln Arg Cys Pro Gly

145 150 155 160

Leu Leu Val Ser Gly Ile Val Gly Val Leu Ser Gly Leu Ala Val Ile

165 170 175

Ile Ile Cys Met Phe Ile Phe Ala Cys Cys Tyr Arg Arg Leu Tyr Arg

180 185 190

Gln Lys Ser Asp Pro Leu Leu Asn Leu Tyr Val

195 200

<210> SEQ ID NO 46

<211> LENGTH: 288

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 46

Met Lys Ala Val Ser Ala Leu Val Phe Cys Ser Leu Ile Gly Ile Val

1 5 10 15

Phe Asn Ser Lys Ile Thr Arg Val Ser Phe Ile Lys His Val Asn Val

20 25 30

Thr Glu Gly Asp Asn Ile Thr Leu Ala Gly Val Glu Gly Ala Gln Asn

35 40 45

Thr Thr Trp Thr Lys Tyr His Leu Gly Trp Arg Asp Ile Cys Thr Trp

50 55 60

Asn Val Thr Tyr Tyr Cys Ile Gly Val Asn Leu Thr Ile Val Asn Ala

65 70 75 80

Asn Gln Ser Gln Asn Gly Leu Ile Lys Gly Gln Ser Val Ser Val Thr

85 90 95

Ser Asp Gly Tyr Tyr Thr Gln His Ser Phe Asn Tyr Asn Ile Thr Val

100 105 110

Ile Pro Leu Pro Thr Pro Ser Pro Pro Ser Thr Thr Ala Gln Thr Thr

115 120 125

Thr Tyr Ser Thr Ser Asn Gln Pro Thr Thr Thr Thr Ala Ala Glu Val

130 135 140

Ala Ser Ser Ser Gly Val Arg Val Ala Phe Leu Met Leu Ala Pro Ser

145 150 155 160

Ser Ser Pro Thr Ala Ser Thr Asn Glu Gln Thr Thr Glu Phe Leu Ser

165 170 175

Thr Val Glu Ser His Thr Thr Ala Thr Ser Ser Ala Phe Ser Ser Thr

180 185 190

Ala Asn Leu Ser Ser Leu Ser Ser Thr Pro Ile Ser Pro Ala Thr Thr

195 200 205

Pro Ser Pro Ala Pro Leu Pro Thr Pro Leu Lys Gln Thr Asp Gly Gly

210 215 220

Met Gln Trp Gln Ile Thr Leu Leu Ile Val Ile Gly Leu Val Ile Leu

225 230 235 240

Ala Val Leu Leu Tyr Tyr Ile Phe Cys Arg Arg Ile Pro Asn Ala His

245 250 255

Arg Lys Pro Ala Tyr Lys Pro Ile Val Ile Gly Gln Pro Glu Pro Leu

260 265 270

Gln Val Glu Gly Gly Leu Arg Asn Leu Leu Phe Ser Phe Thr Val Trp

275 280 285

<210> SEQ ID NO 47

<211> LENGTH: 91

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 47

Met Ile Pro Arg Gln Phe Leu Ile Thr Ile Leu Ile Cys Leu Leu Gln

1 5 10 15

Val Cys Ala Thr Leu Ala Leu Val Ala Asn Ala Ser Pro Asp Cys Ile

20 25 30

Gly Pro Phe Ala Ser Tyr Val Leu Phe Ala Phe Ile Thr Cys Ile Cys

35 40 45

Cys Cys Ser Ile Val Cys Leu Leu Ile Thr Phe Phe Gln Phe Ile Asp

50 55 60

Trp Ile Phe Val Arg Ile Ala Tyr Leu Arg His His Pro Gln Tyr Arg

65 70 75 80

Asp Gln Arg Val Ala Arg Leu Leu Arg Leu Leu

85 90

<210> SEQ ID NO 48

<211> LENGTH: 144

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 48

Met Arg Ala Leu Leu Leu Leu Ala Leu Leu Leu Leu Val Leu Pro Arg

1 5 10 15

Pro Val Asp Pro Arg Thr Pro Thr Gln Ser Pro Glu Glu Val Arg Lys

20 25 30

Cys Lys Phe Gln Glu Pro Trp Lys Phe Leu Lys Cys Tyr Arg Gln Lys

35 40 45

Ser Asp Met His Pro Ser Trp Ile Met Ile Ile Gly Ile Val Asn Ile

50 55 60

Leu Ala Cys Thr Leu Ile Ser Phe Val Ile Tyr Pro Cys Phe Asp Phe

65 70 75 80

Gly Trp Asn Ser Pro Glu Ala Leu Tyr Leu Pro Pro Glu Pro Asp Thr

85 90 95

Pro Pro Gln Gln Pro Gln Ala His Ala Leu Pro Pro Pro Pro Gln Pro

100 105 110

Arg Pro Gln Tyr Met Pro Ile Leu Asp Tyr Glu Ala Glu Pro Gln Arg

115 120 125

Pro Met Leu Pro Ala Ile Ser Tyr Phe Asn Leu Thr Gly Gly Asp Asp

130 135 140

<210> SEQ ID NO 49

<211> LENGTH: 443

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 49

Met Ser Lys Lys Arg Val Arg Val Asp Asp Asp Phe Asp Pro Val Tyr

1 5 10 15

Pro Tyr Asp Ala Asp Asn Ala Pro Thr Val Pro Phe Ile Asn Pro Pro

20 25 30

Phe Val Ser Ser Asp Gly Phe Gln Glu Lys Pro Leu Gly Val Leu Ser

35 40 45

Leu Arg Leu Ala Asp Pro Val Thr Thr Lys Asn Gly Glu Ile Thr Leu

50 55 60

Lys Leu Gly Glu Gly Val Asp Leu Asp Ser Ser Gly Lys Leu Ile Ser

65 70 75 80

Asn Thr Ala Thr Lys Ala Ala Ala Pro Leu Ser Phe Ser Asn Asn Thr

85 90 95

Ile Ser Leu Asn Met Asp Thr Pro Leu Tyr Thr Lys Asp Gly Lys Leu

100 105 110

Ser Leu Gln Val Ser Pro Pro Leu Asn Ile Leu Lys Ser Thr Ile Leu

115 120 125

Asn Thr Leu Ala Val Ala Tyr Gly Ser Gly Leu Gly Leu Ser Gly Gly

130 135 140

Thr Ala Leu Ala Val Gln Leu Ala Ser Pro Leu Thr Phe Asp Glu Lys

145 150 155 160

Gly Asn Ile Lys Ile Asn Leu Ala Ser Gly Pro Leu Thr Val Asp Ala

165 170 175

Ser Arg Leu Ser Ile Asn Cys Lys Arg Gly Val Thr Val Thr Thr Ala

180 185 190

Gly Asp Ala Ile Glu Ser Asn Ile Ser Trp Pro Lys Gly Ile Arg Phe

195 200 205

Glu Gly Asp Ala Ile Ala Ala Asn Ile Gly Arg Gly Leu Glu Phe Gly

210 215 220

Thr Thr Ser Thr Glu Thr Asp Val Thr Asp Ala Tyr Pro Ile Gln Val

225 230 235 240

Lys Leu Gly Thr Gly Leu Thr Phe Asp Ser Thr Gly Ala Ile Val Ala

245 250 255

Trp Asn Lys Glu Asp Asp Lys Leu Thr Leu Trp Thr Thr Ala Asp Pro

260 265 270

Ser Pro Asn Cys Lys Ile Tyr Ser Glu Lys Asp Ala Lys Leu Thr Leu

275 280 285

Cys Leu Thr Lys Cys Gly Ser Gln Ile Leu Gly Thr Val Thr Val Leu

290 295 300

Ala Val Asn Asn Gly Ser Leu Asn Pro Ile Thr Asn Thr Val Ser Thr

305 310 315 320

Ala Leu Val Ser Leu Lys Phe Asp Ala Ser Gly Val Leu Leu Ser Ser

325 330 335

Ser Thr Leu Asp Lys Glu Tyr Trp Asn Phe Arg Lys Gly Asp Val Thr

340 345 350

Pro Ala Glu Pro Tyr Thr Asn Ala Ile Gly Phe Met Pro Asn Ile Lys

355 360 365

Ala Tyr Pro Lys Asn Thr Ser Ala Ala Ser Lys Ser His Ile Val Ser

370 375 380

Gln Val Tyr Leu Asn Gly Asp Glu Thr Lys Pro Leu Met Leu Ile Ile

385 390 395 400

Thr Phe Asn Glu Thr Glu Asp Ala Thr Cys Thr Tyr Ser Ile Thr Phe

405 410 415

Gln Trp Lys Trp Asp Ser Thr Lys Tyr Thr Gly Lys Thr Leu Ala Thr

420 425 430

Ser Ser Phe Thr Phe Ser Tyr Ile Ala Gln Glu

435 440

<210> SEQ ID NO 50

<211> LENGTH: 29910

<212> TYPE: DNA

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Simian adenovirus A1325

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (9)..(1514)

<223> OTHER INFORMATION: E1b\55K

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (23632)..(24192)

<223> OTHER INFORMATION: 22K

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (25488)..(26114)

<223> OTHER INFORMATION: E3\CR1-alpha

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (29500)..(29904)

<223> OTHER INFORMATION: E3\14.7K

<400> SEQUENCE: 50

cttgacaa atg gag tca aga aac cca ttt cag cag gga tta ccg tct gga 50

Met Glu Ser Arg Asn Pro Phe Gln Gln Gly Leu Pro Ser Gly

1 5 10

ctg ctt agc agt agc ttt gtg gag aac atg gag gtg cca gcg cct gaa 98

Leu Leu Ser Ser Ser Phe Val Glu Asn Met Glu Val Pro Ala Pro Glu

15 20 25 30

tgc aat ctc cgg cta ctt gcc agt aca gcc ggt aga cac gct gag gat 146

Cys Asn Leu Arg Leu Leu Ala Ser Thr Ala Gly Arg His Ala Glu Asp

35 40 45

cct gag tct cca gtc acc cca gga aca cca acg ccg cca gca gcc gca 194

Pro Glu Ser Pro Val Thr Pro Gly Thr Pro Thr Pro Pro Ala Ala Ala

50 55 60

gca gca gca aga gga gga gga gga ccg aga aga gaa ccc gag agc cgg 242

Ala Ala Ala Arg Gly Gly Gly Gly Pro Arg Arg Glu Pro Glu Ser Arg

65 70 75

tct gga ccc tcc ggt ggc gga gga gga gga gta gct gac ttg ttt ccc 290

Ser Gly Pro Ser Gly Gly Gly Gly Gly Gly Val Ala Asp Leu Phe Pro

80 85 90

gag ctg tgc cgg gtg ctg act agg tct tcc agt gga cgg gag agg ggg 338

Glu Leu Cys Arg Val Leu Thr Arg Ser Ser Ser Gly Arg Glu Arg Gly

95 100 105 110

att aag cgg gag agg cat gag gag act agt cac aga act gaa ctg act 386

Ile Lys Arg Glu Arg His Glu Glu Thr Ser His Arg Thr Glu Leu Thr

115 120 125

gtc agt ttg atg agc cgc agg cgc cca gaa tcg gtg tgg tgg cat gag 434

Val Ser Leu Met Ser Arg Arg Arg Pro Glu Ser Val Trp Trp His Glu

130 135 140

gtt cag tcg cag ggg gta gat gag gtc tcg gtg atg cat gag aaa tat 482

Val Gln Ser Gln Gly Val Asp Glu Val Ser Val Met His Glu Lys Tyr

145 150 155

tcc cta gaa caa gtc aag act tgt tgg ttg gag ccc gag gat gat tgg 530

Ser Leu Glu Gln Val Lys Thr Cys Trp Leu Glu Pro Glu Asp Asp Trp

160 165 170

gag gta gcc atc agg aat tat gcc aag ctg gct ctg agg cca gac aag 578

Glu Val Ala Ile Arg Asn Tyr Ala Lys Leu Ala Leu Arg Pro Asp Lys

175 180 185 190

aag tac aag att acc aaa ctg att aat atc aga aat tcc tgc tac att 626

Lys Tyr Lys Ile Thr Lys Leu Ile Asn Ile Arg Asn Ser Cys Tyr Ile

195 200 205

tcg ggg aat ggg gcc gag gtg gag atc agt acc cag gag agg gtg gcc 674

Ser Gly Asn Gly Ala Glu Val Glu Ile Ser Thr Gln Glu Arg Val Ala

210 215 220

ttc aga tgt tgt atg atg aat atg tac ccg ggg gtg gtg ggc atg gag 722

Phe Arg Cys Cys Met Met Asn Met Tyr Pro Gly Val Val Gly Met Glu

225 230 235

gga gtc acc ttt atg aac gcg agg ttc agg ggt gat ggg tat aat ggg 770

Gly Val Thr Phe Met Asn Ala Arg Phe Arg Gly Asp Gly Tyr Asn Gly

240 245 250

gtg gtc ttt atg gcc aac acc aag ctg aca gtg cac ggt tgc tcc ttc 818

Val Val Phe Met Ala Asn Thr Lys Leu Thr Val His Gly Cys Ser Phe

255 260 265 270

ttt ggc ttt aat aac atg tgc atc gag gcc tgg ggc agt gtt tca gtg 866

Phe Gly Phe Asn Asn Met Cys Ile Glu Ala Trp Gly Ser Val Ser Val

275 280 285

agg gga tgc agt ttt tca gcc aac tgg atg ggg gtc gtg ggc aga acc 914

Arg Gly Cys Ser Phe Ser Ala Asn Trp Met Gly Val Val Gly Arg Thr

290 295 300

aag agc aag gtg tca gtg aag aaa tgc ctg ttc gag agg tgc cac ctg 962

Lys Ser Lys Val Ser Val Lys Lys Cys Leu Phe Glu Arg Cys His Leu

305 310 315

ggg gtg atg agc gag ggc gaa gcc aaa gtc aaa cac tgc gcc tct acc 1010

Gly Val Met Ser Glu Gly Glu Ala Lys Val Lys His Cys Ala Ser Thr

320 325 330

gag acg ggc tgc ttt gtg ctg atc aag ggc aat gcc aaa gtc aag cat 1058

Glu Thr Gly Cys Phe Val Leu Ile Lys Gly Asn Ala Lys Val Lys His

335 340 345 350

aat atg atc tgt ggg gcc tcg gat gag cgc ggc tac cag atg ctg acc 1106

Asn Met Ile Cys Gly Ala Ser Asp Glu Arg Gly Tyr Gln Met Leu Thr

355 360 365

tgc gcc ggt ggg aac agc cat atg ctg gcc acc gtg cat gtg gcc tcg 1154

Cys Ala Gly Gly Asn Ser His Met Leu Ala Thr Val His Val Ala Ser

370 375 380

cac ccc cgc aag aca tgg ccc gag ttc gag cac aac gtc atg acc cgc 1202

His Pro Arg Lys Thr Trp Pro Glu Phe Glu His Asn Val Met Thr Arg

385 390 395

tgc aat gtg cac ctg ggc tcc cgc cga ggc atg ttc atg cca tac cag 1250

Cys Asn Val His Leu Gly Ser Arg Arg Gly Met Phe Met Pro Tyr Gln

400 405 410

tgc aac atg caa ttt gtg aag gtg ctg ctg gag ccc gat gcc atg tcc 1298

Cys Asn Met Gln Phe Val Lys Val Leu Leu Glu Pro Asp Ala Met Ser

415 420 425 430

aga gtg agc ctg acg ggg gtg ttt gac atg aat gtg gag ctg tgg aaa 1346

Arg Val Ser Leu Thr Gly Val Phe Asp Met Asn Val Glu Leu Trp Lys

435 440 445

att ctg aga tat gat gaa tcc aag acc agg tgc cgg gcc tgc gaa tgc 1394

Ile Leu Arg Tyr Asp Glu Ser Lys Thr Arg Cys Arg Ala Cys Glu Cys

450 455 460

gga ggc aag cac gcc agg ctt cag ccc gtg tgt gtg gag gtg acg gag 1442

Gly Gly Lys His Ala Arg Leu Gln Pro Val Cys Val Glu Val Thr Glu

465 470 475

gac ctg cga ccc gat cat ttg gtg ttg tcc tgc aac ggg acg gag ttc 1490

Asp Leu Arg Pro Asp His Leu Val Leu Ser Cys Asn Gly Thr Glu Phe

480 485 490

ggc tcc agc ggg gaa gaa tct gac tagagtgagt agtgtttggg ggtgggtggg 1544

Gly Ser Ser Gly Glu Glu Ser Asp

495 500

agcctgcatg atgggcagaa tgactaaaat ctgtgttttt ctgcgcagca gcatgagcgg 1604

aagcgcctcc tttgagggag gggtattcag cccttatctg acggggcgtc tcccctcctg 1664

ggctggagtg cgtcagaatg tgatgggatc cacggtggac ggccggcccg tgcagcccgc 1724

gaactcttca accctgacct acgcgaccct gagctcctcg tccgtggacg cagctgccgc 1784

cgcagctgct gcttccgccg ccagcgccgt gcgcggaatg gccctgggtg ccggctacta 1844

cagctctctg gtggccaact cgagttccgc caataatccc gccagcctga acgaggagaa 1904

gctgctgctg ctgatggccc agctcgaggc cctgacccag cgcctgggcg agctgaccca 1964

gcaggtggct cagctgcagg cggagacgcg ggccgcggtt gccacggtga aaaccaaata 2024

aaaaatgaat caataaataa acggagacgg ttgttgattt taacacagag tcttgaatct 2084

ttatttgatt tttcgcgcgc ggtaggccct ggaccaccgg tctcgatcat tgagcacccg 2144

gtggatcttt tccaggaccc ggtagaggtg ggcttggatg ttgaggtaca tgggcatgag 2204

cccgtcccgg gggtggaggt agctccattg cagggcctcg tgctcggggg tggtgttgta 2264

aatcacccag tcatagcagg ggcgcagggc gtggtgctgc acgatgtcct tgaggaggag 2324

actgatggcc acgggcagcc ccttggtgta ggtgttgacg aacctgttga gctgggaggg 2384

atgcatgcgg ggggagatga gatgcatctt ggcctggatc ttgagattgg cgatgttccc 2444

gcccagatcc cgccgggggt tcatgttgtg caggaccacc agcacggtgt atccggtgca 2504

cttggggaat ttgtcatgca acttggaagg gaaggcgtga aagaatttgg agacgccctt 2564

gtgaccgccc aggttttcca tgcactcatc catgatgatg gcgatgggcc cgtgggcggc 2624

ggcctgggca aagacgtttc gggggtcgga cacatcgtag ttgtggtcct gggtgagctc 2684

gtcataggcc attttaatga atttggggcg gagggtgccc gactggggga caaaggtgcc 2744

ctcgatcccg ggggcgtagt tgccctcgca gatctgcatc tcccaggcct tgagctcgga 2804

gggggggatc atgtccacct gcggggcgat gaaaaaaacg gtttccgggg cgggggagat 2864

gagctgcgcc gaaagcaggt tccggagcag ctgggacttg ccgcagccgg tggggccgta 2924

gatgaccccg atgaccggct gcaggtggta gttgagggag agacagctgc cgtcctcgcg 2984

gaggaggggg gccacctcgt tcatcatctc gcgcacatgc atgttctcgc gcacgagttc 3044

cgccaggagg cgctcgcccc ccagcgagag gagctcttgc agcgaggcga agtttttcag 3104

cggcttgagt ccgtcggcca tgggcatttt ggagagggtc tgttgcaaga gttccagacg 3164

gtcccagagc tcggtgatgt gctctagggc atctcgatcc agcagacctc ctcgtttcgc 3224

gggttggggc gactgcggga gtagggcacc aggcgatggg cgtccagcga ggccagggtt 3284

cggtccttcc agggccgcag ggtccgcgtc agcgtggtct ccgtcacggt gaaggggtgc 3344

gcgccgggct gggcgcttgc gagggtgcgc ttcaggctca tccggctggt cgagaaccgc 3404

tcccggtcgg cgccctgcgc gtcggccagg tagcaattga gcatgagttc gtagttgagc 3464

gcctcggccg cgtggccctt ggcgcggagc ttacctttgg aagtgtgtcc gcagacggga 3524

cagaggaggg acttgagggc gtagagcttg ggggcgagga agacggactc gggggcgtag 3584

gcgtccgcgc cgcagctggc gcagacggtc tcgcactcca cgagccaggt gaggtcgggg 3644

cggtcggggt caaaaacgag gtttcctccg tgctttttga tgcgtttctt acctctggtc 3704

tccatgagct cgtgtccccg ctgggtgaca aagaggctgt ccgtgtcccc gtagaccgac 3764

tttatgggcc ggtcctcgag cggggtgccg cggtcctcgt cgtagaggaa ccccgcccac 3824

tccgagacga aggcccgggt ccaggccagc acgaaggagg ccacgtggga ggggtagcgg 3884

tcgttgtcca ccagcgggtc caccttctcc agggtatgca agcacatgtc cccctcgtcc 3944

acatccagga aggtgattgg cttgtaagtg taggccacgt gaccgggggt cccggccggg 4004

ggggtataaa agggggcggg cccctgctcg tcctcactgt cttccggatc gctgtccagg 4064

agcgccagct gttggggtag gtattccctc tcgaaggcgg gcatgacctc ggcactcagg 4124

ttgtcagttt ctagaaacga ggaggatttg atattgacgg tgccgttgga gacgcctttc 4184

atgagcccct cgtccatctg gtcagaaaag acgatctttt tgttgtcgag cttggtggcg 4244

aaggagccgt agagggcgtt ggagagcagc ttggcgatgg agcgcatggt ctggttcttt 4304

tccttgtcgg cgcgctcctt ggcggcgatg ttgagctgca cgtactcgcg cgccacgcac 4364

ttccattcgg ggaagacggt ggtgagctcg tcgggcacga ttctgacccg ccagccgcgg 4424

ttgtgcaggg tgatgaggtc cacgctggtg gccacctcgc cgcgcagggg ctcgttggtc 4484

cagcagaggc gcccgccctt gcgcgagcag aaggggggca gcgggtccag catgagctcg 4544

tcgggggggt cggcgtccac ggtgaagatg ccgggcagga gctcggggtc gaagtagctg 4604

atgcaggtgc ccagatcgtc cagcgccgct tgccagtcgc gcacggccag cgcgcgctcg 4664

taggggctga ggggcgtgcc ccagggcatg gggtgcgtga gcgcggaggc atacatgccg 4724

cagatgtcgt agacgtagag gggctcctct aggacgccga tgtaggtggg gtagcagcgc 4784

cccccgcgga tgctggcgcg cacgtagtcg tacagctcgt gcgagggcgc gaggaggccc 4844

gtgccgaggt tggagcgctg cggcttttcg gcgcggtaga cgatctggcg gaagatggcg 4904

tgggagttgg aggagatggt gggcctctgg aagatgttga agtgggcgtg gggcaggccg 4964

accgagtccc tgatgaagtg ggcgtaggag tcctgcagct tggcgacgag ctcggcggtg 5024

acgaggacgt ccagggcgca gtagtcgagg gtctcttgga tgatgtcata cttgagctgg 5084

cccttctgct tccacagctc gcggttgaga aggaactctt cgcggtcctt ccagtactct 5144

tcgaggggga acccgtcctg atcggcacgg taagagccca ccatgtagaa ctggttgacg 5204

gccttgtagg cgcagcagcc cttctccacg gggagggcgt aggcttgcgc ggccttgcgc 5264

agggaggtgt gggtgagggc gaaggtgtcg cgcaccatga ccttgaggaa ctggtgcttg 5324

aagtcgaggt cgtcgcagcc gccctgctcc cagagttgga agtccgtgcg cttcttgtag 5384

gcggggttgg gcaaagcgaa agtaacatcg ttgaagagga tcttgcccgc gcggggcatg 5444

aagttgcgag tgatgcggaa aggctggggc acctcggccc ggttgttgat gacctgggcg 5504

gcgaggacga tctcgtcgaa gccgttgatg ttgtgcccga cgatgtagag ttccacgaat 5564

cgcgggcggc ccttgacgtg gggcagcttc ttgagctcgt cgtaggtgag ctcggcgggg 5624

tcgctgagcc cgtgctgctc gagggcccag tcggcgacgt gggggttggc gctgaggaag 5684

gaagtccaga gatccacggc cagggcggtc tgcaagcggt cccggtactg acggaactgc 5744

tgacccacgg ccattttttc gggggtgacg cagtagaagg tgcgggggtc gccgtgccag 5804

cggtcccact tgagctggag ggcgaggtcg tgggcgagct cgacgagcgg cgggtccccg 5864

gagagtttca tgaccagcat gaaggggacg agctgcttgc cgaaggaccc catccaggtg 5924

taggtttcca catcgtaggt gaggaagagc ctttcggtgc gaggatgcga gccgatgggg 5984

aagaactgga tctcctgcca ccagttggag gaatggctgt tgatgtgatg gaagtagaaa 6044

tgccgacggc gcgccgagca ctcgtgcttg tgtttataca agcgtccgca gtgctcgcaa 6104

cgctgcacgg gatgcacgtg ctgcacgagc tgtacctggg ttcctttgac gaggaatttc 6164

agtgggcagt ggagcgctgg cggctgcatc tcgtgctgta ctacgtcctg gccatcggcg 6224

tggccatcgt ctgcctcgat ggtggtcatg ctgacgagcc cgcgcgggag gcaggtccag 6284

acctcggctc ggacgggtcg gagagcgagg acgagggcgc gcaggccgga gctgtccagg 6344

gtcctgagac gctgcggagt caggtcagtg ggcagcggcg gcgcgcggtt gacttgcagg 6404

agcttttcca gggcgcgcgg gaggtccaga tggtacttga tctccacggc gccgttggtg 6464

gcgacgtcca cggcttgcag ggtcccgtgc ccctggggcg ccaccaccgt gccccgtttc 6524

ttcttgggcg ctgcttccat gtcggtcaga agcggcggcg aggacgcgcg ccgggcggca 6584

ggggcggctc ggggcccgga ggcaggggcg gcaggggcac gtcggcgccg cgcgcgggca 6644

ggttctggta ctgcgcccgg agaagactgg cgtgagcgac gacgcggcgg ttgacgtcct 6704

ggatctgacg cctctgggtg aaggccacgg gacccgtgag tttgaacctg aaagagagtt 6764

cgacagaatc aatttcggta tcgttgacgg cggcctgccg caggatctct tgcacgtcgc 6824

ccgagttgtc ctggtaggcg atctcggtca tgaactgctc gatctcctcc tcctgaaggt 6884

ctccgcggcc ggcgcgctcg acggtggccg cgaggtcgtt ggagatgcgg cccatgagct 6944

gcgagaaggc gttcatgccg gcctcgttcc agacgcggct gtagaccacg gctccgtcgg 7004

ggtcgcgcgc gcgcatgacc acctgggcga ggttgagctc gacgtggcgc gtgaagaccg 7064

cgtagttgca gaggcgctgg tagaggtagt taagcgtggt ggcgatgtgc tcggtgacga 7124

agaagtacat gatccagcgg cggagcggca tctcgctgac gtcgcccagc gcctccaagc 7184

gttccatggc ctcgtaaaag tccacggcga agttgaaaaa ctgggagttg cgcgccgaga 7244

cggtcaactc ctcctccaga agacggatga gctcggcgat ggtggcgcgc acctcgcact 7304

cgaaggcccc cgggagttcc tccacttcct cttcttccat ctcctccact aacatctctt 7364

ctacttcctc ctcaggcggt ggtggcgggg gagggggcct gcgtcgccgg cggcgcacgg 7424

gcagacggtc gatgaaacgc tcgatggtct cgccgcgccg gcgtcgcatg gtctcggtga 7484

cggcgcgccc gtcctcgcgg ggccgcagcg tgaagacgcc gccgcgcatc tccaggtggc 7544

cgggggggtc cccgttgggc agggagaggg cgctgacgat gcatcttatc aattgccccg 7604

tagggactcc gcgcaaggac ctgagcgtct cgagatccac gggatctgaa aaccgttgaa 7664

cgaaggcttc gagccagtcg cagtcgcaag gtaggctgag cacggtttct tctggcgggt 7724

catgttgggg agcggggcgg gcgatgctgc tggtgatgaa gttgaaatag gcggttctga 7784

gacggcggat ggtggcgagg agcaccaggt ctttgggccc ggcttgctgg atgcgcagac 7844

ggtcggccat gccccaggcg tggtcctgac acctggccag gtccttgtag tagtcctgca 7904

tgagccgctc cacgggcacc tcctcctcgc ccgcgcggcc gtgcatgcgc gtgagcccga 7964

agccgcgctg gggctggacg agcgccaggt cggcgacgac gcgctcggcg aggatggcct 8024

gctgtatctg ggtgagggtg gtctggaagt cgtcaaagtc gacgaagcgg tggtaggctc 8084

cggtgttgat ggtgtaggag cagttggcca tgacggacca gttgacggtc tggtgaccgg 8144

gacgcacgag ctcgtggtac ttgaggcgcg agtaggcgcg cgtgtcgaag atgtagtcgt 8204

tgcaggtgcg caccaggtac tgatagccga tgaggaagtg cggcggcggc tggcggtaga 8264

gcggccatcg ctcggtggcg ggggcgccgg gcgcgaggtc ctcgagcatg gtgcggtggt 8324

agccgtagat gtacctggac atccaggtga tgccggcggc ggtggtggag gcgcgcggga 8384

actcgcggac gcggttccag atgttgcgca gcggcaggaa gtagttcatg gtgggcacgg 8444

tctggcccgt gaggcgcgcg cagtcgtgga tgctctatac gggcaaaaac gaaagcggtc 8504

agcggctcga ctccgtggcc tggaggctaa gcgaacgggt tgggctgcgc gtgtaccccg 8564

gttcgaatct cgaatcaggc tggagccgca gctaacgtgg tactggcact cccgtctcga 8624

cccaagcctg caccaaccct ccaggatacg gaggcgggtc gttttgcaac ttttttcgga 8684

ggcctgaaaa ctagtaagcg cggaaagcgg ccgaccgcga tggctcgctg ccgtagtctg 8744

gagaagaatc gccagggttg cgttgcggtg tgccccggtt cgaggccggc cggattccgc 8804

ggctaacgag ggcgtggctg ccccgtcgtt tccaagaccc cctagccagc cgacttctcc 8864

agttacggag cgagcccctc ttttgttttg tttgtttttg ccagatgcat cccgtactgc 8924

ggcagatgcg cccccaccac cctccaccgc aacaacagcc ccctccacag ccggcgcttc 8984

tgcccccgcc ccagcagcaa cttccagcca cgaccgccgc ggccgccgtg agcggggctg 9044

gccagacttc tcagtatgac cacctggcct tggaagaggg cgaggggctg gcgcgcctgg 9104

gggcgtcgtc gccggagcgg cacccgcgcg tgcagatgaa aagggacgct cgcgaggcct 9164

acgtgcccaa gcagaacctg ttcagagaca ggagcggcga ggagcccgag gagatgcgcg 9224

cggcccggtt ccacgcgggg cgggagctgc gccgcggcct ggaccgaaag agggtgctga 9284

gggacgagga tttcgaggcg gacgagctga cggggatcag ccccgcgcgc gcgcacgtgg 9344

ccgcggccaa cctggtcacg gcgtacgagc agaccgtgaa ggaggagagc aacttccaaa 9404

aatccttcaa caaccacgtg cgcaccctga tcgcgcgcga ggaggtgacc ctgggcctga 9464

tgcacctgtg ggacctgctg gaggccatcg tgcagaaccc caccagcaag ccgctgacgg 9524

cgcagctgtt cctggtggtg caacatagtc gggacaacga ggcgttcagg gaggcgctgc 9584

tgaatatcac cgagcccgag ggccgctggc tcctggacct ggtgaacatt ctgcagagca 9644

tcgtggtgca ggagcgcggg ctgccgctgt ccgagaagct ggcggccatt aacttctcgg 9704

tgctgagtct gggcaagtac tacgctagga agatctacaa gaccccgtac gtgcccatag 9764

acaaggaggt gaagatcgac gggttttaca tgcgcatgac cctgaaagtg ctgaccctga 9824

gcgacgatct gggggtgtac cgcaacgaca ggatgcaccg cgcggtgagc gccagcaggc 9884

ggcgcgagct gagcgaccag gagctgatgc acagcctgca gcgggccctg accggggccg 9944

ggaccgaggg ggagagctac tttgacatgg gcgcggacct gcactggcag cccagccgcc 10004

gggccttgga ggcggcggca ggaccttatg tagaagaggt ggacgatgag gtggacgagg 10064

agggcgagta cctggaagac tgatggcgcg accgtatttt tgctagatgc aacaacaaca 10124

gccacctcct gatcccgcga tgcgggcggc gctgcagagc cagccgtccg gcattaactc 10184

ctcggacgat tggacccagg ccatgcaacg catcatggcg ctgacgaccc gcaaccccga 10244

agcctttaga cagcagcccc aggccaaccg gctctcggcc atcctggagg ccgtggtgcc 10304

ctcgcgctcc aaccccacgc acgagaaggt cctggccatc gtgaacgcgc tggtggagaa 10364

caaggccatc cgcggcgacg aggccggcct ggtgtacaac gcgctgctgg agcgcgtggc 10424

ccgctacaac agcaccaacg ttcagaccaa cctggaccgc atggtgaccg acgtgcgcga 10484

ggccgtggcc cagcgcgagc ggttccaccg cgagtccaac ctgggatcca tggtggcgct 10544

gaatgccttc ctcagcaccc agcccgccaa cgtgccccgg ggccaggagg actacaccaa 10604

cttcatcagt gccctgcgcc tgatggtgac cgaggtgccc cagagcgagg tgtaccagtc 10664

cgggccggac tacttcttcc agaccagtcg ccagggcttg cagaccgtga acctgagcca 10724

ggctttcaag aacttgcagg gattgtgggg cgtgcaggcc ccggtcgggg accgcgcgac 10784

ggtgtcgagc ctgctgacgc cgaactcgcg cctgctgctg ctgctggtgg cccccttcac 10844

ggacagcggc agcatcaacc gcaactcgta cctgggctac ctgattaacc tgtaccgcga 10904

ggccatcggc caggcgcacg tggacgagca gacctaccag gagatcaccc acgtgagccg 10964

cgccctgggc caggacgacc cgggcaacct ggaagccacc ctgaactttt tgctgaccaa 11024

ccggtcgcag aagatcccgc cccagtacgc gctcagcgcc gaggaggagc gcattctgcg 11084

atacgtgcag cagagcgtgg gcctgttcct gatgcaggag ggggccaccc ccagcgccgc 11144

gctcgacatg accgcgcgca acatggagcc cagcatgtac gccagcaacc gcccgttcat 11204

caataaactg atggactact tgcatcgggc ggccgccatg aactctgact atttcaccaa 11264

cgccatcctg aatccccact ggctcccgcc accggggttc tacacgggcg agtacgacat 11324

gcccgacccc aatgacgggt tcctgtggga cgatgtggac agcagcgtgt tctccccccg 11384

accgggtgct aacgagcgcc ccttgtggaa gaaggaaggc agcgaccgac gcccgtcctc 11444

ggcgctgtcc ggccgcgagg gtgctgccgc ggcggtgccc gaagccgcca gtcctttccc 11504

gagcttgccc ttctcgctaa acagcattcg cagcagcgag ctgggcagga tcacgcgccc 11564

gcgcttgctg ggcgaggagg agtacttgaa tgactcgctg ttgagacccg agcgggagaa 11624

gaacttcccc aataacggaa tagagagcct ggtggacaag atgagccgct ggaaaacgta 11684

tgcgcaggag cacagggacg atccccgggc gtcgcagggg gccacgagcc ggggcagcgc 11744

ggcccgtaaa cgccggtggc acgacaggca gcggggactg atgtgggacg atgaggattc 11804

cgccgacgac agcagcgtgt tggacttggg tgggagtggt ggtaacccgt tcgctcacct 11864

gcgcccccgc atcgggcgca tgatgtaaga aaccgaaaat aaatgatact caccaaggcc 11924

atggcgacca gcgtgcgttc gtttcttctc tgttgtttag tatgatgagg cgtgcgtacc 11984

cggagggtcc tcctccctcg tacgagagcg tgatgcagca ggcgatggcg gcggcgatgc 12044

agcccccgct ggaggctcct tacgtgcccc cgcggtacct ggcgcctacg gaggggcgga 12104

acagcattcg ttactcggag ctggcaccct tgtacgatac cacccggttg tacctggtgg 12164

acaacaagtc ggcggacatc gcctcgctga actaccagaa cgaccacagc aacttcctga 12224

ccaccgtggt gcagaacaat gacttcaccc ccacggaggc cagcacccag accatcaact 12284

ttgacgagcg ctcgcggtgg ggcggccagc tgaaaaccat catgcacacc aacatgccca 12344

acgtgaacga gttcatgtac agcaacaagt tcaaggcgcg ggtgatggtc tcccgcaaga 12404

cccccaacgg ggtcaaagta gatgatgact atgatggtag tcaggatatc ttggagtatg 12464

agtgggtgga gtttgagctg cccgaaggca acttctcggt gaccatgacc atcgacctga 12524

tgaacaacgc cattatcgac aattacttgg cggtggggcg gcagaacggg gtgctggaga 12584

gcgacatcgg cgtgaagttc gacacgcgca acttcaggct cggttgggac cccgtgaccg 12644

agctggtcat gccgggcgtg tacaccaacg aggccttcca ccccgacatc gtcctgctgc 12704

ccggctgcgg cgtggacttc accgagagcc gcctcagcaa cctgctgggc atccgcaagc 12764

ggcagccctt ccaggagggt ttccagatca tgtacgagga cctggagggg ggcaacatcc 12824

ccgcgctgct ggacgtcgaa gcctacgaga aaagcaagga ggaggccgcc gcagcggcga 12884

ccgcggccgt ggctactgct gcgaccaccg atgcagatgc agctactact accaggggcg 12944

atacattcgc cacccaggcg gaggaagcag ccgccctagc ggcgaccgat gatagtgaaa 13004

gtaagatagt catcaagccg gtggagaagg acagcaagga caggagctac aacgttctat 13064

cggatggaaa gaacaccgcc taccgcagct ggtacctggc ctacaactac ggcgaccctg 13124

agaagggcgt gcgctcctgg acgctgctca ccacctcgga cgtcacctgc ggcgtggagc 13184

aagtctactg gtcgctgccc gacatgatgc aagacccggt caccttccgc tccacgcgtc 13244

aagttagcaa ctacccggtg gtgggcgccg agctcctgcc cgtctactcc aagagcttct 13304

tcaacgagca ggccgtctac tcgcagcagc tgcgcgcctt cacctcgctc acgcacgtct 13364

tcaaccgctt ccccgagaac cagatcctcg tccgcccgcc cgcgcccacc attaccaccg 13424

tcagtgaaaa cgttcctgct ctcacagatc acgggaccct gccgctgcgc agcagtatcc 13484

ggggagtcca gcgcgtgacc gtcactgacg ccagacgccg cacctgcccc tacgtctaca 13544

aggccctggg catagtcgcg ccgcgcgtcc tctcgagccg caccttctaa aaaatgtcca 13604

ttctcatctc gcccagtaat aacaccggtt ggggcctgcg cgcgcccagc aagatgtacg 13664

gaggcgctcg ccaacgctcc acgcaacacc ccgtgcgcgt gcgcgggcac ttccgcgctc 13724

cctggggcgc cctcaagggc cgcgtgcgct cgcgcaccac cgtcgacgac gtgatcgacc 13784

aggtggtggc cgacgcgcgc aactacacgc ccgccgccgc gcccgtctcc accgtggacg 13844

ccgtcatcga cagcgtggtg gccgacgcgc gccggtacgc ccgcgccaag agccggcggc 13904

ggcgcatcgc ccggcggcac cggagcaccc ccgccatgcg cgcggcgcga gccttgctgc 13964

gcagggccag gcgcacggga cgcagggcca tgctcagggc ggccagacgc gcggcctccg 14024

gcagcagcag cgccggcagg acccgcagac gcgcggccac ggcggcggcg gcggccatcg 14084

ccagcatgtc ccgcccgcgg cgcggcaacg tgtactgggt gcgcgacgcc gccaccggtg 14144

tgcgcgtgcc cgtgcgcacc cgcccccctc gcacttgaag atgctgactt cgcgatgttg 14204

atgtgtccca gcggcgagga ggatgtccaa gcgcaaatac aaggaagaga tgctccaggt 14264

catcgcgcct gagatctacg gccccgcggc ggcggtgaag gaggaaagaa agccccgcaa 14324

actgaagcgg gtcaaaaagg acaaaaagga ggaggaagat gacggactgg tggagtttgt 14384

gcgcgagttc gccccccggc ggcgcgtgca gtggcgcggg cggaaagtga aaccggtgct 14444

gcggcccggc accacggtgg tcttcacgcc cggcgagcgt tccggctccg cctccaagcg 14504

ctcctacgac gaggtgtacg gggacgagga catcctcgag caggcggccg agcgtctggg 14564

cgagtttgct tacggcaagc gcagccgccc cgcgcccttg aaagaggagg cggtgtccat 14624

cccgctggac cacggcaacc ccacgccgag cctgaagccg gtgaccctgc agcaggtgct 14684

gccgagcgcg gcgccgcgcc ggggcttcaa gcgcgagggc ggcgaggatc tgtacccgac 14744

catgcagctg atggtgccca agcgccagaa gctggaggac gtgctggagc acatgaaggt 14804

ggaccccgag gtgcagcccg aggtcaaggt gcggcccatc aagcaggtgg ccccgggcct 14864

gggcgtgcag accgtggaca tcaagatccc cacggagccc atggaaacgc agactgagcc 14924

cgtgaagccc agcaccagca ccatggaggt gcagacggat ccctggatgc cggcgccggc 14984

ttccaccacc accacccgcc gaagacgcaa gtacggcgcg gccagcctgc tgatgcccaa 15044

ctacgcgctg catccttcca tcatccccac gccgggctac cgcggcacgc gcttctaccg 15104

cggctacacc agcagccgcc gccgcaagac caccacccgc cgccgccgtc gtcgcagccg 15164

ccgcagcagc accgcgactt ccgccttggt gcggagagtg taccgcagcg ggcgcgagcc 15224

tctgaccctg ccgcgcgcgc gctaccaccc gagcatcgcc atttaactac cgcctcctac 15284

ttgcagatat ggccctcaca tgccgcctcc gcgtccccat tacgggctac cgaggaagaa 15344

agccgcgccg tagaaggctg acggggaacg ggctgcgtcg ccatcaccac cggcggcggc 15404

gcgccatcag caagcggttg gggggaggct tcctgcccgc gctgatcccc atcatcgccg 15464

cggcgatcgg ggcgatcccc ggcatagctt ccgtggcggt gcaggcctct cagcgccact 15524

gagacacagc ttggaaaatt tgtaataaaa aatggactga cgctcctggt cctgtgatgt 15584

gtgtttttag atggaagaca tcaatttttc gtccctggca ccgcgacacg gcacgcggcc 15644

gtttatgggc acctggagcg acatcggcaa cagccaactg aacgggggcg ccttcaattg 15704

gagcagtctc tggagcgggc ttaagaattt cgggtccacg ctcaaaacct atggcaacaa 15764

ggcgtggaac agcagcacag ggcaggcgct gagggaaaag ctgaaagagc agaacttcca 15824

gcagaaggtg gtcgatggcc tggcctcggg catcaacggg gtggtggacc tggccaacca 15884

ggccgtgcag aaacagatca acagccgcct ggacgcggtc ccgcccgctg ggtccgtgga 15944

catgccccag gtggaggagg agctgcctcc cctggacaag cgcggcgaca agcgaccgcg 16004

tcccgacgct gaggagacgc tgctgacgca cacggacgag ccgcccccgt acgaggaggc 16064

ggtgaaactg ggtctgccca ccacgcggcc cgtggcgcct ctggccaccg gggtgctgaa 16124

acccagcagc agcagccagc ccgcgaccct ggacttgcct ccgcctgctt cccgcccctc 16184

cacagtggct aagcccctgc cgccggtggc cgtcgcgtcg cgcgcccccc gaggccgccc 16244

ccaggcgaac tggcagagca ctctgaacag catcgtgggt ctgggagtgc agagtgtgaa 16304

gcgccgccgc tgctattaaa agacactgta gcgcttaact tgcttgtctg tgtgtatgta 16364

tgtccgccga ccagaaggag gaggaagagg cgcgtcgccg agttgcaaga tggccacccc 16424

atcgatgctg ccccagtggg cgtacatgca catcgccgga caggacgctt cggagtacct 16484

gagtccgggt ctggtgcagt tcgcccgcgc cacagacacc tacttcagtc tggggaacaa 16544

gtttaggaac cccacggtgg cgcccacgca cgatgtgacc accgaccgca gccagcggct 16604

gacgctgcgc ttcgtgcccg tggaccgcga ggacaacacc tactcgtaca aagtgcgcta 16664

cacgctggcc gtgggtgaca accgcgtgct ggacatggcc agcacctact ttgacatccg 16724

cggcgtgctg gatcggggcc ccagcttcaa accctactcc ggcaccgcct acaacgctct 16784

ggctcccaaa gctgctccta atcctagtca gtggtatgaa accaccgagg gaacaaatgg 16844

aaacgctgac gccactacta ctaaacatag ttttggtttg gctgccatga agggcgataa 16904

tattgcagct gaaggactgc aaataggaac agattctacc tctggatctg atacacccat 16964

ctatgcagat aaactgtatc agccagagcc tcagatagga gaggaatcat ggacagacaa 17024

cgatggaacc aatgaaaaat tcggtggaag agttcttaaa cctgacacca acatgaaacc 17084

ctgctatggc tcatttgcca aacccactaa taacaaaggt ggtcaagcaa aacaaaaggc 17144

aacggaagga acagctgtgg attatgatgt tgacatgaac tttttcgatg gtagagatgc 17204

agctgctaac tttactccag aagtagtgtt gtatgctgaa aatgtggatt tggaaactcc 17264

agacacccat attgtataca aaccaggaac ttcagatttg agttcccatg tgaatttggg 17324

tcaacagtcc atgcccaaca gacccaacta cattggattc agagataact ttatcgggct 17384

tatgtactac aacagcactg gcaatatggg tgtactggct ggtcaggcct cccagctgaa 17444

tgctgtggtg gacttgcagg acagaaacac cgaactgtcc taccagctct tgcttgactc 17504

tctgggtgac agaaccaggt atttcagtat gtggaatcag gcggtggaca gttatgaccc 17564

cgatgtgcgc attattgaaa accatggtat agaagatgaa cttcccaact actgtttccc 17624

aattgatgct gtgggcatca ccagaaccta tcaaggcata aaagtacaaa atggaactgg 17684

cactggaacg acatgggaaa aagatactga tgttagtaca gccaacgaaa taggaattgg 17744

caacaatctg gccatggaga tcaacatcca ggccaacctg tggcggaact tcctctacgc 17804

gaacgtggcg ctgtacctgc ccgactccta caagtacacg ccggccaaca tcacgctgcc 17864

caccaacacc aacacctacg attacatgaa cggccgcgtg gtggcgccct cgctggtgga 17924

cgcctacatc aacatcgggg cgcgctggtc gctggacccc atggacaacg tcaatccctt 17984

caaccaccac cgcaacgcgg gcctgcgcta ccgctccatg ctcctgggca acgggcgcta 18044

cgtgcccttc cacatccagg tgccccaaaa gtttttcgcc atcaagagcc tcctgctcct 18104

gcccgggtcc tacacctacg agtggaactt ccgcaaggac gtcaacatga tcctgcagag 18164

ctccctcggc aacgacctgc gcacggacgg ggcctccatc gccttcacca gcatcaacct 18224

ctacgccacc ttcttcccca tggcgcacaa caccgcctcc acgctcgagg ccatgctgcg 18284

caacgacacc aacgaccagt ccttcaacga ctacctctcg gcggccaaca tgctctaccc 18344

catcccggcc aacgccacca acgtgcccat ctccatcccc tcgcgcaact gggccgcctt 18404

ccgcggatgg tccttcacgc gcctcaagac ccgcgagacg ccctcgctcg gctccgggtt 18464

cgacccctac ttcgtctact cgggctccat cccctacctc gacggcacct tctacctcaa 18524

ccacaccttc aagaaggtct ccatcacctt cgactcctcc gtcagctggc ccggcaacga 18584

ccgcctcctg acgcccaacg agttcgaaat caagcgcacc gtcgacggag aggggtacaa 18644

cgtggcccag tgcaacatga ccaaggactg gttcctggtc cagatgctgg cccactacaa 18704

catcggctac cagggcttct acgtgcccga gggctacaag gaccgcatgt actccttctt 18764

ccgcaacttc cagcccatga gccgccaggt cgtggacgag gtcaactaca aggactacca 18824

ggccgtcacc ctggcctacc agcacaacaa ctcgggcttc gtcggctacc tcgcgcccac 18884

catgcgccag ggccagccct accccgccaa ctacccctac ccgctcatcg gcaagagcgc 18944

cgtcaccagc gtcacccaga aaaagttcct ctgcgaccgg gtcatgtggc gcatcccctt 19004

ctccagcaac ttcatgtcca tgggcgcgct caccgacctc ggccagaaca tgctctatgc 19064

caactccgcc cacgcgctag acatgaattt cgaagtcgac cccatggatg agtccaccct 19124

tctctatgtt gtcttcgaag tcttcgacgt cgtccgagtg caccagcccc accgcggcgt 19184

catcgaggcc gtctacctgc gcacgccctt ctcggccggc aacgccacca cctaagccgc 19244

tcttgcttct tgcaagatga cggcctgtgg ctccggcgag caggagctca gggccatcct 19304

ccgcgacctg ggctgcgggc cctgcttcct gggcaccttc gacaagcgct tcccgggatt 19364

catggccccg cacaagctgg cctgcgccat cgtcaacacg gccggccgcg agaccggggg 19424

cgagcactgg ctggccttcg cctggaaccc gcgctcccac acctgctacc tcttcgaccc 19484

cttcgggttc tcagacgagc gcctcaagca gatctaccag ttcgagtacg agggcctgct 19544

gcgccgcagc gccctggcca ccgaggaccg ctgcgtcacc ctggaaaagt ccacccagac 19604

cgtgcagggt ccgcgctcgg ccgcctgcgg gctcttctgc tgcatgttcc tgcacgcctt 19664

cgtgcactgg cccgaccgcc ccatggacaa gaaccccacc atgaacttgc tgacgggggt 19724

gcccaacggc atgctccagt cgccccaggt ggaacccacc ctgcgccgca accaggaagc 19784

gctctaccgc ttcctcaacg cccactccgc ctactttcgc tcccaccgcg cgcgcatcga 19844

gaaggccacc gccttcgacc gcatgaatca agacatgtaa accgtgtgta tgtgaatgct 19904

ttattcatca taataaacag cacatgttta tgccaccttc tctgaggctc tgactttatt 19964

tagaaatcga aggggttctg ccggctctcg gcgtgccccg cgggcaggga tacgttgcgg 20024

aactggtact tggacagcca cttgaactcg gggatcagca gcttgggcac ggggaggtcg 20084

gggaacgagt cgctccacag cttgcgcgtg agttgcaggg cgcccagcag gtcgggcgcg 20144

gagatcttga aatcgcagtt gggacccgcg ttctgcgcgc gagagttacg gtacacgggg 20204

ttgcagcact ggaacaccat cagggccggg tgcttcacgc tcgccagcac cgtcgcgtcg 20264

gtgatgccct ccacgtccag atcctcggcg ttggccatcc cgaagggggt catcttgcag 20324

gtctgccgcc ccatgctggg cacgcagccg ggcttgtggt tgcaatcgca gtgcaggggg 20384

atcagcatca tctgggcctg ctcggagctc atgcccgggt acatggcctt catgaaagcc 20444

tccagctggc ggaaggcctg ctgcgccttg ccgccctcgg tgaagaagac cccacaggac 20504

ttgctagaga actggttggt ggcgcagccc gcgtcgtgca cgcagcagcg cgcgtcgttg 20564

ttggccagct gcaccacgct gcgcccccag cggttctggg tgatcttggc ccggtcgggg 20624

ttctccttca gcgcgcgctg cccattctcg ctcgccacat ccatctcgat cgtgtgctcc 20684

ttctggatca tcacggtccc gtgcaggcac cgcagcttgc cctcggcctc ggtgcacccg 20744

tgcagccaca gcgcgcagcc ggtgctctcc cagttcttgt gggcgatctg ggagtgcgag 20804

tgcacgaagc cctgcaggaa gcggcccatc atcgtggtca gggtcttgtt gctggtgaag 20864

gtcagcggga tgccgcggtg ctcctcgttc acatacaggt ggcagatgcg gcggtacacc 20924

tcgccctgct cgggcatcag ctggaaggcg gacttcaggt cgctctccac gcggtaccgg 20984

tccatcagca gcgtcatcac ttccatgccc ttctcccagg ccgaaacgat cggcaggctc 21044

agggggttct tcaccgtcat cttagtcgcc gccgccgagg tcagggggtc gttctcgtcc 21104

agggtctcaa acactcgctt gccgtccttc tcggtgatgc gcacgggggg gaaggcgaag 21164

cccacggccg ccagctcctc ctcggcctgc ctttcgtcct cgctgtcctg gctgatgtct 21224

tgcaaaggca catgcttggt cttgcggggt ttctttttgg gtggcagagg cggcggcgga 21284

gacgtgctgg gcgagcgcga gttttcgctc accacgacta tttcttctcc ttggccgtcg 21344

tccgagacca cgcggcggta ggcatgcctc ttctggggca gaggcggagg cgacgggctc 21404

tcgcggttcg gcgggcggct ggcagagccc cttccgcgtt cgggggtgcg ctcctggcgg 21464

cgctgctctg actgacttcc tccgcggccg gccattatgt tctcctaggg agcaacaagc 21524

atggagactc agccatcgtc gccaacatcg ccatctgccc ccgccgacga gaaccagcag 21584

cagaatgaaa gcttaaccgc cccgccgccc agccccacct ccgacgccgc agccccagac 21644

atgcaagaga tggagaaatc catcgagatt gacctgggct acgtgacgcc cgcggagcac 21704

gaggaggagc tggcagcgcg cttttcagcc ccggaagaga accaccaaga gcagccagag 21764

caggaagcag agagcgagca gcagcaggct gggctcgagc atggcgacta cctgagcggg 21824

gcagaggacg tgctcatcaa gcatctggcc cgccaatgca tcatcgtcaa ggacgcgctg 21884

ctcgaccgcg ccgaggtgcc cctcagcgtg gcggagctca gccgcgccta cgagcgcaac 21944

ctcttctcgc cgcgcgtgcc ccccaagcgc cagcccaacg gcacctgcga gcccaacccg 22004

cgcctcaact tctacccggt cttcgcggtg cccgaggccc tggccaccta ccacatcttt 22064

ttcaagaacc aaaagatccc cgtctcctgc cgcgccaacc gcacccgcgc cgacgccctg 22124

ctcaacctgg gtcccggcgc ccgcctacct gatatcgcct ccttggaaga ggttcccaag 22184

atcttcgagg gtctgggcag cgacgagact cgggccgcga acgctctgca aggaagcgga 22244

gaggagcatg agcaccacag cgccctggtg gagttggaag gcgacaacgc gcgcctggcg 22304

gtcctcaagc gcacggtcga gctgacccac ttcgcctacc cggcgctcaa cctgcccccc 22364

aaggtcatga gcgccgtcat ggaccaggtg ctcatcaagc gcgcctcgcc cctctcggag 22424

gaggagatgc aggaccccga gagctcggac gagggcaagc ccgtggtcag cgacgagcag 22484

ctggcgcgct ggctgggagc gagtagcacc ccccagagcc tggaagagcg gcgcaagctc 22544

atgatggccg tggtcctggt gaccgtggag ctggagtgtc tgcgccgctt cttcgccgac 22604

gcggagaccc tgcgcaaggt cgaggagaac ctgcactacc tcttcaggca cgggttcgtg 22664

cgccaggcct gcaagatctc caacgtggag ctgaccaacc tggtctccta catgggcatc 22724

ctgcacgaga accgcctggg gcagaacgtg ctgcacacca ccctgcgcgg ggaggcccgc 22784

cgcgactaca tccgcgactg cgtctacctg tacctctgcc acacctggca gacgggcatg 22844

ggcgtgtggc agcagtgcct ggaggagcag aacctgaaag agctctgcaa gctcctgcag 22904

aagaacctga aggccctgtg gaccgggttc gacgagcgca ccaccgccgc ggacctggcc 22964

gacctcattt tccccgagcg cctgcggctg acgctgcgca acgggctgcc cgactttatg 23024

agccaaagca tgttgcaaaa ctttcgctct ttcatcctcg aacgctccgg gatcctgccc 23084

gccacctgct ccgcgctgcc ctcggacttc gtgccgctga ccttccgcga gtgccccccg 23144

ccgctctgga gccactgcta cttgctgcgc ctggccaact acctggccta ccactcggac 23204

gtgatcgagg acgtcagcgg cgagggtctg ctcgagtgcc actgtcgctg caacctctgc 23264

acgccgcacc gctccctggc ctgcaacccc cagctgctga gcgagaccca gatcatcggc 23324

accttcgagt tgcaaggccc cggcgaggag ggcaaggggg gtctgaaact caccccgggg 23384

ctgtggacct cggcctactt gcgcaagttc gtgcccgagg actaccatcc cttcgagatc 23444

aggttctacg aggaccaatc ccagccgccc aaggccgagc tgtcggcctg cgtcatcacc 23504

cagggggcca tcctggccca attgcaagcc atccagaaat cccgccaaga atttctgctg 23564

aaaaagggcc acggggtcta cttggacccc cagaccggag aggagctcaa ccccagcttc 23624

ccccagg atg ccc cga gga agc agc aag aag ctg aaa gtg gag ctg ccg 23673

Met Pro Arg Gly Ser Ser Lys Lys Leu Lys Val Glu Leu Pro

505 510 515

ccg ccg ccg gag gat ttg gag gaa gac tgg gag agc agt cag gca gag 23721

Pro Pro Pro Glu Asp Leu Glu Glu Asp Trp Glu Ser Ser Gln Ala Glu

520 525 530

gag atg gaa gac tgg gac agc act cag gca gag gag gac agc ctg caa 23769

Glu Met Glu Asp Trp Asp Ser Thr Gln Ala Glu Glu Asp Ser Leu Gln

535 540 545

gac agt ctg gag gag gaa gac gag gtg gag gag gag gca gag gaa gaa 23817

Asp Ser Leu Glu Glu Glu Asp Glu Val Glu Glu Glu Ala Glu Glu Glu

550 555 560

gca gcc gcc gcc aga ccg tcg tcc tcg gcg gag aaa gca agc agc acg 23865

Ala Ala Ala Ala Arg Pro Ser Ser Ser Ala Glu Lys Ala Ser Ser Thr

565 570 575 580

gat acc atc tcc gct ccg ggt cgg ggt cgc ggc ggc cgg gcc cac agt 23913

Asp Thr Ile Ser Ala Pro Gly Arg Gly Arg Gly Gly Arg Ala His Ser

585 590 595

agg tgg gac gag acc ggg cgc ttc ccg aac ccc acc acc cag acc ggt 23961

Arg Trp Asp Glu Thr Gly Arg Phe Pro Asn Pro Thr Thr Gln Thr Gly

600 605 610

aag aag gag cgg cag gga tac aag tcc tgg cgg ggg cac aaa aac gcc 24009

Lys Lys Glu Arg Gln Gly Tyr Lys Ser Trp Arg Gly His Lys Asn Ala

615 620 625

atc gtc tcc tgc ttg caa gcc tgc ggg ggc aac atc tcc ttc acc cgg 24057

Ile Val Ser Cys Leu Gln Ala Cys Gly Gly Asn Ile Ser Phe Thr Arg

630 635 640

cgc tac ctg ctc ttc cac cgc ggg gtg aac ttc ccc cgc aac atc ttg 24105

Arg Tyr Leu Leu Phe His Arg Gly Val Asn Phe Pro Arg Asn Ile Leu

645 650 655 660

cat tac tac cgt cac ctc cac agc ccc tac tac tgt ttc caa gaa gag 24153

His Tyr Tyr Arg His Leu His Ser Pro Tyr Tyr Cys Phe Gln Glu Glu

665 670 675

gca gaa acc cag cag cag cag cag aaa acc agc ggc agc tagaaaatcc 24202

Ala Glu Thr Gln Gln Gln Gln Gln Lys Thr Ser Gly Ser

680 685

acagcggcgg caggtggact gaggatcgcg gcgaacgagc cggcgcagac ccgggagctg 24262

aggaatcgga tctttcccac cctctatgcc atcttccagc agagtcgggg gcaggagcag 24322

gaactgaaag tcaagaaccg ttctctgcgc tcgctcaccc gcagttgtct gtatcacaag 24382

agcgaagacc aacttcagcg cactctcgag gacgccgagg ctctcttcaa caagtactgc 24442

gcgctcactc ttaaagagta gcccgcgccc gcccacacac ggaaaaaggc gggaattacg 24502

tcaccacctg cgcccttcgc catcatcatg agcaaagaga ttcccacgcc ttacatgtgg 24562

agctaccagc cccagatggg cctggccgcc ggcgccgccc aggactactc cacccgcatg 24622

aactggctca gtgccgggcc cgcgatgatc tcacgggtga atgacatcct cgcccaccga 24682

aaccagatac tcctagaaca gtcagcgatc accgccacgc cccgccatca ccttaatccg 24742

cgtaattggc ccgccgccct ggtgtaccag gaaattcccc agcccacgac cgtactactt 24802

ccgcgagacg cccaggccga agtccagctg actaactcag gtgtccagct ggccggcggc 24862

gccgccctgt gtcgtcaccg ccccgctcag ggtataaagc ggctggtgat ccgaggcaga 24922

ggcacacagc tcaacgacga ggtggtgagc tcttcgctgg gtctgcgacc tgacggagtc 24982

ttccaactcg ccggatcggg gagatcttcc ttcacgcctc gtcaggccgt cctgactttg 25042

gagagttcgt cctcgcagcc ccgctcgggc ggcatcggca ctctccagtt cgtggaggag 25102

ttcactccct cggtctactt caaccccttc tccggctccc ccggccacta cccggacgag 25162

ttcatcccga acttcgacgc catcagcgag tcggtggacg gctacgattg aatgtcccat 25222

ggtggcgcgg ctgacctagc tcggcttcga cacctggacc actgccgccg cttccgctgc 25282

ttcgctcggg atctcgccga gtttgcctac tttgagctgc ccgaggagca ccctcagggc 25342

ccggcccacg gagtgcggat cgtcgtcgaa gggggcctcg actcccacct gcttcggatc 25402

ttcagccagc gaccgatcct ggtcgagcgc gagcaaggac agacccgtct gaccctgtac 25462

tgcatctgca accaccccgg cctgc atg aaa gtc ttt gtt gtc tgc tgt gta 25514

Met Lys Val Phe Val Val Cys Cys Val

690 695

ctg agt ata ata aaa gct gag atc agc gac tac tcc gga ctc gat tgt 25562

Leu Ser Ile Ile Lys Ala Glu Ile Ser Asp Tyr Ser Gly Leu Asp Cys

700 705 710

ggt gtt cct gct atc aac cgg tcc ctg ttc ttc acc ggg aac gag acc 25610

Gly Val Pro Ala Ile Asn Arg Ser Leu Phe Phe Thr Gly Asn Glu Thr

715 720 725 730

gag ctc cag ctc cag tgt aag ccc cac aag aag tat ctc acc tgg ctg 25658

Glu Leu Gln Leu Gln Cys Lys Pro His Lys Lys Tyr Leu Thr Trp Leu

735 740 745

ttc cag ggc tcc ccg atc gcc gtt gtc aac cac tgc gac aac gac gga 25706

Phe Gln Gly Ser Pro Ile Ala Val Val Asn His Cys Asp Asn Asp Gly

750 755 760

gtc ctg ctg agc ggc ccc gcc aac ctt act ttt tcc acc cgc aga agc 25754

Val Leu Leu Ser Gly Pro Ala Asn Leu Thr Phe Ser Thr Arg Arg Ser

765 770 775

aag ctc cag ctc ttc caa ccc ttc ctc ccc ggg acc tat cag tgc ctc 25802

Lys Leu Gln Leu Phe Gln Pro Phe Leu Pro Gly Thr Tyr Gln Cys Leu

780 785 790

tcg gga ccc tgc cat cac acc ttc cac ctg atc ccg aat acc aca gcg 25850

Ser Gly Pro Cys His His Thr Phe His Leu Ile Pro Asn Thr Thr Ala

795 800 805 810

gcg ctc ccc gct act aac aac caa act acc cac caa cgc cac cgt cgc 25898

Ala Leu Pro Ala Thr Asn Asn Gln Thr Thr His Gln Arg His Arg Arg

815 820 825

gac ctt tcc tct gaa tct aat act acc acc cac acc gga ggt gag ctc 25946

Asp Leu Ser Ser Glu Ser Asn Thr Thr Thr His Thr Gly Gly Glu Leu

830 835 840

cga ggt caa cca acc tct ggg att tac tac ggc ccc tgg gag gtg gtg 25994

Arg Gly Gln Pro Thr Ser Gly Ile Tyr Tyr Gly Pro Trp Glu Val Val

845 850 855

ggg tta ata gcg cta ggc cta gtt gcg ggt ggg ctt ttg gct ctc tgc 26042

Gly Leu Ile Ala Leu Gly Leu Val Ala Gly Gly Leu Leu Ala Leu Cys

860 865 870

tac cta tac ctc cct tgc tgt tct tac tta gtg gtg ctg tgt tgc tgg 26090

Tyr Leu Tyr Leu Pro Cys Cys Ser Tyr Leu Val Val Leu Cys Cys Trp

875 880 885 890

ttt aag aaa tgg gga aga tca ccc tagtgagctg cggtgcgctg gtggcggtgg 26144

Phe Lys Lys Trp Gly Arg Ser Pro

895

tgctttcgat tgtgggactg ggcggcgcgg ctgtagtgaa ggagaaggcc gatccctgct 26204

tgcatttcaa tcccgacaaa tgccagctga gttttcagcc cgatggcaat cggtgcgcgg 26264

tgctgatcaa gtgcggatgg gaatgcgaga acgtgagaat cgagtacaat aacaagactc 26324

ggaacaatac tctcgcgtcc gtgtggcagc ccggggaccc cgagtggtac accgtctctg 26384

tccccggtgc tgacggctcc ccgcgcaccg tgaacaatac tttcattttt gcacacatgt 26444

gcgacacggt catgtggatg agcaagcagt acgatatgtg gccccccacg aaggagaaca 26504

tcgtggtctt ctccatcgct tacagcctgt gcacggcgct aatcaccgct atcgtgtgcc 26564

tgagcattca catgctcatc gctattcgcc ccagaaataa tgccgaaaaa gagaaacagc 26624

cataacacgt tttttcacac acctttttca gaccatggcc tctgttaaat tttttgcttt 26684

atttgccagt ctcattaccg tcattcatgg aatgagtaat gagaaaatta ctatttacac 26744

tggcactaat cacacattga aaggtccaga aaaatccaca gaagtttcat ggtattgtta 26804

ttttaatgaa tcagatgtag ctactgaact ctgtggaaac aacaacaaaa aaaatgagag 26864

cattactctc atcaagtttc aatgtggatc tgacttaacc ctcattaaca tcactagaga 26924

ctatgtaggt atgtattatg gaactacagc aggcatttcg gacatggaat tttatcaagt 26984

ttctgtgtct gaacccacca cgcctagaat gaccacaacc acaaaaacta cacctactac 27044

caccacacag ctcactacca atggcttttt tgccatgctt caagtggctg aaaatagcac 27104

cagcattcaa cccaccccac ccagtgagga aattcccaaa tccatgattg gcattattgt 27164

tgctgtagtg gtgtgcatgt tgatcatcgc cttgtgcatg gtgtactatg ccttctgcta 27224

cagaaagcac agactgaacg acaagctgga acacttacta agtgttgaat tttaattttt 27284

tagaaccatg aagatcctat gccttttagt tttttatatc attacctctg ctctttgtga 27344

atcagtggat aaagatgtta ctattaccac tggttctaat tatacactga aaggaccacc 27404

ctcaggtatg ctttcgtggt attgctattt tggaactgac actgatcaaa ctgaattatg 27464

caattttcaa aaaggcaaaa cctcaaactc taaaatctct aattatcaat gcaatggcac 27524

tgatctgata ctactcaatg tcacgaaagc atatggtggc agttattcat gccctggaca 27584

aaacactgaa gaaatgattt tttacaaagt ggaagtggtt gatcccacta ctccacctcc 27644

acccgccaca actactcaca ccacacacac agaacaaagc acagcagagg cagcaaagtt 27704

agccttgcag gtccaagaca gttcatttgt tggcattacc cctacacctg atcagcggtg 27764

tccggggctg ctcgtcagcg gcattgtcgg tgtgctttcg ggattagcag tcataatcat 27824

ctgcatgttc atttttgctt gctgctatag aaggctttac cgacaaaaat cagacccact 27884

gctgaacctc tatgtttaat tttttccaga gccatgaagg cagttagcgc tctagttttt 27944

tgttctttga ttggcattgt ttttaatagt aaaattacca gagttagctt tattaaacat 28004

gttaatgtaa ctgaaggaga taacatcaca ctagcaggtg tagaaggtgc tcaaaacacc 28064

acctggacaa aataccatct aggatggaga gatatttgca cctggaatgt aacttattat 28124

tgcataggag ttaatcttac cattgttaac gctaaccaat ctcagaatgg gttaattaaa 28184

ggacagagtg ttagtgtgac cagtgatggg tactataccc agcatagttt taactacaac 28244

attactgtca taccactgcc tacgcctagc ccacctagca ctaccgcaca gacaaccaca 28304

tacagtacat caaatcagcc taccaccact acagcagcag aggttgccag ctcgtctggg 28364

gtccgagtgg catttttgat gttggcccca tctagcagtc ccactgctag taccaatgag 28424

cagactactg aatttttgtc cactgtcgag agccacacca cagctacctc cagtgccttc 28484

tctagcaccg ccaatctctc ctcgctttcc tctacaccaa tcagtcccgc tactactcct 28544

agccccgctc ctcttcccac tcccctgaag caaacagacg gcggcatgca atggcagatc 28604

accctgctca ttgtgatcgg gttggtcatc ctggccgtgt tgctatacta catcttctgc 28664

cgccgcattc ccaacgcgca ccgcaagccg gcctacaagc ccatcgttat cgggcagccg 28724

gagccgcttc aggtggaagg gggtctaagg aatcttctct tctcttttac agtatggtga 28784

ttgaattatg attcctagac aattcttgat cactattctt atctgcctcc tccaagtctg 28844

tgccaccctc gctctggtgg ccaacgccag tccagactgt attgggccct tcgcctccta 28904

cgtgctcttt gccttcatca cctgcatctg ctgctgtagc atagtctgcc tgcttatcac 28964

cttcttccag ttcattgact ggatctttgt gcgcatcgcc tacctgcgcc accaccccca 29024

gtaccgcgac cagcgagtgg cgcggctgct caggctcctc tgataagcat gcgggctctg 29084

ctacttctcg cgcttctgct gttagtgctc ccccgtcccg tcgacccccg gacccccacc 29144

cagtcccccg aggaggtccg caaatgcaaa ttccaagaac cctggaaatt cctcaaatgc 29204

taccgccaaa aatcagacat gcatcccagc tggatcatga tcattgggat cgtgaacatt 29264

ctggcctgca ccctcatctc ctttgtgatt tacccctgct ttgactttgg ttggaactcg 29324

ccagaggcgc tctatctccc gcctgaacct gacacaccac cacagcaacc tcaggcacac 29384

gcactaccac caccaccaca gcctaggcca caatacatgc ccatattaga ctatgaggcc 29444

gagccacagc gacccatgct ccccgctatt agttacttca atctaaccgg cggag atg 29502

Met

act gac cca ctg gcc aac aac aac gtc aac gac ctt ctc ctg gac atg 29550

Thr Asp Pro Leu Ala Asn Asn Asn Val Asn Asp Leu Leu Leu Asp Met

900 905 910 915

gac ggc cgc gcc tcg gag cag cga ctc gcc caa ctt cgc att cgc cag 29598

Asp Gly Arg Ala Ser Glu Gln Arg Leu Ala Gln Leu Arg Ile Arg Gln

920 925 930

cag cag gag aga gcc gtc aag gag ctg cag gac ggc ata gcc atc cac 29646

Gln Gln Glu Arg Ala Val Lys Glu Leu Gln Asp Gly Ile Ala Ile His

935 940 945

cag tgc aag aaa ggc atc ttc tgc ctg gtg aaa cag gcc aag atc tcc 29694

Gln Cys Lys Lys Gly Ile Phe Cys Leu Val Lys Gln Ala Lys Ile Ser

950 955 960

tac gag gtc acc cag acc gac cat cgc ctc tcc tac gag ctc ctg cag 29742

Tyr Glu Val Thr Gln Thr Asp His Arg Leu Ser Tyr Glu Leu Leu Gln

965 970 975

cag cgc cag aag ttc acc tgc ctg gtc gga gtc aac ccc atc gtc atc 29790

Gln Arg Gln Lys Phe Thr Cys Leu Val Gly Val Asn Pro Ile Val Ile

980 985 990 995

acc cag cag tcg ggc gat acc aag ggg tgc atc cac tgc tcc tgc 29835

Thr Gln Gln Ser Gly Asp Thr Lys Gly Cys Ile His Cys Ser Cys

1000 1005 1010

gac tcc ccc gac tgc gtc cac act ctg atc aag acc ctc tgc ggc 29880

Asp Ser Pro Asp Cys Val His Thr Leu Ile Lys Thr Leu Cys Gly

1015 1020 1025

ctc cgc gac ctc ctc ccc atg aac taatca 29910

Leu Arg Asp Leu Leu Pro Met Asn

1030

<210> SEQ ID NO 51

<211> LENGTH: 502

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 51

Met Glu Ser Arg Asn Pro Phe Gln Gln Gly Leu Pro Ser Gly Leu Leu

1 5 10 15

Ser Ser Ser Phe Val Glu Asn Met Glu Val Pro Ala Pro Glu Cys Asn

20 25 30

Leu Arg Leu Leu Ala Ser Thr Ala Gly Arg His Ala Glu Asp Pro Glu

35 40 45

Ser Pro Val Thr Pro Gly Thr Pro Thr Pro Pro Ala Ala Ala Ala Ala

50 55 60

Ala Arg Gly Gly Gly Gly Pro Arg Arg Glu Pro Glu Ser Arg Ser Gly

65 70 75 80

Pro Ser Gly Gly Gly Gly Gly Gly Val Ala Asp Leu Phe Pro Glu Leu

85 90 95

Cys Arg Val Leu Thr Arg Ser Ser Ser Gly Arg Glu Arg Gly Ile Lys

100 105 110

Arg Glu Arg His Glu Glu Thr Ser His Arg Thr Glu Leu Thr Val Ser

115 120 125

Leu Met Ser Arg Arg Arg Pro Glu Ser Val Trp Trp His Glu Val Gln

130 135 140

Ser Gln Gly Val Asp Glu Val Ser Val Met His Glu Lys Tyr Ser Leu

145 150 155 160

Glu Gln Val Lys Thr Cys Trp Leu Glu Pro Glu Asp Asp Trp Glu Val

165 170 175

Ala Ile Arg Asn Tyr Ala Lys Leu Ala Leu Arg Pro Asp Lys Lys Tyr

180 185 190

Lys Ile Thr Lys Leu Ile Asn Ile Arg Asn Ser Cys Tyr Ile Ser Gly

195 200 205

Asn Gly Ala Glu Val Glu Ile Ser Thr Gln Glu Arg Val Ala Phe Arg

210 215 220

Cys Cys Met Met Asn Met Tyr Pro Gly Val Val Gly Met Glu Gly Val

225 230 235 240

Thr Phe Met Asn Ala Arg Phe Arg Gly Asp Gly Tyr Asn Gly Val Val

245 250 255

Phe Met Ala Asn Thr Lys Leu Thr Val His Gly Cys Ser Phe Phe Gly

260 265 270

Phe Asn Asn Met Cys Ile Glu Ala Trp Gly Ser Val Ser Val Arg Gly

275 280 285

Cys Ser Phe Ser Ala Asn Trp Met Gly Val Val Gly Arg Thr Lys Ser

290 295 300

Lys Val Ser Val Lys Lys Cys Leu Phe Glu Arg Cys His Leu Gly Val

305 310 315 320

Met Ser Glu Gly Glu Ala Lys Val Lys His Cys Ala Ser Thr Glu Thr

325 330 335

Gly Cys Phe Val Leu Ile Lys Gly Asn Ala Lys Val Lys His Asn Met

340 345 350

Ile Cys Gly Ala Ser Asp Glu Arg Gly Tyr Gln Met Leu Thr Cys Ala

355 360 365

Gly Gly Asn Ser His Met Leu Ala Thr Val His Val Ala Ser His Pro

370 375 380

Arg Lys Thr Trp Pro Glu Phe Glu His Asn Val Met Thr Arg Cys Asn

385 390 395 400

Val His Leu Gly Ser Arg Arg Gly Met Phe Met Pro Tyr Gln Cys Asn

405 410 415

Met Gln Phe Val Lys Val Leu Leu Glu Pro Asp Ala Met Ser Arg Val

420 425 430

Ser Leu Thr Gly Val Phe Asp Met Asn Val Glu Leu Trp Lys Ile Leu

435 440 445

Arg Tyr Asp Glu Ser Lys Thr Arg Cys Arg Ala Cys Glu Cys Gly Gly

450 455 460

Lys His Ala Arg Leu Gln Pro Val Cys Val Glu Val Thr Glu Asp Leu

465 470 475 480

Arg Pro Asp His Leu Val Leu Ser Cys Asn Gly Thr Glu Phe Gly Ser

485 490 495

Ser Gly Glu Glu Ser Asp

500

<210> SEQ ID NO 52

<211> LENGTH: 187

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 52

Met Pro Arg Gly Ser Ser Lys Lys Leu Lys Val Glu Leu Pro Pro Pro

1 5 10 15

Pro Glu Asp Leu Glu Glu Asp Trp Glu Ser Ser Gln Ala Glu Glu Met

20 25 30

Glu Asp Trp Asp Ser Thr Gln Ala Glu Glu Asp Ser Leu Gln Asp Ser

35 40 45

Leu Glu Glu Glu Asp Glu Val Glu Glu Glu Ala Glu Glu Glu Ala Ala

50 55 60

Ala Ala Arg Pro Ser Ser Ser Ala Glu Lys Ala Ser Ser Thr Asp Thr

65 70 75 80

Ile Ser Ala Pro Gly Arg Gly Arg Gly Gly Arg Ala His Ser Arg Trp

85 90 95

Asp Glu Thr Gly Arg Phe Pro Asn Pro Thr Thr Gln Thr Gly Lys Lys

100 105 110

Glu Arg Gln Gly Tyr Lys Ser Trp Arg Gly His Lys Asn Ala Ile Val

115 120 125

Ser Cys Leu Gln Ala Cys Gly Gly Asn Ile Ser Phe Thr Arg Arg Tyr

130 135 140

Leu Leu Phe His Arg Gly Val Asn Phe Pro Arg Asn Ile Leu His Tyr

145 150 155 160

Tyr Arg His Leu His Ser Pro Tyr Tyr Cys Phe Gln Glu Glu Ala Glu

165 170 175

Thr Gln Gln Gln Gln Gln Lys Thr Ser Gly Ser

180 185

<210> SEQ ID NO 53

<211> LENGTH: 209

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 53

Met Lys Val Phe Val Val Cys Cys Val Leu Ser Ile Ile Lys Ala Glu

1 5 10 15

Ile Ser Asp Tyr Ser Gly Leu Asp Cys Gly Val Pro Ala Ile Asn Arg

20 25 30

Ser Leu Phe Phe Thr Gly Asn Glu Thr Glu Leu Gln Leu Gln Cys Lys

35 40 45

Pro His Lys Lys Tyr Leu Thr Trp Leu Phe Gln Gly Ser Pro Ile Ala

50 55 60

Val Val Asn His Cys Asp Asn Asp Gly Val Leu Leu Ser Gly Pro Ala

65 70 75 80

Asn Leu Thr Phe Ser Thr Arg Arg Ser Lys Leu Gln Leu Phe Gln Pro

85 90 95

Phe Leu Pro Gly Thr Tyr Gln Cys Leu Ser Gly Pro Cys His His Thr

100 105 110

Phe His Leu Ile Pro Asn Thr Thr Ala Ala Leu Pro Ala Thr Asn Asn

115 120 125

Gln Thr Thr His Gln Arg His Arg Arg Asp Leu Ser Ser Glu Ser Asn

130 135 140

Thr Thr Thr His Thr Gly Gly Glu Leu Arg Gly Gln Pro Thr Ser Gly

145 150 155 160

Ile Tyr Tyr Gly Pro Trp Glu Val Val Gly Leu Ile Ala Leu Gly Leu

165 170 175

Val Ala Gly Gly Leu Leu Ala Leu Cys Tyr Leu Tyr Leu Pro Cys Cys

180 185 190

Ser Tyr Leu Val Val Leu Cys Cys Trp Phe Lys Lys Trp Gly Arg Ser

195 200 205

Pro

<210> SEQ ID NO 54

<211> LENGTH: 135

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 54

Met Thr Asp Pro Leu Ala Asn Asn Asn Val Asn Asp Leu Leu Leu Asp

1 5 10 15

Met Asp Gly Arg Ala Ser Glu Gln Arg Leu Ala Gln Leu Arg Ile Arg

20 25 30

Gln Gln Gln Glu Arg Ala Val Lys Glu Leu Gln Asp Gly Ile Ala Ile

35 40 45

His Gln Cys Lys Lys Gly Ile Phe Cys Leu Val Lys Gln Ala Lys Ile

50 55 60

Ser Tyr Glu Val Thr Gln Thr Asp His Arg Leu Ser Tyr Glu Leu Leu

65 70 75 80

Gln Gln Arg Gln Lys Phe Thr Cys Leu Val Gly Val Asn Pro Ile Val

85 90 95

Ile Thr Gln Gln Ser Gly Asp Thr Lys Gly Cys Ile His Cys Ser Cys

100 105 110

Asp Ser Pro Asp Cys Val His Thr Leu Ile Lys Thr Leu Cys Gly Leu

115 120 125

Arg Asp Leu Leu Pro Met Asn

130 135

<210> SEQ ID NO 55

<211> LENGTH: 1440

<212> TYPE: DNA

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Simian adenovirus A1325

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (882)..(1146)

<223> OTHER INFORMATION: E1a

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (1234)..(1439)

<223> OTHER INFORMATION: E1a

<400> SEQUENCE: 55

catcatcaat aatatacctc aaactttttg tgcgcgttaa tatgcaaatg aggcgtttga 60

atttggggat gcggggctgt gattggccga gaggagggcg accgttaggg gcggggcgag 120

tgacgttttg atgacgcgac cgtgaggagg agccagtttg caagttctcg tgggaaaagt 180

gacgtcaaac gaggtgtggt ttgaacacgg aaatactcaa ttttcccgcg ctctctgaca 240

ggaaatgagg tgtttctggg cggatgcaag tgaaaacggg ccattttcgc gcgaaaactg 300

aatgaggaag tgaaaatctg agtaattccg cgtttatggc agggaggagt atttgccgag 360

ggccgagtag actttgaccg attacgtggg ggtttcgatt accgtatttt tcacctaaat 420

ttccgcgtac ggtgtcaaag tccggtgttt ttacgtaggt gtcagctgat cgccagggta 480

tttaaacctg cgctctctag tcaagaggcc actcttgagt gccagcgagt agagttttct 540

cctccgcgcc gcgagtcaga tctacacttt gaaagatgag gcacctgaga aacctgcccg 600

aggagaaaat cctcatcgct tccgggaacg agattctgga actggtggta aatgccatga 660

tgggcgacga ccctccggag ccccccaccc catttgaggc gccttcgctg tacgatttgt 720

atgatctgga ggtggatgtg cccgagaacg accccaacga ggaggcggtg aatgatttgt 780

ttagcgatgc cgcgctgctg gctgccgagc aggctaatac ggactctggc tcagacagcg 840

attcctctct ccataccccg agacccggca gaggtgagaa a aag atc ccc gag ctt 896

Lys Ile Pro Glu Leu

1 5

aaa ggg gaa gag ctc gac ctg cgc tgc tat gag gaa tgc ttg cct ccg 944

Lys Gly Glu Glu Leu Asp Leu Arg Cys Tyr Glu Glu Cys Leu Pro Pro

10 15 20

agc gat gat gag gag gac gag gag gcg att cga gct gca gcg agc gag 992

Ser Asp Asp Glu Glu Asp Glu Glu Ala Ile Arg Ala Ala Ala Ser Glu

25 30 35

gga gtg aaa gct gcg ggc gag agc ttt agc ctg gac tgt cct act ctg 1040

Gly Val Lys Ala Ala Gly Glu Ser Phe Ser Leu Asp Cys Pro Thr Leu

40 45 50

ccc gga cac ggc tgt aag tct tgt gaa ttt cat cgc atg aat act gga 1088

Pro Gly His Gly Cys Lys Ser Cys Glu Phe His Arg Met Asn Thr Gly

55 60 65

gat aag aat gtg atg tgt gcc ctg tgc tat atg aga gct tac aac cat 1136

Asp Lys Asn Val Met Cys Ala Leu Cys Tyr Met Arg Ala Tyr Asn His

70 75 80 85

tgt gtt tac a gtaagtgtga ttaactttag ctgggaaggc agagggtgac 1186

Cys Val Tyr

tgggtgctga ctggtttatt tatgtatatg ttttttttta tgtgtag gt ccc gtc 1241

Ser Pro Val

90

tct gac gta gat gag acc ccc act tca gag tgt att tca tca ccc cca 1289

Ser Asp Val Asp Glu Thr Pro Thr Ser Glu Cys Ile Ser Ser Pro Pro

95 100 105

gaa att ggc gag gaa ccg ccc gaa gat atc att cat aga cca gtt gca 1337

Glu Ile Gly Glu Glu Pro Pro Glu Asp Ile Ile His Arg Pro Val Ala

110 115 120

gtg aga gtc acc ggg cgg aga gca gct gtg gag agt ttg aat gac ttg 1385

Val Arg Val Thr Gly Arg Arg Ala Ala Val Glu Ser Leu Asn Asp Leu

125 130 135

cta cag ggt ggg gat gaa cct ttg gac ttg tgt acc cgg aaa cgc ccc 1433

Leu Gln Gly Gly Asp Glu Pro Leu Asp Leu Cys Thr Arg Lys Arg Pro

140 145 150 155

agg cac t 1440

Arg His

<210> SEQ ID NO 56

<211> LENGTH: 157

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 56

Lys Ile Pro Glu Leu Lys Gly Glu Glu Leu Asp Leu Arg Cys Tyr Glu

1 5 10 15

Glu Cys Leu Pro Pro Ser Asp Asp Glu Glu Asp Glu Glu Ala Ile Arg

20 25 30

Ala Ala Ala Ser Glu Gly Val Lys Ala Ala Gly Glu Ser Phe Ser Leu

35 40 45

Asp Cys Pro Thr Leu Pro Gly His Gly Cys Lys Ser Cys Glu Phe His

50 55 60

Arg Met Asn Thr Gly Asp Lys Asn Val Met Cys Ala Leu Cys Tyr Met

65 70 75 80

Arg Ala Tyr Asn His Cys Val Tyr Ser Pro Val Ser Asp Val Asp Glu

85 90 95

Thr Pro Thr Ser Glu Cys Ile Ser Ser Pro Pro Glu Ile Gly Glu Glu

100 105 110

Pro Pro Glu Asp Ile Ile His Arg Pro Val Ala Val Arg Val Thr Gly

115 120 125

Arg Arg Ala Ala Val Glu Ser Leu Asn Asp Leu Leu Gln Gly Gly Asp

130 135 140

Glu Pro Leu Asp Leu Cys Thr Arg Lys Arg Pro Arg His

145 150 155

<210> SEQ ID NO 57

<211> LENGTH: 36643

<212> TYPE: DNA

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Simian adenovirus A1295

<220> FEATURE:

<221> NAME/KEY: repeat_region

<222> LOCATION: (1)..(127)

<223> OTHER INFORMATION: ITR

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (1602)..(2153)

<223> OTHER INFORMATION: E1b\19K

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (3481)..(3906)

<223> OTHER INFORMATION: pIX

<220> FEATURE:

<221> NAME/KEY: misc_feature

<222> LOCATION: (3971)..(5592)

<223> OTHER INFORMATION: IVa2 complement (3971..5301,5580..5592)

<220> FEATURE:

<221> NAME/KEY: misc_feature

<222> LOCATION: (5074)..(13817)

<223> OTHER INFORMATION: pol complement (5074..8646,13809..13817)

<220> FEATURE:

<221> NAME/KEY: misc_feature

<222> LOCATION: (8448)..(13817)

<223> OTHER INFORMATION: pTP complement (8448..10376,13809..13817)

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (10806)..(11984)

<223> OTHER INFORMATION: 52K

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (12011)..(13780)

<223> OTHER INFORMATION: pIIIa

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (13857)..(15458)

<223> OTHER INFORMATION: penton

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (15466)..(16044)

<223> OTHER INFORMATION: pVII

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (16092)..(17117)

<223> OTHER INFORMATION: V

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (17141)..(17371)

<223> OTHER INFORMATION: pX

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (17407)..(18180)

<223> OTHER INFORMATION: pVI

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (18290)..(21109)

<223> OTHER INFORMATION: hexon

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (21130)..(21756)

<223> OTHER INFORMATION: protease

<220> FEATURE:

<221> NAME/KEY: misc_feature

<222> LOCATION: (21838)..(23373)

<223> OTHER INFORMATION: DBP complement (21838..23373)

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (23399)..(25819)

<223> OTHER INFORMATION: 100K

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (26453)..(27133)

<223> OTHER INFORMATION: pVIII

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (27137)..(27454)

<223> OTHER INFORMATION: E3\12.5K

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (28028)..(28555)

<223> OTHER INFORMATION: E3\gp19K

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (28589)..(29317)

<223> OTHER INFORMATION: E3\CR1-beta

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (29333)..(29953)

<223> OTHER INFORMATION: E3\CR1-gamma

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (29976)..(30848)

<223> OTHER INFORMATION: E3\CR1-delta

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (30860)..(31132)

<223> OTHER INFORMATION: E3\RID-alpha

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (31135)..(31572)

<223> OTHER INFORMATION: E3\RID-beta

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (32227)..(33552)

<223> OTHER INFORMATION: fiber

<220> FEATURE:

<221> NAME/KEY: misc_feature

<222> LOCATION: (33646)..(34793)

<223> OTHER INFORMATION: E4\orf6/7 complement (33646..33897,34620..34793)

<220> FEATURE:

<221> NAME/KEY: misc_feature

<222> LOCATION: (33897)..(34793)

<223> OTHER INFORMATION: E4\orf6 complement (33897..34793)

<220> FEATURE:

<221> NAME/KEY: misc_feature

<222> LOCATION: (34699)..(35064)

<223> OTHER INFORMATION: E4\orf4 complement (34699..35064)

<220> FEATURE:

<221> NAME/KEY: misc_feature

<222> LOCATION: (35076)..(35426)

<223> OTHER INFORMATION: E4\orf3 complement (35076..35426)

<220> FEATURE:

<221> NAME/KEY: misc_feature

<222> LOCATION: (35426)..(35812)

<223> OTHER INFORMATION: E4\orf2 complement (35426..35812)

<220> FEATURE:

<221> NAME/KEY: misc_feature

<222> LOCATION: (35865)..(36236)

<223> OTHER INFORMATION: E4\orf1 complement (35865..36236)

<220> FEATURE:

<221> NAME/KEY: repeat_region

<222> LOCATION: (36517)..(36643)

<223> OTHER INFORMATION: ITR complement (36517..36643)

<400> SEQUENCE: 57

catcatcaaa taatatacct caaacttttt gtgcgcgtta atatgcaaat gagccgtttg 60

aatttgggga tggaggaagg tgattggccg agaggagggc gaccgttagg ggcggggcga 120

gtgacgtttt gatgacgtgg tcgcgaggag gagccagttt gcaagttctc gtgggaaaag 180

tgacgtcaaa cgaggtgtgg tttgaacacg gaaatactca attttcccgc gctctctgac 240

aggaaatgag gtgtttctgg gcggatgcaa gtgaaaacgg gccattttcg cgcgaaaact 300

gaatgaggaa gtgaaaatct gagtaatttc gcgtttatgg cagggaggag tatttgccga 360

gggccgagta gactttgacc gattacgtgg gggtttcgat taccgtgttt ttcacctaaa 420

tttccgcgta cggtgtcaaa gtccggtgtt tttacgtagg tgtcagctga tcgccagggt 480

atttaaacct gcgctctcca gtcaagaggc cactcttgag tgccagcgag aagagttttc 540

tcctccgcgc cgcgagtcag atctacactt tgaaagatga ggcacctgag agacctgccc 600

gatgagaaaa tcatcatcgc ttccgggaac gagattctgg aactggtggt aaatgccatg 660

atgggcgacg accctccgga gccccccacc ccatttgaga caccttcgct acacgatttg 720

tatgatctgg aggtggatgt gcccgaggac gaccccaacg aggaggcggt aaatgattta 780

tttagcgatg ccgcactgct agctgccgag gaggcttcga gccctagctc agacagcgac 840

tcttcactgc atacccctag acccggcaga ggtgagaaaa agatccccga gcttaaaggg 900

gaagagatgg acttgcgctg ctatgaggaa tgcttgcccc cgagcgatga tgaggacgag 960

caggcgatcc agaacgtagc gagccaggga gtgcaagccg ccagcgagag ctttgcgctg 1020

gactgcccgc ctctgcccgg acacggctgt aagtcttgtg aatttcatcg cttgaatact 1080

ggagataaag ctgtgttatg tgcactttgc tatatgagag cttacaacca ttgtgtttac 1140

agtaagtgtg attaagttga actttagagg gaggcagaga gcggggtgac tgggcgatga 1200

ctggtttatt tatgtatata tgttctttat ataggtcccg tctctgacgc agatgatgag 1260

acccccacta cagagtccat ttcgtcaccc ccagaaattg gcacatctcc acctgagaat 1320

attgttagac cagttcctgt tagagccact gggaggagag cagctgtgga atgtttggat 1380

gacttgctac agggtgggga tgaacctttg gacttgtgta cccggaaacg ccccaggcac 1440

taagtgccac acatgtgtgt ttacttgagg tgatgtcagt atttataggg tgtggagtgc 1500

aataaaaaat gtgttgactt taagtgcgtg gtttatgact caggggtggg gactgtgggt 1560

atataagcag gtgcagacct gtgtggttag ctcagagcgg c atg gag att tgg acg 1616

Met Glu Ile Trp Thr

1 5

gtc ttg gaa gac ttt cac aag act aga cag ctg cta gag aac gcc tcg 1664

Val Leu Glu Asp Phe His Lys Thr Arg Gln Leu Leu Glu Asn Ala Ser

10 15 20

gac gga gtc tct tac ctg tgg aga ttc tgc ttc ggc ggc gac cta gct 1712

Asp Gly Val Ser Tyr Leu Trp Arg Phe Cys Phe Gly Gly Asp Leu Ala

25 30 35

aag cta gtc tac agg gcc aaa cag gat tat agg gaa caa ttt gag gtt 1760

Lys Leu Val Tyr Arg Ala Lys Gln Asp Tyr Arg Glu Gln Phe Glu Val

40 45 50

att ttg aga gag tgt tct ggt ctt ttt gac gct ctt aac ttg ggc cat 1808

Ile Leu Arg Glu Cys Ser Gly Leu Phe Asp Ala Leu Asn Leu Gly His

55 60 65

cag tct cac ttt aac cag agg att tcg aga gcc ctt gac ttt act act 1856

Gln Ser His Phe Asn Gln Arg Ile Ser Arg Ala Leu Asp Phe Thr Thr

70 75 80 85

cct ggc aga acc act gcg gca gta gcc ttt ttt gct ttt ctt ctt gac 1904

Pro Gly Arg Thr Thr Ala Ala Val Ala Phe Phe Ala Phe Leu Leu Asp

90 95 100

aaa tgg agt caa gaa acc cat ttc agc agg gat tac cag ctg gat ttc 1952

Lys Trp Ser Gln Glu Thr His Phe Ser Arg Asp Tyr Gln Leu Asp Phe

105 110 115

tta gca gta gct ttg tgg aga aca tgg aag tgc cag cgc ctg aat gca 2000

Leu Ala Val Ala Leu Trp Arg Thr Trp Lys Cys Gln Arg Leu Asn Ala

120 125 130

atc tca ggc tac ttg ccg gta cag ccg cta gac act ctg agg atc ctg 2048

Ile Ser Gly Tyr Leu Pro Val Gln Pro Leu Asp Thr Leu Arg Ile Leu

135 140 145

aat ctc cag gag agt ccc agg gca cgc caa cgt cgc cag cag cag cag 2096

Asn Leu Gln Glu Ser Pro Arg Ala Arg Gln Arg Arg Gln Gln Gln Gln

150 155 160 165

cag gag gat caa gaa gag aac ccg aga gcc ggc ctg gac cct ccg gcg 2144

Gln Glu Asp Gln Glu Glu Asn Pro Arg Ala Gly Leu Asp Pro Pro Ala

170 175 180

gag gag gag tagctgacct gtttcctgaa ctgcgccggg tgctgactag 2193

Glu Glu Glu

gtcttcgagt ggtcgggaga gggggattaa gcgggagagg catgatgaga ctaatcacag 2253

aactgaactg actgtgggtc tgatgagccg taagaggcca gaaacagtgt ggtggcatga 2313

ggtgcagtcg actggcacag atgaggtgtc agtcatgcat gagaagtttt ccctagaaca 2373

agtcaagact tgttggttgg agcctgagga tgattgggag gtagccatca ggaattatgc 2433

caagctggct ctcaggccag acaagaagta caagattact aagctgataa atatcagaaa 2493

tgcctgctac atctcaggga atggggctga agtggagatc tgtcttcagg atagagtggc 2553

tttcagatgc tgtatgatga atatgtaccc gggagtggtg ggcatggatg gggtcacctt 2613

tatgaacatg aggttcaggg gagatgggta taatggcacg gtctttatgg ccaatacgaa 2673

gctgacagtc catggctgct ccttctttgg gtttaataac acctgcattg aggcctgggg 2733

tcaggtcggt gtgaggggct gcagtttttc agccaactgg atgggggtcg tgggcaggac 2793

caaaagtgtg ctgtccgtga agaaatgctt gtttgagagg tgccacctgg gggtgatgag 2853

cgagggcgaa gccagaatcc gccactgcgc ctctaccgag acgggctgct tcgtgctgtg 2913

caagggcaat gctaagatca agcataatat gatttgtgga gcctcggacg agcgcggcta 2973

ccagatgctg acctgcgcca gtgggaacag ccatatgctg gccaccgtgc atgtggcttc 3033

ccattcccgc aagccctggc ctgagttcga gcacaatgtc atgaccaggt gcaatatgca 3093

tctggggtcc cgccgaggca tgttcatgcc ctaccagtgc aacctgaatt atgtgaaggt 3153

gctgctggag cccgatgcca tgtccagagt gagcctgacg ggggtgtttg acatgaatgt 3213

ggaggtgtgg aagattctga gatatgatga atccaagacc aggtgccgag cctgcgagtg 3273

cggagggaag catgccaggt tccagcccgt gtgtgtggag gtgacggagg acctgcgacc 3333

cgatcatttg gtgttgtcct gcaccgggac ggagttcggt tccagcgggg aagaatctga 3393

ctagagtgag tagtgttctg gggcggggag gacctgcatg agggccagaa tgactgaaat 3453

ctgtgctttt ctgtgtgttg cagcagc atg agc gga agc ggc tcc ttt gag gga 3507

Met Ser Gly Ser Gly Ser Phe Glu Gly

185 190

ggg gta ttc agc cct tat ctg acg ggg cgt ctc ccc tca tgg gcg gga 3555

Gly Val Phe Ser Pro Tyr Leu Thr Gly Arg Leu Pro Ser Trp Ala Gly

195 200 205

gtg cgt cag aat gtg atg gga tcc acg gtg gac ggc cgg ccc gtg cag 3603

Val Arg Gln Asn Val Met Gly Ser Thr Val Asp Gly Arg Pro Val Gln

210 215 220 225

ccc gcg aac tct tca acc ctg acc tat gca acc ctg agc tct tcg tcg 3651

Pro Ala Asn Ser Ser Thr Leu Thr Tyr Ala Thr Leu Ser Ser Ser Ser

230 235 240

gtg gac gca gct gcc gcc gca gct gct gca tct gcc gcc agc gcc gtg 3699

Val Asp Ala Ala Ala Ala Ala Ala Ala Ala Ser Ala Ala Ser Ala Val

245 250 255

cgc gga atg gcc atg ggc gcc ggc tac tac ggc act ctg gtg gcc aac 3747

Arg Gly Met Ala Met Gly Ala Gly Tyr Tyr Gly Thr Leu Val Ala Asn

260 265 270

tcg agt tcc acc aat aat ccc gcc agc ctg aac gag gag aag ctg ctg 3795

Ser Ser Ser Thr Asn Asn Pro Ala Ser Leu Asn Glu Glu Lys Leu Leu

275 280 285

ctg ctg atg gcc cag ctc gag gcc ttg acc cag cgc ctg ggc gag ctg 3843

Leu Leu Met Ala Gln Leu Glu Ala Leu Thr Gln Arg Leu Gly Glu Leu

290 295 300 305

acc cag cag gtg gct cag ctg cag gag cag acg cgg gcc gcg gtt gcc 3891

Thr Gln Gln Val Ala Gln Leu Gln Glu Gln Thr Arg Ala Ala Val Ala

310 315 320

acg gtg aaa tcc aaa taaaaaatga atcaataaat aaacggagac ggttgttgat 3946

Thr Val Lys Ser Lys

325

tttaacacag agtctgaatc tttatttgat ttttcgcgcg cggtaggccc tggaccaccg 4006

gtctcgatca ttgagcaccc ggtggatttt ttccaggacc cggtagaggt gggcttggat 4066

gttgaggtac atgggcatga gcccgtcccg ggggtggagg tagctccatt gcagggcctc 4126

gtgctcgggg gtggtgttgt aaatcaccca gtcatagcag gggcgcaggg cgtggtgttg 4186

cacaatatct ttgaggagga gactgatggc cacgggcagc cctttggtgt aggtgtttac 4246

aaatctgttg agctgggagg gatgcatgcg gggggagatg aggtgcatct tggcctggat 4306

cttgagattg gcgatgttac cgcccagatc ccgcctgggg ttcatgttgt gcaggaccac 4366

cagcacggtg tatccggtgc acttggggaa tttatcatgc aacttagaag ggaaggcgtg 4426

aaagaatttg gcgacgccct tgtgcccgcc caggttttcc atgcactcat ccatgatgat 4486

ggcgatgggc ccgtgggcgg cggcctgggc aaaaacgttt cgggggtcgg acacatcata 4546

gttgtggtcc tgggtgagat catcataggc cattttaatg aatttggggc ggagggtgcc 4606

tgactggggg acaaaggtac cctcgatccc gggggcgtag ttcccctcgc agatctgcat 4666

ctcccaggct ttgagctcgg agggggggat catgtccacc tgcggggcga taaagaacac 4726

ggtttccggg gcgggggaga tgagctgcgc cgaaagcaag ttccggagca gctgggactt 4786

gccgcagccg gtggggccgt agatgacccc gatgaccggc tgcaggtggt agttgaggga 4846

gagacagctg ccgtcctcgc ggaggagggg ggccacctcg ttcatcatct cgcgcacatg 4906

catgttctcg cgcaccagtt ccgccaggag gcgctctccc cccagggata ggagctcctg 4966

gagcgaggcg aagtttttca gcggcttgag tccgtcggcc atgggcattt tggagagggt 5026

ctgttgcaag agttccaagc ggtcccagag ctcggtgatg tgctctacgg catctcgatc 5086

cagcagacct cctcgtttcg cgggttggga cgactgcggg agtagggcac cagacgatgg 5146

gcgtccagcg cagccagggt ccggtccttc cagggccgca gcgtccgcgt cagggtggtc 5206

tccgtcacgg tgaaggggtg cgcgccgggc tgggcgcttg cgagggtgcg cttcaggctc 5266

atccggctgg tcgaaaaccg ctcccgatcg gcgccctgcg cgtcggccag gtagcaattg 5326

accatgagtt cgtagttgag cgcctcggcc gcgtggcctt tggcgcggag cttacctttg 5386

gaagtctgcc cgcaggcggg acagaggagg gacttgaggg cgtagagctt gggggcgagg 5446

aagacggact cgggggcgta ggcatccgcg ccgcagtggg cgcagacggt ctcgcactcc 5506

acgagccagg tgaggtcggg ctggtcgggg tcaaaaacca gtttcccgcc gttctttttg 5566

atgcgtttct tacctttggt ctccatgagc tcgtgtcccc gctgggtgac aaagaggctg 5626

tccgtgtccc cgtagaccga ctttatgggc cggtcttcga gcggtgtgcc acggtcctcc 5686

tcgtagagga accccgccca ctccgagacg aaagcccggg tccaggccag cacgaaggag 5746

gccacgtggg acgggtagcg gtcgttgtcc accagcgggt ccactttttc cagggtatgc 5806

aaacacatgt ccccctcgtc cacatccagg aaggtgattg gcttgtaagt gtaggccacg 5866

tgaccggggg tcccggccgg gggggtataa aagggggcgg gcccctgctc gtcctcactg 5926

tcttccggat cgctgtccag gagcgccagc tgttggggta ggtattccct ctcgaaggcg 5986

ggcatgacct cggcactcag gttgtcagtt tctagaaacg aggaggattt gatattgacg 6046

gtgccggcgg agatgccttt caagagcccc tcgtccatct ggtcagaaaa gacgatcttt 6106

ttgttgtcga gtttggtggc gaaggagccg tagagggcgt tggaaaggag cttggcgata 6166

gagcgcatgg tctggttctt ttccttgtct gcgcgctcct tggccgcgat gttgagctgc 6226

acgtactcgc gcgccacgca cttccattcg gggaagacgg tggtcagctc gtcgggcacg 6286

attctgacct gccagccccg gttatgcagg gtgataaggt ccacactggt ggccacctcg 6346

ccgcgcaggg gctcattagt ccagcagagg cgcccgccct tgcgcgagca gaaggggggc 6406

agggggtcga gcatgacctc gtcggggggg tcggcatcga tggtgaagat gccgggcagg 6466

agatcggggt caaagtagct gatggaagtg gccagatcgt ccagggcagc ttgccattcg 6526

cgcacggcca gcgcgcgctc gtaggggctg aggggcgtgc cccagggcat gggatgggtg 6586

agcgcggagg cgtacatgcc gcagatgtcg tagacgtaga ggggctcctc gaggatgccg 6646

atgtaggtgg ggtagcagcg ccccccgcgg atgctggcgc gcacgtagtc atacagctcg 6706

tgcgagggcg cgaggagccc cgggcccagg ttggtgcgac tgggcttttc ggcgcggtag 6766

acgatctggc gaaagatggc atgcgagttg gaggagatgg tgggcctttg gaagatgttg 6826

aagtgggcgt gggggagacc gaccgagtcg cggatgaagt gggcgtagga gtcttgcagc 6886

ttggcgacga gctcggcggt gacgaggacg tccagagcgc agtagtcgag ggtctcctgg 6946

atgatgtcat acttgagctg tcccttttgt ttccacagct cgcggttgag aaggaactct 7006

tcgcggtcct tccagtactc ttcgaggggg aacccgtcct gatctgcacg gtaagagcct 7066

agcatgtaga actggttgac ggccttgtag gcgcagcagc ccttctccac ggggagggcg 7126

taggcctggg cggccttgcg cagggaggtg tgcgtgaggg cgaaggtgtc cctgaccatg 7186

actttgagga actggtgctt gaagtcgata tcgtcgcagc ccccctgctc ccagagctgg 7246

aagtccgtgc gcttcttgta ggcggggttg ggcaaagcga aagtaacatc gttgaagagg 7306

atcttgcccg cgcggggcat aaagttgcga gtgatgcgga acggttgggg cacctcggcc 7366

cggttgttga tgacctgggc ggcgagcacg atctcgtcga aaccgttgat gttgtggccc 7426

acgatgtaaa gttccacgaa tcgcgggcgg cccttgacgt ggggcagttt cttgagctcc 7486

tcgtaggtga gctcgtcggg gtcgctgagg ccgtgctgct cgagcgccca gtcggcgaga 7546

tgggggttgg cgcggaggaa ggaagtccag agatccacgg ccagggcggt ctgcaggcgg 7606

tcccggtact gacgaaactg ctgcccgacg gccatttttt cgggggtgac gcagtagaag 7666

gtgcgggggt ccccgtgcca acgatcccat ttgagctgga gggcaagatc gagggcgagc 7726

tcgacgaggc ggtcgtcccc ggagagtttc atgaccagca tgaaggggac gagctgcttg 7786

ccgaaggacc ccatccaggt gtaggtttcc acatcgtagg tgaggaagag cctttcggtg 7846

cgaggatgcg agccgatggg gaagaactgg atctcctgcc accaattgga ggaatggctg 7906

ttgatgtgat ggaagtagaa atgccgacgg cgcgccgaac actcgtgctt gtgtttatac 7966

aagcggccac agtgctcgca acgctgcacg ggatgcacgt gctgcacgag ctgtacctga 8026

gttcctttga cgaggaattt cagtgggaag tggagtcgtg gcgcctgcat ctcgtgctgt 8086

actacgtcgt ggtggtcggc ctggccctct tctgcctcga tggtggtcat gctgacgagc 8146

ccgcgcggga ggcaggtcca gacctcggcg cgagcgggtc ggagagcgag gacgagggcg 8206

cgcaggccgg agctgtccag ggtcctgaga cgctgcggag tcaggtcagt gggcagcggc 8266

ggcgcgcggt tgacttgcag gagtttttcc agggcgcgcg ggaggtccag atggtacttg 8326

atctccaccg cgccgttggt ggcgacgtcg atggcttgca gggtcccgtg cccctggggt 8386

gtgaccaccg tcccccgttt cttcttgggc ggctggggag gcggtgcctc ttccatggtt 8446

agaagcggcg gcgaggacgc gcgccgggcg gcaggggcgg ctctgggccc ggaggcaggg 8506

gcggcagggg cacgtcggcg ccgcgcgcgg gtaggttctg gtactgcgcc cggagaagac 8566

tggcgtgagc gacgacgcga cggttgacgt cctggatctg acgcctctgg gtgaaggcca 8626

cgggacccgt gagtttgaac ctgaaagaga gttcgacaga atcaatctcg gtatcgttga 8686

cggcggcctg ccgcaggatc tcttgcacgt cgcccgagtt gtcctggtag gcgatctcgg 8746

tcatgaactg ctcgatctcc tcctcctgaa ggtctccgcg gccggcgcgc tccacggtgg 8806

ccgcgaggtc gttggagatg cggcccatga gctgcgagaa ggcgttcatg cccgcctcgt 8866

tccagacgcg gctgtagacc acgacgccct cgggatcgcg ggcgcgcatg accacctggg 8926

cgaggttgag ctccacgtgg cgcgtgaaga ccgcgtagtt gcagaggcgc tggtaaaggt 8986

agttgagcgt ggtggcgatg tgctcggtga cgaagaaata catgatccag cggcggagcg 9046

gcatctcgct gacgtcgccc agcgcctcca aacgttccat ggcctcgtaa aagtccacgg 9106

cgaagttgaa aaactgggag ttgcgcgccg agacggtcaa ctcctcctcc agaagacgga 9166

tgagctcggc gatggtggcg cgcacctcgc gctcgaaggc ccccgggagt tcctccactt 9226

cctcttcttc ctcctccact aacatctctt ctacttcctc ctcaggcggt ggtggcgggg 9286

gagggggcct gcgtcgccgg cggcgcacgg gcagacggtc gatgaaacgc tcgatggtct 9346

cgccgcgccg gcgtcgcatg gtctcggtga cggcgcgccc gtcctcgcgg ggccgcagcg 9406

tgaagacgcc gccgcgcatc tccaggtggc cgggggggtc cccgttgggc agggagaggg 9466

cgctgacgat gcatcttatc aattgccccg tagggactcc gcgcaaggac ctgagcgtct 9526

cgagatccac gggatctgaa aaccgttgaa cgaaggcttc gagccagtcg cagtcgcaag 9586

gtaggctgag cacggtttct tctggcgggt catgttggtt ggagggagcg gggcgggcga 9646

tgctgctggt gatgaagttg aaataggcgg ttctgagacg gcggatggtg gcgaggagca 9706

ccaggtcttt gggcccggct tgctggatgc gcagacggtc ggccatgccc caggcgtggt 9766

cctgacacct ggccaggtcc ttgtagtagt cctgcatgag ccgctccacg ggcacctcct 9826

cctcgcctgc gcggccgtgc atgcgcgtga gcccgaagcc gcgctggggc tggacgagcg 9886

ccaggtcggc gacgacgcgc tcggcgagga tggcctgctg gatctgggtg agggtggtct 9946

ggaagtcgtc aaagtcgacg aagcggtggt aggctccggt gttgatggtg taggagcagt 10006

tggccatgac ggaccagttg acggtctggt ggcccggacg cacgagctcg tggtacttga 10066

ggcgcgagta ggcgcgcgtg tcgaagatgt agtcgttgca ggtgcgcacc aggtactggt 10126

agccgatgag gaagtgcggc ggcggctggc ggtagagcgg ccatcgctcg gtggcggggg 10186

cgccgggcgc gaggtcctcg agcatggtgc ggtggtagcc gtagatgtac ctggacatcc 10246

aggtgatgcc ggcggcggtg gtggaggcgc gcgggaactc gcggacgcgg ttccagatgt 10306

tgcgcagcgg caggaagtag ttcatggtgg gcacggtctg gcccgtgagg cgcgcgcagt 10366

cgtggatgct ctatacgggc aaaaacgaaa gcggtcagcg gctcgactcc gtggcctgga 10426

ggctaagcga acgggttggg ctgcgcgtgt accccggttc gaatctcgaa tcaggctgga 10486

gccgcagcta acgtggtact ggcactcccg tctcgaccca agcctgcacc aaccctccag 10546

gatacggagg cgggtcgttt tgcaactttt ttcggaggcc ggatgaaact agtaagcgcg 10606

gaaagcggcc gaccgcgatg gctcgctgcc gtagtctgga gaagaatcgc cagggttgcg 10666

ttgcggtgtg ccccggttcg aggccggccg gattccgcgg ctaacgaggg cgtggctgcc 10726

ccgtcgtttc caagacccct agccagccga cttctccagt tacggagcga gcccctcttt 10786

tgttttttgt ttttgccag atg cat ccc gta ctg cgg cag atg cgc ccc cac 10838

Met His Pro Val Leu Arg Gln Met Arg Pro His

330 335

cac cct cca ccg caa caa cag ccc cct cct cca cag ccg gcg ctt ctg 10886

His Pro Pro Pro Gln Gln Gln Pro Pro Pro Pro Gln Pro Ala Leu Leu

340 345 350

ccc ccg ccc cag cag cag cag caa ctt cca gcc acg acc gcc gcg gcc 10934

Pro Pro Pro Gln Gln Gln Gln Gln Leu Pro Ala Thr Thr Ala Ala Ala

355 360 365

gcc gtg agc ggg gct gga cag agt tat gac cac cag ctg gcc ttg gaa 10982

Ala Val Ser Gly Ala Gly Gln Ser Tyr Asp His Gln Leu Ala Leu Glu

370 375 380 385

gag ggc gag ggg ctg gcg cgc ctg ggg gcg tcg tcg ccg gag cgg cac 11030

Glu Gly Glu Gly Leu Ala Arg Leu Gly Ala Ser Ser Pro Glu Arg His

390 395 400

ccg cgc gtg cag atg aaa agg gac gct cgt gag gcc tac gtg ccc aag 11078

Pro Arg Val Gln Met Lys Arg Asp Ala Arg Glu Ala Tyr Val Pro Lys

405 410 415

cag aac ctg ttc aga gac agg agc ggc gag gag ccc gag gag atg cgc 11126

Gln Asn Leu Phe Arg Asp Arg Ser Gly Glu Glu Pro Glu Glu Met Arg

420 425 430

gcg gcc cgg ttc cac gcg ggg cgg gag ctg cgg cgc ggc ctg gac cga 11174

Ala Ala Arg Phe His Ala Gly Arg Glu Leu Arg Arg Gly Leu Asp Arg

435 440 445

aag agg gtg ctg agg gac gag gat ttc gag gcg gac gag ctg acg ggg 11222

Lys Arg Val Leu Arg Asp Glu Asp Phe Glu Ala Asp Glu Leu Thr Gly

450 455 460 465

atc agc ccc gcg cgc gcg cac gtg gcc gcg gcc aac ctg gtc acg gcg 11270

Ile Ser Pro Ala Arg Ala His Val Ala Ala Ala Asn Leu Val Thr Ala

470 475 480

tac gag cag acc gtg aag gag gag agc aac ttc caa aaa tcc ttc aac 11318

Tyr Glu Gln Thr Val Lys Glu Glu Ser Asn Phe Gln Lys Ser Phe Asn

485 490 495

aac cac gtg cgc acc ctg atc gcg cgc gag gag gtg acc ctg ggc ctg 11366

Asn His Val Arg Thr Leu Ile Ala Arg Glu Glu Val Thr Leu Gly Leu

500 505 510

atg cac ctg tgg gac ctg ctg gag gcc atc gtg cag aac ccc acc agc 11414

Met His Leu Trp Asp Leu Leu Glu Ala Ile Val Gln Asn Pro Thr Ser

515 520 525

aag ccg ctg acg gcg cag ctg ttc ctg gtg gtg cag cac agt cgg gac 11462

Lys Pro Leu Thr Ala Gln Leu Phe Leu Val Val Gln His Ser Arg Asp

530 535 540 545

aac gag gcg ttc agg gag gcg ctg ctg aat atc acc gag ccc gag ggc 11510

Asn Glu Ala Phe Arg Glu Ala Leu Leu Asn Ile Thr Glu Pro Glu Gly

550 555 560

cgc tgg ctc ctg gac ctg gtg aac att ctg cag agc atc gtg gtg cag 11558

Arg Trp Leu Leu Asp Leu Val Asn Ile Leu Gln Ser Ile Val Val Gln

565 570 575

gag cgc ggg ctg ccg ctg tcc gag aag ctg gcg gcc atc aac ttc tcg 11606

Glu Arg Gly Leu Pro Leu Ser Glu Lys Leu Ala Ala Ile Asn Phe Ser

580 585 590

gtg ctg agt ctg ggc aag tac tac gct agg aag atc tac aag acc ccg 11654

Val Leu Ser Leu Gly Lys Tyr Tyr Ala Arg Lys Ile Tyr Lys Thr Pro

595 600 605

tac gtg ccc ata gac aag gag gtg aag atc gac ggg ttt tac atg cgc 11702

Tyr Val Pro Ile Asp Lys Glu Val Lys Ile Asp Gly Phe Tyr Met Arg

610 615 620 625

atg acc ctg aaa gtg ctg acc ctg agc gac gat ctg ggg gtg tac cgc 11750

Met Thr Leu Lys Val Leu Thr Leu Ser Asp Asp Leu Gly Val Tyr Arg

630 635 640

aac gac agg atg cac cgc gcg gtg agc gcc agc agg cgg cgc gag ctg 11798

Asn Asp Arg Met His Arg Ala Val Ser Ala Ser Arg Arg Arg Glu Leu

645 650 655

agc gac cag gag ctg atg cac agc ctg cag cgg gcc ctg acc ggg gcc 11846

Ser Asp Gln Glu Leu Met His Ser Leu Gln Arg Ala Leu Thr Gly Ala

660 665 670

ggg acc gag ggg gag agc tac ttt gac atg ggc gcg gac ctg cac tgg 11894

Gly Thr Glu Gly Glu Ser Tyr Phe Asp Met Gly Ala Asp Leu His Trp

675 680 685

cag ccc agc cgc cgg gcc ttg gag gcg gca ggc ggt ccc ccc tac gta 11942

Gln Pro Ser Arg Arg Ala Leu Glu Ala Ala Gly Gly Pro Pro Tyr Val

690 695 700 705

gaa gag gtg gac gag gac gag gag ggc gag tac ctg gaa gac 11984

Glu Glu Val Asp Glu Asp Glu Glu Gly Glu Tyr Leu Glu Asp

710 715

tgatggcgcg accgtatttt tgctag atg caa caa cag cca cct cct gat ccc 12037

Met Gln Gln Gln Pro Pro Pro Asp Pro

720 725

gcg atg cgg gcg gcg ctg cag agc cag ccg tcc ggc att aac tcc tcg 12085

Ala Met Arg Ala Ala Leu Gln Ser Gln Pro Ser Gly Ile Asn Ser Ser

730 735 740

gac gat tgg acc cag gcc atg caa cgc atc atg gcg ctg acg acc cgc 12133

Asp Asp Trp Thr Gln Ala Met Gln Arg Ile Met Ala Leu Thr Thr Arg

745 750 755 760

aac ccc gaa gcc ttt aga cag cag ccc cag gcc aac cgg ctc tcg gcc 12181

Asn Pro Glu Ala Phe Arg Gln Gln Pro Gln Ala Asn Arg Leu Ser Ala

765 770 775

atc ctg gag gcc gtg gtg ccc tcg cgc tac aac ccc acg cac gag aag 12229

Ile Leu Glu Ala Val Val Pro Ser Arg Tyr Asn Pro Thr His Glu Lys

780 785 790

gtc ctg gcc atc gtg aac gcg ctg gtg gag aac aag gcc atc cgc ggc 12277

Val Leu Ala Ile Val Asn Ala Leu Val Glu Asn Lys Ala Ile Arg Gly

795 800 805

gac gag gcc ggc ctg gtg tac aac gcg ctg ctg gag cgc gtg gcc cgc 12325

Asp Glu Ala Gly Leu Val Tyr Asn Ala Leu Leu Glu Arg Val Ala Arg

810 815 820

tac aac agc acc aac gtg cag acc aac ctg gac cgc atg gtg acc gat 12373

Tyr Asn Ser Thr Asn Val Gln Thr Asn Leu Asp Arg Met Val Thr Asp

825 830 835 840

gtg cgc gag gcc gtg gcc cag cgc gag cgg ttc cac cgc gag tcc aac 12421

Val Arg Glu Ala Val Ala Gln Arg Glu Arg Phe His Arg Glu Ser Asn

845 850 855

ctg gga tcc atg gtg gcg ctg aac gcc ttc ctc agc acc cag ccc gcc 12469

Leu Gly Ser Met Val Ala Leu Asn Ala Phe Leu Ser Thr Gln Pro Ala

860 865 870

aac gtg ccc cgg ggc cag gag gac tac acc aac ttc atc agc gct ctg 12517

Asn Val Pro Arg Gly Gln Glu Asp Tyr Thr Asn Phe Ile Ser Ala Leu

875 880 885

cgc ctg atg gtg acc gag gtg ccc cag agc gag gtg tac cag tcc ggg 12565

Arg Leu Met Val Thr Glu Val Pro Gln Ser Glu Val Tyr Gln Ser Gly

890 895 900

ccg gac tac ttc ttc cag acc agt cgc cag ggc ttg cag acc gtg aac 12613

Pro Asp Tyr Phe Phe Gln Thr Ser Arg Gln Gly Leu Gln Thr Val Asn

905 910 915 920

ctg agc cag gct ttc aag aac ttg cag gga ttg tgg ggc gtg cag gcc 12661

Leu Ser Gln Ala Phe Lys Asn Leu Gln Gly Leu Trp Gly Val Gln Ala

925 930 935

ccg gtc ggg gac cgc gcg acg gtg tcg agc ctg ctg acg ccg aac tcg 12709

Pro Val Gly Asp Arg Ala Thr Val Ser Ser Leu Leu Thr Pro Asn Ser

940 945 950

cgc ctg ctg ctg ctg ctg gtg gcc ccc ttc acg gac agc ggc agc atc 12757

Arg Leu Leu Leu Leu Leu Val Ala Pro Phe Thr Asp Ser Gly Ser Ile

955 960 965

aac cgc aac tcg tac ctg ggc tac ctg att aac ttg tac cgc gag gcc 12805

Asn Arg Asn Ser Tyr Leu Gly Tyr Leu Ile Asn Leu Tyr Arg Glu Ala

970 975 980

atc ggc cag gcg cac gtg gac gag cag acc tac cag gag atc acc cac 12853

Ile Gly Gln Ala His Val Asp Glu Gln Thr Tyr Gln Glu Ile Thr His

985 990 995 1000

gtg agc cgc gcc ctg ggc cag gac gac ccg ggc aat ctg gaa gcc 12898

Val Ser Arg Ala Leu Gly Gln Asp Asp Pro Gly Asn Leu Glu Ala

1005 1010 1015

acc ctg aac ttt ttg ctg acc aac cgg tcg cag aag atc ccg ccc 12943

Thr Leu Asn Phe Leu Leu Thr Asn Arg Ser Gln Lys Ile Pro Pro

1020 1025 1030

cag tac gcg ctc agc gcc gag gag gag cgc att ctg cga tac gtg 12988

Gln Tyr Ala Leu Ser Ala Glu Glu Glu Arg Ile Leu Arg Tyr Val

1035 1040 1045

cag cag agc gtg ggc ctg ttc ctg atg cag gag ggg gcc acc ccc 13033

Gln Gln Ser Val Gly Leu Phe Leu Met Gln Glu Gly Ala Thr Pro

1050 1055 1060

agc gcc gcg ctc gac atg acc gcg cgc aac atg gag ccc agc atg 13078

Ser Ala Ala Leu Asp Met Thr Ala Arg Asn Met Glu Pro Ser Met

1065 1070 1075

tac gcc agc aac cgc ccg ttc atc aat aag ctg atg gac tac ttg 13123

Tyr Ala Ser Asn Arg Pro Phe Ile Asn Lys Leu Met Asp Tyr Leu

1080 1085 1090

cat cgg gcg gcc gcc atg aac tcg gac tat ttc acc aac gcc atc 13168

His Arg Ala Ala Ala Met Asn Ser Asp Tyr Phe Thr Asn Ala Ile

1095 1100 1105

ctg aat ccc cac tgg ctc ccg ccg ccg ggg ttc tac acg ggc gag 13213

Leu Asn Pro His Trp Leu Pro Pro Pro Gly Phe Tyr Thr Gly Glu

1110 1115 1120

tac gac atg ccc gac ccc aat gac ggg ttc ctg tgg gac gat gtg 13258

Tyr Asp Met Pro Asp Pro Asn Asp Gly Phe Leu Trp Asp Asp Val

1125 1130 1135

gac agc agc gtg ttc tcc ccc cga ccg ggt gct aac gag cgc ccc 13303

Asp Ser Ser Val Phe Ser Pro Arg Pro Gly Ala Asn Glu Arg Pro

1140 1145 1150

ttg tgg aag aag gaa ggc agc gac cgt cgc ccg tcc tcg gcg ctg 13348

Leu Trp Lys Lys Glu Gly Ser Asp Arg Arg Pro Ser Ser Ala Leu

1155 1160 1165

tcc ggc cgg gag ggt gct gcc gcg gcg gtg ccc gag gcc gcc agt 13393

Ser Gly Arg Glu Gly Ala Ala Ala Ala Val Pro Glu Ala Ala Ser

1170 1175 1180

cct ttc ccg agc ttg ccc ttc tcg ctg aac agt att cgc agc agc 13438

Pro Phe Pro Ser Leu Pro Phe Ser Leu Asn Ser Ile Arg Ser Ser

1185 1190 1195

gag ctg ggc agg atc acg cgt ccg cgc ttg ctg ggc gag gag gag 13483

Glu Leu Gly Arg Ile Thr Arg Pro Arg Leu Leu Gly Glu Glu Glu

1200 1205 1210

tac ttg aat gac tcg ctg ttg aga ccc gag cgg gag aaa aac ttc 13528

Tyr Leu Asn Asp Ser Leu Leu Arg Pro Glu Arg Glu Lys Asn Phe

1215 1220 1225

ccc aac aac ggg ata gag agc ctg gtg gac aag atg agc cgc tgg 13573

Pro Asn Asn Gly Ile Glu Ser Leu Val Asp Lys Met Ser Arg Trp

1230 1235 1240

aag acg tat gcg cag gag cac agg gac gat ccg tcg cag ggg gcc 13618

Lys Thr Tyr Ala Gln Glu His Arg Asp Asp Pro Ser Gln Gly Ala

1245 1250 1255

acg agc cgg ggc agc gcg gcc cgt aaa cgc cgg tgg cac gac agg 13663

Thr Ser Arg Gly Ser Ala Ala Arg Lys Arg Arg Trp His Asp Arg

1260 1265 1270

cag cgg gga ctg atg tgg gac gat gag gat tcc gcc gac gac agc 13708

Gln Arg Gly Leu Met Trp Asp Asp Glu Asp Ser Ala Asp Asp Ser

1275 1280 1285

agc gtg ttg gac ttg ggt ggg agt ggt ggt aac ccg ttc gct cac 13753

Ser Val Leu Asp Leu Gly Gly Ser Gly Gly Asn Pro Phe Ala His

1290 1295 1300

ctg cgc ccc cgc atc ggg cgc atg atg taagaaaccg aaaataaatg 13800

Leu Arg Pro Arg Ile Gly Arg Met Met

1305

atactcacca aggccatggc gaccagcgtg cgttcgtttc ttctctgttg tttagt 13856

atg atg agg cgt gcg tac ccg gag ggt cct cct ccc tcg tac gag 13901

Met Met Arg Arg Ala Tyr Pro Glu Gly Pro Pro Pro Ser Tyr Glu

1310 1315 1320

agc gtg atg cag cag gcg atg gcg gcg gcg gcg atg cag ccc ccg 13946

Ser Val Met Gln Gln Ala Met Ala Ala Ala Ala Met Gln Pro Pro

1325 1330 1335

ctg gag gct cct tac gtg ccc ccg cgg tac ctg gcg cct acg gag 13991

Leu Glu Ala Pro Tyr Val Pro Pro Arg Tyr Leu Ala Pro Thr Glu

1340 1345 1350

ggg cgg aac agc att cgt tac tcg gag ctg gca ccc ttg tac gat 14036

Gly Arg Asn Ser Ile Arg Tyr Ser Glu Leu Ala Pro Leu Tyr Asp

1355 1360 1365

acc acc cgg ttg tac ctg gtg gac aac aag tcg gcg gac atc gcc 14081

Thr Thr Arg Leu Tyr Leu Val Asp Asn Lys Ser Ala Asp Ile Ala

1370 1375 1380

tcg ctg aac tac cag aac gac cac agc aac ttc ctg acc acc gtg 14126

Ser Leu Asn Tyr Gln Asn Asp His Ser Asn Phe Leu Thr Thr Val

1385 1390 1395

gtg cag aac aat gac ttc acc ccc acg gag gcc agc acc cag acc 14171

Val Gln Asn Asn Asp Phe Thr Pro Thr Glu Ala Ser Thr Gln Thr

1400 1405 1410

atc aac ttt gac gag cgc tcg cgg tgg ggc ggc cag ctg aaa acc 14216

Ile Asn Phe Asp Glu Arg Ser Arg Trp Gly Gly Gln Leu Lys Thr

1415 1420 1425

atc atg cac acc aac atg ccc aac gtg aac gag ttc atg tac agc 14261

Ile Met His Thr Asn Met Pro Asn Val Asn Glu Phe Met Tyr Ser

1430 1435 1440

aac aag ttc aag gcg cgg gtc atg gtc tcc cgc aag acc ccc aac 14306

Asn Lys Phe Lys Ala Arg Val Met Val Ser Arg Lys Thr Pro Asn

1445 1450 1455

ggg gtc aca gta ggg gat gac tat gat ggt agt cag gat atc ttg 14351

Gly Val Thr Val Gly Asp Asp Tyr Asp Gly Ser Gln Asp Ile Leu

1460 1465 1470

gag tat gaa tgg gtg gag ttt gag ctg ccc gaa ggc aac ttc tcg 14396

Glu Tyr Glu Trp Val Glu Phe Glu Leu Pro Glu Gly Asn Phe Ser

1475 1480 1485

gtg acc atg acc atc gac ctg atg aac aac gcc atc atc gac aat 14441

Val Thr Met Thr Ile Asp Leu Met Asn Asn Ala Ile Ile Asp Asn

1490 1495 1500

tac ttg gcg gtg ggg cgg cag aac ggg gtg ctc gag agc gac atc 14486

Tyr Leu Ala Val Gly Arg Gln Asn Gly Val Leu Glu Ser Asp Ile

1505 1510 1515

ggc gtg aag ttc gac acc agg aac ttc agg ctg ggc tgg gac ccc 14531

Gly Val Lys Phe Asp Thr Arg Asn Phe Arg Leu Gly Trp Asp Pro

1520 1525 1530

gtg acc gag ctg gtc atg cca ggg gtg tac acc aac gag gcc ttc 14576

Val Thr Glu Leu Val Met Pro Gly Val Tyr Thr Asn Glu Ala Phe

1535 1540 1545

cat ccc gat att gtc ttg ctg ccc ggc tgc ggg gtg gac ttc acc 14621

His Pro Asp Ile Val Leu Leu Pro Gly Cys Gly Val Asp Phe Thr

1550 1555 1560

gag agc cgc ctc agc aac ctg ctg ggc att cgc aag agg cag ccc 14666

Glu Ser Arg Leu Ser Asn Leu Leu Gly Ile Arg Lys Arg Gln Pro

1565 1570 1575

ttc cag gag ggt ttc cag atc atg tac gag gat ctg gag ggg ggc 14711

Phe Gln Glu Gly Phe Gln Ile Met Tyr Glu Asp Leu Glu Gly Gly

1580 1585 1590

aac atc ccc gcg ctc ctg gat gtc gac gcc tat gag aaa agc aag 14756

Asn Ile Pro Ala Leu Leu Asp Val Asp Ala Tyr Glu Lys Ser Lys

1595 1600 1605

gag gaa gca gca gct gaa gca act gca gcg gta gcc acc gcc tct 14801

Glu Glu Ala Ala Ala Glu Ala Thr Ala Ala Val Ala Thr Ala Ser

1610 1615 1620

acc gag gtc agg ggc gat aat ttt gca agc gct gcg gca gtg gca 14846

Thr Glu Val Arg Gly Asp Asn Phe Ala Ser Ala Ala Ala Val Ala

1625 1630 1635

gcg gcc gag gcg gct gaa acc gaa agt aag ata gtc att cag ccg 14891

Ala Ala Glu Ala Ala Glu Thr Glu Ser Lys Ile Val Ile Gln Pro

1640 1645 1650

gtg gag aag gat agc aag gac agg agc tac aac gtg ctg ccg gac 14936

Val Glu Lys Asp Ser Lys Asp Arg Ser Tyr Asn Val Leu Pro Asp

1655 1660 1665

aag ata aac acc gcc tac cgc agc tgg tac ctg gcc tac aac tat 14981

Lys Ile Asn Thr Ala Tyr Arg Ser Trp Tyr Leu Ala Tyr Asn Tyr

1670 1675 1680

ggc gac ccc gag aag ggc gtg cgc tcc tgg acg ctg ctc acc acc 15026

Gly Asp Pro Glu Lys Gly Val Arg Ser Trp Thr Leu Leu Thr Thr

1685 1690 1695

tcg gac gtc acc tgc ggc gtg gag caa gtc tac tgg tcg ctg ccc 15071

Ser Asp Val Thr Cys Gly Val Glu Gln Val Tyr Trp Ser Leu Pro

1700 1705 1710

gac atg atg caa gac ccg gtc acc ttc cgc tcc acg cgt caa gtt 15116

Asp Met Met Gln Asp Pro Val Thr Phe Arg Ser Thr Arg Gln Val

1715 1720 1725

agc aac tac ccg gtg gtg ggc gcc gag ctc ctg ccc gtc tac tcc 15161

Ser Asn Tyr Pro Val Val Gly Ala Glu Leu Leu Pro Val Tyr Ser

1730 1735 1740

aag agc ttc ttc aac gag cag gcc gtc tac tcg cag cag ctg cgc 15206

Lys Ser Phe Phe Asn Glu Gln Ala Val Tyr Ser Gln Gln Leu Arg

1745 1750 1755

gcc ttc acc tcg ctc acg cac gtc ttc aac cgc ttc ccc gag aac 15251

Ala Phe Thr Ser Leu Thr His Val Phe Asn Arg Phe Pro Glu Asn

1760 1765 1770

cag atc ctc gtc cgc ccg ccc gcg ccc acc att acc acc gtc agt 15296

Gln Ile Leu Val Arg Pro Pro Ala Pro Thr Ile Thr Thr Val Ser

1775 1780 1785

gaa aac gtt cct gct ctc aca gat cac ggg acc ctg ccg ctg cgc 15341

Glu Asn Val Pro Ala Leu Thr Asp His Gly Thr Leu Pro Leu Arg

1790 1795 1800

agc agt atc cgg gga gtc cag cgc gtg acc gtt act gac gcc aga 15386

Ser Ser Ile Arg Gly Val Gln Arg Val Thr Val Thr Asp Ala Arg

1805 1810 1815

cgc cgc acc tgc ccc tac gtc tac aag gcc ctg ggc ata gtc gcg 15431

Arg Arg Thr Cys Pro Tyr Val Tyr Lys Ala Leu Gly Ile Val Ala

1820 1825 1830

ccg cgc gtc ctt tcg agc cgc acc ttc taaaaaa atg tcc att ctc 15477

Pro Arg Val Leu Ser Ser Arg Thr Phe Met Ser Ile Leu

1835 1840 1845

atc tcg ccc agt aat aac acc ggt tgg ggc ctg cgc gcg ccc agc 15522

Ile Ser Pro Ser Asn Asn Thr Gly Trp Gly Leu Arg Ala Pro Ser

1850 1855 1860

aag atg tac gga ggc gct cgc caa cgc tcc acg caa cac ccc gtg 15567

Lys Met Tyr Gly Gly Ala Arg Gln Arg Ser Thr Gln His Pro Val

1865 1870 1875

cgc gtg cgc ggg cac ttc cgc gct ccc tgg ggc gcc ctc aag ggc 15612

Arg Val Arg Gly His Phe Arg Ala Pro Trp Gly Ala Leu Lys Gly

1880 1885 1890

cgc gtg cgg tcg cgc acc acc gtc gac gac gtg atc gac cag gtg 15657

Arg Val Arg Ser Arg Thr Thr Val Asp Asp Val Ile Asp Gln Val

1895 1900 1905

gtg gcc gac gcg cgc aac tac acc ccc gcc gcc gcg ccc gtc tcc 15702

Val Ala Asp Ala Arg Asn Tyr Thr Pro Ala Ala Ala Pro Val Ser

1910 1915 1920

acc gtg gac gcc gtc atc gac agc gtg gtg gcc gac gcg cgc cgg 15747

Thr Val Asp Ala Val Ile Asp Ser Val Val Ala Asp Ala Arg Arg

1925 1930 1935

tac gcc cgc gcc aag agc cgg cgg cgg cgc atc gcc cgg cgg cac 15792

Tyr Ala Arg Ala Lys Ser Arg Arg Arg Arg Ile Ala Arg Arg His

1940 1945 1950

cgg agc acc ccc gcc atg cgc gcg gcg cga gcc ttg ctg cgc agg 15837

Arg Ser Thr Pro Ala Met Arg Ala Ala Arg Ala Leu Leu Arg Arg

1955 1960 1965

gcc agg cgc acg gga cgc agg gcc atg ctc agg gcg gcc aga cgc 15882

Ala Arg Arg Thr Gly Arg Arg Ala Met Leu Arg Ala Ala Arg Arg

1970 1975 1980

gcg gcc tca ggc gcc agc gcc ggc agg acc cgg aga cgc gcg gcc 15927

Ala Ala Ser Gly Ala Ser Ala Gly Arg Thr Arg Arg Arg Ala Ala

1985 1990 1995

acg gcg gcg gca gcg gcc atc gcc agc atg tcc cgc ccg cgg cga 15972

Thr Ala Ala Ala Ala Ala Ile Ala Ser Met Ser Arg Pro Arg Arg

2000 2005 2010

ggg aac gtg tac tgg gtg cgc gac gcc gcc acc ggt gtg cgc gtg 16017

Gly Asn Val Tyr Trp Val Arg Asp Ala Ala Thr Gly Val Arg Val

2015 2020 2025

ccc gtg cgc acc cgc ccc cct cgc act tgaagatgtt cacttcgcga 16064

Pro Val Arg Thr Arg Pro Pro Arg Thr

2030 2035

tgttgatgtg tcccagcggc gaggagg atg tcc aag cgc aaa ttc aag gaa 16115

Met Ser Lys Arg Lys Phe Lys Glu

2040

gag atg ctc cag gtc atc gcg cct gag atc tac ggc ccc gcg gcg 16160

Glu Met Leu Gln Val Ile Ala Pro Glu Ile Tyr Gly Pro Ala Ala

2045 2050 2055

gcg gtg aag gag gaa aga aag ccc cgc aaa atc aag cgg gtc aaa 16205

Ala Val Lys Glu Glu Arg Lys Pro Arg Lys Ile Lys Arg Val Lys

2060 2065 2070

aag gac aaa aag gaa gaa gat gat gtg gac gga ttg gtg gag ttt 16250

Lys Asp Lys Lys Glu Glu Asp Asp Val Asp Gly Leu Val Glu Phe

2075 2080 2085

gtg cgc gag ttc gcc ccc agg cgg cgc gtg cag tgg cgc ggg cgg 16295

Val Arg Glu Phe Ala Pro Arg Arg Arg Val Gln Trp Arg Gly Arg

2090 2095 2100

aag gtg caa ccg gtg ctg aga ccc ggc acc acc gtg gtt ttc acg 16340

Lys Val Gln Pro Val Leu Arg Pro Gly Thr Thr Val Val Phe Thr

2105 2110 2115

cca ggc gag cgc tcc ggc acc gcc tcc aag cgc tcc tac gac gag 16385

Pro Gly Glu Arg Ser Gly Thr Ala Ser Lys Arg Ser Tyr Asp Glu

2120 2125 2130

gtg tac ggg gat gat gat att ctg gaa cag gcg gcc gag cgc ctg 16430

Val Tyr Gly Asp Asp Asp Ile Leu Glu Gln Ala Ala Glu Arg Leu

2135 2140 2145

ggc gag ttt gct tac ggc aag cgc agc cgc tcc gcg ccg aag gaa 16475

Gly Glu Phe Ala Tyr Gly Lys Arg Ser Arg Ser Ala Pro Lys Glu

2150 2155 2160

gag gcg gtg tcc atc ccg ctg gac cac ggc aac ccc acg ccg agc 16520

Glu Ala Val Ser Ile Pro Leu Asp His Gly Asn Pro Thr Pro Ser

2165 2170 2175

ctc aag ccc gtg acc ctg cag cag gtg ctg ccg acc gcg gcg ccg 16565

Leu Lys Pro Val Thr Leu Gln Gln Val Leu Pro Thr Ala Ala Pro

2180 2185 2190

cga agg ggg ttc aag cgc gag ggc gag gat ctg tac ccc acc atg 16610

Arg Arg Gly Phe Lys Arg Glu Gly Glu Asp Leu Tyr Pro Thr Met

2195 2200 2205

cag ctg atg gtg ccc aag cgc cag aag ctg gaa gac gtg ctg gag 16655

Gln Leu Met Val Pro Lys Arg Gln Lys Leu Glu Asp Val Leu Glu

2210 2215 2220

acc atg aag gtg gac ccg gac gtg cag ccc gag gtc aag gtg cgg 16700

Thr Met Lys Val Asp Pro Asp Val Gln Pro Glu Val Lys Val Arg

2225 2230 2235

ccc atc aag cag gtg gcc ccg ggc ctg ggc gtg cag acc gtg gac 16745

Pro Ile Lys Gln Val Ala Pro Gly Leu Gly Val Gln Thr Val Asp

2240 2245 2250

atc aag atc ccc acg gag ccc atg gaa acg cag acc gag ccc gtg 16790

Ile Lys Ile Pro Thr Glu Pro Met Glu Thr Gln Thr Glu Pro Val

2255 2260 2265

aag ccc agc acc agc acc atg gag gtg cag acg gat ccc tgg atg 16835

Lys Pro Ser Thr Ser Thr Met Glu Val Gln Thr Asp Pro Trp Met

2270 2275 2280

cca tcg gct cct agc cga aga ccc cgg cgc aag tac ggc gcg gcc 16880

Pro Ser Ala Pro Ser Arg Arg Pro Arg Arg Lys Tyr Gly Ala Ala

2285 2290 2295

agc ctg ctg atg ccc aac tac gcg ctg cat cct tcc atc atc ccc 16925

Ser Leu Leu Met Pro Asn Tyr Ala Leu His Pro Ser Ile Ile Pro

2300 2305 2310

acg cct ggc tac cgc ggc acg cgc ttc tac cgc ggt cat aca acc 16970

Thr Pro Gly Tyr Arg Gly Thr Arg Phe Tyr Arg Gly His Thr Thr

2315 2320 2325

agc cgc cgc cgc aag acc acc act cgc cgc cgc cgt cgt cgc acc 17015

Ser Arg Arg Arg Lys Thr Thr Thr Arg Arg Arg Arg Arg Arg Thr

2330 2335 2340

gcc gct gca act acc cct gcc gcc ttg gtg cgg aga gtg tac cgc 17060

Ala Ala Ala Thr Thr Pro Ala Ala Leu Val Arg Arg Val Tyr Arg

2345 2350 2355

cgc ggc cgc gcg cct ctg acc ctg ccg cgc gcg cgc tac cac ccg 17105

Arg Gly Arg Ala Pro Leu Thr Leu Pro Arg Ala Arg Tyr His Pro

2360 2365 2370

agc atc gcc att taaaactttc gcctgctttg cag atg gcc ctc aca tgc 17155

Ser Ile Ala Ile Met Ala Leu Thr Cys

2375 2380

cgc ctc cgc gtc ccc att acg ggc tac cga gga aga aaa ccg cgc 17200

Arg Leu Arg Val Pro Ile Thr Gly Tyr Arg Gly Arg Lys Pro Arg

2385 2390 2395

cgt aga agg ctg gcg ggg aac ggg atg cgt cgc cac cac cac cgg 17245

Arg Arg Arg Leu Ala Gly Asn Gly Met Arg Arg His His His Arg

2400 2405 2410

cgg cgg cgc gcc atc agc aag cgg ttg ggg gga ggc ttc ctg ccc 17290

Arg Arg Arg Ala Ile Ser Lys Arg Leu Gly Gly Gly Phe Leu Pro

2415 2420 2425

gcg ctg atc ccc atc atc gcc gcg gcg atc ggg gcg atc ccc ggc 17335

Ala Leu Ile Pro Ile Ile Ala Ala Ala Ile Gly Ala Ile Pro Gly

2430 2435 2440

att gct tcc gtg gcg gtg cag gcc tct cag cgc cac tgagacacac 17381

Ile Ala Ser Val Ala Val Gln Ala Ser Gln Arg His

2445 2450 2455

ttggaaaaca tcttgtaata aacca atg gac tct gac gct cct ggt cct gtg 17433

Met Asp Ser Asp Ala Pro Gly Pro Val

2460

atg tgt ttt cgt aga cag atg gaa gac atc aat ttt tcg tcc ctg 17478

Met Cys Phe Arg Arg Gln Met Glu Asp Ile Asn Phe Ser Ser Leu

2465 2470 2475

gct ccg cga cac ggc acg cgg ccg ttc atg ggc acc tgg agc gac 17523

Ala Pro Arg His Gly Thr Arg Pro Phe Met Gly Thr Trp Ser Asp

2480 2485 2490

atc ggc acc agc caa ctg aac ggg ggc gcc ttc aat tgg agc agt 17568

Ile Gly Thr Ser Gln Leu Asn Gly Gly Ala Phe Asn Trp Ser Ser

2495 2500 2505

ctc tgg agc ggg ctt aag aat ttc ggg tcc acg ctt aaa acc tat 17613

Leu Trp Ser Gly Leu Lys Asn Phe Gly Ser Thr Leu Lys Thr Tyr

2510 2515 2520

ggc agc aag gcg tgg aac agc acc aca ggg cag gcg ctg agg gaa 17658

Gly Ser Lys Ala Trp Asn Ser Thr Thr Gly Gln Ala Leu Arg Glu

2525 2530 2535

aag ctg aaa gag cag aac ttc cag cag aag gtg gtc gat ggg ctc 17703

Lys Leu Lys Glu Gln Asn Phe Gln Gln Lys Val Val Asp Gly Leu

2540 2545 2550

gcc tcg ggc atc aac ggg gtg gtg gac ctg gcc aac cag gcc gtg 17748

Ala Ser Gly Ile Asn Gly Val Val Asp Leu Ala Asn Gln Ala Val

2555 2560 2565

cag cgg cag atc aac agc cgc ctg gac ccg gtg ccg ccc gcc ggc 17793

Gln Arg Gln Ile Asn Ser Arg Leu Asp Pro Val Pro Pro Ala Gly

2570 2575 2580

tcc gtg gag atg ccg cag gtg gag gag gag ctg cct ccc ctg gac 17838

Ser Val Glu Met Pro Gln Val Glu Glu Glu Leu Pro Pro Leu Asp

2585 2590 2595

aag cgg ggc gag aag cga ccc cgc ccc gac gcg gag gag acg ctg 17883

Lys Arg Gly Glu Lys Arg Pro Arg Pro Asp Ala Glu Glu Thr Leu

2600 2605 2610

ctg acg cac acg gac gag ccg ccc ccg tac gag gag gcg gtg aaa 17928

Leu Thr His Thr Asp Glu Pro Pro Pro Tyr Glu Glu Ala Val Lys

2615 2620 2625

ctg ggt ctg ccc acc acg cgg ccc atc gcg ccc ctg gcc acc ggg 17973

Leu Gly Leu Pro Thr Thr Arg Pro Ile Ala Pro Leu Ala Thr Gly

2630 2635 2640

gtg ctg aaa ccc gaa agt agt aag ccc gcg acc ctg gac ttg cct 18018

Val Leu Lys Pro Glu Ser Ser Lys Pro Ala Thr Leu Asp Leu Pro

2645 2650 2655

cct ccc gct tcc cgc ccc tcc aca gtg gct aag ccc ctg ccg ccg 18063

Pro Pro Ala Ser Arg Pro Ser Thr Val Ala Lys Pro Leu Pro Pro

2660 2665 2670

gtg gcc gtg gcc cgc gcg cga ccc ggg ggc acc gcc cgc cct cat 18108

Val Ala Val Ala Arg Ala Arg Pro Gly Gly Thr Ala Arg Pro His

2675 2680 2685

gcg aac tgg cag agc act ctg aac agc atc gtg ggt ctg gga gtg 18153

Ala Asn Trp Gln Ser Thr Leu Asn Ser Ile Val Gly Leu Gly Val

2690 2695 2700

cag agt gtg aag cgc cgc cgc tgc tat taaacctacc gtagcgctta 18200

Gln Ser Val Lys Arg Arg Arg Cys Tyr

2705 2710

acttgcttgt ctgtgtgtgt atgtattatg tcgccgccgc cgctgtccgc cagaaggagg 18260

agtgaagagg cgcgtcgccg agttgcaag atg gcc acc cca tcg atg ctg ccc 18313

Met Ala Thr Pro Ser Met Leu Pro

2715 2720

cag tgg gcg tac atg cac atc gcc gga cag gac gct tcg gag tac 18358

Gln Trp Ala Tyr Met His Ile Ala Gly Gln Asp Ala Ser Glu Tyr

2725 2730 2735

ctg agt ccg ggt ctg gtg cag ttc gcc cgc gcc aca gac acc tac 18403

Leu Ser Pro Gly Leu Val Gln Phe Ala Arg Ala Thr Asp Thr Tyr

2740 2745 2750

ttc agt ctg ggg aac aag ttt agg aac ccc acg gtg gcg ccc acg 18448

Phe Ser Leu Gly Asn Lys Phe Arg Asn Pro Thr Val Ala Pro Thr

2755 2760 2765

cac gat gtg acc acc gac cgc agc cag cgg ctg acg ctg cgc ttc 18493

His Asp Val Thr Thr Asp Arg Ser Gln Arg Leu Thr Leu Arg Phe

2770 2775 2780

gtg ccc gtg gac cgc gag gac aac acc tac tcg tac aaa gtg cgc 18538

Val Pro Val Asp Arg Glu Asp Asn Thr Tyr Ser Tyr Lys Val Arg

2785 2790 2795

tac acg ctg gcc gtg ggc gac aac cgc gtg ctg gac atg gcc agc 18583

Tyr Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met Ala Ser

2800 2805 2810

acc tac ttt gac atc cgc ggc gtg ctg gac cgg ggc cct agc ttc 18628

Thr Tyr Phe Asp Ile Arg Gly Val Leu Asp Arg Gly Pro Ser Phe

2815 2820 2825

aaa ccc tac tcc ggc act gcc tac aac agc ctg gcc ccc aag gga 18673

Lys Pro Tyr Ser Gly Thr Ala Tyr Asn Ser Leu Ala Pro Lys Gly

2830 2835 2840

gca ccc aac act tgc cag tgg aag gac tct gac aac aaa atg cat 18718

Ala Pro Asn Thr Cys Gln Trp Lys Asp Ser Asp Asn Lys Met His

2845 2850 2855

acc ttt ggg gta gct gcc atg ccc ggt gtt act ggg aaa aag ata 18763

Thr Phe Gly Val Ala Ala Met Pro Gly Val Thr Gly Lys Lys Ile

2860 2865 2870

gaa gct gat ggg ctg cct att gga ata gat tca act tct gga act 18808

Glu Ala Asp Gly Leu Pro Ile Gly Ile Asp Ser Thr Ser Gly Thr

2875 2880 2885

gac acc ata att tat gct gat aaa act ttc caa ccc gag cca cag 18853

Asp Thr Ile Ile Tyr Ala Asp Lys Thr Phe Gln Pro Glu Pro Gln

2890 2895 2900

gtt gga aat gac agt tgg gtt gac acc aat ggt gca gag gaa aaa 18898

Val Gly Asn Asp Ser Trp Val Asp Thr Asn Gly Ala Glu Glu Lys

2905 2910 2915

tat gga ggc aga gct ctc aag gac act aca aat atg aaa ccc tgc 18943

Tyr Gly Gly Arg Ala Leu Lys Asp Thr Thr Asn Met Lys Pro Cys

2920 2925 2930

tat ggt tct ttc gcc aag ccc acc aac aaa gaa ggt ggt cag gct 18988

Tyr Gly Ser Phe Ala Lys Pro Thr Asn Lys Glu Gly Gly Gln Ala

2935 2940 2945

aac tta aaa gat tcc gaa act gcc acc acc act cct aac tat gat 19033

Asn Leu Lys Asp Ser Glu Thr Ala Thr Thr Thr Pro Asn Tyr Asp

2950 2955 2960

ata gat ctg gct ttc ttt gac agc aaa aat atc gcc gct aac tac 19078

Ile Asp Leu Ala Phe Phe Asp Ser Lys Asn Ile Ala Ala Asn Tyr

2965 2970 2975

gat cca gat att gta atg tat aca gaa aat gtt gac ctg cag act 19123

Asp Pro Asp Ile Val Met Tyr Thr Glu Asn Val Asp Leu Gln Thr

2980 2985 2990

cca gat act cat att gta tac aaa cct gga aca gag gac acc agc 19168

Pro Asp Thr His Ile Val Tyr Lys Pro Gly Thr Glu Asp Thr Ser

2995 3000 3005

tct gaa tcc aat ttg ggt cag cag gcc atg ccc aac aga ccc aac 19213

Ser Glu Ser Asn Leu Gly Gln Gln Ala Met Pro Asn Arg Pro Asn

3010 3015 3020

tat att ggt ttc agg gac aac ttt atc ggt ctc atg tac tac aac 19258

Tyr Ile Gly Phe Arg Asp Asn Phe Ile Gly Leu Met Tyr Tyr Asn

3025 3030 3035

agc act ggc aat atg ggg gtg ctg gcc ggt cag gcc tct cag ctg 19303

Ser Thr Gly Asn Met Gly Val Leu Ala Gly Gln Ala Ser Gln Leu

3040 3045 3050

aat gct gtg gtt gac ttg caa gac aga aac acc gag ctg tcc tac 19348

Asn Ala Val Val Asp Leu Gln Asp Arg Asn Thr Glu Leu Ser Tyr

3055 3060 3065

cag ctc ttg ctt gac tct ctg ggt gac aga acc agg tat ttc agt 19393

Gln Leu Leu Leu Asp Ser Leu Gly Asp Arg Thr Arg Tyr Phe Ser

3070 3075 3080

atg tgg aat cag gcg gtg gac agc tat gat cct gat gtg cgc att 19438

Met Trp Asn Gln Ala Val Asp Ser Tyr Asp Pro Asp Val Arg Ile

3085 3090 3095

att gaa aac cat ggt gtg gaa gat gaa ttg cca aac tat tgc ttt 19483

Ile Glu Asn His Gly Val Glu Asp Glu Leu Pro Asn Tyr Cys Phe

3100 3105 3110

cca ttg aat ggt gtg gga ttt aca gat act tac cag ggt gtt aaa 19528

Pro Leu Asn Gly Val Gly Phe Thr Asp Thr Tyr Gln Gly Val Lys

3115 3120 3125

gtt aaa aca gat gca aat gct gga aac aat gga aca cag tgg gac 19573

Val Lys Thr Asp Ala Asn Ala Gly Asn Asn Gly Thr Gln Trp Asp

3130 3135 3140

aaa gat gac acc aca gtt agc act gct aat gaa atc cat gga ggc 19618

Lys Asp Asp Thr Thr Val Ser Thr Ala Asn Glu Ile His Gly Gly

3145 3150 3155

aat cct ttt gcc atg gaa atc aac att cag gcc aac ctg tgg cgg 19663

Asn Pro Phe Ala Met Glu Ile Asn Ile Gln Ala Asn Leu Trp Arg

3160 3165 3170

aac ttc ctc tat gcc aac gtg gcc ctc tat ttg cct gat aaa tac 19708

Asn Phe Leu Tyr Ala Asn Val Ala Leu Tyr Leu Pro Asp Lys Tyr

3175 3180 3185

aaa tac aca cct gcc aac atc acc ctg ccc acc aac acc aac acc 19753

Lys Tyr Thr Pro Ala Asn Ile Thr Leu Pro Thr Asn Thr Asn Thr

3190 3195 3200

tac gat tac atg aac ggc cgg gtg gtg gcg ccc tcg ctg gtg gac 19798

Tyr Asp Tyr Met Asn Gly Arg Val Val Ala Pro Ser Leu Val Asp

3205 3210 3215

gcc tac atc aac atc ggg gcg cgc tgg tcg ctg gac ccc atg gac 19843

Ala Tyr Ile Asn Ile Gly Ala Arg Trp Ser Leu Asp Pro Met Asp

3220 3225 3230

aac gtg aat ccc ttc aac cac cac cgc aac gcg ggg ctg cgc tac 19888

Asn Val Asn Pro Phe Asn His His Arg Asn Ala Gly Leu Arg Tyr

3235 3240 3245

cgc tcc atg ctt ctg ggc aac ggg cgc tac gtg ccc ttc cac atc 19933

Arg Ser Met Leu Leu Gly Asn Gly Arg Tyr Val Pro Phe His Ile

3250 3255 3260

cag gtg ccc cag aaa ttt ttc gcc atc aag agc ctc ctg ctc ctg 19978

Gln Val Pro Gln Lys Phe Phe Ala Ile Lys Ser Leu Leu Leu Leu

3265 3270 3275

ccc ggg tcc tac acc tac gag tgg aac ttc cgc aag gac gtc aac 20023

Pro Gly Ser Tyr Thr Tyr Glu Trp Asn Phe Arg Lys Asp Val Asn

3280 3285 3290

atg atc ctg cag agc tcc ctc ggc aac gac ctg cgc acg gac ggg 20068

Met Ile Leu Gln Ser Ser Leu Gly Asn Asp Leu Arg Thr Asp Gly

3295 3300 3305

gcc tcc atc tcc ttc acc agc atc aac ctc tac gcc acc ttc ttc 20113

Ala Ser Ile Ser Phe Thr Ser Ile Asn Leu Tyr Ala Thr Phe Phe

3310 3315 3320

ccc atg gcg cac aac acg gcc tcc acg ctc gag gcc atg ctg cgc 20158

Pro Met Ala His Asn Thr Ala Ser Thr Leu Glu Ala Met Leu Arg

3325 3330 3335

aac gac acc aac gac cag tcc ttc aac gac tac ctc tcg gcg gcc 20203

Asn Asp Thr Asn Asp Gln Ser Phe Asn Asp Tyr Leu Ser Ala Ala

3340 3345 3350

aac atg ctc tat ccc atc ccg gcc aac gcc acc aac gtg ccc ata 20248

Asn Met Leu Tyr Pro Ile Pro Ala Asn Ala Thr Asn Val Pro Ile

3355 3360 3365

tcc atc ccc tcg cgc aac tgg gcc gcc ttc cgc ggc tgg tcc ttc 20293

Ser Ile Pro Ser Arg Asn Trp Ala Ala Phe Arg Gly Trp Ser Phe

3370 3375 3380

acg cgc ctc aag acc aag gag acg ccc tcg ctg ggc tcc ggg ttc 20338

Thr Arg Leu Lys Thr Lys Glu Thr Pro Ser Leu Gly Ser Gly Phe

3385 3390 3395

gac ccc tac ttc gtc tac tcg ggc tcc atc ccc tac ctc gac ggc 20383

Asp Pro Tyr Phe Val Tyr Ser Gly Ser Ile Pro Tyr Leu Asp Gly

3400 3405 3410

acc ttc tac ctc aac cac acc ttc aag aag gtc tcc atc acc ttc 20428

Thr Phe Tyr Leu Asn His Thr Phe Lys Lys Val Ser Ile Thr Phe

3415 3420 3425

gac tcc tcc gtc agc tgg ccc ggc aac gac cgg ctc ctg acg ccc 20473

Asp Ser Ser Val Ser Trp Pro Gly Asn Asp Arg Leu Leu Thr Pro

3430 3435 3440

aac gag ttc gaa atc aag cgc acc gtc gac ggc gag ggc tac aat 20518

Asn Glu Phe Glu Ile Lys Arg Thr Val Asp Gly Glu Gly Tyr Asn

3445 3450 3455

gtg gcc cag tgc aac atg acc aag gac tgg ttc ctg gtc cag atg 20563

Val Ala Gln Cys Asn Met Thr Lys Asp Trp Phe Leu Val Gln Met

3460 3465 3470

ctg gcc cac tac aac atc ggc tac cag ggc ttc tac gtg ccc gag 20608

Leu Ala His Tyr Asn Ile Gly Tyr Gln Gly Phe Tyr Val Pro Glu

3475 3480 3485

ggc tac aag gac cgc atg tac tcc ttc ttc cgc aac ttc cag ccc 20653

Gly Tyr Lys Asp Arg Met Tyr Ser Phe Phe Arg Asn Phe Gln Pro

3490 3495 3500

atg agc cgc cag gtg gtg gac gag gtc aac tac aag gac tac cag 20698

Met Ser Arg Gln Val Val Asp Glu Val Asn Tyr Lys Asp Tyr Gln

3505 3510 3515

gcc gtc acc ctg gcc tac cag cac aac aac tcg ggc ttc gtc ggc 20743

Ala Val Thr Leu Ala Tyr Gln His Asn Asn Ser Gly Phe Val Gly

3520 3525 3530

tac ctc gcg ccc acc atg cgc cag ggc cag ccc tac ccc gcc aac 20788

Tyr Leu Ala Pro Thr Met Arg Gln Gly Gln Pro Tyr Pro Ala Asn

3535 3540 3545

tac cca tac ccg ctc atc ggc aag agc gcc gtc acc agc gtc acc 20833

Tyr Pro Tyr Pro Leu Ile Gly Lys Ser Ala Val Thr Ser Val Thr

3550 3555 3560

cag aaa aag ttc ctc tgc gac agg gtc atg tgg cgc atc ccc ttc 20878

Gln Lys Lys Phe Leu Cys Asp Arg Val Met Trp Arg Ile Pro Phe

3565 3570 3575

tcc agc aac ttc atg tcc atg ggc gcg ctc acc gac ctc ggc cag 20923

Ser Ser Asn Phe Met Ser Met Gly Ala Leu Thr Asp Leu Gly Gln

3580 3585 3590

aac atg ctc tat gcc aac tcc gcc cac gcg cta gac atg aat ttc 20968

Asn Met Leu Tyr Ala Asn Ser Ala His Ala Leu Asp Met Asn Phe

3595 3600 3605

gaa gtc gac ccc atg gat gag tcc acc ctt ctc tat gtt gtc ttc 21013

Glu Val Asp Pro Met Asp Glu Ser Thr Leu Leu Tyr Val Val Phe

3610 3615 3620

gaa gtc ttc gac gtc gtc cga gtg cac cag ccc cac cgc ggc gtc 21058

Glu Val Phe Asp Val Val Arg Val His Gln Pro His Arg Gly Val

3625 3630 3635

atc gag gcc gtc tac ctg cgc acc ccc ttc tcg gcc ggt aac gcc 21103

Ile Glu Ala Val Tyr Leu Arg Thr Pro Phe Ser Ala Gly Asn Ala

3640 3645 3650

acc acc taaattgctt cttgcaagct atg gcc tgt gcc acg ggc tcc ggc 21153

Thr Thr Met Ala Cys Ala Thr Gly Ser Gly

3655 3660

gag cag gag ctc agg gcc atc atc cgc gac ctg ggc tgc ggg ccc 21198

Glu Gln Glu Leu Arg Ala Ile Ile Arg Asp Leu Gly Cys Gly Pro

3665 3670 3675

tac ttc ctg ggc acc ttc gat aag cgc ttc ccg gga ttc atg gcc 21243

Tyr Phe Leu Gly Thr Phe Asp Lys Arg Phe Pro Gly Phe Met Ala

3680 3685 3690

ccg cac aag ctg gcc tgc gcc atc gtc aac acg gcc ggc cgc gag 21288

Pro His Lys Leu Ala Cys Ala Ile Val Asn Thr Ala Gly Arg Glu

3695 3700 3705

acc ggg ggc gag cac tgg ctg gcc ttc gcc tgg aac ccg cgc tcg 21333

Thr Gly Gly Glu His Trp Leu Ala Phe Ala Trp Asn Pro Arg Ser

3710 3715 3720

aac acc tgc tac ctc ttc gac ccc ttc ggg ttc tcg gac gag cgc 21378

Asn Thr Cys Tyr Leu Phe Asp Pro Phe Gly Phe Ser Asp Glu Arg

3725 3730 3735

ctc aag cag atc tac cag ttc gag tac gag ggc ctg ctg cgt cgc 21423

Leu Lys Gln Ile Tyr Gln Phe Glu Tyr Glu Gly Leu Leu Arg Arg

3740 3745 3750

agc gcc ctg gcc acc gag gac cgc tgc gtc acc ctg gaa aag tcc 21468

Ser Ala Leu Ala Thr Glu Asp Arg Cys Val Thr Leu Glu Lys Ser

3755 3760 3765

acc cag acc gtg cag ggt ccg cgc tcg gcc gcc tgc ggg ctc ttc 21513

Thr Gln Thr Val Gln Gly Pro Arg Ser Ala Ala Cys Gly Leu Phe

3770 3775 3780

tgc tgc atg ttc ctg cac gcc ttc gtg cac tgg ccc gac cgc ccc 21558

Cys Cys Met Phe Leu His Ala Phe Val His Trp Pro Asp Arg Pro

3785 3790 3795

atg gac aag aac ccc acc atg aac ttg ctg acg ggg gtg ccc aac 21603

Met Asp Lys Asn Pro Thr Met Asn Leu Leu Thr Gly Val Pro Asn

3800 3805 3810

ggc atg ctc cag tcg ccc cag gtg gaa ccc acc ctg cgc cgc aac 21648

Gly Met Leu Gln Ser Pro Gln Val Glu Pro Thr Leu Arg Arg Asn

3815 3820 3825

cag gag gcg ctc tac cgc ttc ctc aac tcc cac tcc gcc tac ttt 21693

Gln Glu Ala Leu Tyr Arg Phe Leu Asn Ser His Ser Ala Tyr Phe

3830 3835 3840

cgc tcc cac cgc gcg cgc atc gag aag gcc acc gcc ttc gac cgc 21738

Arg Ser His Arg Ala Arg Ile Glu Lys Ala Thr Ala Phe Asp Arg

3845 3850 3855

atg aac aat caa gac atg taaaccgtgt gtgtatgtta aaatgtcttt 21786

Met Asn Asn Gln Asp Met

3860

aataaacagc actttcatgt tacacatgca tctgagatgg tttattattt agaaatcgaa 21846

agggttctgc cgggtctcgg catggcccgc gggcagggac acgttgcgga actggtactt 21906

ggccagccac ttgaactcgg ggatcagcag tttcggcagc ggggtgtcgg ggaaggagtc 21966

ggtccacagc ttccgcgtca gttgcagggc gcccaggagg tcgggcgcgg agatcttgaa 22026

atcgcagttg ggacccgcgt tctgcgcgcg agagttgcgg tacacggggt tgcagcactg 22086

gaacaccatc agggccgggt gcttcacgct cgccagcacc gtcgcgtcgg tgatgctctc 22146

cacgtcgagg tcctcggcgt tggccatccc gaagggggtc atcttgcagg tctgcctgcc 22206

catggtgggc acgcagccgg gcttgtggtt gcaatcgcag tgcaggggga tcagcatcat 22266

ctgggcctgg tcggcgttca tccccgggta catggccttc atgaaagcct ccaattgcct 22326

aaaagcctgc tgggccttgg ctccctcggt gaagaagacc ccgcaagact tgctagagaa 22386

ctggttggta gcgcacccgg cgtcatgcac gcagcagcgc gcgtcgttgt tggccagctg 22446

caccacgctg cgcccccagc ggttctgggt gatcttggcc cggttggggt tctccttcag 22506

cgcgcgctgc ccgttctcgc tcgccacatc catctcgatc atgtgctcct tctggatcat 22566

ggtggtcccg tgcaggcacc gcagcttgcc ctcggtctcg gtgcaaccgt gcagccacag 22626

cgcgcacccg gtgcactccc agttcttgtg ggcgatctgt gaatgcgcgt gcacgaaccc 22686

ttgcaggaag cggcccatca tggtcgtcag ggtcttgttg ctagtgaagg tcagcggaat 22746

gccgcggtgc tcctcgttga tgtacaggtg gcagatgcgg cggtacacct cgccctgctc 22806

gggcatcagc tggaagttgg ctttcaggtc agtctccacg cggtagcggt ccatcagtat 22866

agtcatgatt tccatgccct tctcccaggc cgagacgatg ggcaggctca tagggttctt 22926

caccatcatc ttagcgctag ccgccgcagc cagggggtcg ctctcgtcca gggtctcaaa 22986

gctccgcttg ccgtccttct cggtgatccg cacggggggg tagctgaagc ccacggccgc 23046

cagctcctcc tcggcctgcc tttcgtcctc gctgtcctgg ctgacgtcct gcagaaccac 23106

atgcttggtc ttgcggggtt tcttcttggg tggcagtggc ggcggagatg cttgcggcga 23166

gggggagcgc gagttctcgc tcaccactac tatctcttcc tcttcttggt ccgaggccac 23226

gcggcggtag gtatgtctct tcgggggcag aggcggaggc gacgggctct cgccgccgcg 23286

acttggcgga tggctggcag agccccttcc gcgatcgggg gtgcgctccc ggcggcgctc 23346

tgactgactt cctccgcggc cggccattgt gttctcctag ggaacaacaa gc atg gag 23404

Met Glu

act cag cca tcg cca acc tcg cca tct gcc ccc acc acc gcc gac 23449

Thr Gln Pro Ser Pro Thr Ser Pro Ser Ala Pro Thr Thr Ala Asp

3865 3870 3875

gag aag cag cag cag cag aat gaa agc tta acc gcc ccg ccg ccc 23494

Glu Lys Gln Gln Gln Gln Asn Glu Ser Leu Thr Ala Pro Pro Pro

3880 3885 3890

agc ccc gcc acc tcc gac gcg gcg gtc cca gac atg caa gag atg 23539

Ser Pro Ala Thr Ser Asp Ala Ala Val Pro Asp Met Gln Glu Met

3895 3900 3905

gag gaa tcc atc gag att gac ctg ggc tat gtg acg ccc gcg gag 23584

Glu Glu Ser Ile Glu Ile Asp Leu Gly Tyr Val Thr Pro Ala Glu

3910 3915 3920

cac gag gag gag ctg gca gtg cgc ttt caa tcg tca agc cag gaa 23629

His Glu Glu Glu Leu Ala Val Arg Phe Gln Ser Ser Ser Gln Glu

3925 3930 3935

gat aaa gaa cag cca gag cag gaa gca gaa aac gag cag agt cag 23674

Asp Lys Glu Gln Pro Glu Gln Glu Ala Glu Asn Glu Gln Ser Gln

3940 3945 3950

gct ggg ctc gag cat ggc gac tac ctc cac ctg agc ggg ggg gag 23719

Ala Gly Leu Glu His Gly Asp Tyr Leu His Leu Ser Gly Gly Glu

3955 3960 3965

gac gcg ctc atc aag cat ctg gcc cgg cag gcc atc atc gtc aag 23764

Asp Ala Leu Ile Lys His Leu Ala Arg Gln Ala Ile Ile Val Lys

3970 3975 3980

gat gcg ctg ctc gac cgc acc gag gtg ccc ctc agc gtg gag gag 23809

Asp Ala Leu Leu Asp Arg Thr Glu Val Pro Leu Ser Val Glu Glu

3985 3990 3995

ctc agc cgc gcc tac gag ctc aac ctc ttc tcg ccg cgc gtg ccc 23854

Leu Ser Arg Ala Tyr Glu Leu Asn Leu Phe Ser Pro Arg Val Pro

4000 4005 4010

ccc aag cgc cag ccc aac ggc acc tgc gag ccc aac ccg cgc ctc 23899

Pro Lys Arg Gln Pro Asn Gly Thr Cys Glu Pro Asn Pro Arg Leu

4015 4020 4025

aac ttc tac ccg gtc ttc gcg gtg ccc gag gcc ctg gcc acc tac 23944

Asn Phe Tyr Pro Val Phe Ala Val Pro Glu Ala Leu Ala Thr Tyr

4030 4035 4040

cac atc ttt ttc aag aac caa aag atc ccc gtc tcc tgc cgc gcc 23989

His Ile Phe Phe Lys Asn Gln Lys Ile Pro Val Ser Cys Arg Ala

4045 4050 4055

aac cgc acc cgc gcc gac gcc ctt ttg aac ctg ggc ccc ggc gcc 24034

Asn Arg Thr Arg Ala Asp Ala Leu Leu Asn Leu Gly Pro Gly Ala

4060 4065 4070

cgc cta cct gat atc gcc tcc ttg gaa gag gtt ccc aag atc ttc 24079

Arg Leu Pro Asp Ile Ala Ser Leu Glu Glu Val Pro Lys Ile Phe

4075 4080 4085

gag ggt ctg ggc agc gac gag act cgg gcc gcg aac gct ctg caa 24124

Glu Gly Leu Gly Ser Asp Glu Thr Arg Ala Ala Asn Ala Leu Gln

4090 4095 4100

gga gaa gga gga gag cat gag cac cac agc gcc ctg gtc gag ttg 24169

Gly Glu Gly Gly Glu His Glu His His Ser Ala Leu Val Glu Leu

4105 4110 4115

gaa ggc gac aac gcg cgg ctg gcg gtg ctc aaa cgc acg gtc gag 24214

Glu Gly Asp Asn Ala Arg Leu Ala Val Leu Lys Arg Thr Val Glu

4120 4125 4130

ctg acc cat ttc gcc tac ccg gct ctg aac ctg ccc ccc aaa gtc 24259

Leu Thr His Phe Ala Tyr Pro Ala Leu Asn Leu Pro Pro Lys Val

4135 4140 4145

atg agc gcg gtc atg gac cag gtg ctc atc aag cgc gcg tcg ccc 24304

Met Ser Ala Val Met Asp Gln Val Leu Ile Lys Arg Ala Ser Pro

4150 4155 4160

atc tcc gag gac gag atg cag gac ccc gag agc tcc gag gag ggc 24349

Ile Ser Glu Asp Glu Met Gln Asp Pro Glu Ser Ser Glu Glu Gly

4165 4170 4175

aag ccc gtg gtc agc gac gag cag ctg gcc cgg tgg ctg ggt cct 24394

Lys Pro Val Val Ser Asp Glu Gln Leu Ala Arg Trp Leu Gly Pro

4180 4185 4190

aat gct acc cct cag agt ttg gaa gag cgg cgc aag ctc atg atg 24439

Asn Ala Thr Pro Gln Ser Leu Glu Glu Arg Arg Lys Leu Met Met

4195 4200 4205

gcc gtg gtc ctg gtg acc gtg gag ctg gag tgt ctg cgc cgc ttc 24484

Ala Val Val Leu Val Thr Val Glu Leu Glu Cys Leu Arg Arg Phe

4210 4215 4220

ttc gcc gac gcg gag acc ctg cgc aag gtc gag gag aac ctg cac 24529

Phe Ala Asp Ala Glu Thr Leu Arg Lys Val Glu Glu Asn Leu His

4225 4230 4235

tac atc ttc agg cac ggg ttc gtg cgc cag gcc tgc aag att tcc 24574

Tyr Ile Phe Arg His Gly Phe Val Arg Gln Ala Cys Lys Ile Ser

4240 4245 4250

aac gtg gag ctg acc aac ctg gtc tcc tac atg ggc atc ttg cac 24619

Asn Val Glu Leu Thr Asn Leu Val Ser Tyr Met Gly Ile Leu His

4255 4260 4265

gag aac cga ctg ggg cag aac gtg ctg cac acc acc ctg cgc ggg 24664

Glu Asn Arg Leu Gly Gln Asn Val Leu His Thr Thr Leu Arg Gly

4270 4275 4280

gag gcc cgg cgc gac tac atc cgc gac tgc gtc tac ctc tac ctc 24709

Glu Ala Arg Arg Asp Tyr Ile Arg Asp Cys Val Tyr Leu Tyr Leu

4285 4290 4295

tgc cac acc tgg cag acg ggc atg ggc gtg tgg cag cag tgc ctg 24754

Cys His Thr Trp Gln Thr Gly Met Gly Val Trp Gln Gln Cys Leu

4300 4305 4310

gag gag cag aac ctg aaa gag ctc tgc aag ctc ctg cag aag aac 24799

Glu Glu Gln Asn Leu Lys Glu Leu Cys Lys Leu Leu Gln Lys Asn

4315 4320 4325

ctc aag ggt ctg tgg acc ggg ttc gac gag cgc acc acc gcc tcg 24844

Leu Lys Gly Leu Trp Thr Gly Phe Asp Glu Arg Thr Thr Ala Ser

4330 4335 4340

gac ctg gcc gac ctc atc ttc ccc gag cgc ctc agg ctg acg ctg 24889

Asp Leu Ala Asp Leu Ile Phe Pro Glu Arg Leu Arg Leu Thr Leu

4345 4350 4355

cgc aac ggc ctg ccc gac ttt atg agc caa agc atg ttg caa aac 24934

Arg Asn Gly Leu Pro Asp Phe Met Ser Gln Ser Met Leu Gln Asn

4360 4365 4370

ttt cgc tct ttc atc ctc gaa cgc tcc gga atc ctg ccc gcc acc 24979

Phe Arg Ser Phe Ile Leu Glu Arg Ser Gly Ile Leu Pro Ala Thr

4375 4380 4385

tgc tcc gcg ctg ccc tcg gac ttc gtg ccg ctg acc ttc cgc gag 25024

Cys Ser Ala Leu Pro Ser Asp Phe Val Pro Leu Thr Phe Arg Glu

4390 4395 4400

tgc ccc ccg ccg ctg tgg agc cac tgc tac ctg ctg cgc ctg gcc 25069

Cys Pro Pro Pro Leu Trp Ser His Cys Tyr Leu Leu Arg Leu Ala

4405 4410 4415

aac tac ctg gcc tac cac tcg gac gtg atc gag gac gtc agc ggc 25114

Asn Tyr Leu Ala Tyr His Ser Asp Val Ile Glu Asp Val Ser Gly

4420 4425 4430

gag ggc ctg ctc gag tgc cac tgc cgc tgc aac ctc tgc acg ccg 25159

Glu Gly Leu Leu Glu Cys His Cys Arg Cys Asn Leu Cys Thr Pro

4435 4440 4445

cac cgc tcc ctg gcc tgc aac ccc cag ctg ctg agc gag acc cag 25204

His Arg Ser Leu Ala Cys Asn Pro Gln Leu Leu Ser Glu Thr Gln

4450 4455 4460

atc atc ggc acc ttc gag ttg caa ggg ccc agc gat gag ggc gag 25249

Ile Ile Gly Thr Phe Glu Leu Gln Gly Pro Ser Asp Glu Gly Glu

4465 4470 4475

gga tcg gcc aag ggg ggt ctg aaa ctc acc ccg ggg ctg tgg acc 25294

Gly Ser Ala Lys Gly Gly Leu Lys Leu Thr Pro Gly Leu Trp Thr

4480 4485 4490

tcg gcc tac ttg cgc aag ttc gtg ccc gag gac tac cat ccc ttc 25339

Ser Ala Tyr Leu Arg Lys Phe Val Pro Glu Asp Tyr His Pro Phe

4495 4500 4505

gag atc agg ttc tac gag gac caa tcc cag ccg ccc aag gcc gag 25384

Glu Ile Arg Phe Tyr Glu Asp Gln Ser Gln Pro Pro Lys Ala Glu

4510 4515 4520

ctg tcg gcc tgc gtc atc acc cag ggg gcc atc ctg gcc caa ttg 25429

Leu Ser Ala Cys Val Ile Thr Gln Gly Ala Ile Leu Ala Gln Leu

4525 4530 4535

caa gcc atc cag aaa tcc cgc caa gaa ttc ttg ctg aaa aag ggc 25474

Gln Ala Ile Gln Lys Ser Arg Gln Glu Phe Leu Leu Lys Lys Gly

4540 4545 4550

cgc ggg gtc tac ctc gac ccc cag acc ggt gag gag ctc aac ccc 25519

Arg Gly Val Tyr Leu Asp Pro Gln Thr Gly Glu Glu Leu Asn Pro

4555 4560 4565

ggc ttc ccc cag gat gcc ccg agg aaa caa gaa gct gaa agt gga 25564

Gly Phe Pro Gln Asp Ala Pro Arg Lys Gln Glu Ala Glu Ser Gly

4570 4575 4580

gct gcc gcc cgt gga gga ttt gga gga aga ctg gga gaa cag cag 25609

Ala Ala Ala Arg Gly Gly Phe Gly Gly Arg Leu Gly Glu Gln Gln

4585 4590 4595

tca ggc aga gga gga gga gat gga gga aga ctg gga cag cac tca 25654

Ser Gly Arg Gly Gly Gly Asp Gly Gly Arg Leu Gly Gln His Ser

4600 4605 4610

ggc aga gga gga cag cct gca aga cag tct gga gga aga cga gga 25699

Gly Arg Gly Gly Gln Pro Ala Arg Gln Ser Gly Gly Arg Arg Gly

4615 4620 4625

gga ggc aga gga gga ggt gga aga agc agc cgc cgc cag acc gtc 25744

Gly Gly Arg Gly Gly Gly Gly Arg Ser Ser Arg Arg Gln Thr Val

4630 4635 4640

gtc ctc ggc ggg gga gaa agc aag cag cac gga tac cat ctc cgc 25789

Val Leu Gly Gly Gly Glu Ser Lys Gln His Gly Tyr His Leu Arg

4645 4650 4655

tcc ggg tcg ggg tcc cgc tcg gcc cca cag tagatgggac gagaccgggc 25839

Ser Gly Ser Gly Ser Arg Ser Ala Pro Gln

4660 4665

gattcccgaa ccccaccacc cagaccggta agaaggagcg gcagggatac aagtcctggc 25899

gggggcacaa aaacgccatc gtctcctgct tgcaggcctg cgggggcaac atctccttca 25959

cccggcgcta cctgctcttc caccgcgggg taaacttccc ccgcaacatc ttgcattact 26019

accgtcacct ccacagcccc tactacttcc aagaagaggc agcagcagaa aaagaccagc 26079

agaaaaccag cagcagctag aaaatccaca gcggcggcag cgacaggtgg actgaggatc 26139

gcggcgaacg agccggcgca gacccgggag ctgaggaacc ggatctttcc caccctctat 26199

gccatcttcc agcagagtcg ggggcaggag caggaactga aagtcaagaa ccgttctctg 26259

cgctcgctca cccgcagttg tctgtatcac aagagcgaag accaacttca gcgcactctc 26319

gaggacgccg aggctctctt caacaagtac tgcgcgctca ctcttaaaga gtagcccgcg 26379

cccgcccagt cgcagaaaaa ggcgggaatt acgtcacctg tgcccttcgc cctagccgcc 26439

tccacccctc atc atg agc aaa gag att ccc acg cct tac atg tgg agc 26488

Met Ser Lys Glu Ile Pro Thr Pro Tyr Met Trp Ser

4670 4675 4680

tac cag ccc cag atg ggc ctg gcc gcc ggc gcc gcc cag gac tac 26533

Tyr Gln Pro Gln Met Gly Leu Ala Ala Gly Ala Ala Gln Asp Tyr

4685 4690 4695

tcc acc cgc atg aat tgg ctc agc gcc ggg ccc gcg atg atc tca 26578

Ser Thr Arg Met Asn Trp Leu Ser Ala Gly Pro Ala Met Ile Ser

4700 4705 4710

cgg gtg aat gac atc cgc gcc cac cga aac cag ata ctc cta gaa 26623

Arg Val Asn Asp Ile Arg Ala His Arg Asn Gln Ile Leu Leu Glu

4715 4720 4725

cag tca gcg ctc acc gcc acg ccc cgc aat cac ctc aat ccg cgt 26668

Gln Ser Ala Leu Thr Ala Thr Pro Arg Asn His Leu Asn Pro Arg

4730 4735 4740

aat tgg ccc gcc gcc ctg gtg tac cag gaa att ccc cag ccc acg 26713

Asn Trp Pro Ala Ala Leu Val Tyr Gln Glu Ile Pro Gln Pro Thr

4745 4750 4755

acc gta cta ctt ccg cga gac gcc cag gcc gaa gtc cag ctg act 26758

Thr Val Leu Leu Pro Arg Asp Ala Gln Ala Glu Val Gln Leu Thr

4760 4765 4770

aac tca ggt gtc cag ctg gcg ggc ggc gcc acc ctg tgt cgt cac 26803

Asn Ser Gly Val Gln Leu Ala Gly Gly Ala Thr Leu Cys Arg His

4775 4780 4785

cgc ccc gct cag ggt ata aag cgg ctg gtg atc cgg ggc aga ggc 26848

Arg Pro Ala Gln Gly Ile Lys Arg Leu Val Ile Arg Gly Arg Gly

4790 4795 4800

aca cag ctc aac gac gag gtg gtg agc tct tcg ctg ggt ctg cga 26893

Thr Gln Leu Asn Asp Glu Val Val Ser Ser Ser Leu Gly Leu Arg

4805 4810 4815

cct gac gga gtc ttc caa ctc gcc gga tcg ggg aga tct tcc ttc 26938

Pro Asp Gly Val Phe Gln Leu Ala Gly Ser Gly Arg Ser Ser Phe

4820 4825 4830

acg cct cgt cag gcc gtg ctg act ttg gag agt tcg tcc tcg cag 26983

Thr Pro Arg Gln Ala Val Leu Thr Leu Glu Ser Ser Ser Ser Gln

4835 4840 4845

ccc cgc tcg ggc ggc atc ggc act ctc cag ttc gtg gag gag ttc 27028

Pro Arg Ser Gly Gly Ile Gly Thr Leu Gln Phe Val Glu Glu Phe

4850 4855 4860

act ccc tcg gtc tac ttc aac ccc ttc tcc ggc tcc ccc ggc cac 27073

Thr Pro Ser Val Tyr Phe Asn Pro Phe Ser Gly Ser Pro Gly His

4865 4870 4875

tac ccg gac gag ttc atc ccg aac ttt gac gcc atc agc gag tcg 27118

Tyr Pro Asp Glu Phe Ile Pro Asn Phe Asp Ala Ile Ser Glu Ser

4880 4885 4890

gtg gac ggc tac gat tga atg tcc cat ggt ggc gcg gct gac cta 27163

Val Asp Gly Tyr Asp Met Ser His Gly Gly Ala Ala Asp Leu

4895 4900 4905

gct cgg ctt cga cac ctg gac cac tgc cgc cgc ttt cgc tgc ttc 27208

Ala Arg Leu Arg His Leu Asp His Cys Arg Arg Phe Arg Cys Phe

4910 4915 4920

gct cgg gac ctc gcc gag ttc acc tac ttt gag ctg ccc gag gag 27253

Ala Arg Asp Leu Ala Glu Phe Thr Tyr Phe Glu Leu Pro Glu Glu

4925 4930 4935

cat cct cag ggc ccg gcc cac gga gtg cgg atc gtc gtc gaa ggg 27298

His Pro Gln Gly Pro Ala His Gly Val Arg Ile Val Val Glu Gly

4940 4945 4950

ggc cta gac tcc cac ctg ctt cgg atc ttc agc cag cgc ccg atc 27343

Gly Leu Asp Ser His Leu Leu Arg Ile Phe Ser Gln Arg Pro Ile

4955 4960 4965

ctg gtc gag cgc caa cag ggc aac acc ctc ctg acc ctc tac tgc 27388

Leu Val Glu Arg Gln Gln Gly Asn Thr Leu Leu Thr Leu Tyr Cys

4970 4975 4980

atc tgc gac cac ccc ggc ctg cat gaa agt ctt tgt tgt ctg ctg 27433

Ile Cys Asp His Pro Gly Leu His Glu Ser Leu Cys Cys Leu Leu

4985 4990 4995

tgt act gag tat aat aaa agc tgagatcagc gactactccg gactcaactg 27484

Cys Thr Glu Tyr Asn Lys Ser

5000

tggtgtttct gcatccatca atcggtcact gaccttcacc gggaacgaga ccgagctcca 27544

gctccagtgt aagccccaca agaagtacct cacctggctg taccagggct ccccgatcgc 27604

cgttgttaac cactgcgacg acgacggagt cctgctgaac ggtcccgcca accttacttt 27664

ttccacccgc agaagcaagc tcgagctctt ccaacccttc ctccccggga cctatcagtg 27724

catctcggga ccctgccatc acaccttcca cctgatcccg aataccacct cttccccagc 27784

gccgctcccc actaacaacc aaactaacca ccaccaacgc taccgacgcg acctcgttga 27844

atctaatacc acccacaccg gaggtgagct ccgaggtcct gaatcctctg ggatttatta 27904

cggcccctgg gaggtggtgg ggttaatagc tttaggctta gtagcgggtg ggcttttggc 27964

tctctgctac ctatacctcc cttgcttttc ctacttagtg gtgctttgtt gctggtttaa 28024

gaa atg ggg aag atc acc cta gtg tgc ggt gtg ctg gtg acg gtg 28069

Met Gly Lys Ile Thr Leu Val Cys Gly Val Leu Val Thr Val

5005 5010 5015

gtg ctt tcg att ctg gga ggg gga agc gcg gct gta gtg acg gag 28114

Val Leu Ser Ile Leu Gly Gly Gly Ser Ala Ala Val Val Thr Glu

5020 5025 5030

aag aag gcc gat ccc tgc ttg act ttc aac ccc gat aaa tgc cgg 28159

Lys Lys Ala Asp Pro Cys Leu Thr Phe Asn Pro Asp Lys Cys Arg

5035 5040 5045

ctg agt ttt cag ccc gat ggc aat cgg tgc gcg gtg ttg atc aag 28204

Leu Ser Phe Gln Pro Asp Gly Asn Arg Cys Ala Val Leu Ile Lys

5050 5055 5060

tgc gga tgg gaa tgc gag agt gtg gcg att cag tat aaa aac aag 28249

Cys Gly Trp Glu Cys Glu Ser Val Ala Ile Gln Tyr Lys Asn Lys

5065 5070 5075

acg cgg aac aat act ctc gcg tcc aca tgg cag ccc ggg gac ccc 28294

Thr Arg Asn Asn Thr Leu Ala Ser Thr Trp Gln Pro Gly Asp Pro

5080 5085 5090

gag tgg tac acc gtc tct gtc cct ggt gct gac ggc tcc ctc cgc 28339

Glu Trp Tyr Thr Val Ser Val Pro Gly Ala Asp Gly Ser Leu Arg

5095 5100 5105

acg gtg aac aac act ttc att ttt gag cac atg tgc gaa acc gcc 28384

Thr Val Asn Asn Thr Phe Ile Phe Glu His Met Cys Glu Thr Ala

5110 5115 5120

atg ttc atg agc aag cag tac ggt atg tgg cct cca cga aaa gag 28429

Met Phe Met Ser Lys Gln Tyr Gly Met Trp Pro Pro Arg Lys Glu

5125 5130 5135

aat atc gtg gtc ttc tcc atc gct tac agc gcg tgc acg gtg cta 28474

Asn Ile Val Val Phe Ser Ile Ala Tyr Ser Ala Cys Thr Val Leu

5140 5145 5150

atc acc gcg atc gtg tgc ctg agc att cac atg ctc atc gct att 28519

Ile Thr Ala Ile Val Cys Leu Ser Ile His Met Leu Ile Ala Ile

5155 5160 5165

cgc ccc aga aat aat gcc gag aaa gag aaa cag cca taacacactt 28565

Arg Pro Arg Asn Asn Ala Glu Lys Glu Lys Gln Pro

5170 5175

ttttcacaca cctttttcag acc atg gcc tct gtt aaa gtt ttg atc ata 28615

Met Ala Ser Val Lys Val Leu Ile Ile

5180 5185

gtc agt tta att gct tat gcc tct ggc tat gca aac att aaa aaa 28660

Val Ser Leu Ile Ala Tyr Ala Ser Gly Tyr Ala Asn Ile Lys Lys

5190 5195 5200

acc ctc tat gtt ggt tct gac tct ata tta gag ggt acc caa tcc 28705

Thr Leu Tyr Val Gly Ser Asp Ser Ile Leu Glu Gly Thr Gln Ser

5205 5210 5215

gga gct aag gtt tca tgg tat ttt tat aga acc aca act gat cca 28750

Gly Ala Lys Val Ser Trp Tyr Phe Tyr Arg Thr Thr Thr Asp Pro

5220 5225 5230

gtt aaa ctc tgt aag ggt gaa tta cca cgc acg cat aaa act cca 28795

Val Lys Leu Cys Lys Gly Glu Leu Pro Arg Thr His Lys Thr Pro

5235 5240 5245

ctt aca ttc agc tgc agc aat aat aat ctt aca ctt ttt tcg gtc 28840

Leu Thr Phe Ser Cys Ser Asn Asn Asn Leu Thr Leu Phe Ser Val

5250 5255 5260

act aaa caa tat gct ggc act tac tac agc aca aat ttt cat act 28885

Thr Lys Gln Tyr Ala Gly Thr Tyr Tyr Ser Thr Asn Phe His Thr

5265 5270 5275

ggg caa gat aaa tac tat act att aca gtg gcc gat cca acc act 28930

Gly Gln Asp Lys Tyr Tyr Thr Ile Thr Val Ala Asp Pro Thr Thr

5280 5285 5290

cct aga act act act aaa ccc acc aca act aaa aag ccc act aca 28975

Pro Arg Thr Thr Thr Lys Pro Thr Thr Thr Lys Lys Pro Thr Thr

5295 5300 5305

cct aaa aag cct acc aca ccc aaa acc act aag aca aca act aag 29020

Pro Lys Lys Pro Thr Thr Pro Lys Thr Thr Lys Thr Thr Thr Lys

5310 5315 5320

acc act acc aca gag cca acc aca acc agc acc aca ctt gct ata 29065

Thr Thr Thr Thr Glu Pro Thr Thr Thr Ser Thr Thr Leu Ala Ile

5325 5330 5335

act aca cac acg gag ctg acc tca caa gca act act gaa aat ggt 29110

Thr Thr His Thr Glu Leu Thr Ser Gln Ala Thr Thr Glu Asn Gly

5340 5345 5350

ttt gcc cta ttg caa aag ggg gat aac agt agc agc agt cct ctg 29155

Phe Ala Leu Leu Gln Lys Gly Asp Asn Ser Ser Ser Ser Pro Leu

5355 5360 5365

cct act acc ccc agt gag gaa ata cct aag tcc atg gtt ggc att 29200

Pro Thr Thr Pro Ser Glu Glu Ile Pro Lys Ser Met Val Gly Ile

5370 5375 5380

atc gct gcc gta gtg gtg tgt atg ctg att atc atc ttg tgc atg 29245

Ile Ala Ala Val Val Val Cys Met Leu Ile Ile Ile Leu Cys Met

5385 5390 5395

atg tac tat gcc tgc tac tac aga aaa cac agg ctg aac aat aag 29290

Met Tyr Tyr Ala Cys Tyr Tyr Arg Lys His Arg Leu Asn Asn Lys

5400 5405 5410

ctg gac ccc cta ctg aat gtt gat ttt taatttttta gaacc atg aag 29338

Leu Asp Pro Leu Leu Asn Val Asp Phe Met Lys

5415 5420

atc cta agc ctt ttt gtt ttt tct ata att att acc tct gct cta 29383

Ile Leu Ser Leu Phe Val Phe Ser Ile Ile Ile Thr Ser Ala Leu

5425 5430 5435

tgc aat tcc aac gat gac gtt act gtc gtt gtt gga tcc aat tac 29428

Cys Asn Ser Asn Asp Asp Val Thr Val Val Val Gly Ser Asn Tyr

5440 5445 5450

aca ctg aaa gga cct cca aaa ggt atg ctt tcg tgg tat tgc tgg 29473

Thr Leu Lys Gly Pro Pro Lys Gly Met Leu Ser Trp Tyr Cys Trp

5455 5460 5465

ttt gga aac gat gag caa caa aca gag ctc tgc aat ctt aaa aat 29518

Phe Gly Asn Asp Glu Gln Gln Thr Glu Leu Cys Asn Leu Lys Asn

5470 5475 5480

gga aaa gtc tca aac cca aaa atc tcc aat tat ata tgc aat ggc 29563

Gly Lys Val Ser Asn Pro Lys Ile Ser Asn Tyr Ile Cys Asn Gly

5485 5490 5495

act gat tta ata ctg ctc aat atc acg aaa gca tat gct ggc agt 29608

Thr Asp Leu Ile Leu Leu Asn Ile Thr Lys Ala Tyr Ala Gly Ser

5500 5505 5510

tat act tgt cct gga gac aat gat gac atg att ttt aaa aaa ttg 29653

Tyr Thr Cys Pro Gly Asp Asn Asp Asp Met Ile Phe Lys Lys Leu

5515 5520 5525

att gtg gtt gat ccc act acc cca gca cca ccc acc aca act acc 29698

Ile Val Val Asp Pro Thr Thr Pro Ala Pro Pro Thr Thr Thr Thr

5530 5535 5540

aag gca cat acc aca gac aca cag gaa acc act cca gag gca gaa 29743

Lys Ala His Thr Thr Asp Thr Gln Glu Thr Thr Pro Glu Ala Glu

5545 5550 5555

gta gca gag tta gca aag cag att cat gaa gat tcc ttt gtt gcc 29788

Val Ala Glu Leu Ala Lys Gln Ile His Glu Asp Ser Phe Val Ala

5560 5565 5570

aat acc ccc aca cac ccc gga ccg caa tgt cca ggg cca tta gtc 29833

Asn Thr Pro Thr His Pro Gly Pro Gln Cys Pro Gly Pro Leu Val

5575 5580 5585

agc ggc att gtc ggt gtg ctt tgc ggg tta gca gtt ata atc atc 29878

Ser Gly Ile Val Gly Val Leu Cys Gly Leu Ala Val Ile Ile Ile

5590 5595 5600

tgc atg ttc att ttt gct tgc tgc tac aga agg ctc cac cga caa 29923

Cys Met Phe Ile Phe Ala Cys Cys Tyr Arg Arg Leu His Arg Gln

5605 5610 5615

aaa tca gac cca ctg ctg aac ctc tat gtt taatttttga ttttccagag 29973

Lys Ser Asp Pro Leu Leu Asn Leu Tyr Val

5620 5625

cc atg aag gca ctt agc act tta gta ttt ttg tcc ttg att ggc att 30020

Met Lys Ala Leu Ser Thr Leu Val Phe Leu Ser Leu Ile Gly Ile

5630 5635 5640

gta ttc agt gct ggg ttt ttg aaa aat ctt acc att att gaa ggc 30065

Val Phe Ser Ala Gly Phe Leu Lys Asn Leu Thr Ile Ile Glu Gly

5645 5650 5655

gaa aat gca aca ctg gta ggc atc agt ggt cag aat gtt agc tgg 30110

Glu Asn Ala Thr Leu Val Gly Ile Ser Gly Gln Asn Val Ser Trp

5660 5665 5670

tta aaa tac cat cta gat ggg tgg aaa cct att tgc acc tgg aat 30155

Leu Lys Tyr His Leu Asp Gly Trp Lys Pro Ile Cys Thr Trp Asn

5675 5680 5685

gtc agt gtg tac aca tgc cat ggt gtt aac ctc acc att acc aat 30200

Val Ser Val Tyr Thr Cys His Gly Val Asn Leu Thr Ile Thr Asn

5690 5695 5700

gcc acc caa gat cag aat ggc agg ttt aag ggt cag agt ttt act 30245

Ala Thr Gln Asp Gln Asn Gly Arg Phe Lys Gly Gln Ser Phe Thr

5705 5710 5715

agc aac aat ggg tat gaa acc cat aac atg ttc atc tat gat gtc 30290

Ser Asn Asn Gly Tyr Glu Thr His Asn Met Phe Ile Tyr Asp Val

5720 5725 5730

act gtc ata tca aat aag act aca cct acc acc cag aca ccc act 30335

Thr Val Ile Ser Asn Lys Thr Thr Pro Thr Thr Gln Thr Pro Thr

5735 5740 5745

aca cac agc tca aca cat gcc atg cag acc act cag aca acc aca 30380

Thr His Ser Ser Thr His Ala Met Gln Thr Thr Gln Thr Thr Thr

5750 5755 5760

tac act aca tcc act gaa ccc acc acc act aca gca gag gta acc 30425

Tyr Thr Thr Ser Thr Glu Pro Thr Thr Thr Thr Ala Glu Val Thr

5765 5770 5775

agc aca gcg cct cag ccc cag gca ttg gct tta agg gct gca cag 30470

Ser Thr Ala Pro Gln Pro Gln Ala Leu Ala Leu Arg Ala Ala Gln

5780 5785 5790

tct agc agc atg act gct aaa acc aat gag cag act act gaa ttt 30515

Ser Ser Ser Met Thr Ala Lys Thr Asn Glu Gln Thr Thr Glu Phe

5795 5800 5805

ttg tcc act act cag agt cgc acc aca gct acc tcg agt gcc ttc 30560

Leu Ser Thr Thr Gln Ser Arg Thr Thr Ala Thr Ser Ser Ala Phe

5810 5815 5820

tct agc acc gcc aat ctc acc tcg ctt tcc tct atg cca atc agt 30605

Ser Ser Thr Ala Asn Leu Thr Ser Leu Ser Ser Met Pro Ile Ser

5825 5830 5835

aat gct act acc ctc ccc gct cct ctt ccc act cct ctg aag caa 30650

Asn Ala Thr Thr Leu Pro Ala Pro Leu Pro Thr Pro Leu Lys Gln

5840 5845 5850

tcc gag tct agc acg cag ctg cag atc acc ctg ctc att gtg atc 30695

Ser Glu Ser Ser Thr Gln Leu Gln Ile Thr Leu Leu Ile Val Ile

5855 5860 5865

ggg gtg gtc atc ctg gca gtg ctg ctc tac ttt atc ttc tgc cgc 30740

Gly Val Val Ile Leu Ala Val Leu Leu Tyr Phe Ile Phe Cys Arg

5870 5875 5880

cgc atc ccc aac gcg aaa ccg gcc tac aag ccc att gtt atc ggg 30785

Arg Ile Pro Asn Ala Lys Pro Ala Tyr Lys Pro Ile Val Ile Gly

5885 5890 5895

acg ccg gag ccg ctt cag gtg gag gga ggt cta agg aat ctt ctc 30830

Thr Pro Glu Pro Leu Gln Val Glu Gly Gly Leu Arg Asn Leu Leu

5900 5905 5910

ttc tct ttt aca gta tgg tgatttgaac t atg att cct aga cat ttc 30877

Phe Ser Phe Thr Val Trp Met Ile Pro Arg His Phe

5915 5920 5925

att atc act tct cta atc tgt gtg ctc caa gtc tgt gcc acc ctc 30922

Ile Ile Thr Ser Leu Ile Cys Val Leu Gln Val Cys Ala Thr Leu

5930 5935 5940

gct ctc gtg gct aac gcg agt cca gac tgc att gga gcg ttc gcc 30967

Ala Leu Val Ala Asn Ala Ser Pro Asp Cys Ile Gly Ala Phe Ala

5945 5950 5955

tcc tac gtg ctc ttt gcc ttc atc acc tgc atc tgc tgc tgt agc 31012

Ser Tyr Val Leu Phe Ala Phe Ile Thr Cys Ile Cys Cys Cys Ser

5960 5965 5970

ata gtc tgc ctg ctt atc acc ttc ttc cag ttc att gac tgg gtc 31057

Ile Val Cys Leu Leu Ile Thr Phe Phe Gln Phe Ile Asp Trp Val

5975 5980 5985

ttt gtg cgc atc gcc tac ctg cgc cac cat ccc cag tac cgc gac 31102

Phe Val Arg Ile Ala Tyr Leu Arg His His Pro Gln Tyr Arg Asp

5990 5995 6000

cag aga gtg gct caa ctg ttg agg ctc atc tg atg ata agc atg cgg 31149

Gln Arg Val Ala Gln Leu Leu Arg Leu Ile Met Ile Ser Met Arg

6005 6010 6015

gct ctg cta cta ctc gcg ctt gca cta gct ccc ctc gcc gcc ccc 31194

Ala Leu Leu Leu Leu Ala Leu Ala Leu Ala Pro Leu Ala Ala Pro

6020 6025 6030

ttc tcc ctc aaa tcc ccc acc caa tcc cct gaa gag gtt cga aaa 31239

Phe Ser Leu Lys Ser Pro Thr Gln Ser Pro Glu Glu Val Arg Lys

6035 6040 6045

tgt aaa ttc caa gaa ccc tgg aaa ttc ctc aaa tgc tac cag ctc 31284

Cys Lys Phe Gln Glu Pro Trp Lys Phe Leu Lys Cys Tyr Gln Leu

6050 6055 6060

aaa tca gaa atg cac ccc agc tgg atc atg atc att gga atc gtg 31329

Lys Ser Glu Met His Pro Ser Trp Ile Met Ile Ile Gly Ile Val

6065 6070 6075

aac atc ctt gcc tgt acc ctc atc tcc ttt gtg att tac ccc tgc 31374

Asn Ile Leu Ala Cys Thr Leu Ile Ser Phe Val Ile Tyr Pro Cys

6080 6085 6090

ttt aac ttt ggg tgg aac gca ccc gag gcg ctc tgg ctc ccg cct 31419

Phe Asn Phe Gly Trp Asn Ala Pro Glu Ala Leu Trp Leu Pro Pro

6095 6100 6105

gat ccc gac aca cca cca cag cag cag caa aat cag gca cat gca 31464

Asp Pro Asp Thr Pro Pro Gln Gln Gln Gln Asn Gln Ala His Ala

6110 6115 6120

cca cca cag cct agg cca caa tac atg ccc atc tta gac tat gag 31509

Pro Pro Gln Pro Arg Pro Gln Tyr Met Pro Ile Leu Asp Tyr Glu

6125 6130 6135

gcc gag cca cag cga gcc atg ctt cct gct att agt tac ttc aat 31554

Ala Glu Pro Gln Arg Ala Met Leu Pro Ala Ile Ser Tyr Phe Asn

6140 6145 6150

cta acc ggc gga gat gac tgaccccatg gccaacaaca ccgtcaacga 31602

Leu Thr Gly Gly Asp Asp

6155

cctcctggac atggacggcc gcgcctcgga gcagcgactc gcccaactcc gcatccgcca 31662

gcagcaggag agagccgtca aggagctgca ggatgcggtg gccatccacc agtgcaagag 31722

aggcatcttc tgcctggtga aacaggccaa gatctccttc gaggtcactt ccaccgacca 31782

tcgcctctcc tacgagctcc tgcagcagcg ccagaagttc acctgcctgg tcggagtcaa 31842

ccccatcgtc atcacccagc agtctggcga taccaagggg tgcatccact gctcctgcga 31902

ctcccccgag tgcgttcaca ccctgatcaa gaccctctgc ggcctccgcg acctcctccc 31962

catgaactaa tcaactaacc cactacccct ttaccctcca gtaaaaataa agattaaaga 32022

gttgatgatt gaattgatca ataaagaatc acttacttga aatctgaaac caggtctctg 32082

tccatgtttt ctgccagcag cacttcactc ccctcttccc agctctggta ctgcaggccc 32142

cggcgggctg caaacttcct ccacactctg aaggggatgt caaattcctc ctgtccctca 32202

atcttcattt ttatcttcta tcag atg tcc aaa aag cgc gcg cgg gtg gat 32253

Met Ser Lys Lys Arg Ala Arg Val Asp

6160 6165

gat ggc ttc gac ccc gtg tac ccc tac gat gca gac aac gca ccg 32298

Asp Gly Phe Asp Pro Val Tyr Pro Tyr Asp Ala Asp Asn Ala Pro

6170 6175 6180

act gtg ccc ttc atc aac cct ccc ttc gtc tct tca gat gga ttc 32343

Thr Val Pro Phe Ile Asn Pro Pro Phe Val Ser Ser Asp Gly Phe

6185 6190 6195

caa gaa aag ccc ctg ggg gtg ttg tcc ctg cgc ctg gcc gac ccc 32388

Gln Glu Lys Pro Leu Gly Val Leu Ser Leu Arg Leu Ala Asp Pro

6200 6205 6210

gtc acc acc aag aac ggg gct gtc acc ctc aag ctg ggg gat ggg 32433

Val Thr Thr Lys Asn Gly Ala Val Thr Leu Lys Leu Gly Asp Gly

6215 6220 6225

gtg gac ctc gac gac tcg gga aaa ctc atc tcc aaa aat gcc acc 32478

Val Asp Leu Asp Asp Ser Gly Lys Leu Ile Ser Lys Asn Ala Thr

6230 6235 6240

aag gcc act gca cct ctc agt att tcc aac ggc acc att tcc ctt 32523

Lys Ala Thr Ala Pro Leu Ser Ile Ser Asn Gly Thr Ile Ser Leu

6245 6250 6255

aac atg gat acc cct ttt tac acc agc gat gga aaa tta tcc tta 32568

Asn Met Asp Thr Pro Phe Tyr Thr Ser Asp Gly Lys Leu Ser Leu

6260 6265 6270

caa gtt tat cca cca tta aac ata tta aga aca ccc att atc aac 32613

Gln Val Tyr Pro Pro Leu Asn Ile Leu Arg Thr Pro Ile Ile Asn

6275 6280 6285

aca tta gct tta gca tat aag tca ggt tta ggt gta agt gac aaa 32658

Thr Leu Ala Leu Ala Tyr Lys Ser Gly Leu Gly Val Ser Asp Lys

6290 6295 6300

tcc ctt gct gtt caa tta gct tct cca ctc act ttt gat gaa aaa 32703

Ser Leu Ala Val Gln Leu Ala Ser Pro Leu Thr Phe Asp Glu Lys

6305 6310 6315

gga aat ata gaa att aac ttg gga agt ggg cca tta aca gtt gca 32748

Gly Asn Ile Glu Ile Asn Leu Gly Ser Gly Pro Leu Thr Val Ala

6320 6325 6330

tca aat caa ctt agc att aac tgt caa aga ggt gta cat atc act 32793

Ser Asn Gln Leu Ser Ile Asn Cys Gln Arg Gly Val His Ile Thr

6335 6340 6345

act gca gga gat gcg ctt gaa aca aac ata agt tgg gct aaa ggc 32838

Thr Ala Gly Asp Ala Leu Glu Thr Asn Ile Ser Trp Ala Lys Gly

6350 6355 6360

ata aga ttt gaa agt gat gcc ata gct gca aac att ggc aaa gga 32883

Ile Arg Phe Glu Ser Asp Ala Ile Ala Ala Asn Ile Gly Lys Gly

6365 6370 6375

tta gaa ttc gga acc act agt aca gag tcc gat gtc aca gat gca 32928

Leu Glu Phe Gly Thr Thr Ser Thr Glu Ser Asp Val Thr Asp Ala

6380 6385 6390

ttt cca att caa gtt aaa ttg ggc act ggc ctt aca ttt gac agc 32973

Phe Pro Ile Gln Val Lys Leu Gly Thr Gly Leu Thr Phe Asp Ser

6395 6400 6405

aca gga gct att gtt gct tgg aac aaa gag gat gac aag ctt aca 33018

Thr Gly Ala Ile Val Ala Trp Asn Lys Glu Asp Asp Lys Leu Thr

6410 6415 6420

cta tgg acc aca gct gac cca tct cca aac tgt cac ata tat tct 33063

Leu Trp Thr Thr Ala Asp Pro Ser Pro Asn Cys His Ile Tyr Ser

6425 6430 6435

gca aaa gat gct aag ctt act ctc tgt tta aca aag tgt ggc agt 33108

Ala Lys Asp Ala Lys Leu Thr Leu Cys Leu Thr Lys Cys Gly Ser

6440 6445 6450

cag ata ctg ggc act gtt tcc ctc ata gct gtt aat act ggc agc 33153

Gln Ile Leu Gly Thr Val Ser Leu Ile Ala Val Asn Thr Gly Ser

6455 6460 6465

tta aat cca ata acg gga aca gta cat act gct ctt gtt tca ctt 33198

Leu Asn Pro Ile Thr Gly Thr Val His Thr Ala Leu Val Ser Leu

6470 6475 6480

aaa ttt gat gca agt gga gtt ttg caa gcc agt tca aca ttg gac 33243

Lys Phe Asp Ala Ser Gly Val Leu Gln Ala Ser Ser Thr Leu Asp

6485 6490 6495

aaa gaa tat tgg aat ttc aga aaa gat gat gta aca cct gca gat 33288

Lys Glu Tyr Trp Asn Phe Arg Lys Asp Asp Val Thr Pro Ala Asp

6500 6505 6510

ccc tac act aat gct ata ggt ttt atg ccc aac ctt aat gca tat 33333

Pro Tyr Thr Asn Ala Ile Gly Phe Met Pro Asn Leu Asn Ala Tyr

6515 6520 6525

cca aaa aac aca agt gca gct gca aag agt cac att gtg ggg aaa 33378

Pro Lys Asn Thr Ser Ala Ala Ala Lys Ser His Ile Val Gly Lys

6530 6535 6540

gta tac ctc aat ggt gat gta gcc aaa cca ctg gac ttg ata att 33423

Val Tyr Leu Asn Gly Asp Val Ala Lys Pro Leu Asp Leu Ile Ile

6545 6550 6555

aca ttt aat gaa acc agc gat gaa acc tgt acc tat tgc att aac 33468

Thr Phe Asn Glu Thr Ser Asp Glu Thr Cys Thr Tyr Cys Ile Asn

6560 6565 6570

ttt cag tgg cag tgg gga gct gat gaa tat aaa aat gaa acg ctt 33513

Phe Gln Trp Gln Trp Gly Ala Asp Glu Tyr Lys Asn Glu Thr Leu

6575 6580 6585

gct gtc agt tca ttc acc ttc tcc tac att gct aaa gaa taaaccctgc 33562

Ala Val Ser Ser Phe Thr Phe Ser Tyr Ile Ala Lys Glu

6590 6595

atgccaaccc ctttcccaac tactatggaa aactctgaag cagaaaaaat aaaagttcaa 33622

gtgttttatt gattcaacag ttttcacaga attcgagtag ttattttccc tccaccctcc 33682

caactcatgg aatacaccac cctctcccca cgcacagcct taaacatctg aatgccattg 33742

gtaatggaca tggttttggt ctccacattc cacacagttt cagagcgagc cagtctcggg 33802

tcggtcaggg agatgaaacc ctccgggcac tcctgcatct gcacctcaaa gttcagtagc 33862

tgagggctgt cctcggtggt cgggatcaca gttatctgga agaagagcga tgagagtcat 33922

aatccgcgaa cgggatcggg cggttgtggc gcatcagacc ccgcagcagt cgctgtctgc 33982

gccgctccgt caagctgctg ctcaaagggt ccgggtccag ggactccctg cgcatgatgc 34042

cgatggccct gagcatcagt cgcctggtgc ggcgggcgca gcagcggatg cggatctcac 34102

tcaggtcgga gcagtacgtg cagcacagca ccaccaagtt gttcaacagt ccatagttca 34162

acgtgctcca gccaaaactc atctgtggaa ctatgctgcc cacatgtcca tcgtaccaga 34222

tcctgatgta aatcaggtgg cgccccctcc agaacacact gcccatgtac atgatctcct 34282

tgggcatatg caggttcacc acctcccggt accacatcac ccgctggttg aacatgcagc 34342

cctggataat tctgcggaac cagatggcca gcaccgcccc gcccgccatg cagcgcaggg 34402

accccgggtc ctgacagtgg cagtggagga cccaccgctc gcggccgtgg atcaactggg 34462

agctgaacag gtctatgttg gcacagcaca ggcacacgct catgcatgtc ttcagcactc 34522

tcagttcctc gggggtcagg accatgtccc agggcacggg gaactcttgc aggacagtga 34582

acccggcaga acagggcagc cctcgcacac aacttacatt gtgcatggac agggtatcgc 34642

aatcaggcag caccggatga tcctccacca gagaagcgcg ggtctcggtc tcctcacaac 34702

gaggtaaggg ggccggcggt tggtacggat gatggcggga tgacgctaat cgtgttctgg 34762

atcgtgtcat gatggagctt cttcctgaca tcttcgtatt tcatgtagca gaacctggtc 34822

cgggcactgc acaccgctcg ccggcgacgg tctcggcgct tcgagcgctc ggtgttgaag 34882

ttgtaaaaca gccactccct cagagcgtgc agtatctctt gagcctcttg ggtgatgaaa 34942

atcccatccg ccctgatggc tctgatcaca tcgaccacgg tggaatgggc cagacccagc 35002

cagatgatgc aattttgttg ggtttcggtg acggcggggg agggaagaac aggaagaacc 35062

atgattaact ttattccaaa cggtctcgga gcacttcaaa atgcaggtcg cggagatggc 35122

acctctcgcc cccactgtgt tgatggaaaa taacagccag gtcaaaggtg acacggttct 35182

cgagatgttc cacggtggct tccagcaaag cctccacgcg cacatccaga aacaagagga 35242

cagcgaaagc gggagcgttc tctaattcct caatcatcat attacactcc tgcaccatcc 35302

ccagataatt ttcatttttc cagccttgaa tgatttgaac tagttcctga ggtaaatcca 35362

agccagccat gataaaaagc tcgcgcagag cgccctccac cggcattctt aagcacaccc 35422

tcataattcc aagagattct gctcctggtt cacctgcagc agattaacaa ggggaatatc 35482

aaaatctctg ccgcgatctc taagctcctc cctcagcaat aactgcaagt actctttcat 35542

atcttctccg aaatttttag ccatagggcc gccaggaatg agagcagggc aagccacatt 35602

acagataaag cgaagtcctc cccagtgagc attgccaaat gtaagattga aataagcatg 35662

ctggctagac ccggtgatat cttccagata actggacaga aaatcaggca agcaattttt 35722

aagaaaatca acaaaagaaa agtcgtccag gtgcaagttt agagcctcag gaacaacgat 35782

ggaataagtg caaggagtgc gttccagcat ggttagtgtt tttttggtga tctgtagaac 35842

aaaaaataaa catgcaatat taaaccatgc tagcctggcg aacaggtggg taaatcactc 35902

tttccagcac caggcaggct acggggtctc cggcgcgacc ctcgtagaag ctgtcgccat 35962

gattgaaaag catcaccgaa agactttccc ggtggccggc atggatgatt cgcgaagacg 36022

cgtacactcc gggaacattg gcatccgtga gtgaaaaaaa tcgccccaag aagccccgag 36082

gcactacaat gctcaacctt aattccagca gagcgacccc atgcggatga agcacaaaat 36142

tggtaggtgc gtaaaaaatg taattactcc cctcctgcac aggcagcaaa gcccccgctc 36202

cctccagaaa cacatacaaa gcctcagcgt ccatagctta ccgagcacgg caggcgcaag 36262

attcagagaa aaggctgagc tctaacctga ctgcccgctc ctgagctcaa tatatagccc 36322

taacctacac tgacgtaaag gccaaagtct aaaaataccc gccaaaatga cacacacgcc 36382

cagcacacgc ccagaaaccg gtgacacact caaaaaaata cgtgcgcttc ctcaaacgcc 36442

caaaccggcg tcatttccgg gttcccacgc tacgtcaccg ctcagcgact ttcaaattcc 36502

gtcgaccgtt aaacacgtca ctcgccccgc ccctaacggt cgccctcctc tcggccaatc 36562

accttcctcc atccccaaat tcaaacggct catttgcata ttaacgcgca caaaaagttt 36622

gaggtatatt atttgatgat g 36643

<210> SEQ ID NO 58

<211> LENGTH: 184

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 58

Met Glu Ile Trp Thr Val Leu Glu Asp Phe His Lys Thr Arg Gln Leu

1 5 10 15

Leu Glu Asn Ala Ser Asp Gly Val Ser Tyr Leu Trp Arg Phe Cys Phe

20 25 30

Gly Gly Asp Leu Ala Lys Leu Val Tyr Arg Ala Lys Gln Asp Tyr Arg

35 40 45

Glu Gln Phe Glu Val Ile Leu Arg Glu Cys Ser Gly Leu Phe Asp Ala

50 55 60

Leu Asn Leu Gly His Gln Ser His Phe Asn Gln Arg Ile Ser Arg Ala

65 70 75 80

Leu Asp Phe Thr Thr Pro Gly Arg Thr Thr Ala Ala Val Ala Phe Phe

85 90 95

Ala Phe Leu Leu Asp Lys Trp Ser Gln Glu Thr His Phe Ser Arg Asp

100 105 110

Tyr Gln Leu Asp Phe Leu Ala Val Ala Leu Trp Arg Thr Trp Lys Cys

115 120 125

Gln Arg Leu Asn Ala Ile Ser Gly Tyr Leu Pro Val Gln Pro Leu Asp

130 135 140

Thr Leu Arg Ile Leu Asn Leu Gln Glu Ser Pro Arg Ala Arg Gln Arg

145 150 155 160

Arg Gln Gln Gln Gln Gln Glu Asp Gln Glu Glu Asn Pro Arg Ala Gly

165 170 175

Leu Asp Pro Pro Ala Glu Glu Glu

180

<210> SEQ ID NO 59

<211> LENGTH: 142

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 59

Met Ser Gly Ser Gly Ser Phe Glu Gly Gly Val Phe Ser Pro Tyr Leu

1 5 10 15

Thr Gly Arg Leu Pro Ser Trp Ala Gly Val Arg Gln Asn Val Met Gly

20 25 30

Ser Thr Val Asp Gly Arg Pro Val Gln Pro Ala Asn Ser Ser Thr Leu

35 40 45

Thr Tyr Ala Thr Leu Ser Ser Ser Ser Val Asp Ala Ala Ala Ala Ala

50 55 60

Ala Ala Ala Ser Ala Ala Ser Ala Val Arg Gly Met Ala Met Gly Ala

65 70 75 80

Gly Tyr Tyr Gly Thr Leu Val Ala Asn Ser Ser Ser Thr Asn Asn Pro

85 90 95

Ala Ser Leu Asn Glu Glu Lys Leu Leu Leu Leu Met Ala Gln Leu Glu

100 105 110

Ala Leu Thr Gln Arg Leu Gly Glu Leu Thr Gln Gln Val Ala Gln Leu

115 120 125

Gln Glu Gln Thr Arg Ala Ala Val Ala Thr Val Lys Ser Lys

130 135 140

<210> SEQ ID NO 60

<211> LENGTH: 393

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 60

Met His Pro Val Leu Arg Gln Met Arg Pro His His Pro Pro Pro Gln

1 5 10 15

Gln Gln Pro Pro Pro Pro Gln Pro Ala Leu Leu Pro Pro Pro Gln Gln

20 25 30

Gln Gln Gln Leu Pro Ala Thr Thr Ala Ala Ala Ala Val Ser Gly Ala

35 40 45

Gly Gln Ser Tyr Asp His Gln Leu Ala Leu Glu Glu Gly Glu Gly Leu

50 55 60

Ala Arg Leu Gly Ala Ser Ser Pro Glu Arg His Pro Arg Val Gln Met

65 70 75 80

Lys Arg Asp Ala Arg Glu Ala Tyr Val Pro Lys Gln Asn Leu Phe Arg

85 90 95

Asp Arg Ser Gly Glu Glu Pro Glu Glu Met Arg Ala Ala Arg Phe His

100 105 110

Ala Gly Arg Glu Leu Arg Arg Gly Leu Asp Arg Lys Arg Val Leu Arg

115 120 125

Asp Glu Asp Phe Glu Ala Asp Glu Leu Thr Gly Ile Ser Pro Ala Arg

130 135 140

Ala His Val Ala Ala Ala Asn Leu Val Thr Ala Tyr Glu Gln Thr Val

145 150 155 160

Lys Glu Glu Ser Asn Phe Gln Lys Ser Phe Asn Asn His Val Arg Thr

165 170 175

Leu Ile Ala Arg Glu Glu Val Thr Leu Gly Leu Met His Leu Trp Asp

180 185 190

Leu Leu Glu Ala Ile Val Gln Asn Pro Thr Ser Lys Pro Leu Thr Ala

195 200 205

Gln Leu Phe Leu Val Val Gln His Ser Arg Asp Asn Glu Ala Phe Arg

210 215 220

Glu Ala Leu Leu Asn Ile Thr Glu Pro Glu Gly Arg Trp Leu Leu Asp

225 230 235 240

Leu Val Asn Ile Leu Gln Ser Ile Val Val Gln Glu Arg Gly Leu Pro

245 250 255

Leu Ser Glu Lys Leu Ala Ala Ile Asn Phe Ser Val Leu Ser Leu Gly

260 265 270

Lys Tyr Tyr Ala Arg Lys Ile Tyr Lys Thr Pro Tyr Val Pro Ile Asp

275 280 285

Lys Glu Val Lys Ile Asp Gly Phe Tyr Met Arg Met Thr Leu Lys Val

290 295 300

Leu Thr Leu Ser Asp Asp Leu Gly Val Tyr Arg Asn Asp Arg Met His

305 310 315 320

Arg Ala Val Ser Ala Ser Arg Arg Arg Glu Leu Ser Asp Gln Glu Leu

325 330 335

Met His Ser Leu Gln Arg Ala Leu Thr Gly Ala Gly Thr Glu Gly Glu

340 345 350

Ser Tyr Phe Asp Met Gly Ala Asp Leu His Trp Gln Pro Ser Arg Arg

355 360 365

Ala Leu Glu Ala Ala Gly Gly Pro Pro Tyr Val Glu Glu Val Asp Glu

370 375 380

Asp Glu Glu Gly Glu Tyr Leu Glu Asp

385 390

<210> SEQ ID NO 61

<211> LENGTH: 590

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 61

Met Gln Gln Gln Pro Pro Pro Asp Pro Ala Met Arg Ala Ala Leu Gln

1 5 10 15

Ser Gln Pro Ser Gly Ile Asn Ser Ser Asp Asp Trp Thr Gln Ala Met

20 25 30

Gln Arg Ile Met Ala Leu Thr Thr Arg Asn Pro Glu Ala Phe Arg Gln

35 40 45

Gln Pro Gln Ala Asn Arg Leu Ser Ala Ile Leu Glu Ala Val Val Pro

50 55 60

Ser Arg Tyr Asn Pro Thr His Glu Lys Val Leu Ala Ile Val Asn Ala

65 70 75 80

Leu Val Glu Asn Lys Ala Ile Arg Gly Asp Glu Ala Gly Leu Val Tyr

85 90 95

Asn Ala Leu Leu Glu Arg Val Ala Arg Tyr Asn Ser Thr Asn Val Gln

100 105 110

Thr Asn Leu Asp Arg Met Val Thr Asp Val Arg Glu Ala Val Ala Gln

115 120 125

Arg Glu Arg Phe His Arg Glu Ser Asn Leu Gly Ser Met Val Ala Leu

130 135 140

Asn Ala Phe Leu Ser Thr Gln Pro Ala Asn Val Pro Arg Gly Gln Glu

145 150 155 160

Asp Tyr Thr Asn Phe Ile Ser Ala Leu Arg Leu Met Val Thr Glu Val

165 170 175

Pro Gln Ser Glu Val Tyr Gln Ser Gly Pro Asp Tyr Phe Phe Gln Thr

180 185 190

Ser Arg Gln Gly Leu Gln Thr Val Asn Leu Ser Gln Ala Phe Lys Asn

195 200 205

Leu Gln Gly Leu Trp Gly Val Gln Ala Pro Val Gly Asp Arg Ala Thr

210 215 220

Val Ser Ser Leu Leu Thr Pro Asn Ser Arg Leu Leu Leu Leu Leu Val

225 230 235 240

Ala Pro Phe Thr Asp Ser Gly Ser Ile Asn Arg Asn Ser Tyr Leu Gly

245 250 255

Tyr Leu Ile Asn Leu Tyr Arg Glu Ala Ile Gly Gln Ala His Val Asp

260 265 270

Glu Gln Thr Tyr Gln Glu Ile Thr His Val Ser Arg Ala Leu Gly Gln

275 280 285

Asp Asp Pro Gly Asn Leu Glu Ala Thr Leu Asn Phe Leu Leu Thr Asn

290 295 300

Arg Ser Gln Lys Ile Pro Pro Gln Tyr Ala Leu Ser Ala Glu Glu Glu

305 310 315 320

Arg Ile Leu Arg Tyr Val Gln Gln Ser Val Gly Leu Phe Leu Met Gln

325 330 335

Glu Gly Ala Thr Pro Ser Ala Ala Leu Asp Met Thr Ala Arg Asn Met

340 345 350

Glu Pro Ser Met Tyr Ala Ser Asn Arg Pro Phe Ile Asn Lys Leu Met

355 360 365

Asp Tyr Leu His Arg Ala Ala Ala Met Asn Ser Asp Tyr Phe Thr Asn

370 375 380

Ala Ile Leu Asn Pro His Trp Leu Pro Pro Pro Gly Phe Tyr Thr Gly

385 390 395 400

Glu Tyr Asp Met Pro Asp Pro Asn Asp Gly Phe Leu Trp Asp Asp Val

405 410 415

Asp Ser Ser Val Phe Ser Pro Arg Pro Gly Ala Asn Glu Arg Pro Leu

420 425 430

Trp Lys Lys Glu Gly Ser Asp Arg Arg Pro Ser Ser Ala Leu Ser Gly

435 440 445

Arg Glu Gly Ala Ala Ala Ala Val Pro Glu Ala Ala Ser Pro Phe Pro

450 455 460

Ser Leu Pro Phe Ser Leu Asn Ser Ile Arg Ser Ser Glu Leu Gly Arg

465 470 475 480

Ile Thr Arg Pro Arg Leu Leu Gly Glu Glu Glu Tyr Leu Asn Asp Ser

485 490 495

Leu Leu Arg Pro Glu Arg Glu Lys Asn Phe Pro Asn Asn Gly Ile Glu

500 505 510

Ser Leu Val Asp Lys Met Ser Arg Trp Lys Thr Tyr Ala Gln Glu His

515 520 525

Arg Asp Asp Pro Ser Gln Gly Ala Thr Ser Arg Gly Ser Ala Ala Arg

530 535 540

Lys Arg Arg Trp His Asp Arg Gln Arg Gly Leu Met Trp Asp Asp Glu

545 550 555 560

Asp Ser Ala Asp Asp Ser Ser Val Leu Asp Leu Gly Gly Ser Gly Gly

565 570 575

Asn Pro Phe Ala His Leu Arg Pro Arg Ile Gly Arg Met Met

580 585 590

<210> SEQ ID NO 62

<211> LENGTH: 534

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 62

Met Met Arg Arg Ala Tyr Pro Glu Gly Pro Pro Pro Ser Tyr Glu Ser

1 5 10 15

Val Met Gln Gln Ala Met Ala Ala Ala Ala Met Gln Pro Pro Leu Glu

20 25 30

Ala Pro Tyr Val Pro Pro Arg Tyr Leu Ala Pro Thr Glu Gly Arg Asn

35 40 45

Ser Ile Arg Tyr Ser Glu Leu Ala Pro Leu Tyr Asp Thr Thr Arg Leu

50 55 60

Tyr Leu Val Asp Asn Lys Ser Ala Asp Ile Ala Ser Leu Asn Tyr Gln

65 70 75 80

Asn Asp His Ser Asn Phe Leu Thr Thr Val Val Gln Asn Asn Asp Phe

85 90 95

Thr Pro Thr Glu Ala Ser Thr Gln Thr Ile Asn Phe Asp Glu Arg Ser

100 105 110

Arg Trp Gly Gly Gln Leu Lys Thr Ile Met His Thr Asn Met Pro Asn

115 120 125

Val Asn Glu Phe Met Tyr Ser Asn Lys Phe Lys Ala Arg Val Met Val

130 135 140

Ser Arg Lys Thr Pro Asn Gly Val Thr Val Gly Asp Asp Tyr Asp Gly

145 150 155 160

Ser Gln Asp Ile Leu Glu Tyr Glu Trp Val Glu Phe Glu Leu Pro Glu

165 170 175

Gly Asn Phe Ser Val Thr Met Thr Ile Asp Leu Met Asn Asn Ala Ile

180 185 190

Ile Asp Asn Tyr Leu Ala Val Gly Arg Gln Asn Gly Val Leu Glu Ser

195 200 205

Asp Ile Gly Val Lys Phe Asp Thr Arg Asn Phe Arg Leu Gly Trp Asp

210 215 220

Pro Val Thr Glu Leu Val Met Pro Gly Val Tyr Thr Asn Glu Ala Phe

225 230 235 240

His Pro Asp Ile Val Leu Leu Pro Gly Cys Gly Val Asp Phe Thr Glu

245 250 255

Ser Arg Leu Ser Asn Leu Leu Gly Ile Arg Lys Arg Gln Pro Phe Gln

260 265 270

Glu Gly Phe Gln Ile Met Tyr Glu Asp Leu Glu Gly Gly Asn Ile Pro

275 280 285

Ala Leu Leu Asp Val Asp Ala Tyr Glu Lys Ser Lys Glu Glu Ala Ala

290 295 300

Ala Glu Ala Thr Ala Ala Val Ala Thr Ala Ser Thr Glu Val Arg Gly

305 310 315 320

Asp Asn Phe Ala Ser Ala Ala Ala Val Ala Ala Ala Glu Ala Ala Glu

325 330 335

Thr Glu Ser Lys Ile Val Ile Gln Pro Val Glu Lys Asp Ser Lys Asp

340 345 350

Arg Ser Tyr Asn Val Leu Pro Asp Lys Ile Asn Thr Ala Tyr Arg Ser

355 360 365

Trp Tyr Leu Ala Tyr Asn Tyr Gly Asp Pro Glu Lys Gly Val Arg Ser

370 375 380

Trp Thr Leu Leu Thr Thr Ser Asp Val Thr Cys Gly Val Glu Gln Val

385 390 395 400

Tyr Trp Ser Leu Pro Asp Met Met Gln Asp Pro Val Thr Phe Arg Ser

405 410 415

Thr Arg Gln Val Ser Asn Tyr Pro Val Val Gly Ala Glu Leu Leu Pro

420 425 430

Val Tyr Ser Lys Ser Phe Phe Asn Glu Gln Ala Val Tyr Ser Gln Gln

435 440 445

Leu Arg Ala Phe Thr Ser Leu Thr His Val Phe Asn Arg Phe Pro Glu

450 455 460

Asn Gln Ile Leu Val Arg Pro Pro Ala Pro Thr Ile Thr Thr Val Ser

465 470 475 480

Glu Asn Val Pro Ala Leu Thr Asp His Gly Thr Leu Pro Leu Arg Ser

485 490 495

Ser Ile Arg Gly Val Gln Arg Val Thr Val Thr Asp Ala Arg Arg Arg

500 505 510

Thr Cys Pro Tyr Val Tyr Lys Ala Leu Gly Ile Val Ala Pro Arg Val

515 520 525

Leu Ser Ser Arg Thr Phe

530

<210> SEQ ID NO 63

<211> LENGTH: 193

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 63

Met Ser Ile Leu Ile Ser Pro Ser Asn Asn Thr Gly Trp Gly Leu Arg

1 5 10 15

Ala Pro Ser Lys Met Tyr Gly Gly Ala Arg Gln Arg Ser Thr Gln His

20 25 30

Pro Val Arg Val Arg Gly His Phe Arg Ala Pro Trp Gly Ala Leu Lys

35 40 45

Gly Arg Val Arg Ser Arg Thr Thr Val Asp Asp Val Ile Asp Gln Val

50 55 60

Val Ala Asp Ala Arg Asn Tyr Thr Pro Ala Ala Ala Pro Val Ser Thr

65 70 75 80

Val Asp Ala Val Ile Asp Ser Val Val Ala Asp Ala Arg Arg Tyr Ala

85 90 95

Arg Ala Lys Ser Arg Arg Arg Arg Ile Ala Arg Arg His Arg Ser Thr

100 105 110

Pro Ala Met Arg Ala Ala Arg Ala Leu Leu Arg Arg Ala Arg Arg Thr

115 120 125

Gly Arg Arg Ala Met Leu Arg Ala Ala Arg Arg Ala Ala Ser Gly Ala

130 135 140

Ser Ala Gly Arg Thr Arg Arg Arg Ala Ala Thr Ala Ala Ala Ala Ala

145 150 155 160

Ile Ala Ser Met Ser Arg Pro Arg Arg Gly Asn Val Tyr Trp Val Arg

165 170 175

Asp Ala Ala Thr Gly Val Arg Val Pro Val Arg Thr Arg Pro Pro Arg

180 185 190

Thr

<210> SEQ ID NO 64

<211> LENGTH: 342

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 64

Met Ser Lys Arg Lys Phe Lys Glu Glu Met Leu Gln Val Ile Ala Pro

1 5 10 15

Glu Ile Tyr Gly Pro Ala Ala Ala Val Lys Glu Glu Arg Lys Pro Arg

20 25 30

Lys Ile Lys Arg Val Lys Lys Asp Lys Lys Glu Glu Asp Asp Val Asp

35 40 45

Gly Leu Val Glu Phe Val Arg Glu Phe Ala Pro Arg Arg Arg Val Gln

50 55 60

Trp Arg Gly Arg Lys Val Gln Pro Val Leu Arg Pro Gly Thr Thr Val

65 70 75 80

Val Phe Thr Pro Gly Glu Arg Ser Gly Thr Ala Ser Lys Arg Ser Tyr

85 90 95

Asp Glu Val Tyr Gly Asp Asp Asp Ile Leu Glu Gln Ala Ala Glu Arg

100 105 110

Leu Gly Glu Phe Ala Tyr Gly Lys Arg Ser Arg Ser Ala Pro Lys Glu

115 120 125

Glu Ala Val Ser Ile Pro Leu Asp His Gly Asn Pro Thr Pro Ser Leu

130 135 140

Lys Pro Val Thr Leu Gln Gln Val Leu Pro Thr Ala Ala Pro Arg Arg

145 150 155 160

Gly Phe Lys Arg Glu Gly Glu Asp Leu Tyr Pro Thr Met Gln Leu Met

165 170 175

Val Pro Lys Arg Gln Lys Leu Glu Asp Val Leu Glu Thr Met Lys Val

180 185 190

Asp Pro Asp Val Gln Pro Glu Val Lys Val Arg Pro Ile Lys Gln Val

195 200 205

Ala Pro Gly Leu Gly Val Gln Thr Val Asp Ile Lys Ile Pro Thr Glu

210 215 220

Pro Met Glu Thr Gln Thr Glu Pro Val Lys Pro Ser Thr Ser Thr Met

225 230 235 240

Glu Val Gln Thr Asp Pro Trp Met Pro Ser Ala Pro Ser Arg Arg Pro

245 250 255

Arg Arg Lys Tyr Gly Ala Ala Ser Leu Leu Met Pro Asn Tyr Ala Leu

260 265 270

His Pro Ser Ile Ile Pro Thr Pro Gly Tyr Arg Gly Thr Arg Phe Tyr

275 280 285

Arg Gly His Thr Thr Ser Arg Arg Arg Lys Thr Thr Thr Arg Arg Arg

290 295 300

Arg Arg Arg Thr Ala Ala Ala Thr Thr Pro Ala Ala Leu Val Arg Arg

305 310 315 320

Val Tyr Arg Arg Gly Arg Ala Pro Leu Thr Leu Pro Arg Ala Arg Tyr

325 330 335

His Pro Ser Ile Ala Ile

340

<210> SEQ ID NO 65

<211> LENGTH: 77

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 65

Met Ala Leu Thr Cys Arg Leu Arg Val Pro Ile Thr Gly Tyr Arg Gly

1 5 10 15

Arg Lys Pro Arg Arg Arg Arg Leu Ala Gly Asn Gly Met Arg Arg His

20 25 30

His His Arg Arg Arg Arg Ala Ile Ser Lys Arg Leu Gly Gly Gly Phe

35 40 45

Leu Pro Ala Leu Ile Pro Ile Ile Ala Ala Ala Ile Gly Ala Ile Pro

50 55 60

Gly Ile Ala Ser Val Ala Val Gln Ala Ser Gln Arg His

65 70 75

<210> SEQ ID NO 66

<211> LENGTH: 258

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 66

Met Asp Ser Asp Ala Pro Gly Pro Val Met Cys Phe Arg Arg Gln Met

1 5 10 15

Glu Asp Ile Asn Phe Ser Ser Leu Ala Pro Arg His Gly Thr Arg Pro

20 25 30

Phe Met Gly Thr Trp Ser Asp Ile Gly Thr Ser Gln Leu Asn Gly Gly

35 40 45

Ala Phe Asn Trp Ser Ser Leu Trp Ser Gly Leu Lys Asn Phe Gly Ser

50 55 60

Thr Leu Lys Thr Tyr Gly Ser Lys Ala Trp Asn Ser Thr Thr Gly Gln

65 70 75 80

Ala Leu Arg Glu Lys Leu Lys Glu Gln Asn Phe Gln Gln Lys Val Val

85 90 95

Asp Gly Leu Ala Ser Gly Ile Asn Gly Val Val Asp Leu Ala Asn Gln

100 105 110

Ala Val Gln Arg Gln Ile Asn Ser Arg Leu Asp Pro Val Pro Pro Ala

115 120 125

Gly Ser Val Glu Met Pro Gln Val Glu Glu Glu Leu Pro Pro Leu Asp

130 135 140

Lys Arg Gly Glu Lys Arg Pro Arg Pro Asp Ala Glu Glu Thr Leu Leu

145 150 155 160

Thr His Thr Asp Glu Pro Pro Pro Tyr Glu Glu Ala Val Lys Leu Gly

165 170 175

Leu Pro Thr Thr Arg Pro Ile Ala Pro Leu Ala Thr Gly Val Leu Lys

180 185 190

Pro Glu Ser Ser Lys Pro Ala Thr Leu Asp Leu Pro Pro Pro Ala Ser

195 200 205

Arg Pro Ser Thr Val Ala Lys Pro Leu Pro Pro Val Ala Val Ala Arg

210 215 220

Ala Arg Pro Gly Gly Thr Ala Arg Pro His Ala Asn Trp Gln Ser Thr

225 230 235 240

Leu Asn Ser Ile Val Gly Leu Gly Val Gln Ser Val Lys Arg Arg Arg

245 250 255

Cys Tyr

<210> SEQ ID NO 67

<211> LENGTH: 940

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 67

Met Ala Thr Pro Ser Met Leu Pro Gln Trp Ala Tyr Met His Ile Ala

1 5 10 15

Gly Gln Asp Ala Ser Glu Tyr Leu Ser Pro Gly Leu Val Gln Phe Ala

20 25 30

Arg Ala Thr Asp Thr Tyr Phe Ser Leu Gly Asn Lys Phe Arg Asn Pro

35 40 45

Thr Val Ala Pro Thr His Asp Val Thr Thr Asp Arg Ser Gln Arg Leu

50 55 60

Thr Leu Arg Phe Val Pro Val Asp Arg Glu Asp Asn Thr Tyr Ser Tyr

65 70 75 80

Lys Val Arg Tyr Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met

85 90 95

Ala Ser Thr Tyr Phe Asp Ile Arg Gly Val Leu Asp Arg Gly Pro Ser

100 105 110

Phe Lys Pro Tyr Ser Gly Thr Ala Tyr Asn Ser Leu Ala Pro Lys Gly

115 120 125

Ala Pro Asn Thr Cys Gln Trp Lys Asp Ser Asp Asn Lys Met His Thr

130 135 140

Phe Gly Val Ala Ala Met Pro Gly Val Thr Gly Lys Lys Ile Glu Ala

145 150 155 160

Asp Gly Leu Pro Ile Gly Ile Asp Ser Thr Ser Gly Thr Asp Thr Ile

165 170 175

Ile Tyr Ala Asp Lys Thr Phe Gln Pro Glu Pro Gln Val Gly Asn Asp

180 185 190

Ser Trp Val Asp Thr Asn Gly Ala Glu Glu Lys Tyr Gly Gly Arg Ala

195 200 205

Leu Lys Asp Thr Thr Asn Met Lys Pro Cys Tyr Gly Ser Phe Ala Lys

210 215 220

Pro Thr Asn Lys Glu Gly Gly Gln Ala Asn Leu Lys Asp Ser Glu Thr

225 230 235 240

Ala Thr Thr Thr Pro Asn Tyr Asp Ile Asp Leu Ala Phe Phe Asp Ser

245 250 255

Lys Asn Ile Ala Ala Asn Tyr Asp Pro Asp Ile Val Met Tyr Thr Glu

260 265 270

Asn Val Asp Leu Gln Thr Pro Asp Thr His Ile Val Tyr Lys Pro Gly

275 280 285

Thr Glu Asp Thr Ser Ser Glu Ser Asn Leu Gly Gln Gln Ala Met Pro

290 295 300

Asn Arg Pro Asn Tyr Ile Gly Phe Arg Asp Asn Phe Ile Gly Leu Met

305 310 315 320

Tyr Tyr Asn Ser Thr Gly Asn Met Gly Val Leu Ala Gly Gln Ala Ser

325 330 335

Gln Leu Asn Ala Val Val Asp Leu Gln Asp Arg Asn Thr Glu Leu Ser

340 345 350

Tyr Gln Leu Leu Leu Asp Ser Leu Gly Asp Arg Thr Arg Tyr Phe Ser

355 360 365

Met Trp Asn Gln Ala Val Asp Ser Tyr Asp Pro Asp Val Arg Ile Ile

370 375 380

Glu Asn His Gly Val Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu

385 390 395 400

Asn Gly Val Gly Phe Thr Asp Thr Tyr Gln Gly Val Lys Val Lys Thr

405 410 415

Asp Ala Asn Ala Gly Asn Asn Gly Thr Gln Trp Asp Lys Asp Asp Thr

420 425 430

Thr Val Ser Thr Ala Asn Glu Ile His Gly Gly Asn Pro Phe Ala Met

435 440 445

Glu Ile Asn Ile Gln Ala Asn Leu Trp Arg Asn Phe Leu Tyr Ala Asn

450 455 460

Val Ala Leu Tyr Leu Pro Asp Lys Tyr Lys Tyr Thr Pro Ala Asn Ile

465 470 475 480

Thr Leu Pro Thr Asn Thr Asn Thr Tyr Asp Tyr Met Asn Gly Arg Val

485 490 495

Val Ala Pro Ser Leu Val Asp Ala Tyr Ile Asn Ile Gly Ala Arg Trp

500 505 510

Ser Leu Asp Pro Met Asp Asn Val Asn Pro Phe Asn His His Arg Asn

515 520 525

Ala Gly Leu Arg Tyr Arg Ser Met Leu Leu Gly Asn Gly Arg Tyr Val

530 535 540

Pro Phe His Ile Gln Val Pro Gln Lys Phe Phe Ala Ile Lys Ser Leu

545 550 555 560

Leu Leu Leu Pro Gly Ser Tyr Thr Tyr Glu Trp Asn Phe Arg Lys Asp

565 570 575

Val Asn Met Ile Leu Gln Ser Ser Leu Gly Asn Asp Leu Arg Thr Asp

580 585 590

Gly Ala Ser Ile Ser Phe Thr Ser Ile Asn Leu Tyr Ala Thr Phe Phe

595 600 605

Pro Met Ala His Asn Thr Ala Ser Thr Leu Glu Ala Met Leu Arg Asn

610 615 620

Asp Thr Asn Asp Gln Ser Phe Asn Asp Tyr Leu Ser Ala Ala Asn Met

625 630 635 640

Leu Tyr Pro Ile Pro Ala Asn Ala Thr Asn Val Pro Ile Ser Ile Pro

645 650 655

Ser Arg Asn Trp Ala Ala Phe Arg Gly Trp Ser Phe Thr Arg Leu Lys

660 665 670

Thr Lys Glu Thr Pro Ser Leu Gly Ser Gly Phe Asp Pro Tyr Phe Val

675 680 685

Tyr Ser Gly Ser Ile Pro Tyr Leu Asp Gly Thr Phe Tyr Leu Asn His

690 695 700

Thr Phe Lys Lys Val Ser Ile Thr Phe Asp Ser Ser Val Ser Trp Pro

705 710 715 720

Gly Asn Asp Arg Leu Leu Thr Pro Asn Glu Phe Glu Ile Lys Arg Thr

725 730 735

Val Asp Gly Glu Gly Tyr Asn Val Ala Gln Cys Asn Met Thr Lys Asp

740 745 750

Trp Phe Leu Val Gln Met Leu Ala His Tyr Asn Ile Gly Tyr Gln Gly

755 760 765

Phe Tyr Val Pro Glu Gly Tyr Lys Asp Arg Met Tyr Ser Phe Phe Arg

770 775 780

Asn Phe Gln Pro Met Ser Arg Gln Val Val Asp Glu Val Asn Tyr Lys

785 790 795 800

Asp Tyr Gln Ala Val Thr Leu Ala Tyr Gln His Asn Asn Ser Gly Phe

805 810 815

Val Gly Tyr Leu Ala Pro Thr Met Arg Gln Gly Gln Pro Tyr Pro Ala

820 825 830

Asn Tyr Pro Tyr Pro Leu Ile Gly Lys Ser Ala Val Thr Ser Val Thr

835 840 845

Gln Lys Lys Phe Leu Cys Asp Arg Val Met Trp Arg Ile Pro Phe Ser

850 855 860

Ser Asn Phe Met Ser Met Gly Ala Leu Thr Asp Leu Gly Gln Asn Met

865 870 875 880

Leu Tyr Ala Asn Ser Ala His Ala Leu Asp Met Asn Phe Glu Val Asp

885 890 895

Pro Met Asp Glu Ser Thr Leu Leu Tyr Val Val Phe Glu Val Phe Asp

900 905 910

Val Val Arg Val His Gln Pro His Arg Gly Val Ile Glu Ala Val Tyr

915 920 925

Leu Arg Thr Pro Phe Ser Ala Gly Asn Ala Thr Thr

930 935 940

<210> SEQ ID NO 68

<211> LENGTH: 209

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 68

Met Ala Cys Ala Thr Gly Ser Gly Glu Gln Glu Leu Arg Ala Ile Ile

1 5 10 15

Arg Asp Leu Gly Cys Gly Pro Tyr Phe Leu Gly Thr Phe Asp Lys Arg

20 25 30

Phe Pro Gly Phe Met Ala Pro His Lys Leu Ala Cys Ala Ile Val Asn

35 40 45

Thr Ala Gly Arg Glu Thr Gly Gly Glu His Trp Leu Ala Phe Ala Trp

50 55 60

Asn Pro Arg Ser Asn Thr Cys Tyr Leu Phe Asp Pro Phe Gly Phe Ser

65 70 75 80

Asp Glu Arg Leu Lys Gln Ile Tyr Gln Phe Glu Tyr Glu Gly Leu Leu

85 90 95

Arg Arg Ser Ala Leu Ala Thr Glu Asp Arg Cys Val Thr Leu Glu Lys

100 105 110

Ser Thr Gln Thr Val Gln Gly Pro Arg Ser Ala Ala Cys Gly Leu Phe

115 120 125

Cys Cys Met Phe Leu His Ala Phe Val His Trp Pro Asp Arg Pro Met

130 135 140

Asp Lys Asn Pro Thr Met Asn Leu Leu Thr Gly Val Pro Asn Gly Met

145 150 155 160

Leu Gln Ser Pro Gln Val Glu Pro Thr Leu Arg Arg Asn Gln Glu Ala

165 170 175

Leu Tyr Arg Phe Leu Asn Ser His Ser Ala Tyr Phe Arg Ser His Arg

180 185 190

Ala Arg Ile Glu Lys Ala Thr Ala Phe Asp Arg Met Asn Asn Gln Asp

195 200 205

Met

<210> SEQ ID NO 69

<211> LENGTH: 807

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 69

Met Glu Thr Gln Pro Ser Pro Thr Ser Pro Ser Ala Pro Thr Thr Ala

1 5 10 15

Asp Glu Lys Gln Gln Gln Gln Asn Glu Ser Leu Thr Ala Pro Pro Pro

20 25 30

Ser Pro Ala Thr Ser Asp Ala Ala Val Pro Asp Met Gln Glu Met Glu

35 40 45

Glu Ser Ile Glu Ile Asp Leu Gly Tyr Val Thr Pro Ala Glu His Glu

50 55 60

Glu Glu Leu Ala Val Arg Phe Gln Ser Ser Ser Gln Glu Asp Lys Glu

65 70 75 80

Gln Pro Glu Gln Glu Ala Glu Asn Glu Gln Ser Gln Ala Gly Leu Glu

85 90 95

His Gly Asp Tyr Leu His Leu Ser Gly Gly Glu Asp Ala Leu Ile Lys

100 105 110

His Leu Ala Arg Gln Ala Ile Ile Val Lys Asp Ala Leu Leu Asp Arg

115 120 125

Thr Glu Val Pro Leu Ser Val Glu Glu Leu Ser Arg Ala Tyr Glu Leu

130 135 140

Asn Leu Phe Ser Pro Arg Val Pro Pro Lys Arg Gln Pro Asn Gly Thr

145 150 155 160

Cys Glu Pro Asn Pro Arg Leu Asn Phe Tyr Pro Val Phe Ala Val Pro

165 170 175

Glu Ala Leu Ala Thr Tyr His Ile Phe Phe Lys Asn Gln Lys Ile Pro

180 185 190

Val Ser Cys Arg Ala Asn Arg Thr Arg Ala Asp Ala Leu Leu Asn Leu

195 200 205

Gly Pro Gly Ala Arg Leu Pro Asp Ile Ala Ser Leu Glu Glu Val Pro

210 215 220

Lys Ile Phe Glu Gly Leu Gly Ser Asp Glu Thr Arg Ala Ala Asn Ala

225 230 235 240

Leu Gln Gly Glu Gly Gly Glu His Glu His His Ser Ala Leu Val Glu

245 250 255

Leu Glu Gly Asp Asn Ala Arg Leu Ala Val Leu Lys Arg Thr Val Glu

260 265 270

Leu Thr His Phe Ala Tyr Pro Ala Leu Asn Leu Pro Pro Lys Val Met

275 280 285

Ser Ala Val Met Asp Gln Val Leu Ile Lys Arg Ala Ser Pro Ile Ser

290 295 300

Glu Asp Glu Met Gln Asp Pro Glu Ser Ser Glu Glu Gly Lys Pro Val

305 310 315 320

Val Ser Asp Glu Gln Leu Ala Arg Trp Leu Gly Pro Asn Ala Thr Pro

325 330 335

Gln Ser Leu Glu Glu Arg Arg Lys Leu Met Met Ala Val Val Leu Val

340 345 350

Thr Val Glu Leu Glu Cys Leu Arg Arg Phe Phe Ala Asp Ala Glu Thr

355 360 365

Leu Arg Lys Val Glu Glu Asn Leu His Tyr Ile Phe Arg His Gly Phe

370 375 380

Val Arg Gln Ala Cys Lys Ile Ser Asn Val Glu Leu Thr Asn Leu Val

385 390 395 400

Ser Tyr Met Gly Ile Leu His Glu Asn Arg Leu Gly Gln Asn Val Leu

405 410 415

His Thr Thr Leu Arg Gly Glu Ala Arg Arg Asp Tyr Ile Arg Asp Cys

420 425 430

Val Tyr Leu Tyr Leu Cys His Thr Trp Gln Thr Gly Met Gly Val Trp

435 440 445

Gln Gln Cys Leu Glu Glu Gln Asn Leu Lys Glu Leu Cys Lys Leu Leu

450 455 460

Gln Lys Asn Leu Lys Gly Leu Trp Thr Gly Phe Asp Glu Arg Thr Thr

465 470 475 480

Ala Ser Asp Leu Ala Asp Leu Ile Phe Pro Glu Arg Leu Arg Leu Thr

485 490 495

Leu Arg Asn Gly Leu Pro Asp Phe Met Ser Gln Ser Met Leu Gln Asn

500 505 510

Phe Arg Ser Phe Ile Leu Glu Arg Ser Gly Ile Leu Pro Ala Thr Cys

515 520 525

Ser Ala Leu Pro Ser Asp Phe Val Pro Leu Thr Phe Arg Glu Cys Pro

530 535 540

Pro Pro Leu Trp Ser His Cys Tyr Leu Leu Arg Leu Ala Asn Tyr Leu

545 550 555 560

Ala Tyr His Ser Asp Val Ile Glu Asp Val Ser Gly Glu Gly Leu Leu

565 570 575

Glu Cys His Cys Arg Cys Asn Leu Cys Thr Pro His Arg Ser Leu Ala

580 585 590

Cys Asn Pro Gln Leu Leu Ser Glu Thr Gln Ile Ile Gly Thr Phe Glu

595 600 605

Leu Gln Gly Pro Ser Asp Glu Gly Glu Gly Ser Ala Lys Gly Gly Leu

610 615 620

Lys Leu Thr Pro Gly Leu Trp Thr Ser Ala Tyr Leu Arg Lys Phe Val

625 630 635 640

Pro Glu Asp Tyr His Pro Phe Glu Ile Arg Phe Tyr Glu Asp Gln Ser

645 650 655

Gln Pro Pro Lys Ala Glu Leu Ser Ala Cys Val Ile Thr Gln Gly Ala

660 665 670

Ile Leu Ala Gln Leu Gln Ala Ile Gln Lys Ser Arg Gln Glu Phe Leu

675 680 685

Leu Lys Lys Gly Arg Gly Val Tyr Leu Asp Pro Gln Thr Gly Glu Glu

690 695 700

Leu Asn Pro Gly Phe Pro Gln Asp Ala Pro Arg Lys Gln Glu Ala Glu

705 710 715 720

Ser Gly Ala Ala Ala Arg Gly Gly Phe Gly Gly Arg Leu Gly Glu Gln

725 730 735

Gln Ser Gly Arg Gly Gly Gly Asp Gly Gly Arg Leu Gly Gln His Ser

740 745 750

Gly Arg Gly Gly Gln Pro Ala Arg Gln Ser Gly Gly Arg Arg Gly Gly

755 760 765

Gly Arg Gly Gly Gly Gly Arg Ser Ser Arg Arg Gln Thr Val Val Leu

770 775 780

Gly Gly Gly Glu Ser Lys Gln His Gly Tyr His Leu Arg Ser Gly Ser

785 790 795 800

Gly Ser Arg Ser Ala Pro Gln

805

<210> SEQ ID NO 70

<211> LENGTH: 227

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 70

Met Ser Lys Glu Ile Pro Thr Pro Tyr Met Trp Ser Tyr Gln Pro Gln

1 5 10 15

Met Gly Leu Ala Ala Gly Ala Ala Gln Asp Tyr Ser Thr Arg Met Asn

20 25 30

Trp Leu Ser Ala Gly Pro Ala Met Ile Ser Arg Val Asn Asp Ile Arg

35 40 45

Ala His Arg Asn Gln Ile Leu Leu Glu Gln Ser Ala Leu Thr Ala Thr

50 55 60

Pro Arg Asn His Leu Asn Pro Arg Asn Trp Pro Ala Ala Leu Val Tyr

65 70 75 80

Gln Glu Ile Pro Gln Pro Thr Thr Val Leu Leu Pro Arg Asp Ala Gln

85 90 95

Ala Glu Val Gln Leu Thr Asn Ser Gly Val Gln Leu Ala Gly Gly Ala

100 105 110

Thr Leu Cys Arg His Arg Pro Ala Gln Gly Ile Lys Arg Leu Val Ile

115 120 125

Arg Gly Arg Gly Thr Gln Leu Asn Asp Glu Val Val Ser Ser Ser Leu

130 135 140

Gly Leu Arg Pro Asp Gly Val Phe Gln Leu Ala Gly Ser Gly Arg Ser

145 150 155 160

Ser Phe Thr Pro Arg Gln Ala Val Leu Thr Leu Glu Ser Ser Ser Ser

165 170 175

Gln Pro Arg Ser Gly Gly Ile Gly Thr Leu Gln Phe Val Glu Glu Phe

180 185 190

Thr Pro Ser Val Tyr Phe Asn Pro Phe Ser Gly Ser Pro Gly His Tyr

195 200 205

Pro Asp Glu Phe Ile Pro Asn Phe Asp Ala Ile Ser Glu Ser Val Asp

210 215 220

Gly Tyr Asp

225

<210> SEQ ID NO 71

<211> LENGTH: 106

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 71

Met Ser His Gly Gly Ala Ala Asp Leu Ala Arg Leu Arg His Leu Asp

1 5 10 15

His Cys Arg Arg Phe Arg Cys Phe Ala Arg Asp Leu Ala Glu Phe Thr

20 25 30

Tyr Phe Glu Leu Pro Glu Glu His Pro Gln Gly Pro Ala His Gly Val

35 40 45

Arg Ile Val Val Glu Gly Gly Leu Asp Ser His Leu Leu Arg Ile Phe

50 55 60

Ser Gln Arg Pro Ile Leu Val Glu Arg Gln Gln Gly Asn Thr Leu Leu

65 70 75 80

Thr Leu Tyr Cys Ile Cys Asp His Pro Gly Leu His Glu Ser Leu Cys

85 90 95

Cys Leu Leu Cys Thr Glu Tyr Asn Lys Ser

100 105

<210> SEQ ID NO 72

<211> LENGTH: 176

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 72

Met Gly Lys Ile Thr Leu Val Cys Gly Val Leu Val Thr Val Val Leu

1 5 10 15

Ser Ile Leu Gly Gly Gly Ser Ala Ala Val Val Thr Glu Lys Lys Ala

20 25 30

Asp Pro Cys Leu Thr Phe Asn Pro Asp Lys Cys Arg Leu Ser Phe Gln

35 40 45

Pro Asp Gly Asn Arg Cys Ala Val Leu Ile Lys Cys Gly Trp Glu Cys

50 55 60

Glu Ser Val Ala Ile Gln Tyr Lys Asn Lys Thr Arg Asn Asn Thr Leu

65 70 75 80

Ala Ser Thr Trp Gln Pro Gly Asp Pro Glu Trp Tyr Thr Val Ser Val

85 90 95

Pro Gly Ala Asp Gly Ser Leu Arg Thr Val Asn Asn Thr Phe Ile Phe

100 105 110

Glu His Met Cys Glu Thr Ala Met Phe Met Ser Lys Gln Tyr Gly Met

115 120 125

Trp Pro Pro Arg Lys Glu Asn Ile Val Val Phe Ser Ile Ala Tyr Ser

130 135 140

Ala Cys Thr Val Leu Ile Thr Ala Ile Val Cys Leu Ser Ile His Met

145 150 155 160

Leu Ile Ala Ile Arg Pro Arg Asn Asn Ala Glu Lys Glu Lys Gln Pro

165 170 175

<210> SEQ ID NO 73

<211> LENGTH: 243

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 73

Met Ala Ser Val Lys Val Leu Ile Ile Val Ser Leu Ile Ala Tyr Ala

1 5 10 15

Ser Gly Tyr Ala Asn Ile Lys Lys Thr Leu Tyr Val Gly Ser Asp Ser

20 25 30

Ile Leu Glu Gly Thr Gln Ser Gly Ala Lys Val Ser Trp Tyr Phe Tyr

35 40 45

Arg Thr Thr Thr Asp Pro Val Lys Leu Cys Lys Gly Glu Leu Pro Arg

50 55 60

Thr His Lys Thr Pro Leu Thr Phe Ser Cys Ser Asn Asn Asn Leu Thr

65 70 75 80

Leu Phe Ser Val Thr Lys Gln Tyr Ala Gly Thr Tyr Tyr Ser Thr Asn

85 90 95

Phe His Thr Gly Gln Asp Lys Tyr Tyr Thr Ile Thr Val Ala Asp Pro

100 105 110

Thr Thr Pro Arg Thr Thr Thr Lys Pro Thr Thr Thr Lys Lys Pro Thr

115 120 125

Thr Pro Lys Lys Pro Thr Thr Pro Lys Thr Thr Lys Thr Thr Thr Lys

130 135 140

Thr Thr Thr Thr Glu Pro Thr Thr Thr Ser Thr Thr Leu Ala Ile Thr

145 150 155 160

Thr His Thr Glu Leu Thr Ser Gln Ala Thr Thr Glu Asn Gly Phe Ala

165 170 175

Leu Leu Gln Lys Gly Asp Asn Ser Ser Ser Ser Pro Leu Pro Thr Thr

180 185 190

Pro Ser Glu Glu Ile Pro Lys Ser Met Val Gly Ile Ile Ala Ala Val

195 200 205

Val Val Cys Met Leu Ile Ile Ile Leu Cys Met Met Tyr Tyr Ala Cys

210 215 220

Tyr Tyr Arg Lys His Arg Leu Asn Asn Lys Leu Asp Pro Leu Leu Asn

225 230 235 240

Val Asp Phe

<210> SEQ ID NO 74

<211> LENGTH: 207

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 74

Met Lys Ile Leu Ser Leu Phe Val Phe Ser Ile Ile Ile Thr Ser Ala

1 5 10 15

Leu Cys Asn Ser Asn Asp Asp Val Thr Val Val Val Gly Ser Asn Tyr

20 25 30

Thr Leu Lys Gly Pro Pro Lys Gly Met Leu Ser Trp Tyr Cys Trp Phe

35 40 45

Gly Asn Asp Glu Gln Gln Thr Glu Leu Cys Asn Leu Lys Asn Gly Lys

50 55 60

Val Ser Asn Pro Lys Ile Ser Asn Tyr Ile Cys Asn Gly Thr Asp Leu

65 70 75 80

Ile Leu Leu Asn Ile Thr Lys Ala Tyr Ala Gly Ser Tyr Thr Cys Pro

85 90 95

Gly Asp Asn Asp Asp Met Ile Phe Lys Lys Leu Ile Val Val Asp Pro

100 105 110

Thr Thr Pro Ala Pro Pro Thr Thr Thr Thr Lys Ala His Thr Thr Asp

115 120 125

Thr Gln Glu Thr Thr Pro Glu Ala Glu Val Ala Glu Leu Ala Lys Gln

130 135 140

Ile His Glu Asp Ser Phe Val Ala Asn Thr Pro Thr His Pro Gly Pro

145 150 155 160

Gln Cys Pro Gly Pro Leu Val Ser Gly Ile Val Gly Val Leu Cys Gly

165 170 175

Leu Ala Val Ile Ile Ile Cys Met Phe Ile Phe Ala Cys Cys Tyr Arg

180 185 190

Arg Leu His Arg Gln Lys Ser Asp Pro Leu Leu Asn Leu Tyr Val

195 200 205

<210> SEQ ID NO 75

<211> LENGTH: 291

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 75

Met Lys Ala Leu Ser Thr Leu Val Phe Leu Ser Leu Ile Gly Ile Val

1 5 10 15

Phe Ser Ala Gly Phe Leu Lys Asn Leu Thr Ile Ile Glu Gly Glu Asn

20 25 30

Ala Thr Leu Val Gly Ile Ser Gly Gln Asn Val Ser Trp Leu Lys Tyr

35 40 45

His Leu Asp Gly Trp Lys Pro Ile Cys Thr Trp Asn Val Ser Val Tyr

50 55 60

Thr Cys His Gly Val Asn Leu Thr Ile Thr Asn Ala Thr Gln Asp Gln

65 70 75 80

Asn Gly Arg Phe Lys Gly Gln Ser Phe Thr Ser Asn Asn Gly Tyr Glu

85 90 95

Thr His Asn Met Phe Ile Tyr Asp Val Thr Val Ile Ser Asn Lys Thr

100 105 110

Thr Pro Thr Thr Gln Thr Pro Thr Thr His Ser Ser Thr His Ala Met

115 120 125

Gln Thr Thr Gln Thr Thr Thr Tyr Thr Thr Ser Thr Glu Pro Thr Thr

130 135 140

Thr Thr Ala Glu Val Thr Ser Thr Ala Pro Gln Pro Gln Ala Leu Ala

145 150 155 160

Leu Arg Ala Ala Gln Ser Ser Ser Met Thr Ala Lys Thr Asn Glu Gln

165 170 175

Thr Thr Glu Phe Leu Ser Thr Thr Gln Ser Arg Thr Thr Ala Thr Ser

180 185 190

Ser Ala Phe Ser Ser Thr Ala Asn Leu Thr Ser Leu Ser Ser Met Pro

195 200 205

Ile Ser Asn Ala Thr Thr Leu Pro Ala Pro Leu Pro Thr Pro Leu Lys

210 215 220

Gln Ser Glu Ser Ser Thr Gln Leu Gln Ile Thr Leu Leu Ile Val Ile

225 230 235 240

Gly Val Val Ile Leu Ala Val Leu Leu Tyr Phe Ile Phe Cys Arg Arg

245 250 255

Ile Pro Asn Ala Lys Pro Ala Tyr Lys Pro Ile Val Ile Gly Thr Pro

260 265 270

Glu Pro Leu Gln Val Glu Gly Gly Leu Arg Asn Leu Leu Phe Ser Phe

275 280 285

Thr Val Trp

290

<210> SEQ ID NO 76

<211> LENGTH: 91

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 76

Met Ile Pro Arg His Phe Ile Ile Thr Ser Leu Ile Cys Val Leu Gln

1 5 10 15

Val Cys Ala Thr Leu Ala Leu Val Ala Asn Ala Ser Pro Asp Cys Ile

20 25 30

Gly Ala Phe Ala Ser Tyr Val Leu Phe Ala Phe Ile Thr Cys Ile Cys

35 40 45

Cys Cys Ser Ile Val Cys Leu Leu Ile Thr Phe Phe Gln Phe Ile Asp

50 55 60

Trp Val Phe Val Arg Ile Ala Tyr Leu Arg His His Pro Gln Tyr Arg

65 70 75 80

Asp Gln Arg Val Ala Gln Leu Leu Arg Leu Ile

85 90

<210> SEQ ID NO 77

<211> LENGTH: 146

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 77

Met Ile Ser Met Arg Ala Leu Leu Leu Leu Ala Leu Ala Leu Ala Pro

1 5 10 15

Leu Ala Ala Pro Phe Ser Leu Lys Ser Pro Thr Gln Ser Pro Glu Glu

20 25 30

Val Arg Lys Cys Lys Phe Gln Glu Pro Trp Lys Phe Leu Lys Cys Tyr

35 40 45

Gln Leu Lys Ser Glu Met His Pro Ser Trp Ile Met Ile Ile Gly Ile

50 55 60

Val Asn Ile Leu Ala Cys Thr Leu Ile Ser Phe Val Ile Tyr Pro Cys

65 70 75 80

Phe Asn Phe Gly Trp Asn Ala Pro Glu Ala Leu Trp Leu Pro Pro Asp

85 90 95

Pro Asp Thr Pro Pro Gln Gln Gln Gln Asn Gln Ala His Ala Pro Pro

100 105 110

Gln Pro Arg Pro Gln Tyr Met Pro Ile Leu Asp Tyr Glu Ala Glu Pro

115 120 125

Gln Arg Ala Met Leu Pro Ala Ile Ser Tyr Phe Asn Leu Thr Gly Gly

130 135 140

Asp Asp

145

<210> SEQ ID NO 78

<211> LENGTH: 442

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 78

Met Ser Lys Lys Arg Ala Arg Val Asp Asp Gly Phe Asp Pro Val Tyr

1 5 10 15

Pro Tyr Asp Ala Asp Asn Ala Pro Thr Val Pro Phe Ile Asn Pro Pro

20 25 30

Phe Val Ser Ser Asp Gly Phe Gln Glu Lys Pro Leu Gly Val Leu Ser

35 40 45

Leu Arg Leu Ala Asp Pro Val Thr Thr Lys Asn Gly Ala Val Thr Leu

50 55 60

Lys Leu Gly Asp Gly Val Asp Leu Asp Asp Ser Gly Lys Leu Ile Ser

65 70 75 80

Lys Asn Ala Thr Lys Ala Thr Ala Pro Leu Ser Ile Ser Asn Gly Thr

85 90 95

Ile Ser Leu Asn Met Asp Thr Pro Phe Tyr Thr Ser Asp Gly Lys Leu

100 105 110

Ser Leu Gln Val Tyr Pro Pro Leu Asn Ile Leu Arg Thr Pro Ile Ile

115 120 125

Asn Thr Leu Ala Leu Ala Tyr Lys Ser Gly Leu Gly Val Ser Asp Lys

130 135 140

Ser Leu Ala Val Gln Leu Ala Ser Pro Leu Thr Phe Asp Glu Lys Gly

145 150 155 160

Asn Ile Glu Ile Asn Leu Gly Ser Gly Pro Leu Thr Val Ala Ser Asn

165 170 175

Gln Leu Ser Ile Asn Cys Gln Arg Gly Val His Ile Thr Thr Ala Gly

180 185 190

Asp Ala Leu Glu Thr Asn Ile Ser Trp Ala Lys Gly Ile Arg Phe Glu

195 200 205

Ser Asp Ala Ile Ala Ala Asn Ile Gly Lys Gly Leu Glu Phe Gly Thr

210 215 220

Thr Ser Thr Glu Ser Asp Val Thr Asp Ala Phe Pro Ile Gln Val Lys

225 230 235 240

Leu Gly Thr Gly Leu Thr Phe Asp Ser Thr Gly Ala Ile Val Ala Trp

245 250 255

Asn Lys Glu Asp Asp Lys Leu Thr Leu Trp Thr Thr Ala Asp Pro Ser

260 265 270

Pro Asn Cys His Ile Tyr Ser Ala Lys Asp Ala Lys Leu Thr Leu Cys

275 280 285

Leu Thr Lys Cys Gly Ser Gln Ile Leu Gly Thr Val Ser Leu Ile Ala

290 295 300

Val Asn Thr Gly Ser Leu Asn Pro Ile Thr Gly Thr Val His Thr Ala

305 310 315 320

Leu Val Ser Leu Lys Phe Asp Ala Ser Gly Val Leu Gln Ala Ser Ser

325 330 335

Thr Leu Asp Lys Glu Tyr Trp Asn Phe Arg Lys Asp Asp Val Thr Pro

340 345 350

Ala Asp Pro Tyr Thr Asn Ala Ile Gly Phe Met Pro Asn Leu Asn Ala

355 360 365

Tyr Pro Lys Asn Thr Ser Ala Ala Ala Lys Ser His Ile Val Gly Lys

370 375 380

Val Tyr Leu Asn Gly Asp Val Ala Lys Pro Leu Asp Leu Ile Ile Thr

385 390 395 400

Phe Asn Glu Thr Ser Asp Glu Thr Cys Thr Tyr Cys Ile Asn Phe Gln

405 410 415

Trp Gln Trp Gly Ala Asp Glu Tyr Lys Asn Glu Thr Leu Ala Val Ser

420 425 430

Ser Phe Thr Phe Ser Tyr Ile Ala Lys Glu

435 440

<210> SEQ ID NO 79

<211> LENGTH: 30070

<212> TYPE: DNA

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Simian adenovirus A1295

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (7)..(1494)

<223> OTHER INFORMATION: E1b\55K

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (23633)..(24196)

<223> OTHER INFORMATION: 22K

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (25511)..(26143)

<223> OTHER INFORMATION: E3\CR1-alpha

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (29668)..(30069)

<223> OTHER INFORMATION: E3\14.7K

<400> SEQUENCE: 79

tgacaa atg gag tca aga aac cca ttt cag cag gga tta cca gct gga 48

Met Glu Ser Arg Asn Pro Phe Gln Gln Gly Leu Pro Ala Gly

1 5 10

ttt ctt agc agt agc ttt gtg gag aac atg gaa gtg cca gcg cct gaa 96

Phe Leu Ser Ser Ser Phe Val Glu Asn Met Glu Val Pro Ala Pro Glu

15 20 25 30

tgc aat ctc agg cta ctt gcc ggt aca gcc gct aga cac tct gag gat 144

Cys Asn Leu Arg Leu Leu Ala Gly Thr Ala Ala Arg His Ser Glu Asp

35 40 45

cct gaa tct cca gga gag tcc cag ggc acg cca acg tcg cca gca gca 192

Pro Glu Ser Pro Gly Glu Ser Gln Gly Thr Pro Thr Ser Pro Ala Ala

50 55 60

gca gca gga gga tca aga aga gaa ccc gag agc cgg cct gga ccc tcc 240

Ala Ala Gly Gly Ser Arg Arg Glu Pro Glu Ser Arg Pro Gly Pro Ser

65 70 75

ggc gga gga gga gta gct gac ctg ttt cct gaa ctg cgc cgg gtg ctg 288

Gly Gly Gly Gly Val Ala Asp Leu Phe Pro Glu Leu Arg Arg Val Leu

80 85 90

act agg tct tcg agt ggt cgg gag agg ggg att aag cgg gag agg cat 336

Thr Arg Ser Ser Ser Gly Arg Glu Arg Gly Ile Lys Arg Glu Arg His

95 100 105 110

gat gag act aat cac aga act gaa ctg act gtg ggt ctg atg agc cgt 384

Asp Glu Thr Asn His Arg Thr Glu Leu Thr Val Gly Leu Met Ser Arg

115 120 125

aag agg cca gaa aca gtg tgg tgg cat gag gtg cag tcg act ggc aca 432

Lys Arg Pro Glu Thr Val Trp Trp His Glu Val Gln Ser Thr Gly Thr

130 135 140

gat gag gtg tca gtc atg cat gag aag ttt tcc cta gaa caa gtc aag 480

Asp Glu Val Ser Val Met His Glu Lys Phe Ser Leu Glu Gln Val Lys

145 150 155

act tgt tgg ttg gag cct gag gat gat tgg gag gta gcc atc agg aat 528

Thr Cys Trp Leu Glu Pro Glu Asp Asp Trp Glu Val Ala Ile Arg Asn

160 165 170

tat gcc aag ctg gct ctc agg cca gac aag aag tac aag att act aag 576

Tyr Ala Lys Leu Ala Leu Arg Pro Asp Lys Lys Tyr Lys Ile Thr Lys

175 180 185 190

ctg ata aat atc aga aat gcc tgc tac atc tca ggg aat ggg gct gaa 624

Leu Ile Asn Ile Arg Asn Ala Cys Tyr Ile Ser Gly Asn Gly Ala Glu

195 200 205

gtg gag atc tgt ctt cag gat aga gtg gct ttc aga tgc tgt atg atg 672

Val Glu Ile Cys Leu Gln Asp Arg Val Ala Phe Arg Cys Cys Met Met

210 215 220

aat atg tac ccg gga gtg gtg ggc atg gat ggg gtc acc ttt atg aac 720

Asn Met Tyr Pro Gly Val Val Gly Met Asp Gly Val Thr Phe Met Asn

225 230 235

atg agg ttc agg gga gat ggg tat aat ggc acg gtc ttt atg gcc aat 768

Met Arg Phe Arg Gly Asp Gly Tyr Asn Gly Thr Val Phe Met Ala Asn

240 245 250

acg aag ctg aca gtc cat ggc tgc tcc ttc ttt ggg ttt aat aac acc 816

Thr Lys Leu Thr Val His Gly Cys Ser Phe Phe Gly Phe Asn Asn Thr

255 260 265 270

tgc att gag gcc tgg ggt cag gtc ggt gtg agg ggc tgc agt ttt tca 864

Cys Ile Glu Ala Trp Gly Gln Val Gly Val Arg Gly Cys Ser Phe Ser

275 280 285

gcc aac tgg atg ggg gtc gtg ggc agg acc aaa agt gtg ctg tcc gtg 912

Ala Asn Trp Met Gly Val Val Gly Arg Thr Lys Ser Val Leu Ser Val

290 295 300

aag aaa tgc ttg ttt gag agg tgc cac ctg ggg gtg atg agc gag ggc 960

Lys Lys Cys Leu Phe Glu Arg Cys His Leu Gly Val Met Ser Glu Gly

305 310 315

gaa gcc aga atc cgc cac tgc gcc tct acc gag acg ggc tgc ttc gtg 1008

Glu Ala Arg Ile Arg His Cys Ala Ser Thr Glu Thr Gly Cys Phe Val

320 325 330

ctg tgc aag ggc aat gct aag atc aag cat aat atg att tgt gga gcc 1056

Leu Cys Lys Gly Asn Ala Lys Ile Lys His Asn Met Ile Cys Gly Ala

335 340 345 350

tcg gac gag cgc ggc tac cag atg ctg acc tgc gcc agt ggg aac agc 1104

Ser Asp Glu Arg Gly Tyr Gln Met Leu Thr Cys Ala Ser Gly Asn Ser

355 360 365

cat atg ctg gcc acc gtg cat gtg gct tcc cat tcc cgc aag ccc tgg 1152

His Met Leu Ala Thr Val His Val Ala Ser His Ser Arg Lys Pro Trp

370 375 380

cct gag ttc gag cac aat gtc atg acc agg tgc aat atg cat ctg ggg 1200

Pro Glu Phe Glu His Asn Val Met Thr Arg Cys Asn Met His Leu Gly

385 390 395

tcc cgc cga ggc atg ttc atg ccc tac cag tgc aac ctg aat tat gtg 1248

Ser Arg Arg Gly Met Phe Met Pro Tyr Gln Cys Asn Leu Asn Tyr Val

400 405 410

aag gtg ctg ctg gag ccc gat gcc atg tcc aga gtg agc ctg acg ggg 1296

Lys Val Leu Leu Glu Pro Asp Ala Met Ser Arg Val Ser Leu Thr Gly

415 420 425 430

gtg ttt gac atg aat gtg gag gtg tgg aag att ctg aga tat gat gaa 1344

Val Phe Asp Met Asn Val Glu Val Trp Lys Ile Leu Arg Tyr Asp Glu

435 440 445

tcc aag acc agg tgc cga gcc tgc gag tgc gga ggg aag cat gcc agg 1392

Ser Lys Thr Arg Cys Arg Ala Cys Glu Cys Gly Gly Lys His Ala Arg

450 455 460

ttc cag ccc gtg tgt gtg gag gtg acg gag gac ctg cga ccc gat cat 1440

Phe Gln Pro Val Cys Val Glu Val Thr Glu Asp Leu Arg Pro Asp His

465 470 475

ttg gtg ttg tcc tgc acc ggg acg gag ttc ggt tcc agc ggg gaa gaa 1488

Leu Val Leu Ser Cys Thr Gly Thr Glu Phe Gly Ser Ser Gly Glu Glu

480 485 490

tct gac tagagtgagt agtgttctgg ggcggggagg acctgcatga gggccagaat 1544

Ser Asp

495

gactgaaatc tgtgcttttc tgtgtgttgc agcagcatga gcggaagcgg ctcctttgag 1604

ggaggggtat tcagccctta tctgacgggg cgtctcccct catgggcggg agtgcgtcag 1664

aatgtgatgg gatccacggt ggacggccgg cccgtgcagc ccgcgaactc ttcaaccctg 1724

acctatgcaa ccctgagctc ttcgtcggtg gacgcagctg ccgccgcagc tgctgcatct 1784

gccgccagcg ccgtgcgcgg aatggccatg ggcgccggct actacggcac tctggtggcc 1844

aactcgagtt ccaccaataa tcccgccagc ctgaacgagg agaagctgct gctgctgatg 1904

gcccagctcg aggccttgac ccagcgcctg ggcgagctga cccagcaggt ggctcagctg 1964

caggagcaga cgcgggccgc ggttgccacg gtgaaatcca aataaaaaat gaatcaataa 2024

ataaacggag acggttgttg attttaacac agagtctgaa tctttatttg atttttcgcg 2084

cgcggtaggc cctggaccac cggtctcgat cattgagcac ccggtggatt ttttccagga 2144

cccggtagag gtgggcttgg atgttgaggt acatgggcat gagcccgtcc cgggggtgga 2204

ggtagctcca ttgcagggcc tcgtgctcgg gggtggtgtt gtaaatcacc cagtcatagc 2264

aggggcgcag ggcgtggtgt tgcacaatat ctttgaggag gagactgatg gccacgggca 2324

gccctttggt gtaggtgttt acaaatctgt tgagctggga gggatgcatg cggggggaga 2384

tgaggtgcat cttggcctgg atcttgagat tggcgatgtt accgcccaga tcccgcctgg 2444

ggttcatgtt gtgcaggacc accagcacgg tgtatccggt gcacttgggg aatttatcat 2504

gcaacttaga agggaaggcg tgaaagaatt tggcgacgcc cttgtgcccg cccaggtttt 2564

ccatgcactc atccatgatg atggcgatgg gcccgtgggc ggcggcctgg gcaaaaacgt 2624

ttcgggggtc ggacacatca tagttgtggt cctgggtgag atcatcatag gccattttaa 2684

tgaatttggg gcggagggtg cctgactggg ggacaaaggt accctcgatc ccgggggcgt 2744

agttcccctc gcagatctgc atctcccagg ctttgagctc ggaggggggg atcatgtcca 2804

cctgcggggc gataaagaac acggtttccg gggcggggga gatgagctgc gccgaaagca 2864

agttccggag cagctgggac ttgccgcagc cggtggggcc gtagatgacc ccgatgaccg 2924

gctgcaggtg gtagttgagg gagagacagc tgccgtcctc gcggaggagg ggggccacct 2984

cgttcatcat ctcgcgcaca tgcatgttct cgcgcaccag ttccgccagg aggcgctctc 3044

cccccaggga taggagctcc tggagcgagg cgaagttttt cagcggcttg agtccgtcgg 3104

ccatgggcat tttggagagg gtctgttgca agagttccaa gcggtcccag agctcggtga 3164

tgtgctctac ggcatctcga tccagcagac ctcctcgttt cgcgggttgg gacgactgcg 3224

ggagtagggc accagacgat gggcgtccag cgcagccagg gtccggtcct tccagggccg 3284

cagcgtccgc gtcagggtgg tctccgtcac ggtgaagggg tgcgcgccgg gctgggcgct 3344

tgcgagggtg cgcttcaggc tcatccggct ggtcgaaaac cgctcccgat cggcgccctg 3404

cgcgtcggcc aggtagcaat tgaccatgag ttcgtagttg agcgcctcgg ccgcgtggcc 3464

tttggcgcgg agcttacctt tggaagtctg cccgcaggcg ggacagagga gggacttgag 3524

ggcgtagagc ttgggggcga ggaagacgga ctcgggggcg taggcatccg cgccgcagtg 3584

ggcgcagacg gtctcgcact ccacgagcca ggtgaggtcg ggctggtcgg ggtcaaaaac 3644

cagtttcccg ccgttctttt tgatgcgttt cttacctttg gtctccatga gctcgtgtcc 3704

ccgctgggtg acaaagaggc tgtccgtgtc cccgtagacc gactttatgg gccggtcttc 3764

gagcggtgtg ccacggtcct cctcgtagag gaaccccgcc cactccgaga cgaaagcccg 3824

ggtccaggcc agcacgaagg aggccacgtg ggacgggtag cggtcgttgt ccaccagcgg 3884

gtccactttt tccagggtat gcaaacacat gtccccctcg tccacatcca ggaaggtgat 3944

tggcttgtaa gtgtaggcca cgtgaccggg ggtcccggcc gggggggtat aaaagggggc 4004

gggcccctgc tcgtcctcac tgtcttccgg atcgctgtcc aggagcgcca gctgttgggg 4064

taggtattcc ctctcgaagg cgggcatgac ctcggcactc aggttgtcag tttctagaaa 4124

cgaggaggat ttgatattga cggtgccggc ggagatgcct ttcaagagcc cctcgtccat 4184

ctggtcagaa aagacgatct ttttgttgtc gagtttggtg gcgaaggagc cgtagagggc 4244

gttggaaagg agcttggcga tagagcgcat ggtctggttc ttttccttgt ctgcgcgctc 4304

cttggccgcg atgttgagct gcacgtactc gcgcgccacg cacttccatt cggggaagac 4364

ggtggtcagc tcgtcgggca cgattctgac ctgccagccc cggttatgca gggtgataag 4424

gtccacactg gtggccacct cgccgcgcag gggctcatta gtccagcaga ggcgcccgcc 4484

cttgcgcgag cagaaggggg gcagggggtc gagcatgacc tcgtcggggg ggtcggcatc 4544

gatggtgaag atgccgggca ggagatcggg gtcaaagtag ctgatggaag tggccagatc 4604

gtccagggca gcttgccatt cgcgcacggc cagcgcgcgc tcgtaggggc tgaggggcgt 4664

gccccagggc atgggatggg tgagcgcgga ggcgtacatg ccgcagatgt cgtagacgta 4724

gaggggctcc tcgaggatgc cgatgtaggt ggggtagcag cgccccccgc ggatgctggc 4784

gcgcacgtag tcatacagct cgtgcgaggg cgcgaggagc cccgggccca ggttggtgcg 4844

actgggcttt tcggcgcggt agacgatctg gcgaaagatg gcatgcgagt tggaggagat 4904

ggtgggcctt tggaagatgt tgaagtgggc gtgggggaga ccgaccgagt cgcggatgaa 4964

gtgggcgtag gagtcttgca gcttggcgac gagctcggcg gtgacgagga cgtccagagc 5024

gcagtagtcg agggtctcct ggatgatgtc atacttgagc tgtccctttt gtttccacag 5084

ctcgcggttg agaaggaact cttcgcggtc cttccagtac tcttcgaggg ggaacccgtc 5144

ctgatctgca cggtaagagc ctagcatgta gaactggttg acggccttgt aggcgcagca 5204

gcccttctcc acggggaggg cgtaggcctg ggcggccttg cgcagggagg tgtgcgtgag 5264

ggcgaaggtg tccctgacca tgactttgag gaactggtgc ttgaagtcga tatcgtcgca 5324

gcccccctgc tcccagagct ggaagtccgt gcgcttcttg taggcggggt tgggcaaagc 5384

gaaagtaaca tcgttgaaga ggatcttgcc cgcgcggggc ataaagttgc gagtgatgcg 5444

gaacggttgg ggcacctcgg cccggttgtt gatgacctgg gcggcgagca cgatctcgtc 5504

gaaaccgttg atgttgtggc ccacgatgta aagttccacg aatcgcgggc ggcccttgac 5564

gtggggcagt ttcttgagct cctcgtaggt gagctcgtcg gggtcgctga ggccgtgctg 5624

ctcgagcgcc cagtcggcga gatgggggtt ggcgcggagg aaggaagtcc agagatccac 5684

ggccagggcg gtctgcaggc ggtcccggta ctgacgaaac tgctgcccga cggccatttt 5744

ttcgggggtg acgcagtaga aggtgcgggg gtccccgtgc caacgatccc atttgagctg 5804

gagggcaaga tcgagggcga gctcgacgag gcggtcgtcc ccggagagtt tcatgaccag 5864

catgaagggg acgagctgct tgccgaagga ccccatccag gtgtaggttt ccacatcgta 5924

ggtgaggaag agcctttcgg tgcgaggatg cgagccgatg gggaagaact ggatctcctg 5984

ccaccaattg gaggaatggc tgttgatgtg atggaagtag aaatgccgac ggcgcgccga 6044

acactcgtgc ttgtgtttat acaagcggcc acagtgctcg caacgctgca cgggatgcac 6104

gtgctgcacg agctgtacct gagttccttt gacgaggaat ttcagtggga agtggagtcg 6164

tggcgcctgc atctcgtgct gtactacgtc gtggtggtcg gcctggccct cttctgcctc 6224

gatggtggtc atgctgacga gcccgcgcgg gaggcaggtc cagacctcgg cgcgagcggg 6284

tcggagagcg aggacgaggg cgcgcaggcc ggagctgtcc agggtcctga gacgctgcgg 6344

agtcaggtca gtgggcagcg gcggcgcgcg gttgacttgc aggagttttt ccagggcgcg 6404

cgggaggtcc agatggtact tgatctccac cgcgccgttg gtggcgacgt cgatggcttg 6464

cagggtcccg tgcccctggg gtgtgaccac cgtcccccgt ttcttcttgg gcggctgggg 6524

aggcggtgcc tcttccatgg ttagaagcgg cggcgaggac gcgcgccggg cggcaggggc 6584

ggctctgggc ccggaggcag gggcggcagg ggcacgtcgg cgccgcgcgc gggtaggttc 6644

tggtactgcg cccggagaag actggcgtga gcgacgacgc gacggttgac gtcctggatc 6704

tgacgcctct gggtgaaggc cacgggaccc gtgagtttga acctgaaaga gagttcgaca 6764

gaatcaatct cggtatcgtt gacggcggcc tgccgcagga tctcttgcac gtcgcccgag 6824

ttgtcctggt aggcgatctc ggtcatgaac tgctcgatct cctcctcctg aaggtctccg 6884

cggccggcgc gctccacggt ggccgcgagg tcgttggaga tgcggcccat gagctgcgag 6944

aaggcgttca tgcccgcctc gttccagacg cggctgtaga ccacgacgcc ctcgggatcg 7004

cgggcgcgca tgaccacctg ggcgaggttg agctccacgt ggcgcgtgaa gaccgcgtag 7064

ttgcagaggc gctggtaaag gtagttgagc gtggtggcga tgtgctcggt gacgaagaaa 7124

tacatgatcc agcggcggag cggcatctcg ctgacgtcgc ccagcgcctc caaacgttcc 7184

atggcctcgt aaaagtccac ggcgaagttg aaaaactggg agttgcgcgc cgagacggtc 7244

aactcctcct ccagaagacg gatgagctcg gcgatggtgg cgcgcacctc gcgctcgaag 7304

gcccccggga gttcctccac ttcctcttct tcctcctcca ctaacatctc ttctacttcc 7364

tcctcaggcg gtggtggcgg gggagggggc ctgcgtcgcc ggcggcgcac gggcagacgg 7424

tcgatgaaac gctcgatggt ctcgccgcgc cggcgtcgca tggtctcggt gacggcgcgc 7484

ccgtcctcgc ggggccgcag cgtgaagacg ccgccgcgca tctccaggtg gccggggggg 7544

tccccgttgg gcagggagag ggcgctgacg atgcatctta tcaattgccc cgtagggact 7604

ccgcgcaagg acctgagcgt ctcgagatcc acgggatctg aaaaccgttg aacgaaggct 7664

tcgagccagt cgcagtcgca aggtaggctg agcacggttt cttctggcgg gtcatgttgg 7724

ttggagggag cggggcgggc gatgctgctg gtgatgaagt tgaaataggc ggttctgaga 7784

cggcggatgg tggcgaggag caccaggtct ttgggcccgg cttgctggat gcgcagacgg 7844

tcggccatgc cccaggcgtg gtcctgacac ctggccaggt ccttgtagta gtcctgcatg 7904

agccgctcca cgggcacctc ctcctcgcct gcgcggccgt gcatgcgcgt gagcccgaag 7964

ccgcgctggg gctggacgag cgccaggtcg gcgacgacgc gctcggcgag gatggcctgc 8024

tggatctggg tgagggtggt ctggaagtcg tcaaagtcga cgaagcggtg gtaggctccg 8084

gtgttgatgg tgtaggagca gttggccatg acggaccagt tgacggtctg gtggcccgga 8144

cgcacgagct cgtggtactt gaggcgcgag taggcgcgcg tgtcgaagat gtagtcgttg 8204

caggtgcgca ccaggtactg gtagccgatg aggaagtgcg gcggcggctg gcggtagagc 8264

ggccatcgct cggtggcggg ggcgccgggc gcgaggtcct cgagcatggt gcggtggtag 8324

ccgtagatgt acctggacat ccaggtgatg ccggcggcgg tggtggaggc gcgcgggaac 8384

tcgcggacgc ggttccagat gttgcgcagc ggcaggaagt agttcatggt gggcacggtc 8444

tggcccgtga ggcgcgcgca gtcgtggatg ctctatacgg gcaaaaacga aagcggtcag 8504

cggctcgact ccgtggcctg gaggctaagc gaacgggttg ggctgcgcgt gtaccccggt 8564

tcgaatctcg aatcaggctg gagccgcagc taacgtggta ctggcactcc cgtctcgacc 8624

caagcctgca ccaaccctcc aggatacgga ggcgggtcgt tttgcaactt ttttcggagg 8684

ccggatgaaa ctagtaagcg cggaaagcgg ccgaccgcga tggctcgctg ccgtagtctg 8744

gagaagaatc gccagggttg cgttgcggtg tgccccggtt cgaggccggc cggattccgc 8804

ggctaacgag ggcgtggctg ccccgtcgtt tccaagaccc ctagccagcc gacttctcca 8864

gttacggagc gagcccctct tttgtttttt gtttttgcca gatgcatccc gtactgcggc 8924

agatgcgccc ccaccaccct ccaccgcaac aacagccccc tcctccacag ccggcgcttc 8984

tgcccccgcc ccagcagcag cagcaacttc cagccacgac cgccgcggcc gccgtgagcg 9044

gggctggaca gagttatgac caccagctgg ccttggaaga gggcgagggg ctggcgcgcc 9104

tgggggcgtc gtcgccggag cggcacccgc gcgtgcagat gaaaagggac gctcgtgagg 9164

cctacgtgcc caagcagaac ctgttcagag acaggagcgg cgaggagccc gaggagatgc 9224

gcgcggcccg gttccacgcg gggcgggagc tgcggcgcgg cctggaccga aagagggtgc 9284

tgagggacga ggatttcgag gcggacgagc tgacggggat cagccccgcg cgcgcgcacg 9344

tggccgcggc caacctggtc acggcgtacg agcagaccgt gaaggaggag agcaacttcc 9404

aaaaatcctt caacaaccac gtgcgcaccc tgatcgcgcg cgaggaggtg accctgggcc 9464

tgatgcacct gtgggacctg ctggaggcca tcgtgcagaa ccccaccagc aagccgctga 9524

cggcgcagct gttcctggtg gtgcagcaca gtcgggacaa cgaggcgttc agggaggcgc 9584

tgctgaatat caccgagccc gagggccgct ggctcctgga cctggtgaac attctgcaga 9644

gcatcgtggt gcaggagcgc gggctgccgc tgtccgagaa gctggcggcc atcaacttct 9704

cggtgctgag tctgggcaag tactacgcta ggaagatcta caagaccccg tacgtgccca 9764

tagacaagga ggtgaagatc gacgggtttt acatgcgcat gaccctgaaa gtgctgaccc 9824

tgagcgacga tctgggggtg taccgcaacg acaggatgca ccgcgcggtg agcgccagca 9884

ggcggcgcga gctgagcgac caggagctga tgcacagcct gcagcgggcc ctgaccgggg 9944

ccgggaccga gggggagagc tactttgaca tgggcgcgga cctgcactgg cagcccagcc 10004

gccgggcctt ggaggcggca ggcggtcccc cctacgtaga agaggtggac gaggacgagg 10064

agggcgagta cctggaagac tgatggcgcg accgtatttt tgctagatgc aacaacagcc 10124

acctcctgat cccgcgatgc gggcggcgct gcagagccag ccgtccggca ttaactcctc 10184

ggacgattgg acccaggcca tgcaacgcat catggcgctg acgacccgca accccgaagc 10244

ctttagacag cagccccagg ccaaccggct ctcggccatc ctggaggccg tggtgccctc 10304

gcgctacaac cccacgcacg agaaggtcct ggccatcgtg aacgcgctgg tggagaacaa 10364

ggccatccgc ggcgacgagg ccggcctggt gtacaacgcg ctgctggagc gcgtggcccg 10424

ctacaacagc accaacgtgc agaccaacct ggaccgcatg gtgaccgatg tgcgcgaggc 10484

cgtggcccag cgcgagcggt tccaccgcga gtccaacctg ggatccatgg tggcgctgaa 10544

cgccttcctc agcacccagc ccgccaacgt gccccggggc caggaggact acaccaactt 10604

catcagcgct ctgcgcctga tggtgaccga ggtgccccag agcgaggtgt accagtccgg 10664

gccggactac ttcttccaga ccagtcgcca gggcttgcag accgtgaacc tgagccaggc 10724

tttcaagaac ttgcagggat tgtggggcgt gcaggccccg gtcggggacc gcgcgacggt 10784

gtcgagcctg ctgacgccga actcgcgcct gctgctgctg ctggtggccc ccttcacgga 10844

cagcggcagc atcaaccgca actcgtacct gggctacctg attaacttgt accgcgaggc 10904

catcggccag gcgcacgtgg acgagcagac ctaccaggag atcacccacg tgagccgcgc 10964

cctgggccag gacgacccgg gcaatctgga agccaccctg aactttttgc tgaccaaccg 11024

gtcgcagaag atcccgcccc agtacgcgct cagcgccgag gaggagcgca ttctgcgata 11084

cgtgcagcag agcgtgggcc tgttcctgat gcaggagggg gccaccccca gcgccgcgct 11144

cgacatgacc gcgcgcaaca tggagcccag catgtacgcc agcaaccgcc cgttcatcaa 11204

taagctgatg gactacttgc atcgggcggc cgccatgaac tcggactatt tcaccaacgc 11264

catcctgaat ccccactggc tcccgccgcc ggggttctac acgggcgagt acgacatgcc 11324

cgaccccaat gacgggttcc tgtgggacga tgtggacagc agcgtgttct ccccccgacc 11384

gggtgctaac gagcgcccct tgtggaagaa ggaaggcagc gaccgtcgcc cgtcctcggc 11444

gctgtccggc cgggagggtg ctgccgcggc ggtgcccgag gccgccagtc ctttcccgag 11504

cttgcccttc tcgctgaaca gtattcgcag cagcgagctg ggcaggatca cgcgtccgcg 11564

cttgctgggc gaggaggagt acttgaatga ctcgctgttg agacccgagc gggagaaaaa 11624

cttccccaac aacgggatag agagcctggt ggacaagatg agccgctgga agacgtatgc 11684

gcaggagcac agggacgatc cgtcgcaggg ggccacgagc cggggcagcg cggcccgtaa 11744

acgccggtgg cacgacaggc agcggggact gatgtgggac gatgaggatt ccgccgacga 11804

cagcagcgtg ttggacttgg gtgggagtgg tggtaacccg ttcgctcacc tgcgcccccg 11864

catcgggcgc atgatgtaag aaaccgaaaa taaatgatac tcaccaaggc catggcgacc 11924

agcgtgcgtt cgtttcttct ctgttgttta gtatgatgag gcgtgcgtac ccggagggtc 11984

ctcctccctc gtacgagagc gtgatgcagc aggcgatggc ggcggcggcg atgcagcccc 12044

cgctggaggc tccttacgtg cccccgcggt acctggcgcc tacggagggg cggaacagca 12104

ttcgttactc ggagctggca cccttgtacg ataccacccg gttgtacctg gtggacaaca 12164

agtcggcgga catcgcctcg ctgaactacc agaacgacca cagcaacttc ctgaccaccg 12224

tggtgcagaa caatgacttc acccccacgg aggccagcac ccagaccatc aactttgacg 12284

agcgctcgcg gtggggcggc cagctgaaaa ccatcatgca caccaacatg cccaacgtga 12344

acgagttcat gtacagcaac aagttcaagg cgcgggtcat ggtctcccgc aagaccccca 12404

acggggtcac agtaggggat gactatgatg gtagtcagga tatcttggag tatgaatggg 12464

tggagtttga gctgcccgaa ggcaacttct cggtgaccat gaccatcgac ctgatgaaca 12524

acgccatcat cgacaattac ttggcggtgg ggcggcagaa cggggtgctc gagagcgaca 12584

tcggcgtgaa gttcgacacc aggaacttca ggctgggctg ggaccccgtg accgagctgg 12644

tcatgccagg ggtgtacacc aacgaggcct tccatcccga tattgtcttg ctgcccggct 12704

gcggggtgga cttcaccgag agccgcctca gcaacctgct gggcattcgc aagaggcagc 12764

ccttccagga gggtttccag atcatgtacg aggatctgga ggggggcaac atccccgcgc 12824

tcctggatgt cgacgcctat gagaaaagca aggaggaagc agcagctgaa gcaactgcag 12884

cggtagccac cgcctctacc gaggtcaggg gcgataattt tgcaagcgct gcggcagtgg 12944

cagcggccga ggcggctgaa accgaaagta agatagtcat tcagccggtg gagaaggata 13004

gcaaggacag gagctacaac gtgctgccgg acaagataaa caccgcctac cgcagctggt 13064

acctggccta caactatggc gaccccgaga agggcgtgcg ctcctggacg ctgctcacca 13124

cctcggacgt cacctgcggc gtggagcaag tctactggtc gctgcccgac atgatgcaag 13184

acccggtcac cttccgctcc acgcgtcaag ttagcaacta cccggtggtg ggcgccgagc 13244

tcctgcccgt ctactccaag agcttcttca acgagcaggc cgtctactcg cagcagctgc 13304

gcgccttcac ctcgctcacg cacgtcttca accgcttccc cgagaaccag atcctcgtcc 13364

gcccgcccgc gcccaccatt accaccgtca gtgaaaacgt tcctgctctc acagatcacg 13424

ggaccctgcc gctgcgcagc agtatccggg gagtccagcg cgtgaccgtt actgacgcca 13484

gacgccgcac ctgcccctac gtctacaagg ccctgggcat agtcgcgccg cgcgtccttt 13544

cgagccgcac cttctaaaaa aatgtccatt ctcatctcgc ccagtaataa caccggttgg 13604

ggcctgcgcg cgcccagcaa gatgtacgga ggcgctcgcc aacgctccac gcaacacccc 13664

gtgcgcgtgc gcgggcactt ccgcgctccc tggggcgccc tcaagggccg cgtgcggtcg 13724

cgcaccaccg tcgacgacgt gatcgaccag gtggtggccg acgcgcgcaa ctacaccccc 13784

gccgccgcgc ccgtctccac cgtggacgcc gtcatcgaca gcgtggtggc cgacgcgcgc 13844

cggtacgccc gcgccaagag ccggcggcgg cgcatcgccc ggcggcaccg gagcaccccc 13904

gccatgcgcg cggcgcgagc cttgctgcgc agggccaggc gcacgggacg cagggccatg 13964

ctcagggcgg ccagacgcgc ggcctcaggc gccagcgccg gcaggacccg gagacgcgcg 14024

gccacggcgg cggcagcggc catcgccagc atgtcccgcc cgcggcgagg gaacgtgtac 14084

tgggtgcgcg acgccgccac cggtgtgcgc gtgcccgtgc gcacccgccc ccctcgcact 14144

tgaagatgtt cacttcgcga tgttgatgtg tcccagcggc gaggaggatg tccaagcgca 14204

aattcaagga agagatgctc caggtcatcg cgcctgagat ctacggcccc gcggcggcgg 14264

tgaaggagga aagaaagccc cgcaaaatca agcgggtcaa aaaggacaaa aaggaagaag 14324

atgatgtgga cggattggtg gagtttgtgc gcgagttcgc ccccaggcgg cgcgtgcagt 14384

ggcgcgggcg gaaggtgcaa ccggtgctga gacccggcac caccgtggtt ttcacgccag 14444

gcgagcgctc cggcaccgcc tccaagcgct cctacgacga ggtgtacggg gatgatgata 14504

ttctggaaca ggcggccgag cgcctgggcg agtttgctta cggcaagcgc agccgctccg 14564

cgccgaagga agaggcggtg tccatcccgc tggaccacgg caaccccacg ccgagcctca 14624

agcccgtgac cctgcagcag gtgctgccga ccgcggcgcc gcgaaggggg ttcaagcgcg 14684

agggcgagga tctgtacccc accatgcagc tgatggtgcc caagcgccag aagctggaag 14744

acgtgctgga gaccatgaag gtggacccgg acgtgcagcc cgaggtcaag gtgcggccca 14804

tcaagcaggt ggccccgggc ctgggcgtgc agaccgtgga catcaagatc cccacggagc 14864

ccatggaaac gcagaccgag cccgtgaagc ccagcaccag caccatggag gtgcagacgg 14924

atccctggat gccatcggct cctagccgaa gaccccggcg caagtacggc gcggccagcc 14984

tgctgatgcc caactacgcg ctgcatcctt ccatcatccc cacgcctggc taccgcggca 15044

cgcgcttcta ccgcggtcat acaaccagcc gccgccgcaa gaccaccact cgccgccgcc 15104

gtcgtcgcac cgccgctgca actacccctg ccgccttggt gcggagagtg taccgccgcg 15164

gccgcgcgcc tctgaccctg ccgcgcgcgc gctaccaccc gagcatcgcc atttaaaact 15224

ttcgcctgct ttgcagatgg ccctcacatg ccgcctccgc gtccccatta cgggctaccg 15284

aggaagaaaa ccgcgccgta gaaggctggc ggggaacggg atgcgtcgcc accaccaccg 15344

gcggcggcgc gccatcagca agcggttggg gggaggcttc ctgcccgcgc tgatccccat 15404

catcgccgcg gcgatcgggg cgatccccgg cattgcttcc gtggcggtgc aggcctctca 15464

gcgccactga gacacacttg gaaaacatct tgtaataaac caatggactc tgacgctcct 15524

ggtcctgtga tgtgttttcg tagacagatg gaagacatca atttttcgtc cctggctccg 15584

cgacacggca cgcggccgtt catgggcacc tggagcgaca tcggcaccag ccaactgaac 15644

gggggcgcct tcaattggag cagtctctgg agcgggctta agaatttcgg gtccacgctt 15704

aaaacctatg gcagcaaggc gtggaacagc accacagggc aggcgctgag ggaaaagctg 15764

aaagagcaga acttccagca gaaggtggtc gatgggctcg cctcgggcat caacggggtg 15824

gtggacctgg ccaaccaggc cgtgcagcgg cagatcaaca gccgcctgga cccggtgccg 15884

cccgccggct ccgtggagat gccgcaggtg gaggaggagc tgcctcccct ggacaagcgg 15944

ggcgagaagc gaccccgccc cgacgcggag gagacgctgc tgacgcacac ggacgagccg 16004

cccccgtacg aggaggcggt gaaactgggt ctgcccacca cgcggcccat cgcgcccctg 16064

gccaccgggg tgctgaaacc cgaaagtagt aagcccgcga ccctggactt gcctcctccc 16124

gcttcccgcc cctccacagt ggctaagccc ctgccgccgg tggccgtggc ccgcgcgcga 16184

cccgggggca ccgcccgccc tcatgcgaac tggcagagca ctctgaacag catcgtgggt 16244

ctgggagtgc agagtgtgaa gcgccgccgc tgctattaaa cctaccgtag cgcttaactt 16304

gcttgtctgt gtgtgtatgt attatgtcgc cgccgccgct gtccgccaga aggaggagtg 16364

aagaggcgcg tcgccgagtt gcaagatggc caccccatcg atgctgcccc agtgggcgta 16424

catgcacatc gccggacagg acgcttcgga gtacctgagt ccgggtctgg tgcagttcgc 16484

ccgcgccaca gacacctact tcagtctggg gaacaagttt aggaacccca cggtggcgcc 16544

cacgcacgat gtgaccaccg accgcagcca gcggctgacg ctgcgcttcg tgcccgtgga 16604

ccgcgaggac aacacctact cgtacaaagt gcgctacacg ctggccgtgg gcgacaaccg 16664

cgtgctggac atggccagca cctactttga catccgcggc gtgctggacc ggggccctag 16724

cttcaaaccc tactccggca ctgcctacaa cagcctggcc cccaagggag cacccaacac 16784

ttgccagtgg aaggactctg acaacaaaat gcataccttt ggggtagctg ccatgcccgg 16844

tgttactggg aaaaagatag aagctgatgg gctgcctatt ggaatagatt caacttctgg 16904

aactgacacc ataatttatg ctgataaaac tttccaaccc gagccacagg ttggaaatga 16964

cagttgggtt gacaccaatg gtgcagagga aaaatatgga ggcagagctc tcaaggacac 17024

tacaaatatg aaaccctgct atggttcttt cgccaagccc accaacaaag aaggtggtca 17084

ggctaactta aaagattccg aaactgccac caccactcct aactatgata tagatctggc 17144

tttctttgac agcaaaaata tcgccgctaa ctacgatcca gatattgtaa tgtatacaga 17204

aaatgttgac ctgcagactc cagatactca tattgtatac aaacctggaa cagaggacac 17264

cagctctgaa tccaatttgg gtcagcaggc catgcccaac agacccaact atattggttt 17324

cagggacaac tttatcggtc tcatgtacta caacagcact ggcaatatgg gggtgctggc 17384

cggtcaggcc tctcagctga atgctgtggt tgacttgcaa gacagaaaca ccgagctgtc 17444

ctaccagctc ttgcttgact ctctgggtga cagaaccagg tatttcagta tgtggaatca 17504

ggcggtggac agctatgatc ctgatgtgcg cattattgaa aaccatggtg tggaagatga 17564

attgccaaac tattgctttc cattgaatgg tgtgggattt acagatactt accagggtgt 17624

taaagttaaa acagatgcaa atgctggaaa caatggaaca cagtgggaca aagatgacac 17684

cacagttagc actgctaatg aaatccatgg aggcaatcct tttgccatgg aaatcaacat 17744

tcaggccaac ctgtggcgga acttcctcta tgccaacgtg gccctctatt tgcctgataa 17804

atacaaatac acacctgcca acatcaccct gcccaccaac accaacacct acgattacat 17864

gaacggccgg gtggtggcgc cctcgctggt ggacgcctac atcaacatcg gggcgcgctg 17924

gtcgctggac cccatggaca acgtgaatcc cttcaaccac caccgcaacg cggggctgcg 17984

ctaccgctcc atgcttctgg gcaacgggcg ctacgtgccc ttccacatcc aggtgcccca 18044

gaaatttttc gccatcaaga gcctcctgct cctgcccggg tcctacacct acgagtggaa 18104

cttccgcaag gacgtcaaca tgatcctgca gagctccctc ggcaacgacc tgcgcacgga 18164

cggggcctcc atctccttca ccagcatcaa cctctacgcc accttcttcc ccatggcgca 18224

caacacggcc tccacgctcg aggccatgct gcgcaacgac accaacgacc agtccttcaa 18284

cgactacctc tcggcggcca acatgctcta tcccatcccg gccaacgcca ccaacgtgcc 18344

catatccatc ccctcgcgca actgggccgc cttccgcggc tggtccttca cgcgcctcaa 18404

gaccaaggag acgccctcgc tgggctccgg gttcgacccc tacttcgtct actcgggctc 18464

catcccctac ctcgacggca ccttctacct caaccacacc ttcaagaagg tctccatcac 18524

cttcgactcc tccgtcagct ggcccggcaa cgaccggctc ctgacgccca acgagttcga 18584

aatcaagcgc accgtcgacg gcgagggcta caatgtggcc cagtgcaaca tgaccaagga 18644

ctggttcctg gtccagatgc tggcccacta caacatcggc taccagggct tctacgtgcc 18704

cgagggctac aaggaccgca tgtactcctt cttccgcaac ttccagccca tgagccgcca 18764

ggtggtggac gaggtcaact acaaggacta ccaggccgtc accctggcct accagcacaa 18824

caactcgggc ttcgtcggct acctcgcgcc caccatgcgc cagggccagc cctaccccgc 18884

caactaccca tacccgctca tcggcaagag cgccgtcacc agcgtcaccc agaaaaagtt 18944

cctctgcgac agggtcatgt ggcgcatccc cttctccagc aacttcatgt ccatgggcgc 19004

gctcaccgac ctcggccaga acatgctcta tgccaactcc gcccacgcgc tagacatgaa 19064

tttcgaagtc gaccccatgg atgagtccac ccttctctat gttgtcttcg aagtcttcga 19124

cgtcgtccga gtgcaccagc cccaccgcgg cgtcatcgag gccgtctacc tgcgcacccc 19184

cttctcggcc ggtaacgcca ccacctaaat tgcttcttgc aagctatggc ctgtgccacg 19244

ggctccggcg agcaggagct cagggccatc atccgcgacc tgggctgcgg gccctacttc 19304

ctgggcacct tcgataagcg cttcccggga ttcatggccc cgcacaagct ggcctgcgcc 19364

atcgtcaaca cggccggccg cgagaccggg ggcgagcact ggctggcctt cgcctggaac 19424

ccgcgctcga acacctgcta cctcttcgac cccttcgggt tctcggacga gcgcctcaag 19484

cagatctacc agttcgagta cgagggcctg ctgcgtcgca gcgccctggc caccgaggac 19544

cgctgcgtca ccctggaaaa gtccacccag accgtgcagg gtccgcgctc ggccgcctgc 19604

gggctcttct gctgcatgtt cctgcacgcc ttcgtgcact ggcccgaccg ccccatggac 19664

aagaacccca ccatgaactt gctgacgggg gtgcccaacg gcatgctcca gtcgccccag 19724

gtggaaccca ccctgcgccg caaccaggag gcgctctacc gcttcctcaa ctcccactcc 19784

gcctactttc gctcccaccg cgcgcgcatc gagaaggcca ccgccttcga ccgcatgaac 19844

aatcaagaca tgtaaaccgt gtgtgtatgt taaaatgtct ttaataaaca gcactttcat 19904

gttacacatg catctgagat ggtttattat ttagaaatcg aaagggttct gccgggtctc 19964

ggcatggccc gcgggcaggg acacgttgcg gaactggtac ttggccagcc acttgaactc 20024

ggggatcagc agtttcggca gcggggtgtc ggggaaggag tcggtccaca gcttccgcgt 20084

cagttgcagg gcgcccagga ggtcgggcgc ggagatcttg aaatcgcagt tgggacccgc 20144

gttctgcgcg cgagagttgc ggtacacggg gttgcagcac tggaacacca tcagggccgg 20204

gtgcttcacg ctcgccagca ccgtcgcgtc ggtgatgctc tccacgtcga ggtcctcggc 20264

gttggccatc ccgaaggggg tcatcttgca ggtctgcctg cccatggtgg gcacgcagcc 20324

gggcttgtgg ttgcaatcgc agtgcagggg gatcagcatc atctgggcct ggtcggcgtt 20384

catccccggg tacatggcct tcatgaaagc ctccaattgc ctaaaagcct gctgggcctt 20444

ggctccctcg gtgaagaaga ccccgcaaga cttgctagag aactggttgg tagcgcaccc 20504

ggcgtcatgc acgcagcagc gcgcgtcgtt gttggccagc tgcaccacgc tgcgccccca 20564

gcggttctgg gtgatcttgg cccggttggg gttctccttc agcgcgcgct gcccgttctc 20624

gctcgccaca tccatctcga tcatgtgctc cttctggatc atggtggtcc cgtgcaggca 20684

ccgcagcttg ccctcggtct cggtgcaacc gtgcagccac agcgcgcacc cggtgcactc 20744

ccagttcttg tgggcgatct gtgaatgcgc gtgcacgaac ccttgcagga agcggcccat 20804

catggtcgtc agggtcttgt tgctagtgaa ggtcagcgga atgccgcggt gctcctcgtt 20864

gatgtacagg tggcagatgc ggcggtacac ctcgccctgc tcgggcatca gctggaagtt 20924

ggctttcagg tcagtctcca cgcggtagcg gtccatcagt atagtcatga tttccatgcc 20984

cttctcccag gccgagacga tgggcaggct catagggttc ttcaccatca tcttagcgct 21044

agccgccgca gccagggggt cgctctcgtc cagggtctca aagctccgct tgccgtcctt 21104

ctcggtgatc cgcacggggg ggtagctgaa gcccacggcc gccagctcct cctcggcctg 21164

cctttcgtcc tcgctgtcct ggctgacgtc ctgcagaacc acatgcttgg tcttgcgggg 21224

tttcttcttg ggtggcagtg gcggcggaga tgcttgcggc gagggggagc gcgagttctc 21284

gctcaccact actatctctt cctcttcttg gtccgaggcc acgcggcggt aggtatgtct 21344

cttcgggggc agaggcggag gcgacgggct ctcgccgccg cgacttggcg gatggctggc 21404

agagcccctt ccgcgatcgg gggtgcgctc ccggcggcgc tctgactgac ttcctccgcg 21464

gccggccatt gtgttctcct agggaacaac aagcatggag actcagccat cgccaacctc 21524

gccatctgcc cccaccaccg ccgacgagaa gcagcagcag cagaatgaaa gcttaaccgc 21584

cccgccgccc agccccgcca cctccgacgc ggcggtccca gacatgcaag agatggagga 21644

atccatcgag attgacctgg gctatgtgac gcccgcggag cacgaggagg agctggcagt 21704

gcgctttcaa tcgtcaagcc aggaagataa agaacagcca gagcaggaag cagaaaacga 21764

gcagagtcag gctgggctcg agcatggcga ctacctccac ctgagcgggg gggaggacgc 21824

gctcatcaag catctggccc ggcaggccat catcgtcaag gatgcgctgc tcgaccgcac 21884

cgaggtgccc ctcagcgtgg aggagctcag ccgcgcctac gagctcaacc tcttctcgcc 21944

gcgcgtgccc cccaagcgcc agcccaacgg cacctgcgag cccaacccgc gcctcaactt 22004

ctacccggtc ttcgcggtgc ccgaggccct ggccacctac cacatctttt tcaagaacca 22064

aaagatcccc gtctcctgcc gcgccaaccg cacccgcgcc gacgcccttt tgaacctggg 22124

ccccggcgcc cgcctacctg atatcgcctc cttggaagag gttcccaaga tcttcgaggg 22184

tctgggcagc gacgagactc gggccgcgaa cgctctgcaa ggagaaggag gagagcatga 22244

gcaccacagc gccctggtcg agttggaagg cgacaacgcg cggctggcgg tgctcaaacg 22304

cacggtcgag ctgacccatt tcgcctaccc ggctctgaac ctgcccccca aagtcatgag 22364

cgcggtcatg gaccaggtgc tcatcaagcg cgcgtcgccc atctccgagg acgagatgca 22424

ggaccccgag agctccgagg agggcaagcc cgtggtcagc gacgagcagc tggcccggtg 22484

gctgggtcct aatgctaccc ctcagagttt ggaagagcgg cgcaagctca tgatggccgt 22544

ggtcctggtg accgtggagc tggagtgtct gcgccgcttc ttcgccgacg cggagaccct 22604

gcgcaaggtc gaggagaacc tgcactacat cttcaggcac gggttcgtgc gccaggcctg 22664

caagatttcc aacgtggagc tgaccaacct ggtctcctac atgggcatct tgcacgagaa 22724

ccgactgggg cagaacgtgc tgcacaccac cctgcgcggg gaggcccggc gcgactacat 22784

ccgcgactgc gtctacctct acctctgcca cacctggcag acgggcatgg gcgtgtggca 22844

gcagtgcctg gaggagcaga acctgaaaga gctctgcaag ctcctgcaga agaacctcaa 22904

gggtctgtgg accgggttcg acgagcgcac caccgcctcg gacctggccg acctcatctt 22964

ccccgagcgc ctcaggctga cgctgcgcaa cggcctgccc gactttatga gccaaagcat 23024

gttgcaaaac tttcgctctt tcatcctcga acgctccgga atcctgcccg ccacctgctc 23084

cgcgctgccc tcggacttcg tgccgctgac cttccgcgag tgccccccgc cgctgtggag 23144

ccactgctac ctgctgcgcc tggccaacta cctggcctac cactcggacg tgatcgagga 23204

cgtcagcggc gagggcctgc tcgagtgcca ctgccgctgc aacctctgca cgccgcaccg 23264

ctccctggcc tgcaaccccc agctgctgag cgagacccag atcatcggca ccttcgagtt 23324

gcaagggccc agcgatgagg gcgagggatc ggccaagggg ggtctgaaac tcaccccggg 23384

gctgtggacc tcggcctact tgcgcaagtt cgtgcccgag gactaccatc ccttcgagat 23444

caggttctac gaggaccaat cccagccgcc caaggccgag ctgtcggcct gcgtcatcac 23504

ccagggggcc atcctggccc aattgcaagc catccagaaa tcccgccaag aattcttgct 23564

gaaaaagggc cgcggggtct acctcgaccc ccagaccggt gaggagctca accccggctt 23624

cccccagg atg ccc cga gga aac aag aag ctg aaa gtg gag ctg ccg ccc 23674

Met Pro Arg Gly Asn Lys Lys Leu Lys Val Glu Leu Pro Pro

500 505 510

gtg gag gat ttg gag gaa gac tgg gag aac agc agt cag gca gag gag 23722

Val Glu Asp Leu Glu Glu Asp Trp Glu Asn Ser Ser Gln Ala Glu Glu

515 520 525

gag gag atg gag gaa gac tgg gac agc act cag gca gag gag gac agc 23770

Glu Glu Met Glu Glu Asp Trp Asp Ser Thr Gln Ala Glu Glu Asp Ser

530 535 540

ctg caa gac agt ctg gag gaa gac gag gag gag gca gag gag gag gtg 23818

Leu Gln Asp Ser Leu Glu Glu Asp Glu Glu Glu Ala Glu Glu Glu Val

545 550 555

gaa gaa gca gcc gcc gcc aga ccg tcg tcc tcg gcg ggg gag aaa gca 23866

Glu Glu Ala Ala Ala Ala Arg Pro Ser Ser Ser Ala Gly Glu Lys Ala

560 565 570

agc agc acg gat acc atc tcc gct ccg ggt cgg ggt ccc gct cgg ccc 23914

Ser Ser Thr Asp Thr Ile Ser Ala Pro Gly Arg Gly Pro Ala Arg Pro

575 580 585 590

cac agt aga tgg gac gag acc ggg cga ttc ccg aac ccc acc acc cag 23962

His Ser Arg Trp Asp Glu Thr Gly Arg Phe Pro Asn Pro Thr Thr Gln

595 600 605

acc ggt aag aag gag cgg cag gga tac aag tcc tgg cgg ggg cac aaa 24010

Thr Gly Lys Lys Glu Arg Gln Gly Tyr Lys Ser Trp Arg Gly His Lys

610 615 620

aac gcc atc gtc tcc tgc ttg cag gcc tgc ggg ggc aac atc tcc ttc 24058

Asn Ala Ile Val Ser Cys Leu Gln Ala Cys Gly Gly Asn Ile Ser Phe

625 630 635

acc cgg cgc tac ctg ctc ttc cac cgc ggg gta aac ttc ccc cgc aac 24106

Thr Arg Arg Tyr Leu Leu Phe His Arg Gly Val Asn Phe Pro Arg Asn

640 645 650

atc ttg cat tac tac cgt cac ctc cac agc ccc tac tac ttc caa gaa 24154

Ile Leu His Tyr Tyr Arg His Leu His Ser Pro Tyr Tyr Phe Gln Glu

655 660 665 670

gag gca gca gca gaa aaa gac cag cag aaa acc agc agc agc 24196

Glu Ala Ala Ala Glu Lys Asp Gln Gln Lys Thr Ser Ser Ser

675 680

tagaaaatcc acagcggcgg cagcgacagg tggactgagg atcgcggcga acgagccggc 24256

gcagacccgg gagctgagga accggatctt tcccaccctc tatgccatct tccagcagag 24316

tcgggggcag gagcaggaac tgaaagtcaa gaaccgttct ctgcgctcgc tcacccgcag 24376

ttgtctgtat cacaagagcg aagaccaact tcagcgcact ctcgaggacg ccgaggctct 24436

cttcaacaag tactgcgcgc tcactcttaa agagtagccc gcgcccgccc agtcgcagaa 24496

aaaggcggga attacgtcac ctgtgccctt cgccctagcc gcctccaccc ctcatcatga 24556

gcaaagagat tcccacgcct tacatgtgga gctaccagcc ccagatgggc ctggccgccg 24616

gcgccgccca ggactactcc acccgcatga attggctcag cgccgggccc gcgatgatct 24676

cacgggtgaa tgacatccgc gcccaccgaa accagatact cctagaacag tcagcgctca 24736

ccgccacgcc ccgcaatcac ctcaatccgc gtaattggcc cgccgccctg gtgtaccagg 24796

aaattcccca gcccacgacc gtactacttc cgcgagacgc ccaggccgaa gtccagctga 24856

ctaactcagg tgtccagctg gcgggcggcg ccaccctgtg tcgtcaccgc cccgctcagg 24916

gtataaagcg gctggtgatc cggggcagag gcacacagct caacgacgag gtggtgagct 24976

cttcgctggg tctgcgacct gacggagtct tccaactcgc cggatcgggg agatcttcct 25036

tcacgcctcg tcaggccgtg ctgactttgg agagttcgtc ctcgcagccc cgctcgggcg 25096

gcatcggcac tctccagttc gtggaggagt tcactccctc ggtctacttc aaccccttct 25156

ccggctcccc cggccactac ccggacgagt tcatcccgaa ctttgacgcc atcagcgagt 25216

cggtggacgg ctacgattga atgtcccatg gtggcgcggc tgacctagct cggcttcgac 25276

acctggacca ctgccgccgc tttcgctgct tcgctcggga cctcgccgag ttcacctact 25336

ttgagctgcc cgaggagcat cctcagggcc cggcccacgg agtgcggatc gtcgtcgaag 25396

ggggcctaga ctcccacctg cttcggatct tcagccagcg cccgatcctg gtcgagcgcc 25456

aacagggcaa caccctcctg accctctact gcatctgcga ccaccccggc ctgc atg 25513

Met

685

aaa gtc ttt gtt gtc tgc tgt gta ctg agt ata ata aaa gct gag atc 25561

Lys Val Phe Val Val Cys Cys Val Leu Ser Ile Ile Lys Ala Glu Ile

690 695 700

agc gac tac tcc gga ctc aac tgt ggt gtt tct gca tcc atc aat cgg 25609

Ser Asp Tyr Ser Gly Leu Asn Cys Gly Val Ser Ala Ser Ile Asn Arg

705 710 715

tca ctg acc ttc acc ggg aac gag acc gag ctc cag ctc cag tgt aag 25657

Ser Leu Thr Phe Thr Gly Asn Glu Thr Glu Leu Gln Leu Gln Cys Lys

720 725 730

ccc cac aag aag tac ctc acc tgg ctg tac cag ggc tcc ccg atc gcc 25705

Pro His Lys Lys Tyr Leu Thr Trp Leu Tyr Gln Gly Ser Pro Ile Ala

735 740 745

gtt gtt aac cac tgc gac gac gac gga gtc ctg ctg aac ggt ccc gcc 25753

Val Val Asn His Cys Asp Asp Asp Gly Val Leu Leu Asn Gly Pro Ala

750 755 760 765

aac ctt act ttt tcc acc cgc aga agc aag ctc gag ctc ttc caa ccc 25801

Asn Leu Thr Phe Ser Thr Arg Arg Ser Lys Leu Glu Leu Phe Gln Pro

770 775 780

ttc ctc ccc ggg acc tat cag tgc atc tcg gga ccc tgc cat cac acc 25849

Phe Leu Pro Gly Thr Tyr Gln Cys Ile Ser Gly Pro Cys His His Thr

785 790 795

ttc cac ctg atc ccg aat acc acc tct tcc cca gcg ccg ctc ccc act 25897

Phe His Leu Ile Pro Asn Thr Thr Ser Ser Pro Ala Pro Leu Pro Thr

800 805 810

aac aac caa act aac cac cac caa cgc tac cga cgc gac ctc gtt gaa 25945

Asn Asn Gln Thr Asn His His Gln Arg Tyr Arg Arg Asp Leu Val Glu

815 820 825

tct aat acc acc cac acc gga ggt gag ctc cga ggt cct gaa tcc tct 25993

Ser Asn Thr Thr His Thr Gly Gly Glu Leu Arg Gly Pro Glu Ser Ser

830 835 840 845

ggg att tat tac ggc ccc tgg gag gtg gtg ggg tta ata gct tta ggc 26041

Gly Ile Tyr Tyr Gly Pro Trp Glu Val Val Gly Leu Ile Ala Leu Gly

850 855 860

tta gta gcg ggt ggg ctt ttg gct ctc tgc tac cta tac ctc cct tgc 26089

Leu Val Ala Gly Gly Leu Leu Ala Leu Cys Tyr Leu Tyr Leu Pro Cys

865 870 875

ttt tcc tac tta gtg gtg ctt tgt tgc tgg ttt aag aaa tgg gga aga 26137

Phe Ser Tyr Leu Val Val Leu Cys Cys Trp Phe Lys Lys Trp Gly Arg

880 885 890

tca ccc tagtgtgcgg tgtgctggtg acggtggtgc tttcgattct gggaggggga 26193

Ser Pro

895

agcgcggctg tagtgacgga gaagaaggcc gatccctgct tgactttcaa ccccgataaa 26253

tgccggctga gttttcagcc cgatggcaat cggtgcgcgg tgttgatcaa gtgcggatgg 26313

gaatgcgaga gtgtggcgat tcagtataaa aacaagacgc ggaacaatac tctcgcgtcc 26373

acatggcagc ccggggaccc cgagtggtac accgtctctg tccctggtgc tgacggctcc 26433

ctccgcacgg tgaacaacac tttcattttt gagcacatgt gcgaaaccgc catgttcatg 26493

agcaagcagt acggtatgtg gcctccacga aaagagaata tcgtggtctt ctccatcgct 26553

tacagcgcgt gcacggtgct aatcaccgcg atcgtgtgcc tgagcattca catgctcatc 26613

gctattcgcc ccagaaataa tgccgagaaa gagaaacagc cataacacac ttttttcaca 26673

cacctttttc agaccatggc ctctgttaaa gttttgatca tagtcagttt aattgcttat 26733

gcctctggct atgcaaacat taaaaaaacc ctctatgttg gttctgactc tatattagag 26793

ggtacccaat ccggagctaa ggtttcatgg tatttttata gaaccacaac tgatccagtt 26853

aaactctgta agggtgaatt accacgcacg cataaaactc cacttacatt cagctgcagc 26913

aataataatc ttacactttt ttcggtcact aaacaatatg ctggcactta ctacagcaca 26973

aattttcata ctgggcaaga taaatactat actattacag tggccgatcc aaccactcct 27033

agaactacta ctaaacccac cacaactaaa aagcccacta cacctaaaaa gcctaccaca 27093

cccaaaacca ctaagacaac aactaagacc actaccacag agccaaccac aaccagcacc 27153

acacttgcta taactacaca cacggagctg acctcacaag caactactga aaatggtttt 27213

gccctattgc aaaaggggga taacagtagc agcagtcctc tgcctactac ccccagtgag 27273

gaaataccta agtccatggt tggcattatc gctgccgtag tggtgtgtat gctgattatc 27333

atcttgtgca tgatgtacta tgcctgctac tacagaaaac acaggctgaa caataagctg 27393

gaccccctac tgaatgttga tttttaattt tttagaacca tgaagatcct aagccttttt 27453

gttttttcta taattattac ctctgctcta tgcaattcca acgatgacgt tactgtcgtt 27513

gttggatcca attacacact gaaaggacct ccaaaaggta tgctttcgtg gtattgctgg 27573

tttggaaacg atgagcaaca aacagagctc tgcaatctta aaaatggaaa agtctcaaac 27633

ccaaaaatct ccaattatat atgcaatggc actgatttaa tactgctcaa tatcacgaaa 27693

gcatatgctg gcagttatac ttgtcctgga gacaatgatg acatgatttt taaaaaattg 27753

attgtggttg atcccactac cccagcacca cccaccacaa ctaccaaggc acataccaca 27813

gacacacagg aaaccactcc agaggcagaa gtagcagagt tagcaaagca gattcatgaa 27873

gattcctttg ttgccaatac ccccacacac cccggaccgc aatgtccagg gccattagtc 27933

agcggcattg tcggtgtgct ttgcgggtta gcagttataa tcatctgcat gttcattttt 27993

gcttgctgct acagaaggct ccaccgacaa aaatcagacc cactgctgaa cctctatgtt 28053

taatttttga ttttccagag ccatgaaggc acttagcact ttagtatttt tgtccttgat 28113

tggcattgta ttcagtgctg ggtttttgaa aaatcttacc attattgaag gcgaaaatgc 28173

aacactggta ggcatcagtg gtcagaatgt tagctggtta aaataccatc tagatgggtg 28233

gaaacctatt tgcacctgga atgtcagtgt gtacacatgc catggtgtta acctcaccat 28293

taccaatgcc acccaagatc agaatggcag gtttaagggt cagagtttta ctagcaacaa 28353

tgggtatgaa acccataaca tgttcatcta tgatgtcact gtcatatcaa ataagactac 28413

acctaccacc cagacaccca ctacacacag ctcaacacat gccatgcaga ccactcagac 28473

aaccacatac actacatcca ctgaacccac caccactaca gcagaggtaa ccagcacagc 28533

gcctcagccc caggcattgg ctttaagggc tgcacagtct agcagcatga ctgctaaaac 28593

caatgagcag actactgaat ttttgtccac tactcagagt cgcaccacag ctacctcgag 28653

tgccttctct agcaccgcca atctcacctc gctttcctct atgccaatca gtaatgctac 28713

taccctcccc gctcctcttc ccactcctct gaagcaatcc gagtctagca cgcagctgca 28773

gatcaccctg ctcattgtga tcggggtggt catcctggca gtgctgctct actttatctt 28833

ctgccgccgc atccccaacg cgaaaccggc ctacaagccc attgttatcg ggacgccgga 28893

gccgcttcag gtggagggag gtctaaggaa tcttctcttc tcttttacag tatggtgatt 28953

tgaactatga ttcctagaca tttcattatc acttctctaa tctgtgtgct ccaagtctgt 29013

gccaccctcg ctctcgtggc taacgcgagt ccagactgca ttggagcgtt cgcctcctac 29073

gtgctctttg ccttcatcac ctgcatctgc tgctgtagca tagtctgcct gcttatcacc 29133

ttcttccagt tcattgactg ggtctttgtg cgcatcgcct acctgcgcca ccatccccag 29193

taccgcgacc agagagtggc tcaactgttg aggctcatct gatgataagc atgcgggctc 29253

tgctactact cgcgcttgca ctagctcccc tcgccgcccc cttctccctc aaatccccca 29313

cccaatcccc tgaagaggtt cgaaaatgta aattccaaga accctggaaa ttcctcaaat 29373

gctaccagct caaatcagaa atgcacccca gctggatcat gatcattgga atcgtgaaca 29433

tccttgcctg taccctcatc tcctttgtga tttacccctg ctttaacttt gggtggaacg 29493

cacccgaggc gctctggctc ccgcctgatc ccgacacacc accacagcag cagcaaaatc 29553

aggcacatgc accaccacag cctaggccac aatacatgcc catcttagac tatgaggccg 29613

agccacagcg agccatgctt cctgctatta gttacttcaa tctaaccggc ggag atg 29670

Met

act gac ccc atg gcc aac aac acc gtc aac gac ctc ctg gac atg gac 29718

Thr Asp Pro Met Ala Asn Asn Thr Val Asn Asp Leu Leu Asp Met Asp

900 905 910

ggc cgc gcc tcg gag cag cga ctc gcc caa ctc cgc atc cgc cag cag 29766

Gly Arg Ala Ser Glu Gln Arg Leu Ala Gln Leu Arg Ile Arg Gln Gln

915 920 925

cag gag aga gcc gtc aag gag ctg cag gat gcg gtg gcc atc cac cag 29814

Gln Glu Arg Ala Val Lys Glu Leu Gln Asp Ala Val Ala Ile His Gln

930 935 940

tgc aag aga ggc atc ttc tgc ctg gtg aaa cag gcc aag atc tcc ttc 29862

Cys Lys Arg Gly Ile Phe Cys Leu Val Lys Gln Ala Lys Ile Ser Phe

945 950 955 960

gag gtc act tcc acc gac cat cgc ctc tcc tac gag ctc ctg cag cag 29910

Glu Val Thr Ser Thr Asp His Arg Leu Ser Tyr Glu Leu Leu Gln Gln

965 970 975

cgc cag aag ttc acc tgc ctg gtc gga gtc aac ccc atc gtc atc acc 29958

Arg Gln Lys Phe Thr Cys Leu Val Gly Val Asn Pro Ile Val Ile Thr

980 985 990

cag cag tct ggc gat acc aag ggg tgc atc cac tgc tcc tgc gac tcc 30006

Gln Gln Ser Gly Asp Thr Lys Gly Cys Ile His Cys Ser Cys Asp Ser

995 1000 1005

ccc gag tgc gtt cac acc ctg atc aag acc ctc tgc ggc ctc cgc 30051

Pro Glu Cys Val His Thr Leu Ile Lys Thr Leu Cys Gly Leu Arg

1010 1015 1020

gac ctc ctc ccc atg aac t 30070

Asp Leu Leu Pro Met Asn

1025

<210> SEQ ID NO 80

<211> LENGTH: 496

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 80

Met Glu Ser Arg Asn Pro Phe Gln Gln Gly Leu Pro Ala Gly Phe Leu

1 5 10 15

Ser Ser Ser Phe Val Glu Asn Met Glu Val Pro Ala Pro Glu Cys Asn

20 25 30

Leu Arg Leu Leu Ala Gly Thr Ala Ala Arg His Ser Glu Asp Pro Glu

35 40 45

Ser Pro Gly Glu Ser Gln Gly Thr Pro Thr Ser Pro Ala Ala Ala Ala

50 55 60

Gly Gly Ser Arg Arg Glu Pro Glu Ser Arg Pro Gly Pro Ser Gly Gly

65 70 75 80

Gly Gly Val Ala Asp Leu Phe Pro Glu Leu Arg Arg Val Leu Thr Arg

85 90 95

Ser Ser Ser Gly Arg Glu Arg Gly Ile Lys Arg Glu Arg His Asp Glu

100 105 110

Thr Asn His Arg Thr Glu Leu Thr Val Gly Leu Met Ser Arg Lys Arg

115 120 125

Pro Glu Thr Val Trp Trp His Glu Val Gln Ser Thr Gly Thr Asp Glu

130 135 140

Val Ser Val Met His Glu Lys Phe Ser Leu Glu Gln Val Lys Thr Cys

145 150 155 160

Trp Leu Glu Pro Glu Asp Asp Trp Glu Val Ala Ile Arg Asn Tyr Ala

165 170 175

Lys Leu Ala Leu Arg Pro Asp Lys Lys Tyr Lys Ile Thr Lys Leu Ile

180 185 190

Asn Ile Arg Asn Ala Cys Tyr Ile Ser Gly Asn Gly Ala Glu Val Glu

195 200 205

Ile Cys Leu Gln Asp Arg Val Ala Phe Arg Cys Cys Met Met Asn Met

210 215 220

Tyr Pro Gly Val Val Gly Met Asp Gly Val Thr Phe Met Asn Met Arg

225 230 235 240

Phe Arg Gly Asp Gly Tyr Asn Gly Thr Val Phe Met Ala Asn Thr Lys

245 250 255

Leu Thr Val His Gly Cys Ser Phe Phe Gly Phe Asn Asn Thr Cys Ile

260 265 270

Glu Ala Trp Gly Gln Val Gly Val Arg Gly Cys Ser Phe Ser Ala Asn

275 280 285

Trp Met Gly Val Val Gly Arg Thr Lys Ser Val Leu Ser Val Lys Lys

290 295 300

Cys Leu Phe Glu Arg Cys His Leu Gly Val Met Ser Glu Gly Glu Ala

305 310 315 320

Arg Ile Arg His Cys Ala Ser Thr Glu Thr Gly Cys Phe Val Leu Cys

325 330 335

Lys Gly Asn Ala Lys Ile Lys His Asn Met Ile Cys Gly Ala Ser Asp

340 345 350

Glu Arg Gly Tyr Gln Met Leu Thr Cys Ala Ser Gly Asn Ser His Met

355 360 365

Leu Ala Thr Val His Val Ala Ser His Ser Arg Lys Pro Trp Pro Glu

370 375 380

Phe Glu His Asn Val Met Thr Arg Cys Asn Met His Leu Gly Ser Arg

385 390 395 400

Arg Gly Met Phe Met Pro Tyr Gln Cys Asn Leu Asn Tyr Val Lys Val

405 410 415

Leu Leu Glu Pro Asp Ala Met Ser Arg Val Ser Leu Thr Gly Val Phe

420 425 430

Asp Met Asn Val Glu Val Trp Lys Ile Leu Arg Tyr Asp Glu Ser Lys

435 440 445

Thr Arg Cys Arg Ala Cys Glu Cys Gly Gly Lys His Ala Arg Phe Gln

450 455 460

Pro Val Cys Val Glu Val Thr Glu Asp Leu Arg Pro Asp His Leu Val

465 470 475 480

Leu Ser Cys Thr Gly Thr Glu Phe Gly Ser Ser Gly Glu Glu Ser Asp

485 490 495

<210> SEQ ID NO 81

<211> LENGTH: 188

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 81

Met Pro Arg Gly Asn Lys Lys Leu Lys Val Glu Leu Pro Pro Val Glu

1 5 10 15

Asp Leu Glu Glu Asp Trp Glu Asn Ser Ser Gln Ala Glu Glu Glu Glu

20 25 30

Met Glu Glu Asp Trp Asp Ser Thr Gln Ala Glu Glu Asp Ser Leu Gln

35 40 45

Asp Ser Leu Glu Glu Asp Glu Glu Glu Ala Glu Glu Glu Val Glu Glu

50 55 60

Ala Ala Ala Ala Arg Pro Ser Ser Ser Ala Gly Glu Lys Ala Ser Ser

65 70 75 80

Thr Asp Thr Ile Ser Ala Pro Gly Arg Gly Pro Ala Arg Pro His Ser

85 90 95

Arg Trp Asp Glu Thr Gly Arg Phe Pro Asn Pro Thr Thr Gln Thr Gly

100 105 110

Lys Lys Glu Arg Gln Gly Tyr Lys Ser Trp Arg Gly His Lys Asn Ala

115 120 125

Ile Val Ser Cys Leu Gln Ala Cys Gly Gly Asn Ile Ser Phe Thr Arg

130 135 140

Arg Tyr Leu Leu Phe His Arg Gly Val Asn Phe Pro Arg Asn Ile Leu

145 150 155 160

His Tyr Tyr Arg His Leu His Ser Pro Tyr Tyr Phe Gln Glu Glu Ala

165 170 175

Ala Ala Glu Lys Asp Gln Gln Lys Thr Ser Ser Ser

180 185

<210> SEQ ID NO 82

<211> LENGTH: 211

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 82

Met Lys Val Phe Val Val Cys Cys Val Leu Ser Ile Ile Lys Ala Glu

1 5 10 15

Ile Ser Asp Tyr Ser Gly Leu Asn Cys Gly Val Ser Ala Ser Ile Asn

20 25 30

Arg Ser Leu Thr Phe Thr Gly Asn Glu Thr Glu Leu Gln Leu Gln Cys

35 40 45

Lys Pro His Lys Lys Tyr Leu Thr Trp Leu Tyr Gln Gly Ser Pro Ile

50 55 60

Ala Val Val Asn His Cys Asp Asp Asp Gly Val Leu Leu Asn Gly Pro

65 70 75 80

Ala Asn Leu Thr Phe Ser Thr Arg Arg Ser Lys Leu Glu Leu Phe Gln

85 90 95

Pro Phe Leu Pro Gly Thr Tyr Gln Cys Ile Ser Gly Pro Cys His His

100 105 110

Thr Phe His Leu Ile Pro Asn Thr Thr Ser Ser Pro Ala Pro Leu Pro

115 120 125

Thr Asn Asn Gln Thr Asn His His Gln Arg Tyr Arg Arg Asp Leu Val

130 135 140

Glu Ser Asn Thr Thr His Thr Gly Gly Glu Leu Arg Gly Pro Glu Ser

145 150 155 160

Ser Gly Ile Tyr Tyr Gly Pro Trp Glu Val Val Gly Leu Ile Ala Leu

165 170 175

Gly Leu Val Ala Gly Gly Leu Leu Ala Leu Cys Tyr Leu Tyr Leu Pro

180 185 190

Cys Phe Ser Tyr Leu Val Val Leu Cys Cys Trp Phe Lys Lys Trp Gly

195 200 205

Arg Ser Pro

210

<210> SEQ ID NO 83

<211> LENGTH: 134

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 83

Met Thr Asp Pro Met Ala Asn Asn Thr Val Asn Asp Leu Leu Asp Met

1 5 10 15

Asp Gly Arg Ala Ser Glu Gln Arg Leu Ala Gln Leu Arg Ile Arg Gln

20 25 30

Gln Gln Glu Arg Ala Val Lys Glu Leu Gln Asp Ala Val Ala Ile His

35 40 45

Gln Cys Lys Arg Gly Ile Phe Cys Leu Val Lys Gln Ala Lys Ile Ser

50 55 60

Phe Glu Val Thr Ser Thr Asp His Arg Leu Ser Tyr Glu Leu Leu Gln

65 70 75 80

Gln Arg Gln Lys Phe Thr Cys Leu Val Gly Val Asn Pro Ile Val Ile

85 90 95

Thr Gln Gln Ser Gly Asp Thr Lys Gly Cys Ile His Cys Ser Cys Asp

100 105 110

Ser Pro Glu Cys Val His Thr Leu Ile Lys Thr Leu Cys Gly Leu Arg

115 120 125

Asp Leu Leu Pro Met Asn

130

<210> SEQ ID NO 84

<211> LENGTH: 880

<212> TYPE: DNA

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Simian adenovirus A1295

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (7)..(571)

<223> OTHER INFORMATION: E1a

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (665)..(870)

<223> OTHER INFORMATION: E1a

<400> SEQUENCE: 84

tgaaag atg agg cac ctg aga gac ctg ccc gat gag aaa atc atc atc 48

Met Arg His Leu Arg Asp Leu Pro Asp Glu Lys Ile Ile Ile

1 5 10

gct tcc ggg aac gag att ctg gaa ctg gtg gta aat gcc atg atg ggc 96

Ala Ser Gly Asn Glu Ile Leu Glu Leu Val Val Asn Ala Met Met Gly

15 20 25 30

gac gac cct ccg gag ccc ccc acc cca ttt gag aca cct tcg cta cac 144

Asp Asp Pro Pro Glu Pro Pro Thr Pro Phe Glu Thr Pro Ser Leu His

35 40 45

gat ttg tat gat ctg gag gtg gat gtg ccc gag gac gac ccc aac gag 192

Asp Leu Tyr Asp Leu Glu Val Asp Val Pro Glu Asp Asp Pro Asn Glu

50 55 60

gag gcg gta aat gat tta ttt agc gat gcc gca ctg cta gct gcc gag 240

Glu Ala Val Asn Asp Leu Phe Ser Asp Ala Ala Leu Leu Ala Ala Glu

65 70 75

gag gct tcg agc cct agc tca gac agc gac tct tca ctg cat acc cct 288

Glu Ala Ser Ser Pro Ser Ser Asp Ser Asp Ser Ser Leu His Thr Pro

80 85 90

aga ccc ggc aga ggt gag aaa aag atc ccc gag ctt aaa ggg gaa gag 336

Arg Pro Gly Arg Gly Glu Lys Lys Ile Pro Glu Leu Lys Gly Glu Glu

95 100 105 110

atg gac ttg cgc tgc tat gag gaa tgc ttg ccc ccg agc gat gat gag 384

Met Asp Leu Arg Cys Tyr Glu Glu Cys Leu Pro Pro Ser Asp Asp Glu

115 120 125

gac gag cag gcg atc cag aac gta gcg agc cag gga gtg caa gcc gcc 432

Asp Glu Gln Ala Ile Gln Asn Val Ala Ser Gln Gly Val Gln Ala Ala

130 135 140

agc gag agc ttt gcg ctg gac tgc ccg cct ctg ccc gga cac ggc tgt 480

Ser Glu Ser Phe Ala Leu Asp Cys Pro Pro Leu Pro Gly His Gly Cys

145 150 155

aag tct tgt gaa ttt cat cgc ttg aat act gga gat aaa gct gtg tta 528

Lys Ser Cys Glu Phe His Arg Leu Asn Thr Gly Asp Lys Ala Val Leu

160 165 170

tgt gca ctt tgc tat atg aga gct tac aac cat tgt gtt tac a 571

Cys Ala Leu Cys Tyr Met Arg Ala Tyr Asn His Cys Val Tyr

175 180 185

gtaagtgtga ttaagttgaa ctttagaggg aggcagagag cggggtgact gggcgatgac 631

tggtttattt atgtatatat gttctttata tag gt ccc gtc tct gac gca gat 684

Ser Pro Val Ser Asp Ala Asp

190 195

gat gag acc ccc act aca gag tcc att tcg tca ccc cca gaa att ggc 732

Asp Glu Thr Pro Thr Thr Glu Ser Ile Ser Ser Pro Pro Glu Ile Gly

200 205 210

aca tct cca cct gag aat att gtt aga cca gtt cct gtt aga gcc act 780

Thr Ser Pro Pro Glu Asn Ile Val Arg Pro Val Pro Val Arg Ala Thr

215 220 225

ggg agg aga gca gct gtg gaa tgt ttg gat gac ttg cta cag ggt ggg 828

Gly Arg Arg Ala Ala Val Glu Cys Leu Asp Asp Leu Leu Gln Gly Gly

230 235 240

gat gaa cct ttg gac ttg tgt acc cgg aaa cgc ccc agg cac 870

Asp Glu Pro Leu Asp Leu Cys Thr Arg Lys Arg Pro Arg His

245 250 255

taagtgccac 880

<210> SEQ ID NO 85

<211> LENGTH: 257

<212> TYPE: PRT

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Synthetic Construct

<400> SEQUENCE: 85

Met Arg His Leu Arg Asp Leu Pro Asp Glu Lys Ile Ile Ile Ala Ser

1 5 10 15

Gly Asn Glu Ile Leu Glu Leu Val Val Asn Ala Met Met Gly Asp Asp

20 25 30

Pro Pro Glu Pro Pro Thr Pro Phe Glu Thr Pro Ser Leu His Asp Leu

35 40 45

Tyr Asp Leu Glu Val Asp Val Pro Glu Asp Asp Pro Asn Glu Glu Ala

50 55 60

Val Asn Asp Leu Phe Ser Asp Ala Ala Leu Leu Ala Ala Glu Glu Ala

65 70 75 80

Ser Ser Pro Ser Ser Asp Ser Asp Ser Ser Leu His Thr Pro Arg Pro

85 90 95

Gly Arg Gly Glu Lys Lys Ile Pro Glu Leu Lys Gly Glu Glu Met Asp

100 105 110

Leu Arg Cys Tyr Glu Glu Cys Leu Pro Pro Ser Asp Asp Glu Asp Glu

115 120 125

Gln Ala Ile Gln Asn Val Ala Ser Gln Gly Val Gln Ala Ala Ser Glu

130 135 140

Ser Phe Ala Leu Asp Cys Pro Pro Leu Pro Gly His Gly Cys Lys Ser

145 150 155 160

Cys Glu Phe His Arg Leu Asn Thr Gly Asp Lys Ala Val Leu Cys Ala

165 170 175

Leu Cys Tyr Met Arg Ala Tyr Asn His Cys Val Tyr Ser Pro Val Ser

180 185 190

Asp Ala Asp Asp Glu Thr Pro Thr Thr Glu Ser Ile Ser Ser Pro Pro

195 200 205

Glu Ile Gly Thr Ser Pro Pro Glu Asn Ile Val Arg Pro Val Pro Val

210 215 220

Arg Ala Thr Gly Arg Arg Ala Ala Val Glu Cys Leu Asp Asp Leu Leu

225 230 235 240

Gln Gly Gly Asp Glu Pro Leu Asp Leu Cys Thr Arg Lys Arg Pro Arg

245 250 255

His

<210> SEQ ID NO 86

<211> LENGTH: 36528

<212> TYPE: DNA

<213> ORGANISM: Unknown

<220> FEATURE:

<223> OTHER INFORMATION: Simian adenovirus A1309

<220> FEATURE:

<221> NAME/KEY: repeat_region

<222> LOCATION: (1)..(129)

<223> OTHER INFORMATION: ITR

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (1599)..(2177)

<223> OTHER INFORMATION: E1b\19K

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (3456)..(3926)

<223> OTHER INFORMATION: pIX

<220> FEATURE:

<221> NAME/KEY: misc_feature

<222> LOCATION: (3991)..(5612)

<223> OTHER INFORMATION: IVa2 complement (3991..5321,5600..5612)

<220> FEATURE:

<221> NAME/KEY: misc_feature

<222> LOCATION: (5094)..(13857)

<223> OTHER INFORMATION: pol complement (5094..8660,13849..13857)

<220> FEATURE:

<221> NAME/KEY: misc_feature

<222> LOCATION: (8462)..(13857)

<223> OTHER INFORMATION: pTP complement (8462..10393,13849..13857)

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (10850)..(12022)

<223> OTHER INFORMATION: 52K

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (12049)..(13815)

<223> OTHER INFORMATION: pIIIa

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (13897)..(15516)

<223> OTHER INFORMATION: penton

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (15523)..(16104)

<223> OTHER INFORMATION: pVII

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (16152)..(17195)

<223> OTHER INFORMATION: V

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (17223)..(17453)

<223> OTHER INFORMATION: pX

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (17525)..(18253)

<223> OTHER INFORMATION: pVI

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (18348)..(21164)

<223> OTHER INFORMATION: hexon

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (21189)..(21809)

<223> OTHER INFORMATION: protease

<220> FEATURE:

<221> NAME/KEY: misc_feature

<222> LOCATION: (21898)..(23433)

<223> OTHER INFORMATION: DBP complement (21898...23433)

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (23456)..(25843)

<223> OTHER INFORMATION: 100K

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (26469)..(27149)

<223> OTHER INFORMATION: pVIII

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (27153)..(27470)

<223> OTHER INFORMATION: E3\12.5K

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (28035)..(28565)

<223> OTHER INFORMATION: E3\gp19K

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (28598)..(29215)

<223> OTHER INFORMATION: E3\CR1-beta

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (29231)..(29845)

<223> OTHER INFORMATION: E3\CR1-gamma

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (29863)..(30741)

<223> OTHER INFORMATION: E3\CR1-delta

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (31033)..(31464)

<223> OTHER INFORMATION: E3\RID=beta

<220> FEATURE:

<221> NAME/KEY: CDS

<222> LOCATION: (32137)..(33411)

<223> OTHER INFORMATION: fiber

<220> FEATURE:

<221> NAME/KEY: misc_feature

<222> LOCATION: (33523)..(34676)

<223> OTHER INFORMATION: E4\orf6/7 complement (33523..33774,34506..34676)

<220> FEATURE:

<221> NAME/KEY: misc_feature

<222> LOCATION: (33774)..(34676)

<223> OTHER INFORMATION: E4\orf6 complement (33774..34676)

<220> FEATURE:

<221> NAME/KEY: misc_feature

<222> LOCATION: (34585)..(34947)

<223> OTHER INFORMATION: E4\orf4 complement (34585..34947)

<220> FEATURE:

<221> NAME/KEY: misc_feature

<222> LOCATION: (34960)..(35310)

<223> OTHER INFORMATION: E4\orf3 complement (34960..35310)

<220> FEATURE:

<221> NAME/KEY: misc_feature

<222> LOCATION: (35310)..(35696)

<223> OTHER INFORMATION: E4\orf2 complement (35310..35696)

<220> FEATURE:

<221> NAME/KEY: misc_feature

<222> LOCATION: (35739)..(36110)

<223> OTHER INFORMATION: E4\orf1 complement (35739..36110)

<220> FEATURE:

<221> NAME/KEY: repeat_region

<222> LOCATION: (36400)..(36528)

<223> OTHER INFORMATION: ITR complement (36400..36528)

<400> SEQUENCE: 86

catcatcaat aatatacctc aaacttttgg tgcgcgttaa tatgcaaatg aggtatttga 60

atttggggat gcggggcggt gattggctgc gggagcggcg accgttaggg gcggggcggg 120

tgacgttttg atgacgtggc cgtgaggcgg agccggtttg caagttctcg tgggaaaagt 180

gacgtcaaac gaggtgtggt ttgaacacgg aaatactcaa ttttcccgcg ctctctgaca 240

ggaaatgagg tgtttctggg cggatgcaag tgaaaacggg ccattttcgc gcgaaaactg 300

aatgaggaag tgaaaatctg agtaattccg cgtttatggc agggaggagt atttgccgag 360

ggccgagtag actttgaccg attacgtggg ggtttcgatt accgtatttt tcacctaaat 420

ttccgcgtac ggtgtcaaag tccggtgttt ttacgtaggt gtcagctgat cgccagggta 480

tttaaacctg cgctctctag tcaagaggcc actcttgagt gccagcgagt agagttttct 540

cctccgcgcc gcgagtcaga tctacacttt gaaagatgag gcacctgaga gacctgcccg 600

gtaatgtttt cctggctact gggaacgaga ttctggaact ggtggtggac gccatgatgg 660

gtgacgaccc tccggagccc cctaccccat ttgaagcgcc ttcgctgtac gatttgtatg 720

atctggaggt ggatgtgccc gagaacgacc ccaacgagga ggcggtgaat gatttgttta 780

gcgatgccgc gctgctggct gccgagcagg ctaatacgga ctctggctca gacagcgatt 840

cctctctcca taccccgaga cccggcagag gtgagaaaaa gatccccgag cttaaagggg 900

aagagctcga cctgcgctgc tatgaggaat gcttgcctcc gagcgatgat gaggaggacg 960

aggaggcgat tcgagctgca gcgaaccagg gagtgaaaac agcgagcgag ggctttagcc 1020

tggactgtcc tactctgccc ggacacggct gtaagtcttg tgaatttcat cgcatgaata 1080

ctggagataa gaatgtgatg tgtgccctgt gctatatgag agcttacaac cattgtgttt 1140

acagtaagtg tgattaactt tagctgggga ggcagagggt gactgggtgc tgactggttt 1200

atttatgtat atgtttttta tgtgtaggtc ccgtctctga cgtagatgag acccccacta 1260

cagagtgcat ttcatcaccc ccagaaattg gcgaggaacc gcccgaagat attattcata 1320

gaccagttgc agtgagagtc accgggcgta gagcagctgt ggagagtttg gatgacttgc 1380

tacagggtgg ggatgaacct ttggacttgt gtacccggaa acgccccagg cactaagtgc 1440

cacacatgtg tgtttactta aggtgatgtc agtatttata gggtgtggag tgcaataaaa 1500

tccgtgttga ctttaagtgc gtggtttatg actcaggggt ggggactgtg ggtatataag 1560

caggtgcaga cctgtgtggt cagttcagag caggactc atg gag atc tgg aca gtc 1616

Met Glu Ile Trp Thr Val

1 5

ttg gaa gac ttt cac cag act aga cag ctg cta gag aac tca tcg gag 1664

Leu Glu Asp Phe His Gln Thr Arg Gln Leu Leu Glu Asn Ser Ser Glu

10 15 20

gga gtc tct tac ctg tgg aga ttc tgc ttc ggt ggg gct cta gct aag 1712

Gly Val Ser Tyr Leu Trp Arg Phe Cys Phe Gly Gly Ala Leu Ala Lys

25 30 35

cta gtc cat agg gcc aaa cag gat tat aag gat caa ttt gag gat att 1760

Leu Val His Arg Ala Lys Gln Asp Tyr Lys Asp Gln Phe Glu Asp Ile

40 45 50

ttg aga gag tgt cct ggt att ttt gac tct ctc aac ttg ggc cat cag 1808

Leu Arg Glu Cys Pro Gly Ile Phe Asp Ser Leu Asn Leu Gly His Gln

55 60 65 70

tct cac ttt aac cag agt att ctg aga gcc ctt gac ttt tca act cct 1856

Ser His Phe Asn Gln Ser Ile Leu Arg Ala Leu Asp Phe Ser Thr Pro

75 80 85

ggc aga act acc gcc gcg gta gcc ttt ttt gcc ttt atc ctt gac aaa 1904

Gly Arg Thr Thr Ala Ala Val Ala Phe Phe Ala Phe Ile Leu Asp Lys

90 95 100

tgg agt caa gaa acc cat ttc agc agg gat tac cgt ctg gac tgc tta 1952

Trp Ser Gln Glu Thr His Phe Ser Arg Asp Tyr Arg Leu Asp Cys Leu

105 110 115

gca gta gct ttg tgg aga aca tgg agg tgc cag cgc ctg aat gca atc 2000

Ala Val Ala Leu Trp Arg Thr Trp Arg Cys Gln Arg Leu Asn Ala Ile

120 125 130

tcc ggc tac ttg cca gta cag ccg gta gac acg ctg agg atc ctg agt 2048

Ser Gly Tyr Leu Pro Val Gln Pro Val Asp Thr Leu Arg Ile Leu Ser

135 140 145 150

ctc cag tca ccc cag gaa cac caa cgc cgc cag cag ccg cag cag gag 2096

Leu Gln Ser Pro Gln Glu His Gln Arg Arg Gln Gln Pro Gln Gln Glu

155 160 165

cag cag caa gag gag gag gag gag gac cga gaa gag aac ccg aga gcc 2144

Gln Gln Gln Glu Glu Glu Glu Glu Asp Arg Glu Glu Asn Pro Arg Ala

170 175 180

ggt ctg gac cct ccg gtg gcg gag gag gag gag tagctgactt gtttcccgag 2197

Gly Leu Asp Pro Pro Val Ala Glu Glu Glu Glu

185 190

ctgcgccggg tgctgactag gtcttccagt ggacgggaga gggggattaa gcgggagagg 2257

catgaggaga ctagccatag aactgaactg actgtcagtc tgatgagccg caggcgtcca 2317

gaatcggtgt ggtggcatga ggttcagtcg caggggatag atgaggtctc agtgatgcat 2377

gagaaatatt ccctagaaca agtcaagact tgttggttgg agcctgagga tgattgggag 2437

gtagccatca ggaattatgc caagctagcg ctgaagccag acaagaagta caagattacc 2497

aagttgatta atatcagaaa ttcctgctac atttcaggga atggggccga ggtggagatc 2557

agtacccagg agagggtggc cttcagatgc tgcatgatga atatgtaccc gggggtggtg 2617

ggcatggagg gagtcacctt tatgaacgcg aggttcaggg gtgatgggta taatggggtg 2677

gtctttatgg ccaacaccaa gctgacagtg cacggatgct ccttctttgg cttcaataac 2737

atgtgcatcg aggcctgggg cagtgtttca gtgaggggat gcagcttttc agccaactgg 2797

atgggggttg tgggcagaac caagagcatg gtgtcagtga agaaatgtct gttcgagagg 2857

tgccacctgg gggtgatgag cgagggcgaa gccaaagtca aacactgcgc ctctaccgag 2917

acgggctgct ttgtgctgat caagggcaat gccaaagtca agcataacat gatctgtggg 2977

gcctctgatg agcgcggcta ccagatgctg acctgcgccg gtgggaacag ccatatgctg 3037

gccactgtgc atgtgtcctc gcacccccgc aagacatggc ccgagttcga gcacaacgtc 3097

atgacccgct gcaatgtgca cctgggctcc cgccgaggca tgttc