GENSCAN 1.0 Date run: 2-Nov-116 Time: 20:03:23 Sequence gi568815577f:36064734_36393568 : 328835 bp : 45.53% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 5201 5341 141 1 0 64 62 136 0.958 6.73 1.02 Intr + 5359 5671 313 0 1 113 51 776 0.915 72.16 1.03 Intr + 6217 6324 108 2 0 79 113 5 0.786 2.36 1.04 Term + 7713 8149 437 1 2 59 49 422 0.567 30.85 1.05 PlyA + 8410 8415 6 1.05 2.05 PlyA - 9545 9540 6 1.05 2.04 Term - 13320 13103 218 1 2 86 42 86 0.034 1.11 2.03 Intr - 30566 30492 75 0 0 100 84 46 0.363 4.89 2.02 Intr - 33526 33391 136 1 1 52 74 71 0.364 2.24 2.01 Init - 39908 39906 3 2 0 91 89 0 0.241 0.40 2.00 Prom - 40469 40430 40 -3.66 3.00 Prom + 44862 44901 40 -2.46 3.01 Init + 48977 49034 58 1 1 104 99 33 0.922 7.47 3.02 Intr + 50301 50347 47 1 2 66 94 6 0.109 -2.87 3.03 Intr + 62615 62732 118 1 1 64 42 116 0.378 4.64 3.04 Term + 65739 66643 905 1 2 76 48 286 0.373 15.89 3.05 PlyA + 67050 67055 6 1.05 4.00 Prom + 67063 67102 40 -7.06 4.01 Init + 67717 68053 337 0 1 99 16 366 0.315 27.94 4.02 Intr + 70404 70748 345 1 0 47 39 699 0.333 56.26 4.03 Intr + 73092 73199 108 1 0 82 103 47 0.969 5.86 4.04 Intr + 81343 81775 433 2 1 98 57 315 0.384 22.10 4.05 Term + 85923 86043 121 0 1 131 55 27 0.371 1.65 4.06 PlyA + 86623 86628 6 1.05 5.00 Prom + 86936 86975 40 -2.86 5.01 Init + 87443 87469 27 1 0 71 111 11 0.563 1.35 5.02 Intr + 90786 90959 174 2 0 49 99 34 0.340 0.74 5.03 Intr + 92007 92210 204 2 0 50 115 47 0.337 3.00 5.04 Intr + 92767 92862 96 0 0 82 87 44 0.635 3.91 5.05 Term + 96950 96982 33 1 0 78 42 61 0.209 -1.91 5.06 PlyA + 97291 97296 6 1.05 6.00 Prom + 97755 97794 40 -6.76 6.01 Init + 98103 98274 172 2 1 104 88 68 0.666 8.00 6.02 Term + 100774 100823 50 2 2 49 49 102 0.469 -0.13 6.03 PlyA + 101548 101553 6 -0.45 7.02 PlyA - 102757 102752 6 1.05 7.01 Sngl - 104803 104237 567 1 0 73 42 839 0.959 73.95 7.00 Prom - 111274 111235 40 -6.06 8.00 Prom + 114937 114976 40 -3.16 8.01 Init + 123596 123799 204 1 0 73 49 88 0.172 0.29 8.02 Intr + 126001 126162 162 0 0 53 37 85 0.091 0.07 8.03 Intr + 134337 134518 182 2 2 105 80 94 0.982 8.97 8.04 Intr + 135598 135768 171 1 0 57 96 254 0.997 22.16 8.05 Intr + 143982 144171 190 0 1 92 76 212 0.637 19.99 8.06 Intr + 147241 147364 124 0 1 85 52 56 0.819 1.86 8.07 Intr + 149348 149457 110 0 2 93 88 22 0.949 2.70 8.08 Intr + 149709 149823 115 2 1 53 63 127 0.985 6.72 8.09 Intr + 154639 154759 121 2 1 115 86 22 0.996 4.25 8.10 Intr + 158498 158617 120 2 0 95 97 10 0.805 2.21 8.11 Intr + 160832 160934 103 2 1 44 97 134 0.978 10.08 8.12 Intr + 162953 163144 192 1 0 80 74 157 0.678 13.19 8.13 Intr + 165717 166401 685 2 1 82 94 603 0.999 51.44 8.14 Intr + 168071 168342 272 0 2 93 68 196 0.987 15.36 8.15 Intr + 172529 172731 203 1 2 98 49 208 0.310 15.98 8.16 Intr + 175032 175222 191 0 2 113 105 236 0.998 27.03 8.17 Intr + 180315 181944 1630 1 1 96 96 2257 0.987 214.62 8.18 Intr + 182784 182895 112 0 1 109 94 26 0.999 5.68 8.19 Intr + 183647 183835 189 1 0 100 69 54 0.957 4.48 8.20 Intr + 185009 185101 93 1 0 41 86 52 0.531 0.46 8.21 Intr + 186429 186551 123 2 0 109 98 -6 0.870 3.28 8.22 Intr + 189039 189176 138 2 0 71 78 118 0.819 9.76 8.23 Intr + 195944 195999 56 1 2 81 85 2 0.078 -3.02 8.24 Intr + 199015 199081 67 1 1 68 72 89 0.302 4.21 8.25 Intr + 205280 205424 145 1 1 77 78 107 0.966 8.46 8.26 Intr + 212288 212367 80 0 2 83 84 56 0.560 3.97 8.27 Intr + 213242 213351 110 1 2 97 97 78 0.994 8.68 8.28 Intr + 213476 213622 147 2 0 105 52 128 0.845 10.25 8.29 Intr + 220791 220926 136 0 1 95 61 75 0.812 6.07 8.30 Intr + 224023 224078 56 0 2 74 99 51 0.897 2.58 8.31 Intr + 227371 227500 130 1 1 114 98 23 0.797 6.60 8.32 Term + 228587 228838 252 1 0 95 41 168 0.998 8.44 8.33 PlyA + 229367 229372 6 1.05 9.02 PlyA - 230084 230079 6 -0.45 9.01 Sngl - 231008 230577 432 2 0 43 55 613 0.970 47.78 9.00 Prom - 249077 249038 40 -0.56 10.02 PlyA - 249830 249825 6 1.05 10.01 Sngl - 255713 255342 372 2 0 92 42 243 0.525 14.23 10.00 Prom - 263464 263425 40 -8.56 11.00 Prom + 265788 265827 40 -3.06 11.01 Init + 271698 271746 49 2 1 86 58 7 0.220 -3.38 11.02 Intr + 272141 272273 133 0 1 53 100 72 0.763 4.60 11.03 Intr + 272999 273131 133 2 1 69 -1 112 0.149 1.05 11.04 Intr + 274046 274188 143 1 2 25 92 64 0.263 -0.35 11.05 Intr + 276666 276813 148 0 1 49 75 157 0.690 10.74 11.06 Intr + 280179 280298 120 2 0 9 98 65 0.340 0.29 11.07 Intr + 284578 284675 98 0 2 99 91 27 0.903 2.91 11.08 Intr + 291887 291991 105 2 0 30 98 77 0.767 2.23 11.09 Intr + 295222 295344 123 1 0 81 81 42 0.833 2.50 11.10 Intr + 295451 295525 75 2 0 49 86 89 0.864 3.43 11.11 Intr + 304255 305143 889 1 1 42 105 541 0.971 42.52 11.12 Intr + 307641 307798 158 2 2 44 85 125 0.964 6.61 11.13 Term + 310410 310563 154 0 1 71 39 136 0.990 4.39 11.14 PlyA + 311228 311233 6 -0.45 12.00 Prom + 311578 311617 40 -4.26 12.01 Init + 320619 320718 100 2 1 41 96 196 0.719 14.12 12.02 Intr + 321429 321529 101 1 2 1 59 150 0.676 3.23 12.03 Intr + 322865 322997 133 1 1 38 101 82 0.992 4.72 12.04 Term + 326818 326939 122 2 2 59 32 174 0.961 7.54 12.05 PlyA + 328315 328320 6 -0.45 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 18113 18166 54 1 0 63 107 46 0.857 5.28 S.002 Term + 18698 18997 300 1 0 96 44 117 0.979 3.32 S.003 Init - 106316 106250 67 2 1 61 75 64 0.815 3.63 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815577f:36064734_36393568|GENSCAN_predicted_peptide_1|332_aa MVQPRPARAGPAPARSAAGRVTHGCAPTTARLEQSLEHAAGLPGLSQVFRAPRSAMSSGI HVALVTGGNKGIGLAIVRDLCRLFSGDVVLTARDVTRGQAAVQQLQAEGLSPRFHQLDID DLQSIRALRDFLRKEYGGLDVLVNNAGIAFKVADPTPFHIQAEVTMKTNFFGTRDVCTEL LPLIKPQGRVVNVSSIMSVRALKSCSPELQQKFRSETITEEELVGLMNKFVEDTKKGVHQ KEGWPSSAYGVTKIGVTVLSRIHARKLSEQRKGDKILLNACCPGWVRTDMAGPKATKSPE EGAETPVYLALLPPDAEGPHGQFVSEKRVEQW >gi568815577f:36064734_36393568|GENSCAN_predicted_CDS_1|999_bp atggttcagccccgccccgctagggcggggcctgcgcctgcgcgctcagcggccgggcgt gtaacccacgggtgcgcgcccacgaccgccagactcgagcagtctctggaacacgctgcg gggctcccgggcctgagccaggtgttccgcgcgccccgttcagccatgtcgtccggcatc catgtagcgctggtgactggaggcaacaagggcatcggcttggccatcgtgcgcgacctg tgccggctgttctcgggggacgtggtgctcacggcgcgggacgtgacgcggggccaggcg gccgtacagcagctgcaggcggagggcctgagcccgcgcttccaccagctggacatcgac gatctgcagagcatccgcgccctgcgcgacttcctgcgcaaggagtacgggggcctggac gtgctggtcaacaacgcgggcatcgccttcaaggttgctgatcccacaccctttcatatt caagctgaagtgacgatgaaaacaaatttctttggtacccgagatgtgtgcacagaatta ctccctctaataaaaccccaagggagagtggtgaacgtatctagcatcatgagcgtcaga gcccttaaaagctgcagcccagagctgcagcagaagttccgcagtgagaccatcactgag gaggagctggtggggctcatgaacaagtttgtggaggatacaaagaagggagtgcaccag aaggagggctggcccagcagcgcatacggggtgacgaagattggcgtcaccgttctgtcc aggatccacgccaggaaactgagtgagcagaggaaaggggacaagatcctcctgaatgcc tgctgcccagggtgggtgagaactgacatggcgggacccaaggccaccaagagcccagaa gaaggtgcagagacccctgtgtacttggcccttttgcccccagatgctgagggtccccat ggacaatttgtttcagagaagagagttgaacagtggtga >gi568815577f:36064734_36393568|GENSCAN_predicted_peptide_2|143_aa MGRRSPSALKYTPGEYTLLPCEEGACFCFTFSHDCNFPEASPAMQNSMLLERGPHPDPEK GFLDLVQERIRDAALFAAKVPSPVYLRLTPRTHYKWYWDLRVQRRQFSELLQPEGALWLQ AGMVALGCTAELSSAFQCCNSRG >gi568815577f:36064734_36393568|GENSCAN_predicted_CDS_2|432_bp atgggtcgaaggagtcccagtgctttaaaatacactccaggggagtacacgttactacct tgtgaagagggtgcctgcttctgcttcaccttcagccatgattgtaactttcctgaggcc tccccagccatgcagaactcaatgttactggaaaggggtcctcatccagaccctgagaaa ggattcttggatctcgtgcaagaaagaattagggatgctgctttgtttgcagccaaagtg cccagtcctgtttatctgagactaacacctcgtacacattataagtggtattgggacctg cgtgtacagagacgtcaattctcggaattgttgcagcctgaaggtgccttatggctgcag gccggcatggttgctctgggctgtacagcagagctaagttctgcattccagtgctgcaac tcgaggggctga >gi568815577f:36064734_36393568|GENSCAN_predicted_peptide_3|375_aa MAGGNLLGGHSTGNGKNDSAPWKGVQPCWRLDPRQVLRQSLDADEGTAVDIQSFRPQKDF GRRRTLEECHVTGKGGTGTKMSNRVVCREASHAGSWYTASGPQLNAQLEGWLSQVQSTKR PARAIIAPHAGYTYCGSCAAHAYKQVDPSITRRIFILGPSHHVPLSRCALSSVDIYRTPL YDLRIDQKIYGELWKTGMFERMSLQTDEDERSIEMHLPYTAKAMESHKDEFTIIPLLVGA LSQKNRNSELFSKYLADPSNLFVVSSDFCHWGQRFRHSYYDESQGEICRSIEHLDKMGMS IIEQLDPVSFSNYLKKYHNTICGRHPIGVLLNAITELQKNGMNMSFSFLNYAQSSQCRNW QDSSVSYAAGALTVH >gi568815577f:36064734_36393568|GENSCAN_predicted_CDS_3|1128_bp atggcggggggtaaccttctagggggtcattccaccggaaacgggaagaacgactcagcc ccctggaaaggcgtgcagccctgctggcgccttgatcccaggcaggtgctcagacagtcg ctggatgcagatgagggaacggctgtggacattcagtccttcagaccacaaaaggacttt ggaaggagaagaaccctggaagaatgccatgttactggcaaaggcggcacaggcaccaag atgtccaaccgagtggtctgccgagaagccagtcacgccgggagctggtacacagcctca ggaccgcagctgaatgcacagctagaaggttggctttcacaagtacagtctacaaaaaga cctgctagagccattattgccccccatgcaggatatacgtactgtgggtcttgtgctgcc catgcttataaacaagtggatccgtctattacccggagaattttcatccttgggccttct catcatgtgcccctctctcgatgtgcactttccagtgtggatatatataggacacccctg tatgaccttcgtattgaccaaaagatttacggagaactatggaagacaggaatgtttgaa cgcatgtctctgcagacagatgaagatgaacgcagtattgaaatgcatttgccttataca gctaaagccatggaaagccataaggatgagtttaccattattcctttactggttggagct ctgagtcaaaagaacaggaattcggaacttttcagtaaatatctagcggatcctagtaat ctctttgtggtttcttctgatttctgccattggggtcaaaggttccgtcacagttactat gatgaatcccagggggagatttgtagatccattgaacatctagataaaatgggtatgagt attatagaacaattagaccctgtatcttttagcaattacttgaagaaataccataatact atatgtggaagacatcccattggggtgttattaaatgctatcacagagctccagaagaat ggaatgaatatgagtttttcgtttttgaattatgcccagtcgagccagtgtagaaactgg caagacagttcagtgagttatgcagctggagcactcacggtccactga >gi568815577f:36064734_36393568|GENSCAN_predicted_peptide_4|447_aa MPAAQSWVCCQTYVTLQRPFEKPHLDQKLKLIGEYGLQNKCEVWRVKFTLVKIHKAAREL LTLDEKDPRRLFEGIALLQWLVRTGVLDEGKMKLDYILGLKINDFLEIPTDPGGPKPGPP STQVPRAPRSAMSSCSRVALVTGANRGIGLAIARELCRQFSGDVVLTARDVARGQAAVQQ LQAEGLSPRFHQLDIDDLQSIRALRDFLRKEYGGLNVLVNNAAVAFKSDDPMPFDIKAEM TLKTNFFATRNMCNELLPIMKPHGRVVNISSLQCLRAFENCSEDLQERFHSETLTEGDLV DLMKKFVEDTKNEVHEREGWPNSPYGVSKLGVTVLSRILARRLDEKRKADRILVNACCPG PVKTDMDGKDSIRTVEEGAETPVYLALLPPDATEPQGQLVHDKVVQNCCSCHKVTLPLLT APAIGFSSKKNQLDWIRVVTVHLTPCA >gi568815577f:36064734_36393568|GENSCAN_predicted_CDS_4|1344_bp atgccagcggcccagagctgggtttgttgccaaacttatgtgaccctgcagagacccttc gagaaacctcatctcgaccaaaagctaaagctgattggtgagtatgggctccagaacaaa tgtgaggtctggagggtcaaatttaccctggtcaagatccataaggccgcccgggagctg ctgacgcttgatgagaaggacccacggcgtctgttcgaaggcattgccctgctgcagtgg ctggtccgcactggggtgctggatgagggcaagatgaagctggattacatcctgggcctg aagatcaatgatttcttagagattcctacagacccaggtggtccgaagcccggtccgccc tccacgcaggtgccccgcgctccccgctcagccatgtcgtcctgcagccgcgtggcgctg gtgaccggggccaacaggggcatcggcttggccatcgcgcgcgaactgtgccgacagttc tctggggatgtggtgctcaccgcgcgggacgtggcgcggggccaggcggccgtgcagcag ctgcaggcggagggcctgagcccgcgcttccaccaactggacatcgacgacttgcagagc atccgcgccctgcgcgacttcctgcgcaaggagtacggggggctcaacgtactggtcaac aacgcggccgtcgccttcaagagtgatgatccaatgccctttgacattaaagctgagatg acactgaagacaaatttttttgccactagaaacatgtgcaacgagttactgccgataatg aaacctcatgggagagtggtgaatatcagtagtttgcagtgtttaagggcttttgaaaac tgcagtgaagatctgcaggaaaggttccacagtgagacactcacagaaggagacctggtg gatctcatgaaaaagtttgtggaggacacaaaaaatgaggtgcatgagagggaaggctgg cccaactcaccttatggggtgtccaagttgggggtcacggtcttatcgaggatcctggcc aggcgtctggatgagaagaggaaagctgacaggattctggtgaatgcgtgctgcccagga ccagtgaagacagacatggatgggaaagacagcatcaggactgtggaggagggggctgag acccctgtctacttggccctcttgcctccagatgccactgagccacaaggccagttggtc catgacaaagttgtgcaaaactgctgcagctgccacaaggtgaccttgcctctgttgact gctccagctataggtttttcctctaaaaagaaccagctggattggatcagagttgttact gtgcatctcaccccctgtgcctga >gi568815577f:36064734_36393568|GENSCAN_predicted_peptide_5|177_aa MLTFSSTYLLIDHCLAFTNSRPGLNISKGRSGVKRSEATAEMTAFIHKWPTSGWLEPQLG GDRACAQPRPKRRGPASRLRPDPRGGEEPLPLVLRLFLGGSGSRPRRSPAQSLSPSPEGS SAPHVSAPANRATAAGSRLLAGTVTRSVALHSRLQPHNWEDFARIFLSCASIAIIPF >gi568815577f:36064734_36393568|GENSCAN_predicted_CDS_5|534_bp atgctcacctttagttctacctacttgctcattgaccactgcctcgccttcacgaactcc agacctggtctgaacatttctaaaggacgctcgggagttaaaaggagcgaggccactgca gaaatgacagccttcatccacaagtggcccacttcgggttggctggagccacagctgggt ggggacagggcctgtgctcagccccgccccaagcgccgaggcccggcttccaggctccgc cccgacccgcgcggtggggaagagccgctgccgctggttctccggctcttcctcggcggt tctggctcccggccgcgccgcagccctgcccagtctctctcccccagcccggagggctcc tccgcgccgcacgtgagcgcgcccgccaacagggccactgccgcgggttcccggctgctc gcggggactgttacccgcagtgttgccttacactcgcgactgcagccccacaattgggaa gactttgctaggatttttttgagttgtgcaagcatcgccatcatcccattttag >gi568815577f:36064734_36393568|GENSCAN_predicted_peptide_6|73_aa MPSRQGMYLAVGRGGGKKFREVDGKQALVLEGPEECVGQVEAARLLKEVVCRRVLLRGVP NPQAADRYQSVAY >gi568815577f:36064734_36393568|GENSCAN_predicted_CDS_6|222_bp atgcccagccgccagggaatgtatttagcagttggtagaggaggaggaaagaagttcaga gaggtagatgggaaacaagctctggtgctggaggggccagaggagtgtgtgggccaagtg gaggctgctaggctgcttaaagaagtggtctgccgcagagtgctactgagaggggtcccc aacccccaggccgctgaccggtaccagtccgtggcctattag >gi568815577f:36064734_36393568|GENSCAN_predicted_peptide_7|188_aa MIDIIGVTKGKGYKGVTSCWHTKKLPHKTHRGLCKVAWIGAWHPDHVAFSMACAGQKGDH HHTEINKKICKIDQGYLTEDSKPIKNNASTDYDLSDKSIDLLGGFVHYGEVTNNSVMLKG CVVGTKKRVLILCTSLLVQTRRWALEKTDLKFTDTTSKFGYGRFQTMEEKKAFMGPLEKD RIAKEEGA >gi568815577f:36064734_36393568|GENSCAN_predicted_CDS_7|567_bp atgatcgacatcatcggggtgaccaagggcaaaggctacaaaggggtcaccagttgttgg cacaccaagaagctgccccacaagacccaccgaggcctgtgcaaggtggcctggattggg gcatggcatcctgaccatgtggccttctccatggcatgtgctgggcagaaaggcgaccat caccacactgagatcaacaagaagatctgtaagattgaccagggctaccttactgaggac agcaaaccgatcaagaacaatgcctccacggactatgacctgtctgacaagagcatcgac cttctgggtggctttgtccactatggtgaagtgaccaataactctgttatgctgaaaggc tgtgtggtgggaaccaagaagcgagtgctcattctctgcacatctttgctggtgcagacc agacgatgggctctggagaagactgaccttaagttcactgacaccacctccaagtttggc tatggccgcttccagaccatggaggagaagaaagcattcatgggaccactcgagaaagac cgaattgcaaaggaagaaggagcttaa >gi568815577f:36064734_36393568|GENSCAN_predicted_peptide_8|2202_aa MALGSLAFCLFLQLISCHRHSLKLAKQPGHAISVKVECHTMSTWEVLAKCWSVPFSSCLK GTNSAPWQLQKNISHMIRILSLLASQFGFLDSETVLEVFAELFLLEKILGKAWMEDGDEK GSALQSNLRYSLLPRRLLISKRLAQCLHPALPSGVHLKALETYEIIFKIVGTKWLAKDLF LYSCGLFPLLAHAAVSVRPVLLTLYEKYFLPLQKLLLPSLQAFIVGLLPGLEEGSEISDR TDALLLRLSLVVGKEVFYTALWGSVLASPSIRLPASVFVVGHINRDAPGREQKYMLGTNH QLTDSNERAIPLLRSDIVRILSAATQTLLRRDMSLNRRLYAWLLGSDIKGNTVVPESEIS NSYEDQSSYFFEKYSKDLLVEGLAEILHQKFIDADVEERHHAYLKPFRVLISLLDKPEIG PQVVGNLFLEVIRAFYSYCRDALGSDLKLSYTQSGNSLISAIKENRNASEIVKTVNLLIT SLSTDFLWDYMTRCFEECFRPVKQRYSVRNSVSPPPTVSELCALLVFLLDVIPLELYSEV QTQYLPQVLGCLVQPLAEDMEALSLPELTHALKTCFKVLSKVQMPPSYLDTESTSGTSSP VKGENGKIILETKAVIPGDEDASFPPLKSEDSGIGLSASSPELSEHLRVPRVSLERDDVW KKGGSMQRTFLCIQELIANFASKNIFGVQLTASGEESKSEEPAGKRDRDGTQSLAANDSS RKNSWEPKPITVPQFKQMLSDLFTARGSPFKTKSSESPSSSPSSPARKNGGEWDVEKVVI DLGGSREERREAFAAACHLLLDCATFPVYLSEEETEQLCATLFQLPGAGDSSFPSWLKSL MTICCCVTDCYLQNVAISTLLEVINHSQSLALVIEDKMKRYKSSGHNPFFGKLQMVTVPP IAPGILKVIAEKTDFYQRVARVLWNQLNKETREHHVTCVELFYRLHCLAPTANICEDIIC HALLDPDKVSLSGRHLHPAAPRPLLSLFVVLDSLACTDGAIGAAAQGWLVRALSLGDVAR ILEPVLLLLLQPKTQRTSIHCLKQENSADDLHRWFNRKKTSFREACAVPEPQESGSEEHL PLSQFTTVDREAIWAEVEKEPEKYPLRGELSEEELPYYVELPDRTAHGAPDSSEHTESAD TSSCHTDSENTSSFSSPSHDLQELSNEENCCAPIPMGGRAYPKRSALLAAFQSESFKAGA KLSLVRVDSDKTQASESFSSDEEADLELQALTTSRLLKQQRERQEAVEALFKHILLYLQP YDSRRVLYAFSVLEAVLKTNPKEFIEAVSRTSMDTSSTAHLNLISNLLARHQEALIGQSF YGKLQTQVPNVCPHSLLLELLTYLCLSFLRSYYPCYLKVSHRDILGNRDVQVKSVEVLIR IMMQLVSVAKSSEGKNVEFIHSLLQRCKVQEFVLLSLSASMYTSQKRYGLATAHHGRALP EDSLFEESLINLGQDQIWSEHPLQIELLKLLQVLIVLEHHLGRAHEEAENQPDLSREWQR ALNFQQAISALQYVQPHPLTSQGLLVSAVVRGLQPAYGYGMHPAWVSLVTHSLPYFGKSL GWTVTPFVVQICKNLDDLVKQYESESVKLSVSTTSKRENISPDYPLTLLEGLTTISHFCL LEQANQNKKTMAAGDPANLRNARNAILEELPRTVNTMALLWNVLRKEETQKRPVDLLGAT KGSSSVYFKTTKNYLPHLTSVESPQVPDVAAGMSLAILHLGTTTIRQKILDFLNPLTAHL GVQLTAAVAAVWSRKKAQRHSKMKIIPTASASQLTLVDLVCALSTLQTDTLLHLVKEVVK RPPQVKGGDEKSPLVDIPVLQFCYAFLQSMLNDFVTRTPNLENKKDQKDLQEITQKILEA VGNIAGSSLEQTSWLSRNLEVKAQPQASLEESDAEEDLYDAAAASAMVSSSAPSVYSVQA LSLLAEVLASLLDMVYRSDEKEKAVPLISRLLYYVFPYLRNHSAYNAPSFRAGAQLLSSL SGYAYTKRAWRKEVLELFLDPAFFQMDTSCVQMLVDRPSPPESWVDGTQSVLCQSCGTLN LPGDALWRDDAHICLREIQTFTQLEEDLKDEDESLSYRWAFIPEVDTEGPAFLSDVEENH QECKPHTVRILELLKLKFGEISSSDEITMKSEFPLLRQHSVSSIRQLMPFFMTLNGAFKT QRQLPADSPGTPFLDFPVTDSPRILKQLEECIEYDFLEHPEC >gi568815577f:36064734_36393568|GENSCAN_predicted_CDS_8|6609_bp atggctctcggcagccttgccttctgcctgtttctccagcttatctcttgccaccgccac tccctcaagctggcaaagcaaccgggtcatgccatttcagtcaaggtggaatgccacacc atgagcacttgggaagtcctggccaagtgctggtctgttccattttcgtcctgtctaaag ggcactaacagtgccccctggcagttgcagaagaatatttctcatatgatacgtatatta agcttactggccagccagtttgggtttctggatagtgaaactgtcctggaagtgtttgct gagcttttcttgctggagaagattcttggcaaagcctggatggaggatggagatgagaaa ggtagtgctcttcagagtaacctgaggtactccttgttgccaagacggctcctcatcagc aaaagattagctcagtgtttgcaccctgccctgcccagtggtgtccacttaaaagctctg gaaacctacgagattatctttaaaatcgtggggaccaaatggctggccaaggacttgttt ctgtacagctgcgggttatttcctctcctggcacacgcggcggtgtcggtgaggccggtg ctgctcaccctgtacgagaagtacttcctcccactgcagaagctgctcctgcccagtctg caggccttcatcgtgggcctgctgcccggccttgaagagggctccgagatctccgacaga acggatgctctgctcctgagactgtcgctggtggttggcaaagaggtgttttacaccgcc ctctgggggagcgtcctggccagcccgtccatccgcctccctgcctcagtcttcgtggtg ggccacatcaacagggatgcccccggccgggagcagaagtacatgctggggaccaatcac caactcacggattccaatgagagagccatccccctcctcagatctgacatcgtgcgcatt ctctcagccgccacccagaccctactgagaagggacatgtccctgaacagaagactgtat gcatggttactaggctcagacataaaaggaaataccgttgtgccagaatctgaaatctca aattcttatgaagaccagtcgtcttatttttttgaaaaatactccaaggatcttttagtt gagggtttggctgagatattgcatcagaagttcatagatgctgacgtggaggaacgccat catgcatacctgaagccttttcgcgtcctcatcagtctgcttgacaagccagaaataggg cctcaagtggttgggaatttgtttctcgaagtcatcagggccttttattcttactgcaga gatgcccttggctctgatcttaaacttagctacacccagagtggaaattcgctgataagt gcaatcaaggaaaacagaaatgcctctgagattgtcaaaacggtaaatttgctgataact tctctaagcacagactttctctgggattatatgacaaggtgttttgaggaatgctttaga ccagtgaagcagcgttacagcgtgaggaacagcgtcagccctccccccacggtctcggag ctctgcgccctcctggtcttcctgctggatgtcattcctttggaactttactctgaggtg caaacccagtatctccctcaggtgctcggctgcctggtgcagcctcttgctgaggacatg gaggccttaagtttacctgaactcacgcatgccttgaagacgtgtttcaaggtgctcagc aaagtccagatgcctccttcctacctcgacacggagtccaccagcggaacctcgagtcca gtaaaaggtgaaaacggcaaaataattttggaaacaaaggcagtgattcccggtgacgaa gatgcttcgtttccccctctgaagtctgaggacagtgggatcgggctcagtgcctcgtca ccggagctctctgagcacttgagggttcctcgagtttctctggaaagggacgacgtttgg aagaagggcgggagcatgcagaggacgtttctttgcatccaagagctaatcgccaacttt gccagcaagaacatttttggagtacagctgacagcgtcaggagaagaaagcaagtccgag gagcctgcagggaagagggacagggatgggacgcagagcctggcagccaatgattccagc aggaagaactcttgggagcccaagcccatcactgtgcctcagttcaagcagatgctgtca gacttgttcacagcacgagggtctccattcaagacaaaaagttcagagtcaccatcgtct tcgcccagcagccctgccaggaaaaacgggggagaatgggatgttgagaaggtggtcatt gacctggggggttccagggaggaacgcagggaggcctttgccgccgcctgccacctgctg ctggattgtgccactttccctgtctacctgtccgaggaagagaccgagcagctctgtgca acgctcttccagctgccaggagccggtgattccagttttccatcttggctgaagtccctc atgactatttgctgctgtgtgactgactgctacctccagaacgtggccatttccactctg ctggaagtgataaaccattcccagtccctggcgcttgtcattgaagacaagatgaaacgc tataagagctctggacacaaccctttttttggcaagctgcagatggtgacggttcctccc attgctccagggatattgaaagtcattgcagagaaaacagatttctatcagagggtggct cgtgtgctttggaatcagctgaacaaagagacccgggagcatcacgtcacctgcgtagaa ttgttctaccggctgcactgcctggcccctacggccaacatctgcgaggacatcatctgc catgccctcctggaccctgacaaggtgagcctttctggccgccacctccatcctgcagca ccccggcccctgctgtccttgtttgtcgtgctggacagcctggcctgcacggatggtgcc atcggtgcggcagcccagggctggctggtgcgtgcgctctccctcggggacgtggctcgc atcctcgaacccgtgctcctgctgctgctgcagccaaaaacccagagaacctccatccac tgcctcaagcaggagaactcggccgatgacttgcaccgttggtttaacaggaagaaaacc tctttcagagaggcatgcgcagtgcccgagcctcaggagagcggctctgaagagcacctg cctctgagccagttcaccacagtggaccgtgaagccatttgggccgaagtggagaaggag cccgagaagtacccgctgcgaggcgagctgagcgaggaagagctgccctactacgtggag cttccagacaggacggcccacggcgccccggacagcagcgagcacaccgagtctgcagat acaagctcctgccacacggacagcgagaacacgtcctccttctcctccccttcccacgac ctgcaggagctgagcaacgaagagaactgctgtgcacccatccccatggggggcagggcg taccccaagcgctcggccctgctggcggccttccagtcagaaagcttcaaggctggggcc aagttaagcctggtgcgggtggactcggacaagacgcaggcttctgagtcgttctccagc gacgaggaggcggacttggagctccaggccctcaccacatccaggctgctaaagcagcag cgggaaaggcaggaggccgtcgaggccttgttcaagcacatcctgctctacctgcagccc tacgactctcggcgggtcctctatgccttctcggtgctggaggctgtgctcaaaaccaac cctaaggaattcatcgaggctgtgtccaggactagcatggataccagctccaccgcgcac ctcaacctcatctccaacctcctcgctcgccaccaggaggccctcattggccagagtttc tacggaaagctccagacccaggtccccaacgtgtgcccccactctctgctcctggagctg ctcacctacctctgcctgagcttcctgcgctcctactacccttgctatttgaaggtctcg caccgagacattctcggcaaccgggacgtgcaggtcaaaagtgtcgaggttttgatcagg ataatgatgcagctggtctcagtggccaagtcttcggaagggaagaacgtggagttcatc cacagcttgctgcagaggtgcaaagttcaggagtttgtcctgctctccctgtcggcgtcc atgtacacgagccagaagcgctacgggctggccaccgcccaccacggcagggccctgcca gaggacagcctctttgaggagagtctcattaacttgggtcaggaccagatctggagtgag cacccgctgcagattgagctgctgaagctgctgcaggtgctgattgtcttggaacaccac ctgggtcgggcccatgaggaggcggaaaaccagcccgacctgtcccgggagtggcagaga gccctgaacttccagcaggccatcagcgccctgcagtacgtgcagccccaccccctcacc tcccagggtcttctggtctctgcggtggtgaggggtctgcagcccgcctacggttacggc atgcatccggcctgggtgagcttggtcacgcattccttgccctacttcggaaagtccctg ggctggacggtgacaccctttgttgtccagatttgcaaaaacttggatgacttggtcaag cagtatgaaagcgaatctgtgaagctctctgtcagcacaacctccaagagggaaaacatt tctccagattatccactcacccttctagaaggtctaacgaccattagtcatttttgtctt ttggaacaagccaaccaaaacaaaaagaccatggctgcaggtgatcctgccaacttgagg aatgccagaaatgccattttggaagagctgcctcgaactgttaacaccatggcccttctc tggaatgttctcagaaaggaggagactcaaaagagacctgtcgatctcctaggggccacg aagggatcctcttccgtttactttaaaaccaccaaaaattatctgccacatctgacatca gttgagagcccccaggtgccagatgtggccgcagggatgagccttgccatcctgcattta ggtaccacgaccataagacaaaaaattttagacttcttaaaccccttgacggcccatctt ggggttcagttgacagcggctgttgcggcagtgtggagcagaaagaaagcccagcgtcac agtaagatgaagattatcccaacggcaagtgcatcccagctaacccttgtcgacttggtg tgtgcactcagcaccctgcagactgacacgctgctgcacctggtgaaggaggtggtgaag aggccaccccaagtcaaagggggtgatgagaaatcgcccctagtggacattcctgtgttg cagttttgctatgcttttctccaaagcatgctgaatgactttgtaacaagaactcccaac ctggaaaacaagaaggaccaaaaagacctgcaggaaatcactcagaaaatcctagaagct gtggggaacattgccggctcttccttggagcaaaccagctggctaagcagaaacctggaa gtgaaggcccaacctcaggcctctctagaagaatctgatgctgaggaggacctgtatgat gctgctgcagcttcagcaatggtgtcttcatccgccccgtcggtgtacagcgtgcaagcc ctctctctcctggcagaggtactggcttccctcctggacatggtttatcgaagtgatgag aaggagaaagctgtgccgttaatctcccgtctgctttactatgtttttccatacttacgc aaccacagtgcctacaatgctcccagcttccgggctggcgctcagctgctgagctccctg agtggctatgcctacacaaagcgagcctggaggaaggaggtcctggagctgtttctcgac cccgctttctttcagatggatacttcctgtgttcagatgctcgtggataggcccagcccc cctgagtcctgggttgatggcactcagagtgtcctctgccagagctgtgggacactgaac ttgcctggagatgccctgtggagggatgatgcccacatctgcctcagagagattcagaca ttcacacagcttgaagaagatctaaaagatgaagatgagtcattgagttataggtgggca tttattccagaagtggacacagagggccctgccttcctgtcggatgtagaggagaatcac caagaatgcaaaccccacactgtcaggattctagaacttctaaaattaaagtttggggaa atcagtagctctgatgagatcaccatgaagagtgaattcccgcttctgcgccaacattct gtttccagcatcaggcagttgatgccattcttcatgactctaaatggtgcatttaagacc cagagacagctgcctgctgatagcccaggaactccattcttggactttcctgtcacagat agcccaaggatcttaaaacaactggaagaatgcatcgaatatgattttctggaacatcca gaatgttaa >gi568815577f:36064734_36393568|GENSCAN_predicted_peptide_9|143_aa MRAAAGQQRRAVRLSGWADTSAAGRATGASTWGNLPTHVREKEPQDLFYKYSRIREIELK SRYGLVPFASVRFEDPRDAEDAIYGRNGLLPSGSWQDLKDHTREAGDACYTDVQKDGVGM VGCLRKEDMEYALRQLDDQIPLS >gi568815577f:36064734_36393568|GENSCAN_predicted_CDS_9|432_bp atgcgggctgcggctggccagcagcgtcgggcggtgcggttgtccggctgggcggacacg agcgcggctgggagggcgacgggtgcatctacgtggggaaaccttccgacccacgtgcgc gagaaggagccgcaggacctgttctacaagtacagccgcatccgcgagatcgagctcaag agccggtacggccttgtgcccttcgcctccgtgcgcttcgaggaccctcgagatgcagag gatgctatttatggaagaaatggacttcttccatcaggcagctggcaggacctgaaggat cacacgcgagaagctggggatgcctgttacacggatgtgcagaaggatggagtggggatg gttgggtgtctcagaaaagaagacatggaatatgccctgcgtcaactggatgaccaaatt ccactctcatga >gi568815577f:36064734_36393568|GENSCAN_predicted_peptide_10|123_aa MAAPSPGPAAAPAAASAPQAATAHCRPPFACPPALLGHGPLHPSRLLTALRRIPRGGCAA ILSELQSQERPPPQPDWEVAERLWSPMGTAHFRGGAHVWPLPPTSASRQCRIGSLGKMNC LTG >gi568815577f:36064734_36393568|GENSCAN_predicted_CDS_10|372_bp atggccgccccctccccgggccccgccgcagcccccgccgccgcgtcggccccacaagcc gccaccgcccactgccggccccccttcgcttgcccgcccgccctcctgggacacggcccg ctccacccctcgcggctgctcaccgcgctgaggcgtatcccgcggggtggctgcgccgcc atcttgagcgagctacaaagccaggaacggcctccgccgcaacccgactgggaggtggcg gaacgactgtggagccctatgggtaccgcccacttccggggaggggcgcacgtctggccc ctcccacccacttccgcctccaggcaatgccgaattgggagcttggggaagatgaattgc ctgactggttaa >gi568815577f:36064734_36393568|GENSCAN_predicted_peptide_11|775_aa MGFHHVGQADLKLLTSDNAYDPDVNAKQIWIDKTVINDHICLTFTDNGNGMTSDKLHKML SFGFSDKVTMNGHVPVGLYGNGFKSGSMRLGKDAIVFTKNGESMSMINLAESKASLAAIL EHSLFSTEQKLLAELDAIIGKKGTRIIIWNLRSYKNATEFDFEKDKYDIRIPEDLDEITG KKGYKKQERMDQIAPESDYSLRSKTVRITFGFNCRNKDHYGIMMYHRNRLIKAYEKVGCQ LRANNMGVGVVGIIECNFLKPTHNKQDFDYTNEYRLTITALGEKLNDYWNEMKVKKNTEY PLNLPVEDIQKRPDQTWVQCDACLKWRKLPDGMDQLPEKWYCSNNPDPQFRNCEVPEEPE DEDLVHPTYEKTYKKTLKRRLSTRSSILNAKNRRLSSQFENSVYKGDDDDEDVIILEENS TPKPAVDHDIDMKSEQSHVEQGGVQVEFVGDSEPCGQTGSTSTSSSRCDQGNTAATQTEV PSLVVKKEETVEDEIDVRNDAVILPSCVEAEAKIHETQETTDKSADDAGCQLQELRNQLL LVTEEKENYKRQCHMFTDQIKVLQQRILEMNDKYVKKETCHQSTETDAVFLLESINGKSE SPDHMVSQYQQALEEIERLKKQCSALQHVKAECSQCSNNESKSEMDEMAVQLDDVFRQLD KCSIERDQYKSEVELLEMEKSQIRSQCEELKTEVEQLKSTNQQTATDVSTSSNIEESVNH MDGESLKLRSLRVNVGQLLAMIVPDLDLQQVNYDVDVVDEILGQVVEQMSEISST >gi568815577f:36064734_36393568|GENSCAN_predicted_CDS_11|2328_bp atggggtttcaccatgttggccaggctgatctcaaactcctgacctcagataatgcttat gatcctgatgtgaacgctaaacaaatatggattgacaaaacagtgataaatgaccatata tgcttgacattcaccgacaatgggaatggtatgacttctgataaattacataaaatgcta agctttggcttcagtgacaaagtcaccatgaatggtcatgtcccagttggattatatggg aatggcttcaagtcgggttctatgcgtctgggtaaagacgcaatcgtttttaccaaaaat ggagaaagcatgagcatgattaatttagcagaatcaaaagccagccttgctgcaattctg gaacattctctgttttccacggaacagaagttactggcagaacttgatgctattataggc aagaaggggacgaggatcatcatttggaatcttagaagctacaaaaatgcaacagagttc gattttgaaaaggataaatatgatatcagaattcccgaggatttagatgagataacaggg aagaaggggtacaagaagcaggaaaggatggaccagattgcccctgagagtgactattcc ctgaggtctaaaacagtgagaattacctttggattcaactgcagaaataaagatcattat gggataatgatgtatcacagaaatagactcatcaaagcttatgaaaaagttggatgtcag ttaagggcaaacaacatgggtgttggagtggttggaattatagagtgtaatttccttaag ccaactcataataaacaagatttcgactatactaatgagtacagacttacaataacagca ctaggagaaaagctgaatgattactggaatgaaatgaaagtgaagaaaaatacagaatat cctctaaatttgccagttgaagatatacagaagcgtcctgatcagacatgggttcagtgt gatgcctgtctaaagtggcggaaattacctgatgggatggatcaacttcctgaaaaatgg tattgctccaataaccctgacccacagttcagaaattgtgaggttccagaagaacctgaa gatgaggatttggtacatcccacttatgaaaaaacctacaaaaagaccttgaaacggaga ctttctactcgttcctcaattttgaatgcaaagaatcggagattgagtagtcagtttgaa aattcagtttataaaggtgatgatgatgatgaagatgtcatcatcttagaagaaaacagt acccccaaacctgcagtagatcatgatattgacatgaaatcagaacagagtcacgttgag caaggtggtgttcaggttgagtttgtgggtgacagtgaaccttgtggccagactggttca acaagcacctcatcatcccgatgcgaccagggaaatactgcagctacccagactgaagta ccaagtttagttgttaaaaaagaagaaactgttgaagacgagatagacgtaagaaatgat gcagtgattctgccctcctgtgtagaagctgaagcaaagatacatgaaacccaggaaacc accgataaatctgcagatgatgcaggctgccaattacaagaactgagaaaccagctactc cttgtcactgaggaaaaagagaattataaaagacagtgtcatatgtttactgatcaaatc aaagtgttacaacagaggatactagaaatgaatgacaagtatgttaagaaagaaacttgc catcagtccactgaaaccgatgctgtatttttacttgaaagtattaatggcaaatctgaa agtccagaccatatggtatctcagtatcagcaagctttggaagaaatagaaaggctgaaa aaacaatgtagtgctttgcaacatgtaaaggctgaatgcagccagtgttccaataatgag agtaaaagtgaaatggatgagatggctgtgcagcttgacgatgtgtttagacaactggac aaatgcagtattgagagggaccagtataaaagtgaggttgaattgctggaaatggaaaag tcacaaatccgttcacagtgtgaagaactcaaaactgaagtagaacagttaaaatctaca aatcaacagacggcaacagatgtttcaacatcaagtaacattgaggagtctgtaaatcat atggatggagaaagcctcaaactccgatctcttcgagttaacgtaggacaactgctggct atgattgtgcctgatcttgatcttcagcaagtgaattacgatgttgatgtagttgatgag attttaggacaagttgttgaacaaatgagtgaaatcagtagtacttaa >gi568815577f:36064734_36393568|GENSCAN_predicted_peptide_12|151_aa MRSRPCGGGACGAREAARAAREVTVPLTVRVPPAWHNKEPVYSLDFQHGTAGRIHRLASA GVDTNVRIWKVEKGPDGKAIVEFLSNLARHTKAVNVVRFSPTGEILASGGDDAVILLWKV NDNKEPEQIAFQDEDEAQLNKENWTVVKTLR >gi568815577f:36064734_36393568|GENSCAN_predicted_CDS_12|456_bp atgaggtcccgcccatgcgggggcggggcctgcggcgcgcgggaagcggcgcgcgctgcg cgggaggtgacggtgcctctgactgtccgggtccctccagcctggcacaacaaggagccc gtgtacagcctggacttccagcatgggacggctgggaggatccacagactggcgtctgcc ggcgtggacaccaatgtcaggatctggaaggtagaaaagggaccagatggaaaagccatc gtggaatttttgtccaatcttgctcgtcataccaaagccgtcaatgttgtgcgtttttct ccaactggggaaattttagcatcgggaggagatgatgctgtcatcctattgtggaaggtg aatgataacaaggagccggagcagatcgcttttcaggatgaggacgaggcccagctgaac aaggagaactggacggttgtgaagactctgcggtaa