GENSCAN 1.0 Date run: 3-Nov-116 Time: 10:47:33 Sequence gi568815575f:54822247_55031017 : 208771 bp : 41.07% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2515 2666 152 2 2 79 94 228 0.639 21.46 1.02 Intr + 3743 3816 74 1 2 40 71 93 0.027 0.09 1.03 Intr + 7484 8331 848 2 2 40 0 314 0.000 7.88 1.04 Intr + 10985 11093 109 0 1 104 36 47 0.001 -0.48 1.05 Intr + 16922 17042 121 2 1 29 42 136 0.002 2.68 1.06 Intr + 19774 19890 117 0 0 75 92 54 0.092 4.24 1.07 Term + 23708 24016 309 1 0 53 44 242 0.223 10.38 1.08 PlyA + 24031 24036 6 1.05 2.02 PlyA - 25548 25543 6 1.05 2.01 Sngl - 33621 32953 669 0 0 79 50 227 0.843 13.92 2.00 Prom - 35398 35359 40 -6.15 3.02 PlyA - 35568 35563 6 1.05 3.01 Sngl - 36809 36549 261 2 0 88 42 251 0.932 15.41 3.00 Prom - 37971 37932 40 -3.65 4.02 PlyA - 37982 37977 6 -3.94 4.01 Sngl - 39695 38100 1596 2 0 22 45 471 0.506 31.29 4.00 Prom - 49434 49395 40 -6.55 5.02 PlyA - 49444 49439 6 1.05 5.01 Sngl - 57574 56564 1011 1 0 70 38 337 0.931 23.64 5.00 Prom - 58525 58486 40 -6.15 6.02 PlyA - 58694 58689 6 1.05 6.01 Sngl - 59937 59608 330 0 0 65 44 409 0.872 29.97 6.00 Prom - 62715 62676 40 -3.65 7.00 Prom + 65795 65834 40 -3.95 7.01 Init + 66954 67069 116 2 2 60 35 103 0.242 1.93 7.02 Intr + 68575 68727 153 1 0 66 35 115 0.302 2.27 7.03 Intr + 72565 72706 142 1 1 53 82 114 0.765 6.73 7.04 Intr + 77902 78031 130 0 1 67 44 72 0.127 0.05 7.05 Intr + 95469 95917 449 1 2 67 86 164 0.201 6.24 7.06 Intr + 98776 98931 156 0 0 9 21 187 0.605 3.39 7.07 Intr + 99243 99414 172 2 1 26 74 66 0.466 -2.41 7.08 Intr + 100378 101522 1145 2 2 101 100 1119 0.994 102.39 7.09 Intr + 102424 102487 64 0 1 104 72 68 0.982 4.17 7.10 Intr + 102743 102822 80 0 2 107 67 97 0.979 7.85 7.11 Intr + 103346 103437 92 1 2 73 85 44 0.906 0.47 7.12 Intr + 104164 104243 80 1 2 108 96 27 0.877 3.78 7.13 Intr + 104337 104481 145 1 1 113 5 67 0.483 -0.68 7.14 Intr + 105421 105535 115 1 1 33 90 102 0.671 4.43 7.15 Term + 106357 108774 2418 0 0 100 40 2700 0.816 250.75 7.16 PlyA + 108987 108992 6 1.05 8.18 PlyA - 109124 109119 6 1.05 8.17 Term - 111216 111157 60 0 0 103 52 64 0.730 1.13 8.16 Intr - 111639 111575 65 1 2 47 92 81 0.861 1.92 8.15 Intr - 112763 112701 63 0 0 124 115 5 0.895 4.57 8.14 Intr - 112929 112885 45 1 0 140 81 45 0.912 6.46 8.13 Intr - 115478 115349 130 2 1 60 79 126 0.106 8.25 8.12 Intr - 123297 123193 105 0 0 63 94 175 0.903 15.09 8.11 Intr - 126975 126829 147 0 0 61 81 162 0.944 12.31 8.10 Intr - 128290 128061 230 2 2 59 75 60 0.326 -1.53 8.09 Intr - 128742 128596 147 1 0 90 34 111 0.617 5.19 8.08 Intr - 129287 129191 97 2 1 77 38 42 0.541 -3.24 8.07 Intr - 129866 129659 208 1 1 47 91 318 0.848 26.16 8.06 Intr - 134028 133907 122 0 2 94 110 158 0.999 16.97 8.05 Intr - 136116 136060 57 0 0 109 86 41 0.947 4.16 8.04 Intr - 136679 136605 75 2 0 87 86 132 0.999 11.69 8.03 Intr - 137647 137581 67 0 1 97 45 93 0.999 3.89 8.02 Intr - 138671 138578 94 0 1 87 100 82 0.941 7.40 8.01 Init - 141104 141011 94 2 1 91 95 73 0.627 8.89 8.00 Prom - 148575 148536 40 -3.65 9.02 PlyA - 149339 149334 6 1.05 9.01 Sngl - 151197 150829 369 0 0 83 41 227 0.466 13.36 9.00 Prom - 161407 161368 40 -5.75 10.00 Prom + 167703 167742 40 -6.75 10.01 Init + 168268 168317 50 0 2 28 98 77 0.912 3.17 10.02 Intr + 171684 171798 115 0 1 83 77 93 0.687 7.23 10.03 Intr + 177593 177832 240 1 0 52 78 144 0.231 6.62 10.04 Intr + 178177 178333 157 0 1 -20 55 236 0.527 8.16 10.05 Intr + 179300 179383 84 0 0 128 59 117 0.999 11.67 10.06 Intr + 180005 180185 181 0 1 106 47 179 0.923 13.60 10.07 Intr + 180716 180862 147 2 0 95 84 180 0.940 16.73 10.08 Intr + 181553 181622 70 2 1 83 95 52 0.994 3.67 10.09 Term + 184272 185189 918 2 0 107 46 521 0.980 41.02 10.10 PlyA + 185502 185507 6 1.05 11.11 PlyA - 186581 186576 6 1.05 11.10 Term - 187097 186934 164 0 2 80 54 133 0.976 6.22 11.09 Intr - 191402 191240 163 2 1 42 106 128 0.865 8.63 11.08 Intr - 192769 192501 269 2 2 87 80 308 0.999 26.23 11.07 Intr - 193496 193332 165 0 0 86 111 172 0.999 18.31 11.06 Intr - 195419 195240 180 0 0 64 80 156 0.844 11.32 11.05 Intr - 198258 198074 185 2 2 93 115 134 0.999 15.11 11.04 Intr - 199028 198806 223 0 1 93 79 191 0.992 14.96 11.03 Intr - 201621 201511 111 1 0 69 105 69 0.942 6.13 11.02 Intr - 202594 202472 123 2 0 60 47 135 0.981 6.24 11.01 Init - 203754 203574 181 0 1 77 94 209 0.600 17.88 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 3846 3655 192 0 0 85 48 89 0.857 1.04 S.002 Intr - 4095 3985 111 0 0 80 75 112 0.908 8.76 S.003 Sngl + 7770 8462 693 2 0 37 38 221 0.841 7.95 S.004 Sngl - 11623 11354 270 1 0 69 43 165 0.954 5.13 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575f:54822247_55031017|GENSCAN_predicted_peptide_1|576_aa XLSTPAHKPPLSSPTPHHLLPDIARPVGSGIMADFPVQYDVVMEDIIRAVQCVMGSVVVP VEEGFRAVILKELEFKLNQEEVESLNRPITGSDIVAVINSLPTKKSPGPDGFTAEFYQRY KEELVPFLLKLFQSIEKEGILPNSFYEASIILIPKLGRDTTKKENFRPISLMNIDAKILS KILAKRIQQHIKKLIHHDQVGFIPGMQGWFNICKSINVIQHINRTKDKNHMIISIDAEKA FDKIQQHFMLKTLNKLGIDGTYFKIIRAIYDKPTANIILNGQKLEAFPLKTGTRQGCPLS PLLFNIVLEVLARAVRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQNLLKLISS PSSTLWNNSGESEHLCHAPDLSRKAFSFSLQYDTSRHPSSLGQARKLDYRPSKSQCLEPG TPRACLVLYTLVAKLIYNDYFIHYFAPRGLPPMEKNVVFVIDISGFMFGTKMKQDHNFSL AREQNWTENEFDKLMEVDFRRWVITNSSELKEHVLTQCKEANNLEKRLDELLTRITSVEK NINDLMELKNTAGELREAYTSFNSCNDQAEERITVI >gi568815575f:54822247_55031017|GENSCAN_predicted_CDS_1|1731_bp ngactctcaaccccagcacacaaaccacccctcagcagccccacaccacaccacctactg cccgacattgcaagaccagttggctcaggcatcatggctgacttcccggttcagtacgat gtggtcatggaggacatcattagagccgtgcagtgtgtcatgggttctgtcgtggtgcca gtggaggagggtttcagggccgtcatcctaaaggagttggaatttaaactaaaccaggag gaagttgaatctctgaatagaccaataacaggctctgatattgtggcagtaatcaatagc ttaccaaccaaaaagagtccaggaccagatggattcacagccgaattctaccagaggtac aaggaggaactggtaccattccttctgaaactattccaatcaatagaaaaagagggaatc ctccctaactcattttatgaggccagcatcattctgataccaaagctgggcagagacaca accaaaaaagagaattttagaccaatatccttgatgaacattgatgcaaaaatcctcagt aaaatactggcaaaacgaatccagcagcacatcaaaaaacttatccaccatgatcaagtg ggcttcatccctgggatgcaaggctggttcaatatatgcaaatcaataaatgtcatccag catataaacagaaccaaagacaaaaaccacatgattatctcaatagatgctgaaaaggcc tttgacaaaattcaacaacacttcatgctaaaaactctcaataaattaggtattgatggg acgtatttcaaaataataagagctatctatgacaaacccacagccaatatcatactgaat gggcaaaaactggaagcattccctttgaaaacgggcacaagacagggatgccccctctca ccactcctattcaacatagtgttggaagttctggccagggcagttaggcaggagaaggaa ataaagggtattcaattaggaaaagaggaagtcaaattatccctgtttgcagatgacatg attgtatatctagaaaaccccattgtttcagcccaaaatctccttaagctgataagtagt ccttccagtactttgtggaataacagtggtgaaagcgagcatctgtgtcatgctccagac cttagcagaaaggctttcagtttttctcttcagtatgatactagtaggcacccctctagc ctagggcaggcccgaaagttggattataggccatccaagagccaatgcctagaacctggg accccaagagcctgcttggtgctctacacccttgtggccaagctgatttataatgactat ttcattcactactttgcccccagaggccttccacctatggagaagaatgtggtttttgtt attgacataagtggcttcatgtttggtaccaagatgaaacaggatcacaacttctcacta gcaagggaacaaaactggacagagaatgagtttgacaaattgatggaagtagacttcaga aggtgggtaataacaaactcctccgagctaaaggagcatgttctaacccaatgcaaggaa gctaacaaccttgaaaaaaggttagatgaattgctaactagaataaccagtgtagagaag aacataaatgacctgatggagctgaaaaacacagcaggagaacttcgtgaagcatacaca agtttcaatagctgcaatgatcaagcagaagaaaggataacagtgatttaa >gi568815575f:54822247_55031017|GENSCAN_predicted_peptide_2|222_aa MIVYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQAFLHTNNRQTESHIMSELPFTIAS KGIKYLGIQLTRDVKDLFKENYKPLLNEIKEDTNKWKNIPCSWVERINILKMAILPKVIY RFNAIPIKLPMTFFTGLEKTTLKFIWNQKRACMAKSILSQKNKAGGITLPDFKLYYKATV TKTAWYWYQNRDIDQWNRIEPSEILPHIYNYLIFEKPDKNKQ >gi568815575f:54822247_55031017|GENSCAN_predicted_CDS_2|669_bp atgattgtatatctagaaaaccccattgtctcagctcaaaatctccttaagctgataagc aacttcagcaaagtctcaggatacaaaatcaatgtacaaaaatcacaagcattcttacac accaataacagacaaacagagagccatatcatgagtgaactgccattcacaattgcttca aagggaataaaatacctaggaatccaacttacaagggatgtgaaggacctcttcaaggag aactacaaaccactgctcaatgaaataaaagaggatacaaacaaatggaagaacattccg tgctcatgggtagaaagaatcaatatattgaaaatggccatactgcccaaggtaatttat agattcaatgccatccccatcaagctaccaatgactttcttcacaggattggaaaaaact actttaaagttcatatggaaccaaaaaagagcctgcatggccaagtcaatcctaagccaa aagaacaaagctggaggcatcacactacctgacttcaaactatactacaaggctacagta accaaaacagcatggtactggtaccaaaacagagatatagaccaatggaacagaatagag ccctcagaaatattgccacatatctacaactatctgatctttgagaaacctgacaaaaac aagcaatag >gi568815575f:54822247_55031017|GENSCAN_predicted_peptide_3|86_aa MGKKHSRKTGNSRNKSTSPPPKEHSSSPAMGQSWTENDFDELREEGFRRSNYSEQKEEVL INGKEVKNLEKKLDEWLTRITNAEKS >gi568815575f:54822247_55031017|GENSCAN_predicted_CDS_3|261_bp atggggaaaaaacacagcagaaaaaccggaaactctagaaataagagcacctctcctcct ccaaaggaacacagctcctcaccagcaatgggacaaagctggacagagaatgactttgac gagttgagagaagaaggcttcagaagatcaaactactccgagcaaaaggaggaagttcta atcaatggcaaagaagttaaaaaccttgaaaaaaaattagatgaatggctaactagaata accaatgcagagaagtcctaa >gi568815575f:54822247_55031017|GENSCAN_predicted_peptide_4|531_aa MLKTLTKLGIDGMYLKIIRAIYDKPTANIILNGQKLEAFPLKTGTKQGCSLSPLLFNTEL EVFARAIRQEKEIKGIQLGKEEVKLSLFADDMILYLENPIVSAQNLLKLISNFSKVSGYK INVQKSQALLYANNTQTDSQTRSELPFTIASKRIKHLGIQLTRDVKDLFKENCKPLLNEI KEDTNKWKNIPCSWAGRINIVKMAKLPKVFYRFNAIPIKLPMTFFTELEKTTLKFIQNQK RARIAKSILSQKNKVGDITLPDFKLNYKATVIKTAWYWYQNRDIDQWNRTEPSEIMLHIY NHLIFDKPEKNKQWGKDSLFNKRCWENWLAICRKLKLDPFLIPYTKINSRWMKDLHVRPK TIKTLEENLGNKIQDIGMGKDFMSKTPKAMATKDKIDKWELIKLKSFCTAKETTIRVNRQ PTEWEKIFAIYSSDKWLISRIYDELKQTYKKKTNNPIKKWVKDMNRHFSKEDIYAAKRHM KKCSSSLAIREMQIKTKMRYHLTPVRTVIIKKLGNTRCWRGCGEIGTLLHC >gi568815575f:54822247_55031017|GENSCAN_predicted_CDS_4|1596_bp atgttaaaaactctcactaaattaggtattgatgggatgtatctcaaaataataagagct atctatgacaaacccacagccaatatcatactgaatgggcaaaaactggaagcattccct ttgaaaacgggcacaaaacagggatgctctctctcaccactcctattcaacacagagttg gaagtttttgccagggcaatcaggcaggagaaggaaataaagggcattcaattaggaaaa gaggaagtcaaattgtccctgtttgcagatgacatgattttatatctagaaaaccccatc gtctcagcccaaaatctccttaagctgataagcaacttcagcaaagtctcaggatacaaa atcaatgtgcaaaaatcacaagcactcttatacgccaataacacacaaacagacagtcaa acccggagtgaactcccattcacaattgcttcaaagagaataaaacacctaggaatccaa cttacaagggatgtgaaggacctcttcaaagagaactgcaaaccactgctcaatgaaata aaagaggatacaaacaaatggaagaacattccatgctcatgggcaggaagaatcaatatc gtgaaaatggccaaattacccaaagtattttatagattcaatgccatccccatcaagcta ccaatgactttcttcacagaattggaaaaaactactttaaaattcatacagaaccaaaaa agagcacgcattgccaagtcaatcctaagccaaaagaacaaagttggagacatcacgcta cctgacttcaaactaaactacaaggctacagtaatcaaaacagcatggtactggtaccaa aacagagatatagaccaatggaacagaacagagccttcagaaataatgctgcatatctac aaccatctgatctttgacaaacctgagaaaaacaagcaatggggaaaggattccctattt aataaacggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcctttc cttataccttatacaaaaattaattcaagatggatgaaagacttacatgttagacctaaa accataaaaaccctagaagaaaacctaggcaataaaattcaggacataggcatgggcaag gacttcatgtctaaaacaccaaaagcaatggcaacaaaagacaaaattgacaaatgggag ctaattaaactcaagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaa cctacagaatgggagaaaatttttgcgatctactcatctgacaaatggctaatatccaga atctatgatgaactcaaacaaacttacaagaaaaaaacaaacaatcccatcaaaaagtgg gtgaaggacatgaatagacacttctcaaaagaagacatttatgcagccaaaagacacatg aaaaaatgctcatcatcactggccatcagagaaatgcaaatcaaaaccaaaatgagatac catctcacaccagttagaacagtgatcattaaaaagttaggaaacaccaggtgctggaga ggatgtggagaaataggaacacttttacactgttga >gi568815575f:54822247_55031017|GENSCAN_predicted_peptide_5|336_aa MDKSLDTYTLPRLNQEEVDSLNRTITGSEIVAIINSLPTKKSPGPDGFTAEFYQRYKEEL VPFLLKLFQSIEKKGNLPNSFYEASIILIPKPGRDTTKKENFRPISLMNIDAKILNKILA NRIQQHIKKLIHHDQVGFFPGMQGWFNIRKSINVIQHINRTKDKNHMIISIDAEKAFDKI QQPFMLKTLNKLGIDGTYFKIIRAIYDKPTANIILNGQKLEAFPLKTGTRQGCPLSPLLF NIVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMILYLENPIVSAQNLLKLISNFSKV SGYKINVQKSQAFLYTNNRQRAKSWVNSHSQLLQRE >gi568815575f:54822247_55031017|GENSCAN_predicted_CDS_5|1011_bp atggataaatccctcgacacatacaccctcccaagactaaatcaggaagaagttgactct ctgaatagaacaataacaggctctgaaattgtggcaataatcaatagcttaccaaccaaa aagagtccaggaccagatggattcacagccgaattctaccagaggtacaaggaggaactg gtaccattccttctgaaactattccaatcaatagaaaaaaagggaaacctccctaactca ttttatgaggccagcatcatcctgataccaaagccaggcagagacacaaccaaaaaagag aattttagaccaatatccttgatgaacattgatgcaaaaatcctcaataaaatactggca aaccgaatccagcagcacatcaaaaagcttatccaccatgatcaagtgggcttctttcct gggatgcaaggctggttcaatatacgcaaatcaataaatgtcatccagcatataaacaga accaaagacaaaaaccacatgattatctcaatagatgcagaaaaggcctttgacaaaatt cagcagcccttcatgctaaaaactctcaataaattaggtattgatgggacgtatttcaaa ataataagagctatctatgacaaacccacagccaatatcatactgaatgggcaaaaactg gaagcattccctttgaaaactggcacaagacagggatgccctctctcaccactcctattc aacatagtgttggaagttctggccagggcaattaggcaggagaaggaaataaagggcatt caattaggaaaagaggaagtcaaattatccctgtttgcagatgacatgattttatatcta gaaaaccccatcgtctcagcccaaaatctccttaagctgataagcaacttcagcaaagtc tcaggatacaaaatcaatgtgcaaaaatcacaagcattcttatacaccaataacagacag agagccaaatcatgggtgaactcccattcacaattgcttcaaagagaataa >gi568815575f:54822247_55031017|GENSCAN_predicted_peptide_6|109_aa MWKKQSRKTGNSKKQSASPPPKECSSSPATEQSRTENDFDELREEGFRRSNYFKLQEEIQ TKGKEVENFEKNLDECITKITNTEKSLKELMELKAKARELREECRSLRS >gi568815575f:54822247_55031017|GENSCAN_predicted_CDS_6|330_bp atgtggaaaaaacagagcagaaaaaccggaaactctaaaaagcagagtgcctctcctcct ccaaaggaatgcagctcctcaccagcaacggaacaaagccggacagagaatgactttgac gagttgagagaagaaggcttcagacgatcaaactacttcaagctacaggaggaaattcaa accaaaggcaaggaagttgaaaactttgaaaaaaatttagacgaatgtataactaaaata accaatacagagaagtccttaaaggagctgatggagctgaaagccaaggctcgagaacta cgtgaagaatgcagaagcctcaggagctga >gi568815575f:54822247_55031017|GENSCAN_predicted_peptide_7|1818_aa MKEKILRAAREKVQVTYKGKLIRLTADLSAETLQARREGDMDEAGSRHSQQTNTGRENQI PYVLSHKWELNNENTWTQEGEQHSPGPIGGWHAPPVVLVTGLPSLLQPLPTPVPTTWEPE SCTANATAIARATPAAQAAGVIERQNKLLKEKQKKITGQPRVSFQWSTHLSQAVWSVAQK VLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMTVYLENPIVSAQNLLKLISNFSKVSG YKINVQKSQAFLYTNNRQTESQIMSELLFTIASKRIKYLGIQLTRDVKDLFKENYKPLLN EIKEDTKKWKNIPCSWLGRINIVKMAILPKRCLDPVRTGGSRRQNPLWATESNDLDSVRA VKGRDADLVGTAERKDREILQTKGRSCTLADVERGGSGVGWGRGGAETYRRAEGGGLLKE GKMKRNAAERREGGVLGRGDIQTETTEEDSVLLMHTLLAATKDSLAMDPPVVNRPKKSKT KKAPIKTITKAAPAAPPVPAANEIATNKPKITWQALNLPVITQISQALPTTEVTNTQASS VTAQPKKANKMKRVTAKAAQGSQSPTGHEGGTIQLKSPLQVLKLPVISQNIHAPIANESA SSQALITSIKPKKASKAKKAANKAIASATEVSLAATATHTATTQGQITNETASIHTTAAS IRTKKASKARKTIAKVINTDTEHIEALNVTDAATRQIEASVVAIRPKKSKGKKAASRGPN SVSEISEAPLATQIVTNQALAATLRVKRGSRARKAATKARATESQTPNADQGAQAKIASA QTNVSALETQVAAAVQALADDYLAQLSLEPTTRTRGKRNRKANKLVKYLLVKDQTKIPIK RSDMLRDVIQEYDEYFPEIIERASYTLEKMFRVNLKEIDKQSSLYILISTQESSAGILGT TKDTPKLGLLMVILSVIFMNGNKASEAVIWEVLRKLGLRPGYDWALSALAVRVVLWQERM VLGLHQSGGLVERVGYLEYKRVPNSRPPEYEFFWGLRSYHETSKMKVLKFACRVQKKDPK DWAVQYREAVEMEVQAAAVAVAEAEARAEARAQMGIGEEAVAGPWNWDDMDIDCLTREEL GDDAQAWSRFSFEIEARAQENADASTNVNFSRGASTRAGFSDGASISFNGAPSSSGGFSG GPGITFGVAPSTSASFSNTASISFGGTLSTSSSFSSAASISFGCAHSTSTSFSSEASISF GGMPCTSASFSGGVSSSFSGPLSTSATFSGGASSGFGGTLSTTAGFSGVLSTSTSFGSAP TTSTVFSSALSTSTGFGGILSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTL STSVSFGGSSSTSANFGGTLSTSICFDGSPSTGAGFGGALNTSASFGSVLNTSTGFGGAM STSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGSP STSAGFGGALNTNASFGCAVSTSASFSGAVSTSACFSGAPITNPGFGGAFSTSAGFGGAL STAADFGGTPSNSIGFGAAPSTSVSFGGAHGTSLCFGGAPSTSLCFGSASNTNLCFGGPP STSACFSGATSPSFCDGPSTSTGFSFGNGLSTNAGFGGGLNTSAGFGGGLGTSAGFSGGL STSSGFDGGLGTSAGFGGGPGTSTGFGGGLGTSAGFSGGLGTSAGFGGGLVTSDGFGGGL GTNASFGSTLGTSAGFSGGLSTSDGFGSRPNASFDRGLSTIIGFGSGSNTSTGFTGEPST STGFSSGPSSIVGFSGGPSTGVGFCSGPSTSGFSGGPSTGAGFGGGPNTGAGFGGGPSTS AGFGSGAASLGACGFSYG >gi568815575f:54822247_55031017|GENSCAN_predicted_CDS_7|5457_bp atgaaggaaaaaatattaagggcagccagagagaaagtccaggtcacttacaaagggaag ctgatcagactaacagcagacctctcagcagaaaccctgcaagccagaagagagggggac atggatgaagctggaagccgtcattctcagcaaaccaacacaggaagagaaaaccaaatt ccttatgttctctctcataaatgggagttgaacaatgagaacacatggactcaggaaggg gaacaacactcaccagggcctattggtggttggcatgcaccacctgtggtcctggtgact ggcttgcccagcctgttgcagccactgccaacaccagtgcctactacttgggagccagag agttgtactgccaatgctactgccatcgcacgtgccacacctgctgctcaggcagcaggc gtcattgaaagacagaataaactcttaaaggagaagcagaaaaagataactggacaacca agggtcagcttccagtggagcacacacttaagtcaggcagtttggtctgtggcccaaaag gtgttggaagttctggccagggcaattaggcaggagaaggaaataaagggtattcaatta ggaaaagaggaagtcaaattgtccctgtttgcagatgacatgactgtatatctagaaaac cccattgtctcagctcaaaatctccttaagctgataagcaacttcagcaaagtctcagga tacaaaatcaatgtacaaaaatcacaagcattcttatacaccaataacagacaaacagag agccaaatcatgagtgaactcctattcacaattgcttcaaagagaataaaatacctagga atccaacttacaagggatgtgaaggacctcttcaaggagaactacaaaccactgctcaat gaaataaaagaggatacaaagaaatggaagaacattccatgctcatggctaggaagaatc aatatcgtgaaaatggccatactgcccaagcggtgcctggacccagttcgcacgggagga agtaggaggcagaatcccctttgggccacagaaagtaacgatctggactcggttcgggcc gtgaaagggagggatgcagacctagttgggaccgcagaaaggaaggatcgagaaatcctt cagacgaaagggcggagctgcacacttgcagacgtggaaagaggggggagtggtgttggt tggggaaggggtggtgcagagacgtaccgcagagcggagggagggggcttattaaaggaa gggaagatgaagagaaatgctgcagagcggagggaggggggtgtattgggaaggggcgat atacagactgagaccacagaagaggacagtgtcctgctgatgcataccctgttggcggca accaaggactccctggccatggacccaccagttgtcaaccggcctaagaaaagcaagacc aagaaggcccctataaagactattactaaggctgcacctgctgcccctccagtcccagct gccaatgagattgccaccaacaagcccaaaataacttggcaggctttaaacctgccagtc attacccagatcagccaggctttacctaccactgaggtaaccaatactcaggcttcttca gtcactgctcagcctaagaaagccaacaagatgaagagagttactgccaaggcagcccaa ggctcccaatccccaactggccatgagggtggcactatacagctgaagtcacccttgcag gtcctaaagctaccagtcatctcacagaatattcacgctccaattgccaatgagtcagcc agttcccaagccttgataacctctatcaagcctaagaaagcttccaaggctaagaaggct gcaaataaggccatagctagtgccaccgaggtctcgctggctgcaactgccacccataca gctaccacccaaggccaaattaccaatgagacagccagtatccacaccacagcagcctcc atccgaaccaagaaagcctccaaagccaggaagacaattgctaaggtcataaatactgac actgagcatatagaggctctaaatgtcactgacgcagctaccaggcagattgaggcctca gtagtggctatcaggcccaaaaaatccaagggcaagaaggctgccagcaggggcccaaat tctgtctctgagatctctgaggccccacttgccactcagatagtcacaaaccaagccctg gcagccaccctgcgggtcaagagagggtctagggctcggaaggctgccactaaggctcgg gcaactgaaagccagactccaaatgctgaccaaggggcccaggccaagatagcctctgct cagaccaacgtaagtgcccttgagactcaggttgctgctgctgtccaggccctggcagat gactatctggctcagttgagcctggagcccacaaccaggacccggggcaaaagaaaccga aaggctaataagttggtgaaatacctgttggttaaggaccagacaaagatccccatcaaa cgctcagacatgctgagggatgtcatccaagaatatgatgaatatttcccagaaatcatt gaacgagcaagctacactctggagaagatgtttcgagtcaatctgaaagaaattgataag caaagtagcttgtatattctcatcagcactcaggaatcctctgcaggcatactgggaacg accaaggacacacccaagctgggtctcctcatggtgattctgagtgtcatttttatgaat ggcaacaaggccagtgaggctgtcatctgggaggtgctgcgcaagttggggctgcgccct gggtatgactgggctctctcagcgcttgctgtccgtgttgtcctttggcaagagaggatg gtcctaggattgcatcagtctggtggtctggtggagcgggtggggtacctggagtacaag agggtccctaacagcagaccacctgaatatgagttcttctggggcttgcgctcctaccac gagactagcaagatgaaagtcctcaagtttgcatgcagggtgcagaagaaagaccccaag gactgggctgtgcagtaccgcgaggcagtggagatggaagtccaagctgcagctgtggct gtggctgaggctgaagccagggctgaggcaagagcccaaatggggattggagaggaagct gtggctgggccctggaattgggatgacatggatatcgactgcctaacaagggaagagtta ggcgatgatgctcaggcctggagcagattttcatttgaaattgaggccagagcccaagaa aatgcagatgccagcaccaacgtcaacttcagcagaggagctagtaccagggctggcttc agcgatggtgctagtattagcttcaatggtgcacccagctccagtggtggcttcagtggt ggacctggcattacctttggtgttgcacccagcaccagtgccagcttcagcaatacagcc agcattagctttggtggtacactgagcactagctccagcttcagcagcgcagccagcatt agctttggttgtgcacacagcaccagcactagtttcagcagtgaagccagcattagcttt ggtggcatgccttgtaccagtgccagctttagtggtggagtcagctctagttttagtggc ccactcagcaccagtgccactttcagtggtggagccagctctggctttggaggcacactc agcaccacggctggctttagtggtgtactcagcactagcaccagctttggcagtgcaccc acaacgagcacagtcttcagtagtgcgcttagcaccagcactggctttggaggcatactc agcaccagtgtctgttttggtggctctcccagctccagtggtagctttggtggtacactc agtaccagtatctgcttcggtggctctccctgcaccagcactggctttggaggcacactt agcaccagtgtctcctttggtggctcttccagcaccagtgccaattttggtggtacacta agtaccagcatctgctttgatggctctcccagcactggtgctggctttggtggtgctctc aacaccagtgccagctttggcagtgtgctcaacaccagtactggttttggtggtgctatg agcaccagtgctgactttggcggtacactaagcaccagtgtctgctttggtggctctcct ggcaccagtgtcagctttggcagtgcactcaacaccaatgctggttatggtggtgctgtc agcaccaacactgactttggtggtacactaagcaccagcgtctgttttggtggctctccc agcaccagtgctggctttggtggtgcactcaacaccaatgccagctttggctgtgccgtc agcaccagtgccagcttcagtggtgctgtcagcaccagtgcttgcttcagtggtgcacca atcaccaaccctggctttggcggtgcatttagcaccagtgctggcttcggtggggcactt agtaccgctgctgacttcggtggtactcccagcaacagcattggctttggtgctgctccc agcaccagtgtcagctttggtggtgctcatggcaccagcctctgttttggtggagctccc agcaccagcctctgctttggcagtgcatctaatactaacctatgctttggtggccctcct agcaccagtgcctgctttagtggtgctaccagccctagtttttgtgatggacccagcacc agtaccggtttcagctttggcaatgggttaagcaccaatgctggatttggtggtggactg aacaccagtgctggctttggtggtggcctaggcaccagtgctggcttcagtggtggccta agcacaagttctggctttgatggtgggctaggtaccagcgctggcttcggtggaggacca ggcaccagcactggttttggtggtggactgggcaccagtgctggcttcagtggcggactg ggcaccagtgctggctttggtggtggactggtcactagtgatggctttggtggtggactg ggcaccaatgctagtttcggcagcacacttggcaccagtgctggctttagtggtggcctc agcaccagcgatggctttggcagtaggcctaatgccagcttcgacagaggactgagtacc atcattggctttggcagtggttccaacaccagcactggctttactggcgaacccagcacc agcacgggcttcagtagtggacccagttctattgttggcttcagcggtggaccaagcact ggtgttggcttctgcagtggaccaagcaccagtggcttcagcggtggaccgagcacagga gctggcttcggcggtggaccaaacactggtgctggctttggtggtggaccgagcaccagt gctggctttggcagtggagccgccagtcttggtgcctgtggcttctcgtatggctag >gi568815575f:54822247_55031017|GENSCAN_predicted_peptide_8|601_aa MVIMVGLPARGKTYISTKLTRYLNWIGTPTKVFNLGQYRREAVSYKNYEFFLPDNMEALQ IRKQCALAALKDVHNYLSHEEGHVAVFDATNTTRERRSLILQFAKEHGYKVFFIESICND PGIIAENIRQVKLGSPDYIDCDREKVLEDFLKRIECYEVNYQPLDEELDSHLSYIKIFDV GTRYMVNRVQDHIQSRTVYYLMNIHVTPRSIYLCRHGESELNIRGRIGGDSGLSVRGKQW VTVSGTYSCPERPTLESLFFPILPSRPYPNAGLWGGHSCHKFMDEEAESSPRAGGQDETE PGLKAGPTGLPSAPPLEMGGAGATLTLLTEHQAQFTKHLPVEVALTCAAPTVYQAVVWVL PIHYLKCSPQTAGQMRQCSLDMRPERGRDVSEGTRDHYAYALANFIQSQGISSLKVWTSH MKRTIQTAEALGVPYEQWKALNEIDAGVCEEMTYEEIQEHYPEEFALRDQDKYRYRYPKG ESYEDLVQRLEPVIMELERQENVLVICHQAVMRCLLAYFLDKSSGALSAAPAGCGEWAPD ELPYLKCPLHTVLKLTPVAYGCKVESIYLNVEAVNTHREKPENVDITREPEEALDTVPAH Y >gi568815575f:54822247_55031017|GENSCAN_predicted_CDS_8|1806_bp atggtgatcatggtgggtttaccagctcgaggcaagacctatatctccacaaagctcaca cgatatctcaactggataggaacaccaactaaagtgtttaatttaggccagtatcgacga gaggcagtgagctacaagaactatgaattctttcttccagacaacatggaagccctgcaa atcaggaagcagtgcgccctggcagccctgaaggatgttcacaactatctcagccatgag gaaggtcatgttgcggtttttgatgccaccaacactaccagagaacgacggtcactgatc ctgcagtttgcaaaagaacatggttacaaggtgtttttcattgagtccatttgtaatgac cctggcataattgcagaaaacatcaggcaagtgaaacttggcagccctgattatatagac tgtgaccgggaaaaggttctggaagactttctaaagagaattgagtgctatgaggtcaac taccaacccttggatgaggaactggacagccacctgtcctacatcaagatcttcgacgtg ggcacacgctacatggtgaaccgagtgcaggatcacatccagagccgcacagtctactac ctcatgaatatccatgtcacacctcgctccatctacctttgccgacatggcgagagtgaa ctcaacatcagaggccgcatcggaggtgactctggcctctcagttcgcggcaagcagtgg gtgactgtcagtggcacatacagctgccctgagagacccacccttgaatcactcttcttc ccaattctgccctcccgaccttatcccaacgcaggtctgtggggtgggcattcctgccac aagtttatggatgaagaagcagagagcagccccagggcaggcggccaggatgagacagag ccaggactgaaagccgggcctactggcctgccctcagctcccccactggagatgggggga gctggcgccaccttgacgcttctcacagagcaccaggcccagtttacaaagcacctcccc gtcgaggtagctctcacttgcgcagcacccactgtgtaccaggctgtggtctgggtgctc cccatccactatctaaagtgcagcccccagacagcaggccagatgagacaatgctctttg gatatgaggccagagaggggcagggatgtgtccgagggcacaagggaccattatgcctat gccctggccaacttcattcagtcccagggcatcagctccctgaaggtgtggaccagtcac atgaagaggaccatccagacagctgaggccctgggtgtcccctatgagcagtggaaggcc ctgaatgagattgatgcgggtgtctgtgaggagatgacctatgaagaaatccaggaacat taccctgaagaatttgcactgcgagaccaagataaatatcgctaccgctatcccaaggga gagtcctatgaggatctggttcagcgtctggagccagtgataatggagctagaacgacag gagaatgtactggtgatctgccaccaggctgtcatgcggtgcctcctggcctatttcctg gataaaagttcaggtgctctcagtgctgcccctgcagggtgtggggagtgggcccctgat gagcttccatatctcaagtgccctctgcacacagtgctcaaactcactcctgtggcttat ggctgcaaagtggaatccatctacctgaatgtggaggccgtgaacacacaccgggagaag cctgagaatgtggacatcacccgggaacctgaggaagccctggatactgtcccagcccac tactga >gi568815575f:54822247_55031017|GENSCAN_predicted_peptide_9|122_aa MDTLTSQLKELEKQEQTHSKASRRQEITKIRAELKKIETQKTLQKINESRSWFFERINKI DRPLARLIKRKREKKQIDAIKNDKGDITTDPTEIQTTVREYYEHLYANKLENLEEMDTFL DT >gi568815575f:54822247_55031017|GENSCAN_predicted_CDS_9|369_bp atggacaccctaacatcacaattaaaagaactagaaaagcaagagcaaacacattcaaaa gctagcagaaggcaagaaataactaagatcagagcagaactgaagaaaatagagacacaa aaaacccttcaaaaaattaatgaatccaggagctggttttttgaaaggatcaacaaaatt gatagaccgctagcaagactaataaagaggaaaagggagaagaagcaaatagacgcaata aaaaatgataaaggggatatcaccaccgatcccacagaaatacaaactaccgtcagagaa tactacgaacacctctatgcaaataaactagaaaatctagaagaaatggatacattcctc gacacataa >gi568815575f:54822247_55031017|GENSCAN_predicted_peptide_10|653_aa MLKETPGPDILPGNAASLLLPLLCGIQIFCNLVWVSSPISGDILGVAPNDITAHKPKALL LSLISVERLLLLPPPTMNFPLPSPTRALSASYSWGASAAAICELRWAVRRTEGGALAAEQ FRSGFRRLKSGKMPFMLRVVSWNINGIRRPLQGVANQEPSNCAAVAVGRILDELDADIVC LQETKVTRDALTEPLAIVEGYNSYFSFSRNRSGYSGVATFCKDNATPVAAEEGLSGLFAT QNGDVGCYGNMDEFTQEELRALDSEGRALLTQHKIRTWEGKEKTLTLINVYCPHADPGRP ERLVFKMRFYRLLQIRAEALLAAGSHVIILGDLNTAHRPIDHWDAVNLECFEEDPGRKWM DSLLSNLGCQSASHVGPFIDSYRCFQPKQEGAFTCWSAVTGARHLNYGSRLDYVLGDRTL VIDTFQASFLLPEVMGSDHCPVGAVLSVSSVPAKQCPPLCTRFLPEFAGTQLKILRFLVP LEQSPVLEQSTLQHNNQTRVQTCQNKAQVRSTRPQPSQVGSSRGQKNLKSYFQPSPSCPQ ASPDIELPSLPLMSALMTPKTPEEKAVAKVVKGQAKTSEAKDEKELRTSFWKSVLAGPLR TPLCGGHREPCVMRTVKKPGPNLGRRFYMCARPRGPPTDPSSRCNFFLWSRPS >gi568815575f:54822247_55031017|GENSCAN_predicted_CDS_10|1962_bp atgttaaaggagacaccaggtccagatattttacctggaaatgctgccagcctgctgctg ccgctgctgtgtggaatccagatcttctgcaacctggtttgggtgagctctcccatctct ggagacatcttaggagtcgcaccgaatgacatcactgcccacaagccaaaggccctcctc ctaagtcttatctcagtggagcgactcttactgctccctcctccaaccatgaacttccct ttgcccagtccaacgcgagcacttagcgccagctattcctggggcgcaagcgctgcagcc atctgtgaacttaggtgggcggtccgccgcactgagggcggggctttggcagctgagcag ttccgctcagggttccgtcgcttaaaaagtggcaaaatgccgtttatgttgcgcgtggtg agctggaacatcaatgggattcggagacccctgcaaggggtggcaaatcaggaacccagc aactgtgccgccgtggccgtggggcgcattttggacgagctggatgcggatatcgtctgt ctccaggaaaccaaagtgaccagggatgcactgacagagcccctggctatcgttgagggt tataactcctatttcagcttcagccgcaaccgtagcggctattctggtgtagccaccttc tgtaaggacaatgctaccccagtggctgctgaagaaggcctgagtggcctgtttgccacc cagaatggggatgttggttgctatggaaacatggatgagtttacccaagaggaactccgg gctctggatagtgagggcagggccctcctcacacagcataagatccgcacatgggaaggt aaggagaagaccttgaccctaatcaacgtgtactgcccccatgcggaccctgggaggcct gagcggctagtctttaagatgcgcttctatcgtttgctgcaaatccgagcagaagccctc ctggcggcaggcagccatgtgatcattctgggtgacctgaatacagcccaccgccccatt gaccactgggatgcagtcaacctggaatgctttgaagaggacccagggcgcaagtggatg gacagcttgctcagtaacttggggtgccagtctgcctctcatgtagggcccttcatcgat agctaccgctgcttccaaccaaagcaggagggggccttcacctgctggtcagcagtcact ggcgcccgccatctcaactatggctcccggcttgactatgtgctgggggacaggaccctg gtcatagacacctttcaggcctctttcctgctgcctgaggtgatgggctctgaccactgc cctgtgggtgcagtcttgagtgtgtcctctgtgcctgcaaaacagtgcccacctctgtgc acccgcttcctccctgagtttgcaggcacccagctcaagatccttcgcttcctagttcct ctcgaacaaagtcctgtgttggagcagtcgacgctgcagcacaacaatcaaacccgggta cagacatgccaaaacaaagcccaagtgcgctcaaccaggcctcagcccagtcaggttggc tctagcagaggccagaaaaacctgaagagctactttcagccctcccctagctgtccccaa gcctctcctgacatagagctgcctagcctaccactgatgagcgccctcatgaccccgaag actccagaagagaaggcagtggccaaagtggtgaaggggcaggccaagacttcagaagcc aaagatgagaaggagttacggacctcattctggaagtctgtgctggcggggcccttgcgc acacccctctgtgggggccacagggagccatgtgtgatgcgtactgtgaagaagccagga cccaacttgggccgccgcttctacatgtgtgccaggccccggggtcctcccactgacccc tcctcccggtgcaacttcttcctctggagcaggcccagctga >gi568815575f:54822247_55031017|GENSCAN_predicted_peptide_11|587_aa MVTAAMLLQCCPVLARGPTSLLGKVVKTHQFLFGIGRCPILATQGPNCSQIHLKATKAGG DSPSWAKGHCPFMLSELQDGKSKIVQKAAPEVQEDVKAFKTDLPSSLVSVSLRKPFSGPQ EQEQISGKVTHLIQNNMPGNYVFSYDQFFRDKIMEKKQDHTYRVFKTVNRWADAYPFAQH FSEASVASKDVSVWCSNDYLGMSRHPQVLQATQETLQRHGAGAGGTRNISGTSKFHVELE QELAELHQKDSALLFSSCFVANDSTLFTLAKILPGCEIYSDAGNHASMIQGIRNSGAAKF VFRHNDPDHLKKLLEKSNPKIPKIVAFETVHSMDGAICPLEELCDVSHQYGALTFVDEVH AVGLYGSRGAGIGERDGIMHKIDIISGTLGKAFGCVGGYIASTRDLVDMVRSYAAGFIFT TSLPPMVLSGALESVRLLKGEEGQALRRAHQRNVKHMRQLLMDRGLPVIPCPSHIIPIRV GNAALNSKLCDLLLSKHGIYVQAINYPTVPRGEELLRLAPSPHHSPQMMEDFVEKLLLAW TAVGLPLQDVSVAACNFCRRPVHFELMSEWERSYFGNMGPQYVTTYA >gi568815575f:54822247_55031017|GENSCAN_predicted_CDS_11|1764_bp atggtgactgcagccatgctgctacagtgctgcccagtgcttgcccggggccccacaagc ctcctaggcaaggtggttaagactcaccagttcctgtttggtattggacgctgtcccatc ctggctacccaaggaccaaactgttctcaaatccaccttaaggcaacaaaggctggagga gattctccatcttgggcgaagggccactgtcccttcatgctgtcggaactccaggatggg aagagcaagattgtgcagaaggcagccccagaagtccaggaagatgtgaaggctttcaag acagatctgcctagctccctggtctcagtcagcctaaggaagccattttccggtccccag gagcaggagcagatctctgggaaggtcacacacctgattcagaacaatatgcctggaaac tatgtcttcagttatgaccagtttttcagggacaagatcatggagaagaaacaggatcac acctaccgtgtgttcaagactgtgaaccgctgggctgatgcatatccctttgcccaacat ttctctgaggcatctgtggcctcaaaggatgtgtccgtctggtgtagtaatgattacctg ggcatgagccgacaccctcaggtcttgcaagccacacaggagaccctgcagcgtcatggt gctggagctggtggcacccgcaacatctcaggcaccagtaagtttcatgtggagcttgag caggagctggctgagctgcaccagaaggactcagccctgctcttctcctcctgctttgtt gccaatgactctactctcttcaccttggccaagatcctgccagggtgcgagatttactca gacgcaggcaaccatgcttccatgatccaaggtatccgtaacagtggagcagccaagttt gtcttcaggcacaatgaccctgaccacctaaagaaacttctagagaagtctaaccctaag atacccaaaattgtggcctttgagactgtccactccatggatggtgccatctgtcccctc gaggagttgtgtgatgtgtcccaccagtatggggccctgaccttcgtggatgaggtccat gctgtaggactgtatgggtcccggggcgctgggattggggagcgtgatggaattatgcat aagattgacatcatctctggaactcttggcaaggcctttggctgtgtgggcggctacatt gccagcacccgtgacttggtggacatggtgcgctcctatgctgcaggcttcatctttacc acttctctgccccccatggtgctctctggagctctagaatctgtgcggctgctcaaggga gaggagggccaagccctgaggcgagcccaccagcgcaatgtcaagcacatgcgccagcta ctcatggacaggggccttcctgtcatcccctgccccagccacatcatccccatccgggtg ggcaatgcagcactcaacagcaagctctgtgatctcctgctctccaagcatggcatctat gtgcaggccatcaactacccaactgtcccccggggtgaagagctcctgcgcttggcaccc tccccccaccacagccctcagatgatggaagattttgtggagaagctgctgctggcttgg actgcggtggggctgcccctccaggatgtgtctgtggctgcctgcaatttctgtcgccgt cctgtacactttgagctcatgagtgagtgggaacgttcctacttcgggaacatggggccc cagtatgtcaccacctatgcctga