GENSCAN 1.0 Date run: 6-Nov-116 Time: 20:18:05 Sequence gi568815592f:33355101_33556519 : 201419 bp : 48.36% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 PlyA - 2321 2316 6 1.05 1.01 Sngl - 11128 10448 681 1 0 73 36 760 0.996 65.49 1.00 Prom - 24815 24776 40 -4.86 2.03 PlyA - 26388 26383 6 1.05 2.02 Term - 36509 36304 206 0 2 105 42 105 0.926 5.13 2.01 Init - 36996 36570 427 0 1 80 80 103 0.603 3.86 2.00 Prom - 40815 40776 40 -3.06 3.02 PlyA - 43371 43366 6 1.05 3.01 Sngl - 45456 44959 498 0 0 87 47 520 0.991 43.85 3.00 Prom - 46949 46910 40 -3.16 4.00 Prom + 47719 47758 40 -9.85 4.01 Init + 48174 48267 94 2 1 40 93 8 0.610 -2.91 4.02 Intr + 48385 48435 51 2 0 78 92 73 0.938 5.58 4.03 Intr + 48629 49029 401 0 2 94 94 282 0.729 23.62 4.04 Intr + 49752 50531 780 2 0 76 53 755 0.981 62.48 4.05 Intr + 51096 51386 291 2 0 114 81 271 0.982 26.33 4.06 Intr + 51492 51565 74 2 2 54 76 89 0.983 2.50 4.07 Intr + 51700 51775 76 1 1 75 86 95 0.917 7.52 4.08 Intr + 55293 55445 153 2 0 38 64 87 0.496 1.57 4.09 Intr + 57408 57489 82 2 1 95 99 7 0.902 1.71 4.10 Intr + 57598 57693 96 2 0 96 81 26 0.821 2.68 4.11 Intr + 58096 58196 101 2 2 69 98 45 0.988 3.43 4.12 Intr + 58309 58457 149 0 2 103 64 100 0.999 8.13 4.13 Intr + 58636 58731 96 1 0 98 94 94 0.999 10.12 4.14 Intr + 58941 59009 69 0 0 76 93 49 0.756 2.50 4.15 Intr + 59143 59266 124 1 1 79 94 30 0.745 3.29 4.16 Intr + 59377 59444 68 0 2 98 117 53 0.878 6.90 4.17 Intr + 59625 59736 112 0 1 103 2 27 0.759 -4.02 4.18 Intr + 59871 60044 174 2 0 51 84 80 0.730 4.04 4.19 Intr + 60135 60229 95 2 2 98 82 139 0.889 13.06 4.20 Intr + 60490 60570 81 1 0 79 94 79 0.799 6.35 4.21 Term + 60710 60998 289 2 1 123 47 73 0.904 1.55 4.22 PlyA + 61320 61325 6 1.05 5.06 PlyA - 61470 61465 6 1.05 5.05 Term - 61676 61550 127 1 1 65 45 103 0.905 1.46 5.04 Intr - 61869 61820 50 0 2 116 89 -16 0.749 -1.22 5.03 Intr - 62042 61997 46 1 1 87 81 58 0.813 3.41 5.02 Intr - 62210 62151 60 1 0 92 84 77 0.973 5.55 5.01 Init - 62568 62381 188 0 2 28 91 294 0.968 20.23 5.00 Prom - 64273 64234 40 -5.86 6.00 Prom + 64393 64432 40 -11.14 6.01 Init + 65165 65231 67 1 1 52 94 64 0.454 3.16 6.02 Intr + 68377 68498 122 2 2 123 105 163 0.884 21.71 6.03 Intr + 70698 70803 106 2 1 87 64 126 0.996 9.89 6.04 Intr + 73491 73660 170 1 2 96 89 20 0.526 2.57 6.05 Intr + 76845 76932 88 2 1 90 66 5 0.505 -1.96 6.06 Intr + 77061 77152 92 1 2 105 109 48 0.991 8.31 6.07 Intr + 77585 77745 161 1 2 126 23 162 0.563 12.29 6.08 Intr + 80052 80205 154 0 1 86 69 78 0.679 5.77 6.09 Intr + 80415 80513 99 2 0 95 67 81 0.983 7.01 6.10 Intr + 82568 83191 624 1 0 66 113 761 0.982 68.84 6.11 Intr + 83319 83463 145 2 1 75 52 195 0.991 14.46 6.12 Intr + 83675 83819 145 0 1 83 85 192 0.999 17.74 6.13 Intr + 85629 85865 237 0 0 106 61 285 0.996 24.33 6.14 Intr + 86073 86274 202 0 1 83 82 188 0.968 16.99 6.15 Intr + 86481 86659 179 2 2 42 99 218 0.921 17.02 6.16 Intr + 87353 87394 42 2 0 109 94 77 0.995 7.76 6.17 Intr + 87789 88860 1072 0 1 89 96 704 0.922 61.53 6.18 Intr + 89344 89517 174 0 0 73 63 260 0.983 22.14 6.19 Intr + 91475 91686 212 1 2 32 94 396 0.982 32.31 6.20 Intr + 92743 92833 91 1 1 81 44 176 0.999 12.50 6.21 Intr + 94077 94200 124 2 1 122 96 -62 0.086 -1.84 6.22 Term + 99930 101422 1493 1 2 121 42 779 0.869 67.33 6.23 PlyA + 102420 102425 6 1.05 7.09 PlyA - 103071 103066 6 1.05 7.08 Term - 111338 111213 126 2 0 96 44 69 0.648 1.58 7.07 Intr - 143620 143561 60 1 0 90 71 49 0.348 2.33 7.06 Intr - 147707 147422 286 1 1 66 45 175 0.394 8.34 7.05 Intr - 157407 157341 67 1 1 105 61 8 0.098 -2.24 7.04 Intr - 164391 164265 127 0 1 121 92 10 0.032 4.95 7.03 Intr - 177759 177575 185 1 2 58 35 100 0.032 1.21 7.02 Intr - 178654 178568 87 2 0 106 101 -13 0.077 1.74 7.01 Init - 184704 184635 70 0 1 52 110 48 0.377 4.62 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 9827 9813 15 2 0 144 49 13 0.844 1.24 S.002 Init - 9981 9925 57 0 0 103 81 121 0.824 12.21 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:33355101_33556519|GENSCAN_predicted_peptide_1|226_aa MSVPLLNDAATVSGAERETAVVIFLHGLGDTGHSWADALSTIRLPHVKYICSHEPRIPVT LNMKMVMPSWFDLMGLSPDAPEDEAGIKKAAENIKALIEHEMKNGIPANQIILGGFSQGR ALSLYMALTCPHPLAGILALSCWPPLHRAFPQAANGSAKDLAILQCHGELDPMVPVRFGA LMAEKLRSVVTPARVQFQTYLGVMHSSCPQEMAAVKEFLEKLLPPV >gi568815592f:33355101_33556519|GENSCAN_predicted_CDS_1|681_bp atgtctgtgcccctgctcaacgatgctgccaccgtgtctggagctgagcgggaaacggcc gtggttatttttttacatggacttggagacacagggcacagctgggctgacgccctctcc accattcggctccctcacgtcaagtacatctgttcccatgagcctaggatccctgtgacc ctcaacatgaagatggtgatgccctcctggtttgacctgatggggctgagtccagatgcc ccagaggacgaggctggcatcaagaaggcagcagagaacatcaaggccttgattgagcat gaaatgaagaacgggatccctgccaatcaaatcatcctgggaggcttttcacagggccgg gccctgtccctctacatggccctcacctgcccccaccctctggctggcatcctggctttg agctgctggccgcctctgcaccgggccttcccccaggcagctaatggcagtgccaaggac ctggccatcctccagtgccatggggagctggaccccatggtgcccgtacggtttggggcc ctgatggctgagaagctccggtctgttgtcacacctgccagggtccagttccagacatac ctgggtgtcatgcacagctcctgtcctcaggagatggcagctgtgaaggaatttcttgag aagctgctgcctcctgtctaa >gi568815592f:33355101_33556519|GENSCAN_predicted_peptide_2|210_aa MTKADREPGARQHRSGQPSSPSPGQDTCAAPTHLRIHVHLGPTRAGGMQWKRTGCRGSWD SRSLARSRSRVENSHGSGRYQPRATPATFEFQRPPPTLTALCTQGPIVAVATAEGQSQRS PPPEATPRPTGALSFLFRRRRRAAIVLDFRSAPRRRSLRQKMRGRRRTNATGQAKLVAAG GEGAGTLEERNDACANEDPCRETGGTSASW >gi568815592f:33355101_33556519|GENSCAN_predicted_CDS_2|633_bp atgacaaaggccgaccgggagccgggggcgcgacagcatcggagcggtcagccttcgtcc ccatccccagggcaggacacctgcgccgcccctactcacctgcggatccatgtccacctc ggtcccacacgcgccgggggaatgcagtggaagagaactgggtgccggggatcctgggac tcgcgttctctcgcccgctcgcgaagcagggtagagaactcgcacggctccggccgctac cagccccgcgccacacccgccacttttgaattccaacggccaccacccactctcaccgcg ctctgcacgcagggaccaatcgtcgctgtcgccacagccgagggccaatcgcagcgttct ccgccacccgaagccacaccccgcccgacaggcgccttgtcttttctgtttcgcaggcgc aggagagcggcaatagtgctggacttccgctcggctccccgccgtcgctcgctacgtcag aaaatgcgtggacgtcgccgcacgaacgcaactggccaagcgaaactggtggcggccgga ggagaaggggcggggacgctggaggaaagaaatgacgcgtgcgcaaacgaggacccgtgc cgggagacaggcgggactagcgcctcctggtga >gi568815592f:33355101_33556519|GENSCAN_predicted_peptide_3|165_aa MPPKFDPNEIKVVYLRCTRGEVGATSALAPKIGPLGLSPKKVGDDIAKATGDWKGLRITV KLTIENRQAQIEVVPSASAPIIKALKKPPRDRKKQKNIKHNGNITFDEIVNIARQMRHRS LARELSGTIKEIPGTAQSMGCNVDGHHPHDIIDDINSGAVECPAS >gi568815592f:33355101_33556519|GENSCAN_predicted_CDS_3|498_bp atgccaccgaagttcgaccccaacgagatcaaggtcgtatacctgaggtgcaccagaggt gaagtcggtgccacttctgccctggcccccaagatcggccccctgggtctgtctccaaaa aaggttggtgatgacattgccaaggcaacgggtgactggaagggcctgaggattacagtg aaactgaccattgagaacagacaggcccagattgaggtggtgccttctgcctctgccccg atcatcaaagccctcaagaaaccaccaagagacagaaagaaacagaaaaacattaaacac aatgggaatatcacttttgatgagatcgtcaacattgctcgacagatgcggcaccgatcc ttagccagagaactctctggaaccattaaagagattccggggactgcccagtctatgggc tgtaatgttgatggccaccaccctcatgacatcatagatgacatcaacagtggtgctgtg gaatgcccagctagttaa >gi568815592f:33355101_33556519|GENSCAN_predicted_peptide_4|1151_aa MIILPLLSPISWAAQKVSKKTGPRCSTAIATGLKNQKPVPAVPVQKSGTSGVPPMAGGKK PSKRPAWDLKGQLCDLNAELKRCRERTQTLDQENQQLQDQLRDAQQQVKALGTERTTLEG HLAKVQAQAEQGQQELKNLRACVLELEERLSTQEGLVQELQKKQVELQEERRGLMSQLEE KERRLQTSEAALSSSQAEVASLRQETVAQAALLTEREERLHGLEMERRRLHNQLQELKGN IRVFCRVRPVLPGEPTPPPGLLLFPSGPGGPSDPPTRLSLSRSDERRGTLSGAPAPPTRH DFSFDRVFPPGSGQDEVFEEIAMLVQSALDGYPVCIFAYGQTGSGKTFTMEGGPGGDPQL EGLIPRALRHLFSVAQELSGQGWTYSFVASYVEIYNETVRDLLATGTRKGQGGECEIRRA GPGSEELTVTNARYVPVSCEKEVDALLHLARQNRAVARTAQNERSSRSHSVFQLQISGEH SSRGLQCGAPLSLVDLAGSERLDPGLALGPGERERLRETQAINSSLSTLGLVIMALSNKE SHVPYRNSKLTYLLQNSLGGSAKMLMFVNISPLEENVSESLNSLRFASKPWKHGAGRPRE AARTPADSLAPTADGRRPVKEGLSPPRCRAQEGTRVRKRAVDSAREVCLVQFEDDSQFLV LWKDISPAALPGEELLCCVCRSETVVPGNRLVSCEKCRHAYHQDCHVPRAPAPGEGEGTS WVCRQCVFAIATKRGGALKKGPYARAMLGMKLSLPYGLKGLDWDAGHLSNRQQSYCYCGG PGEWNLKMLQCRSCLQWFHEACTQCLSKPLLYGDRFYEFECCVCRGGPEKVRRLQLRWVD VAHLVLYHLSVCCKKKYFDFDREILPFTSENWDSLLLGELSDTPKGERSSRLLSALNSHK DRFISGREIKKRKCLFGLHARMPPPVEPPTGDGALTRSLGPGGGVSRPLGKRRRPEPEPL RRRQKGKVEELGPPSAVRNQPEPQEQRERAHLQRALQASVSPPSPSPNQSYQGSSGYNFR PTDARCLPSSPIRMFASFHPSASTAGTSGDSGPPDRSPLELHIGFPTDIPKSAPHSMTAS SSSVSSPSPGLPRRSAPPSPLCRSLSPGTGGGVRGGVGYLSRGDPVRVLARRVRPDGSVQ YLVEWGGGGIF >gi568815592f:33355101_33556519|GENSCAN_predicted_CDS_4|3456_bp atgatcattctacctttgctctctcccatctcctgggcagctcaaaaagtttccaagaag acaggaccccggtgttccacagctattgccacagggttgaagaaccagaagccagttcct gctgttcctgtccagaagtctggcacatcaggtgttcctcccatggcaggagggaagaaa cccagcaaacgtccagcctgggacttaaagggtcagttatgtgacctaaatgcagaacta aaacggtgccgtgagaggactcaaacgttggaccaagagaaccagcagcttcaggaccag ctcagagatgcccagcagcaggtcaaggccctggggacagagcgcacaacactggagggg catttagccaaggtacaggcccaggctgagcagggccaacaggagctgaagaacttgcgt gcttgtgtcctggagctggaagagcggctgagcacgcaggagggcttggtgcaagagctt cagaaaaaacaggtggaattgcaggaagaacggaggggactgatgtcccaactagaggag aaggagaggaggctgcagacatcagaagcagccctgtcaagcagccaagcagaggtggca tctctgcggcaggagactgtggcccaggcagccttactgactgagcgggaagaacgtctt catgggctagaaatggagcgccggcgactgcacaaccagctgcaggaactcaagggcaac atccgtgtattctgccgggtccgccctgtcctgccgggggagcccactccaccccctggc ctcctcctgtttccctctggccctggtgggccctctgatcctccaacccgccttagcctc tcccggtctgacgagcggcgtgggaccctgagtggggcaccagctcccccaactcgccat gatttttcctttgaccgggtattcccaccaggaagtggacaggatgaagtgtttgaagag attgccatgcttgtccagtcagccctggatggctatccagtatgcatctttgcctatggc cagacaggcagtggcaagaccttcacaatggagggtgggcctgggggagacccccagttg gaggggctgatccctcgggccctgcggcacctcttctctgtggctcaggagctgagtggt cagggctggacctacagctttgtagcaagctacgtagagatctacaatgagactgtccgg gacctgctggccactggaacccggaagggtcaagggggcgagtgtgagattcgccgtgca gggccagggagtgaggagctcactgtcaccaatgctcgatatgtccctgtctcctgtgag aaagaagtggacgccctgcttcatctggcccgccagaatcgggctgtggcccgcacagcc cagaatgaacggtcatcacgcagccacagtgtattccagctacagatttctggggagcac tccagccgaggcctgcagtgtggggcccccctcagtcttgtggacctggccgggagtgag cgacttgaccccggcttagccctcggccccggggagcgggaacgccttcgggaaacacag gccattaacagcagcctgtccacgctggggctggttatcatggccctgagcaacaaggag tcccacgtgccttaccggaacagcaaactgacctacctgctgcagaactctctgggtggt agtgctaagatgctcatgtttgtgaacatttctccactggaagagaacgtctccgagtcc ctcaactctctacgctttgcctccaagccctggaagcacggggcgggacgtccacgggaa gcggcgcgcacgcccgccgactccctcgcgccaaccgccgacggccgccgcccggtgaag gaggggctcagtcctcccaggtgccgcgcgcaggaggggacacgcgtgcgcaaaagggcg gtggacagtgctagggaggtgtgtctggtccagtttgaggatgattcgcagtttctggtt ctatggaaagacattagccctgctgccctccctggagaggaactcctctgttgtgtctgt cgctctgagactgtggtccctgggaaccggctggtcagctgtgagaagtgtcgccatgct tatcaccaggactgccatgttcccagggctccagcccctggagagggagagggcacatcc tgggtatgccgccagtgtgtctttgcgatcgccaccaagaggggaggtgccctgaagaag ggcccctatgcccgggccatgctgggtatgaagctttctctgccatatggactgaagggg ctggactgggatgctggacatctgagcaaccgacagcagagttactgttactgtggtggc cctggggagtggaacctgaaaatgctgcagtgccggagctgcctgcagtggttccatgag gcctgcacccagtgtctgagcaagcccctcctctatggggacaggttctatgaatttgaa tgctgtgtgtgtcgcgggggccctgagaaagtccggagactacagcttcgctgggtggat gtggcccatcttgtcctgtatcacctcagtgtttgctgtaagaagaaatactttgatttt gatcgtgagatcctccccttcacttctgagaattgggacagtttgctcctgggggagctt tcagacacccccaaaggagaacgttcttccaggctcctctctgctcttaacagccacaag gaccgtttcatttcagggagagagattaagaagaggaaatgtttgtttggtctccatgct cggatgcctccccctgtggagccccctactggagatggagcactcaccaggtcactgggc cctgggggaggggtctcacgtcccctggggaagcgccggaggccggagccagagcccctg aggaggaggcagaaggggaaagtggaggagctggggccaccctcagcagtgcgcaatcag cccgagccccaggagcagagggagcgggctcatctgcagagggcactgcaggcctcagtg tctccaccatcccccagccctaaccagagttaccagggcagcagcggctacaacttccgg cccacagatgcccgctgcctgcccagcagccccatccggatgtttgcttccttccaccct tctgccagcaccgcagggacctctggggacagtggacccccagacaggtcacccctggaa cttcacattggtttccccacagacatccctaaaagtgccccccactcgatgactgcctca tcttcctcagtttcatccccatccccaggtcttcctagacgctcagcacccccttctccc ctgtgccgtagtttgtctcctgggactgggggaggagtccgaggtggggttggttacctg tcccgaggggaccctgtccgggtccttgctcggagagtacggcctgatggctctgtgcag tacctggttgagtggggaggagggggcatcttctga >gi568815592f:33355101_33556519|GENSCAN_predicted_peptide_5|156_aa MPALLPVASRLLLLPRVLLTMASGSPPTQPSPASDSGSGYVPGSVSAAFVTCPNEKVAKE IARAVVEKRLAACVNLIPQITSIYEWKGKIEEDSEVLMMIKTQSSLVPALTDFVRSVHPY EVAEVIALPVEQGNFPYLQWVRQVTESVSDSITVLP >gi568815592f:33355101_33556519|GENSCAN_predicted_CDS_5|471_bp atgccggcgctgctgcctgtggcctcccgccttttgttgctaccccgagtcttgctgacc atggcctctggaagccctccgacccagccctcgccggcctcggattccggctctggctac gttccgggctcggtctctgcagcctttgttacttgccccaacgagaaggtcgccaaggag atcgccagggccgtggtggagaagcgcctagcagcctgcgtcaacctcatccctcagatt acatccatctatgagtggaaagggaagatcgaggaagacagtgaggtgctgatgatgatt aaaacccaaagttccttggtcccagctttgacagattttgttcgttctgtgcacccttac gaagtggccgaggtaattgcattgcctgtggaacaggggaactttccgtacctgcagtgg gtgcgccaggtcacagagtcagtttctgactctatcacagtcctgccatga >gi568815592f:33355101_33556519|GENSCAN_predicted_peptide_6|1932_aa MSRSRASIHRGSIPAMSYAPFRDVRGPSMHRTQYVHSPYDRPGWNPRFCIISGNQLLMLD EDEIHPLLIRDRRSESSRNKLLRRTVSVPVEGRPHGEHGGCWASHGLVGQSAALLVHVTS DPEHAYPWLGLTSCQASVCGNVWPYEQDFLCPWWQGAPAQVPCPLLPAASLSAVAALPAA FRGVEYHLGRSRRKSVPGGKQYSMEGAPAAPFRPSQGFLSRRLKSSIKRTKSQPKLDRTS SFRQILPRFRSADHDRYRGWSMWDEIDVMARLMQSFKESHSHESLLSPSSAAEALELNLD EDSIIKPVHSSILGQEFCFEVTTSSGTKCFACRSAAERDKWIENLQRAVKPNKDNSRRVD NVLKLWIIEARELPPKKRYYCELCLDDMLYARTTSKPRSASGDTVFWGEHFEFNNLPAVR ALRLHLYRDSDKKRKKDKAGYVGLVTVPVATLAGRHFTEQWYPVTLPTGSGGSGGMGSGG GGGSGGGSGGKGKGGCPAVRLKARYQTMSILPMELYKEFAEYVTNHYRMLCAVLEPALNV KGKEEVASALVHILQSTGKAKDFLSDMAMSEVDRFMEREHLIFRENTLATKAIEEYMRLI GQKYLKDAIGEFIRALYESEENCEVDPIKCTASSLAEHQANLRMCCELALCKVVNSHCVF PRELKEVFASWRLRCAERGREDIADRLISASLFLRFLCPAIMSPSLFGLMQEYPDEQTSR TLTLIAKVIQNLANFSKFTSKEDFLGFMNEFLELEWGSMQQFLYEISNLDTLTNSSSFEG YIDLGRELSTLHALLWEVLPQLSKEALLKLGPLPRLLNDISTALRNPNIQRQPSRQSERP RPQPVVLRGPSAEMQGYMMRDLNSSIDLQSFMARGLNSSMDMARLPSPTKEKPPPPPPGG GKDLFYVSRPPLARSSPAYCTSSSDITEPEQKMLSVNKSVSMLDLQGDGPGGRLNSSSVS NLAAVGDLLHSSQASLTAALGLRPAPAGRLSQGSGSSITAAGMRLSQMGVTTDGVPAQQL RIPLSFQNPLFHMAADGPGPPGGHGGGGGHGPPSSHHHHHHHHHHRGGEPPGDTFAPFHG YSKSEDLSSGVPKPPAASILHSHSYSDEFGPSGTDFTRRQLSLQDNLQHMLSPPQITIGP QRPAPSGPGGGSGGGSGGGGGGQPPPLQRGKSQQLTVSAAQKPRPSSGNLLQSPEPSYGP ARPRQQSLSKEGSIGGSGGSGGGGGGGLKPSITKQHSQTPSTLNPTMPASERTVAWVSNM PHLSADIESAHIEREEYKLKEYSKSMDESRLDRVKEYEEEIHSLKERLHMSNRKLEEYER RLLSQEEQTSKILMQYQARLEQSEKRLRQQQAEKDSQIKSIIGRLMLVEEELRRDHPAMA EPLPEPKKRLLDAQCMSVTPHFTRASLGAGGGFVNGVEAMMGWRILAIGAVLTAAAFIPR GVYPQALLLFPILVTFEEAMETPTPLPPVPASPTCNPAPRTIQIEFPQHSSSLLESLNRH RLEGKFCDVSLLVQGRELRAHKAVLAAASPYFHDKLLLGDAPRLTLPSVIEADAFEGLLQ LIYSGRLRLPLDALPAHLLVASGLQMWQVVDQCSEILRELETSGGGISARGGNSYHALLS TTSSTGGWCIRSSPFQTPVQSSASTESPASTESPVGGEGSELGEVLQIQVEEEEEEEEDD DDEDQGSATLSQTPQPQRVSGVFPRPHGPHPLPMTATPRKLPEGESAPLELPAPPALPPK IFYIKQEPFEPKEEISGSGTQPGGAKEETKVFSGGDTEGNGELGFLLPSGPGPTSGGGGP SWKPVDLHGNEILSGGGGPGGAGQAVHGPVKLGGTPPADGKRFGCLCGKRFAVKPKRDRH IMLTFSLRPFGCGICNKRFKLKHHLTEHMKTHAGALHACPHCGRRFRVHACFLRHRDLCK GQGWATAHWTYK >gi568815592f:33355101_33556519|GENSCAN_predicted_CDS_6|5799_bp atgagcaggtctcgagcctccatccatcgggggagcatccccgcgatgtcctatgccccc ttcagagatgtacggggaccctctatgcaccgaacccaatacgttcattccccgtatgat cgtcctggttggaaccctcggttctgcatcatctcggggaaccagctgctcatgctggat gaggatgagatacaccccctactgatccgggaccggaggagcgagtccagtcgcaacaaa ctgctgagacgcacagtctccgtgccggtggaggggcggccccacggcgagcatgggggc tgttgggccagccacgggcttgtggggcagtccgcggccttgctggtccatgtcacctct gaccctgagcatgcatatccctggctggggctcacctcctgtcaggcttctgtgtgtggg aatgtgtggccctatgagcaggattttctgtgtccctggtggcagggggctcctgctcag gttccttgccccctccttcccgctgccagcctctccgccgtcgctgctcttcctgctgct ttccggggggtagaataccacttgggtcgctcgaggaggaagagtgtcccaggggggaag cagtacagcatggagggtgcccctgctgcgcccttccggccctcgcaaggcttcctgagc cgacggctaaaaagctccatcaaacgaacgaagtcacaacccaaacttgaccggaccagc agctttcgccagatcctgcctcgcttccgaagtgctgaccatgaccggtacaggggctgg agcatgtgggatgagattgatgtaatggcccggctgatgcaaagctttaaggagtcacac tctcatgagtccttgctgagtcctagcagtgcagctgaggcattggagctcaacttggat gaagattccattatcaagccagtgcacagctccatcctgggccaggagttctgttttgag gtaacaacttcatcaggaacaaaatgctttgcctgtcggtctgcggccgaaagagacaaa tggattgagaatctgcagcgggcagtaaagcccaacaaggacaacagccgccgggtagac aatgtgctaaagctgtggatcatagaggcccgggagctgccccccaagaagcggtactac tgtgagctctgcctggatgacatgctgtatgcacgcaccacctccaagccccgctctgcc tctggggacaccgtcttctggggcgagcacttcgagtttaacaacctgccggctgtccgt gccctgcggctgcatctgtaccgtgactcagacaaaaagcgcaagaaggacaaggcaggc tatgtcggcctggtgactgtgccagtggccaccctggctgggcgccacttcacagagcag tggtaccctgtaaccctgccaacaggcagtgggggatctgggggcatgggttcgggaggg ggagggggctcggggggtggctcagggggcaagggcaaaggaggttgcccggctgtgcgg ctgaaagcacgttaccagacaatgagcatcttgcccatggagctatataaagagtttgca gagtatgtcaccaaccattatcggatgctgtgtgcagtcttggagcccgccctgaatgtc aaaggcaaggaggaggttgccagtgcactagttcacatcctgcagagtacaggcaaggcc aaggacttcctttcagacatggccatgtctgaggtagaccggttcatggaacgggagcac ctcatattccgcgagaacacgcttgccactaaagccatagaagagtatatgagactgatt ggtcagaaatacctcaaggatgccattggagaattcatccgtgctctgtatgaatctgag gaaaactgcgaggtagaccctatcaagtgcacagcatccagtttggcagagcaccaggcc aacctgcgaatgtgctgtgagttggccctgtgcaaggtggtcaactcccactgcgtgttc ccgagggagctgaaggaggtgtttgcttcgtggcggctgcgctgcgcagagcgaggccgg gaggacatcgcagacaggcttatcagcgcctcactcttcctgcgcttcctctgcccagcg attatgtcgcccagtctctttgggcttatgcaggagtacccagatgagcagacctcacga accctcaccctcattgccaaggtcatccagaacctggccaacttttccaagtttacctca aaggaggactttctgggcttcatgaatgagtttctggagctggaatggggttccatgcag cagtttttgtatgagatctccaatctggacacgctaaccaacagcagtagctttgagggt tacatcgacttgggccgagagctctccacactgcatgccctactctgggaggtgctgccc cagctcagcaaggaagccctcctgaagctgggtccactgccccggctcctcaacgacatc agcacagctctgaggaaccccaacatccaaaggcagccaagccgccagagtgagcggccc cggcctcagcctgtggtactgcgggggccatcggctgagatgcagggctacatgatgcgg gacctcaacagctccatcgaccttcagtccttcatggctcgaggcctcaacagctctatg gacatggctcgcctcccctccccaaccaaggaaaagccacccccaccaccgcctggtggt ggtaaagacctgttctatgtaagccgtccacccctggcccgttcctcaccagcatactgc acgagcagctcggacatcacagagccagagcagaagatgctgagtgtcaacaagagtgtg tccatgctggacttacagggtgatgggcctggtggccgcctcaacagcagcagtgtttcg aacctggcggccgtaggggacctgctgcactcaagccaggcctcgctgacagcagccttg gggctacggcctgcgcctgccggacgcctctcccaggggagtggctcatccatcacggcg gctggcatgcgcctcagccagatgggtgtcaccacagacggtgtccctgcccagcaactg cgaatccccctctccttccagaaccctctcttccacatggctgctgatgggccaggtccc ccaggcggccatggagggggcggtggccatggcccaccttcctcccatcaccaccaccac caccatcaccaccaccgaggtggagagccccctggggacacctttgccccattccatggc tatagcaagagtgaggacctctcttccggggtccccaagccccctgctgcctccatcctt catagccacagctacagtgatgagtttggaccctctggcactgacttcacccgtcggcag ctttcactccaggacaacctgcagcacatgctgtcccctccccagatcaccattggtccc cagaggccagccccctcagggcctggaggtgggagcggtgggggcagcggtgggggtggc gggggccagccgcctccattgcagaggggcaagtctcagcagttgacagtcagcgcagcc cagaaaccccggccatccagcgggaatctattgcagtccccagagccaagttatggcccc gcccgtccacggcaacagagcctcagcaaggagggcagcattgggggcagcgggggcagc ggtggcggagggggtggggggctgaagccctccatcaccaagcagcattctcagacacca tccacattgaaccccacaatgccagcctctgagcggacagtggcctgggtctccaacatg cctcacctgtcggctgacatcgagagtgcccacatcgagcgggaagagtacaagctcaag gagtactcaaaatcgatggatgagagccggctggatagggtgaaggagtacgaggaggag attcactcactgaaagagcggctgcacatgtccaaccggaagctggaagagtatgagcgg aggctgctgtcccaggaagaacaaaccagcaaaatcctgatgcagtatcaggcccgactg gagcagagtgagaagaggctaaggcagcagcaggcagagaaggattcccagatcaagagc atcattggcaggctgatgctggtggaggaggagctgcgccgggaccaccccgccatggct gagccgctgccagaacccaagaagaggctgctcgacgctcagtgtatgtctgtcaccccc catttcaccagagcgtccttaggggctgggggtgggtttgttaatggggtggaggcaatg atgggttggaggatcttggctataggggctgtgctgactgcagcagccttcatcccgcgt ggagtctacccccaagcccttctcctcttcccaattcttgtcaccttcgaggaggccatg gaaaccccaacacctttgccgcctgtacccgcctccccgacctgcaacccagccccacgg acaatccagatcgagttcccacagcatagctcgtctctgctggaatctctgaaccgccac aggctagagggaaagttctgtgatgtgtccctcctggtgcagggccgggaacttagggct cataaagcagtgttagctgctgcctctccttacttccatgacaagctgcttctgggggat gcgcctcgtctcactctaccgagtgtcattgaagccgatgccttcgaggggctgctccag ctcatttattcagggcgtctccgcctgccactggatgctcttcctgctcatctccttgtg gccagtggccttcaaatgtggcaggtagtagatcagtgctcagaaattcttagagaatta gaaacttcaggtggtggaatttcagcccgtggaggaaactcctaccatgcccttctttcc actacatcctctacaggaggctggtgcattcgctcttcgcctttccagaccccagtacag tcctctgcttctactgaaagccctgcttccactgagagccctgtgggaggggagggaagt gaactgggagaagtgctgcaaattcaggtggaagaagaagaggaggaggaggaagatgat gatgatgaggaccaggggtcagccacactctctcagactcctcagccccagagagtatca ggggtttttccccgtcctcatggaccccacccactgcccatgactgctactccccgaaag cttccagagggtgagagtgcaccacttgagcttcctgcccctcctgcactgccccccaaa atcttctacattaagcaggaacccttcgagcctaaggaggagatatcaggaagcggaact cagcctggaggagcaaaggaggaaaccaaagtgttttctggaggggacactgaagggaat ggggagctagggttcttgttgccttcagggccagggccaacatctgggggagggggtcca tcctggaaaccagtggatcttcatgggaatgaaatcctgtcagggggtggaggacctggg ggagcaggccaggccgtgcatgggcctgtgaagctaggggggacaccccctgcagatgga aaacgctttggttgcctgtgtgggaagcggtttgcagtgaagccaaagcgtgaccggcac atcatgctgaccttcagccttcggccttttggctgtggcatctgcaacaagcgcttcaag ctgaagcaccatctgacagagcacatgaagacccatgctggagccctgcatgcctgtccc cactgtggccgtcggttccgagtccatgcctgttttctccgccaccgggacctatgcaag ggccagggctgggccactgcccactggacttacaagtga >gi568815592f:33355101_33556519|GENSCAN_predicted_peptide_7|335_aa MNGKGLKESLPAAQIDFARGITTGGTTGTCHCVRSWWVLGLTDLKNEAADPHDSGAQLAS PSGSRTGAAGGAACQSCAVRSHSSALGWSMGLAALEQGVVLVREARAAQEPMEWRYEQNF IFSNLNQSSCIGMDGLRERSWYQICRQLGTEVFTCMAHNTAGDGEGPTLGCSPGVTEDEA SGAGRPLRVWGHTHLELALARDRLPRLSLHIFLQAEGAGSDLGQPREGLPQCGGGLKGSS STVRVGVEAEQTPRASEGCQHDVTSHQHFGKPRQVHREAKGEEDEEDLYLVLEQAQHHME ATKAWGFHPLKPQPKLYIGPFQSWLELLGHRTPGP >gi568815592f:33355101_33556519|GENSCAN_predicted_CDS_7|1008_bp atgaatggaaaaggcttgaaggaaagcctgccagccgctcagattgactttgcaaggggg atcaccacaggtgggactacaggcacatgccactgtgtccggagttggtgggttcttggt ctgactgacttgaagaacgaagccgcggaccctcatgactcaggagcccagctggcttca cccagtggatcccgcactggggctgcaggtggagctgcctgccagtcctgcgccgtgcgc tcgcactcctcagcccttgggtggtcaatgggactggccgccctggagcagggggtggtg ctcgtccgggaggctcgggcggcacaggagcccatggagtggcgttatgaacagaacttt attttttccaacctaaaccaatcatcctgcattgggatggatggattgagggaaagatca tggtaccagatctgcaggcagctggggacagaagtcttcacttgcatggcccataacact gcgggggatggagaagggccgaccctcggatgctcacctggagtaacggaggatgaggcc agtggcgctggccggccactccgagtgtggggccacacccacctggaactcgcgctggcc cgtgatcgtctcccacgcctctccctccacatcttcctgcaagcagagggagccggctcc gacctcggccagcccagagaggggctcccacagtgcggcggcgggctgaagggctcttca agcacggtcagagtgggcgtcgaggccgagcagacaccaagagcgagcgagggctgccag cacgatgtcacctctcaccagcactttgggaagccaaggcaggtacacagagaagcaaag ggtgaagaagatgaagaggacctttatttagtgttagaacaggctcaacaccacatggaa gctaccaaggcttggggcttccaccctctgaagccacagcccaagctgtacattggcccc tttcagtcatggctggagttgctgggacacaggacaccaggcccctag