GENSCAN 1.0 Date run: 8-Nov-116 Time: 07:27:33 Sequence gi568815591f:100331426_100533551 : 202126 bp : 50.15% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 Intr - 1156 1017 140 0 2 50 111 93 0.210 7.91 1.02 Intr - 4938 4653 286 1 1 37 70 199 0.010 9.40 1.01 Init - 11561 11444 118 2 1 111 92 37 0.148 6.77 1.00 Prom - 17719 17680 40 -4.46 2.00 Prom + 18063 18102 40 -5.26 2.01 Init + 20996 21053 58 1 1 104 109 168 0.976 19.97 2.02 Intr + 21163 21489 327 2 0 -99 75 342 0.838 9.87 2.03 Intr + 21572 21698 127 0 1 76 97 69 0.690 6.34 2.04 Intr + 25325 25458 134 2 2 91 96 24 0.207 3.79 2.05 Intr + 26795 26941 147 0 0 84 105 96 0.119 11.11 2.06 Intr + 27265 27654 390 2 0 128 96 279 0.965 27.40 2.07 Intr + 27912 28112 201 1 0 76 115 18 0.061 2.56 2.08 Intr + 41223 41250 28 1 1 86 89 8 0.076 -2.13 2.09 Intr + 42171 42295 125 0 2 104 75 117 0.899 12.23 2.10 Intr + 42619 43008 390 2 0 122 96 251 0.027 24.00 2.11 Intr + 58463 58681 219 0 0 93 115 27 0.361 4.17 2.12 Intr + 66454 66487 34 2 1 126 109 27 0.813 5.88 2.13 Intr + 67866 67915 50 0 2 104 113 31 0.836 5.52 2.14 Intr + 68156 68187 32 0 2 82 121 8 0.863 1.35 2.15 Term + 68360 68482 123 1 0 90 41 60 0.909 -0.12 2.16 PlyA + 68650 68655 6 1.05 3.14 PlyA - 69467 69462 6 1.05 3.13 Term - 69911 69589 323 0 2 66 37 315 0.980 19.18 3.12 Intr - 71151 71091 61 0 1 118 99 25 0.995 4.91 3.11 Intr - 72360 72269 92 1 2 112 61 59 0.990 5.31 3.10 Intr - 72819 72753 67 0 1 86 97 24 0.990 1.58 3.09 Intr - 73668 73588 81 0 0 74 75 74 0.892 4.53 3.08 Intr - 75373 75269 105 1 0 45 89 74 0.661 3.61 3.07 Intr - 75878 75803 76 1 1 98 117 -49 0.716 -1.48 3.06 Intr - 77237 77114 124 0 1 94 58 155 0.296 12.74 3.05 Intr - 78119 78003 117 0 0 122 75 19 0.948 4.44 3.04 Intr - 84672 84550 123 1 0 126 97 70 0.981 12.26 3.03 Intr - 87764 87686 79 2 1 112 47 84 0.600 5.82 3.02 Intr - 88372 88205 168 1 0 59 76 82 0.501 4.24 3.01 Init - 96949 96731 219 1 0 85 63 193 0.821 15.13 3.00 Prom - 97587 97548 40 -7.96 4.00 Prom + 97983 98022 40 -7.36 4.01 Init + 98603 100264 1662 1 0 91 107 1176 0.653 111.67 4.02 Intr + 101494 101712 219 0 0 116 79 263 0.998 26.70 4.03 Intr + 101838 101964 127 2 1 99 116 199 0.999 24.05 4.04 Term + 102077 102129 53 0 2 131 42 59 0.999 3.19 4.05 PlyA + 102662 102667 6 1.05 5.05 PlyA - 103862 103857 6 1.05 5.04 Term - 104108 103935 174 2 0 63 41 109 0.643 1.46 5.03 Intr - 104342 104206 137 0 2 98 59 219 0.997 20.29 5.02 Intr - 104639 104423 217 2 1 97 72 391 0.950 36.48 5.01 Init - 104949 104716 234 0 0 76 99 153 0.496 13.40 5.00 Prom - 107237 107198 40 -7.46 6.08 PlyA - 108235 108230 6 1.05 6.07 Term - 112124 111956 169 1 1 68 42 170 0.965 7.75 6.06 Intr - 114960 114788 173 0 2 31 105 95 0.474 4.24 6.05 Intr - 132261 131938 324 0 0 119 82 229 0.943 21.37 6.04 Intr - 135743 135579 165 2 0 66 -17 297 0.066 17.16 6.03 Intr - 136175 136127 49 1 1 115 95 72 0.931 9.28 6.02 Intr - 143015 142849 167 2 2 119 82 160 0.960 17.26 6.01 Init - 146613 145852 762 0 0 94 110 437 0.999 41.20 6.00 Prom - 150433 150394 40 -6.36 7.00 Prom + 150803 150842 40 -9.55 7.01 Init + 153887 153954 68 1 2 69 94 89 0.975 8.14 7.02 Intr + 155396 155757 362 2 2 116 94 303 0.998 28.56 7.03 Intr + 156727 158241 1515 2 0 107 79 971 0.996 87.71 7.04 Intr + 159089 159304 216 0 0 51 51 128 0.610 3.88 7.05 Intr + 159561 159670 110 1 2 132 94 90 0.999 14.00 7.06 Term + 162221 162478 258 1 0 50 54 386 0.753 26.95 7.07 PlyA + 163680 163685 6 1.05 8.00 Prom + 176339 176378 40 -6.66 8.01 Init + 181490 181594 105 1 0 74 80 98 0.734 7.82 8.02 Term + 186228 186374 147 2 0 78 43 97 0.789 2.10 8.03 PlyA + 188593 188598 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 5027 4422 606 2 0 67 54 260 0.906 14.70 S.002 Term + 27695 27750 56 0 2 110 54 38 0.895 0.32 S.003 Term + 42619 43118 500 2 2 122 38 333 0.960 26.39 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:100331426_100533551|GENSCAN_predicted_peptide_1|182_aa MDVGWDLGEDCWQEHLHTAFSFDCWPCSQNGDRVPSVNPVGGGAPRSYLQVASAECWAAA PAVHVGEPVHAGGLHTERGADPVIGLYLVHRGGACQTPTVGNRQTPTLGIHARPRRRATT SLLTLLLAFGKNAVRTEPAKAIKPIDRKSVHQICSGPVVPSLSTAVKELVENSLDAGATN IX >gi568815591f:100331426_100533551|GENSCAN_predicted_CDS_1|546_bp atggatgttggctgggacctcggtgaggactgctggcaagaacacctacacacggccttt tcctttgactgctggccttgctcacagaatggtgaccgggttcccagtgtgaacccagtg ggcgggggggccccgcgctcctacctgcaagtggccagtgccgagtgctgggccgccgct cctgccgtgcatgttggggagccagtacatgcaggtgggctccacacggagaggggcgcc gaccccgtgatagggctttacctggtacatcggggtggcgcgtgccagacaccaacggtc ggaaaccgccagacaccaacgctcggaatccacgccaggccacgacggagggcgactacc tcccttctgaccctgctgctggcgttcggaaaaaacgcagtccgtacagaacctgctaag gccatcaaacctattgatcggaagtcagtccatcagatttgctctgggccggtggtaccg agtctaagcactgcggtgaaggagttagtagaaaacagtctggatgctggtgccactaat attgnn >gi568815591f:100331426_100533551|GENSCAN_predicted_peptide_2|794_aa MGRRTLALPWVLLTLRVTAVRMEATELSSFTIRCGFLESGSISLVTVSRGGPDGAGGTTL AVLHPELGIQQWAPARQARWETQSSISLILEGSGASSPCANTTFCCKFASFPEGSWEACG SLPPSSDPGLSVPPTPAPILRADLAGILGVSGVLLFDCGYLLHLLCRQKHREASSGSRTC GRRTSLCTSAKSSWTYRSGRLSWQSIKGTHLTITQALRQPLHRAPLLPGQLCWSPRPLEK NKAMGRPLLLPLLLLLQPPAFLQPGGSTGSGPSYLYGVTQPKHLSASMGGSVEIPFSFYY PWELAIVPNVRISWRRGHFHGQSFYSTRPPSIHKDYVNRLFLNWTEGQESGFLRISNLRK EDQSVYFCRVELDTRRSGRQQLQSIKGTKLTITQAVTTTTTWRPSSTTTIAGLRVTESKG HSESWHLSLDTAIRVALAVAVLKTVILGLLCLLLLWWRRRKGSFWISQGLRAPLLPGRLC WSPRPLEKNKAMGRPLLLPLLPLLLPPAFLQPSGSTGSGPSYLYGVTQPKHLSASMGGSV EIPFSFYYPWELATAPDVRISWRRGHFHRQSFYSTRPPSIHKDYVNRLFLNWTEGQKSGF LRISNLQKQDQSVYFCRVELDTRSSGRQQWQSIEGTKLSITQAVTTTTQRPSSMTTTWRL SSTTTTTGLRVTQGKRRSDSWHISLETAVGVAVAVTVLGIMILGLICLLRWRRRKGQQRT KATTPAREPFQNTEEPYENIRNEGQNTDPKLNPKDDGIVYASLALSSSTSPRAPPSHRPL KSPQNETLYSVLKA >gi568815591f:100331426_100533551|GENSCAN_predicted_CDS_2|2385_bp atggggcgccggaccctggccctgccctgggtgctgctgaccctgcgtgtcactgcagtt cggatggaggccaccgagctctcgtccttcaccatccgttgtgggttcctggagtctggc tccatctccctggtgactgtgagcagggggggccccgatggtgctggggggaccacgctg gctgtgttgcacccggaacttggcatccagcaatgggcccctgctcgccaggcccgctgg gaaacccagagcagcatctctctcatcctggaaggctctggggccagcagcccctgcgcc aacaccaccttctgctgcaagtttgcgtccttccctgagggctcctgggaggcctgtggg agcctcccgcccagctcagacccagggctctctgtcccgccgactcctgcccccattctg cgggcagacctggccgggatcttgggggtctcaggagtccttctctttgactgtggctac ctccttcatctgctgtgccgacagaagcaccgggaggcttcctcaggatctcgaacctgc ggaaggaggaccagtctgtgtacttctgccaagtccagctggacatacagatcagggagg ctgtcgtggcagtccatcaaggggacccacctcaccatcacccaggccctcaggcagccc ctccacagggcccctctcctgcctggacagctctgctggtctccccgtcccctggagaag aacaaggccatgggtcggcccctgctgctgcccctgctgctcctgctgcagccgccagca tttctgcagcctggtggctccacaggatctggtccaagctacctttatggggtcactcaa ccaaaacacctctcagcctccatgggtggctctgtggaaatccccttctccttctattac ccctgggagttagccatagttcccaacgtgagaatatcctggagacggggccacttccac gggcagtccttctacagcacaaggccgccttccattcacaaggattatgtgaaccggctc tttctgaactggacagagggtcaggagagcggcttcctcaggatctcaaacctgcggaag gaggaccagtctgtgtatttctgccgagtcgagctggacacccggagatcagggaggcag cagttgcagtccatcaaggggaccaaactcaccatcacccaggctgtcacaaccaccacc acctggaggcccagcagcacaaccaccatagccggcctcagggtcacagaaagcaaaggg cactcagaatcatggcacctaagtctggacactgccatcagggttgcattggctgtcgct gtgctcaaaactgtcattttgggactgctgtgcctcctcctcctgtggtggaggagaagg aaaggaagcttctggatcagccaagggctcagggcccctctcctgcctggacggctctgc tggtctccccgtcccctggagaagaacaaggccatgggtcggcccctgctgctgccccta ctgcccttgctgctgccgccagcatttctgcagcctagtggctccacaggatctggtcca agctacctttatggggtcactcaaccaaaacacctctcagcctccatgggtggctctgtg gaaatccccttctccttctattacccctgggagttagccacagctcccgacgtgagaata tcctggagacggggccacttccacaggcagtccttctacagcacaaggccgccttccatt cacaaggattatgtgaaccggctctttctgaactggacagagggtcagaagagcggcttc ctcaggatctccaacctgcagaagcaggaccagtctgtgtatttctgccgagttgagctg gacacacggagctcagggaggcagcagtggcagtccatcgaggggaccaaactctccatc acccaggctgtcacgaccaccacccagaggcccagcagcatgactaccacctggaggctc agtagcacaaccaccacaaccggcctcagggtcacacagggcaaacgacgctcagactct tggcacataagtctggagactgctgtgggggtggcagtggctgtcactgtgctcggaatc atgattttgggactgatctgcctcctcaggtggaggagaaggaaaggtcagcagcggact aaagccacaaccccagccagggaacccttccaaaacacagaggagccatatgagaatatc aggaatgaaggacaaaatacagatcccaagctaaatcccaaggatgacggcatcgtctat gcttcccttgccctctccagctccacctcacccagagcacctcccagccaccgtcccctc aagagcccccagaacgagaccctgtactctgtcttaaaggcctaa >gi568815591f:100331426_100533551|GENSCAN_predicted_peptide_3|544_aa MADLPQLRKRRRCFQLRMRLGPASAPPSGGASAQGVLVPFLGGGKPGLRCEGLAESLRQL VPPIAFRSSLLLKEETPGISSPETEARISLPKASLKKKEEKATMKNVPSREQEKKRKAQI NKQAEKKEKEKSSLTNAEFEEIVQIVLQKSLQECLEDEKVEKTQGGHEHRQEDRLKKTVQ DHSQIRDQQKGEISGFGQCLVWVQCSFPNCGKWRRLCGNIDPSVLPDNWSCDQNTADVQY NRCDIPEETWTGLESDVAYASYIPGSIIWAKQYGYPWWPGMIESDPDLGEYFLFTSHLDS LPSKYHVTFFGETVSRAWIPVNMLKNFQELSLELSVMKKRRNDCSQKLGVALMMAQEAEQ ISIQERVNLFGFWSRFNGSNSNGERKDLQLSGLNSPGSCLEKKEKEEELEKEEGEKTDPI LPIRKRVKIQTQKTKPRGPKKKFKAPQSKALAASFSEGKEVRTVPKNLGLSACKGACPSS AKEEPRHREPLTQEAGSVPLEDEASSDLDLEQLMEDVGRELGQSGELQHSNSDGEDFPVA LFGK >gi568815591f:100331426_100533551|GENSCAN_predicted_CDS_3|1635_bp atggcggatttacctcagctgcggaaacgcagacgctgtttccagttgcgcatgcgcctc ggccccgcgagcgcaccgccctcgggtggagctagtgctcagggcgtcctcgtgcctttt cttggtggcgggaaacctgggttgaggtgtgaggggcttgcggagtcgctgcggcagctg gttccgcccatcgcctttaggtcctccttgctcctgaaggaggagaccccggggatcagt tccccagagacagaggccaggataagcctgccaaaggccagtttaaagaagaaagaggaa aaagcaaccatgaagaatgttccaagcagggaacaggagaaaaaaagaaaggcacaaatc aacaagcaagcagagaagaaagaaaaggaaaaatcaagtcttaccaatgcagaatttgag gagattgtccagattgttctgcagaagtcccttcaggagtgcttggaagatgagaaggtg gagaaaactcaaggtggacatgagcacagacaggaagaccgactaaagaaaacagttcag gatcattctcagatcagggaccagcaaaaaggagagataagtggttttggtcaatgtctg gtctgggtccagtgttccttcccaaactgtgggaaatggaggcggctgtgtgggaacatt gacccctcagttctcccagataattggtcctgtgatcagaacacagcagatgtgcagtat aatcgctgtgatattcctgaggagacctggacagggcttgagagtgatgtggcctatgcc tcctacatcccaggatccatcatctgggccaagcaatacggttacccctggtggccaggc atgatagaatctgatcctgacttaggggaatattttctttttacttcccatcttgattcc ctgccgtctaagtaccatgtgacgttttttggagaaacagtttctcgtgcatggatccca gtcaacatgctaaagaacttccaggagctgtccctggagctatcagtcatgaaaaagcgc agaaatgactgcagccagaaactgggggtggccctgatgatggctcaagaggcagaacag atcagcattcaggaacgggttaacttgtttggtttctggagccgattcaacggatctaac agtaatggggaaagaaaagacttacagctctctggtttgaacagcccaggatcctgctta gagaaaaaggagaaagaggaagagttggaaaaggaggaaggagagaaaacagacccaatt ttgcccattcgtaagcgagtcaaaatacagacccaaaaaaccaagccaagaggccctaag aaaaaatttaaagctccccagagcaaggccttggcagccagcttttcagagggaaaagaa gttagaacagtgccaaagaacctgggcctatcagcgtgtaagggggcctgcccctcatct gcgaaagaagagcccagacaccgggaacccctgacccaggaggctggaagtgtccccctt gaggacgaagcctccagtgacctggacctggagcaactcatggaagatgttgggagagag ctggggcagagcggggagctgcagcacagcaacagtgatggcgaggacttccccgtggcg ctgtttgggaagtag >gi568815591f:100331426_100533551|GENSCAN_predicted_peptide_4|686_aa MAAEKEPFLVPAPPPPLKDESGGGGGPTVPPHQEAASGELRGGTERGPGRCAPSAGSPAA AVGRESPGAAATSSSGPQAQQHRGGGPQAQSHGEARLSDPPGRAAPPDVGEERRGGGGTE LGPPAPPRPRNGYQPHRPPGGGGGKRRNSCNVGGGGGGFKHPAFKRRRRVNSDCDSVLPS NFLLGGNIFDPLNLNSLLDEEVSRTLNAETPKSSPLPAKGRDPVEILIPKDITDPLSLNT CTDEGHVVLASPLKTGRKRHRHRGQHHQQQQAAGGSESHPVPPTAPLTPLLHGEGASQQP RHRGQNRDAPQPYELNTAINCRDEVVSPLPSALQGPSGSLSAPPAASVISAPPSSSSRHR KRRRTSSKSEAGARGGGQGSKEKGRGSWGGRHHHHHPLPAAGFKKQQRKFQYGNYCKYYG YRNPSCEDGRLRVLKPEWFRGRDVLDLGCNVGHLTLSIACKWGPSRMVGLDIDSRLIHSA RQNIRHYLSEELRLPPQTLEGDPGAEGEEGTTTVRKRSCFPASLTASRGPIAAPQVPLDG ADTSVFPNNVVFVTGNYVLDRDDLVEAQTPEYDVVLCLSLTKWVHLNWGDEGLKRMFRRI YRHLRPGGILVLEPQPWSSYGKRKTLTETIYKNYYRIQLKPEQFSSYLTSPDVGFSSYEL VATPHNTSKGFQRPVYLFHKARSPSH >gi568815591f:100331426_100533551|GENSCAN_predicted_CDS_4|2061_bp atggcggcggagaaggagccgtttctggtgccggccccgccgccgccgctcaaagatgag tcgggcggagggggcggccccacggtgccaccgcaccaagaggccgcctctggggagctc cgcggcgggacggagcgtggtccgggtcgttgcgcgccatctgcggggtccccagccgct gcggtcggtcgggaaagccccggggccgcggccacctcctccagtggtccccaggcgcag cagcaccgagggggcggcccccaggcgcagtcgcatggggaggcccgcctgtcggatccc ccggggcgagccgctcccccggacgtgggggaggagcgccggggagggggcgggacagag ctgggtccccctgctcctcctcgaccccgcaatggctatcagccccaccggccacctggg gggggcgggggcaagaggagaaatagctgtaatgtagggggaggcgggggaggcttcaaa catccggccttcaagaggcgcaggcgggtgaattcggactgtgactctgtgttaccctcc aacttcctcctggggggcaatatctttgatcccctgaacctgaatagcctcctggatgag gaagtgagccgcactctcaacgcggagacccctaagtcatccccccttccggccaaaggg cgagatccggtggagatcctcatccccaaagatattactgacccgctcagtctcaatact tgcactgatgagggccatgtagttcttgcttcgccactcaagactggtcggaagcggcat agacaccggggacagcaccaccagcagcagcaggcagccggagggagtgagagtcacccc gtgccgcccacagcccctctcacccccttactccacggggagggcgcctcacagcagccg cggcacaggggccagaaccgggatgccccccaaccctatgaactcaacacagccatcaac tgcagggatgaagtggtgtctccccttccatctgctctgcagggtccctcaggctcccta tcagcccctccagctgcctcagttatctctgcacccccatcttcctcctcccgacatcgc aaacgtcgcaggacttccagcaagtcggaggcaggggctaggggtggaggccagggttcc aaggaaaagggccgagggagttggggaggccgccaccaccaccaccacccactgcctgca gcaggcttcaaaaagcaacagcgcaagttccagtatgggaattattgcaaatactatggg taccgcaatccttcctgtgaggatgggcgccttcgggtgttgaagcctgagtggtttcgg ggccgggacgtcctagatctgggctgcaatgtgggccatctgaccctgagcattgcctgc aagtggggcccgtcccgcatggtgggcctggatatcgattcccggctcatccattctgcc cgccaaaacatccgacactacctttccgaggagctgcgtctcccaccccagactttggaa ggggacccgggggcagagggtgaggaagggaccaccaccgttcgaaagaggagctgcttc ccagcctcgctgactgccagccggggtcccatcgctgccccccaagtgcccttggatgga gcggacacatcagtcttccccaacaatgttgtcttcgtcacgggtaattatgtgctggat cgagatgacctggtggaggcccaaacacctgagtatgatgtggtgctctgcctcagcctc accaagtgggtgcatctgaactggggagacgagggcctgaagcgcatgtttcgccggatc taccggcacctacgccctgggggcatcctggtcctagagccccaaccctggtcgtcgtat ggcaagagaaagactcttacagaaacgatctacaagaactactaccgaatccaattgaag ccagagcagttcagttcctacctgacatccccagacgtgggcttctccagctatgagctt gtggccacaccccacaacacctctaaaggcttccagcgtcctgtgtacctgttccacaag gcccgatcccccagccactaa >gi568815591f:100331426_100533551|GENSCAN_predicted_peptide_5|253_aa MMMGCGESELKSADGEEAAAVPGPPPEPQVPQLRAPVPEPGLDLSLSPRPDSPQPRHGSP GRRKGRAERRGAARQRRQVRFRLTPPSPVRSEPQPAVPQELEMPVLKSSLALGLELRAAA GSHFDAAKAVEEQLRKSFQIRCGLEESVSEGLNVPRSKRLFRDLVSLQVPEEQVLNAALR EKLALLPPQARAPHPKEPPGPGPDMTILCDPETLFYESPHLTLDGLPPLRLQLRPRPSED TFLMHRTLRRWEA >gi568815591f:100331426_100533551|GENSCAN_predicted_CDS_5|762_bp atgatgatgggttgtggggagtcagagctgaagtcggcggacggggaagaagccgcggcg gtcccggggccacccccggagccccaagtcccgcaactccgagccccagtgcccgagccc ggcctggacttgagcctgagcccgcggcccgacagccctcagccgcggcacggcagcccc gggcggcggaaggggcgggcggagcggcggggcgcggctcggcagcggcggcaggtccgc ttccgcctgacgccgccctccccggtgcggtccgagccgcagcctgcggtgccgcaggag ctggagatgcccgtgctgaagagcagcctggccttgggcctggagctgcgggccgcagcc gggagccactttgatgctgcgaaggccgtggaggaacagctgagaaagtcgttccagatc cgctgcggcctggaggagagcgtgtccgaggggctgaacgtgccgcgctccaagcggctc ttccgggacctggtgagcctgcaggtgccggaggaacaggttctgaatgccgcgctcagg gagaaattggctctcctgccgccacaggctcgagccccgcacccaaaggagccacctggg cctgggccagacatgaccatcttgtgtgacccagaaacgctattttatgaatctccacac ctgaccctggacggtctgccccctctccgacttcaactccggccccgcccttcagaggac accttcctcatgcaccggacactgaggcgatgggaagcgtag >gi568815591f:100331426_100533551|GENSCAN_predicted_peptide_6|602_aa MSGGKKKSSFQITSVTTDYEGPGSPGASDPPTPQPPTGPPPRLPNGEPSPDPGGKGTPRN GSPPPGAPSSRFRVVKLPHGLGEPYRRGRWTCVDVYERDLEPHSFGGLLEGIRGASGGAG GRSLDSRLELASLGLGAPTPPSGLSQGPTSWLRPPPTSPGPQARSFTGGLGQLVVPSKAK AEKPPLSASSPQQRPPEPETGESAGTSRAATPLPSLRVEAEAGGSGARTPPLSRRKAVDM RLRMELGAPEEMGQVPPLDSRPSSPALYFTHDASLVHKSPDPFGAVAAQKFSLAHSMLAI SGHLDSDDDSGSGSLVGIDNKIEQAMDLVKSHLMFAVREEVEVLKEQIRELAERNAALEQ ENGLLRALASPEQLAQLPSSGVFFWRQKIKPTISGHPDSKKHSLKKMEKTLQVVETLRLV ELPKEAKPKLGESPELADPCVLAKTTEETEVELGQQGQSLLQLPRTAVKSVSTLMVSALQ SGWQMCSWKPPRDGDTAKVKVSGKTLREVQLCLFPVTAEIVQIATYYILDLLNSDLGIRK EVAGKQSLSWTVEKPHTNISLSNRLSWTVEKPHTNISLSNPEASMESLSTPRPPNISCVL IE >gi568815591f:100331426_100533551|GENSCAN_predicted_CDS_6|1809_bp atgagcgggggcaagaagaagagtagtttccaaatcaccagcgtcaccacggactatgag ggccctgggagcccaggggcttcggatccccctaccccacagcccccaaccgggcccccg ccccgcctgcccaatggggagcccagccccgatccggggggcaagggcaccccccggaat ggctccccaccacctggggccccttcctcccgtttccgggtggtgaagctgccccacggc ctgggagagccttatcgccgcggtcgctggacgtgtgtggatgtttatgagcgagacctg gagccccacagcttcggcggactcctggagggaattcgaggggcctcagggggcgccggg ggcagatctttggattccaggttggagctggccagcctcggcctgggcgcccccacccca ccgtcaggcctgtctcagggccccacctcctggctccgtccaccccccacctctcctgga cctcaggcccgctccttcactgggggactgggccagctggtggtgcccagcaaagccaag gcagagaaacccccactgtcggcctcctcaccccagcagcgccccccagagcctgagacc ggtgagagtgcgggcacatcccgggctgccacgcccctgccctctctgagggtggaagcg gaggctgggggctcaggggccaggacccctccactgtcccggaggaaagctgtagacatg cggctgcggatggagttgggtgctccagaagagatggggcaggtgcccccacttgactct cgccccagctccccagccctctacttcacccacgatgccagcctggttcacaaatctcca gaccccttcggagcagtagcagctcagaagttcagcctggcccactccatgttggccatc agtggtcacctagacagcgacgatgatagtggctccggaagcctggttggcattgacaac aaaatcgagcaagccatggacttggtgaagtcccacctcatgtttgcggtccgggaggag gtggaggtgctgaaggagcagatccgggaattggcggagcggaacgctgcgctggagcag gagaatgggctgctgcgcgccctggccagcccggagcagctggctcagctgccctcctcg ggggttttcttctggaggcaaaaaattaaaccaaccatctcaggacaccctgactccaag aaacactcattgaagaagatggagaagactctccaggtggttgagactttgaggttggtc gagctcccaaaagaggctaagcccaagttgggtgagtcccccgagctggcagatccctgc gtgttggccaagactacagaggagaccgaggtggagctgggccaacagggccaatcccta ctgcagctgccgaggacggccgtcaagtctgtctccacgctcatggtctctgccctgcag agcggctggcagatgtgcagctggaagccccccagagatggggatactgccaaggtgaag gtttcggggaagaccctcagggaggttcagctgtgtttgtttcctgtgactgctgaaatt gtacagattgccacatactacatcctagacctcctgaattctgatttggggatcaggaag gaggttgctgggaagcagagcctctcttggactgtggaaaaacctcacaccaacatctcc ttgtccaatcgcctctcttggactgtggaaaaacctcacaccaacatctccttgtccaat ccagaggcttccatggaatcactctccaccccaagaccccccaacatctcctgtgtgctc attgagtaa >gi568815591f:100331426_100533551|GENSCAN_predicted_peptide_7|842_aa MNLLYRKTKLEWRQHKEEEAKRSSSKEVAPAGSAGPAAGQGPGVRVRDIASLRRSLRMGF MTMPASQEHTPHPCRSAMAPRSLSCHSVGSMDSVGGGPGGASGGLTEDSSTRRPPAKPRR HPSTKLSMVGPGSGAETPPSKKAGSQKPTPEGRESSRKVPPQKPRRSPNTQLSVSFDESC PPGPSPRGGNLPLQRLTRGSRVAGDPDVGAQEEPVYIEMVGDVFRGGGRSGGGLAGPPLG GGGPTPPAGADSDSEESEAIYEEMKYPLPEEAGEGRANGPPPLTATSPPQQPHALPPHAH RRPASALPSRRDGTPTKTTPCEIPPPFPNLLQHRPPLLAFPQAKSASRTPGDGVSRLPVL CHSKEPAGSTPAPQVPARERETPPPPPPPPAANLLLLGPSGRARSHSTPLPPQGSGQPRG ERELPNSHSMICPKAAGAPAAPPAPAALLPGPPKDKAVSYTMVYSAVKVTTHSVLPAGPP LGAGEPKTEKEISVLHGMLCTSSRPPVPGKTSPHGGAMGAAAGVLHHRGCLASPHSLPDP TVGPLTPLWTYPATAAGLKRPPAYESLKAGGVLNKGCGVGAPSPMVKIQLQEQGTDGGAF ASISCAHVIASAGTPEEEEEEVGAATFGAGWALQRKVLYGGRKAKELDTEVEDGARAWNG SAEGPGKVEREDRGPGTSGIPVRSQGAEGLLARIHHGDRGGSRTALPIPCQTFPACHRNG DFTGGYRLGRSASTSGVRQVVLHTPRPCSQPRDALSQPHPALPLPLPLPPQPARERDGKL LEVIERKRCVCKEIKARHRPDRGLCKQESMPILPSWRRGPEPRKSGTPPCRRQHTVLWDT AI >gi568815591f:100331426_100533551|GENSCAN_predicted_CDS_7|2529_bp atgaacctcctctaccgaaaaaccaagctggagtggaggcagcacaaggaagaggaggcc aagaggagctccagtaaggaggtggcccccgctggctcggctgggcccgcggccggccag gggcctggggtccgcgtgcgggacatcgcctcgctgcggcgctccctcaggatgggtttc atgacgatgcccgcctcccaggagcacaccccgcacccctgccgcagcgccatggcccca cgctccctctcctgccactcggtgggcagcatggacagtgtcgggggtggccctggcggg gccagtgggggcctcacagaggacagcagcacccgaagaccccctgccaagccccggaga caccccagcaccaagctcagcatggtggggcctgggtctggggcagagacgccccccagc aagaaagcaggctcacagaagccaaccccagagggccgagagtccagccggaaggttcct ccgcagaagcccaggcgaagccctaacacccagctctctgtctccttcgatgagtcctgc cccccaggcccctctcctcgaggggggaacctgcctcttcagcgcctcactagggggtcc cgagtagctggggaccctgatgtgggtgcccaggaagagcctgtgtacattgagatggtg ggggacgtctttaggggaggaggacgaagtggaggaggcctggctgggccccctcttggg ggtgggggcccgacccctccagcgggcgccgactcggactctgaagagagtgaggccatc tatgaagagatgaagtacccgctgccggaagaggctggggaaggccgggccaatggccct ccaccattgacggcaacatccccgccacaacagcctcacgcccttccgccccatgcccac cgccgcccagcttcagccctcccgagccggagggacgggacgcccaccaagaccactcct tgtgaaatccccccgcccttccccaacctccttcagcaccggcctccactcctggccttc ccccaagccaagtctgcttcccgaacccctggcgatggggtctcaaggctacctgtcctc tgccactccaaggagccagccggctccaccccagctccccaagtgcctgcacgggagcgg gagacgcctcccccaccgcctccacctcctgctgccaacctgctgctgctgggaccatcg ggccgggcccggagccactcgacaccgttgccaccccagggctctggccagccccggggg gagcgggagctccccaactcccacagcatgatctgccctaaggcggcgggggcgccggca gccccccctgccccggccgccttgctccccggcccccccaaggacaaggccgtgtcttac accatggtgtactcggcggtcaaggtgaccacgcactctgtcctgccagctggtccaccc ctgggtgctggggagccaaagacggagaaggagatctcggtcctccatgggatgctgtgt accagctcaaggccccctgtgccagggaagaccagcccccacggtggggccatgggcgca gcagctggggtcctccaccaccgcggctgcctggcctccccccacagccttccggaccca actgtaggccccctgaccccgctgtggacctacccagccacagcagctgggctcaagaga ccccctgcctatgagagcctcaaggctgggggggtgctgaataagggctgtggtgtgggg gccccatcccccatggtcaagatccagctgcaggagcaagggaccgatgggggtgctttt gccagcatctcctgtgcccacgtcatcgccagcgcagggacaccagaggaggaagaagag gaggtgggcgccgcgacatttggggcaggctgggccctgcagaggaaggtcctctatgga gggagaaaagcaaaggagttggacacagaggtcgaggacggtgcccgggcctggaatggc agtgccgagggtccaggcaaggtggagcgtgaggacaggggccctgggacatcggggatc ccagtgagaagccagggggcagagggactgctggccaggatccaccatggagaccgagga gggagccgcaccgcgctgcccattccctgccagaccttccccgcctgccaccgcaatgga gacttcacgggaggctaccgcctggggcgctccgcctccacctccggagtccggcaggtc gtgctccacacaccccggccctgcagccagcccagggatgccctgagccagccccacccc gcgctgccgctgcctctgcccctgccgccccagccggcccgcgagcgtgacgggaagctg ctggaggtgatcgagcgcaagcgctgcgtgtgcaaggagatcaaggcgcgccaccgcccg gaccgaggcctctgcaagcaggagagcatgcccatcctccccagctggcggcggggaccc gagccccgcaagtccggcaccccgccctgccgccggcagcacacggtcctctgggacacc gccatctga >gi568815591f:100331426_100533551|GENSCAN_predicted_peptide_8|83_aa MQEAMRAMKDERDSGQFPANLKFLDFQFQPTYVLMDVILIVIDNIMLICLGLEVSKIPQV LLRRGCNGVVRYNWQKCKDLSHQ >gi568815591f:100331426_100533551|GENSCAN_predicted_CDS_8|252_bp atgcaggaggctatgcgggccatgaaagatgaaagagacagtggccagttcccagccaac ctcaaattcttagattttcagttccagcccacctatgtccttatggatgtcatcctgatc gtcattgacaatatcatgctgatttgccttggacttgaagtatcaaagatcccccaggtc ttgctgagacgtggatgtaatggggttgtgagatataactggcagaaatgcaaggacctg tcacatcagtag