GENSCAN 1.0 Date run: 3-Nov-116 Time: 14:57:06 Sequence gi568815583f:73898013_74145005 : 246993 bp : 49.92% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 976 971 6 -0.45 1.02 Term - 1487 1372 116 0 2 99 55 66 0.287 3.13 1.01 Init - 3162 3099 64 0 1 74 52 59 0.245 2.00 1.00 Prom - 3238 3199 40 -3.86 2.06 PlyA - 3977 3972 6 1.05 2.05 Term - 9345 9251 95 1 2 110 48 104 0.919 6.59 2.04 Intr - 11518 11424 95 0 2 93 -31 102 0.330 -1.59 2.03 Intr - 13901 13820 82 0 1 98 91 19 0.398 1.90 2.02 Intr - 13981 13951 31 1 1 72 72 37 0.278 -1.80 2.01 Init - 15309 15307 3 0 0 108 81 0 0.242 1.30 2.00 Prom - 17172 17133 40 -3.36 3.00 Prom + 19561 19600 40 -5.96 3.01 Init + 28772 29873 1102 1 1 111 101 2276 0.239 223.75 3.02 Intr + 44842 44950 109 2 1 118 110 91 0.991 13.64 3.03 Intr + 48405 48542 138 0 0 129 81 209 0.999 23.88 3.04 Intr + 49055 49211 157 2 1 120 47 338 0.999 32.91 3.05 Intr + 49795 49890 96 0 0 98 86 192 0.990 20.21 3.06 Intr + 51447 51562 116 2 2 144 63 111 0.618 13.45 3.07 Intr + 56227 56434 208 1 1 5 68 146 0.244 3.38 3.08 Term + 65271 65291 21 2 0 136 48 19 0.782 1.01 3.09 PlyA + 65390 65395 6 1.05 4.00 Prom + 65599 65638 40 -5.26 4.01 Init + 75536 75862 327 1 0 74 45 194 0.955 9.03 4.02 Term + 75915 76196 282 2 0 82 37 181 0.976 7.83 4.03 PlyA + 76534 76539 6 1.05 5.09 PlyA - 80938 80933 6 1.05 5.08 Term - 86118 85925 194 1 2 140 43 279 0.988 26.18 5.07 Intr - 86859 86647 213 1 0 66 55 221 0.925 15.29 5.06 Intr - 87501 87306 196 0 1 70 94 214 0.940 19.19 5.05 Intr - 90790 90587 204 1 0 72 70 421 0.981 38.10 5.04 Intr - 91245 91096 150 0 0 71 100 136 0.797 13.46 5.03 Intr - 92445 92339 107 1 2 112 116 5 0.998 5.63 5.02 Intr - 94200 94079 122 2 2 32 117 193 0.993 16.74 5.01 Init - 94383 94313 71 0 2 79 39 178 0.851 10.52 5.00 Prom - 97923 97884 40 -9.16 6.00 Prom + 98623 98662 40 -6.26 6.01 Init + 99886 99918 33 0 0 54 59 9 0.038 -5.27 6.02 Intr + 99992 100464 473 1 2 110 109 736 0.809 69.78 6.03 Intr + 124816 125396 581 1 2 77 121 1186 0.995 113.14 6.04 Intr + 126845 126915 71 0 2 132 85 69 0.987 9.90 6.05 Intr + 127409 127534 126 1 0 114 63 3 0.651 1.28 6.06 Intr + 134560 134703 144 0 0 80 94 168 0.995 17.08 6.07 Intr + 135144 135402 259 2 1 91 91 146 0.915 12.24 6.08 Intr + 136466 136518 53 0 2 100 109 97 0.991 11.63 6.09 Intr + 137160 137747 588 2 0 29 28 303 0.604 11.22 6.10 Intr + 138616 138679 64 0 1 94 60 54 0.800 1.49 6.11 Intr + 139820 139943 124 0 1 101 50 45 0.863 1.64 6.12 Intr + 142380 142461 82 0 1 68 70 53 0.764 1.14 6.13 Intr + 144977 145127 151 1 1 117 37 199 0.999 17.44 6.14 Intr + 146051 146119 69 0 0 86 101 12 0.727 1.45 6.15 Term + 146209 146996 788 2 2 111 54 1006 0.990 92.99 6.16 PlyA + 148738 148743 6 1.05 7.00 Prom + 149746 149785 40 -7.36 7.01 Init + 154593 154648 56 2 2 77 61 76 0.123 2.88 7.02 Intr + 157967 158063 97 2 1 67 37 113 0.101 4.11 7.03 Intr + 164328 164396 69 2 0 104 82 38 0.257 4.18 7.04 Intr + 164758 164940 183 0 0 96 4 306 0.168 22.98 7.05 Term + 165677 165778 102 1 0 85 47 158 0.658 9.68 7.06 PlyA + 168380 168385 6 1.05 8.33 PlyA - 168983 168978 6 1.05 8.32 Term - 173025 172898 128 1 2 59 49 145 0.995 6.14 8.31 Intr - 173265 173112 154 0 1 57 101 151 0.669 12.95 8.30 Intr - 173458 173355 104 2 2 87 97 141 0.999 14.79 8.29 Intr - 173828 173729 100 2 1 30 64 82 0.666 -0.32 8.28 Intr - 174297 174206 92 1 2 85 101 84 0.998 9.11 8.27 Intr - 174805 174730 76 1 1 117 87 54 0.998 7.29 8.26 Intr - 175045 174929 117 1 0 20 -19 197 0.797 2.86 8.25 Intr - 176204 176124 81 2 0 86 46 163 0.826 11.73 8.24 Intr - 176989 176481 509 2 2 76 68 560 0.567 45.49 8.23 Intr - 177478 177391 88 1 1 94 86 93 0.999 9.24 8.22 Intr - 177778 177671 108 1 0 126 78 31 0.988 6.38 8.21 Intr - 178024 177887 138 1 0 17 94 153 0.846 9.46 8.20 Intr - 179112 179006 107 1 2 86 96 142 0.997 14.73 8.19 Intr - 179290 179209 82 1 1 48 81 83 0.969 2.81 8.18 Intr - 179441 179391 51 2 0 111 105 17 0.930 4.80 8.17 Intr - 179741 179703 39 2 0 119 21 57 0.561 0.52 8.16 Intr - 180677 180597 81 2 0 78 111 76 0.832 8.73 8.15 Intr - 182723 182604 120 2 0 116 76 9 0.901 3.19 8.14 Intr - 190427 190327 101 0 2 -8 115 130 0.731 5.93 8.13 Intr - 191119 191018 102 2 0 123 31 82 0.571 6.15 8.12 Intr - 192194 192103 92 1 2 101 28 -9 0.284 -5.96 8.11 Intr - 193439 193319 121 0 1 114 82 166 0.988 18.15 8.10 Intr - 197767 197686 82 1 1 87 61 67 0.314 3.11 8.09 Intr - 198829 198660 170 2 2 90 50 33 0.358 -0.63 8.08 Intr - 199181 199101 81 0 0 71 92 57 0.617 4.01 8.07 Intr - 199461 199342 120 1 0 -25 102 151 0.675 5.67 8.06 Intr - 199648 199511 138 2 0 59 24 102 0.518 1.34 8.05 Intr - 199819 199780 40 1 1 81 85 74 0.348 4.20 8.04 Intr - 200402 200295 108 2 0 134 -7 85 0.560 4.08 8.03 Intr - 202393 202011 383 2 2 73 94 130 0.428 6.73 8.02 Intr - 209871 209749 123 1 0 80 43 48 0.375 0.06 8.01 Init - 219915 219876 40 0 1 73 100 35 0.357 3.59 8.00 Prom - 228656 228617 40 -4.36 9.00 Prom + 232444 232483 40 -4.76 9.01 Sngl + 234743 236980 2238 1 0 77 46 2829 0.998 269.20 9.02 PlyA + 238771 238776 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 61848 61949 102 2 0 79 94 75 0.826 7.37 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583f:73898013_74145005|GENSCAN_predicted_peptide_1|59_aa MLTLVPKQASEAQLRIPRAVEGSPHTERKHSPGIKWEHGPGQGALGVQPGALNDPEDWI >gi568815583f:73898013_74145005|GENSCAN_predicted_CDS_1|180_bp atgctcaccttggtcccgaagcaggcctcggaggcacagctacgcatccccagagctgtg gaaggttcccctcacacagagcggaagcacagccctggcatcaagtgggagcatggccca gggcaaggagctctgggagtgcagcctggggccttgaatgacccggaggactggatttga >gi568815583f:73898013_74145005|GENSCAN_predicted_peptide_2|101_aa MRHQPVVELAAGRASKGHYQLRDLAGTTVWELAGARSSRSHASCRGPDETLSFISREARG KGNNHHNSNANPTKDELESSPVKYQEQQLLMARTTELRNSC >gi568815583f:73898013_74145005|GENSCAN_predicted_CDS_2|306_bp atgaggcaccagccagtagttgagctggctgcaggtagagcgtccaagggccattaccag ctcagggatctagctgggacaactgtgtgggagttggctggggccaggtccagcaggagt catgccagctgcagaggccctgatgaaactctgtccttcatctctcgtgaagcccgtggc aagggcaacaaccaccacaacagtaatgctaaccccaccaaagatgaacttgagagcagc ccggtgaagtatcaggagcagcagctgctcatggctagaacaactgagctgagaaattca tgttga >gi568815583f:73898013_74145005|GENSCAN_predicted_peptide_3|648_aa MALARGSRQLGALVWGACLCVLVHGQQAQPGQGSDPARWRQLIQWENNGQVYSLLNSGSE YVPAGPQRSESSSRVLLAGAPQAQQRRSHGSPRRRQAPSLPLPGRVGSDTVRGQARHPFG FGQVPDNWREVAVGDSTGMARARTSVSQQRHGGSASSVSASAFASTYRQQPSYPQQFPYP QAPFVSQYENYDPASRTYDQGFVYYRPAGGGVGAGAAAVASAGVIYPYQPRARYEEYGGG EELPEYPPQGFYPAPERPYVPPPPPPPDGLDRRYSHSLYSEGTPGFEQAYPDPGPEAAQA HGGDPRLGWYPPYANPPPEAYGPPRALEPPYLPVRSSDTPPPGGERNGAQQGRLSVGSVY RPNQNGRGLPDLVPDPNYVQASTYVQRAHLYSLRCAAEEKCLASTAYAPEATDYDVRVLL RFPQRVKNQGTADFLPNRPRHTWEWHSCHQHYHSMDEFSHYDLLDAATGKKVAEGHKASF CLEDSTCDFGNLKRYACTSHTQGLSPGCYDTYNADIDCQWIDITDVQPGNYILKVHVNPK YIVLESDFTNNVVRCNIHYTGRYVSATNCKIVHRGGFHPKEKRDRERREKTAVTRRDQGS PHRFLADLSGPYQCSFTALPSSDGPSGKEMQLLPPQGAGFPQIQEIIN >gi568815583f:73898013_74145005|GENSCAN_predicted_CDS_3|1947_bp atggctctggcccgaggcagccggcagctgggggccctggtgtggggcgcctgcctgtgc gtgctggtgcacgggcagcaggcgcagcccgggcagggctcggaccccgcccgctggcgg cagctgatccagtgggagaacaacgggcaggtgtacagcttgctcaactcgggctcagag tacgtgccggccggacctcagcgctccgagagtagctcccgggtgctgctggccggcgcg ccccaggcccagcagcggcgcagccacgggagcccccggcgtcggcaggcgccgtccctg cccctgccggggcgcgtgggctcggacaccgtgcgcggccaggcgcggcacccattcggc tttggccaggtgcccgacaactggcgcgaggtggccgtcggggacagcacgggcatggcc cgggcccgcacctccgtctcccagcaacggcacgggggctccgcctcctcggtctcggct tcggccttcgccagcacctaccgccagcagccctcctacccgcagcagttcccctacccg caggcgcccttcgtcagccagtacgagaactacgaccccgcgtcgcggacctacgaccag ggtttcgtgtactaccggcccgcgggcggcggcgtgggcgcgggggcggcggccgtggcc tcggcgggggtcatctacccctaccagccccgggcgcgctacgaggagtacggcggcggc gaagagctgcccgagtacccgcctcagggcttctacccggcccccgagaggccctacgtg ccgccgccgccgccgccccccgacggcctggaccgccgctactcgcacagtctgtacagc gagggcacccccggcttcgagcaggcctaccctgaccccggtcccgaggcggcgcaggcc catggcggagacccacgcctgggctggtacccgccctacgccaacccgccgcccgaggcg tacgggccgccgcgcgcgctggagccgccctacctgccggtgcgcagctccgacacgccc ccgccgggtggggagcggaacggcgcgcagcagggccgcctcagcgtgggcagcgtgtac cggcccaaccagaacggccgcggtctccctgacttggtcccagaccccaactatgtgcaa gcatccacttatgtgcagagagcccacctgtactccctgcgctgtgctgcggaggagaag tgtctggccagcacagcctatgcccctgaggccaccgactacgatgtgcgggtgctactg cgcttcccccagcgcgtgaagaaccagggcacagcagacttcctccccaaccggccacgg cacacctgggagtggcacagctgccaccagcattaccacagcatggacgagttcagccac tacgacctactggatgcagccacaggcaagaaggtggccgagggccacaaggccagtttc tgcctggaggacagcacctgtgacttcggcaacctcaagcgctatgcatgcacctctcat acccagggcctgagcccaggctgctatgacacctacaatgcggacatcgactgccagtgg atcgacataaccgacgtgcagcctgggaactacatcctcaaggtgcacgtgaacccaaag tatattgttttggagtctgacttcaccaacaacgtggtgagatgcaacattcactacaca ggtcgctacgtttctgcaacaaactgcaaaattgtccacagaggaggctttcacccaaag gagaagagagatagagagagaagagagaagacagccgtgacgcgcagggaccagggcagc cctcaccgcttcctggctgacctgtcagggccctatcagtgcagcttcacggcactgccc agcagtgacggcccctcgggaaaggagatgcagctgctcccacctcagggggcagggttt cctcagatccaggagataatcaactaa >gi568815583f:73898013_74145005|GENSCAN_predicted_peptide_4|202_aa MAHCSLDLLLVRGDSAPAALALGASSALVPTLAALEEPFSPLLHCGSPFLGWPRLEPAPL ACGEVWRESRGREPGLRAALAGQREFRVGVGSAGPALGAAGRPARKPRAALAAFPRGRAQ DLQPAMPEPPPSSVGSCAAQASAMSAAPCSMEPNPIDHPRTEECRHTARDWQAVPPAAPV WDPLGEASWAPESVGDLENLYV >gi568815583f:73898013_74145005|GENSCAN_predicted_CDS_4|609_bp atggctcactgcagcctcgacctcctcctggtgagaggtgacagcgcgcctgcagccctc gctctcggcgcctcctcggccttggtgcccactctggccgcgcttgaggagcccttcagc ccactgctgcactgtgggagcccctttctgggctggccaaggctggagccggctccctta gcttgcggggaggtgtggagggagagccgcgggcgggaaccggggctgcgcgcggcactt gctggccagcgcgagttccgggtgggcgtgggctccgcgggccccgcactcggtgcggcc ggccggccggcccgcaagccccgggcagccttagctgccttcccgcggggcagggctcaa gacctgcagcccgccatgcctgagcctccccccagctccgtgggctcctgtgcggcccaa gcctccgcgatgagcgccgccccctgctccatggagcccaatcccatcgaccacccaaga actgaggagtgcaggcacacggcgcgggactggcaggcagttccacctgcagcccccgtg tgggatccactgggtgaagccagctgggctcctgagtctgttggggacttggagaacctt tatgtctag >gi568815583f:73898013_74145005|GENSCAN_predicted_peptide_5|418_aa MRRRKRAPAGLLFLRPRRRGWPRGSGYRALPLGDFDRFQQSSFGFLGSQKGCLSPERGGV GTGADVPQSWPSCLCHGLISFLGFLLLLVTFPISGWFALKIVPTYERMIVFRLGRIRTPQ GPGMVLLLPFIDSFQRVDLRTRAFNVPPCKLASKDGAVLSVGADVQFRIWDPVLSVMTVK DLNTATRMTAQNAMTKALLKRPLREIQMEKLKISDQLLLEINDVTRAWGLEVDRVELAVE AVLQPPQDSPAGPNLDSTLQQLALHFLGGSMNSMAGGAPSPGPADTVEMVSEVEPPAPQV GARSSPKQPLAEGLLTALQPFLSEALVSQVGACYQFNVVLPSGTQSAYFLDLTTGRGRVG HGVPDGIPDVVVEMAEADLRALLCRELRPLGAYMSGRLKVKGDLAMAMKLEAVLRALK >gi568815583f:73898013_74145005|GENSCAN_predicted_CDS_5|1257_bp atgcgacggcggaagcgggccccggccggcctcctcttcctgcgcccgcgccgccgcggg tggccgcgcgggtctgggtaccgggcgctgcccctgggtgattttgaccgcttccagcag tcgagcttcggctttctgggctcgcagaagggctgcttgtccccggagcggggcggcgtg gggacaggggccgatgtaccccagagctggccctcctgcctctgtcatggcctcatcagt ttcctggggttcttgctgctgttggtcaccttccccatttctggctggtttgccctgaag attgtgcccacctacgagcggatgattgtgttccgcctgggccggatccgcaccccccag ggacctggcatggttctgctcttgcccttcattgactcctttcagagggtggatctgagg acacgagccttcaacgtccctccctgcaagctggcctctaaggacggggctgtgctgtcc gtgggagccgatgtccagtttcgcatctgggacccggtgctgtcggtgatgactgtgaaa gacctgaacacagccacacgcatgacagcccagaacgccatgaccaaggccctgctcaag aggccgctgcgggagatccagatggagaagctcaagatcagcgaccagcttctgctggag atcaacgatgtgaccagggcctgggggctggaggtagaccgcgtggagctggcagtggag gccgtgctccagccgccccaggacagcccagctgggcccaacctggacagcaccctccag cagctggccctgcacttcctgggaggaagcatgaactcaatggcaggaggtgccccgtcc ccggggccagcagacaccgtggagatggtgagtgaagttgagccacctgcccctcaagtt ggtgccaggtccagtccgaagcagcctctggcggaggggctactgactgctctacagccc ttcctgtctgaggccctggtcagccaagtcggggcctgctaccagttcaatgtcgtcctg cccagcggcacccaaagcgcctacttcctggacctcactacaggacgaggaagagtggga cacggggtgcctgatggcatccctgatgtggtggtggagatggccgaggcagacctgcgg gccctgctatgcagagagctgcggcccctgggggcctacatgagtggacggctgaaggtg aagggcgacctggctatggccatgaagctggaggctgtcctcagggccttgaagtag >gi568815583f:73898013_74145005|GENSCAN_predicted_peptide_6|1201_aa MEYEVLLQGPARAPASEEEFQFLRCQQCQAEAKCPKLLPCLHTLCSGCLEASGMQCPICQ APWPLGADTPALDNVFFESLQRRLSVYRQIVDAQAVCTRCKESADFWCFECEQLLCAKCF EAHQWFLKHEARPLAELRNQSVREFLDGTRKTNNIFCSNPNHRTPTLTSIYCRGCSKPLC CSCALLDSSHSELKCDISAEIQQRQEELDAMTQALQEQDSAFGAVHAQMHAAVGQLGRAR AETEELIRERVRQVVAHVRAQERELLEAVDARYQRDYEEMASRLGRLDAVLQRIRTGSAL VQRMKCYASDQEVLDMHGFLRQALCRLRQEEPQSLQAAVRTDGFDEFKVRLQDLSSCITQ GKDAAVSKKASPEAASTPRDPIDVDLMVLGKERGMVPGVRFRSQLPGFCWCDSRATIHVT KQTVSGFKPEEAERVKAQVQALGLAEAQPMAVVQSVPGAHPVPVYAFSIKGPSYGEDVSN TTTAQKRKCSQTQCPRKVIKMESEEGKEARLARSSPEQPRPSTSKAVSPPHLDGPPSPRS PVIGSEVFLPNSNHVASGAGEAEERVVVISSSEDSDAENSCMEPMETAEPQSSPAHSSPA HSSPAHSSPVQSLLRAQGASSLPCGTYHPPAWPPHQPAEQAATPDAEPHSEPPDHQERPA VHRGIRYLLYRAQRAIRLRHALRLHPQLHRAPIRTWSPHVVQASTPAITGPLNHPANAQE HPAQLQRGISPPHRIRGAVRSRSRSLRGSSHLSQWLNNFFALPFSSMASQLDMSSVGGPH APQLIPDTHLPTLIEGRRALSHAPDFSYCPGVADTQHSHLPARPAWPVTFQQLRMDAHSK SKSFPVTVPLYAPPTPSYPNLRNKAKSSRELDDSSSESSDLQLEGPSTLRVLDENLADPQ AEDRPLVFFDLKIDNERSELHLGADVWLLPADPKVRSLDAQKISQLAAVNRESKFRVVIQ PEAFFSIYSKAVSLEVGLQHFLSFLSSMRRPILACYKLWGPGLPNFFRALEDINRLWEFQ EAISGFLAALPLIRERVPGASSFKLKNLAQTYLARNMSERSAMAAVLAMRDLCRLLEVSP GPQLAQHVYPFSSLQCFASLQPLVQAAVLPRAEARLLALHNVSFMELLSAHRRDRQGGLK KYSRYLSLQTTTLPPAQPAFNLQALGTYFEGLLEGPALARAEGVSTPLAGRGLAERASQQ S >gi568815583f:73898013_74145005|GENSCAN_predicted_CDS_6|3606_bp atggaatatgaggtcctactccagggtccagctcgagcccccgcttcggaggaggagttc cagtttctgcgctgccagcaatgccaggcggaagccaagtgcccgaagctgctgccttgt ctgcacacgctgtgctcaggatgcctggaggcgtcgggcatgcagtgccccatctgccag gcgccctggcccctaggtgcagacacacccgccctggataacgtctttttcgagagtctg cagcggcgcctgtcggtgtaccggcagattgtggatgcgcaggctgtgtgcacccgctgc aaagagtcggccgacttctggtgctttgagtgcgagcagctcctctgcgccaagtgcttc gaggcacaccagtggttcctcaagcacgaggcccggcccctagcagagctgcgcaaccag tcggtgcgtgagttcctggacggcacccgcaagaccaacaacatcttctgctccaacccc aaccaccgcacccctacgctgaccagcatctactgccgaggatgttccaagccgctgtgc tgctcgtgcgcgctccttgacagcagccacagtgagctcaagtgcgacatcagcgcagag atccagcagcgacaggaggagctggacgccatgacgcaggcgctgcaggagcaggatagt gcctttggcgcggttcacgcgcagatgcacgcggccgtcggccagctgggccgcgcgcgt gccgagaccgaggagctgatccgcgagcgcgtgcgccaggtggtagctcacgtgcgggct caggagcgcgagctgctggaggctgtggacgcgcggtaccagcgcgactacgaggagatg gccagtcggctgggccgcctggatgctgtgctgcagcgcatccgcacgggcagcgcgctg gtgcagaggatgaagtgctacgcctcggaccaggaggtgctggacatgcacggtttcctg cgccaggcgctctgccgcctgcgccaggaggagccccagagcctgcaagctgccgtgcgc accgatggcttcgacgagttcaaggtgcgcctgcaggacctcagctcttgcatcacccag gggaaagatgcagctgtatccaagaaagccagcccagaggctgccagcactcccagggac cctattgacgttgacctgatggttcttggaaaggaacggggaatggtgccaggagtcaga tttcgatcccagttgcccgggttctgctggtgtgattccagggccaccatacatgtaaca aagcagactgtctcagggttcaagcccgaggaggcagagagagtgaaggcccaggttcag gccctggggctggctgaagcccagcctatggctgtggtacagtcagtgcccggggcacac cccgtgccagtgtacgccttctccatcaaaggcccttcctatggagaggatgtctccaat acaacgacagcccagaagaggaagtgcagccagacccagtgccccaggaaggtcatcaag atggagtctgaggaggggaaggaggcaaggttggctcggagctccccggagcagcccagg cccagcacctccaaggcagtctcaccaccccacctggatggaccgcctagccccaggagc cccgtcataggaagtgaggtcttcctgcccaacagcaaccacgtggccagtggcgccggg gaggcagaggaacgcgttgtggtgatcagcagctcggaagactcagatgccgaaaactcg tgcatggagcccatggagaccgccgagccacagtcctcgccagcccactcctcgccagcc cactcctcgccagcccactcctcgccagtccagtctctgctgagagcacaaggagcctcc agcctgccctgtggcacataccaccccccagcttggcctccccaccagcccgctgagcag gctgccacccccgatgctgagcctcacagcgagcctcctgatcaccaggagcgccctgcc gtccaccgtgggatccgctacctgttgtacagagcacagagagccatccgccttcgccat gccctccgcttgcaccctcaattgcatcgggcccctattcggacttggtctccccatgtg gtccaagccagcactcctgccatcacagggcccctcaaccatcctgccaatgcccaggaa catcctgcccagctgcaaaggggcatcagcccaccccaccggatacgaggggctgtgcga tcccgcagccgctccctccggggctcctcccatttatcccagtggctcaacaactttttt gccctccccttctcctccatggcttcccagcttgacatgtcttccgtgggtgggccccat gctccccagctcatcccagacacccacctgcccaccctcattgagggcagacgtgctctc agccacgcccctgacttcagttactgcccaggggtagctgatacccaacactcgcacctg cctgccagaccagcatggccagtgaccttccaacagctccgcatggatgcccatagcaaa tctaagtcttttccagtgactgtccctctctatgctcccccgacacccagttatcctaat ttgcggaataaggccaagtcctcccgagagctggatgacagcagcagtgagtccagtgac ctccagctggaaggccccagcaccctcagggtcctggacgagaaccttgctgacccccaa gcagaagacagacctctggttttctttgacctcaagattgacaatgaaagatcagagctg catttgggggctgatgtgtggctactgccagcagaccccaaggtgaggtctctagatgcc cagaagattagccagctggctgcggtgaaccgggaaagcaagttccgcgtggtcatccag cctgaagccttcttcagcatctactccaaggccgtgtccctggaggtggggctgcagcac ttcctcagctttctgagctccatgcgccgccctatcttggcctgctacaagctgtggggg cctggcctcccaaacttcttccgggccctggaggacattaacaggctgtgggaattccag gaggccatctcgggcttcctggctgccctgcctctcatccgggagcgtgtgcccggggcc agcagcttcaaactcaagaacctggcccagacctacctggcgagaaacatgagcgagcgc agcgccatggctgccgtgctggccatgcgtgacctgtgccgcctcctcgaggtctccccg ggcccccagctggcccagcatgtctaccccttcagtagcctgcagtgctttgcctccctg cagcccctggtgcaggcagctgtgctgccccgggctgaggcccgcctcctggccctacac aacgtgagcttcatggagctgctgagtgcacaccgccgtgaccggcaggggggcctgaag aagtacagccgctatctaagcctgcagaccaccacgttgccccctgcccagcctgctttc aacctgcaggctctgggcacctactttgaaggcctgttggagggtccggcgctggcacgg gcagaaggagtctccaccccacttgctggccgtggcttggcagagagggcctcccagcag agctga >gi568815583f:73898013_74145005|GENSCAN_predicted_peptide_7|168_aa MVRRPSTGSGQRAPPAPRKTLFKVPSWKQEQPLRDYKPAGPLILDFSDTKTVAVIPGGSG GGNVGVWASQDKGNASKAEENVSDSFMHSMDPQLEQQMETTQSLVDSYVTIVNKTVWDLM VGLTPKTIMHLMINNTKEFIFSELLANLYLHGDKNMLMEESAEQAQRS >gi568815583f:73898013_74145005|GENSCAN_predicted_CDS_7|507_bp atggtccgcaggcccagcactgggagcggccagcgggccccgccagccccccgcaagaca ctgttcaaggtgccatcttggaagcaggagcagcccttgcgagactacaagcctgctggc cccttgatcttggacttctcagacaccaaaactgtggctgtgatcccaggtggcagcggt ggcggcaatgttggggtgtgggcctcccaggacaaggggaatgccagcaaggctgaggag aatgtctccgacagcttcatgcactccatggacccacagctggagcagcaaatggagacc actcagagcctggtggactcctatgtgaccattgtcaacaagaccgtgtgggacctcatg gttggtctcacgcccaagaccatcatgcacctcatgatcaacaacaccaaggagtttatc ttctcggagctgctggccaacctgtacttgcatggggacaagaacatgctgatggaggag tcggcagagcaggcacagcggtcatga >gi568815583f:73898013_74145005|GENSCAN_predicted_peptide_8|1291_aa MRPGFVVSVGMVEGVPGEPDPYPVLCYTPWPGHDLPASRAALAQGRSWSLHRMIGQDSGR RRSRRQHFAPGTSSGLRSAPGLTRAGPAPPEAVSPSHVIVDSADLAGPEKEIPGPWLPRA MYEAPGVKRAWAAGAGMRGRQWLRKRVEVVCTGRSANTVCAGVRAAGLVEKSPPPSLSRM GRRFRFCGDLDCPDRVLAEISTLAKMVECTGSTLGGGGYKKILKLTADAKFVEEHEGKKD IDSPTFLSSASSWWSSSSAYGSRRPGGFELKLIGQQGESGDVKATVAVLSFILSGAAKHS VDGKSLASELQQLGLPKEHAASPCCCYEEKQSPLQKHLRVCSLRTWSLAARVAEGTAETV DPSAAPKTSVWSSRVCWGRLLATISPSVNETDTCGMEDRRLGVQALACTVPHDGLGWRHP EEGGTHSGGSPLEHLEPKKLSLTFAIPGTRPLIMMKVDPNPLRGLKEQRFLAEHDVLAHF LDYPQKVYRTTWGHHICLPAPSAVLQALAVAIQLGGHLADPLLQVDPLSSCGAEVQRNLV PAAVNPYVMIFLPALFRVLVLAAVFGQLKEYQQRKSPGIPAGAKTKKKKTDSSPETTTSG GCHSPGDSQYQELAVALESSSVTISQLNENIESLKQQKKQVEHQLEEAKKTNNEIHKAQM ERLETINILTLEKADLKTTLYHTKRAARHFEEESKDLAGRLQYSLQRIQELERALCAVST QQQEEDRGHCLSSPDQNFSLFTIQSSSCREAVLQRWLQQTIKERALLNAHVTQVTESLKQ VQLERDEYAKHIKGERARWQERMWKMSVEARTLKEEKKRDIHRIQELERSLSELKNQMAE PPSLAPPAVTSVVEQLQDEAKHLRQEVEGLEGKLQSQVENNQALSLLSKEQKQRLQEQEE MLREQEAQRVREQERLCEQNERLREQQKTLQEQGERLRKQEQRLRKQEERLRKEEERLQK QEKRLWDQEERLWKKEERLQKQEERLALSQNHKLDKQLAEPQCSFEDLNNEKKSALQLEQ QVKELQEKLDEVKEMQYMATYQQLTSEKEALHRQLLLQTQFVDQLQQQEAWGKAEHLEAA SHQNQQLETQLSLVALPGEGDGGQHLDSEEEEAPRPTPNIPEDLESREATEPEAEAPAPG SGGEFVCGESYRALKEAMVKLKGRESFTVYESQGAVPNTRHQEMEDVIRLAQKEEEMKVK LLELQELVLPLVGNHEGHGKFLIAAQNPADEPTPGAPAPQELGAAGEQDVFYEVSLDNNV EPAPGAAREGSPHDNPTVQQIVQLSPVMQDT >gi568815583f:73898013_74145005|GENSCAN_predicted_CDS_8|3876_bp atgagaccaggcttcgtggtgtcagtgggaatggtggaaggtgtacctggagagcctgat ccatacccagtgctctgctacaccccatggcctggccacgacctccctgcctccagagca gccctggcccagggaagaagctggtctctgcatcgaatgatagggcaggattctgggaga cggaggagcagacgccagcatttcgcccccggcacctccagcgggctgcggtccgctccc ggcctcactcgagcaggtcccgccccgccagaggccgttagtccaagtcacgtgatcgtc gactcagctgacctggcgggaccggaaaaagaaattcccgggccctggcttcctcgcgcg atgtatgaggcaccaggggtgaagcgggcatgggctgctggagcgggaatgagggggcgc cagtggctccggaaacgggttgaggttgtctgcactggccgctccgcaaacacagtgtgt gcgggcgtcagggcagcaggtctggtagagaaaagtccaccgccatccctctcccgaatg gggcggaggtttcggttctgtggtgatctggactgtcctgaccgggtcctggcagagatc agcacgctggccaagatggttgagtgcacagggtctacactgggtggaggagggtataag aagatcctgaaactcacagctgatgccaagtttgtggaggagcatgagggaaagaaagac attgacagtcccaccttcctgtcctctgccagctcctggtggagcagtagcagtgcctat ggctccaggaggcctgggggctttgagctgaagttaatagggcaacagggagagtcaggc gatgtgaaggccacagtggcagtgctgagtttcatcctctccggtgcggccaagcacagt gtcgatggcaaatccttggccagtgaactgcagcagctggggctgcccaaagagcacgcg gccagcccgtgctgctgttatgaggagaagcaaagccccttgcagaagcacttgcgggtc tgcagcctacgcacgtggtccctggcagcacgagtggcagaagggacagcagagactgtg gacccctcagctgcacctaagacctctgtgtggtctagccgtgtgtgctgggggaggctt ttggccaccatttccccctctgtaaatgagaccgatacctgtggcatggaggacaggagg ctgggtgtgcaggccctggcgtgcacagtgccacatgacggcctgggctggcgccaccct gaagaaggtggcacccactctggcgggtctcccttggagcacctggagcccaagaagctg tccctcacctttgcaatcccagggactcgccccctgatcatgatgaaggtggacccaaac ccgctgaggggcctgaaggagcaaaggttcttggctgagcatgatgttcttgcccatttc cttgattatcctcaaaaggtttatagaaccacttggggacatcacatctgtctaccagcc ccctctgctgtcctgcaggctctggcggtggccatccagcttggtggccatctggctgat ccactcctccaggtggaccctctgtcctcatgtggtgcagaagtacagaggaacctggtg ccggccgcagtcaacccctatgtgatgatctttctacccgccctctttcgtgtcttggtg ctggctgctgtctttggccagctaaaagaatatcagcaaaggaagagccctggtattcca gcaggagcaaagacaaaaaagaaaaaaactgacagtagccctgagacaaccacttccggt ggttgccactcacctggggatagccagtaccaagaactagcagtagccctggagtcaagc tcagtgacaatcagtcaactcaatgaaaacatagaatcattgaaacagcagaagaaacaa gtggaacatcagctggaagaagcaaagaaaacaaacaatgaaatacacaaagcacaaatg gagcggttagagacaatcaacatcctcacattggaaaaggcagacttgaagaccaccctt taccatactaaacgtgctgcccgacacttcgaagaagagtccaaggatctggctggccgc ctgcaatactccttacagcgtattcaagaattggagcgggctctctgtgctgtgtctaca cagcagcaggaagaggacagggggcactgcctgagctccccagatcaaaacttctcactc ttcaccatccagtcctcgagctgcagagaagcggtcctccagcggtggttacagcagacc ataaaggagcgggcgctgctgaacgcacacgtgacacaggtgacagagtcactaaaacaa gtccagctagagcgagacgaatatgctaaacacataaaaggagagagggcccggtggcag gagaggatgtggaaaatgtcggtggaggctcgaacattgaaggaagagaagaagcgtgac atacatcggatacaggagctggagaggagcttgtccgaactcaaaaaccagatggctgag cccccatccctggcgcccccagcagtgacctctgtggtggaacagctacaagatgaggcc aaacacctgaggcaggaggtggaaggtctggagggaaagctccagtcccaggtggaaaac aatcaggccttgagtctccttagcaaggaacaaaagcagagactccaggagcaggaggag atgctccgagagcaggaggcgcagagagtgcgggagcaggagagactgtgtgaacaaaac gagaggcttcgggagcagcagaagacgctacaggagcagggtgagaggctgcgaaagcag gagcagaggctacgcaaacaggaggagaggctgcgaaaggaggaggagaggctgcaaaag caggaaaagaggctgtgggaccaggaggagaggctgtggaagaaggaggagaggctacaa aagcaggaggagaggctcgcgctctcccagaaccacaagctcgacaagcagctggccgag ccacagtgcagcttcgaggatctgaacaacgagaaaaagagcgcactgcagttggagcag caagtaaaggagctgcaggagaagctagacgaggtgaaggagatgcagtacatggccacc tatcagcagctgacctctgagaaggaggcgctgcacaggcagttactgctgcagacccag ttcgtggaccagctgcagcagcaggaagcttggggcaaagcggagcacctagaagctgcc agccaccagaaccaacagctagagacccagctaagcctcgtggctctccctggagaagga gatggaggacaacatctggacagtgaggaggaggaggcgcctcggcccacgccaaacatc ccagaggacctggagagccgggaggccacggagccagaggcagaggccccagccccaggg agtgggggtgagtttgtgtgtggggagagctaccgggccctgaaggaggccatggtgaag ctgaaagggagagagtccttcaccgtatatgaaagccagggggcagtgccaaacacgcgg caccaggagatggaggatgtcatcaggctggcccagaaggaggaggagatgaaggtgaag ctgctggagctgcaagagttggtgttgccccttgtgggcaaccatgaggggcatggcaaa ttcctcatcgctgcccagaaccctgctgatgagcccactccaggggccccagccccccag gaacttggggctgccggtgagcaggatgttttttatgaagtgagcctggacaacaacgtg gagcctgcaccaggagcggccagggagggttctccccatgacaaccccactgtacagcag atcgtgcagctgtctcctgtcatgcaggacacctag >gi568815583f:73898013_74145005|GENSCAN_predicted_peptide_9|745_aa MFPLRALWLVWALLGVAGSCPEPCACVDKYAHQFADCAYKELREVPEGLPANVTTLSLSA NKITVLRRGAFADVTQVTSLWLAHNEVRTVEPGALAVLSQLKNLDLSHNFISSFPWSDLR NLSALQLLKMNHNRLGSLPRDALGALPDLRSLRINNNRLRTLAPGTFDALSALSHLQLYH NPFHCGCGLVWLQAWAASTRVSLPEPDSIACASPPALQGVPVYRLPALPCAPPSVHLSAE PPLEAPGTPLRAGLAFVLHCIADGHPTPRLQWQLQIPGGTVVLEPPVLSGEDDGVGAEEG EGEGDGDLLTQTQAQTPTPAPAWPAPPATPRFLALANGSLLVPLLSAKEAGVYTCRAHNE LGANSTSIRVAVAATGPPKHAPGAGGEPDGQAPTSERKSTAKGRGNSVLPSKPEGKIKGQ GLAKVSILGETETEPEEDTSEGEEAEDQILADPAEEQRCGNGDPSRYVSNHAFNQSAELK PHVFELGVIALDVAEREARVQLTPLAARWGPGPGGAGGAPRPGRRPLRLLYLCPAGGGAA VQWSRVEEGVNAYWFRGLRPGTNYSVCLALAGEACHVQVVFSTKKELPSLLVIVAVSVFL LVLATVPLLGAACCHLLAKHPGKPYRLILRPQAPDPMEKRIAADFDPRASYLESEKSYPA GGEAGGEEPEDVQGEGLDEDAEQGDPSGDLQREESLAACSLVESQSKANQEEFEAGSEYS DRLPLGAEAVNIAQEINGNYRQTAG >gi568815583f:73898013_74145005|GENSCAN_predicted_CDS_9|2238_bp atgttcccccttcgggccctgtggttggtctgggcgcttctaggagtggccggatcatgc ccggagccgtgcgcctgcgtggacaagtacgctcaccagttcgcggactgcgcttacaaa gagttgcgtgaggtgccggaaggactgcctgccaacgtgacgacgcttagtctgtccgcg aacaagatcactgtgctgcggcgcggggccttcgccgacgtcacacaggtcacgtcgctg tggctggcgcacaatgaggtgcgcaccgtggagccaggcgcactggccgtgctgagtcag ctcaagaacctcgatctgagccacaacttcatatccagctttccgtggagcgacctgcgc aacctgagcgcgctgcagctgctcaaaatgaaccacaaccgcctgggctctctgccccgg gacgcactcggtgcgctacccgacctgcgttccctgcgcatcaacaacaaccggctgcgt acgctggcgcctggcaccttcgacgcgcttagcgcgctgtcacacttgcaactctatcac aatcccttccactgcggctgcggccttgtgtggctgcaggcctgggccgcgagcacccgg gtgtccttacccgagcccgactccattgcttgtgcctcgcctcccgcgctgcagggggtg ccggtgtaccgcctgcccgccctgccctgtgcaccgcccagcgtgcatctgagtgccgag ccaccgcttgaagcacccggcaccccactgcgcgcaggactggcgttcgtgttacactgc atcgccgacggccaccctacgcctcgcctgcaatggcaacttcagatccccggtggcacc gtagtcttagagccaccggttctgagcggggaggacgacggggttggggcggaggaagga gagggagaaggagatggggatttgctgacgcagacccaagcccaaacgccgactccagca cccgcttggccggcgcccccagccacaccgcgcttcctggccctcgcaaatggctccctg ttggtgcccctcctgagtgccaaggaggcgggcgtctacacttgccgtgcacacaatgag ctgggcgccaactctacgtcaatacgcgtggcggtggcagcaaccgggcccccaaaacac gcgcctggcgccgggggagaacccgacggacaggccccgacctctgagcgcaagtccaca gccaagggccggggcaacagcgtcctgccttccaaacccgagggcaaaatcaaaggccaa ggcctggccaaggtcagcattctcggggagaccgagacggagccggaggaggacacaagt gagggagaggaggccgaagaccagatcctcgcggacccggcggaggagcagcgctgtggc aacggggacccctctcggtacgtttctaaccacgcgttcaaccagagcgcagagctcaag ccgcacgtcttcgagctgggcgtcatcgcgctggatgtggcggagcgcgaggcgcgggtg cagctgactccgctggctgcgcgctggggccctgggcccggcggggctggcggagccccg cgacccgggcggcgacccctgcgcctactctatctgtgtccagcggggggcggcgcggca gtgcagtggtcccgcgtagaggaaggcgtcaacgcctactggttccgcggcctgcggccg ggtaccaactactccgtgtgcctggcgctggcgggcgaagcctgccacgtgcaagtggtg ttttccaccaagaaggagctcccatcgctgctggtcatagtggcagtgagcgtattcctc ctggtgctggccacagtgccccttctgggcgccgcctgctgccatctgctggctaaacac ccgggcaagccctaccgtctgatcctgcggcctcaggcccctgaccctatggagaagcgc atcgccgcagacttcgacccgcgtgcttcgtacctcgagtccgagaaaagctacccggca ggcggcgaggcgggcggcgaggagccagaggacgtgcagggggagggccttgatgaagac gcggagcagggagacccaagtggggacctgcagagagaggagagcctggcggcctgctca ctggtggagtcccagtccaaggccaaccaagaggagttcgaggcgggctctgagtacagc gatcggctgcccctgggcgccgaggcggtcaacatcgcccaggagattaatggcaactac aggcagacggcaggctga