GENSCAN 1.0 Date run: 4-Nov-116 Time: 00:44:16 Sequence gi568815597r:151304805_151542200 : 237396 bp : 45.57% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 Intr - 1559 1332 228 2 0 78 78 386 0.572 34.67 1.04 Intr - 2997 2770 228 0 0 36 71 284 0.970 19.57 1.03 Intr - 5451 5407 45 0 0 86 94 28 0.676 1.81 1.02 Intr - 11705 10769 937 1 1 112 83 989 0.640 91.99 1.01 Init - 12224 12214 11 2 2 92 31 5 0.274 -4.78 1.00 Prom - 14001 13962 40 -4.96 2.13 PlyA - 16561 16556 6 1.05 2.12 Term - 21569 21372 198 2 0 60 50 94 0.815 0.20 2.11 Intr - 22634 22467 168 2 0 14 99 166 0.384 10.44 2.10 Intr - 38374 37481 894 1 0 92 59 160 0.118 5.01 2.09 Intr - 38638 38538 101 2 2 88 110 112 0.518 13.23 2.08 Intr - 39078 38877 202 0 1 52 70 139 0.544 7.36 2.07 Intr - 39474 39351 124 2 1 14 98 -7 0.211 -6.51 2.06 Intr - 39732 39613 120 2 0 93 7 141 0.543 6.11 2.05 Intr - 40043 39924 120 1 0 85 72 134 0.620 11.11 2.04 Intr - 40384 40302 83 1 2 97 100 72 0.997 7.74 2.03 Intr - 41157 41124 34 2 1 115 119 30 0.997 7.03 2.02 Intr - 41529 41401 129 2 0 86 88 45 0.300 4.11 2.01 Init - 50611 50577 35 1 2 114 53 27 0.163 -0.54 2.00 Prom - 51867 51828 40 -4.26 3.12 PlyA - 52676 52671 6 1.05 3.11 Term - 59901 59739 163 2 1 102 43 190 0.999 13.31 3.10 Intr - 60240 60122 119 0 2 73 94 138 0.999 12.16 3.09 Intr - 60477 60385 93 0 0 70 116 61 0.819 7.26 3.08 Intr - 60880 60759 122 2 2 90 49 232 0.659 19.71 3.07 Intr - 61042 60964 79 1 1 79 109 104 0.999 10.72 3.06 Intr - 61649 61471 179 0 2 81 46 233 0.994 18.04 3.05 Intr - 62100 61918 183 1 0 113 85 195 0.999 21.46 3.04 Intr - 63515 63395 121 2 1 129 77 147 0.899 17.77 3.03 Intr - 64385 64236 150 2 0 79 10 225 0.898 14.16 3.02 Intr - 64750 64638 113 2 2 105 66 136 0.999 13.20 3.01 Init - 65095 64909 187 1 1 75 86 154 0.957 11.13 3.00 Prom - 88959 88920 40 -1.86 4.00 Prom + 93125 93164 40 -4.06 4.01 Init + 94784 94923 140 1 2 62 100 77 0.975 3.92 4.02 Intr + 95177 95383 207 2 0 111 78 253 0.998 24.79 4.03 Intr + 95638 95784 147 1 0 83 12 130 0.949 4.25 4.04 Intr + 95960 96041 82 2 1 49 111 99 0.999 7.94 4.05 Intr + 96435 96551 117 2 0 65 76 212 0.999 18.36 4.06 Intr + 96738 96826 89 2 2 73 109 110 0.848 10.37 4.07 Term + 97013 97025 13 2 1 95 44 14 0.823 -4.63 4.08 PlyA + 97115 97120 6 1.05 5.20 PlyA - 97944 97939 6 1.05 5.19 Term - 101660 99998 1663 1 1 107 48 1305 0.981 117.20 5.18 Intr - 101827 101803 25 2 1 108 87 -26 0.701 -3.62 5.17 Intr - 102219 102107 113 2 2 73 98 55 0.708 5.02 5.16 Intr - 102487 102431 57 0 0 104 94 6 0.625 0.80 5.15 Intr - 103436 103296 141 1 0 119 108 87 0.999 13.27 5.14 Intr - 103777 103605 173 1 2 101 1 108 0.953 2.14 5.13 Intr - 104024 103890 135 2 0 80 98 138 0.997 14.76 5.12 Intr - 106967 106821 147 2 0 77 79 100 0.935 8.43 5.11 Intr - 107592 107492 101 1 2 81 72 27 0.895 0.23 5.10 Intr - 118747 118593 155 0 2 81 94 113 0.979 10.92 5.09 Intr - 119482 119145 338 1 2 67 123 199 0.923 15.62 5.08 Intr - 120257 120151 107 0 2 96 75 33 0.712 2.73 5.07 Intr - 123210 123019 192 1 0 94 82 152 0.960 14.66 5.06 Intr - 123609 123319 291 1 0 46 82 182 0.589 10.51 5.05 Intr - 124907 124799 109 2 1 60 110 -1 0.973 -0.84 5.04 Intr - 126037 125862 176 2 2 100 109 85 0.916 11.46 5.03 Intr - 136282 136124 159 2 0 118 115 85 0.997 14.16 5.02 Intr - 137433 137277 157 0 1 44 92 188 0.037 14.38 5.01 Init - 154244 154146 99 2 0 69 33 166 0.936 7.46 5.00 Prom - 161313 161274 40 -4.06 6.00 Prom + 161542 161581 40 -3.16 6.01 Init + 175220 175245 26 1 2 81 78 19 0.021 -0.64 6.02 Intr + 198067 198376 310 1 1 116 31 99 0.088 3.32 6.03 Intr + 206274 206397 124 2 1 -10 36 199 0.044 5.06 6.04 Intr + 213702 214588 887 1 2 119 71 802 0.125 72.64 6.05 Intr + 215362 215462 101 0 2 102 84 104 0.999 10.31 6.06 Intr + 215610 215679 70 0 1 116 89 23 0.713 4.38 6.07 Intr + 215792 215887 96 1 0 81 116 110 0.989 13.31 6.08 Intr + 218630 218757 128 1 2 103 64 134 0.993 11.98 6.09 Intr + 219422 219554 133 2 1 36 97 182 0.990 14.55 6.10 Intr + 219870 220100 231 2 0 97 1 278 0.944 17.97 6.11 Intr + 220838 220986 149 1 2 103 55 132 0.999 10.43 6.12 Intr + 222171 222303 133 0 1 63 63 197 0.979 15.35 6.13 Intr + 224546 224764 219 1 0 107 -12 223 0.814 12.70 6.14 Intr + 225105 225311 207 2 0 78 78 309 0.996 28.17 6.15 Intr + 225685 225942 258 0 0 98 59 383 0.972 34.06 6.16 Intr + 227598 227768 171 2 0 75 60 195 0.995 15.54 6.17 Intr + 229171 229332 162 0 0 57 78 200 0.980 16.07 6.18 Intr + 230238 230327 90 2 0 77 91 144 0.998 13.79 6.19 Intr + 230796 230921 126 2 0 73 105 11 0.502 2.18 6.20 Intr + 230976 231094 119 2 2 84 46 105 0.981 5.16 6.21 Intr + 231433 231541 109 1 1 76 94 209 0.999 20.59 6.22 Intr + 231926 232089 164 1 2 37 60 288 0.967 19.67 6.23 Intr + 235412 235622 211 2 1 50 95 111 0.290 6.92 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 137400 137277 124 0 1 64 92 201 0.909 18.43 S.002 Init + 213734 214588 855 1 0 107 71 799 0.863 74.74 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:151304805_151542200|GENSCAN_predicted_peptide_1|483_aa MVKPLEARSLAVAMGDTVVEPAPLKPTSEPTSGPPGNNGGSLLSVITEGVGELSVIDPEV AQKACQEVLEKVKLLHGGVAVSSRGTPLELVNGDGVDSEIRCLDDPPAQIREEEDEMGAA VASGTAKGARRRRQNNSAKQSWLLRLFESKLFDISMAISYLYNSKEPGVQAYIGNRLFCF RNEDVDFYLPQLLNMYIHMDEDVGDAIKPYIVHRCRQSINFSLQCALLLGAYSSDMHIST QRHSRGTKLRKLILSDELKPAHRKRELPSLSPAPDTGLSPSKRTHQRSKSDATASISLSS NLKRTASNPKVENEDEELSSSTESIDNSFSSPVRLAPEREFIKSLMAIGKRLATLPTKEQ KTQRLISELSLLNHKLPARVWLPTAGFDHHVVRVPHTQAVVLNSKDKAPYLIYVEVLECE NFDTTSVPARIPENRIRSTRSVENLPECGITHEQRAGSFSTVPNYDNDDEAWSVDDIGEL QVE >gi568815597r:151304805_151542200|GENSCAN_predicted_CDS_1|1449_bp atggtgaaacccttggaagctcgaagtctggctgtggccatgggagatacagtagtggag cctgcccccttgaagccaacttctgagcccacttctggcccaccagggaataatgggggg tccctgctaagtgtcatcacggagggggtcggggaactatcagtgattgaccctgaggtg gcccagaaggcctgccaggaggtgttggagaaagtcaagcttttgcatggaggcgtggca gtctctagcagaggcaccccactggagttggtcaatggggatggtgtggacagtgagatc cgttgcctagatgatccacctgcccagatcagggaggaggaagatgagatgggggccgct gtggcctcaggcacagccaaaggagcaagaagacggcggcagaacaactcagctaaacag tcttggctgctgaggctgtttgagtcaaaactgtttgacatctccatggccatttcatac ctgtataactccaaggagcctggagtacaagcctacattggcaaccggctcttctgcttt cgcaacgaggacgtggacttctatctgccccagttgcttaacatgtacatccacatggat gaggacgtgggtgatgccattaagccctacatagtccaccgttgccgccagagcattaac ttttccctccagtgtgccctgttgcttggggcctattcttcagacatgcacatttccact caacgacactcccgtgggaccaagctacggaagctgatcctctcagatgagctaaagcca gctcacaggaagagggagctgccctccttgagcccggcccctgacacagggctgtctccc tccaaaaggactcaccagcgctctaagtcagatgccactgccagcataagtctcagcagc aacctgaaacgaacagccagcaaccctaaagtggagaatgaggatgaggagctctcctcc agcaccgagagtattgataattcattcagttcccctgttcgactggctcctgagagagaa ttcatcaagtccctgatggcgatcggcaagcggctggccacgctccccaccaaagagcag aaaacacagaggctgatctcagagctctccctgctcaaccataagctccctgcccgagtc tggctgcccactgctggctttgaccaccacgtggtccgtgtaccccacacacaggctgtt gtcctcaactccaaggacaaggctccctacctgatttatgtggaagtccttgaatgtgaa aactttgacaccaccagtgtccctgcccggatccccgagaaccgaattcggagtacgagg tccgtagaaaacttgcccgaatgtggtattacccatgagcagcgagctggcagcttcagc actgtgcccaactatgacaacgatgatgaggcctggtcggtggatgacataggcgagctg caagtggag >gi568815597r:151304805_151542200|GENSCAN_predicted_peptide_2|735_aa MPGRLHLLTGKFPHAGMAEDEPDAKSPKTGGRAPPGGAEAGEPTTLLQRLRGTISKAVQN KVEGILQDVQKFSDNDKLYLYLQLPSGPTTGDKSSEPSTLSNEEYMYAYRWIRNHLEEHT DTCLPKQSVYDAYRKYCESLACCRPLSTANFGKIIREIFPDIKARRLGGRGQSKYCYSGI RRKTLVSMPPLPGLDLKGSESVSTKSPSNSTLLSQPEMGPEVTPAPRDELVEAACALTCD WAERILKRSFSSIVEVARFLLQQHLISARSAHAHVLKAMGLAEEDEHAPRERSSKPKNGL ENPEGGAHKKPERLAQPPKDLEARTGAGPLARGERKKSVVESSAPGANNLQVNALVARLP LLLPRAPRSLIPPIPVSPPILAPRLSSGALKVATLPLSSRAGAPPAAVPIINMILPTVPA LPGPGPGPGRAPPGGLTQPRGTENREVGIGGDQGPHDKGVKRTAEVPVSEASGQAPPAKA AKQDIEDTASDAKRKRGRPRKKSGGSGERNSTPLKSAAAMESAQSSRLPWETWGSGGEGN SAGGAERPGPMGEAEKGAVLAQGQGDGTVSKGGRGPGSQHTKEAEDKIPLVPSKVSVIKG SRSQKEAFPLAKGEAAPRAAPQPGPGAASAAAVREAQAANGTRERSLLLRRQCRQSNRDC PHPLRGGPPEDQLNLELRRKMCCEQGEPQLPQAVQDRRVFLKAGASFEEVPIRLHLVDYS GAATKRDEQACVEIE >gi568815597r:151304805_151542200|GENSCAN_predicted_CDS_2|2208_bp atgcccggccgactccacctcttgacagggaagttccctcatgccgggatggcagaagat gagcctgatgctaagagccccaagactgggggaagggcccccccaggtggtgctgaggct ggggaacctaccacccttcttcagaggctccgaggtaccatttccaaggccgtgcagaac aaagtagaggggatcctgcaagatgtacagaaattttctgacaatgacaagctgtatctc taccttcagctcccctcaggacccaccactggagacaaaagctcagagccaagtacactg agcaatgaggagtacatgtatgcctataggtggatccgcaaccacctggaagagcacact gacacctgtctgccaaagcaaagtgtttatgatgcctatcggaagtactgtgagagtctt gcctgttgccgcccactcagcacagccaactttggcaagatcatcagagagatcttccct gacatcaaagctcgaaggcttggtggccggggccagtccaaatattgctacagtggcata aggaggaagaccttggtgtctatgccacccctgcctggacttgacctaaagggttctgag agtgtaagtaccaaatcaccttccaattccactcttctctcccagccagaaatgggccca gaagtaaccccagcacctcgagatgaactggtggaggcagcgtgtgccctgacctgtgac tgggcagagcggatcctgaaacggtccttcagttccatcgttgaggtcgcccgcttcctg ctacagcagcatctcatctctgcccgatctgcacatgcccatgtgcttaaggccatgggg ctcgctgaagaggacgaacatgcacctcgggaacggtcatctaaaccaaagaatggttta gagaacccagagggtggagcccacaagaagccagagagactggcccagcctcctaaggat ctggaagcccgaactggggccggtcctctcgcacgtggagagcggaagaagagtgtagtt gagagctcggccccaggagccaataacctgcaggttaatgccctagtggctcggctgcct ctgctccttccccgggcccctcgctcactaattccgccaatcccagtctctccacctatt ctggcccccaggctttcttcaggtgccctgaaagtggctacactgcctctgtctagtagg gccggggcacccccagcagctgtgcccatcattaacatgatcttaccaactgttcctgct ttgcctggacctggacctgggcctgggcgagctccacctgggggactcactcagccccgg ggcacagagaacagagaggtaggcataggtggtgaccaaggaccacatgacaagggtgtc aagaggacagctgaagtacctgtgagtgaggccagtgggcaggctccaccagctaaagca gcaaagcaggatatagaggatacagcaagtgatgccaaaaggaaacgggggcgccctcga aaaaagtcaggtggaagtggggaaaggaattctacccctctcaagtcagcagctgccatg gaatctgcccagtcctcaaggttaccatgggagacatggggctcaggaggggaaggcaac tcagctggaggggcagagaggccagggccaatgggagaggctgaaaagggggcagtactt gcccagggtcagggagatggtactgtttccaaaggaggaaggggccccggttcccagcat accaaagaagcagaagataaaattcccttggtcccctcaaaagtgagtgtcatcaagggc agcagaagccaaaaggaggcttttcctttggcaaagggagaggcggcgccgcgggcagcc ccgcagccggggcctggtgcagcctccgcggccgctgtcagggaagcgcaggcggccaat ggaacccgggagcggtcgctgctgctgaggcggcagtgtcggcagtccaaccgcgactgc ccgcaccccctccgcgggggtcccccagaggatcaactaaaccttgaactaagaagaaaa atgtgttgtgagcagggggagcctcagctgcctcaggccgttcaggacagaagggtgttt ctgaaggccggagcaagttttgaagaagtccctatcagattacacttggttgactactcc ggagcagccactaagagggatgaacaggcctgcgtggaaattgaatga >gi568815597r:151304805_151542200|GENSCAN_predicted_peptide_3|502_aa MRLEWGPRPAALPWPAGMCAAERAEGAFTLQSVAQPMRPIASTATKCGNCGPGYSTPLEA MKGPREEIVYLPCIYRNTGTEAPDYLATVDVDPKSPQYCQVIHRLPMPNLKDELHHSGWN TCSSCFGDSTKSRTKLVLPSLISSRIYVVDVIEPKDIHAKCELAFLHTSHCLASGEVMIS SLGDVKGNGKGGFVLLDGETFEVKGTWERPGGAAPLGYDFWYQPRHNVMISTEWAAPNVL RDGFNPADVEAGLYGSHLYVWDWQRHEIVQTLSLKDGLIPLEIRFLHNPDAAQGFVGCAL SSTIQRFYKNEGGTWSVEKVIQVPPKKVKGWLLPEMPGLITDILLSLDDRFLYFSNWLHG DLRQYDISDPQRPRLTGQLFLGGSIVKGGPVQVLEDEELKSQPEPLVVKGKRVAGGPQMI QLSLDGKRLYITTSLYSAWDKQFYPDLIREGSVMLQVDVDTVKGGLKLNPNFLVDFGKEP LGPALAHELRYPGGDCSSDIWI >gi568815597r:151304805_151542200|GENSCAN_predicted_CDS_3|1509_bp atgaggctggagtggggacctaggccagccgcactgccgtggcccgctgggatgtgtgct gcagaacgtgcggagggagccttcaccctccagagcgtggcccagccaatgcgccccatt gcttccacagctacgaaatgtgggaattgtggacccggctactccacccctctggaggcc atgaaaggacccagggaagagatcgtctacctgccctgcatttaccgaaacacaggcact gaggccccagattatctggccactgtggatgttgaccccaagtctccccagtattgccag gtcatccaccggctgcccatgcccaacctgaaggacgagctgcatcactcaggatggaac acctgcagcagctgcttcggtgatagcaccaagtcgcgcaccaagctggtgctgcccagt ctcatctcctctcgcatctatgtggtggacgtcattgagcccaaggacatccatgccaag tgcgaactggcctttctccacaccagccactgcctggccagcggggaagtgatgatcagc tccctgggagacgtcaagggcaatggcaaagggggttttgtgctgctggatggggagacg ttcgaggtgaaggggacatgggagagacctgggggtgctgcaccgttgggctatgacttc tggtaccagcctcgacacaatgtcatgatcagcactgagtgggcagctcccaatgtctta cgagatggcttcaaccccgctgatgtggaggctggactgtacgggagccacttatatgta tgggactggcagcgccatgagattgtgcagaccctgtctctaaaagatgggcttattccc ttggagatccgcttcctgcacaacccagacgctgcccaaggctttgtgggctgcgcactc agctccaccatccagcgcttctacaagaacgagggaggtacatggtcagtggagaaggtg atccaggtgccccccaagaaagtgaagggctggctgctgcccgaaatgccaggcctgatc accgacatcctgctctccctggacgaccgcttcctctacttcagcaactggctgcatggg gacctgaggcagtatgacatctctgacccacagagaccccgcctcacaggacagctcttc ctcggaggcagcattgttaagggaggccctgtgcaagtgctggaggacgaggaactaaag tcccagccagagcccctagtggtcaagggaaaacgggtggctggaggccctcagatgatc cagctcagcctggatgggaagcgcctctacatcaccacgtcgctgtacagtgcctgggac aagcagttttaccctgatctcatcagggaaggctctgtgatgctgcaggttgatgtagac acagtaaaaggagggctgaagttgaaccccaacttcctggtggacttcgggaaggagccc cttggcccagcccttgcccatgagctccgctaccctgggggcgattgtagctctgacatc tggatttga >gi568815597r:151304805_151542200|GENSCAN_predicted_peptide_4|264_aa MEAFLGSRSGLWAGGPAPGQFYRIPSTPDSFMDPASALYRGPITRTQNPMVTGTSVLGVK FEGGVVIAADMLGSYGSLARFRNISRIMRVNNSTMLGASGDYADFQYLKQVLGQMVIDEE LLGDGHSYSPRAIHSWLTRAMYSRRSKMNPLWNTMVIGGYADGESFLGYVDMLGVAYEAP SLATGYGAYLAQPLLREVLEKQPVLSQTEARDLVERCMRVLYYRDARSYNRFQIATVTEK GVEIEGPLSTETNWDIAHMISGFE >gi568815597r:151304805_151542200|GENSCAN_predicted_CDS_4|795_bp atggaagcgtttttggggtcgcggtccggactttgggcggggggtccggccccaggacag ttttaccgcattccgtccactcccgattccttcatggatccggcgtctgcactttacaga ggtccaatcacgcggacccagaaccccatggtgaccgggacctcagtcctcggcgttaag ttcgagggcggagtggtgattgccgcagacatgctgggatcctacggctccttggctcgt ttccgcaacatctctcgcattatgcgagtcaacaacagtaccatgctgggtgcctctggc gactacgctgatttccagtatttgaagcaagttctcggccagatggtgattgatgaggag cttctgggagatggacacagctatagtcctagagctattcattcatggctgaccagggcc atgtacagccggcgctcgaagatgaaccctttgtggaacaccatggtcatcggaggctat gctgatggagagagcttcctcggttatgtggacatgcttggtgtagcctatgaagcccct tcgctggccactggttatggtgcatacttggctcagcctctgctgcgagaagttctggag aagcagccagtgctaagccagaccgaggcccgcgacttagtagaacgctgcatgcgagtg ctgtactaccgagatgcccgttcttacaaccggtttcaaatcgccactgtcaccgaaaaa ggtgttgaaatagagggaccattgtctacagagaccaactgggatattgcccacatgatc agtggctttgaatga >gi568815597r:151304805_151542200|GENSCAN_predicted_peptide_5|1445_aa MGPVRAGVGAGGRRCAPPPTASGGAARGRGGRGAHILFECFSYSMADTDLFMECEEEELE PWQKISDVIEDSVVEDYNSVDKTTTVSVSQQPVSAPVPIAAHASVAGHLSTSTTVSSSGA QNSDSTKKTLVTLIANNNAGNPLVQQGGQPLILTQNPAPGLGTMVTQPVLRPVQVMQNAN HVTSSPVASQPIFITTQGFPVRNVRPVQNAMNQVGIVLNVQQGQTVRPITLVPAPGTQFV KPTVGVPQVFSQMTPVRPGSTMPVRPTTNTFTTVIPATLTIRSTVPQSQSQQTKSTPSTS TTPTATQPTSLGQLAVQSPGQSNQTTNPKLVSIASFVTVKRPGVTGENSNEVAKLVNTLN TIPSLGQSPGPVVVSNNSSAHGSQRTSGPESSMKVTSSIPVFDLQDGGRKICPRCNAQFR VTEALRGHMCYCCPEMVEYQKKGKSLDSEPSVPSAAKPPSPEKTAPVASTPSSTPIPALS PPTKVPEPNENVGDAVQTKLIMLVDDFYYGRDGGKVAQLTNFPKVATSFRCPHCTKRLKN NIRFMNHMKHHVELDQQNGEVDGHTICQHCYRQFSTPFQLQCHLENVHSPYESTTKCKIC EWAFESEPLFLQHMKDTHKPGEMPYVCQVCQYRSSLYSEVDVHFRMIHEDTRHLLCPYCL KVFKNGNAFQQHYMRHQKRNVYHCNKCRLQFLFAKDKIEHKLQHHKTFRKPKQLEGLKPG TKVTIRASRGQPRTVPVSSNDTPPSALQEAAPLTSSMDPLPVFLYPPVQRSIQKRAVRKM SVMGRQTCLECSFEIPDFPNHFPTYVHCSLCRYSTCCSRAYANHMINNHVPRKSPKYLAL FKNSVSGIKLACTSCTFVTSVGDAMAKHLVFNPSHRSSSILPRGLTWIAHSRHGQTRDRV HDRNVKNMYPPPSFPTNKAATVKSAGATPAEPEELLTPLAPALPSPASTATPPPTPTHPQ ALALPPLATEGAECLNVDDQDEGSPVTQEPELASGGGGSGGVGKKEQLSVKKLRVVLFAL CCNTEQAAEHFRNPQRRIRRWLRRFQASQGENLEGKYLSFEAEEKLAEWVLTQREQQLPV NEETLFQKATKIGRSLEGGFKISYEWAVRFMLRHHLTPHARRAVAHTLPKDVAENAGLFI DFVQRQIHNQDLPLSMIVAIDEISLFLDTEVLSSDDRKENALQTVGTGEPWCDVVLAILA DGTVLPTLVFYRGQMDQPANMPDSILLEAKESGYSDDEIMELWSTRVWQKHTACQRSKGM LVMDCHRTHLSEEVLAMLSASSTLPAVVPAGCSSKIQPLDVCIKRTVKNFLHKKWKEQAR EMADTACDSDVLLQLVLVWLGEVLGVIGDCPELVQRSFLVASVLPGPDGNINSPTRNADM QEELIASLEEQLKLSGEHSESSTPRPRSSPEETIEPESLHQLFEGESETESFYGFEEADL DLMEI >gi568815597r:151304805_151542200|GENSCAN_predicted_CDS_5|4338_bp atggggccggtacgcgcgggggtgggggcggggggccggcggtgcgcgcccccgcccacc gcgagtggcggcgcggcccgcggcaggggcggccgcggggcccatatcttatttgaatgt ttttcttatagcatggcggacaccgacctgttcatggaatgtgaggaggaggagttggag ccatggcagaaaatcagtgatgtcattgaggactctgtagttgaagattataattcagtg gataaaactaccacagtttctgtgagccagcagccagtctcggctccagtgcccatcgct gcccatgcttctgttgctgggcacctctctacatccaccaccgttagtagcagcggggca cagaacagcgacagtacaaagaagactcttgtcacactaattgccaacaacaatgctggc aatcctttggtccagcaaggtggacagccactcatcctgacccagaatccagccccaggt ctgggcacaatggttactcaaccagtattgaggcctgttcaggtcatgcagaatgccaat catgtgactagttcccctgtggcctcacaaccaatatttatcactacgcagggatttcct gtaaggaatgtccggcctgtacaaaatgcaatgaatcaggttgggattgtgctgaacgta cagcaaggccaaacggttagaccaattacactagttccagccccaggtacccagtttgtt aagccgacagttggagttccacaagtgttctcccagatgacccctgtgaggccaggctcc acaatgcctgtgaggcccaccaccaacaccttcaccaccgtcatcccggccactcttacc attcgaagcaccgtcccacagtcccagtcccagcagaccaagtccactcccagcacttct accactcccactgccacacagccaacctcactggggcaactagctgttcagtctccaggc cagtcaaaccagaccacgaatcccaagctagtgagcattgccagctttgtcactgtgaag cgacctggtgttacaggcgaaaatagcaatgaagtggccaaattggtgaatacccttaac accatcccttccctgggccagagtcctgggccagtggtggtgtccaacaacagctctgct catggctctcaaagaaccagcggacctgagtcttcaatgaaagtgacctcttccatccca gtatttgacctccaggatggtggacggaaaatatgtccacgatgtaatgctcaatttcgt gttactgaagctttgagaggtcacatgtgttactgttgcccagaaatggttgaataccag aagaaaggaaagtccctggattcagaacccagtgtcccatcagcagcaaagcccccatcc cctgagaaaacagctcctgttgcttccacaccctcttctacacctattcctgctctgtca ccgcctaccaaagtaccagaaccaaatgagaacgtgggcgatgccgtccagaccaaactc attatgcttgtagatgacttctactatggacgggatggtggcaaagtagcccagctcaca aatttccctaaggtcgccacatctttccgatgcccacattgtaccaaaaggctaaaaaac aatattcgattcatgaaccatatgaaacaccacgtagaactcgatcagcagaacggtgag gtagatggtcacactatctgccagcactgttaccgccagttttccactcccttccagctt cagtgccacttggaaaatgttcatagtccctatgaatctactaccaagtgcaagatctgt gaatgggcgtttgaaagtgagccactatttctccagcatatgaaggatactcataagcct ggagagatgccttatgtttgccaggtgtgtcaatatcgctcctcactctactctgaggta gatgtccattttcggatgatccatgaggatacccggcatctgctctgcccttattgcctg aaggtcttcaaaaatggcaatgcattccaacagcattacatgaggcaccagaagagaaat gtttatcactgcaacaaatgccggctgcagtttctctttgccaaggacaaaattgaacac aagcttcaacaccataaaaccttccgtaaacccaagcagctggagggcttgaaaccaggc accaaggtgacaatccgggcttcccgagggcagccacgaactgttcctgtatcctctaat gatacacctcccagcgccttgcaggaggcagcaccgctgacctcctcaatggaccctctg cctgtcttcctttatccccctgtccagcgcagcatccagaagagagctgttaggaaaatg agtgtcatgggccggcagacatgcctggagtgcagcttcgagatcccagacttccctaat catttccctacttacgtacactgctctctgtgtcgctatagcacctgctgttctcgagct tatgccaaccacatgatcaacaatcatgttccacggaagagccccaagtatttggctttg tttaaaaattctgtgagtggaatcaagctggcctgcacttcatgtacctttgttacctct gtgggcgatgctatggccaagcatttggtattcaacccctctcacagatccagcagcatc ctgccacggggactcacttggatagctcactcaaggcatggccagactcgtgaccgagtg catgaccggaacgtgaagaatatgtaccctcctccttccttccccactaacaaagctgcc actgtgaaatctgcgggggccaccccagctgagcctgaagagctactaactcccttagcc ccagcactcccatcaccagcctcaactgcaaccccaccaccaacccccactcacccgcag gctttagcccttccaccgctggctacagagggagccgaatgtctgaatgttgatgatcag gatgaagggagcccagtcacccaagaacctgagctagcatcaggtggtggtggtagtggt ggagttggcaaaaaggagcagctgtctgtgaagaagcttcgagtagtactgtttgctcta tgctgcaatacagaacaggcagctgaacacttccgaaatccccagcgacgtattcgccgt tggcttcgacgtttccaggcctcccagggggagaatctagagggcaaatatctgagcttt gaggcagaagagaaactggctgagtgggtgctaacccagcgcgaacaacagctacctgta aatgaggagaccttgttccagaaggccaccaaaataggacgttctttggaaggggggttt aagatctcctatgagtgggctgtgcgtttcatgctgcggcaccacctgactccccatgcc cggcgagctgtggcccacaccctacctaaggatgtagcagagaatgcaggactcttcatt gattttgtacaacggcagattcacaaccaggacttacccttgtctatgattgtggctatt gatgagatctctttgttcctggatacagaggtgctgagcagtgatgatcgaaaggagaat gccctgcagacagtgggcacaggggaaccttggtgtgatgtagtcctagccattctggca gatggcactgtccttcccaccctggttttctacagagggcagatggatcagcctgctaac atgccagactccatattgctagaggcaaaggagagtggctacagtgatgacgagatcatg gagctgtggtcaactcgagtgtggcagaagcacacagcttgccagcgcagcaaaggcatg cttgtgatggactgtcatcgcactcacttgtcagaagaggtactggctatgcttagtgcc tctagcactttgcctgcagtggtcccagcaggctgtagctccaaaattcagccattagat gtatgcatcaaaagaactgtcaagaacttcctgcataaaaaatggaaggaacaggctcgg gaaatggcagatactgcatgtgattctgatgtcctgcttcagctggtgcttgtctggctg ggtgaagtgctaggtgtcattggggactgtccagagctagttcagcgctccttcctggtg gctagtgttctgcctggccccgatggcaacattaactcacctacaagaaatgctgacatg caggaggagctaattgcctccctagaggagcaactgaagctgagtggggaacattctgag tcttccactccacgacccagatcatctcctgaagagacaattgagcctgaaagtcttcac cagctctttgagggtgaaagtgagaccgagtctttctatggctttgaagaagctgaccta gatctgatggagatttga >gi568815597r:151304805_151542200|GENSCAN_predicted_peptide_6|1408_aa MVVCAYSTRPLLEWSPESVQVVWGADTETKLGVQEAYWLATPVKDKEGGRNGQRKTLDRL PYPDKVKEGRSKIGQEKPQAVVQILPSGSLPSGKHQGKGYHIEKSTVGRNGQSPEDALAD GSRDLGRRSLATIRDAPKMASAPLAAPPRRSAARLLLFMEQAPNMAEPRGPVDHGVQIRF ITEPVSGAEMGTLRRGGRRPAKDARASTYGVAVRVQGIAGQPFVVLNSGEKGGDSFGVQI KGANDQGASGALSSDLELPENPYSQVKGFPAPSQSSTSDEEPGAYWNGKLLRSHSQASLA GPGPVDPSNRSNSMLELAPKVASPGSTIDTAPLSSVDSLINKFDSQLGGQARGRTGRRTR MLPPEQRKRSKSLDSRLPRDTFEERERQSTNHWTSSTKYDNHVGTSKQPAQSQNLSPLSG FSRSRQTQDWVLQSFEEPRRSAQDPTMLQFKSTPDLLRDQQEAAPPGSVDHMKATIYGIL REGSSESETSVRRKVSLVLEKMQPLVMVSSGSTKAVAGQGELTRKVEELQRKLDEEVKKR QKLEPSQVGLERQLEEKTEECSRLQELLERRKGEAQQSNKELQNMKRLLDQGEDLRHGLE TQVMELQNKLKHVQGPEPAKEVLLKDLLETRELLEEVLEGKQRVEEQLRLRERELTALKG ALKEEVASRDQEVEHVRQQYQRDTEQLRRSMQDATQACDKSRDHAVLEAERQKMSALVRG LQRELEETSEETGHWQSMFQKNKEDLRATKQELLQLRMEKEEMEEELGEKIEVLQRELEQ ARASAGDTRQVEVLKKELLRTQEELKELQAERQSQEVAGRHRDRELEKQLAVLRVEADRG RELEEQNLQLQKTLQQLRQDCEEASKARGAKMVAEAEATVLGQRRAAVETTLRETQEEND EFRRRILGLEQQLKETRGLVDGGEAVEARLRDKLQRLEAEKQQLEEALNASQEEEGSLAA AKRALEARLEEAQRGLARLGQEQQTLNRALEEEGKQREVLRRGKAELEEQKRLLDRTVDR LNKELEKIGEDSKQALQQLQAQLEDYKEKARREVADAQRQAKDWASEAEKTSGGLSRLQD EIQRLRQALQASQAERDTARLDKELLAQRLQGLEQEAENKKRSQDDRARQLKGLEEKVSR LETELDEEKNTVELLTDRVNRGRDQVDQLRTELMQERSARQDLECDKISLERQVMGEGRI LRDGPQENKDLKTRLASSEGFQKPSASLSQLESQNQLLQERLQAEEREKTVLQSTNRKLE RKVKELSIQIEDERQHVNDQKDQLSLRVKALKRQVDEAEEEIERLDGLRKKAQREVEEQH EVNEQLQARIKSLEKDSCSLRNRELGARPIFSPAPDGPIQGGGSARARPVGARQRGGQVA CVLRPRGKMNGTRNWCTLVDVHPEDQAA >gi568815597r:151304805_151542200|GENSCAN_predicted_CDS_6|4224_bp atggtggtgtgcgcctatagtaccagaccacttctggagtggtcacctgagtcagttcag gtggtctggggagcagatactgagacaaagttaggagtgcaagaggcttactggctggca acacctgtgaaagataaagaaggaggcaggaatgggcagagaaagactttggataggctt ccctatcctgataaagtaaaggaaggcagaagcaaaatcgggcaagaaaaacctcaagct gtggtgcagatccttccaagtggcagtctacccagtggaaagcaccagggcaaaggttat cacatagagaaatcaacagtgggcagaaacggccagagcccagaagatgctctggccgac ggctcccgggatcttggccggcggtcactcgcgaccatccgcgacgcccccaaaatggcc tccgcgcccctcgccgccccgccccgacgctccgcagcccgactcctcctatttatggag caggcacccaacatggctgagccccggggccccgtagaccatggagtccagattcgcttc atcacagagccagtgagtggtgcagagatgggcactctacgtcgaggtggacgacgccca gctaaggatgcaagagccagtacctacggggttgctgtgcgtgtgcagggaatcgctggg cagccctttgtggtgctcaacagtggggagaaaggcggtgactcctttggggtccaaatc aagggggccaatgaccaaggggcctcaggagctctgagctcagatttggaactccctgag aacccctactctcaggtcaagggatttcctgccccctcgcagagcagcacatctgatgag gagcctggggcctactggaatggaaagctactccgttcccactcccaggcctcactggca ggccctggcccagtggatcctagtaacagaagcaacagcatgctggagctagccccgaaa gtggcttccccaggtagcaccattgacactgctcccctgtcttcagtggactcactcatc aacaagtttgacagtcaacttggaggccaggcccggggtcggactggccgccgaacacgg atgctaccccctgaacagcgcaaacggagcaagagcctggacagccgcctcccacgggac acctttgaggaacgggagcgccagtccaccaaccactggacctctagcacaaaatatgac aaccatgtgggcacttcgaagcagccagcccagagccagaacctgagtcctctcagtggc tttagccgttctcgtcagactcaggactgggtccttcagagttttgaggagccgcggagg agtgcacaggaccccaccatgctgcagttcaaatcaactccagacctccttcgagaccag caggaggcagccccaccaggcagtgtggaccatatgaaggccaccatctatggcatcctg agggagggaagctcagaaagtgaaacctctgtgaggaggaaggttagtttggtgctggag aagatgcagcctctagtgatggtttcttctggttctactaaggccgtggcagggcagggt gagcttacccgaaaagtggaggagctacagcgaaagctggatgaagaggtgaagaagcgg cagaagctagagccatcccaagttgggctggagcggcagctggaggagaaaacagaagag tgcagccgactgcaggagctgctggagaggaggaagggggaggcccagcagagcaacaag gagctccagaacatgaagcgcctcttggaccagggtgaagatttacgacatgggctggag acccaggtgatggagctgcagaacaagctgaaacatgtccagggtcctgagcctgctaag gaggtgttactgaaggacctgttagagacccgggaacttctggaagaggtcttggagggg aaacagcgagtagaggagcagctgaggctgcgggagcgggagttgacagccctgaagggg gccctgaaagaggaggtagcctcccgtgaccaggaggtggaacatgtccggcagcagtac cagcgagacacagagcagctccgcaggagcatgcaagatgcaacccaggcatgtgacaag agcagggaccatgcagtgctggaggccgagaggcagaagatgtcagcccttgtgcgaggg ctgcagagggagctggaggagacttcagaggagacagggcattggcagagtatgttccag aagaacaaggaggatcttagagccaccaagcaggaactcctgcagctgcgaatggagaag gaggagatggaagaggagcttggagagaagatagaggtcttgcagagggaattagagcag gcccgagctagtgctggagatactcgccaggttgaggtgctcaagaaggagctgctccgg acacaggaggagcttaaggaactgcaggcagaacggcagagccaggaggtggctgggcga caccgggaccgggagttggagaagcagctggcggtcctgagggtcgaggctgatcgaggt cgggagctggaagaacagaacctccagctacaaaagaccctccagcaactgcgacaggac tgtgaagaggcttccaaggcaaggggagctaagatggtggccgaggcagaggcaacagtg ctggggcagcggcgggccgcagtggagacgacgcttcgggagacccaggaggaaaatgac gaattccgccggcgcatcctgggtttggagcagcagctgaaggagactcgaggtctggtg gatggtggggaagcggtggaggcacgactacgggacaagctgcagcggctggaggcagag aaacagcagctggaggaggccctgaatgcgtcccaggaagaggaggggagtctggcagca gccaagcgggcactggaggcacgcctagaggaggctcagcgggggctggcccgcctgggg caggagcagcagacactgaaccgggccctggaggaggaagggaagcagcgggaggtgctc cggcgaggcaaggctgagctggaggagcagaagcgtttgctggacaggactgtggaccga ctgaacaaggagttggagaagatcggggaggactctaagcaagccctgcagcagctccag gcccagctggaggattataaggaaaaggcccggcgggaggtggcagatgcccagcgccag gccaaggattgggccagtgaggctgagaagacctctggaggactgagccgacttcaggat gagatccagaggctgcggcaggccctgcaggcatcccaggctgagcgggacacagcccgg ctggacaaagagctactggcccagcgactgcaggggctggagcaagaggcagagaacaag aagcgttcccaggacgacagggcccggcagctgaagggtctcgaggaaaaagtctcacgg ctggaaacagagttagatgaggagaagaacaccgtggagctgctaacagatcgggtgaat cgtggccgggaccaggtggatcagctgaggacagagctcatgcaggaaaggtctgctcgg caggacctggagtgtgacaaaatctccttggagagacaggtgatgggggaggggaggatt cttagggatggaccccaggagaacaaggacctgaagacccggttggccagctcagaaggc ttccagaagcctagtgccagcctctctcagcttgagtcccagaatcagttgttgcaggag cggctacaggctgaagagagggagaagacagttctgcagtctaccaatcgaaaactggag cggaaagttaaagaactatccatccagattgaagacgagcggcagcatgtcaatgaccag aaagaccagctaagcctgagggtgaaggctttgaagcgtcaggtggatgaagcagaagag gaaattgagcgactggacggcctgaggaagaaggcccagcgtgaggtggaggagcagcat gaggtcaatgaacagctccaggcccggatcaagtctctggagaaggactcctgttctctg cgaaaccgcgaactgggggcgcggcctatcttcagccccgcccctgatgggcctatacaa gggggcggttccgcgcgcgcccgcccagttggagccagacagcggggtggacaagtggcg tgtgtgctgcgaccccgagggaagatgaacgggacgcggaactggtgtaccctggtggac gtgcacccagaggaccaggcggcg