GENSCAN 1.0 Date run: 10-Feb-117 Time: 15:05:15 Sequence gi568815591r:142771710_142985634 : 213925 bp : 45.36% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 340 499 160 2 1 92 91 238 0.761 23.55 1.02 Intr + 1557 1810 254 0 2 39 89 403 0.702 32.58 1.03 Intr + 2210 2346 137 0 2 78 60 138 0.998 10.29 1.04 Term + 2647 2799 153 0 0 115 42 226 0.999 18.62 1.05 PlyA + 2830 2835 6 1.05 2.00 Prom + 6622 6661 40 -4.36 2.01 Init + 7099 7188 90 0 0 83 25 140 0.003 7.69 2.02 Intr + 15247 15355 109 0 1 54 82 34 0.357 -0.74 2.03 Intr + 19985 20371 387 0 0 103 109 565 0.876 54.86 2.04 Intr + 20813 20830 18 0 0 113 114 1 0.774 1.78 2.05 Intr + 20983 21089 107 2 2 90 103 148 0.974 16.43 2.06 Intr + 24778 24954 177 0 0 33 99 81 0.142 3.82 2.07 Intr + 25132 25186 55 0 1 48 111 29 0.130 -0.25 2.08 Intr + 29332 29718 387 2 0 92 109 568 0.931 54.06 2.09 Intr + 30235 30252 18 2 0 119 131 14 0.976 5.38 2.10 Intr + 30396 30502 107 1 2 71 91 167 0.984 15.23 2.11 Intr + 70737 70815 79 2 1 84 73 40 0.034 1.22 2.12 Intr + 78253 78330 78 2 0 52 59 75 0.115 0.52 2.13 Intr + 83385 83676 292 1 1 20 105 231 0.321 14.19 2.14 Term + 85858 86098 241 1 1 100 41 63 0.261 -1.70 2.15 PlyA + 86298 86303 6 1.05 3.00 Prom + 87201 87240 40 -2.56 3.01 Init + 91564 91618 55 0 1 91 116 84 0.986 11.05 3.02 Intr + 91922 91986 65 0 2 81 84 77 0.999 5.04 3.03 Intr + 92257 93140 884 0 2 86 77 603 0.740 49.13 3.04 Intr + 93833 93921 89 2 2 75 105 54 0.761 5.41 3.05 Intr + 94251 94607 357 1 0 92 105 322 0.993 29.53 3.06 Intr + 94772 94896 125 0 2 91 85 205 0.973 20.80 3.07 Intr + 95197 95359 163 0 1 25 105 193 0.930 14.25 3.08 Intr + 95899 96013 115 2 1 96 101 182 0.998 19.81 3.09 Intr + 96288 96340 53 0 2 92 75 69 0.627 4.55 3.10 Intr + 96532 96651 120 2 0 82 92 251 0.999 25.37 3.11 Intr + 96783 97030 248 1 2 83 56 379 0.991 31.38 3.12 Intr + 97265 97438 174 1 0 110 94 220 0.931 24.94 3.13 Intr + 98108 98257 150 1 0 78 83 39 0.870 2.76 3.14 Intr + 98505 98698 194 2 2 82 47 189 0.950 12.49 3.15 Intr + 98821 98976 156 1 0 108 63 119 0.993 10.53 3.16 Term + 99087 99195 109 0 1 77 54 113 0.993 4.78 3.17 PlyA + 99359 99364 6 -1.95 4.33 PlyA - 99521 99516 6 1.05 4.32 Term - 100280 99998 283 1 1 79 42 227 0.978 12.10 4.31 Intr - 100769 100663 107 2 2 107 101 126 0.999 14.81 4.30 Intr - 102007 101739 269 2 2 8 99 659 0.910 56.25 4.29 Intr - 102433 102367 67 1 1 108 89 76 0.969 8.18 4.28 Intr - 102947 102782 166 1 1 133 86 267 0.999 30.96 4.27 Intr - 103271 103195 77 2 2 91 84 46 0.990 2.81 4.26 Intr - 103455 103369 87 0 0 94 78 204 0.952 20.17 4.25 Intr - 103971 103759 213 0 0 133 61 324 0.999 33.11 4.24 Intr - 104195 104049 147 2 0 92 64 259 0.995 24.33 4.23 Intr - 104874 104699 176 1 2 107 69 208 0.999 20.46 4.22 Intr - 105146 105030 117 0 0 88 86 183 0.759 18.54 4.21 Intr - 105570 105433 138 1 0 69 77 99 0.992 7.34 4.20 Intr - 106064 105942 123 0 0 48 72 198 0.940 14.76 4.19 Intr - 106317 106220 98 2 2 89 94 111 0.531 11.45 4.18 Intr - 113834 113680 155 2 2 42 94 51 0.331 -0.03 4.17 Intr - 117324 117168 157 2 1 129 86 -20 0.113 1.91 4.16 Intr - 125330 125252 79 0 1 150 82 14 0.150 5.61 4.15 Intr - 137099 136840 260 1 2 102 36 196 0.086 12.91 4.14 Intr - 137980 137781 200 1 2 119 101 100 0.552 12.55 4.13 Intr - 141041 140773 269 0 2 -4 109 398 0.252 29.95 4.12 Intr - 142997 142931 67 2 1 122 107 58 0.988 9.58 4.11 Intr - 143337 143172 166 2 1 126 86 244 0.999 27.96 4.10 Intr - 143674 143598 77 1 2 105 84 40 0.999 3.61 4.09 Intr - 143859 143773 87 0 0 48 78 108 0.975 5.97 4.08 Intr - 154092 153820 273 0 0 84 60 269 0.598 21.43 4.07 Intr - 156525 156379 147 0 0 80 64 223 0.982 19.53 4.06 Intr - 157157 156982 176 0 2 93 83 224 0.999 22.06 4.05 Intr - 157411 157313 99 2 0 103 86 148 0.968 16.18 4.04 Intr - 157856 157719 138 0 0 65 77 104 0.993 7.44 4.03 Intr - 158471 158349 123 0 0 70 110 149 0.982 15.86 4.02 Intr - 158737 158640 98 0 2 86 59 43 0.940 0.85 4.01 Init - 161750 161623 128 2 2 92 80 25 0.849 1.31 4.00 Prom - 162527 162488 40 -8.76 5.00 Prom + 162847 162886 40 -6.56 5.01 Init + 167913 168045 133 2 1 50 96 212 0.974 16.51 5.02 Term + 168643 168878 236 2 2 118 46 82 0.422 3.48 5.03 PlyA + 169130 169135 6 1.05 6.18 PlyA - 169221 169216 6 1.05 6.17 Term - 169704 169543 162 0 0 120 32 103 0.988 5.84 6.16 Intr - 170820 170725 96 0 0 80 96 93 0.977 9.51 6.15 Intr - 171335 171166 170 0 2 71 85 83 0.551 5.97 6.14 Intr - 171634 171567 68 0 2 73 15 132 0.545 2.95 6.13 Intr - 172174 172074 101 1 2 101 80 36 0.959 3.01 6.12 Intr - 172691 172614 78 2 0 96 26 84 0.841 2.75 6.11 Intr - 173032 172934 99 1 0 91 77 87 0.992 8.21 6.10 Intr - 174608 174498 111 2 0 68 34 173 0.986 10.48 6.09 Intr - 180916 180800 117 1 0 21 77 76 0.539 0.46 6.08 Intr - 182247 182002 246 0 0 85 95 258 0.899 23.86 6.07 Intr - 182663 182475 189 2 0 117 46 161 0.748 14.58 6.06 Intr - 182818 182756 63 1 0 93 110 76 0.951 9.21 6.05 Intr - 186719 186595 125 0 2 88 78 95 0.997 8.80 6.04 Intr - 189395 189219 177 0 0 83 68 118 0.824 9.19 6.03 Intr - 189792 189651 142 0 1 114 73 28 0.808 3.83 6.02 Intr - 190163 190086 78 2 0 116 107 -25 0.571 1.95 6.01 Init - 206894 206829 66 2 0 78 -1 66 0.079 -2.23 6.00 Prom - 207664 207625 40 -0.86 7.02 PlyA - 207780 207775 6 1.05 7.01 Term - 213128 213028 101 0 2 88 41 122 0.560 5.79 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:142771710_142985634|GENSCAN_predicted_peptide_1|234_aa XAAPFDDDDKIVGGYICEENSVPYQVSLNSGYHFCGGSLISEQWVVSAGHCYKSRIQVRL GEHNIEVLEGNEQFINAAKIIRHPKYNSRTLDNDILLIKLSSPAVINSRVSAISLPTAPP AAGTESLISGWGNTLSSGADYPDELQCLDAPVLSQAECEASYPGKITNNMFCVGFLEGGK DSCQGDSGGPVVSNGELQGIVSWGYGCAQKNRPGVYTKVYNYVDWIKDTIAANS >gi568815591r:142771710_142985634|GENSCAN_predicted_CDS_1|705_bp nttgctgccccctttgatgatgatgacaagatcgttgggggctacatctgtgaggagaat tctgtcccctaccaggtgtccttgaattctggctaccacttctgcggtggctccctcatc agcgaacagtgggtggtgtcagcaggtcactgctacaagtcccgcatccaggtgagactg ggagagcacaacatcgaagtcctggaggggaatgaacagttcatcaatgcggccaagatc atccgccaccccaaatacaacagccggactctggacaatgacatcctgctgatcaagctc tcctcacctgccgtcatcaattcccgcgtgtccgccatctctctgcccactgcccctcca gctgctggcaccgagtccctcatctccggctggggcaacactctgagttctggtgccgac tacccagacgagctgcagtgcctggatgctcctgtgctgagccaggctgagtgtgaagcc tcctaccctggaaagattaccaacaacatgttctgtgtgggcttcctcgagggaggcaag gattcctgccagggtgattctggtggccctgtggtctccaatggagagctccaaggaatt gtctcctggggctatggctgtgcccagaagaacaggcctggagtctacaccaaggtctac aactatgtggactggattaaggacaccatagctgccaacagctaa >gi568815591r:142771710_142985634|GENSCAN_predicted_peptide_2|714_aa MDIQGAIQSITDEHVECFGAIVMLAPVAHMIRHREKWVHSVPFRVAIFLCANYGYTFGSG TRLTVVEDLNKVFPPEVAVFEPSEAEISHTQKATLVCLATGFFPDHVELSWWVNGKEVHS GVSTDPQPLKEQPALNDSRYCLSSRLRVSATFWQNPRNHFRCQVQFYGLSENDEWTQDRA KPVTQIVSAEAWGRADCGFTSVSYQQGVLSATILYEILLGKATLYAVLVSALVLMAMGSG HWAIQGPPRGKRGLRQGPQGCANTGELFFGEGSRLTVLGKEAVGAPESSERAGWAEAVST DTQYFGPGTRLTVLEDLKNVFPPKVAVFEPSEAEISHTQKATLVCLATGFYPDHVELSWW VNGKEVHSGVSTDPQPLKEQPALNDSRYCLSSRLRVSATFWQNPRNHFRCQVQFYGLSEN DEWTQDRAKPVTQIVSAEAWGRADCGFTSESYQQGVLSATILYEILLGKATLYAVLVSAL VLMAMSPYENFQVALQAKPYCDFAILGKMNRDVAGFEDKGEDLKLRNTVASKNHTGKTAA LQQRAAGPGTQLGDGDLDAGPQDLPAPHLWSSPCRQRQVHPGIPGTLGAERTRAGCNGVP GLEKTRVAPCELQEPRVHCEASGGADLQRLRPTGRCKDQPSNHTRHKTSNRFALGTVLSL LDSAACFVSLNCSCLGSVWGHALYYVFPVDSPLGSPLSLHSLIPLLDIVEEKHA >gi568815591r:142771710_142985634|GENSCAN_predicted_CDS_2|2145_bp atggacatccagggggccatccagagcatcactgatgagcatgttgaatgctttggggcc atcgtcatgttggccccagttgcacacatgatccgtcacagggaaaagtgggtccacagt gtcccttttagagtggctatattcttatgtgctaactatggctacaccttcggttcgggg accaggttaaccgttgtagaggacctgaacaaggtgttcccacccgaggtcgctgtgttt gagccatcagaagcagagatctcccacacccaaaaggccacactggtgtgcctggccaca ggcttcttccctgaccacgtggagctgagctggtgggtgaatgggaaggaggtgcacagt ggggtcagcacggacccgcagcccctcaaggagcagcccgccctcaatgactccagatac tgcctgagcagccgcctgagggtctcggccaccttctggcagaacccccgcaaccacttc cgctgtcaagtccagttctacgggctctcggagaatgacgagtggacccaggatagggcc aaacccgtcacccagatcgtcagcgccgaggcctggggtagagcagactgtggctttacc tcggtgtcctaccagcaaggggtcctgtctgccaccatcctctatgagatcctgctaggg aaggccaccctgtatgctgtgctggtcagcgcccttgtgttgatggccatggggagtgga cactgggcaatccagggccctcctcgagggaagcggggtttgcgccagggtccccagggc tgtgcgaacaccggggagctgttttttggagaaggctctaggctgaccgtactgggtaag gaggcggttggggctccggagagctccgagagggcgggatgggcagaggctgtgagcaca gatacgcagtattttggcccaggcacccggctgacagtgctcgaggacctgaaaaacgtg ttcccacccaaggtcgctgtgtttgagccatcagaagcagagatctcccacacccaaaag gccacactggtgtgcctggccacaggcttctaccccgaccacgtggagctgagctggtgg gtgaatgggaaggaggtgcacagtggggtcagcacagacccgcagcccctcaaggagcag cccgccctcaatgactccagatactgcctgagcagccgcctgagggtctcggccaccttc tggcagaacccccgcaaccacttccgctgtcaagtccagttctacgggctctcggagaat gacgagtggacccaggatagggccaaacctgtcacccagatcgtcagcgccgaggcctgg ggtagagcagactgtggcttcacctccgagtcttaccagcaaggggtcctgtctgccacc atcctctatgagatcttgctagggaaggccaccttgtatgccgtgctggtcagtgccctc gtgctgatggccatgagcccatatgaaaacttccaagtggccctccaggcaaagccatat tgtgattttgctatcttaggaaaaatgaacagagatgttgctggatttgaagataaagga gaagacctcaaactaaggaatacagtggcttccaagaaccacactgggaaaaccgcggct ctacagcagcgggcggcgggacccgggacccagcttggcgacggcgatctcgacgcgggc ccccaggatctcccggcgccccacctctggagcagcccctgccgccagcgtcaggtccac cccggaatcccagggactctcggcgccgaacggacccgggccgggtgcaacggggtcccc ggactggagaagacgcgggtggcaccgtgcgagctccaggagccccgggtccactgcgag gcctcggggggcgcagacctgcagagactgcggccaacgggaaggtgtaaggaccagccc agcaaccacaccagacataaaacctcgaaccgctttgcccttgggactgttctgtccctg ctggactctgcagcctgctttgtctcactaaattgttcctgtctgggctcggtttggggc catgccctttactatgtgttcccagtggacagtcccctgggttcccctctgtcccttcat agcctcatccctttgcttgacatcgtcgaagagaagcatgcatga >gi568815591r:142771710_142985634|GENSCAN_predicted_peptide_3|1018_aa MVCSLWVLLLVSSVLALEEVLLDTTGETSEIGWLTYPPGGWDEVSVLDDQRRLTRTFEAC HVAGAPPGTGQDNWLQTHFVERRGAQRAHIRLHFSVRACSSLGVSGGTCRETFTLYYRQA EEPDSPDSVSSWHLKRWTKVDTIAADESFPSSSSSSSSSSSSAAWAVGPHGAGQRAGLQL NVKERSFGPLTQRGFYVAFQDTGACLALVAVRLFSYTCPAVLRSFASFPETQASGAGGAS LVAAVGTCVAHAEPEEDGVGGQAGGSPPRLHCNGEGKWMVAVGGCRCQPGYQPARGDKAC QGESPLVLHLPDTSHPPPAFGLLSEHPKIHFICKSHAPNPAAPVCPCLEGFYRASSDPPE APCTGPPSAPQELWFEVQGSALMLHWRLPRELGGRGDLLFNVVCKECEGRQEPASGGGGT CHRCRDEVHFDPRQRGLTESRVLVGGLRAHVPYILEVQAVNGVSELSPDPPQAAAINVST SHEVPSAVPVVHQVSRASNSITVSWPQPDQTNGNILDYQLRYYDQAEDESHSFTLTSETN TATVTQLSPGHIYGFQVRARTAAGHGPYGGKVYFQTLPQGELSSQLPERLSLVIGSILGA LAFLLLAAITVLAVVFQRKRRGTGYTEQLQQYSSPGLGVKYYIDPSTYEDPCQAIRELAR EVDPAYIKIEEVIGTGSFGEVRQGRLQPRGRREQTVAIQALWAGGAESLQMTFLGRAAVL GQFQHPNILRLEGVVTKSRPLMVLTEFMELGPLDSFLRQREGQFSSLQLVAMQRGVAAAM QYLSSFAFVHRSLSAHSVLVNSHLVCKVARLGHSPQGPSCLLRWAAPEVIAHGKHTTSSD VWSFGILMWEVMSYGERPYWDMSEQEVLNAIEQEFRLPPPPGCPPGLHLLMLDTWQKDRA RRPHFDQLVAAFDKMIRKPDTLQAGGDPGERPSQALLTPVALDFPCLDSPQAWLSAIGLE CYQDNFSKFGLCTFSDVAQLSLEDLPALGITLAGHQKKLLHHIQLLQQHLRQQGSVEV >gi568815591r:142771710_142985634|GENSCAN_predicted_CDS_3|3057_bp atggtgtgtagcctatgggtgctgctcctggtgtcttcagttctggctctggaagaggta ttgctggacaccaccggagagacatctgagattggctggctcacctacccaccagggggg tgggacgaggtgagtgttctggacgaccagcgacgcctgactcggacctttgaggcatgt catgtggcaggggcccctccaggcaccgggcaggacaattggttgcagacacactttgtg gagcggcgcggggcccagagggcgcacattcgactccacttctctgtgcgggcatgctcc agcctgggtgtgagcggcggcacctgccgggagaccttcaccctttactaccgtcaggct gaggagcccgacagccctgacagcgtttcctcctggcacctcaaacgctggaccaaggtg gacacaattgcagcagacgagagctttccctcctcctcctcctcctcctcctcctcttct tcctctgcagcgtgggctgtgggaccccacggggctgggcagcgggctggactgcaactg aacgtcaaagagcggagctttgggcctctcacccaacgcggcttctacgtggccttccag gacacgggggcctgcctggccctggtcgctgtcaggctcttctcctacacctgccctgcc gtgctccgatcctttgcttcctttccagagacgcaggccagtggggctgggggggcctcc ctggtggcagctgtgggcacctgtgtggctcatgcagagccagaggaggatggagtaggg ggccaggcaggaggcagcccccccaggctgcactgcaacggggagggcaagtggatggta gctgtcgggggctgccgctgccagcctggataccaaccagcacgaggagacaaggcctgc caaggtgagagcccactcgtcttgcacttgcccgacacctcccacccacccccagccttt gggctcctctctgaacaccccaaaattcattttatctgcaaaagtcacgctcccaaccca gcagcccccgtttgcccctgcctggagggcttctaccgggccagttccgacccaccagag gccccctgcactggtcctccatcggctccccaggagctttggtttgaggtgcaaggctca gcactcatgctacactggcgcctgcctcgggagctggggggtcgaggggacctgctcttc aatgtcgtgtgcaaggagtgtgaaggccgccaggaacctgccagcggtggtgggggcact tgtcaccgctgcagggatgaggtccacttcgaccctcgccagagaggcctgactgagagc cgagtgttagtggggggactccgggcacacgtaccctacatcttagaggtgcaggctgtt aatggggtgtctgagctcagccctgaccctcctcaggctgcagccatcaatgtcagcacc agccatgaagtgccctctgctgtccctgtggtgcaccaggtgagccgggcatccaacagc atcacggtgtcctggccgcagcccgaccagaccaatgggaacatcctggactatcagctc cgctactatgaccaggcagaagacgaatcccactccttcaccctgaccagcgagaccaac actgccaccgtgacacagctgagccctggccacatctatggtttccaggtgcgggcccgg actgctgccggccacggcccctacgggggcaaagtctatttccagacacttcctcaaggg gagctgtcttcccagcttccagaaagactctccttggtgatcggctccatcctgggggct ttggccttcctcctgctggcagccatcaccgtgctggcggtcgtcttccagcggaagcgg cgtgggactggctacacagagcagctgcagcaatacagcagcccaggactcggggtgaag tattacatcgacccctccacctacgaggacccctgtcaggccatccgagaacttgcccgg gaagtcgatcctgcttatatcaagattgaggaggtcattgggacaggctcttttggagaa gtgcgccagggccgcctgcagccacggggacggagggagcagactgtggccatccaggcc ctgtgggccgggggcgccgaaagcctgcagatgaccttcctgggccgggccgcagtgctg ggtcagttccagcaccccaacatcctgcggctggagggcgtggtcaccaagagccgaccc ctcatggtgctgacggagttcatggagcttggccccctggacagcttcctcaggcagcgg gagggccagttcagcagcctgcagctggtggccatgcagcggggagtggctgctgccatg cagtacctgtccagctttgccttcgtccatcgctcgctgtctgcccacagcgtgctggtg aatagccacttggtgtgcaaggtggcccgtcttggccacagtcctcagggcccaagttgt ttgcttcgctgggcagccccagaggtcattgcacatggaaagcatacaacatccagtgat gtctggagctttgggatactcatgtgggaagtgatgagttatggagaacggccttactgg gacatgagtgagcaggaggtactaaatgcaatagagcaggagttccggctgcccccgcct ccaggctgtcctcctggattacatctacttatgttggacacttggcagaaggaccgtgcc cggcggcctcattttgaccagctggtggctgcatttgacaagatgatccgcaagccagat accctgcaggctggcggggacccaggggaaaggccttcccaggcccttctgacccctgtg gccctggactttccttgtctggactcaccccaggcctggctttcagccattggactggag tgctaccaggacaacttctccaagtttggcctctgtaccttcagtgatgtggctcagctc agcctagaagacctgcctgccctgggcatcaccctggctggccaccagaagaagctgctg caccacatccagctccttcagcaacacctgaggcagcagggctcagtggaggtctga >gi568815591r:142771710_142985634|GENSCAN_predicted_peptide_4|1588_aa MGGFLPKAEGPGSQLQKLLPSFLVREQDWDQHLDKLHMLQQKRILESPLLRASKENDLSV LRQLLLDCTCDVRQRGALGETALHIAALYDNLEAALVLMEAAPELVFEPTTCEAFAGQTA LHIAVVNQNVNLVRALLTRRASVSARATGTAFRHSPRNLIYFGEHPLSFAACVNSEEIVR LLIEHGADIRAQDSLGNTVLHILILQPNKTFACQMYNLLLSYDGHGDHLQPLDLVPNHQG LTPFKLAGVEGNTVMFQHLMQKRRHIQWTYGPLTSILYDLTEIDSWGEELSFLELVVSSD KREWDKQHPSVLVTPSFTLLPCQARQILEQTPVKELVSFKWNKYGRPYFCILAALYLLYM ICFTTCCVYRPLKFRGGNRTHSRDITILQQKLLQEAYETREDIIRLVGELVSIVGAVIIL LLEIPDIFRVGASRYFGKTILGGPFHVIIITYASLVLVTMVMRLTNTNGEVVPMSFALVL GWCSVMYFTRGFQMLGPFTIMIQKMIFGDLMRFCWLMAVVILGFASAFYIIFQTEDPTSL GQFYDYPMALFTTFELFLTVIDAPANYDVDLPFMFSIVNFAFTIIATLLMLNLFIAMMGD THWRVAQERDELWRAQIAFGSLYQWPQCLFEPHGSFRVLPVSPASFQVVATTVMLERKLP RCLWPRSGICGCEFGLGDRWFLRVENHNDQNPLRVLRYVEVFKNSDKEDDQEHPSEKQPS GAESGTLARASLALPTSSLSRTASQSSSHRGWEILRQNTLGHLNLGLNLSELGEPEGQEG GNPRFHGLCANPPGLRAWSEATTRAEKRSGHALIIMDKIIERKSDSLRSHTSLIIFGAWE REPGEASPAPKEPALHPMGLSLPKEKGLILCLWSKFCRWFQRRESWAQSRDEQNLLQQKR IWESPLLLAAKDNDVQALNKLLKYEDCKVHQRGAMGETALHIAALYDNLEAAMVLMEAAP ELVFEPMTSELYEGQTALHIAVVNQNMNLVRALLARRASVSARATGTAFRRSPCNLIYFD LSVSTGEHPLSFAACVNSEEIVRLLIEHGADIRAQDSLGNTVLHILILQPNKTFACQMYN LLLSYDRHGDHLQPLDLVPNHQGLTPFKLAGVEGNTVMFQHLMQKRKHTQWTYGPLTSTL YDLTEIDSSGDEQSLLELIITTKKREARQILDQTPVKELVSLKWKRYGRPYFCMLGAIYL LYIICFTMCCIYRPLKPRTNNRTSPRDNTLLQQKLLQEAYMTPKDDIRLVGELVTVIGAI IILLVEVPDIFRMGVTRFFGQTILGGPFHVLIITYAFMVLVTMVMRLISASGEVVPMSFA LVLGWCNVMYFARGFQMLGPFTIMIQKMIFGDLMRFCWLMAVVILGFASAFYIIFQTEDP EELGHFYDYPMALFSTFELFLTIIDGPANYNVDLPFMYSITYAAFAIIATLLMLNLLIAM MGDTHWRVAHERDELWRAQIVATTVMLERKLPRCLWPRSGICGREYGLGDRWFLRVEDRQ DLNRQRIQRYAQAFHTRGSEDLDKDSVEKLELGCPFSPHLSLPMPSVSRSTSRSSANWER LRQGTLRRDLRGIINRGLEDGESWEYQI >gi568815591r:142771710_142985634|GENSCAN_predicted_CDS_4|4767_bp atggggggttttctacctaaggcagaagggcccgggagccaactccagaaacttctgccc tcctttctggtcagagaacaagactgggaccagcacctggacaagcttcatatgctgcag cagaagaggattctagagtctccactgcttcgagcatccaaggaaaatgacctgtctgtt cttaggcaacttctactggactgcacctgtgacgttcgacaaagaggagccctgggggag acggcgctgcacatagcagccctctatgacaacttggaggcggccttggtgctgatggag gctgccccagagctggtctttgagcccaccacatgtgaggcttttgcaggtcagactgca ctgcacatcgctgttgtgaaccagaatgtgaacctggtgcgtgccctgctcacccgcagg gccagtgtctctgccagagccacaggcactgccttccgccatagtccccgcaacctcatc tactttggggagcaccctttgtcctttgctgcctgtgtgaacagcgaggagatcgtgcgg ctgctcattgagcatggagctgacatcagggcccaggactccctgggaaacacagtatta cacatcctcatcctccagcccaacaaaacctttgcctgccagatgtacaacctgctgctg tcctatgatggacatggggaccacctgcagcccctggaccttgtgcccaatcaccagggt ctcacccccttcaagctggctggagtggagggtaacactgtgatgttccagcacctgatg cagaagcggaggcacatccagtggacgtatggacccctgacctccattctctacgacctc acggagatcgactcctggggagaggagctgtccttcctggagcttgtggtctcctctgat aaacgagagtgggataaacaacatccttctgtgttggtcaccccaagtttcactctactt ccatgccaggctcgccaaattctggaacagaccccagtgaaggagctggtgagcttcaag tggaacaagtatggccggccgtacttctgcatcctggctgccttgtacctgctctacatg atctgctttactacgtgctgcgtctaccgcccccttaagtttcgtggtggcaaccgcact cattctcgagacatcaccatcctccagcaaaaactactacaggaggcctatgagacacgt gaagatatcatcaggctggtgggggagctggtgagcatcgttggggctgtgatcatcctg ctcctagagattccagacatcttcagggttggtgcctctcgctattttggaaagacgatt cttggggggccattccatgtcatcatcatcacctatgcctccctggtgctggtgaccatg gtgatgcggctcaccaacaccaatggggaggtggtgcccatgtcctttgccctggtgctg ggctggtgcagtgtcatgtatttcactcgaggattccagatgctgggtcccttcaccatc atgatccagaagatgatttttggagacctaatgcgtttctgctggctgatggctgtggtc atcttgggatttgcctccgcgttctatatcattttccagacagaggacccaaccagtctg gggcaattctatgactaccccatggcactgttcaccacctttgagctttttctcactgtt attgatgcacctgccaactacgacgtggacttgcccttcatgttcagcattgtcaacttc gccttcaccatcattgccacactgctcatgctcaacttgttcatcgccatgatgggcgac acccactggagggtggcccaggagagggatgagctctggagggcccagattgcctttggt tccctttatcagtggcctcagtgcctctttgaacctcatggctctttcagagttcttcct gtatctcctgcctccttccaggtcgtggccaccacagtgatgctggagcggaagctgcct cgctgcctgtggcctcgctccgggatctgtgggtgcgaattcgggctgggggaccgctgg ttcctgcgggttgagaaccacaatgatcagaatcctctgcgagtgcttcgctatgtggaa gtgttcaagaactcagacaaggaggatgaccaggagcatccatctgagaaacagccctct ggggctgagagtgggactctagccagagcctctttggctcttccaacttcctccctgtcc cggaccgcgtcccagagcagcagtcaccgaggctgggagatccttcgtcaaaacaccctg gggcacttgaatcttggactgaaccttagtgaacttggtgaacctgaggggcaagaagga ggcaatcccaggttccatggcctctgtgccaaccccccaggtctcagggcttggtcagag gccaccacgagggcagaaaagaggtcaggccatgcactcatcattatggataaaatcatt gagaggaagagcgactcactcagaagtcacacaagtcttattatttttggagcctgggaa agggaaccgggagaggccagccccgcccccaaggagccggccctacaccccatgggtttg tcactgcccaaggagaaagggctaattctctgcctatggagcaagttctgcagatggttc cagagacgggagtcctgggcccagagccgagatgagcagaacctgctgcagcagaagagg atctgggagtctcctctccttctagctgccaaagataatgatgtccaggccctgaacaag ttgctcaagtatgaggattgcaaggtgcaccagagaggagccatgggggaaacagcgcta cacatagcagccctctatgacaacctggaggccgccatggtgctgatggaggctgccccg gagctggtctttgagcccatgacatctgagctctatgagggtcagactgcactgcacatc gctgttgtgaaccagaacatgaacctggtgcgagccctgcttgcccgcagggccagtgtc tctgccagagccacaggcactgccttccgccgtagtccctgcaacctcatctactttgat ctctctgtgtccacaggggagcaccctttgtcctttgctgcctgtgtgaacagtgaggag atcgtgcggctgctcattgagcatggagctgacatccgggcccaggactccctgggaaac acagtgttacacatcctcatcctccagcccaacaaaacctttgcctgccagatgtacaac ctgttgctgtcctacgacagacatggggaccacctgcagcccctggacctcgtgcccaat caccagggtctcacccctttcaagctggctggagtggagggtaacactgtgatgtttcag cacctgatgcagaagcggaagcacacccagtggacgtatggaccactgacctcgactctc tatgacctcacagagatcgactcctcaggggatgagcagtccctgctggaacttatcatc accaccaagaagcgggaggctcgccagatcctggaccagacgccggtgaaggagctggtg agcctcaagtggaagcggtacgggcggccgtacttctgcatgctgggtgccatatatctg ctgtacatcatctgcttcaccatgtgctgcatctaccgccccctcaagcccaggaccaat aaccgcacgagcccccgggacaacaccctcttacagcagaagctacttcaggaagcctac atgacccctaaggacgatatccggctggtcggggagctggtgactgtcattggggctatc atcatcctgctggtagaggttccagacatcttcagaatgggggtcactcgcttctttgga cagaccatccttgggggcccattccatgtcctcatcatcacctatgccttcatggtgctg gtgaccatggtgatgcggctcatcagtgccagcggggaggtggtacccatgtcctttgca ctcgtgctgggctggtgcaacgtcatgtacttcgcccgaggattccagatgctaggcccc ttcaccatcatgattcagaagatgatttttggcgacctgatgcgattctgctggctgatg gctgtggtcatcctgggctttgcttcagccttctatatcatcttccagacagaggacccc gaggagctaggccacttctacgactaccccatggccctgttcagcaccttcgagctgttc cttaccatcatcgatggcccagccaactacaacgtggacctgcccttcatgtacagcatc acctatgctgcctttgccatcatcgccacactgctcatgctcaacctcctcattgccatg atgggcgacactcactggcgagtggcccatgagcgggatgagctgtggagggcccagatt gtggccaccacagtgatgctggagcggaagctgcctcgctgcctgtggcctcgctccggg atctgcggacgggagtatggcctgggagaccgctggttcctgcgggtggaagacaggcaa gatctcaaccggcagcggatccaacgctacgcacaggccttccacacccggggctctgag gatttggacaaagactcagtggaaaaactagagctgggctgtcccttcagcccccacctg tcccttcctatgccctcagtgtctcgaagtacctcccgcagcagtgccaattgggaaagg cttcggcaagggaccctgaggagagacctgcgtgggataatcaacaggggtctggaggac ggggagagctgggaatatcagatctga >gi568815591r:142771710_142985634|GENSCAN_predicted_peptide_5|122_aa MPPLAPQLCRAVFLVPILLLLQVKPLNGSPGPKDGSQTEKTPSADQNQEQFEEHFVASSV GEMWQVVDMAQQEEDQSSKTAAVHKHSFHLSFCFSLASVMVFSGGPLRRTFPNIQLCFML TH >gi568815591r:142771710_142985634|GENSCAN_predicted_CDS_5|369_bp atgcctcccctggccccccagctctgcagggcagtgttcctggttcctatcttgctgctg ctgcaggtgaagcctctgaacgggagcccaggccccaaagatgggagccagacagagaaa acgccctctgcagaccagaatcaagaacagttcgaagagcactttgtggcctcctcagtg ggtgagatgtggcaggtggtggacatggcccagcaggaagaagaccagtcgtccaagacg gcagctgttcacaagcactctttccacctcagcttctgctttagtctggccagtgtcatg gttttctcaggagggccattgaggcggacattcccaaatatccaactctgcttcatgctc actcactga >gi568815591r:142771710_142985634|GENSCAN_predicted_peptide_6|695_aa MEYYAAMKKNDIMSFAGTCMQLEGGDQSEEEPRERSQAGGMGTLWSQESTPEERLPVEGS RPWAVARRVLTAILILGLLLCFSVLLFYNFQNCGPRPCETSVCLDLRDHYLASGNTSVAP CTDFFSFACGRAKETNNSFQELATKNKNRLRRILEVQNSWHPGSGEEKAFQFYNSCMDTL AIEAAGTGPLRQVIEEIDQPEFDVPLKQDQEQKIYAQIFREYLTYLNQLGTLLGGDPSKV QEHSSLSISITSRLFQFLRPLEQRRAQGKLFQMVTIDQLKEMAPAIDWLSCLQATFTPMS LSPSQSLVVHDVEYLKNMSQLVEEMLLKQRFAAGGIGEIMEMEESLSTVDLGGKGNLGEG RLSHMILGLVVTLSPALDSQFQEARRKLSQKLRELTEQPPMPARPRWMKCVEETGTFFEP TLAALFVREAFGPSTRSAAMKLFTAIRDALITRLRNLPWMNEETQNMAQDKVAQLQVEMG ASEWALKPELARQEYNDIQLGSSFLQSVLSCVRSLRARIVQSFLQPHPQHRAVNFGAAGS IMAHELLHIFYQLLLPGGCLACDNHALQEAHLCLKRHYAAFPLPSRTSFNDSLTFLENAA DVGGLAIALQAYSKRLLRHHGETVLPSLDLSPQQIFFRSYAQVMCRKPSPQDSHDTHSPP HLRVHGPLSSTPAFARYFRCARGALLNPSSRCQLW >gi568815591r:142771710_142985634|GENSCAN_predicted_CDS_6|2088_bp atggaatactatgcagccatgaaaaagaatgatatcatgtcctttgcgggaacatgtatg cagctggaaggtggggaccaaagtgaggaagagccgagggaacgcagccaggcaggtgga atgggaactctctggagccaagagagcactccagaagagaggctgcccgtggaagggagc aggccatgggcagtggccaggcgggtgctgacagctatcctgattttgggcctgctcctt tgtttttctgtgcttttgttctacaacttccagaactgtggccctcgcccctgtgagaca tctgtgtgtttggatctccgggatcattacctggcctctgggaacacaagtgtggccccc tgcaccgacttcttcagctttgcctgtggaagggccaaagagaccaataattcttttcag gagcttgccacaaagaacaaaaaccgacttcggagaatactggaggtccagaattcctgg cacccaggctctggggaggagaaagccttccagttctacaactcctgcatggatacactt gccattgaagctgcagggactggtcccctcagacaagttattgaggagatagaccagcca gagtttgatgttcccctcaagcaagatcaagaacagaagatctatgcccagatctttcgg gaatacctgacttacctgaatcagctgggaaccttgctgggaggagacccaagcaaggtg caagaacactcttccttgtcaatctccatcacttcacggctgttccagtttctgaggccc ctggagcagcggcgggcacagggcaagctcttccagatggtcactatcgaccagctcaag gaaatggcccccgccatcgactggttgtcctgcttgcaagcgacattcacaccgatgtcc ctgagcccttctcagtccctcgtggtccatgacgtggaatatttgaaaaacatgtcacaa ctggtggaggagatgctgctaaagcagaggttcgccgcaggtgggattggggagatcatg gaaatggaggagagcctgagcaccgtagatcttgggggcaaaggaaaccttggggaaggc aggctgagccacatgatcttagggctggtggtgaccctttctccagccctggacagtcaa ttccaggaggcacgcagaaagctcagccagaaactgcgggaactgacagagcaaccaccc atgcctgcccgcccacgatggatgaagtgcgtggaggagacaggcacgttcttcgagccc acgctggcggctttgtttgttcgtgaggcctttggcccgagcacccgaagtgctgccatg aaattattcactgcgatccgggatgccctcatcactcgcctcagaaaccttccctggatg aatgaggagacccagaacatggcccaggacaaggttgctcaactgcaggtggagatgggg gcttcagaatgggccctgaagccagagctggcccgacaagaatacaacgatatacagctt ggatcgagcttcctgcagtctgtcctgagctgtgtccggtccctccgagctagaattgtc cagagcttcttgcagcctcacccccaacacagagccgtgaactttggcgctgctggcagc atcatggcccacgagctgttgcacatcttctaccagctcttactgcctgggggctgcctc gcctgtgacaaccatgccctccaggaagctcacctgtgcctgaagcgccattatgctgcc tttccattacctagcagaacctccttcaatgactccctcacattcttagagaatgctgca gacgttggggggctagccatcgcgctgcaggcatacagcaagaggctgttacggcaccat ggggagactgtcctgcccagcctggacctcagcccccagcagatcttctttcgaagctat gcccaggtgatgtgtaggaagcccagcccccaggactctcacgacactcacagccctcca cacctccgagtccacgggcccctcagcagcaccccagcctttgccaggtatttccgctgt gcacgtggtgctctcttgaacccctccagccgctgccagctctggtaa >gi568815591r:142771710_142985634|GENSCAN_predicted_peptide_7|33_aa XLWNLAGGDSSTTMDTRVGREHYLEKRYRQQAS >gi568815591r:142771710_142985634|GENSCAN_predicted_CDS_7|102_bp ngcctctggaacctggcaggaggagactcctcaaccaccatggacactcgagttggcagg gagcactacttagagaagcggtacaggcagcaagccagctga