GENSCAN 1.0 Date run: 3-Nov-116 Time: 14:37:40 Sequence gi568815587r:72477387_72774207 : 296821 bp : 51.42% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 3020 3015 6 1.05 1.02 Term - 12274 12215 60 1 0 113 44 47 0.309 0.90 1.01 Init - 30648 30517 132 0 0 50 31 156 0.058 6.21 1.00 Prom - 32064 32025 40 -3.21 2.00 Prom + 36614 36653 40 -0.41 2.01 Init + 36870 36875 6 2 0 36 94 0 0.166 -3.79 2.02 Term + 43556 43705 150 1 0 109 43 155 0.913 11.32 2.03 PlyA + 44183 44188 6 1.05 3.03 PlyA - 49573 49568 6 1.05 3.02 Term - 55305 55190 116 1 2 38 47 107 0.031 0.64 3.01 Init - 68414 68321 94 2 1 74 52 64 0.543 1.89 3.00 Prom - 68889 68850 40 3.89 4.00 Prom + 77821 77860 40 -1.91 4.01 Init + 79640 79713 74 1 2 82 45 66 0.155 2.19 4.02 Term + 93642 93747 106 0 1 158 47 3 0.039 1.38 4.03 PlyA + 95591 95596 6 1.05 5.33 PlyA - 98777 98772 6 1.05 5.32 Term - 100208 99998 211 1 1 107 42 414 0.984 35.79 5.31 Intr - 100953 100847 107 0 2 21 89 180 0.694 10.91 5.30 Intr - 101623 101511 113 2 2 92 78 109 0.988 10.90 5.29 Intr - 101997 101898 100 0 1 80 24 196 0.927 12.58 5.28 Intr - 102222 102148 75 0 0 103 59 149 0.923 13.71 5.27 Intr - 103238 103191 48 2 0 66 54 80 0.769 1.76 5.26 Intr - 103587 103500 88 2 1 93 103 47 0.998 7.17 5.25 Intr - 104093 103971 123 1 0 136 72 231 0.999 26.51 5.24 Intr - 104561 104491 71 2 2 142 38 117 0.990 10.57 5.23 Intr - 105180 105058 123 0 0 108 101 164 0.999 20.99 5.22 Intr - 106129 106052 78 1 0 112 100 54 0.998 9.24 5.21 Intr - 106927 106815 113 2 2 104 56 305 0.999 29.50 5.20 Intr - 107342 107165 178 2 1 86 57 432 0.998 39.91 5.19 Intr - 107558 107486 73 1 1 64 78 132 0.699 9.70 5.18 Intr - 108048 107985 64 1 1 129 76 69 0.999 8.07 5.17 Intr - 108207 108168 40 0 1 121 92 17 0.998 3.68 5.16 Intr - 108795 108684 112 2 1 101 109 234 0.999 27.68 5.15 Intr - 111528 111398 131 0 2 124 70 170 0.996 18.80 5.14 Intr - 111854 111789 66 2 0 86 94 98 0.994 9.79 5.13 Intr - 112406 112365 42 2 0 120 86 27 0.796 4.62 5.12 Intr - 112595 112521 75 2 0 103 100 70 0.914 9.91 5.11 Intr - 112858 112806 53 2 2 113 83 71 0.995 8.22 5.10 Intr - 113194 113041 154 1 1 46 77 186 0.901 13.46 5.09 Intr - 113970 113911 60 0 0 109 84 79 0.995 9.02 5.08 Intr - 119262 119207 56 1 2 106 91 98 0.955 11.09 5.07 Intr - 120294 120124 171 1 0 62 116 73 0.509 7.93 5.06 Intr - 121222 121123 100 1 1 74 49 59 0.528 0.78 5.05 Intr - 125148 125027 122 1 2 108 64 33 0.340 3.62 5.04 Intr - 126980 126940 41 1 2 103 80 18 0.923 0.85 5.03 Intr - 127840 127752 89 1 2 69 105 75 0.969 6.57 5.02 Intr - 131365 131276 90 1 0 137 66 99 0.904 13.29 5.01 Init - 137624 137610 15 2 0 92 98 19 0.765 3.48 5.00 Prom - 143700 143661 40 1.19 6.00 Prom + 145117 145156 40 -2.31 6.01 Init + 149053 149208 156 0 0 104 107 -6 0.134 1.94 6.02 Term + 155630 155731 102 1 0 95 44 93 0.129 4.18 6.03 PlyA + 156632 156637 6 1.05 7.00 Prom + 159161 159200 40 -5.91 7.01 Init + 160087 160412 326 0 2 39 75 116 0.320 2.21 7.02 Intr + 161327 161424 98 2 2 -18 81 164 0.814 5.25 7.03 Intr + 164851 165034 184 2 1 56 57 171 0.036 10.16 7.04 Intr + 166096 166132 37 1 1 119 75 -8 0.035 -0.25 7.05 Intr + 170901 171014 114 2 0 79 53 91 0.129 5.85 7.06 Intr + 171497 171703 207 1 0 57 30 95 0.115 0.40 7.07 Intr + 178007 178094 88 1 1 86 89 7 0.691 0.64 7.08 Intr + 179204 179354 151 0 1 81 16 119 0.423 3.73 7.09 Term + 180889 181057 169 1 1 88 54 55 0.466 -0.23 7.10 PlyA + 181320 181325 6 -1.75 8.42 PlyA - 182177 182172 6 1.05 8.41 Term - 183114 183048 67 2 1 96 40 40 0.188 -2.30 8.40 Intr - 185943 185733 211 1 1 75 100 108 0.432 9.20 8.39 Intr - 189007 188923 85 1 1 67 46 18 0.016 -4.61 8.38 Intr - 193965 193789 177 0 0 44 84 109 0.490 6.73 8.37 Intr - 202674 202572 103 2 1 112 65 -8 0.366 -0.02 8.36 Intr - 203357 203161 197 2 2 73 119 103 0.638 10.63 8.35 Intr - 207473 207230 244 1 1 94 59 84 0.265 4.13 8.34 Intr - 208510 208197 314 1 2 58 50 74 0.332 -3.87 8.33 Intr - 208805 208656 150 2 0 100 65 118 0.989 11.57 8.32 Intr - 210158 210053 106 1 1 6 81 98 0.477 1.62 8.31 Intr - 210352 210302 51 0 0 122 89 52 0.993 7.21 8.30 Intr - 211151 211069 83 2 2 95 107 53 0.998 6.83 8.29 Intr - 216084 215939 146 1 2 48 114 250 0.986 24.11 8.28 Intr - 216419 216306 114 0 0 49 69 220 0.982 17.12 8.27 Intr - 217711 217594 118 1 1 109 94 233 0.999 26.54 8.26 Intr - 218069 218001 69 2 0 95 100 86 0.990 10.37 8.25 Intr - 218242 218156 87 1 0 116 72 152 0.854 17.06 8.24 Intr - 218479 218332 148 0 1 127 27 257 0.920 24.25 8.23 Intr - 219268 219163 106 2 1 93 81 246 0.994 24.17 8.22 Intr - 219809 219597 213 0 0 101 80 430 0.999 42.71 8.21 Intr - 220100 219937 164 1 2 76 89 373 0.999 36.33 8.20 Intr - 220263 220212 52 1 1 83 83 91 0.918 6.55 8.19 Intr - 220720 220525 196 1 1 76 21 175 0.963 9.11 8.18 Intr - 221721 221619 103 2 1 74 105 144 0.999 15.38 8.17 Intr - 222166 222031 136 2 1 97 92 146 0.999 16.03 8.16 Intr - 224397 224263 135 1 0 70 98 150 0.988 15.25 8.15 Intr - 225293 225165 129 0 0 88 38 76 0.712 3.77 8.14 Intr - 225693 225519 175 0 1 126 99 165 0.999 21.43 8.13 Intr - 227019 226766 254 1 2 61 77 372 0.831 31.09 8.12 Intr - 228504 228235 270 1 0 88 55 149 0.745 9.65 8.11 Intr - 229988 229789 200 1 2 95 89 204 0.984 20.72 8.10 Intr - 232590 232484 107 0 2 105 60 328 0.847 31.21 8.09 Intr - 233201 232999 203 0 2 95 33 293 0.977 24.03 8.08 Intr - 233755 233635 121 1 1 100 40 215 0.897 18.47 8.07 Intr - 234113 234044 70 1 1 113 100 74 0.950 10.78 8.06 Intr - 234953 234810 144 1 0 88 84 37 0.733 3.21 8.05 Intr - 235182 235052 131 0 2 21 89 196 0.580 13.00 8.04 Intr - 235857 235790 68 1 2 99 81 66 0.710 6.12 8.03 Intr - 236935 236766 170 0 2 76 109 11 0.121 2.11 8.02 Intr - 249786 249234 553 1 1 126 105 498 0.526 47.82 8.01 Init - 262502 262493 10 2 1 100 80 7 0.038 1.88 8.00 Prom - 263083 263044 40 -2.91 9.12 PlyA - 275861 275856 6 1.05 9.11 Term - 277756 277511 246 1 0 43 53 482 0.972 36.33 9.10 Intr - 278367 278315 53 1 2 122 113 14 0.959 6.42 9.09 Intr - 280068 279910 159 1 0 93 3 101 0.575 2.57 9.08 Intr - 280498 280381 118 1 1 106 100 228 0.999 26.34 9.07 Intr - 281247 281144 104 1 2 110 86 96 0.995 11.99 9.06 Intr - 281995 281848 148 1 1 98 50 323 0.934 29.82 9.05 Intr - 287476 287352 125 2 2 82 7 91 0.233 1.21 9.04 Intr - 289351 289229 123 2 0 113 59 -6 0.294 0.06 9.03 Intr - 290945 290780 166 2 1 45 77 67 0.333 1.35 9.02 Intr - 294412 294317 96 1 0 128 59 23 0.254 4.11 9.01 Init - 294709 294602 108 1 0 79 18 106 0.067 0.93 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 48165 48241 77 1 2 105 42 72 0.891 2.60 S.002 Term + 62539 62708 170 2 2 65 48 80 0.837 0.06 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:72477387_72774207|GENSCAN_predicted_peptide_1|63_aa MDKDPIFHSIKEKSYNNSNMDDEVQAYEVSDGNEKRIGNWSKGHVFGEQVVFGYMNKFFS GDL >gi568815587r:72477387_72774207|GENSCAN_predicted_CDS_1|192_bp atggacaaggaccccatctttcactcaattaaggaaaagtcttacaacaatagtaatatg gatgatgaagtccaggcttatgaggtctcagatggaaatgagaaacgtattgggaactgg agcaaaggtcatgtttttggggaacaggtggtatttggttacatgaacaagttctttagt ggtgatttgtga >gi568815587r:72477387_72774207|GENSCAN_predicted_peptide_2|51_aa MEVTSLGQPAKLDMADNVFDDQCVGCVEEMEKRVPQLLKEELRVNTNLNID >gi568815587r:72477387_72774207|GENSCAN_predicted_CDS_2|156_bp atggaggtgaccagcctgggacagccagccaagctggacatggctgacaatgtgtttgat gaccagtgcgtgggttgtgttgaagaaatggagaaaagggtgccccagctgttaaaagaa gaactgagagtgaatacaaatttaaacattgactag >gi568815587r:72477387_72774207|GENSCAN_predicted_peptide_3|69_aa MEIYPEVETYEMEMAVKYTKKELGRAFQIEEASITLIPKPGKDTMKKEYFRPISLMNMDT KIFNKVLAN >gi568815587r:72477387_72774207|GENSCAN_predicted_CDS_3|210_bp atggaaatctacccagaggtagagacatatgagatggaaatggcagtaaaatatacgaag aaggagctgggaagagcattccagatagaagaagccagcatcaccctgattccaaaacct ggcaaagatacaatgaaaaaagaatacttcaggccaatatccctgatgaacatggacaca aaaatcttcaacaaagtactagcaaactga >gi568815587r:72477387_72774207|GENSCAN_predicted_peptide_4|59_aa MQPAKTPNCATGISTGNTTCNLAPMCSPEPVLSIMAALLCPRPSPPPAGFIIPVYGQKN >gi568815587r:72477387_72774207|GENSCAN_predicted_CDS_4|180_bp atgcagccagccaaaacgcccaactgtgccacgggaatttctactggaaacactacctgt aacttggcacctatgtgcagtccagagcctgtcttgagcataatggctgctctgctctgc ccccgaccctcaccacctccagcggggttcatcatcccagtttatggacagaaaaactaa >gi568815587r:72477387_72774207|GENSCAN_predicted_peptide_5|993_aa MEREGDALLSLGSVIDISGLQRAVKEALSAVLPRVETVYTYLLDGESQLVCEDPPHELPQ EGKVRWVDIMRHDQGHMPGGRGTDLQAAVEPEALAVPEEVGVWGGCYSRKPWTVGKQQQA RQFLQAQLSGGSMRDEEPRTINHQGVTFGANSGQASFLPESGALKVASCPLFREAIISQK RLGCNGLGFSDLPGKPLARLVAPLAPDTQVLVMPLADKEAGAVAAVILVHCGQLSDNEEW SLQAVEKHTLVALRRVQVLQQRGPREAPRAVQNPPEGTAEDQKGGAAYTDRDRKILQLCG ELYDLDASSLQLKVLQYLQQETRASRCCLLLVSEDNLQLSCKVIGDKVLGEEVSFPLTGC LGQVVEDKKSIQLKDLTSEDVQQLQSMLGCELQAMLCVPVISRATDQVVALACAFNKLEG DLFTDEDEHVIQHCFHYTSTVLTSTLAFQKEQKLKCECQALLQVAKNLFTHLDDVSVLLQ EIITEARNLSNAEICSVFLLDQNELVAKVFDGGVVDDESYEIRIPADQGIAGHVATTGQI LNIPDAYAHPLFYRGVDDSTGFRTRNILCFPIKNENQEVIGVAELVNKINGPWFSKFDED LATAFSIYCGISIAHSLLYKKVNEAQYRSHLANEMMMYHMKVSDDEYTKLLHDGIQPVAA IDSNFASFTYTPRSLPEDDTSMAILSMLQDMNFINNYKIDCPTLARFCLMVKKGYRDPPY HNWMHAFSVSHFCYLLYKNLELTNYLEDIEIFALFISCMCHDLDHRGTNNSFQVASKSVL AALYSSEGSVMERHHFAQAIAILNTHGCNIFDHFSRKDYQRMLDLMRDIILATDLAHHLR IFKDLQKMAEVGYDRNNKQHHRLLLCLLMTSCDLSDQTKGWKTTRKIAEKAMGNRPMEMM DREKAYIPELQISFMEHIAMPIYKLLQDLFPKAAELYERVASNREHWTKVSHKFTIRGLP SNNSLDFLDEEYEVPDLDGTRAPINGCCSLDAE >gi568815587r:72477387_72774207|GENSCAN_predicted_CDS_5|2982_bp atggagcgggagggggacgccttgctgagtctgggctctgtcatcgacatttcaggcctg caacgtgctgtcaaggaggccctgtcagctgtgctcccccgagtggaaactgtctacacc tacctactggatggtgagtcccagctggtgtgtgaggaccccccacatgagctgccccag gaggggaaagtccgatgggtggatatcatgagacatgaccaaggccacatgccaggaggg agagggactgacctacaggctgccgtggagcctgaggctctagcagtgcccgaggaggtg ggggtgtggggagggtgctactccaggaaaccctggactgtgggcaaacagcagcaggcc aggcagttcctacaagcacaattaagtggaggcagcatgagggatgaagaacccaggaca attaatcatcaaggagtgacatttggtgcaaactcaggccaggcctcgttcctccctgag tcgggggctctcaaggtggcatcatgtcctcttttcagggaggctatcatctcccagaag cggctgggctgcaatgggctgggcttctcagacctgccagggaagcccttggccaggctg gtggctccactggctcctgatacccaagtgctggtcatgccgctagcggacaaggaggct ggggccgtggcagctgtcatcttggtgcactgtggccagctgagtgataatgaggaatgg agcctgcaggcggtggagaagcataccctggtcgccctgcggagggtgcaggtcctgcag cagcgcgggcccagggaggctccccgagccgtccagaaccccccggaggggacggcggaa gaccagaagggcggggcggcgtacaccgaccgcgaccgcaagatcctccaactgtgcggg gaactctacgacctggatgcctcttccctgcagctcaaagtgctccaatacctgcagcag gagacccgggcatcccgctgctgcctcctgctggtgtcggaggacaatctccagctttct tgcaaggtcatcggagacaaagtgctcggggaagaggtcagctttcccttgacaggatgc ctgggccaggtggtggaagacaagaagtccatccagctgaaggacctcacctccgaggat gtacaacagctgcagagcatgttgggctgtgagctgcaggccatgctctgtgtccctgtc atcagccgggccactgaccaggtggtggccttggcctgcgccttcaacaagctagaagga gacttgttcaccgacgaggacgagcatgtgatccagcactgcttccactacaccagcacc gtgctcaccagcaccctggccttccagaaggaacagaaactcaagtgtgagtgccaggct cttctccaagtggcaaagaacctcttcacccacctggatgacgtctctgtcctgctccag gagatcatcacggaggccagaaacctcagcaacgcagagatctgctctgtgttcctgctg gatcagaatgagctggtggccaaggtgttcgacgggggcgtggtggatgatgagagctat gagatccgcatcccggccgatcagggcatcgcgggacacgtggcgaccacgggccagatc ctgaacatccctgacgcatatgcccatccgcttttctaccgcggcgtggacgacagcacc ggcttccgcacgcgcaacatcctctgcttccccatcaagaacgagaaccaggaggtcatc ggtgtggccgagctggtgaacaagatcaatgggccatggttcagcaagttcgacgaggac ctggcgacggccttctccatctactgcggcatcagcatcgcccattctctcctatacaaa aaagtgaatgaggctcagtatcgcagccacctggccaatgagatgatgatgtaccacatg aaggtctccgatgatgagtataccaaacttctccatgatgggatccagcctgtggctgcc attgactccaattttgcaagtttcacctatacccctcgttccctgcccgaggatgacacg tccatggccatcctgagcatgctgcaggacatgaatttcatcaacaactacaaaattgac tgcccgaccctggcccggttctgtttgatggtgaagaagggctaccgggatcccccctac cacaactggatgcacgccttttctgtctcccacttctgctacctgctctacaagaacctg gagctcaccaactacctcgaggacatcgagatctttgccttgtttatttcctgcatgtgt catgacctggaccacagaggcacaaacaactctttccaggtggcctcgaaatctgtgctg gctgcgctctacagctctgagggctccgtcatggagaggcaccactttgctcaggccatc gccatcctcaacacccacggctgcaacatctttgatcatttctcccggaaggactatcag cgcatgctggatctgatgcgggacatcatcttggccacagacctggcccaccatctccgc atcttcaaggacctccagaagatggctgaggtgggctacgaccgaaacaacaagcagcac cacagacttctcctctgcctcctcatgacctcctgtgacctctctgaccagaccaagggc tggaagactacgagaaagatcgcggagaaggccatgggcaacaggccgatggagatgatg gaccgggagaaggcctatatccctgagctgcaaatcagcttcatggagcacattgcaatg cccatctacaagctgttgcaggacctgttccccaaagcggcagagctgtacgagcgcgtg gcctccaaccgtgagcactggaccaaggtgtcccacaagttcaccatccgcggcctccca agtaacaactcgctggacttcctggatgaggagtacgaggtgcctgatctggatggcact agggcccccatcaatggctgctgcagccttgatgctgagtga >gi568815587r:72477387_72774207|GENSCAN_predicted_peptide_6|85_aa MGHCRGHWLLMCRIQEEVAHLAARSTHLRMLDKGPKSAAYSAAPPASPQDLLAPQASSPL LDTGICTNPPITYQCPGGLDSIAAS >gi568815587r:72477387_72774207|GENSCAN_predicted_CDS_6|258_bp atggggcattgtagaggccactggctcctgatgtgcagaatccaggaggaagtagcacat ctggcagctaggagcactcacttgagaatgttagataagggcccgaaaagtgctgcctac agcgcggcccctccagcctcaccccaggacttgctggccccccaggccagctcacccctc ctggacactggaatctgcaccaacccgccaataacctaccagtgccctggaggcctggac tccatagccgccagttag >gi568815587r:72477387_72774207|GENSCAN_predicted_peptide_7|457_aa MCDSDRKGTVLKGGKNPWAEWNWRLPGGSIWEDPSYVSPKAAIRDSLREVPRPALGLSEQ TISGGLESAMRGTQCLGLRVVDTGGAGCAFREAFLEEELLLAGMGQGRRTLEFGNNCLKD DEDDNVNDNEDNSSQRQWDKKARGPPYLQAVGAWLRRRRRLVRLEEDLLAPPEELDNSDE DVVQHQDHARSRPPARRPGRCARTARRTKPGNGGKHLPEVLPMATSPDPALCPLAMAPDV GVALTERGCGHGQTHILKHMLTQLLHRGTGGHACFCASSSMPTLAGVIFSTPKSDSASPP PIWLPIASRTIGALVLAFKAFQPSILLTPQEPGPGRWMVQEPKWWLVLMHASQTRAERTI GIALADANFIQTEGKVTHGLSSPFSEVVITWENPGQQVWGKRLWRKQGQIFPPSYPPGSQ KTTLIQAGWSCGSDPSQDFPWKSPPPQPGGEMPDSHL >gi568815587r:72477387_72774207|GENSCAN_predicted_CDS_7|1374_bp atgtgtgactcagacaggaagggcactgtgctcaaaggaggtaagaatccctgggctgag tggaactggagacttcctggaggtagcatctgggaggacccttcatatgtcagtcctaag gcagcaattagggacagtctgcgggaagtgcccaggccagccctagggctgtcagagcaa accatcagtggaggactagagtctgcaatgagggggactcagtgcctcgggctcagggta gtggacactggaggagctggctgtgcattcagagaagccttcttggaggaagagctactt ctggctggcatgggacaaggaagaaggaccctggaatttggaaataactgcctaaaagat gatgaagatgacaatgtcaacgacaatgaagataattccagccagagacaatgggataaa aaagcgcgggggcccccctacctgcaggctgtcggcgcatggctgcggcggcggcggcgg ctcgtccggcttgaggaagacctgctggccccgcctgaggaattggacaacagcgatgag gatgtggtgcagcaccaggaccatgctcgcagccgcccgcccgcccgccggcccggccgc tgcgctcggacggccaggaggaccaaacccgggaatggcgggaagcacctgccagaggtg ctgcccatggccaccagtccagatccagctctgtgtccgctggctatggctccagatgtg ggggtagccctcacagagagaggatgtggccacgggcagacacacatcctcaagcacatg ctcacacagctcctacacagaggcacaggtggccacgcctgcttctgtgcctcctcttcg atgcccacactggctggagtcatcttttccacacctaaatctgattctgcctcaccccca cccatatggcttcctattgccagcaggacaattggagccctggtcctggccttcaaggcc ttccagcccagcattctcctcactcctcaagaacctggcccaggtagatggatggtacaa gagccaaaatggtggctcgtgctcatgcatgcaagccagacacgagcagaaaggactatc ggcatagcccttgctgatgcaaatttcattcaaactgagggcaaagtcacccatgggctc tcctcgcctttctcggaggtagtgattacctgggaaaatcctggccagcaagtctggggc aagaggttatggaggaaacaagggcagatcttcccacctagctaccctcctggcagccag aagaccaccctcatccaagcaggctggagctgtggttctgatccctcccaggactttccc tggaagtcaccacctcctcagcctggaggagaaatgccagactcccacctgtga >gi568815587r:72477387_72774207|GENSCAN_predicted_peptide_8|2059_aa MAGALDKALPVRPPLAGTMAEAGDAALSVAEWLRALHLEQYTGLFEQHGLVWATECQGLS DTRLMDMGMLLPGHRRRILAGLLRAHTSPAPAPRPTPRPVPMKRHIFRSPPVPATPPEPL PTTTEDEGLPAAPPIPPRRSCLPPTCFTTPSTAAPDPVLPPLPAKRHLAELSVPPVPPRT GPPRLLVSLPTKEEESLLPSLSSPPQPQSEEPLSTLPQGPPQPPSPPPCPPEIPPKPVRL FPEFDDSDYDEVPEEGPGAPARVMTKKEEPPPSRVPRAVRVASLLSEGEELSGDDQGDEE EDDHAYEGVPNGGWHTSSLSLSLPSTIAAPHPMDGPPGGSTPVTPVIKAGWLDKNPPQGS YIYQKRWVRLDTDHLRYFDSNKDAYSKRFISVACISHVAAIGDQKFEVITNNRTFAFRAE SDVERKEWMQALQQAMAEQRARARLSSAYLLGVPGSEQPDRAGSLELRGFKNKLYVAVVG DKVQLYKNLEEYHLGIGITFIDMSVGNVKEVDRRSFDLTTPYRIFSFSADSELEKEQWLE AMQGAIAEALSTSEVAERIWAAAPNRFCADCGAPQPDWASINLCVVICKRCAGEHRGLGA GVSKVRSLKMDRKVWTETLIEVGMPLLATDTWGLPHSTAVWVSQPPPYLSDHSTLELEET LCDNLVGGDFGKAIFGKVSKLAKTKFLPGTAPGCQLARLTSCTPPQLFLQLGNGAGNRFW AANVPPSEALQPSSSPSTRRCHLEAKYREGKYRRYHPLFGNQEELDKALCAAVTTTDLAE TQALLGCGAGINCFSGDPEAPTPLALAEQAGQTLQMEFLRNNRTTVWAPQDGWQLCWCLC ASEPPGFVFLGMLRALEYMRVQSVSVLLQVPRLDSMKPLEKHYSVVLPTVSHSGFLYKTA SAGKLLQDRRAREEFSRRWCVLGDGVLSYFENERAVTPNGEIRASEIVCLAVPPPDTHGF EHTFEVYTEGERLYLFGLESAEQAHEWVKCIAKAFVPPLAEDLLARDFERLGRLPYKAGL SLQRAQEGWFSLSGSELRAVFPEGPCEEPLQLRKLQELSIQGDSENQVLVLVERRRTLYI QGERRLDFMGWLGAIQKAAASMGDTLSEQQLGDSDIPVIVYRCVDYITQCGLTSEGIYRK CGQTSKTQRLLESLRQDARSVHLKEGEQHVDDVSSALKRFLRDLPDGLFTRAQRLTWLEA SEIEDEEEKVSRYRELLVRLPPVNRATVKALISHLYCVQCFSDTNQMNVHNLAIVFGPTL FQTDGQDYKAGRVVEDLINHYVVVFSVDEEELRKQREEITAIVKMRVAGTASGTQHAGDF ICTVYLEEKKAETEQHIKVPASMTAEELTLEILDRRNVGIREKDYWTCFEVNEREEAERP LHFAEKVLPILHGLGTDSHLVVKKHQAMEAMLLYLASRVGDTKHGMMKFREDRSLLGLGL PSGGFHDRYFILNSSCLRLYKEVRSHRPEKEWPIKSLKVYLGVKKKLRPPTCWGFTVVHE TEKHEKQQCQAFGIMWVRVQPCRYLCCDTQMELREWFATFLFVQHDGLVWPSEPSRVSRA VPEVRLGSVSLIPLRGSENEMRRSVAAFTADPLSASLVHTIQRSPFCLALEAGSPLCLAL GAGSLADWSAPRLAVESGCSSCSSFSGAPAPLGLHKIPVGLLPSCSFCATSEHRSPSLAL GFRRWKPSVQTPLMLQGLIQAGNAGPFLASTDPGWTQAPVLLQSESESQLGPGCGVSVTQ AWQPCSSPLLETQGQSHRSLRKLRRSTFGSPVAPVPTPPEKGDWGQRGQSNLSSHHDLQP ALRVGSGSEDASDSAEKPSWGAHHLLCQGFACVIPFYSKTQEEDDWCGELWGKRWFRAEW EVRGLLARKIPSVLDWRVVVHDSIAAFIHSTTQQELLSSCLVPTWLVTGDTEQTLTESLF SGSAQSDEGERNGKLQPYRELLMVTLKEFQGLPARSSCTDATTEVQRAQGLLKNVILRTQ CALEPPKPPVHSSQLALRVPWCALPVLVTLAMEMDPKALALRFIRDRHTQTQAINFVGLR ESREKANEIKDVQMHSEGA >gi568815587r:72477387_72774207|GENSCAN_predicted_CDS_8|6180_bp atggctggagctctggacaaagccctgccagtcaggcctccgctggcaggaaccatggca gaggctggggatgctgcgctatcggtggccgagtggctgcgggcattgcacctggagcag tacacggggctctttgagcagcatggcctggtgtgggccactgagtgccaaggcctcagc gacacccgcctgatggacatgggcatgctactccctggtcaccgccgccgcatcctggct ggcctgctccgtgcccatacctcaccggcccctgcaccccgccccaccccacggcctgtg cccatgaagcgccacatcttccgctcaccacctgtgcctgccactccacccgagccgctg cccaccactacagaggatgaggggctccccgctgccccacccatcccgccccggaggagc tgccttccgcccacctgcttcaccaccccatccacagctgccccagaccctgtgctgccc ccgctgcctgctaagcggcatttggcagagctgagcgttccacccgtgccgccccgcacc ggacccccccgcctgctggtgagcctgcccactaaggaggaggagtcattgctgccatca ttatcatcccctccccagccacagtctgaggagcccctgtccaccctcccccaggggcct ccccagcctccctctccacctccctgccccccggagatacctccaaagccggtacgcctg ttcccagagttcgatgactctgactacgatgaggtcccagaggaggggccgggggcccca gccagagtgatgaccaagaaggaggagcccccaccgagccgagtcccacgggccgtgcgc gtggccagtctgctgagcgagggagaggaactgtctggggacgaccaaggggatgaggaa gaggatgaccacgcctatgagggcgtccccaatggcggatggcataccagcagcctgagc ttgtccttgcccagcacaatagctgcgccacaccccatggacgggccgcctgggggctcc acccccgtcacaccagtcatcaaggctggctggctggacaagaacccaccgcagggatct tacatctatcagaaacgatgggtgagactggatactgatcacctgcgatactttgacagt aacaaggacgcttactctaagcgctttatctctgtggcctgcatctcccacgtggctgcc atcggggaccagaagtttgaagtgatcacaaacaaccgaacctttgccttccgggcagag agtgatgtggagcggaaggagtggatgcaggccctgcagcaggccatggctgagcagcgt gcccgggcccggctctctagcgcttatctgctgggagttccaggctcagagcagcctgac cgcgctggcagcctggagcttcgtggcttcaagaataagctgtacgtggccgtggtcggg gacaaagtgcagctctacaagaatctagaggagtaccacctgggcattggcatcaccttc atcgacatgagcgtgggcaacgtgaaggaagtggaccggcgcagcttcgacctcaccacg ccctaccgcatcttcagcttctctgctgactcagagctagagaaggagcagtggttggag gccatgcagggagccatcgctgaggccctgtctacctcggaggtggccgagcgcatctgg gctgcagcccccaacaggttctgtgctgactgcggggctcctcagcctgactgggcctcc atcaacctctgtgttgttatctgcaagcgctgtgcaggggagcaccgtggcctgggcgct ggcgtctccaaggtgcggagcctgaagatggacaggaaggtgtggacagaaacacttatc gaggtggggatgcccctgcttgccaccgatacttggggtctgccccattcaacagctgtc tgggtctcccagccccctccctatctcagtgaccacagcaccttggagctggaagagacc ctttgtgataatctagtgggtggggatttcggaaaagcaatttttggcaaagtcagcaaa ctggccaaaaccaagtttctgcccggcacggcccctggctgtcagctggcccgtctgacc tcctgcacacccccacagctcttcttacagctggggaatggcgctgggaaccgcttctgg gcagccaacgtgccccccagtgaggccctgcagcccagcagcagccccagcacccggcgg tgccacctggaggccaagtaccgtgagggcaagtaccgccgctaccacccgctctttggc aaccaggaggagctggacaaggccctgtgtgctgcagtcaccaccacagacctggctgag acccaggcgctcctgggctgtggggctgggatcaactgcttctcgggggaccctgaggcc cccacgcccctggctcttgcagagcaggcggggcagacgctgcagatggaattccttcgg aacaaccggaccacagtgtgggcccctcaggatggctggcagctgtgctggtgtttgtgc gcctctgagccccccggatttgtgtttctggggatgctgagagctttggagtacatgcgt gtgcagagtgtgtctgtgttgctgcaggtgcctcggctggactcgatgaagcccctggaa aagcactactcagttgtcctgccgaccgtgagccacagtggcttcctctacaagactgcc tctgccggcaagctgctacaggaccgccgggcccgggaagagttcagccggcgctggtgt gtccttggtgacggggtcctgagctactttgagaatgagcgggcagtgacccccaatgga gagattcgggccagcgagattgtgtgcctggcagtgccccctcctgacacccatggcttt gagcacacctttgaggtgtacacggagggagaacggctgtacctgtttgggctggagagt gcggagcaggctcatgagtgggtcaagtgtattgctaaggcattcgtgcctcccctagcc gaggatctgctggcccgggattttgagcggctgggacgcctaccctacaaagctggcctg agcctacagcgggcccaggagggctggttctctctcagtggctcggagctccgtgctgtc ttcccggaggggccctgcgaagagccgctgcaactacggaaactgcaggagctttccatc cagggggacagtgagaaccaggtgctggtgctggtggagcgaaggaggacactgtacata cagggcgagcggcggctggacttcatgggttggctgggggccatccagaaagcagccgcc agcatgggggacacgctgtcggagcagcagcttggggactcggatatcccggtgatcgtg taccgctgtgtggactacatcacgcagtgcggcctgacctccgagggcatctaccgcaag tgtgggcagacatcgaagacacagcggctgctggagagcctgcggcaggatgcgcgctct gtgcacctcaaggagggcgagcagcacgtggatgatgtttcctcggcgctcaagcgcttc ctgcgcgacctgcctgatgggctcttcactcgcgcccagcgcctaacctggctggaggcc tcagagattgaggacgaggaggagaaggtctccaggtaccgagagctgctggtgcggctg ccccctgtcaaccgggccacagtgaaggcccttatcagccacctgtactgtgttcagtgc ttctcagacacgaaccagatgaacgtgcacaacctggcaattgtgtttgggcccacgctc ttccagacagatgggcaggactacaaggctggccgtgtggtggaagacctcattaaccac tatgtggtggtgtttagtgtggatgaggaagagctcaggaagcagcgggaggagatcact gccattgtgaagatgcgcgtggctggcactgccagtgggacccagcatgccggtgacttc atctgcacagtgtatctggaagagaagaaggcagagactgagcagcatatcaaggtccca gcatccatgactgctgaggagctcaccctggagatcctggatcgccggaacgtgggcatc agggagaaggactattggacctgctttgaggtcaacgagagggaggaggcagagcgcccc ctgcactttgcggagaaggtgctgcccatcctgcacgggctgggcacggacagccacctg gtggtgaagaagcaccaggccatggaggccatgctgctgtacctggccagccgtgtcggt gacaccaagcatggcatgatgaagttccgtgaggaccgcagcctcctgggcctgggcctg ccctcaggtggcttccacgatcgctacttcatcctcaacagcagctgcttgcggctctac aaggaggtccggagtcaccggcctgagaaggagtggcctattaagagtctcaaagtctac ctgggagtgaagaagaaactcaggccacccacctgctggggcttcacagtggtgcatgag acagagaaacatgagaagcagcagtgtcaggcctttggcatcatgtgggtccgcgtccaa ccctgcaggtacctctgctgtgacacacagatggagctccgggagtggttcgctaccttt ctgtttgtgcagcatgacggcctggtgtggccctcagagccctcacgcgtgtcccgggca gtgcctgaggtccggctgggtagtgtgtcactgatcccccttcgaggtagtgaaaatgaa atgcgccggagtgtggctgccttcaccgcggaccctctgtctgcctctttggttcacacc attcaaaggagccccttctgcctggccctggaagctgggagtcccctctgcctggccctg ggagctgggagcctggctgattggagtgctccccggctggcagtggagagtggctgcagc agctgcagcagcttcagcggggcgccagcccctctgggccttcacaaaatacctgtgggt ctcttgccttcctgcagcttctgcgcaacgtctgagcacaggagcccatccttggctcta ggattccgccgctggaagccttctgttcagacaccccttatgctccaaggcctgatccag gctgggaatgccggtcctttcctggccagcactgaccctggctggacacaagcccctgtc ctgctgcagtccgagtctgagtcccagctggggcctgggtgtggtgtgtcagtgactcag gcctggcagccctgcagcagccccctgctggagacacaaggtcagagccacaggagcctg aggaaattgagaagaagcacctttgggtcacctgtggccccagtgcctacccctccagag aaaggggactgggggcagaggggtcagtccaacctcagttcccaccacgatcttcagcca gcccttagagttggcagtgggagtgaagatgcaagtgatagtgccgagaaaccatcatgg ggcgcccaccacctgctgtgccaaggctttgcatgtgtcatcccattttattccaagacc caggaggaagatgactggtgcggggagttatgggggaagaggtggttcagggctgagtgg gaggttcggggcctcctggcaaggaagatccctagtgtgctggattggagggtggtggtg catgattccattgctgcattcattcattcaaccactcaacaggaacttctcagtagctgc ttggtgcccacttggcttgtcaccggggacacagagcagacactgactgagtccctgttc tcagggagtgcccagtctgatgaaggagaaagaaatggaaagctgcaaccctacagggaa ctcttaatggtcacacttaaagagtttcagggactcccagcacggtcctcttgtactgat gcaactactgaagttcagagagcacaggggctgctgaagaatgtcattctcagaacgcag tgtgccctggagccccccaagccacctgttcattcatcccaactggccttgagggtgccc tggtgtgccctgcctgtgcttgtcaccctggccatggagatggacccaaaagcccttgct ctccgcttcattagagacaggcacacccagacgcaggcaatcaattttgtcgggttgcgt gagagcagggagaaagctaatgagatcaaggatgtgcaaatgcactcagaaggtgcctag >gi568815587r:72477387_72774207|GENSCAN_predicted_peptide_9|481_aa MGSGRHPWAAQGVVLWGLLGMKPGTLLESSGGVSPLDRSGEVQGRRPASATFITAAELGL LLWSLEVEGRAEWRALKPAAGVWVMGGEAGAERAEGHWRWGSDLLCTVGQALHLSEPPLL VQHGRPSPWVLGETAAPPWSGESTFSNAQHWQDIPENTVRGRERGGVSPGAAKMAYQATP DGNIWCGHSLSQGLIAGGSWYRPAHLCRMECCDVPAETLYDVLHDIEYRKKWDSNVIETF DIARLTVNADVGYYSWRCPKPLKNRDVITLRSWLPMGADYIIMNYSVKHPKYPPRKDLVR AVSIQTGYLIQSTGPKSCVITYLAQVDPKGLSLLSLGMAQPPLWLLSQVAMSLSAEIQIP VAIGHLDISLRMSSRHLSLVVSSSLPKWVVNKSSQFLAPKAMKKMYKACLKYPEWKQKHL PHFKPWLHPEQSPLPSLALSELSVQHADSLENIDESAVAESREERMGGAGGEGSDDDTSL T >gi568815587r:72477387_72774207|GENSCAN_predicted_CDS_9|1446_bp atggggagcgggagacatccgtgggctgcccagggtgtggtgctctgggggctgttgggg atgaagcctggaactttactggagagctctgggggcgtcagtccactggatagaagtgga gaggtacagggcaggaggccagcctctgcaacgttcatcactgctgcagagctgggcctg ctgctctggagcctggaggtggagggaagggcagagtggagggccctgaagccagcagct ggtgtgtgggtcatgggaggagaagcaggagcagaaagagcagaggggcactggcgctgg ggctcggacctgctgtgtactgtgggccaggctttgcatctctcagagcctcctctcctg gtccagcatggtcgaccatctccctgggtcttgggagagactgctgcccctccctggagt ggagagtccaccttctctaatgctcagcactggcaagacatcccagagaacacagtcagg gggcgggagagagggggagtttctccgggggcagccaagatggcctaccaggcaacacct gacggcaacatctggtgtggacactcactcagccagggcctaatagctgggggttcctgg taccggcccgcccacctgtgccggatggagtgctgtgatgtgccagccgagacactctac gacgtcctacacgacattgagtaccgcaagaaatgggacagcaacgtcattgagactttt gacatcgcccgcttgacagtcaacgctgacgtgggctattactcctggaggtgtcccaag cccctgaagaaccgtgatgtcatcaccctccgctcctggctccccatgggcgctgattac atcattatgaactactcagtcaaacatcccaaatacccacctcggaaagacttggtccga gctgtgtccatccagacgggctacctcatccagagcacagggcccaagagctgcgtcatc acctacctggcccaggtggaccccaaaggccttagtcttctctccctggggatggctcag ccacccctgtggcttctgtcacaagtggccatgtctctctcagctgagattcagattcct gtggccatcggccacctggacatctcccttcggatgtcttccaggcacctaagcctcgtt gtgtccagctccttacccaagtgggtggtgaataaatcttctcagttcctggctcccaag gccatgaagaagatgtacaaggcgtgcctcaagtaccccgagtggaaacagaagcacctg cctcacttcaagccgtggctgcacccggagcagagcccgttgccgagcctggcgctgtcg gagctgtcggtgcagcatgcggactcactggagaacatcgacgagagcgcggtggccgag agcagagaggagcggatgggcggcgcgggcggcgagggcagcgacgacgacacctcgctc acctga