GENSCAN 1.0 Date run: 7-Nov-116 Time: 23:18:57 Sequence gi568815579f:38519269_38735121 : 215853 bp : 52.01% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 53 186 134 1 2 -120 76 377 0.866 15.65 1.02 Intr + 3760 3847 88 1 1 98 94 100 0.893 12.07 1.03 Intr + 3949 4041 93 0 0 66 58 152 0.999 10.66 1.04 Intr + 6064 6234 171 0 0 80 11 368 0.995 28.95 1.05 Intr + 7725 7784 60 2 0 90 91 63 0.957 6.22 1.06 Intr + 8379 8516 138 2 0 138 94 288 0.997 35.57 1.07 Intr + 9038 9150 113 1 2 107 85 141 0.999 15.38 1.08 Intr + 9331 9427 97 1 1 116 44 83 0.984 7.21 1.09 Intr + 9683 9789 107 1 2 78 57 191 0.898 14.51 1.10 Intr + 13222 13273 52 1 1 96 100 88 0.994 10.20 1.11 Intr + 13403 13468 66 1 0 70 92 44 0.853 2.59 1.12 Intr + 15452 15551 100 1 1 107 44 115 0.997 9.18 1.13 Intr + 15873 15952 80 1 2 84 92 97 0.992 9.47 1.14 Intr + 16048 16124 77 0 2 111 96 61 0.999 8.01 1.15 Intr + 16729 16802 74 1 2 109 81 85 0.999 9.44 1.16 Intr + 18612 18692 81 1 0 119 77 41 0.786 6.21 1.17 Intr + 24079 24167 89 2 2 74 94 129 0.999 12.29 1.18 Intr + 24264 24392 129 2 0 89 91 272 0.999 29.00 1.19 Intr + 24503 24607 105 1 0 84 52 96 0.908 6.51 1.20 Intr + 27177 27258 82 2 1 6 92 140 0.977 5.91 1.21 Intr + 28965 29152 188 1 2 65 98 411 0.973 39.73 1.22 Intr + 41845 42186 342 0 0 115 71 831 0.673 80.28 1.23 Intr + 43934 43988 55 1 1 100 60 20 0.566 -0.56 1.24 Intr + 45617 46503 887 0 2 33 64 1559 0.983 139.72 1.25 Intr + 47643 47719 77 2 2 130 72 124 0.999 13.81 1.26 Intr + 48505 48649 145 1 1 136 95 164 0.957 22.69 1.27 Intr + 51339 51425 87 2 0 117 48 100 0.929 9.56 1.28 Intr + 52751 53002 252 1 0 94 86 283 0.995 26.86 1.29 Intr + 53909 54039 131 1 2 86 69 267 0.984 24.60 1.30 Intr + 56651 56693 43 2 1 160 100 54 0.998 12.53 1.31 Intr + 58650 58780 131 2 2 83 93 238 0.999 23.80 1.32 Intr + 58876 58936 61 1 1 108 66 143 0.995 13.33 1.33 Intr + 60714 60860 147 2 0 119 64 217 0.999 23.34 1.34 Intr + 61102 61236 135 0 0 106 101 396 0.999 43.97 1.35 Intr + 65675 65831 157 1 1 95 56 393 0.995 36.90 1.36 Intr + 66670 66734 65 2 2 94 92 125 0.999 12.43 1.37 Intr + 66823 66923 101 0 2 91 58 147 0.642 11.41 1.38 Intr + 67257 67308 52 0 1 113 110 30 0.640 7.10 1.39 Term + 68057 68152 96 1 0 66 47 34 0.424 -4.63 1.40 PlyA + 68270 68275 6 1.05 2.27 PlyA - 68396 68391 6 1.05 2.26 Term - 68549 68480 70 1 1 141 47 75 0.688 6.41 2.25 Intr - 74069 74014 56 2 2 75 91 2 0.374 -2.53 2.24 Intr - 76287 76217 71 1 2 59 115 50 0.851 4.19 2.23 Intr - 76461 76372 90 1 0 114 17 135 0.849 9.46 2.22 Intr - 76733 76671 63 0 0 62 81 57 0.749 1.58 2.21 Intr - 77218 77044 175 1 1 70 117 208 0.929 21.93 2.20 Intr - 77869 77766 104 2 2 73 90 69 0.989 5.99 2.19 Intr - 78116 78058 59 1 2 96 72 31 0.562 1.32 2.18 Intr - 80717 80638 80 2 2 86 75 36 0.574 0.94 2.17 Intr - 80885 80809 77 0 2 76 77 131 0.361 10.53 2.16 Intr - 86223 86086 138 1 0 93 44 41 0.169 1.14 2.15 Intr - 86462 86300 163 2 1 62 95 70 0.981 5.16 2.14 Intr - 86947 86905 43 0 1 110 116 3 0.964 4.03 2.13 Intr - 88643 88596 48 1 0 72 103 123 0.793 10.48 2.12 Intr - 88902 88722 181 1 1 86 71 32 0.268 0.74 2.11 Intr - 90406 90328 79 1 1 91 72 20 0.422 0.32 2.10 Intr - 90868 90641 228 1 0 88 78 145 0.899 12.09 2.09 Intr - 92037 91950 88 2 1 85 76 126 0.998 11.57 2.08 Intr - 93474 93343 132 2 0 95 96 187 0.999 20.47 2.07 Intr - 94684 94612 73 2 1 98 105 50 0.998 6.66 2.06 Intr - 94817 94775 43 2 1 97 92 33 0.978 2.90 2.05 Intr - 95177 95122 56 0 2 40 110 29 0.075 -0.51 2.04 Intr - 96991 96927 65 0 2 63 72 43 0.122 -0.95 2.03 Intr - 98176 98086 91 2 1 119 24 93 0.191 5.55 2.02 Intr - 98357 98300 58 2 1 57 81 92 0.997 4.35 2.01 Init - 98627 98529 99 2 0 74 100 126 0.995 12.62 2.00 Prom - 99194 99155 40 -7.89 3.00 Prom + 99230 99269 40 -7.79 3.01 Init + 99567 99622 56 2 2 79 117 77 0.999 8.45 3.02 Intr + 99715 100059 345 1 0 -14 41 318 0.918 11.96 3.03 Intr + 101069 101167 99 2 0 115 87 186 0.985 20.92 3.04 Intr + 104809 104956 148 1 1 107 49 224 0.971 21.15 3.05 Intr + 106760 106834 75 1 0 76 28 95 0.206 2.51 3.06 Intr + 113162 113228 67 1 1 42 84 94 0.169 3.37 3.07 Intr + 113333 113410 78 0 0 93 64 115 0.727 9.62 3.08 Intr + 115725 115850 126 1 0 104 79 202 0.986 22.06 3.09 Term + 117621 117652 32 1 2 156 32 22 0.931 1.70 3.10 PlyA + 117665 117670 6 1.05 4.00 Prom + 119695 119734 40 -5.01 4.01 Init + 128478 128639 162 2 0 79 91 358 0.996 34.90 4.02 Intr + 139335 139528 194 2 2 1 49 103 0.030 -3.49 4.03 Term + 141978 142161 184 0 1 98 39 136 0.652 7.04 4.04 PlyA + 146150 146155 6 1.05 5.04 PlyA - 153989 153984 6 1.05 5.03 Term - 155688 155574 115 2 1 72 48 101 0.372 2.85 5.02 Intr - 159808 159786 23 1 2 75 93 14 0.277 -2.58 5.01 Init - 162025 161903 123 1 0 97 74 70 0.614 6.63 5.00 Prom - 168308 168269 40 -5.01 6.00 Prom + 170842 170881 40 -2.21 6.01 Init + 173554 173677 124 0 1 104 42 52 0.157 2.53 6.02 Intr + 177760 177879 120 2 0 76 75 38 0.527 2.27 6.03 Intr + 178622 178736 115 0 1 120 58 52 0.157 5.41 6.04 Intr + 178996 179044 49 1 1 53 72 20 0.065 -3.93 6.05 Intr + 181332 181446 115 2 1 115 94 145 0.956 18.32 6.06 Intr + 181734 181853 120 1 0 81 105 165 0.956 18.47 6.07 Intr + 185666 185752 87 0 0 104 56 196 0.894 18.44 6.08 Intr + 186776 186863 88 0 1 111 115 86 0.999 13.13 6.09 Intr + 188849 188927 79 2 1 130 113 101 0.999 16.85 6.10 Intr + 190127 190208 82 1 1 101 110 174 0.999 20.61 6.11 Intr + 192008 192093 86 0 2 140 105 92 0.978 16.14 6.12 Intr + 195201 195293 93 2 0 117 86 255 0.915 28.86 6.13 Intr + 197818 198048 231 0 0 77 97 486 0.944 46.90 6.14 Intr + 198659 198806 148 1 1 93 76 312 0.997 30.82 6.15 Intr + 202270 202420 151 2 1 80 70 420 0.902 39.13 6.16 Intr + 204346 204454 109 1 1 82 78 135 0.992 12.69 6.17 Intr + 204669 204809 141 2 0 67 41 356 0.856 29.86 6.18 Intr + 204889 205071 183 0 0 90 86 308 0.999 31.30 6.19 Intr + 205163 205297 135 1 0 110 78 254 0.999 27.87 6.20 Intr + 206456 206635 180 1 0 84 69 465 0.941 44.78 6.21 Intr + 207689 207835 147 1 0 103 105 403 0.999 44.44 6.22 Intr + 208678 208758 81 0 0 74 84 180 0.999 16.53 6.23 Intr + 209048 209113 66 1 0 47 72 62 0.457 0.09 6.24 Intr + 209212 209295 84 0 0 125 -16 70 0.499 0.81 6.25 Intr + 209728 209886 159 0 0 112 105 306 0.997 35.40 6.26 Term + 210006 210164 159 2 0 61 54 330 0.973 25.15 6.27 PlyA + 211235 211240 6 1.05 7.08 PlyA - 211289 211284 6 1.05 7.07 Term - 211610 211584 27 2 0 118 45 30 0.949 0.06 7.06 Intr - 211755 211697 59 1 2 102 99 20 0.733 3.59 7.05 Intr - 211955 211839 117 0 0 0 65 281 0.805 17.94 7.04 Intr - 214513 214435 79 1 1 79 64 131 0.973 9.42 7.03 Intr - 214936 214874 63 1 0 52 94 36 0.446 0.01 7.02 Intr - 215117 215029 89 0 2 19 68 42 0.171 -4.51 7.01 Intr - 215602 215545 58 1 1 64 105 64 0.623 4.65 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 95208 95122 87 0 0 20 110 86 0.844 2.74 S.002 Term - 98176 98082 95 2 2 119 45 94 0.802 6.49 S.003 Init + 113180 113228 49 1 1 93 84 43 0.809 5.45 S.004 Sngl - 171209 171003 207 2 0 63 49 193 0.880 8.39 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815579f:38519269_38735121|GENSCAN_predicted_peptide_1|1695_aa EEQLRLEAKAEAQEGELLVRDEFSVLCRDLYALYPLLIRYVDNNRAQWLTEPNPSAEELF RMVGEIFIYWSKSHNFKREEQNFVVQNEINNMSFLTADNKSKMAKSGGSDQERTKKKRRG DRYSVQTSLIVATLKKMLPIGLNMCAPTDQDLITLAKTRYALKDTDEEVREFLHNNLHLQ GKVEGSPSLRWQMALYRGVPGREEDADDPEKIVRRVQEVSAVLYYLDQTEHPYKSKKAVW HKLLSKQRRRAVVACFRMTPLYNLPTHRACNMFLESYKAAWILTEDHSFEDRMIDDLSKA GEQEEEEEEVEEKKPDPLHQLVLHFSRTALTEKSKLDEDYLYMAYADIMAKSCHLEEGGE NGEAEEEVEVSFEEKQMEKQRLLYQQARLHTRGAAEMVLQMISACKGETGAMVSSTLKLG ISILNGGNAEVQQKMLDYLKDKKEVGFFQSIQALMQTCSVLDLNAFERQNKAEGLGMVNE DGTGEKVMADDEFTQDLFRFLQLLCEGHNNDFQNYLRTQTGNTTTINIIICTVDYLLRLQ ESISDFYWYYSGKDVIEEQGKRNFSKAMSVAKQVFNSLTEYIQGPCTGNQQSLAHSRLWD AVVGFLHVFAHMMMKLAQDSSQIELLKELLDLQKDMVVMLLSLLEGNVVNGMIARQMVDM LVESSSNVEMILKFFDMFLKLKDIVGSEAFQDYVTDPRGLISKKDFQKAMDSQKQFSGPE IQFLLSCSEADENEMINCEEFANRFQEPARDIGFNVAVLLTNLSEHVPHDPRLHNFLELA ESILEYFRPYLGRIEIMGASRRIERIYFEISETNRAQWEMPQASTYAHVALEHLWGSIII AATALSLLSEPPLTAPYPVCRPSLQVKESKRQFIFDVVNEGGEAEKMELFVSFCEDTIFE MQIAAQISEPEGEPETDEDEGAGAAEAGAEGAEEGAAGLEGTAATAAAGATARVVAAAGR ALRGLSYRSLRRRVRRLRRLTAREAATAVAALLWAAVTRAGAAGAGAAAGALGLLWGSLF GGGLVEGAKKVTVTELLAGMPDPTSDEVHGEQPAGPGGDADGEGASEGAGDAAEGAGDEE EAVHEAGPGGADGAVAVTDGGPFRPEGAGGLGDMGDTTPAEPPTPEGSPILKRKLGVDGV EEELPPEPEPEPEPELEPEKADAENGEKEEVPEPTPEPPKKQAPPSPPPKKEEAGGEFWG ELEVQRVKFLNYLSRNFYTLRFLALFLAFAINFILLFYKVSDSPPGEDDMEGSAAGDVSG AGSGGSSGWGLGAGEEAEGDEDENMVYYFLEESTGYMEPALRCLSLLHTLVAFLCIIGYN CLKVPLVIFKREKELARKLEFDGLYITEQPEDDDVKGQWDRLVLNTPSFPSNYWDKFVKR KVLDKHGDIYGRERIAELLGMDLATLEITAHNERKPNPPPGLLTWLMSIDVKYQIWKFGV IFTDNSFLYLGWYMVMSLLGHYNNFFFAAHLLDIAMGVKTLRTILSSVTHNGKQLVMTVG LLAVVVYLYTVVAFNFFRKFYNKSEDEDEPDMKCDDMMTCYLFHMYVGVRAGGGIGDEIE DPAGDEYELYRVVFDITFFFFVIVILLAIIQGLIIDAFGELRDQQEQVKEDMETKCFICG IGSDYFDTTPHGFETHTLEEHNLANYMFFLMYLINKDETEHTGQESYVWKMYQERCWDFF PAGDCFRKQYEDQLS >gi568815579f:38519269_38735121|GENSCAN_predicted_CDS_1|5088_bp gaggagcagctgcgcctggaggccaaggcggaggcccaggagggcgagctgctggtgcgg gacgagttctctgtgctctgccgggacctctacgccctgtatccgctgctcatccgctac gtggacaacaacagggcgcagtggctgacggagccgaatcccagcgcggaggagctgttc aggatggtgggcgagatcttcatctactggtccaagtcccacaacttcaagcgcgaggag cagaactttgtggtccagaatgagatcaacaacatgtccttcctgactgctgacaacaaa agcaaaatggctaagtccggtggctcggaccaggaacgcaccaagaagaagcgccggggg gaccggtactctgtgcagacgtcactgatcgtggccacactgaagaagatgctgcccatc ggcctgaatatgtgtgcgcccaccgaccaagacctcatcacgctggccaagacccgttac gccctgaaagacacagatgaggaggtccgggaatttctgcacaacaaccttcaccttcag ggaaaggtcgaaggctccccgtctctgcgctggcagatggctctgtaccggggcgtcccg ggtcgcgaggaggacgccgatgaccccgagaaaatcgtgcgcagagtccaggaagtgtca gccgtgctctactacctggaccagaccgagcacccttacaagtctaagaaggccgtgtgg cacaagcttttgtccaaacagcgccggcgggcagtcgtggcctgtttccgtatgacgccc ctgtacaacctgcccacgcaccgggcatgtaacatgttcctggagagctacaaggctgca tggatcctgactgaagaccacagttttgaggaccgcatgatagatgacctttcaaaagct ggggagcaggaggaggaggaggaagaggtggaagagaagaagccagaccccctgcaccag ttggtcctgcacttcagccgcactgccctgacggaaaagagcaaactggatgaggattac ctgtacatggcctatgctgatatcatggcaaagagctgccacctggaggagggaggggag aacggtgaagctgaagaggaggttgaggtctcctttgaggagaaacagatggagaagcag aggctcttgtaccagcaagcacggctgcacacccggggggcggccgagatggtgctgcag atgatcagtgcctgcaaaggagagacaggtgccatggtgtcctccaccctgaagctgggc atctccatcctcaatggaggcaatgctgaggtccagcagaaaatgctggattatcttaag gacaagaaggaagttggcttcttccagagtatccaggcactgatgcaaacatgcagcgtc ctggatctcaatgcctttgagagacagaacaaggccgaggggctgggcatggtgaatgag gatggcactggagagaaggtcatggcggatgatgaattcacacaagacctgttccgattc ctacaattgctctgtgaggggcacaataatgatttccagaactacctacggacacagaca gggaacacgaccactattaacatcatcatttgcactgtggactacctcctgcggctgcag gaatccatcagcgacttctactggtactactcgggcaaggatgtcattgaagagcagggc aagaggaacttctccaaagccatgtcggtggctaagcaggtgttcaacagcctcactgag tacatccagggtccctgcaccgggaaccagcagagcctggcgcacagtcgcctatgggac gcagtggtgggattcctgcacgtgttcgcccacatgatgatgaagctcgctcaggactca agccagatcgagctgctgaaggagctgctggatctgcagaaggacatggtggtgatgttg ctgtcgctactagaagggaacgtggtgaacggcatgatcgcccggcagatggtggacatg ctcgtggaatcctcatccaatgtggagatgatcctcaagttcttcgacatgttcctgaaa ctcaaggacattgtgggctctgaagccttccaggactacgtaacggatccccgtggcctc atctccaagaaggacttccagaaggccatggacagccagaagcagttcagcggtccagaa atccagttcctgctttcgtgctccgaagcggatgagaacgaaatgatcaactgcgaagag ttcgccaaccgcttccaggagccagcacgcgacatcggcttcaacgtggcggtgctgctg accaacctgtcggagcatgtgccgcatgaccctcgcctgcacaacttcctggagctggcc gagagcatccttgagtacttccgcccctacctgggccgcatcgagatcatgggcgcgtca cgccgcatcgagcgcatctacttcgagatctcagagaccaaccgcgcccagtgggagatg ccccaggcatccacgtatgcccatgtggccttggagcacctgtggggcagtataataata gctgccactgcgctgtcgctgctgtccgagcccccgctgacggcgccctatcctgtctgc cgcccctcgcttcaggtgaaggagtccaagcgccagttcatcttcgacgtggtgaacgag ggcggcgaggctgagaagatggagctcttcgtgagtttctgcgaggacaccatcttcgag atgcagatcgccgcgcagatctcggagcccgagggcgagccggagaccgacgaggacgag ggcgcgggcgcggcggaggcgggcgcggaaggcgcggaggagggcgcggcggggctcgag ggcacggcggccacggcggcggcgggggcgacggcgcgggttgtggcggccgcaggccgg gccctgcgaggcctcagctaccgcagcctgcggcggcgcgtgcggcggctgcggcggctt acggcccgcgaggcggccaccgcagtggcggcgctgctctgggcagcagtgacgcgcgct ggggccgctggcgcgggggcggcggcgggcgcgctgggcctgctctggggctcgctgttc ggcggcggcctggtggagggcgccaagaaggtgacggtgaccgagctcctggcaggcatg cccgaccccaccagcgacgaggtgcacggcgagcagccggccgggccgggcggagacgca gacggcgagggtgccagcgagggcgctggagacgccgcggagggcgctggagacgaggag gaggcggtgcacgaggccgggccgggcggtgccgacggggcggtggccgtgaccgatggg ggccccttccggcccgaaggggctggcggtctcggggacatgggggacacgacgcctgcg gaaccgcccacacccgagggctctcccatcctcaagaggaaattgggggtggatggagtg gaggaggagctcccgccagagccagagcccgagccggaaccagagctggagccggagaaa gccgatgccgagaatggggagaaggaagaagttcccgagcccacaccagagccccccaag aagcaagcacctccctcaccccctccaaagaaggaggaagctggaggcgaattctgggga gaactggaggtgcagagggtgaagttcctgaactacctgtcccggaacttttacaccctg cggttccttgccctcttcttggcatttgccatcaacttcatcttgctgttttataaggtc tcagactctccaccaggggaggacgacatggaaggctcagctgctggggatgtgtcaggt gcaggctctggtggcagctctggctggggcttgggggccggagaggaggcagagggcgat gaggatgagaacatggtgtactacttcctggaggaaagcacaggctacatggaacccgcc ctgcggtgtctgagcctcctgcatacactggtggcctttctctgcatcattggctataat tgtctcaaggtgcccctggtaatctttaagcgggagaaggagctggcccggaagctggag tttgatggcctgtacatcacggagcagcctgaggacgatgacgtgaaggggcagtgggac cgactggtgctcaacacgccgtctttccctagcaactactgggacaagtttgtcaagcgc aaggtcctggacaaacatggggacatctacgggcgggagcggattgctgagctactgggc atggacctggccacactagagatcacagcccacaatgagcgcaagcccaacccgccgcca gggctgctgacctggctcatgtccatcgatgtcaagtaccagatctggaagttcggggtc atcttcacagacaactccttcctgtacctgggctggtatatggtgatgtccctcttggga cactacaacaacttcttctttgctgcccatctcctggacatcgccatgggggtcaagacg ctgcgcaccatcctgtcctctgtcacccacaatgggaaacagctggtgatgaccgtgggc cttctggcggtggtcgtctacctgtacaccgtggtggccttcaacttcttccgcaagttc tacaacaagagcgaggatgaggatgaacctgacatgaagtgtgatgacatgatgacgtgt tacctgtttcacatgtacgtgggtgtccgggctggcggaggcattggggacgagatcgag gaccccgcgggtgacgaatacgagctctacagggtggtcttcgacatcaccttcttcttc ttcgtcatcgtcatcctgttggccatcatccagggtctgatcatcgacgcttttggtgag ctccgagaccaacaagagcaagtgaaggaggatatggagaccaagtgcttcatctgtgga atcggcagtgactactttgatacgacaccgcatggcttcgagactcacacgctggaggag cacaacctggccaattacatgtttttcctgatgtatttgataaacaaggatgagacagaa cacacgggtcaggagtcttatgtctggaagatgtaccaagagagatgttgggatttcttc ccagctggtgattgtttccgtaagcagtatgaggaccagcttagctga >gi568815579f:38519269_38735121|GENSCAN_predicted_peptide_2|809_aa MDVVDPDIFNRDPRDHYDLLQRLGGGTYGEVFKARDKVSGDLVALKMVKMEPDDDVSTLQ KEILILKTCRHANIVAYHGSYLWLQKLWICMEFCGAGSLQDIYQVTGSLSELQISYVCRE VLQGANILINDAGEVRLADFGISAQIGATLARRLSFIGTPYWMAPEVAAVALKGGYNELC DIWSLGITAIELAELQPPLFDVHPLRVLFLMTKSGYQPPRLKEKGKWYERVTGGRLDSFS PVPKALYQSSLSTPALSVHTIPLDTDTSSLRQHQLVSQPGLNRGLILDLLDKLKNPGKGP SIGDIEDEEPELPPAIPRRIRSTHRSSSLGIPDADCCRRHMEFRKLRGMETRPPANTVRC CPPTPHHTACFPEATADPFPTPQARLQPPRDLRSSSPRKQLSESSDDDYDDVDIPTPAED TPPPLPPKPKFRSPSDEGPGSMGDDGQLSPGVLVRCASGPPPNSPRPGPPPSTSSPHLTA HSEPSLWNPPSRELDKPPLLPPKKEKMKRKVRFSSAEDLHLWTGLGMPDQHLLLGAEEGI FILNRNDQEATLEMLFPSRTTWVYSINNVLMSLSGLAVGSRKNMVSTKIQDTKGCRACCV AEGASSGGPFLCGALETSVVLLQWYQPMNKFLLVRQVLFPLPTPLSVFALLTGPGSELPA VCIGVSPGRPGKSVLFHTVRFGALSCWLGEMSTEHRGPVQVTQVEEDMVMVLMDGSVKLV TPEGSPVRGLRTPEIPMTEAVEAVAMVGGQLQAFWKHGVQVWALGSDQLLQELRDPTLTF RLLGSPRPVVVETRPVDDPTAPSNLYIQE >gi568815579f:38519269_38735121|GENSCAN_predicted_CDS_2|2430_bp atggacgtcgtggaccctgacattttcaatagagacccccgggaccactatgacctgcta cagcggctgggtggcggcacgtatggggaagtctttaaggctcgagacaaggtgtcaggg gacctggtggcactgaagatggtgaagatggagcctgatgatgatgtctccacccttcag aaggaaatcctcatattgaaaacttgccggcacgccaacatcgtggcctaccatgggagt tatctctggttgcagaaactctggatctgcatggaattctgtggggctggttctctccag gacatctaccaagtgacaggctccctgtcagagctccagattagctatgtctgccgggaa gtgctccagggagctaacatcctcatcaatgatgctggggaggtcagattggctgacttt ggcatctcggcccagattggggctacactggccagacgcctctctttcattgggacaccc tactggatggctccggaagtggcagctgtggccctgaagggaggatacaatgagctgtgt gacatctggtccctgggcatcacggccatcgaactggccgagctacagccaccgctcttt gatgtgcaccctctcagagttctcttcctcatgaccaagagtggctaccagcctccccga ctgaaggaaaaaggcaaatggtacgagagagtaacaggagggaggctagattcctttagt ccagtccccaaggccctgtaccagtcctctctgtccacaccagccctctctgtccacacc atccctctggatacggacacctcttccctccgccagcatcaactggtatcccagcctggg ctgaatcgaggcctgatcctggatcttcttgacaaactgaagaatcccgggaaaggaccc tccattggggacattgaggatgaggagcccgagctaccccctgctatccctcggcggatc agatccacccaccgctccagctctctggggatcccagatgcagactgctgtcggcggcac atggagttcaggaagctccgaggaatggagaccagacccccagccaacaccgtgagatgc tgtccccccactccccaccacaccgcctgcttccctgaggccactgctgacccttttcct accccacaggctcgcctacagcctcctcgagacctcaggagcagcagccccaggaagcaa ctgtcagagtcgtctgacgatgactatgacgacgtggacatccccacccctgcagaggac acacctcctccacttccccccaagcccaagttccgttctccatcagacgagggtcctggg agcatgggggatgatgggcagctgagcccgggggtgctggtccggtgtgccagtgggccc ccaccaaacagcccccgtcctgggcctcccccatccaccagcagcccccacctcaccgcc cattcagaaccctcactctggaacccaccctcccgggagcttgacaagcccccacttctg ccccccaagaaggaaaagatgaagagaaaggtgaggttcagctctgctgaggacttacac ctctggacggggctggggatgcctgaccagcacctgctcctgggggcagaggaaggcatc ttcatcctgaaccggaatgaccaggaggccacgctggaaatgctctttcctagccggact acgtgggtgtactccatcaacaacgttctcatgtctctctcaggtctggctgtgggtagc aggaagaacatggtttccaccaagatccaggacaccaaaggctgccgggcgtgctgtgtg gcggagggtgcgagctctgggggcccgttcctgtgcggtgcattggagacgtccgttgtc ctgcttcagtggtaccagcccatgaacaaattcctgcttgtccggcaggtgctgttccca ctgccgacgcctctgtccgtgttcgcgctgctgaccgggccaggctctgagctgcccgct gtgtgcatcggcgtgagccccgggcggccggggaagtcggtgctcttccacacggtgcgc tttggcgcgctctcttgctggctgggcgagatgagcaccgagcacaggggacccgtgcag gtgacccaggtagaggaagatatggtgatggtgttgatggatggctctgtgaagctggtg accccggaggggtccccagtccggggacttcgcacacctgagatccccatgaccgaagcg gtggaggccgtggctatggttggaggtcagcttcaggccttctggaagcatggagtgcag gtgtgggctctaggctcggatcagctgctacaggagctgagagaccctaccctcactttc cgtctgcttggctcccccaggcctgtagtggtggagacacgcccagtggatgatcctact gctcccagcaacctctacatccaggaatga >gi568815579f:38519269_38735121|GENSCAN_predicted_peptide_3|341_aa MAARNTTRAHAARAPGASRGACALGSSKIFLTLSVQLEWRECARANCACEEEVTWQPESV AAAARAFPFTAPKELERQQRRRFRFHHLFLFPSLRTPCRVSVSLQPWLWKATEVMAMFEQ MRANVGKLLKGIDRYNPENLATLERYVETQAKENAYDLEANLAVLKLYQFNPAFFQTTVT AQILLKALTNLPHTDFTLCKCMIDQAHVSFQHWGRGQEERPIRQILYLGDLLETCHFQAF WQALDENMDLLEGITGFEDSVRKFICHVVGITYQHIDRWLLAEMLGDLSDSQLKVWMSKY GWSADESGQIFICSQEESIKPKNIVEKIDFDSVSSIMASSQ >gi568815579f:38519269_38735121|GENSCAN_predicted_CDS_3|1026_bp atggccgcgcgcaatacaacgcgcgcgcacgccgccagagctccgggtgcttcccgaggc gcctgcgcactgggatccagtaagatttttctcacgctgtctgtccagcttgagtggcgc gaatgcgcacgcgccaattgcgcctgcgaggaagaagtcacgtggcagccggaaagcgtg gcggctgctgctagagcctttccctttaccgcacccaaggagctggagcgacaacaacga cgtcgtttccgtttccaccacctcttcctgttcccgtccttgaggacgccgtgccgggtc agtgttagcctccagccctggttgtggaaggcgacagaagtcatggcgatgtttgagcag atgagagccaacgtgggcaagttgctcaagggtatcgacaggtacaatcctgagaacctg gccaccctggagcgctatgtagagacgcaggccaaggaaaatgcctatgatctggaagcc aacctggctgtcctgaagctgtaccagttcaacccagccttctttcagaccacggtcacc gcccagatcctgctgaaggccctcaccaacttgccgcacacagacttcaccctgtgcaag tgcatgatcgaccaggcacatgtatccttccagcactggggccgggggcaagaagaacgg ccaatccgacagattttgtacctcggggacctgctggagacctgccatttccaggccttc tggcaagccctggatgaaaacatggacctcttggaaggtataactggctttgaagactct gtccgaaagtttatctgccatgttgtgggtatcacttaccagcacattgaccgctggctg ctggccgagatgctcggggatctgtcggacagccagctaaaggtgtggatgagcaaatac ggctggagtgccgacgagtcggggcagatcttcatctgtagccaagaagagagcattaaa cccaagaacattgtggagaagattgactttgacagtgtgtccagcatcatggcctcctcc cagtaa >gi568815579f:38519269_38735121|GENSCAN_predicted_peptide_4|179_aa MVDYHAANQSYQYGPSSAGNGAGGGGSMGDYMAQEDDWDRDLLLDPAWEKQQRKGQRFED FCDLPVLAYEIQSVSKFKMSGRANLPGRFKGSQMNRPFLQAVYEPVMIQIFKSWKVSRRP PGQAKPELTSKVGTPRRQNMNQPNKSCSYLCDDFGSRHVYLIKALVTLNLEFYGTVYYS >gi568815579f:38519269_38735121|GENSCAN_predicted_CDS_4|540_bp atggtggactaccacgcggcgaaccagtcgtaccagtacggccccagcagcgcgggcaat ggcgctggcggcgggggcagcatgggcgactacatggcccaggaggacgactgggaccgg gacctgctgctggacccggcctgggagaagcagcagcgcaagggtcagagatttgaagac ttctgtgatcttccagttctggcctatgaaatacagtcagtttcaaaattcaaaatgtct ggaagagccaacttgccaggccggttcaagggcagtcagatgaatcgacccttcttacag gctgtctacgaaccagtcatgattcagatattcaagtcctggaaggtatccagacggcct cctggacaggccaaaccagaactgacttctaaagtgggaaccccaagacggcagaacatg aatcagccaaacaaaagctgcagttatctctgtgatgatttcggcagtcggcacgtgtat ttgattaaagcgttggtaactttgaatctggagttttatggcactgtgtactactcataa >gi568815579f:38519269_38735121|GENSCAN_predicted_peptide_5|86_aa MGTDDDKSQRWSISCASDTIFTEWLSHLSKATQAQAASSSTEKDWIAPRCLDEGETPKDS YNSGLLNQRYHEFSPFAIPYSPRPEW >gi568815579f:38519269_38735121|GENSCAN_predicted_CDS_5|261_bp atgggcacagatgacgacaagagccaacgttggagcatttcctgtgcatcagataccatc ttcacagagtggttaagtcatctgtccaaagccacacaagctcaggcagccagcagcagc actgagaaagactggattgctccaagatgcctggatgaaggtgaaacacccaaggactcg tacaactctgggctactgaatcagcgatatcacgagttctctcctttcgccataccctat tctccaagacctgaatggtag >gi568815579f:38519269_38735121|GENSCAN_predicted_peptide_6|1043_aa MGSVRGHSRQGGSSGGGTAPSPSKQLMPQRSDKPYTVFQSQVRKLRLIACLRPQWQKQDV NPHLCDAQARALVYYPEKMSTGLLGSWAPRLLAACHQGWQCSGAVKVQAPYFPTPGARTW WGLHPHVAEEEMEVRKTFTAWCNSHLRKAGTQIENIDEDFRDGLKLMLLLEVISGERLPK PERGKMRVHKINNVNKALDFIASKGVKLVSIGAEEIVDGNAKMTLGMIWTIILRFAIQDI SVEETSAKEGLLLWCQRKTAPYKNVNVQNFHISWKDGLAFNALIHRHRPELIEYDKLRKD DPVTNLNNAFEVAEKYLDIPKMLDAEDIVGTLRPDEKAIMTYVSCFYHAFSGAQKAETAA NRICKVLAVNQENEHLMEDYEKLASDLLEWIRRTIPWLEDRVPQKTIQEMQQKLEDFRDY RRVHKPPKVQEKCQLEINFNTLQTKLRLSNRPAFMPSEGKMVSDINNGWQHLEQAEKGYE EWLLNEIRRLERLDHLAEKFRQKASIHEAWTDGKEAMLKHRDYETATLSDIKALIRKHEA FESDLAAHQDRVEQIAAIAQELNELDYYDSHNVNTRCQKICDQWDALGSLTHSRREALEK TEKQLEAIDQLHLEYAKRAAPFNNWMESAMEDLQDMFIVHTIEEIEGLISAHDQFKSTLP DADREREAILAIHKEAQRIAESNHIKLSGSNPYTTVTPQIINSKWEKVQQLVPKRDHALL EEQSKQQSNEHLRRQFASQANVVGPWIQTKMEEIGRISIEMNGTLEDQLSHLKQYERSIV DYKPNLDLLEQQHQLIQEALIFDNKHTNYTMEHIRVGWEQLLTTIARTINEVENQILTRD AKGISQEQMQEFRASFNHFDKDHGGALGPEEFKACLISLGYDVENDRQKQTGSMDSDDFR ALLISTGYSLSFLVLGDTPPPEQRSRACLFLCCICRGGGEAEFNRIMSLVDPNHSGLVTF QAFIDFMSRETTDTDTADQVIASFKVLAGDKNFITAEELRRELPPDQAEYCIARMAPYQG PDAVPGALDYKSFSTALYGESDL >gi568815579f:38519269_38735121|GENSCAN_predicted_CDS_6|3132_bp atgggcagcgtcagaggccacagcaggcaaggtggaagttcaggaggtgggacggcgccc tccccctcaaagcaactgatgccccagcggagcgacaaaccctacacggtatttcagagc caagtgaggaaactgaggcttattgcttgcctgaggcctcagtggcagaagcaagatgtg aatccgcacctgtgtgatgcccaagccagagctcttgtttactacccagagaagatgagc acagggctgctgggctcctgggctccacgcctgcttgctgcctgccatcaaggctggcag tgctctggggctgtaaaggtgcaggcgccgtacttcccaacccctggagctcgcacatgg tggggtcttcatccccacgttgcagaggaggaaatggaagttaggaagaccttcacggca tggtgcaactcccacctgcggaaggcaggcacacagatcgagaacattgatgaggacttc cgagacgggctcaagctcatgctgctcctggaggtcatatcaggggagcggttacctaag ccggagcgggggaagatgagagtgcacaaaatcaacaatgtgaacaaagcgctggacttt attgccagcaaaggcgtcaagctggtctccatcggggcagaagagattgtggacggcaac gcaaagatgaccctgggaatgatctggaccatcatccttaggttcgccatccaggacatc tccgtggaagagacctcggccaaggaagggctccttctctggtgccagagaaagacagcc ccgtataagaacgtcaatgtgcagaacttccacatcagctggaaggatggtcttgccttc aatgccctgatccaccggcacagaccagagctgattgagtatgacaagctgaggaaggac gaccctgtcaccaacctgaacaatgccttcgaagtggctgagaaatacctcgacatcccc aagatgctggatgcagaggatattgtgggcactctgaggccagatgagaaggccatcatg acttacgtgtcctgcttctaccacgctttctcgggggctcagaaggctgaaactgccgcc aaccggatctgtaaggtgctggctgtcaaccaagagaacgagcacctgatggaggactac gagaagctggccagcgacctcctggagtggatccggcgcaccatcccctggctggaggac cgtgtgccccaaaagactatccaggagatgcagcagaagctggaggacttccgcgactac cggcgtgtgcacaagccgcccaaggtgcaggagaagtgccagctggagatcaacttcaac acgctgcagaccaagctgcgcctcagcaaccggcccgccttcatgccctccgagggcaag atggtctcggacatcaacaatggctggcagcacttggagcaggctgagaagggctacgag gagtggctgctgaatgagatccgcaggctggagcggctcgaccacctggcagagaagttc cggcagaaggcctccatccacgaggcctggactgacgggaaggaagccatgctgaagcac cgggactacgagacggccacactatcggacatcaaagccctcattcgcaagcacgaggcc ttcgagagcgacctggctgcgcaccaggaccgcgtggagcagatcgccgccattgcccag gagctcaacgagctggattactacgactcccacaatgtcaacacccggtgccagaagatc tgtgaccagtgggacgccctcggctctctgacacatagtcgcagggaagccctggagaaa acagagaagcagctggaggccatcgaccagctgcacctggaatacgccaagcgcgcggcc cccttcaacaactggatggagagcgccatggaggacctccaggacatgttcatcgtccat accatcgaggagattgagggcctgatctcagcccatgaccagttcaagtccaccctgccg gacgccgatagggagcgcgaggccatcctggccatccacaaggaggcccagaggatcgct gagagcaaccacatcaagctgtcgggcagcaacccctacaccaccgtcaccccgcaaatc atcaactccaagtgggagaaggtgcagcagctggtgccaaaacgggaccatgccctcctg gaggagcagagcaagcagcagtccaacgagcacctgcgccgccagttcgccagccaggcc aatgttgtggggccctggatccagaccaagatggaggagatcgggcgcatctccattgag atgaacgggaccctggaggaccagctgagccacctgaagcagtatgaacgcagcatcgtg gactacaagcccaacctggacctgctggagcagcagcaccagctcatccaggaggccctc atcttcgacaacaagcacaccaactataccatggagcacatccgcgtgggctgggagcag ctgctcaccaccattgcccgcaccatcaacgaggtggagaaccagatcctcacccgcgac gccaagggcatcagccaggagcagatgcaggagttccgggcgtccttcaaccacttcgac aaggatcatggcggggcgctggggcccgaggagttcaaggcctgcctcatcagcctgggc tacgacgtggagaacgaccggcagaagcagacaggcagcatggactccgatgacttcagg gctctgcttatctccacaggatacagcctgagcttcctcgtcctcggggacactcctccg cccgagcagcgcagccgtgcctgcctcttcctctgctgcatctgccggggtgggggtgag gccgagttcaaccgcatcatgagcctggtcgaccccaaccatagcggccttgtgaccttc caagccttcatcgacttcatgtcgcgggagaccaccgacacggacacggctgaccaggtc atcgcttccttcaaggtcttagcaggggacaagaacttcatcacagctgaggagctgcgg agagagctgccccccgaccaggccgagtactgcatcgcccgcatggcgccataccagggc cctgacgccgtgcccggtgccctcgactacaagtccttctccacggccttgtatggcgag agcgacctgtga >gi568815579f:38519269_38735121|GENSCAN_predicted_peptide_7|163_aa EEELNASQLQALLSIALEPGPIPPPPERSGSGPVSSCCSVSGYMGGSAWHGQSLALHHFQ QLWGYLLEWQAIFNKFDEDTSGTMNSYELRLALNAAGFHLNNQLTQTLTSRYRDSRLRVD FERFVSCVAHLTCIFCHCSQHLDGGEGVICLTHRQWMEVATFS >gi568815579f:38519269_38735121|GENSCAN_predicted_CDS_7|492_bp gaggaagaactcaatgcctctcagctccaggccttactaagcattgccctggagcctggg cccatacctccacccccagagagatcgggctcaggacctgtgagcagctgctgcagtgtt tcggggtacatggggggcagtgcctggcatgggcaaagcctggccttacaccacttccag cagctctggggctacctcctggagtggcaggccatatttaacaagttcgatgaggacacc tctggaaccatgaactcctacgagctgaggctggcactgaatgcagcaggcttccacctg aacaaccagctgacccagaccctcaccagccgctaccgggatagccgtctgcgtgtggac ttcgagcggttcgtgtcctgtgtggcccacctcacctgcatcttctgccactgcagccag cacctggatgggggtgagggggtcatctgcctgacccacagacagtggatggaggtggcc accttctcctag