GENSCAN 1.0 Date run: 5-Nov-116 Time: 21:24:44 Sequence gi568815591f:92347730_92556559 : 208830 bp : 40.37% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 33379 33766 388 0 1 32 77 300 0.540 20.31 1.02 Intr + 34817 35157 341 0 2 5 77 168 0.343 1.67 1.03 Intr + 38780 38914 135 1 0 48 88 107 0.947 6.54 1.04 Intr + 40069 40155 87 0 0 84 91 25 0.758 1.65 1.05 Intr + 40246 40312 67 0 1 92 106 66 0.999 6.36 1.06 Intr + 42242 42387 146 0 2 72 71 83 0.996 4.08 1.07 Intr + 43437 43615 179 2 2 82 98 177 0.985 15.90 1.08 Intr + 44512 44563 52 1 1 61 76 3 0.940 -5.61 1.09 Intr + 48636 48747 112 2 1 63 97 165 0.948 13.93 1.10 Intr + 49994 50130 137 0 2 25 68 144 0.977 5.47 1.11 Intr + 50483 51205 723 1 0 83 27 677 0.098 51.80 1.12 Intr + 55876 56238 363 0 0 57 50 279 0.027 15.46 1.13 Intr + 65265 65390 126 2 0 75 19 100 0.541 1.76 1.14 Intr + 66218 66358 141 1 0 69 47 91 0.609 2.73 1.15 Intr + 74313 74370 58 2 1 112 78 30 0.699 1.94 1.16 Term + 74552 74760 209 0 2 85 55 172 0.664 10.12 1.17 PlyA + 74843 74848 6 -5.80 2.07 PlyA - 74906 74901 6 1.05 2.06 Term - 75341 75000 342 2 0 89 46 192 0.720 8.63 2.05 Intr - 75628 75401 228 1 0 64 48 131 0.293 3.84 2.04 Intr - 77935 77906 30 1 0 103 82 25 0.431 0.81 2.03 Intr - 84753 84602 152 1 2 0 94 75 0.371 -1.74 2.02 Intr - 85700 85426 275 1 2 84 78 372 0.961 32.06 2.01 Init - 91833 91730 104 0 2 38 22 78 0.103 -4.04 2.00 Prom - 93671 93632 40 -8.65 3.00 Prom + 93685 93724 40 -10.55 3.01 Sngl + 94288 94485 198 0 0 68 49 275 0.932 16.77 3.02 PlyA + 94916 94921 6 1.05 4.00 Prom + 95242 95281 40 -10.45 4.01 Init + 100001 100249 249 1 0 118 99 147 0.598 16.03 4.02 Intr + 102972 103031 60 2 0 74 121 9 0.496 0.81 4.03 Intr + 106773 106956 184 2 1 85 97 187 0.999 17.74 4.04 Term + 108643 108833 191 2 2 113 32 107 0.987 4.23 4.05 PlyA + 109074 109079 6 1.05 5.02 PlyA - 109095 109090 6 1.05 5.01 Sngl - 122652 121036 1617 0 0 75 43 469 0.336 34.16 5.00 Prom - 122834 122795 40 -10.84 6.30 PlyA - 123393 123388 6 1.05 6.29 Term - 124651 123861 791 2 2 -1 46 460 0.736 25.38 6.28 Intr - 125187 124676 512 2 2 48 -25 274 0.913 3.78 6.27 Intr - 125872 125539 334 2 1 45 88 231 0.151 12.51 6.26 Intr - 128051 127811 241 2 1 17 63 152 0.092 1.80 6.25 Intr - 128344 128141 204 1 0 42 77 118 0.801 4.57 6.24 Intr - 129600 129528 73 2 1 93 64 118 0.992 8.39 6.23 Intr - 130023 129938 86 0 2 111 -42 78 0.315 -5.20 6.22 Intr - 133948 133028 921 1 0 53 65 311 0.012 15.80 6.21 Intr - 136149 135961 189 0 0 117 33 74 0.299 3.66 6.20 Intr - 138748 138698 51 1 0 78 87 38 0.503 0.89 6.19 Intr - 139812 139743 70 2 1 107 27 40 0.741 -1.93 6.18 Intr - 141694 141564 131 1 2 82 87 89 0.894 6.77 6.17 Intr - 143773 143543 231 1 0 102 -2 199 0.829 9.35 6.16 Intr - 146667 146564 104 1 2 104 61 73 0.869 5.17 6.15 Intr - 146900 146758 143 1 2 84 52 143 0.999 9.38 6.14 Intr - 149048 148984 65 2 2 80 63 73 0.999 0.60 6.13 Intr - 152109 151975 135 0 0 95 95 14 0.824 2.64 6.12 Intr - 153944 153778 167 0 2 37 95 130 0.975 7.36 6.11 Intr - 154350 154161 190 0 1 87 53 62 0.778 0.84 6.10 Intr - 155466 155312 155 1 2 56 90 54 0.600 1.27 6.09 Intr - 157173 157003 171 1 0 64 54 228 0.990 15.99 6.08 Intr - 158615 158519 97 2 1 23 64 108 0.855 0.56 6.07 Intr - 159397 159265 133 0 1 59 115 71 0.470 6.63 6.06 Intr - 161682 161649 34 1 1 84 51 29 0.388 -4.84 6.05 Intr - 163974 163851 124 0 1 90 101 55 0.820 6.24 6.04 Intr - 170313 169547 767 1 2 52 40 330 0.443 14.86 6.03 Intr - 170526 170412 115 0 1 72 74 40 0.384 0.10 6.02 Intr - 174516 174373 144 0 0 43 115 68 0.362 4.56 6.01 Init - 180706 180578 129 1 0 74 91 206 0.774 17.70 6.00 Prom - 181010 180971 40 -17.12 7.00 Prom + 181020 181059 40 -10.35 7.01 Init + 181085 181195 111 1 0 110 98 124 0.926 15.87 7.02 Intr + 181747 181937 191 0 2 82 90 120 0.996 9.06 7.03 Intr + 184675 184820 146 1 2 87 86 65 0.866 5.21 7.04 Intr + 186673 187241 569 2 2 26 94 410 0.927 26.97 7.05 Intr + 190353 190533 181 2 1 2 56 145 0.450 1.22 7.06 Term + 192245 192468 224 0 2 54 43 120 0.258 0.20 7.07 PlyA + 192722 192727 6 1.05 8.00 Prom + 193199 193238 40 -5.95 8.01 Init + 194253 194598 346 2 1 68 64 179 0.601 10.92 8.02 Intr + 196695 196768 74 1 2 85 82 59 0.551 3.21 8.03 Intr + 199382 199487 106 1 1 100 69 56 0.142 3.77 8.04 Term + 199704 200170 467 1 2 2 47 216 0.189 3.09 8.05 PlyA + 200208 200213 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 50483 51220 738 1 0 83 43 669 0.891 54.37 S.002 Init - 125900 125539 362 2 2 31 88 241 0.807 15.00 S.003 Intr - 128051 127908 144 2 0 17 111 122 0.858 6.96 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:92347730_92556559|GENSCAN_predicted_peptide_1|1087_aa MHKLQVEEKVSVIEDQINEMTQVGKLGEKRVKRNERSLQEIRDCVKTPNIRTIGVPESDG ENGTKLENTLQDIIQETFPNLASQANIQVQETQRTPQRYSSRRATPRHIIVRFTNVEMEE KMLRAAREKDHSAIKLELRIKKPTQNRTTTWKLNNLLLNDYWVHNEMKAEIKMFFETNEN KDTTYQNLWDTFKAVCRGKFIALNAHKRKQERSKIDTLTSQLKELEKQEQTHSKASRRQE ITKCKYDFCWICLEEWKKHSSSTGGYYRCTRYEVIQHVEEQSKEMTVEAEKKHKRFQELD RFMHYYTRFKNHEHSYQLEQRLLKTAKEKMEQLSRALKETEGGCPDTTFIEDAVHVLLKT RRILKCSYPYGFFLEPKSTKKEIFELMQTDLEMVTEDLAQKVNRPYLRTPRHKIIKAACL VQQKRQEFLASVARGVAPADSPEAPRRSFAGGTWDWEYLGFASPEEYAEFQYRRRHRQRR RGDVHSLLSNPPDPDEPSESTLDIPEGGSSSRRPGTSVVSSASMSVLHSSSLRDYTPASR SENQDSLQALSSLDEDDPNILLAIQLSLQESGLALDEETRDFLSNEASLGAIGTSLPSRL DSVPRNTDSPRAALSSSELLELGDSLMRLGAENDPFSTDTLSSHPLSEARSDFCPSSSDP DSAGQDPNINDNLLGNIMAWFHDMNPQSIALIPPATTEISADSQLPCIKDGSEGVKDVEL VLPEDSMFEDASVSEGRGTQIEENPLEENILAGEAASQAGDSGNEAANRGDGSDVSSQTP QTSSDWLEQETMFRGLATRGEIETNSANGALLLLQAGRGRDVFKVRSQQLLLQCKINGVI NKNLAFRLRALDRLKTEGYLIELQAVATLFKCRSEGNAMPKSAEQATADSASNEGKVLGA GVTEGAEDFGTQNKNEKVDDVQVDVQDSKNIPFRVQGIMAVPSQHELHIDHQQFVWHCVD LFESTIVFTLPANGKQKHQHTFAWKRCLQASVQQASTQKLELPIGQSPWKALRQEGPGET KTRDCDYCSHTFREKHTQIKGNVYQRLTLIRLLAADLEELSKNKIHNKSEACFPRHGGEA RNPEEVI >gi568815591f:92347730_92556559|GENSCAN_predicted_CDS_1|3264_bp atgcataagcttcaagtggaagaaaaggtatcagtgattgaagatcaaattaatgaaatg acgcaagtagggaagttgggagaaaaaagagtaaaaagaaatgaacgaagcctccaagaa ataagagactgtgtgaaaacaccaaatatacgtacaattggtgtacctgaaagtgacggg gagaatggaaccaagctggaaaacactcttcaggatattatccaggagaccttccccaac ctagcaagtcaagctaacattcaagttcaggaaacacagagaacaccacaaagatactcc tcaagaagagcaaccccaaggcacataattgtcagattcaccaatgttgaaatggaggaa aaaatgctaagggcagccagagagaaagaccacagtgcaatcaaattagaactcaggatt aagaaacccactcaaaaccgcacaactacatggaaactgaacaacctgctcctgaatgac tactgggtacataatgaaatgaaggcagaaataaagatgttctttgaaaccaatgagaac aaagacacaacataccagaatctctgggacacatttaaagcagtgtgcagagggaaattt atagcactaaatgcccacaagagaaagcaggaaagatctaaaattgacaccctaacatca caattaaaagaactagagaagcaagagcaaacacattcaaaagctagcagaaggcaagaa ataactaagtgcaagtatgacttttgctggatttgccttgaagagtggaaaaaacatagt tcgtccactggaggttattacagatgtactcgctatgaagtcattcaacacgtggaggag caatccaaggaaatgactgtggaggctgagaaaaaacacaaacgatttcaggaacttgac agatttatgcactattatacaagatttaaaaaccatgagcatagttatcagctagaacaa cgccttcttaaaacagccaaagaaaagatggagcaattgagcagagctctcaaagaaact gaaggaggctgtccagataccactttcattgaagatgcagttcatgtgctcttaaaaact cggcgcattctcaagtgttcttatccatatggatttttcttggaacctaaaagcacaaag aaagaaatttttgaactaatgcaaacagacctagaaatggtcactgaagaccttgcccag aaagtcaataggccttaccttcgcacaccccgccacaagatcatcaaagcagcatgcctt gtacagcagaagaggcaagaattcctggcatctgtggctcggggagtagctcctgcagac tcaccagaagctccaaggcgcagctttgctggtggaacatgggattgggaatatttagga tttgcatcaccagaggaatatgctgaatttcagtatcggaggaggcacagacaacgtcgt cgaggagatgttcacagtctactcagtaatcctccagaccctgatgagccaagtgaaagc actttagatattccagaaggcggcagcagcagccgcaggcctggcacatccgtggtaagt tctgcatctatgagtgtgctgcacagctcttccctgcgtgactacacccctgccagtcgc tctgaaaaccaggactctcttcaggctctgagttccttggatgaagacgatcccaatata cttcttgcaatacagttatcactgcaagagtctgggctggccctcgatgaagaaactaga gacttcctcagtaatgaagcatccttaggtgcgataggcacttctttaccttccaggctg gactctgtccccagaaatacagatagccctcgggctgcattgagcagctctgagcttttg gaacttggtgacagcctcatgagactaggagcagagaatgacccattttcaactgacacc ctgagctcacaccctctcagtgaggcaagaagtgatttctgtccctcatctagtgatcct gactcagctggccaggaccccaacatcaatgacaatcttctcggcaacatcatggcttgg tttcatgacatgaaccctcagagtattgccctgattcctccagcaactacagaaatcagt gcagattcccagctcccctgtatcaaagatgggtcagaaggtgtgaaggatgtggaactg gtgctgccagaagattcaatgtttgaagatgccagtgtcagtgaaggtagaggaacccag atagaagaaaatcctttggaagaaaatattctggcgggggaagcagcatctcaagctggt gacagtggtaacgaggcagccaacagaggagatggttcagatgtttcaagtcaaacacct caaacctcaagtgactggcttgaacaagagacaatgtttagaggcttagcaacaagaggt gagattgaaacaaacagtgcaaacggtgctttgttactcctgcaggcaggcagaggaagg gatgttttcaaggtaaggagtcagcaactcctgttacaatgtaaaataaatggtgttatc aataagaaccttgcttttcgattgagagccttggatagactgaaaacagaaggttatttg attgagctgcaagcagttgcaacactgttcaaatgccgaagtgaaggcaatgccatgcct aaaagtgcagagcaggccacagctgattcagccagtaatgagggcaaggtgctgggtgca ggagtgacagaaggtgccgaagacttcgggacgcagaataaaaatgaaaaggtcgatgat gtccaagtagatgttcaggatagcaaaaacatacccttcagggtccagggaataatggcg gtgccctcccagcatgagctgcacatcgatcaccaacaatttgtttggcattgcgtagac ctgtttgagtctacaatcgtttttactctgcctgcaaatgggaaacagaaacatcaacat acttttgcatggaaacgttgtttgcaagcatcagtacagcaagcatcaacacaaaagctg gaactccccattggccaaagcccatggaaagccctaaggcaagagggccctggagaaact aaaaccagagactgtgattattgttcacatacattccgagaaaagcacacacaaatcaaa ggaaatgtgtatcaaagactcacactgatacggctcctagctgcagaccttgaagagctg tctaaaaacaaaattcacaataaatcagaagcctgcttcccgaggcatgggggcgaggcc cggaatccagaagaggtgatttga >gi568815591f:92347730_92556559|GENSCAN_predicted_peptide_2|376_aa MALNFTIREKEECRIPQGIVSACMQSNLHDLNVTRKLDLEVSEPIDFGSGGHQHLIQEAA DKSIQSQSGERYTPSSSAGPQETRKKPRPKDTEQLNTCVIEISDDVTPGDNNATLGNPFS DASVRRVRKPGILDITKHLAVSEINIRPTQGVVQGQCYCVSSLQQPYKYRSCIAILQKPQ DRASSQQPLLLAILGQREFKAFSRRPRTRMGLHFYPWRLEESFLALPVDQWVSPVSGFAS GRAALELGMPSALRPLLCCPHALPDCLLALSPCPYPTFPRTPGSIWVMTVLLSPGLHLQT QLHTLLCQYLPSSPISGTGKTLFLIPCLRLEGCPSGALESICTQNVMKLRYRISCPDETL IQCGWPPLMNDNMVYK >gi568815591f:92347730_92556559|GENSCAN_predicted_CDS_2|1131_bp atggccttaaatttcacaataagggaaaaagaggaatgccgaattccccaggggattgtc agtgcctgcatgcagtctaatttgcatgatttaaatgttacaaggaaattggacttggag gtcagcgaaccaatagactttggttctggaggtcatcaacatctgattcaggaagcagca gataaaagcatccagtcccaatctggtgagcgctacacaccttcatcttcagctgggcca caagaaactcggaagaagccacggccaaaggacaccgagcagttgaatacctgtgtgata gaaatttcagatgacgtcaccccaggtgataacaatgcaactttaggcaatcctttctca gacgcatctgtccgcagagtgaggaagccagggatattggatatcactaagcacctggct gtctctgagatcaatatcaggcctacacagggtgttgtgcaagggcaatgctattgcgtt tcgtccttacaacaaccctacaaatacagaagttgtattgccattttacagaaaccccag gatcgtgccagctctcagcagcctctgctcctggccatccttggccagcgggagttcaaa gccttctctcgcaggcctcgtactaggatgggccttcatttttacccctggagactggag gagtctttcctggctctgcccgtggaccagtgggtcagcccagtctctggatttgcctct ggcagagcagccttggagctgggcatgccttcagccctgcggccactgctctgttgtcct catgccttgcctgactgcctgctggctctcagcccctgcccctatcccacctttccccgc acccctggctctatttgggtgatgacagtcctcctaagcccaggactgcacctgcagacg cagctccacactctgctctgccagtatctgccctccagccccatctccggcactgggaag accttgtttctcatcccatgtctgaggttggaaggctgtccctcaggtgccttggaaagc atctgtacccagaatgttatgaagcttagataccgcatatcctgtcctgatgaaacattg atacagtgtggatggccccctttgatgaatgacaatatggtctacaaatga >gi568815591f:92347730_92556559|GENSCAN_predicted_peptide_3|65_aa MGQLLWLLLQNLSTGLYAAAMSYCQQEGKDGIIFVTKDDHKTPSNVELVADDPNNPYTAW IDAAK >gi568815591f:92347730_92556559|GENSCAN_predicted_CDS_3|198_bp atggggcagctgctgtggctgcttctgcaaaatctaagcactggtctatatgctgcagcc atgtcctactgccagcaggaagggaaggatggaatcatatttgtgaccaaagatgaccac aaaactccaagcaatgtagagctggtggctgatgaccccaacaatccatacacagcatgg attgatgctgccaaatga >gi568815591f:92347730_92556559|GENSCAN_predicted_peptide_4|227_aa MPLGLKPTCSVCKTTSSSMWKKGAQGEILCHHCTGRGGAGSGGAGSGAAGGTGGSGGGGF GAATFASTSATPPQSNGGGGGKQPIKAPESVSTIITAESIFYKGVYYQIGDVVSVIDEQD GKPYYAQIRGFIQDQYCEKSAALTWLIPTLSSPRDQFDPASYIIGPEEDLPRKMEYLEFV CHAPSEYFKSRSSPFPTVPTRPEKGYIWTHVGPTPAITIKESVANHL >gi568815591f:92347730_92556559|GENSCAN_predicted_CDS_4|684_bp atgccgctgggcctgaagcccacctgcagcgtatgcaagaccacgtcgtcctccatgtgg aagaagggagcgcagggggagatcctctgccatcattgcactggccggggcggcgcgggc agcgggggcgcaggctcgggggcggctggagggactgggggcagcggcggcggcggcttc ggcgcggcgaccttcgccagcacctccgccacccctccgcagagcaacgggggcgggggc ggcaagcagcccatcaaagctcctgagtcagtttccactataatcactgcagaatcaatc ttctacaagggagtatattaccaaattggtgatgttgtttctgtgattgatgaacaagat ggaaagccctactatgctcaaatcagaggttttatccaggaccagtattgcgagaagagt gcagcactgacgtggctcattcctaccctctctagccccagagaccaatttgatcccgcc tcctatatcatagggccagaggaagatcttccaaggaagatggaatacttggaatttgtt tgtcatgcaccttctgagtatttcaagtcacggtcatcaccatttcccacagttcccacc agaccagagaagggctacatatggactcatgttgggcctactcctgcaataacaattaag gaatcagttgccaaccatttgtag >gi568815591f:92347730_92556559|GENSCAN_predicted_peptide_5|538_aa MALPYHIFLFTVLLPSFTLTAPPPCRCMTSSSPYQEFLWRMQRPGNIDAPSYRSLSKGTP TFTAHTHMPRNCYHSATLCMHANTHYWTGKMINPSCPGGLGVTVCWTYFTQTGMSDGGGV QDQAREKHVKEVISQLTRVHGTSSPYKGLDLSKLHETLRTHTRLVSLFNTTLTGLHEVSA QNPTNCWICLPLNFRPYVSIPVPEQWNNFSTEINTTSVLVGPLVSNLEITHTSNLTCVKF SNTTYTTNSQCIRWVTPPTQIVCLPSGIFFVCGTSAYRCLNGSSESMCFLSFLVPPMTIY TEQDLYSYVISKPRNKRVPILPFVIGAGVLGALGTGIGGITTSTQFYYKLSQELNGDMER VADSLVTLQDQLNSLAAVVLQNRRALDLLTAERGGTCLFLGEECCYYVNQSGIVTEKVKE IRDRIQRRAEELRNTGPWGLLSQWMPWILPFLGPLAAIILLLLFGPCIFNLLVNFVSSRI EAVKLQMEPKMQSKTKIYRRPLDRPASPRSDVNDIKGTPPEEISAAQPLLRPNSAGSS >gi568815591f:92347730_92556559|GENSCAN_predicted_CDS_5|1617_bp atggccctcccttatcatatttttctctttactgttcttttaccctctttcactctcact gcaccccctccatgccgctgtatgaccagtagctccccttaccaagagtttctatggaga atgcagcgtcccggaaatattgatgccccatcgtataggagtctttctaagggaaccccc accttcactgcccacacccatatgccccgcaactgctatcactctgccactctttgcatg catgcaaatactcattattggacaggaaaaatgattaatcctagttgtcctggaggactt ggagtcactgtctgttggacttacttcacccaaactggtatgtctgatgggggtggagtt caagatcaggcaagagaaaaacatgtaaaagaagtaatctcccaactcacccgggtacat ggcacctctagcccctacaaaggactagatctctcaaaactacatgaaaccctccgtacc catactcgcctggtaagcctatttaataccaccctcactgggctccatgaggtctcggcc caaaaccctactaactgttggatatgcctccccctgaacttcaggccatatgtttcaatc cctgtacctgaacaatggaacaacttcagcacagaaataaacaccacttccgttttagta ggacctcttgtttccaatctggaaataacccatacctcaaacctcacctgtgtaaaattt agcaatactacatacacaaccaactcccaatgcatcaggtgggtaactcctcccacacaa atagtctgcctaccctcaggaatattttttgtctgtggtacctcagcctatcgttgtttg aatggctcttcagaatctatgtgcttcctctcattcttagtgccccctatgaccatctac actgaacaagatttatacagttatgtcatatctaagccccgcaacaaaagagtacccatt cttccttttgttataggagcaggagtgctaggtgcactaggtactggcattggcggtatc acaacctctactcagttctactacaaactatctcaagaactaaatggggacatggaacgg gtcgccgactccctggtcaccttgcaagatcaacttaactccctagcagcagtagtcctt caaaatcgaagagctttagacttgctaaccgctgaaagagggggaacctgtttattttta ggggaagaatgctgttattatgttaatcaatccggaatcgtcactgagaaagttaaagaa attcgagatcgaatacaacgtagagcagaggagcttcgaaacactggaccctggggcctc ctcagccaatggatgccctggattctccccttcttaggacctctagcagctataatattg ctactcctctttggaccctgtatctttaacctccttgttaactttgtctcttccagaatc gaagctgtaaaactacaaatggagcccaagatgcagtccaagactaagatctaccgcaga cccctggaccggcctgctagcccacgatctgatgttaatgacatcaaaggcacccctcct gaggaaatctcagctgcacaacctctactacgccccaattcagcaggaagcagttag >gi568815591f:92347730_92556559|GENSCAN_predicted_peptide_6|2168_aa MWGSDRLAGAGGGGAAVTVAFTNARDCFLHLPRRLVAQLHLLQNQAIEVVWSHQPAFLSW VEGRHFSDQGENVAEINRQVGQKLGLSNGGQELHAVSLEQHLLDQIRIVFPKAIFPVWVD QQTYIFIQIVALIPAASYGRLETDTKLLIQPKTRRAKENTFSKADAEYKKLHSYGRDQKG MMKELQTKQLQSNTVGITESNENESEIPVDSSSVASLWTMIGSIFSFQSEKKQETSWGLT EINAFKNMQSKVVPLDNIFRVCKSQPPSIYNASATSVFHKHCAIHVFPWDQEYFDVEPSF TVTYGKLVKLLSPKQQQSKTKQNVLSPEKEKQMSEPLDQKKIRSDHNEEDEKACVLQVVW NGLEELNNAIKYTKNVEVLHLGKVWPKDISEEDIKTVFYSWLQQSTTTMLPLVISEEEFI KLETKDGPSRSYGKRRKQGVNSLGVSSLEHITHSLLGRPLSRQLMSLVAGLRNGALLLTG GKGSGKSTLAKAICKEAFDKLDAHVERVDCKALRGKRLENIQKTLEVAFSEAVWMQPSVV LLDDLDLIAGLPAVPEHEHSPDAVQSQRLAHALNDMIKEFISMGSLVALIATSQSQQSLH PLLVSAQGVHIFQCVQHIQPPNQEQRCEILCNVIKNKLDCDINKFTDLDLQHVAKETGGF VARDFTVLVDRAIHSRLSRQSISTREKLVLTTLDFQKALRGFLPASLRSVNLHKPRDLGW DKIGGLHEVRQILMDTIQLPAKYPELFANLPIRQRTGILLYGPPGTGKTLLAGVIARESR MNFISVKGPELLSKYIGASEQAVRDIFIRAQAAKPCILFFDEFESIAPRRGHDNTGVTDR VVNQLLTQLDGVEGLQGVYVLAATSRPDLIDPALLRPGRLDKCVYCPPPDQDGSSSSDSD LSLSSMVFLNHSSGSDDSAGDGECGLDQSLVSLEMSEILPDESKFNMYRLYFGSSYESEL GNGTSSDLEDESMNQPGPIKTRLAISQSHLMTALGHTRPSISEDDWKNFAELYESFQNPK RRKNQSGTMFRPGQKFFDEITELTYLPSFHHKAAPHQAEPGPNSSSASAPPPYNPFITSS PHTQSGLQFRSVTSPPPSAQQFPLKEVAGAKGIVKTALETAPTLALPVSSQPFSLHTAEV QGCAVGILTQGPGPCPVAFLSKQLDLTVLGSPSCLHAVASAALILLEALKITNYAQLTLY SSHNFQNLFSFSHLTHILSAPRLLQLYSLFVESPTITILPGPDFNLASHIILDTTPDPDD CMSLIYLTFTPFPHISFFSVPHVDHIWFTDGSSTRPDRHSPAKAGYAIESSTSIIEATAL PPSTTSQQAELIALTRAFTLAKGLHVNIYTDSKYAFHILHHHAVIWAERGFLTTQGSSII NASLIKTLLKAALLPKEAGVTHCKGHQKASDPITLGNAYADKGVRCAPDPARRPLPLPIG LKACHCSCTAKIGGKYRALVGQLKTISVATGLKTQDRTIDGSSQVIEEKNHNGYSVIDTG TLVEAELEKLPNNWSPQTCELFALSQALKYLQNQKTISILIQKEPSPALGLTPERKGNVG HAGKGPLESSSPDPFLCGQERREKGCRTATSVSITNPINRGPWVVTHPGKELTPEHKGNV GHAGRDILAKAGAIIHLNIGEGTPVCCPLLEEGINPEVWATEGQYGRAKNARPVQVKLKD STSFPYQRQYPLRPKAQQGLQKIVKDLKAQGLVKPCSNPCSTPILGVQKPNRQWRNLVPS SHPSALQFPRYLWLHGFQTKGSTLLTAGYLGLKLSKGTRALSEEHIQPILAYPHPKTLKQ LRGFLGVIGFCRKWIPRYGEIARSLNTLIKETQKANTHLVRWTTEVEVAFQALTQAPVLS LPTGQDFSSYVTEKTGIALGVLTQIRGMSLQPVAYLTKEIDVVAKVVAVAVLVSEAVKII QGRDLTVWTSHDVNGILTAKGDLWLSDNCLLKCQALLLEGPVLRLCTCATLNPATFLPDN EEKIKHNCQQVISQTYATRGDLLEVPLTDPDLNLYTDGSSFVEKGLRKVGYAVVSDNGIL ESNPLTPGTSAQLAELIALTWALELGEEKRANIYTDSKYAYLVLHAHAAIWKEREFLTSE RTPIKHQEAIRKLLLAVQKPKEVAVLHCRGHQKGKEREIEENCQADIEAKRAARQDPPLE MLIKQPLV >gi568815591f:92347730_92556559|GENSCAN_predicted_CDS_6|6507_bp atgtggggcagcgatcgcctggcgggtgctgggggaggcggggcggcagtgactgtggcc ttcaccaacgctcgcgactgcttcctccacctgccgcggcgtctcgtggcccagctgcat ctgctgcagaatcaagctatagaagtggtctggagtcaccagcctgcattcttgagctgg gtggaaggcaggcattttagtgatcaaggtgaaaatgtggctgaaattaacagacaagtt ggtcaaaaacttggactctcaaatgggggacaggagctgcatgctgtttcccttgaacaa catcttctagatcaaattcgaatagtttttccaaaagccatttttcctgtttgggttgat caacaaacgtacatatttatccaaattgttgcactaataccagctgcctcttatggaagg ctggaaactgacaccaaactccttattcagccaaagacacgccgagccaaagagaataca ttttcaaaagctgatgctgaatataaaaaacttcatagttatggaagagaccagaaagga atgatgaaagaacttcaaaccaagcaacttcagtcaaatactgtgggaatcactgaatct aatgaaaacgagtcagagattccagttgactcatcatcagtagcaagtttatggactatg ataggaagcattttttcctttcaatctgagaagaaacaagagacatcttggggtttaact gaaatcaatgcattcaaaaatatgcagtcaaaggttgttcctctagacaatattttcaga gtatgcaaatctcaacctcctagtatatataacgcgtcagcaacctctgtttttcataaa cactgtgccattcatgtatttccatgggaccaggaatattttgatgtagagcccagcttt actgtgacatatggaaagctagttaagctactttctccaaagcaacagcaaagtaaaaca aaacaaaatgtgttatcacctgaaaaagagaagcagatgtcagagccactagatcaaaaa aaaattaggtcagatcataatgaagaagatgagaaggcctgtgtgctacaagtagtctgg aatggacttgaagaattgaacaatgccatcaaatataccaaaaatgtagaagttctccat cttgggaaagtctggcctaaagacataagtgaagaagacataaaaactgtattttattca tggctacagcagtctactaccaccatgcttcctttggtaatatcagaggaagaatttatt aagctggaaactaaagatggtccttctagatcctatggtaaaagaagaaaacaaggagtg aattccttaggcgtatcctccttggagcacatcactcacagcctcctgggacgccctttg tctcggcagctgatgtctcttgttgcaggacttaggaatggagctcttttactcacagga ggaaagggaagtggaaaatcaactttagccaaagcaatctgtaaagaagcatttgacaaa ctggatgcccatgtggagagagttgactgtaaagctttacgaggaaaaaggcttgaaaac atacaaaaaaccctagaggtggctttctcagaggcagtgtggatgcagccatctgttgtc ctgctggatgaccttgacctcattgctggactgcctgctgtcccggaacatgagcacagt cctgatgcggtgcagagccagcggcttgctcatgctttgaatgatatgataaaagagttt atctccatgggaagtttggttgcactgattgccacaagtcagtctcagcaatctctacat cctttacttgtttctgctcaaggagttcacatatttcagtgcgtccaacacattcagcct cctaatcaggaacaaagatgtgaaattctgtgtaatgtaataaaaaataaattggactgt gatataaacaagttcaccgatcttgacctgcagcatgtagctaaagaaactggcgggttt gtggctagagattttacagtacttgtggatcgagccatacattctcgactctctcgtcag agtatatccaccagagaaaaattagttttaacaacattggacttccaaaaggctctccgc ggatttcttcctgcgtctttgcgaagtgtcaacctgcataaacctagagacctgggttgg gacaagattggtgggttacatgaagttaggcagatactcatggatactatccagttacct gccaagtatccagaattatttgcaaacttgcccatacgacaaagaacaggaatactgttg tatggtccgcctggaacaggaaaaaccttactagctggggtaattgcacgagagagtaga atgaattttataagtgtcaaggggccagagttactcagcaaatacattggagcaagtgaa caagctgttcgggatatttttattagagcacaggctgcaaagccctgcattcttttcttt gatgaatttgaatccattgctcctcggcggggtcatgataatacaggagttacagaccga gtagttaaccagttgctgactcagttggatggagtagaaggcttacagggtgtttatgta ttggctgctactagtcgccctgacttgattgaccctgccctgcttaggcctggtcgacta gataaatgtgtatactgtcctcctcctgatcaggatggaagttccagctctgatagtgac ctaagtctgtcttcaatggtctttcttaaccatagcagtggctctgacgattcagctgga gatggagaatgtggcttagatcagtcccttgtttctttagagatgtccgagatccttcca gatgaatcaaaattcaatatgtaccggctctactttggaagctcttatgaatcagaactt ggaaatggaacctcttctgatttggaggacgaatccatgaaccaaccaggaccaatcaaa accagactggctattagtcagtcacatttaatgactgcacttggtcacacaagaccatcc attagtgaagatgactggaagaattttgctgagctatatgaaagctttcaaaatccaaag aggagaaaaaatcaaagtggaacaatgtttcgacctggacagaaattctttgatgaaatc actgagctgacatacctgccctcttttcaccataaagctgctcctcaccaggccgagcca ggtcccaattcttcctcagcctctgctcccccaccctataatccttttatcacctcctct cctcacactcagtccggcttacagtttcgttctgtgactagccctcccccatctgcccaa caatttcctcttaaagaggtggctggagctaaaggcatagtcaagacagctttagagact gctcccacactagctctccctgtctcatcccaacccttttcattacacacagccgaagtg cagggctgtgcagtcggaattcttacacaaggaccgggaccatgccctgtagcctttttg tccaaacaacttgaccttactgttttaggctcgccatcatgtctccatgcggtagcttcc gctgccctaatacttttagaggccctcaaaatcacaaactatgctcaactcactctctac agctctcacaacttccaaaatctattttctttctcacacctgacgcatatactttctgct ccccggctccttcagctgtattcactctttgttgagtctcccacaattaccattcttcct ggcccagacttcaatctggcctcccacattattctggataccacacctgaccctgatgat tgtatgtctctgatctacctgacattcaccccatttccccatatttccttcttttctgtt cctcatgttgatcacatttggtttactgacggcagttccaccaggcctgatcgccactca ccagcaaaggcaggctatgctatagaatcttccacatccatcattgaggctactgctctg cccccctccactacctctcagcaagccgaactgattgccttaactcgggccttcactctt gcaaagggactacacgtcaatatttatactgactctaaatatgccttccatatcttgcac caccatgctgttatatgggctgaaagaggtttcctcactacgcaagggtcctccatcatt aatgcctctttaataaaaactcttctcaaggctgctttacttccaaaggaagctggagtc acacactgcaagggccaccaaaaggcgtcagatcccattactctaggaaatgcttatgct gataagggtgtccgctgtgctcctgatccagcgaggcgcccattgccgctcccaattggg ctaaaggcttgccattgttcctgcacggctaaaattggaggaaaataccgggcacttgtc ggccagttaaaaacgattagtgtggccaccggacttaagactcaggacaggacgatagat ggttcctcccaggtgattgaggaaaaaaaccacaatgggtattcagtaattgatacgggg actcttgtggaagcagagttagaaaaattgcctaataactggtctcctcaaacgtgtgag ctgtttgcactcagccaagccttaaagtacttacagaatcaaaagactatctcaatcctg attcaaaaggaacccagcccagctctaggactcacccctgagcgcaaaggcaatgttggg catgctggtaaaggaccactagaatccagcagcccagacccctttctttgtggtcaagaa aggcgggaaaaggggtgcaggactgctacatcggtaagcataactaatccgataaacaga ggtccatgggtggttacgcaccctggaaaggaactcacccctgagcacaaaggcaatgtt gggcacgctgggagagacattctagcaaaagcaggggccattatacacctgaacatagga gaaggaacacccgtttgttgtcccctgcttgaggaaggaattaatcctgaagtctgggca acagaaggacaatatggacgagcaaagaatgcccgtcctgttcaagttaaactaaaggat tccacctcctttccctaccaaaggcagtaccccctcagacccaaggcccaacaaggactc caaaagattgttaaggacctaaaagcccaaggcctagtaaaaccatgcagtaacccctgc agtactccaattttaggagtacagaaacccaacagacagtggagaaaccttgtgccatca agccacccaagcgctcttcaatttcctcgctacctgtggctacatggtttccaaaccaaa ggctcaactctgctcacagcaggttacttagggctaaaattatccaaaggcaccagggcc ctcagtgaggaacacatccagcctatactggcttatcctcatcccaaaaccctaaagcaa ctaaggggattccttggcgtaataggtttctgccgaaaatggattcccaggtatggcgaa atagccaggtcattaaatacactaattaaggaaactcagaaagccaatacccatttagta agatggacaactgaagtagaagtggctttccaggccctaacccaagccccagtgttaagt ttgccaacagggcaagacttttcttcatatgtcacagaaaaaacaggaatagctctagga gtccttacacagatccgagggatgagcttgcaacctgtggcatacctgactaaggaaatt gatgtagtggcaaaggtggtggcagtagcagtcttagtatctgaagcagttaaaataata cagggaagagatcttactgtgtggacatctcatgatgtgaatggcatactcactgctaaa ggagacttgtggctgtcagacaactgtttacttaaatgtcaggctctattacttgaaggg ccagtgctgcgactgtgcacttgtgcaactcttaacccagccacatttcttccagacaat gaagaaaagataaaacataactgtcaacaagtaatttctcaaacctatgccactcgaggg gaccttttagaggttcctttgactgatcccgacctcaacttgtatactgatggaagttcc tttgtagaaaaaggacttcgaaaagtggggtatgcagtggtcagtgataatggaatactt gaaagtaatcccctcactccaggaactagtgctcagctagcagaactaatagccctcact tgggcactagaattaggagaagaaaaaagggcaaatatatatacagactctaaatatgct tacctagtcctccatgcccatgcagcaatatggaaagaaagggaattcctaacttctgag agaacacctatcaaacatcaggaagccattaggaaattattattggctgtacagaaacct aaagaggtggcagtcttacactgccggggtcatcagaaaggaaaggaaagggaaatagaa gagaactgccaagcagatattgaagccaaaagagctgcaaggcaggaccctccattagaa atgcttataaaacaacccctagtatag >gi568815591f:92347730_92556559|GENSCAN_predicted_peptide_7|473_aa MASSGGELGSLFDHHVQRAVCDTRAKYREGRRPRAVKVYTINLESQYLLIQGVPAVGVMK ELVERFALYGAIEQYNALDEYPAEDFTEVYLIKFMNLQSARTAKRKMDEQSFFGGLLHVC YAPEFETVEETRKKLQMRKAYVVKTTENKDHYVTKKKLVTEHKDTEDFRQDFHSEMSGFC KAALNTSAGNSNPYLPYSCELPLCYFSSKCMCSSGGPVDRAPDSSKDGRNHHKTMGHYNH NDSLRKTQINSLKNSVACPGAQKAITSSEAVDRFMPRTTQLQERKRRREDDRKLGTFLQT NPTGNEIMIGPLLPDISKVDMHDDSLNTTANLIRHKLKELIRFNFSYDVLVAGRAADKTP QTPGYRKKRLYSARSVGRLVSQEPCSLKKKFLVLLRAYNSSQYTLDLVWEIHHTYRKQEL SIAKEVKSSMEVLNIVTEIIMNVFLGPKCQDGAELSMIALTSAVGCSPTEESV >gi568815591f:92347730_92556559|GENSCAN_predicted_CDS_7|1422_bp atggcgtcgagcggcggggagctagggagtttatttgatcaccacgtccagagggcggta tgcgacacacgggccaaatatcgagagggacgacggcctcgtgctgtgaaggtatataca atcaatttggaatctcagtacttattaatacaaggagttcctgctgtgggagtcatgaag gaattagttgagcgattcgctttatatggtgcaattgaacagtacaatgctctagatgaa tacccagcagaagactttactgaagtttatcttattaaatttatgaacttacaaagtgca aggacagccaagagaaaaatggatgaacagagtttcttcggtggattgcttcatgtgtgc tatgctccagaatttgaaacagttgaagaaactagaaaaaaactacaaatgcggaaggca tatgtagtaaaaactactgaaaataaagaccattacgtgacaaagaagaaattggttaca gagcataaagacacagaggattttagacaagacttccactcagagatgtctggattttgt aaagctgctttgaacacttctgcagggaactcaaatccttatcttccgtattcctgtgaa ttgcctttatgttatttctcctcaaaatgtatgtgttcatccgggggacctgtagacaga gcaccagactcctctaaggatggtagaaaccatcataaaacaatggggcattataaccac aatgactctttgcggaaaacacagataaactctttgaaaaactcagtggcctgccctggt gcacaaaaggctattacgtcttcagaggcagttgacagatttatgcctaggacaacacaa ctgcaggagcgcaaaagaagaagagaagatgatcgtaaacttggaacttttcttcaaaca aacccaactggtaatgagattatgattggacctctgttaccagacatctctaaagtggat atgcacgatgactcattgaatacaacggcgaatttaattcggcataaacttaaagagtta ataaggttcaatttcagctatgatgtacttgtagcaggacgagctgcagacaaaaccccg cagacaccaggttatagaaagaagaggctttattcggccaggagcgtcggcagacttgtg tctcaagaaccgtgctccctgaagaaaaagttcctggtccttttaagggcttacaactca agccaatataccctggatttggtttgggaaatacaccatacttacaggaaacaggaattg tctattgctaaagaggttaaaagctcaatggaagtcttgaatatagttactgaaataatc atgaatgtcttcttggggcccaaatgccaggatggggctgaactctcaatgattgccttg acttcagcagtgggctgcagcccaacagaggagagtgtctag >gi568815591f:92347730_92556559|GENSCAN_predicted_peptide_8|330_aa MGKLDIHMLKNETRPYFSPYTKIKSKWIKDINVKPQTMKLLQENIRETLQDIGLGEDFSQ AQATKAKVDKLDHVKFKNFCTAKETINKVKRKPKEWEKILIFADYPSDKGLITRILFIDH SWVFLGEGNLAGSSDNSGGKAWKHLLPLLASRCSGTHSNLRAKLRLSQGTVATWPVLKRE GEKSCGPSESPDLGAPQARDVTPCLGICSSWLLQASGHHHIPWCQLWKLFVVYLVQLQPH REVALVPAPGAAHPTTASVPVCAQWLDPMVIHSHTLCCSEPDSPLAGMGFRPVAQAKHNP ARPSGWNESSRPEQNLDKGATSHRGFWLAR >gi568815591f:92347730_92556559|GENSCAN_predicted_CDS_8|993_bp atgggaaaactggatatccatatgctgaagaatgaaactagaccctacttctcaccatat acaaaaatcaagtcaaaatggattaaagacataaatgtaaaacctcaaactatgaaacta ctacaagaaaacatcagagaaactctccaggacattggtctgggtgaagatttctcacaa gcacaggcaaccaaagcaaaagtggacaaattggatcacgtgaagtttaaaaacttctgc acagcaaaagaaacaatcaacaaagtgaagagaaaacccaaagaatgggagaaaatacta atatttgcagactatccatctgacaagggattaataaccagaatattatttattgatcat tcttgggtgtttctcggagaggggaatttggcagggtcatcggacaatagtggagggaag gcttggaagcacctgctgccacttctggcctctcgctgctccggcacccattccaatctt agagcaaagttgaggctgagccagggcactgtcgcgacctggccagttttgaagagagaa ggagagaagagctgcggcccttcagagagcccagacctaggagctccccaagccagggat gtgacaccctgtttgggaatctgtagttcctggcttctccaagcttctgggcaccaccac attccctggtgccaactgtggaagctgtttgtggtatacctggtccagctgcagcctcac agggaggtagcgcttgtgccagcacctggagctgcccaccccaccacagccagtgtgcct gtctgtgcgcagtggctggaccccatggtcattcactcacacaccctttgctgctctgag cctgactcacccttggcaggcatgggattcaggccagtagcacaagccaagcacaaccct gccaggccgagtggatggaatgagtccagtaggcctgagcaaaacttggacaaaggtgcc actagccacagaggtttctggctggcaaggtga