GENSCAN 1.0 Date run: 6-Nov-116 Time: 19:42:33 Sequence gi568815579f:47253031_47482110 : 229080 bp : 52.53% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3493 3579 87 1 0 80 60 80 0.029 3.98 1.02 Intr + 4750 4879 130 1 1 -89 90 139 0.020 -2.40 1.03 Intr + 5529 5633 105 2 0 57 49 231 0.961 17.01 1.04 Intr + 7291 7392 102 0 0 91 97 157 0.999 17.77 1.05 Intr + 7558 7809 252 0 0 113 97 61 0.993 7.66 1.06 Intr + 11573 11656 84 1 0 99 60 195 0.999 18.31 1.07 Intr + 11743 11916 174 0 0 115 94 336 0.993 37.55 1.08 Intr + 13581 13762 182 2 2 47 76 329 0.995 26.78 1.09 Intr + 17377 17423 47 1 2 117 101 22 0.591 4.94 1.10 Intr + 17523 17658 136 1 1 88 75 81 0.998 6.93 1.11 Intr + 18052 18157 106 1 1 115 94 58 0.996 9.82 1.12 Intr + 18244 18619 376 0 1 74 76 399 0.997 32.45 1.13 Intr + 19039 19131 93 2 0 67 75 50 0.800 2.03 1.14 Term + 20327 20430 104 0 2 129 47 115 0.921 10.24 1.15 PlyA + 20757 20762 6 -6.37 2.00 Prom + 20776 20815 40 -6.30 2.01 Sngl + 21890 22318 429 1 0 74 35 246 0.396 13.26 2.02 PlyA + 22643 22648 6 1.05 3.03 PlyA - 23858 23853 6 1.05 3.02 Term - 27466 27334 133 0 1 112 41 82 0.658 3.77 3.01 Init - 28466 28453 14 2 2 86 55 -2 0.480 -4.07 3.00 Prom - 29311 29272 40 0.99 4.00 Prom + 30912 30951 40 -0.61 4.01 Init + 36734 36787 54 1 0 51 63 51 0.306 0.13 4.02 Term + 38948 38962 15 1 0 140 35 13 0.382 -0.28 4.03 PlyA + 42398 42403 6 1.05 5.02 PlyA - 44080 44075 6 1.05 5.01 Sngl - 47358 47029 330 0 0 48 47 140 0.830 2.24 5.00 Prom - 54826 54787 40 0.69 6.00 Prom + 56068 56107 40 -4.11 6.01 Init + 56866 56868 3 0 0 80 101 0 0.408 0.43 6.02 Intr + 57040 57168 129 0 0 105 66 17 0.426 2.60 6.03 Term + 66751 67800 1050 0 0 118 48 1560 0.990 147.65 6.04 PlyA + 69017 69022 6 1.05 7.00 Prom + 85249 85288 40 -3.61 7.01 Sngl + 87770 88783 1014 1 0 56 44 1062 0.999 96.10 7.02 PlyA + 89330 89335 6 1.05 8.00 Prom + 91147 91186 40 -6.01 8.01 Init + 100001 100705 705 1 0 69 109 842 0.928 79.44 8.02 Intr + 102009 102320 312 2 0 133 113 427 0.796 46.63 8.03 Intr + 104836 105090 255 0 0 77 64 407 0.946 35.37 8.04 Intr + 106938 107040 103 2 1 129 107 150 0.999 21.25 8.05 Intr + 109446 109663 218 1 2 86 94 293 0.999 28.45 8.06 Intr + 113951 114125 175 1 1 76 97 238 0.652 23.53 8.07 Intr + 119700 119893 194 1 2 89 94 336 0.846 34.03 8.08 Intr + 120569 120670 102 1 0 111 100 154 0.999 19.77 8.09 Intr + 122436 122717 282 2 0 110 86 392 0.723 39.36 8.10 Intr + 122894 123067 174 1 0 140 75 342 0.999 38.75 8.11 Intr + 123413 123530 118 1 1 118 77 255 0.996 27.94 8.12 Intr + 124070 124176 107 0 2 87 80 223 0.994 21.83 8.13 Intr + 126680 126955 276 1 0 119 75 447 0.828 44.75 8.14 Intr + 127786 127962 177 0 0 116 115 133 0.964 19.43 8.15 Intr + 128156 128294 139 1 1 39 115 194 0.989 17.74 8.16 Intr + 128950 129077 128 2 2 112 4 142 0.113 9.10 8.17 Intr + 132922 132982 61 0 1 117 77 72 0.137 7.80 8.18 Intr + 141845 142119 275 0 2 17 38 166 0.018 2.30 8.19 Intr + 142479 142565 87 2 0 43 115 64 0.944 5.26 8.20 Term + 143304 143498 195 2 0 46 33 118 0.641 -0.17 8.21 PlyA + 145664 145669 6 1.05 9.15 PlyA - 145679 145674 6 1.05 9.14 Term - 146204 146182 23 0 2 125 32 5 0.140 -2.64 9.13 Intr - 153941 153858 84 0 0 98 102 92 0.984 11.89 9.12 Intr - 154107 154049 59 2 2 80 92 88 0.999 7.42 9.11 Intr - 154398 154322 77 0 2 34 105 154 0.900 10.51 9.10 Intr - 156217 156069 149 2 2 63 89 176 0.855 15.66 9.09 Intr - 156517 156406 112 1 1 106 100 19 0.985 5.36 9.08 Intr - 161550 161365 186 0 0 14 45 152 0.620 3.90 9.07 Intr - 161836 161687 150 1 0 102 80 309 0.977 32.37 9.06 Intr - 162071 162021 51 2 0 106 105 105 0.999 13.59 9.05 Intr - 163672 163622 51 1 0 123 85 46 0.994 7.39 9.04 Intr - 163933 163774 160 0 1 137 80 172 0.922 21.80 9.03 Intr - 164320 164148 173 1 2 88 92 270 0.999 26.66 9.02 Intr - 166569 166402 168 0 0 78 82 17 0.512 0.76 9.01 Init - 170060 170058 3 2 0 76 81 0 0.302 -1.97 9.00 Prom - 170269 170230 40 -1.51 10.12 PlyA - 171272 171267 6 1.05 10.11 Term - 177435 177059 377 1 2 110 49 917 0.999 85.26 10.10 Intr - 179415 179137 279 1 0 55 52 550 0.410 46.09 10.09 Intr - 184531 184432 100 1 1 113 111 58 0.892 10.78 10.08 Intr - 184943 184819 125 0 2 67 55 179 0.689 13.31 10.07 Intr - 188156 188139 18 0 0 116 113 -13 0.649 1.06 10.06 Intr - 188410 188307 104 0 2 83 94 120 0.951 12.42 10.05 Intr - 195354 194779 576 2 0 66 58 1196 0.053 106.84 10.04 Intr - 196705 196642 64 2 1 107 71 67 0.597 5.17 10.03 Intr - 202050 202016 35 2 2 124 56 -16 0.740 -2.85 10.02 Intr - 204564 203900 665 0 2 101 94 1723 0.975 165.39 10.01 Init - 213373 212699 675 1 0 82 99 1323 0.719 126.06 10.00 Prom - 215281 215242 40 -3.51 11.09 PlyA - 215439 215434 6 1.05 11.08 Term - 222514 222386 129 1 0 122 47 36 0.496 1.39 11.07 Intr - 223684 223502 183 1 0 124 68 193 0.953 21.40 11.06 Intr - 223908 223773 136 2 1 42 75 222 0.945 17.38 11.05 Intr - 224751 224676 76 1 1 69 85 99 0.996 6.77 11.04 Intr - 226910 226833 78 0 0 98 84 100 0.998 10.62 11.03 Intr - 227377 227256 122 0 2 39 77 144 0.771 9.04 11.02 Intr - 227803 227730 74 1 2 50 63 100 0.928 2.40 11.01 Intr - 228003 227928 76 2 1 90 78 54 0.793 4.61 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815579f:47253031_47482110|GENSCAN_predicted_peptide_1|659_aa XQEVTRLARELRSQRPTEPESRTPRGLRDRAWAANKSGVRTFFERGIGNVSEPYLVELEY SPGYERHQDEHREAATLDLKSKEEKDAELDKRIEALRRKNEALIRRYQEIEEDRKKAELE GVAVTAPRKGRSVEKENVAVESEKNLGPSRRSPGTPRPPGASKGGRTPPQQGGRAGMGRA SRSWEGSPGEQPRGGGAGGRGRRGRGRGSPHLSGAGDTSISDRKSKEWEERRRQNIEKMN EEMEKIAEYERNQREGVLEPNPVRNFLDDPRRRSGPLEESERDRREESRRHGRNWGGPDF ERVRCGLEHERQGRRAGLGSAGDMTLSMTGRERSEYLRWKQEREKIDQERLQRHRKPTGQ WRREWDAEKTDGMFKDGPVPAHEPSHRYDDQAWARPPKPPTFGEFLSQHKAEASSRRRRK SSRPQAKAAPRAYSDHDDRWETKEGAASPAPETPQPTSPETSPKETPMQPPEIPAPAHRP PEDEGEENEGEEDEEWEDISEDEEEEEIEVEEGDEEEPAQDHQAPEAAPTGIPCSEQAHG VPFSPEEPLLEPQAPGTPSSPFSPPSGHQPVSDWGEEVELNSPRTTHLAGALSPGGGQSA PAFPESGPSLRGTQEAEEEGSEATPEAGPEGQETAEITDFQRVRFCKVVAAPPLPGAAR >gi568815579f:47253031_47482110|GENSCAN_predicted_CDS_1|1980_bp nngcaggaagtgacgcgcctggcccgggagctgcggtcgcagaggccgacggagccggag tcgcggacgccgcggggtctccgggacagagcttgggcagccaataagagtggggttcgg actttctttgagcgtgggattgggaatgtttctgaaccttacctggtggagctagaatat tcgcctggatatgagaggcaccaggatgagcacagggaggcagccacactcgatttgaaa tcaaaggaggagaaggatgctgagttggacaagaggatcgaggctcttcggcggaagaat gaggccctcatccggcgctaccaggagattgaggaagaccgtaagaaagctgaacttgag ggagtcgcagtcacagctccccgaaagggccgctcagtggagaaggagaacgtggcagtg gagtcggagaagaacctgggtccttcccggaggtctcctgggacccctcggcccccaggg gccagcaaggggggccggactcctccacagcagggaggccgggccggcatgggccgagca tcgcgcagctgggagggcagccccggggagcagcctcgaggaggaggagctgggggccgt ggccggaggggccggggccgaggttcacctcacctctctggagctggagacacctcaatc tctgaccgtaaatccaaggagtgggaggagcggcgcaggcagaacattgagaagatgaat gaggagatggagaagatcgccgagtatgagcgcaaccagcgggaaggggttcttgaaccc aacccagtgcggaacttcctggacgacccccggcgacgcagcgggcccctggaggagtct gagcgggaccgccgggaggagagccgccggcacggccgcaactgggggggccccgacttc gagcgggtgcgctgtggccttgagcacgagcggcagggccgccgagctggcctgggcagt gctggagacatgacgttgtccatgacgggccgggagcggtcggagtacctgcgctggaag caggagagggagaagatcgaccaggagcggctgcagaggcaccgcaagcccactggccag tggaggcgcgagtgggatgccgagaagaccgatgggatgttcaaggatggcccagtccct gcccatgaaccatcccaccgctatgatgaccaggcctgggcccggcccccgaagccccct acttttggggagttcctgtcccagcacaaagctgaggccagcagccgcagaaggagaaag agcagtcggccccaggccaaggcagcgcccagggcctacagtgaccatgatgaccgctgg gagacaaaagaaggggcagcatccccagcccctgagactccacagcctacttcccccgag acttcccccaaggagacacccatgcagccacccgagatcccagctcctgcccaccggcct cctgaagacgagggggaagagaatgagggggaagaggatgaagaatgggaggacataagt gaggatgaggaagaggaggagatcgaggtggaagaaggtgatgaggaggaaccagcccaa gaccaccaagccccagaggctgcccccaccgggatcccctgcagtgagcaggcccacgga gtccccttcagtccggaggagcccctgctggagccccaggcccctggcacgccttccagc cctttctcaccacccagcggccaccagcctgtgtccgattggggtgaagaggtggagctg aattctccccggaccactcacctggctggcgccctctccccgggaggtggccagtcagcc cctgccttcccggagagtgggcccagcctccgaggaacccaggaagctgaagaggaaggg tctgaggcaactccagaggcaggccccgaaggccaggagacggcggagatcaccgacttc cagagggtgcgtttctgcaaggtggtggcggcccctccgctgccgggggccgctcgctga >gi568815579f:47253031_47482110|GENSCAN_predicted_peptide_2|142_aa MRGTSCVGGGAESPGGAGLSEGPRGRWLRLAPVCAYFLCVSLAAVLLAVYYGLIWVPTRS PAAPAGPQPSAPSPPCAARPGVPPVPAPAAASLSCLLGVPGGPRPQLQLPLSRRRRYSDP DRRPSRQTPRETPEAAEGRRPG >gi568815579f:47253031_47482110|GENSCAN_predicted_CDS_2|429_bp atgcgggggaccagctgcgtgggcggcggcgccgagagccccggaggcgcggggctgagc gagggcccgcgggggcgctggctgcgcttggctccggtatgcgcctacttcctctgcgtc tcgctagctgccgtgctgctcgccgtgtactacggtctcatctgggtacccacgcggtct cccgcggcacccgccggcccacagcccagcgcgccgtcccctccgtgtgctgcccgcccg ggcgtgccgcctgtcccggcgcccgccgctgcctccctctcctgcctcctgggagtcccc ggcgggccgcgaccccagctccagctgccgctgagccgccgccgccgctacagcgaccct gaccgccgtccgagccgccagacacccagagagacgccagaggccgcggaggggcgaaga cccgggtaa >gi568815579f:47253031_47482110|GENSCAN_predicted_peptide_3|48_aa MVFNGQHSSTATMVSHLNQGGGVLIGLLASTIAPCSPFSTSQSEGLEI >gi568815579f:47253031_47482110|GENSCAN_predicted_CDS_3|147_bp atggtgtttaatggccaacactctagcacagccaccatggtctctcacctgaaccagggc ggtggtgtcctcattgggctcctggcttccaccatcgccccctgcagtccattctccaca tcgcagtcagaaggtcttgagatctga >gi568815579f:47253031_47482110|GENSCAN_predicted_peptide_4|22_aa MRVVISSHRVIAMESGAKVLLE >gi568815579f:47253031_47482110|GENSCAN_predicted_CDS_4|69_bp atgagggtggtaatttccagtcaccgggtcattgccatggaaagtggtgctaaggtgctt ctggaatag >gi568815579f:47253031_47482110|GENSCAN_predicted_peptide_5|109_aa MPTPIRKDGLSRRLQLPHPKEKQLCPATSLPAATGSPEPAKLGWLTTTYHPPLPWHEGSS AVATVAFEAGCGGLISGNACAHAVIRPWPDQLTRNSSTTGGLSKAQRPQ >gi568815579f:47253031_47482110|GENSCAN_predicted_CDS_5|330_bp atgccaacccccatcaggaaggatggactgagcaggcggctgcagctaccacatcccaag gagaagcagctgtgcccagccaccagcttgccagcggccacaggaagcccagaaccagcc aaactgggctggttgaccaccacttaccaccctccactcccttggcacgagggttcctca gctgtggccacagtggcttttgaggcaggatgtggtgggctcatcagcggcaatgcttgt gcccatgcagttatccgaccctggcctgaccaacttaccagaaactctagcaccacgggg ggtctgagtaaggcccagaggccgcaataa >gi568815579f:47253031_47482110|GENSCAN_predicted_peptide_6|393_aa MALTLQGWDVAMGIRLSRLWEQQQPRGAGAREDSPGEGTLELGLDSFNYTTPDYGHYDDK DTLDLNTPVDKTSNTLRVPDILALVIFAVVFLVGVLGNALVVWVTAFEAKRTINAIWFLN LAVADFLSCLALPILFTSIVQHHHWPFGGAACSILPSLILLNMYASILLLATISADRFLL VFKPIWCQNFRGAGLAWIACAVAWGLALLLTIPSFLYRVVREEYFPPKVLCGVDYSHDKR RERAVAIVRLVLGFLWPLLTLTICYTFILLRTWSRRATRSTKTLKVVVAVVASFFIFWLP YQVTGIMMSFLEPSSPTFLLLKKLDSLCVSFAYINCCINPIIYVVAGQGFQGRLRKSLPS LLRNVLTEESVVRESKSFTRSTVDTMAQKTQAV >gi568815579f:47253031_47482110|GENSCAN_predicted_CDS_6|1182_bp atggctctcacactccagggctgggatgtggccatgggaataagattgtcaagattgtgg gagcaacagcaacctcgtggggctggggcccgagaagattccccaggggaggggaccctt gagttgggtctagactccttcaattataccacccctgattatgggcactatgatgacaag gataccctggacctcaacacccctgtggataaaacttctaacacgctgcgtgttccagac atcctggccttggtcatctttgcagtcgtcttcctggtgggagtgctgggcaatgccctg gtggtctgggtgacggcattcgaggccaagcggaccatcaatgccatctggttcctcaac ttggcggtagccgacttcctctcctgcctggcgctgcccatcttgttcacgtccattgta cagcatcaccactggccctttggcggggccgcctgcagcatcctgccctccctcatcctg ctcaacatgtacgccagcatcctgctcctggccaccatcagcgccgaccgctttctgctg gtgtttaaacccatctggtgccagaacttccgaggggctggcttggcctggatcgcctgt gccgtggcttggggtttagccctgctgctgaccataccctccttcctgtaccgggtggtc cgggaggagtactttccaccaaaggtgttgtgtggcgtggactacagccacgacaaacgg cgggagcgagccgtggccatcgtccggctggtcctgggcttcctgtggcctctactcacg ctcacgatttgttacactttcatcctgctccggacgtggagccgcagggccacgcggtcc accaagacactcaaggtggtggtggcagtggtggccagtttctttatcttctggttgccc taccaggtgacggggataatgatgtccttcctggagccatcgtcacccaccttcctgctg ctgaagaagctggactccctgtgtgtctcctttgcctacatcaactgctgcatcaacccc atcatctacgtggtggccggccagggcttccagggccgactgcggaaatccctccccagc ctcctccggaacgtgttgactgaagagtccgtggttagggagagcaagtcattcacgcgc tccacagtggacactatggcccagaagacccaggcagtgtag >gi568815579f:47253031_47482110|GENSCAN_predicted_peptide_7|337_aa MGNDSVSYEYGDYSDLSDRPVDCLDGACLAIDPLRVAPLPLYAAIFLVGVPGNAMVAWVA GKVARRRVGATWLLHLAVADLLCCLSLPILAVPIARGGHWPYGAVGCRALPSIILLTMYA SVLLLAALSADLCFLALGPAWWSTVQRACGVQVACGAAWTLALLLTVPSAIYRRLHQEHF PARLQCVVDYGGSSSTENAVTAIRFLFGFLGPLVAVASCHSALLCWAARRCRPLGTAIVV GFFVCWAPYHLLGLVLTVAAPNSALLARALRAEPLIVGLALAHSCLNPMLFLYFGRAQLR RSLPAACHWALRESQGQDESVDSKKSTSHDLVSEMEV >gi568815579f:47253031_47482110|GENSCAN_predicted_CDS_7|1014_bp atggggaacgattctgtcagctacgagtatggggattacagcgacctctcggaccgccct gtggactgcctggatggcgcctgcctggccatcgacccgctgcgcgtggccccgctccca ctgtatgccgccatcttcctggtgggggtgccgggcaatgccatggtggcctgggtggct gggaaggtggcccgccggagggtgggtgccacctggttgctccacctggccgtggcggat ttgctgtgctgtttgtctctgcccatcctggcagtgcccattgcccgtggaggccactgg ccgtatggtgcagtgggctgtcgggcgctgccctccatcatcctgctgaccatgtatgcc agcgtcctgctcctggcagctctcagtgccgacctctgcttcctggctctcgggcctgcc tggtggtctacggttcagcgggcgtgcggggtgcaggtggcctgtggggcagcctggaca ctggccttgctgctcaccgtgccctccgccatctaccgccggctgcaccaggagcacttc ccagcccggctgcagtgtgtggtggactacggcggctcctccagcaccgagaatgcggtg actgccatccggtttctttttggcttcctggggcccctggtggccgtggccagctgccac agtgccctcctgtgctgggcagcccgacgctgccggccgctgggcacagccattgtggtg gggttttttgtctgctgggcaccctaccacctgctggggctggtgctcactgtggcggcc ccgaactccgcactcctggccagggccctgcgggctgaacccctcatcgtgggccttgcc ctcgctcacagctgcctcaatcccatgctcttcctgtattttgggagggctcaactccgc cggtcactgccagctgcctgtcactgggccctgagggagtcccagggccaggacgaaagt gtggacagcaagaaatccaccagccatgacctggtctcggagatggaggtgtag >gi568815579f:47253031_47482110|GENSCAN_predicted_peptide_8|1360_aa MPPPRTREGRDRRDHHRAPSEEEALEKWDWNCPETRRLLEDAFFREEDYIRQGSEECQKF WTFFERLQRFQNLKTSRKEEKDPGQPKHSIPALADLPRTYDPRYRINLSVLGPATRGSQG LGRHLPAERVAEFRRALLHYLDFGQKQAFGRLAKLQRERAALPIAQYGNRILQTLKEHQV VVVAGDTGCGKSTQVPQYLLAAGFSHVACTQPRRIACISLAKRVGFESLSQYGSQVGYQI RFESTRSAATKIVFLTVGLLLRQIQREPSLPQYEVLIVDEVHERHLHNDFLLGVLQRLLP TRPDLKVILMSATINISLFSSYFSNAPVVQVPGRLFPITVVYQPQEAEPTTSKSEKLDPR PFLRVLESIDHKYPPEERGDLLVFLSGMAEISAVLEAAQTYASHTQRWVVLPLHSALSVA DQDKVFDVAPPGVRKCILSTNIAETSVTIDGIRFVVDSGKVKEMSYDPQAKLQRLQEFWI SQASAEQRKGRAGRTGPGVCFRLYAESDYDAFAPYPVPEIRRVALDSLVLQMKSMSVGDP RTFPFIEPPPPASLETAILYLRDQGALDSSEALTPIGSLLAQLPVDVVIGKMLILGSMFS LVEPVLTIAAALSVQSPFTRSAQSSPECAAARRPLESDQGDPFTLFNVFNAWVQVKSERS RNSRKWCRRRGIEEHRLYEMANLRRQFKELLEDHGLLAGAQAAQVGDSYSRLQQRRERRA LHQLKRQHEEGAGRRRKVLRLQEEQDGGSSDEDRAGPAPPGASDGVDIQVGAMGCGVWGF TKDVKFKLRHDLAQLQAAASSAQDLSREQLALLKLVLGRGLYPQLAVPDAFNSSRKDSDQ IFHTQAKQGAVLHPTCVFAGSPEVLHAQELEASNCDGSRDDKDKMSSKHQLLSFVSLLET NKPYLVNCVRIPALQSLLLFSRSLDTNGDCSRLVADGWLELQLADSESAIRLLAASLRLR ARWESALDRQLAHQAQQQLEEEEEDTPVSPKEVATLSKELLQFTASKIPYSLRRLTGLEV QNMYVGPQTIPATPHLPGLFGSSTLSPHPTKGGYAVTDFLTYNCLTNDTDLYSDCLRTFW TCPHCGLHAPLTPLERIAHENTCPQAPQDGPPGAEEAALETLQKTSVLQRPYHCEACGKD FLFTPTEVLRHRKQHNKGPIRGDVLGVSVSFWEEEVEGLKKAAYKAVNYDKLKETTQGKE ENPAQFVAHLAATLRRYTALDPEGPEGRLILNMHFITQSTPDIRKKLQKLESGPQTPQQE LSNLAFKRLKTDAARSPRKPPRPSQTPSFMQLSQWKPLFAFTWTDPDTHQAQQTTWAVLP QGFTDSPHYFSQAQISSSSVTYLSIIIIKTQVLSLLIVSN >gi568815579f:47253031_47482110|GENSCAN_predicted_CDS_8|4083_bp atgcctcctcctagaacaagggagggcagggatcgccgagaccaccaccgggctcccagc gaggaagaggccttggagaaatgggactggaattgtccagagacgcgtcgcctcttggaa gatgccttcttccgtgaagaggattacatccgtcagggttctgaggaatgtcagaagttt tggaccttctttgaacgcctgcagagattccagaatctcaagacctccaggaaggaggag aaagaccctggacagcccaagcacagcatcccagcgctggccgacctacctcgcacttac gacccacgttaccgcatcaacctctctgttcttggccctgccacgcggggctctcaggga ctgggcaggcacttgcccgcggagagagtggctgagttccgccgagccctgttgcactac ctggactttggccagaagcaggcatttgggcgtctggccaagctgcagcgtgagcgggca gccctccccatcgcccagtatgggaaccgcatcctgcagacgctgaaggagcaccaggtg gtggtagtggccggtgacaccggctgtggcaagtccactcaggtgccccagtacctgctg gctgctggcttcagtcatgtggcgtgcacccagccccggcggatcgcctgcatctcactg gccaagcgtgtgggctttgagagcctcagtcagtatggctcacaggtcggctaccagatc cgctttgagagcacacgttcggcggccaccaagattgtattcctgacagtggggctgctc ctgcgacaaatccagcgggaacccagcctgccccagtatgaggtcctgattgtggatgaa gtccatgagcggcatctccacaacgatttcctcctgggcgtcctccagcgcctgttgccc acgcggcctgacctcaaggtcatcctcatgtcggccaccatcaacatctcgctcttctcc agctatttcagcaatgcccctgtggtacaggtgcctgggaggctgttccccatcacggtt gtgtaccagccgcaggaggcggagccgaccacgtccaagtcagagaagctggacccgcgg cctttcctgagggtgctggagtccattgaccacaagtacccgcctgaggagcggggtgac ctcctcgtcttcctcagcggcatggcggagatcagcgccgtgctggaggctgcccagacc tatgccagccacacccagcgctgggtggtactgccactgcacagcgccctgtctgtggcc gaccaggacaaggtatttgatgtggcaccccctggagtccggaaatgcatcctctccacc aacattgctgagacctcagtcaccattgacgggatccgcttcgtagtagattccggaaag gtgaaggagatgagctacgatccgcaggccaagctgcaacggctgcaggagttctggatt agtcaggccagcgcagagcagcggaagggccgggcgggccgcacgggccccggagtctgc ttccgcctctatgccgaatcggactatgatgccttcgccccctaccccgtcccagaaatt cggagggtggccctggactcgttggtgctgcagatgaagagcatgagtgtgggggacccc cgaaccttccccttcatcgagcccccaccaccagccagcctggaaaccgccatcctctac ctccgggaccagggggccctggacagctcagaggccctcacacccattgggtccctgcta gcccagctgcctgtggacgttgtgattgggaagatgctgatcctgggctccatgttcagc ctggtggagcctgtgctcaccatcgcagccgcacttagcgtccagtcgcccttcacccgc agcgcccagagcagcccagagtgcgcggcagcacggcggccgctggagagcgaccagggt gaccccttcacgctcttcaacgtcttcaacgcctgggtgcaggtgaaatctgaacggagc agaaactctcgcaagtggtgccgccgccggggcatagaggagcatcgactgtacgaaatg gccaaccttcggcgccagttcaaggagctgttggaggaccacgggctgctggctggggcc caggccgcgcaggtaggggacagctacagtcggttgcagcagcgccgggagcgccgggcc ctgcaccagctgaaacgccagcacgaggagggcgcggggcgcaggcgcaaggtgctgcgg ctgcaggaggagcaggacggcggctccagtgacgaggacagggctggcccagccccccca ggggccagtgatggcgtggacatccaggtgggcgccatgggctgtggggtgtgggggttt accaaggatgtgaagttcaagcttcggcatgacctggcgcagctgcaggccgctgccagc tcagcccaggacctgagccgcgagcagctggctctgctgaagctggtgctgggccggggc ctgtacccacagctggccgtccccgacgccttcaacagcagccgaaaggactcagaccag attttccacacgcaggccaagcagggcgccgtgctgcaccccacctgcgtcttcgctggc agccccgaggtgctgcacgcacaggagctggaggccagcaactgcgacggaagccgagac gacaaggacaagatgagcagcaaacaccagctcctcagcttcgtgtccctgctggagacc aacaagccgtacctggtgaactgcgtccgcatccctgccctccagtccctcctgcttttt agccggtctttggacaccaatggtgactgctcccgcctggtggccgatggctggctggag ctgcagctagcagacagtgaaagtgccatccgactcctggcggcttccctgcggctccgt gcccgctgggaaagtgccctggaccggcagctggcgcaccaggcccagcagcagctggag gaggaggaggaggatacgccagtcagccccaaggaggtggccaccctgagcaaggaactc ctgcaattcacggcatccaagattccttacagcctccggcggctcacagggctagaagtc cagaacatgtatgtgggaccccagaccatcccagccaccccccatcttcctggcctcttt ggcagctccaccctgtccccccaccccacaaaggggggctacgcagtcactgacttcctc acctacaactgcctcacgaatgacacagacctgtacagcgactgtctccgaaccttctgg acctgcccccactgtggcctgcatgcgcccctcacgcccctggagcgcatcgcccatgag aacacctgcccccaggccccacaggatgggcccccaggggctgaggaagctgccctcgaa accctccagaagacatctgtcctgcagaggccctaccactgcgaggcctgcgggaaggac ttcctctttacacccacagaggtgctgcgccaccggaagcagcacaacaaaggccccatc agaggggacgtgctgggtgtcagcgtcagcttctgggaggaggaagttgaagggcttaaa aaggcagcttacaaagctgttaattatgacaaacttaaagaaactacccaaggtaaagag gaaaacccagcccagttcgtggcccacttagcagcaacacttagacgctataccgcccta gacccagaagggccagaaggccgccttattcttaatatgcattttatcactcagtccact cctgacattaggaaaaaacttcaaaaattagaatctggccctcaaaccccacaacaggaa ttaagcaacctcgccttcaagcggctgaagactgatgctgcccgatcgcctcggaagccc cctagaccatcacagacgccaagcttcatgcaactctcacagtggaagcctctcttcgct ttcacttggactgaccctgacacccatcaggctcagcaaactacctgggctgtactgccg caaggcttcacagacagcccccattacttcagtcaagcccaaatttcatcctcatctgtt acctatctcagcataattatcataaaaacacaggtgctctccctgctgatcgtgtccaat taa >gi568815579f:47253031_47482110|GENSCAN_predicted_peptide_9|481_aa MLCRLTLAHPLSRSLVVLFNFHKTRERPGTGSAAASTPEPPPDPPSPRAAPRPLRSPYDE LPHYPGIVDGPAALASFPETVPAVPGPYGPHRPPQPLPPGLDSDGLKREKDEIYGHPLFP LLALVFEKCELATCSPRDGAGAGLGTPPGGDVCSSDSFNEDIAAFAKQVRSERPLFSSNP ELDNLMIQAIQVLRFHLLELEKVHDLCDNFCHRYITCLKGKMPIDLVIEDRDGGCREDFE DYPASCPSLPDQPHTHYTTATQTRMRLGNSAQTHAHQRLQMPPTDSATPTQVHIQPHQLS NSHFCSQPDSSVHRNNMWIRDHEDSGSVHLGTPGPSSGGLASQSGDNSSDQGDGLDTSVA SPSSGGEDEDLDQERRRNKKRGIFPKVATNIMRAWLFQHLSHPYPSEEQKKQLAQDTGLT ILQVNNWFINARRRIVQPMIDQSNRTGQGAAFSPEGQPIGGYTETQPHVAVRPPESGNAS H >gi568815579f:47253031_47482110|GENSCAN_predicted_CDS_9|1446_bp atgctctgcaggctcactctggctcatcccctatcccgctccttggtggttctgtttaat ttccacaagacaagggagagaccgggaacggggagcgcggctgccagcacccctgagccg ccgccggaccctccgtcgccccgggccgccccccgccccctgcggtccccgtatgatgag ctgccgcactacccaggcatcgtggatggccccgcagccctggctagcttcccagagaca gtgcccgcagtaccagggccctatggcccgcaccggcctccccagcccctgcccccaggc ttggacagcgacggcctgaagagggagaaggatgagatctatggacacccgctcttcccc ctcttggccctggtctttgagaaatgtgaactggctacatgctctccccgtgacggggcc ggagctgggctggggacaccccctggaggtgacgtctgctcctctgattccttcaacgag gacatcgctgcctttgccaagcaggttcgctctgagaggcccctcttctcctccaaccca gaactggacaatctgatgatccaggccatccaggtgctgcggttccacctgctggagctg gagaaggtccacgacctgtgcgacaacttctgtcaccgctacatcacctgcctcaaggga aagatgcccatcgacctggtcatcgaggatcgggacggcggctgcagggaggacttcgag gactacccagcctcctgccccagcctcccagaccagccacacacccactacacaacagct acgcagactcggatgcggctgggcaacagtgcacagacacacgcacaccagcggctgcag atgccacccacagattcagctacacccacacaggtccacatacagccacaccaactcagc aacagccatttctgctcacagccagacagtagcgtccaccggaataatatgtggattcga gaccatgaggatagtgggtctgtacatttggggaccccaggtccatccagtgggggcctg gcctcccagagtggggacaactccagtgaccaaggagacgggctggacaccagcgtggcc tctcccagttctggtggagaagatgaggacttggaccaggagcgacggcgaaacaagaag agggggatcttccccaaggtggccaccaacatcatgcgagcctggttgttccagcacctc tcgcacccgtacccctcggaggagcagaagaaacagctggcgcaggacacggggctcacc atcctgcaagtcaacaactggttcattaacgcccggagacgcatcgtgcaacctatgatc gatcaatccaaccgcacagggcagggtgcagccttcagcccagagggccagcccatcggg ggctataccgagacgcagccacacgtggccgtccggcctccggaatctggaaatgcctct cattaa >gi568815579f:47253031_47482110|GENSCAN_predicted_peptide_10|1005_aa MAPLALVGVTLLLAAPPCSGAATPTPSLPPPPANDSDTSTGGCQGSYRCQPGVLLPVWEP DDPSLGDKAARAVVYFVAMVYMFLGVSIIADRFMAAIEVITSKEKEITITKANGETSVGT VRIWNETVSNLTLMALGSSAPEILLSVIEVCGHNFQAGELGPGTIVGSAAFNMFVVIAVC IYVIPAGESRKIKHLRVFFVTASWSIFAYVWLYLILAVFSPGVVQVWEALLTLVFFPVCV VFAWMADKRLLFYKYVYKRYRTDPRSGIIIGAEGDPPKSIELDGTFVGAEAPGELGGLGP GPAEARELDASRREVIQILKDLKQKHPDKDLEQLVGIANYYALLHQQKSRAFYRIQATRL MTGAGNVLRRHAADASRRAAPAEGAGEDEDDGASRIFFEPSLYHCLENCGSVLLSVTCQG GEGNSTFYVDYRTEDGSAKAGSDYEYRVSHLLPRLEYSVPRRGDPSLSQILSPVRLSSPS ITCLMPVDPHTWEFFYGTYVSSGKHVQTPPVQHTADDRPSPRSSPSPPRRSEGTLVFKPG ETQKELRIGIIDDDIFEEDEHFFVRLLNLRVGDAQGMFEPDGGGRPKGRLVAPLLATVTI LDDDHAGIFSFQDRLLHVSECMGTVDVRVVRSSGARGTVRLPYRTVDGTARGGGVHYEDA CGELEFGDDETMKTLQVKIVDDEEYEKKDNFFIELGQPQWLKRGISALLLNQGDGDRKLT AEEEEARRIAEMGKPVLGENCRLEVIIEESYDFKNTVDKLIKKTNLALVIGTHSWREQFL EAITVSAGDEEEEEDGSREERLPSCFDYVMHFLTVFWKVLFACVPPTEYCHGWACFGVSI LVIGLLTALIGDLASHFGCTVGLKDSVNAVVFVALGTSIPDTFASKVAALQDQCADASIG NVTGSNAVNVFLGLGVAWSVAAVYWAVQGRPFEVRTGTLAFSVTLFTVFAFVGIAVLLYR RRPHIGGELGGPRGPKLATTALFLGLWLLYILFASLEAYCHIRGF >gi568815579f:47253031_47482110|GENSCAN_predicted_CDS_10|3018_bp atggctcccctggccttggtgggggtcacactcctcctggcggctcccccatgctccggg gcagccaccccaaccccctccctgccgcctcccccggccaatgacagcgacaccagcaca gggggctgccaggggtcctaccgctgccagccgggggtgctgctgcccgtgtgggagccc gacgacccgtcgctgggtgacaaggcggcacgggcagtggtgtactttgtggccatggtc tacatgtttctgggagtgtccatcatcgccgaccgtttcatggcggccatcgaggtcatc acgtcaaaagagaaggagatcaccatcaccaaggccaacggtgagaccagcgtgggcacc gttcgcatctggaatgagacggtgtccaacctcacgctcatggccctgggctcctccgca cctgagatcctgctgtcagtcatcgaagtctgcggccacaacttccaggcgggtgagctg ggcccaggcaccatcgtgggcagcgctgccttcaacatgtttgtggtcatcgccgtgtgc atctacgtcatcccagccggcgagagccgcaagatcaagcacctgagagtcttctttgtc actgcctcttggagcatcttcgcctatgtctggctttatctcatccttgctgttttttcc cccggtgtggtccaggtgtgggaggcgctgctgaccctggtcttcttcccggtgtgcgtg gtattcgcctggatggccgacaagcggctgctcttctacaagtacgtgtacaagcgctac cgcaccgacccacgcagcggcatcatcataggcgccgagggcgaccccccgaagagcatc gagctggacggcacgttcgtgggcgccgaggccccaggtgagctgggcggcctgggcccg ggccccgccgaggcgcgcgagctggacgccagccgccgcgaggtcatccagatcctcaag gacctcaagcagaagcacccggacaaggatctggagcagctggtgggcatcgccaactac tacgcgctgctgcaccagcagaagagccgcgccttctaccgcatccaggccacgcggctg atgaccggcgccgggaacgtgctgcgcagacacgcggcggacgcctcgcgcagggcggcg ccggccgagggcgcgggcgaggacgaagacgacggcgccagccgcatcttcttcgagcct agcctctaccactgcctggagaactgcggctccgtgctgctgtccgtcacgtgccagggc ggcgagggcaacagcaccttctacgtggactaccgcactgaggacggctctgccaaggcg ggctccgactacgagtacagagtttcgcacttgttgcccagactggagtacagtgttcct cgaagaggtgacccgtccctaagtcagatcctgtcccctgtccggctgtccagtcccagt attacttgtttaatgcctgtcgaccctcatacttgggagttcttctatggcacctacgtc tcctctgggaagcacgtccagactccccccgtacaacacacagccgatgaccgaccctca ccccgctcttccccctccccaccccgccgcagcgagggcacgctggtgttcaaaccaggc gagacgcagaaggagctgcgcatcggcatcatcgacgacgacatcttcgaggaggacgag catttcttcgtgcggctgctgaacctgcgcgtgggcgacgcgcagggcatgttcgagccg gacggcggcgggcggcccaaggggcggctggtggcgccgctgctggccaccgtcaccatc ctggacgacgaccacgcaggcatcttctccttccaggaccgcctgctgcacgtgagcgag tgcatgggcaccgtggacgtgcgcgtcgtgcgcagctcgggcgcgcgcggcaccgtgcgc cttccctaccgcacggtggacggcacggcgcgcggcggcggcgtgcactacgaggacgcg tgcggagagctggagtttggcgacgacgagaccatgaaaactcttcaggtgaagatagtt gatgacgaggaatatgagaaaaaggataatttcttcattgagctgggccagccccagtgg cttaagcgagggatttcagctctgctactcaatcaaggggatggggacaggaagctaaca gccgaggaggaggaggctcggaggatagcagagatgggcaagccagttcttggggagaac tgccggctggaggtcatcatcgaggagtcatatgattttaagaacacggtggataaactc atcaagaaaacgaacttggccttggtaattgggacccattcatggagggagcagttttta gaggcaattacggtgagcgcaggggacgaggaggaggaggaggacgggtcccgggaggag cggctgccgtcgtgctttgactacgtgatgcacttcctgacggtgttctggaaggtgctc ttcgcctgtgtgccccccaccgagtactgccacggctgggcctgctttggtgtctccatc ctggtcatcggcctgctcaccgccctcattggggacctcgcctcccacttcggctgcacc gttggcctcaaggactctgtcaatgctgttgtcttcgttgccctgggcacctccatccct gacacgttcgccagcaaggtggcggcgctgcaggaccagtgcgccgacgcgtccatcggc aacgtgaccggctccaacgcggtgaacgtgttccttggcctgggcgtcgcctggtctgtg gccgccgtgtactgggcggtgcagggccgccccttcgaggtgcgcactggcacgctggcc ttctccgtcacgctcttcaccgtcttcgccttcgtgggcattgccgtgctgctgtaccgg cgccggccgcacatcggcggcgagctgggcggcccgcgcggacccaagctcgccaccacc gcgctcttcctgggcctctggctcctgtacatcctcttcgccagcctggaggcgtactgc cacatccggggcttctag >gi568815579f:47253031_47482110|GENSCAN_predicted_peptide_11|291_aa XVQVGDQLETVFLLSGNDPAIHLYKENEGLHQFEEQPVENLFPELTNLTSSVLWLDVHNF PGTSRRLSALGCQSGYVRVAHVDQRSRGEGREVLQMWSVLQDGPISRVIVFSLSAAKETK DRPLQDEYSVLVASMLEPAVVYRDLLNRGLEDQLLLPGSDQFDSVLCSLVTDVDLDGRPE VLVATYGQELLCYKYRGPESGLPEAQHGFHLLWQRSFSSPLLAMAHVDLTGDGLQELAVV SLKGVHILQHSLIQASELVLTRLRHQVEQRRRRLQGLEDGAGAGPAENAAS >gi568815579f:47253031_47482110|GENSCAN_predicted_CDS_11|876_bp nnggtccaggtcggggatcaacttgagactgtgtttctcttgagtgggaacgacccggcc attcatctctacaaggagaacgaggggctgcatcagtttgaggaacagcccgtggaaaac ctcttcccagagctgacgaacctgaccagtagcgtcctctggctggacgtccacaacttc cccggcacgtcccggcgcctctcagctctgggctgtcagagtggttatgtccgtgtcgcc cacgtggaccagcggagtcgaggtgagggccgcgaggttctgcagatgtggtcggtcctg caggacggtcccatctcccgagtgattgtgttcagcctctcggccgccaaggagaccaag gacaggccactacaagatgagtacagcgtgctcgtggccagcatgttggagccagcagtg gtgtatcgggacctgctgaaccggggtcttgaagaccagcttctcctgcccggcagtgac cagtttgacagcgtcctctgcagcctggtcaccgatgtggatttggatgggcggccagaa gtcctggtggccacctatggacaggaactgctgtgttataagtaccggggcccagagtcg gggcttcctgaggcccagcacgggttccatctgctgtggcagcggagcttctccagtccc ctgctggccatggctcacgtggacctgaccggggatgggctgcaggagcttgccgtggtc tccctgaagggcgtgcacatcctgcagcacagcctgattcaggcctcagagctggtcttg acccggcttcgacatcaagtggagcagaggagacgtcggctacaggggttggaggacggg gcaggtgcagggcctgctgagaatgcagcctcttaa