GENSCAN 1.0 Date run: 5-Nov-116 Time: 11:31:02 Sequence gi568815597f:10395271_10729984 : 334714 bp : 49.92% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 3704 3855 152 1 2 69 81 95 0.585 6.53 1.02 Intr + 4359 4434 76 0 1 114 91 190 0.995 21.42 1.03 Intr + 5123 5302 180 1 0 102 73 203 0.987 20.26 1.04 Intr + 7801 7866 66 0 0 77 98 79 0.987 6.90 1.05 Intr + 8891 9009 119 1 2 76 89 46 0.753 2.76 1.06 Intr + 12801 12840 40 0 1 123 0 59 0.695 -1.17 1.07 Intr + 16148 16282 135 1 0 107 94 247 0.993 27.96 1.08 Intr + 17792 17981 190 1 1 76 23 240 0.973 15.46 1.09 Intr + 21717 21847 131 1 2 75 81 160 0.977 14.41 1.10 Intr + 21998 22081 84 1 0 62 44 70 0.502 0.02 1.11 Intr + 22106 22239 134 1 2 45 115 127 0.991 10.54 1.12 Intr + 23556 23655 100 0 1 75 91 140 0.997 13.11 1.13 Intr + 24147 24269 123 2 0 90 115 59 0.999 9.58 1.14 Term + 24360 24479 120 2 0 117 43 115 0.999 8.37 1.15 PlyA + 24853 24858 6 1.05 2.00 Prom + 27848 27887 40 -3.16 2.01 Init + 31081 31188 108 0 0 82 35 205 0.908 12.83 2.02 Intr + 35233 35298 66 0 0 61 92 66 0.247 3.40 2.03 Intr + 38572 38695 124 0 1 73 48 154 0.911 10.06 2.04 Intr + 39387 39420 34 1 1 121 98 37 0.722 5.28 2.05 Term + 44962 45100 139 1 1 58 41 64 0.138 -3.86 2.06 PlyA + 45820 45825 6 1.05 3.00 Prom + 46965 47004 40 -5.16 3.01 Init + 54954 55052 99 2 0 65 105 209 0.987 18.57 3.02 Term + 58715 58747 33 1 0 82 55 19 0.273 -4.41 3.03 PlyA + 62633 62638 6 1.05 4.08 PlyA - 64675 64670 6 1.05 4.07 Term - 66432 66220 213 0 0 53 43 231 0.994 12.33 4.06 Intr - 67939 67788 152 2 2 118 64 143 0.999 14.78 4.05 Intr - 68350 68161 190 1 1 53 97 139 0.982 10.46 4.04 Intr - 72062 71920 143 0 2 63 103 138 0.995 12.87 4.03 Intr - 74068 73907 162 2 0 61 110 152 0.997 14.65 4.02 Intr - 76654 76595 60 2 0 69 92 71 0.949 4.31 4.01 Init - 77188 77008 181 1 1 69 76 203 0.905 16.55 4.00 Prom - 77789 77750 40 -5.66 5.00 Prom + 78081 78120 40 -6.36 5.01 Init + 79697 79732 36 1 0 91 110 30 0.407 5.55 5.02 Intr + 100004 100051 48 1 0 101 119 25 0.718 5.78 5.03 Intr + 100419 100451 33 2 0 83 115 3 0.477 0.92 5.04 Term + 100643 100654 12 1 0 129 41 17 0.579 -0.70 5.05 PlyA + 101730 101735 6 1.05 6.05 PlyA - 101828 101823 6 1.05 6.04 Term - 112606 112268 339 1 0 57 37 125 0.106 -1.16 6.03 Intr - 115832 115748 85 1 1 10 59 134 0.072 2.52 6.02 Intr - 131644 131412 233 1 2 6 100 145 0.028 4.07 6.01 Init - 138340 138248 93 1 0 88 103 30 0.485 4.88 6.00 Prom - 144340 144301 40 -3.26 7.07 PlyA - 145045 145040 6 1.05 7.06 Term - 146079 146012 68 1 2 123 45 41 0.055 1.40 7.05 Intr - 154307 154269 39 0 0 118 101 9 0.719 3.40 7.04 Intr - 156236 156099 138 0 0 2 100 136 0.723 6.64 7.03 Intr - 158620 158438 183 2 0 35 81 86 0.500 2.36 7.02 Intr - 160596 160546 51 1 0 83 94 14 0.262 0.38 7.01 Init - 165894 165855 40 0 1 82 72 61 0.388 4.25 7.00 Prom - 172992 172953 40 -2.96 8.00 Prom + 178489 178528 40 -2.86 8.01 Init + 200747 200870 124 1 1 94 94 7 0.843 1.88 8.02 Intr + 203968 204096 129 2 0 110 98 110 0.885 14.87 8.03 Intr + 223062 223147 86 1 2 127 113 102 0.830 16.14 8.04 Intr + 227749 227851 103 0 1 100 103 98 0.995 12.25 8.05 Intr + 229070 229167 98 0 2 91 77 186 0.601 17.53 8.06 Intr + 232002 232093 92 2 2 95 66 69 0.473 4.19 8.07 Term + 234261 234717 457 0 1 132 49 845 0.985 79.60 8.08 PlyA + 235463 235468 6 1.05 9.24 PlyA - 240118 240113 6 1.05 9.23 Term - 244789 243672 1118 2 2 115 44 2651 0.997 255.60 9.22 Intr - 247052 246753 300 0 0 103 97 13 0.529 0.41 9.21 Intr - 247730 247589 142 2 1 103 99 128 0.990 15.33 9.20 Intr - 248041 247890 152 2 2 71 94 229 0.991 21.68 9.19 Intr - 248812 248645 168 2 0 33 28 129 0.469 1.32 9.18 Intr - 249818 249566 253 2 1 124 40 336 0.692 29.41 9.17 Intr - 251056 250858 199 0 1 84 76 271 0.909 24.85 9.16 Intr - 252869 252531 339 1 0 111 81 203 0.965 16.49 9.15 Intr - 253922 253800 123 1 0 63 87 209 0.999 18.00 9.14 Intr - 254167 254013 155 1 2 112 94 204 0.992 22.27 9.13 Intr - 255485 255422 64 1 1 90 116 -9 0.967 0.82 9.12 Intr - 255806 255671 136 0 1 138 94 134 0.999 18.63 9.11 Intr - 256849 256700 150 2 0 43 51 95 0.684 1.43 9.10 Intr - 258948 258107 842 2 2 73 99 1260 0.999 116.63 9.09 Intr - 259321 259149 173 1 2 113 115 370 0.999 40.94 9.08 Intr - 260543 260379 165 2 0 82 51 318 0.999 27.66 9.07 Intr - 261466 261376 91 0 1 140 96 168 0.835 22.80 9.06 Intr - 262107 261979 129 2 0 59 44 116 0.938 4.11 9.05 Intr - 263306 263238 69 1 0 88 96 77 0.991 6.80 9.04 Intr - 265266 264432 835 1 1 103 80 1363 0.975 127.45 9.03 Intr - 270301 269813 489 2 0 121 94 751 0.995 71.98 9.02 Intr - 273649 273645 5 0 2 124 110 0 0.482 -1.43 9.01 Init - 274412 274381 32 2 2 76 99 85 0.650 5.62 9.00 Prom - 274617 274578 40 -6.16 10.00 Prom + 275887 275926 40 -4.66 10.01 Init + 276460 276582 123 0 0 70 49 75 0.617 1.83 10.02 Term + 279057 279308 252 2 0 55 41 291 0.814 16.74 10.03 PlyA + 279929 279934 6 1.05 11.00 Prom + 283008 283047 40 -6.66 11.01 Init + 284989 285113 125 0 2 50 62 128 0.608 4.04 11.02 Intr + 288406 288602 197 1 2 89 75 26 0.185 0.46 11.03 Intr + 290249 290288 40 0 1 108 92 11 0.379 0.78 11.04 Intr + 293629 293740 112 1 1 47 59 93 0.271 2.68 11.05 Intr + 299245 299354 110 0 2 24 78 110 0.057 2.68 11.06 Intr + 302498 302619 122 2 2 59 41 97 0.012 2.24 11.07 Intr + 306927 307079 153 1 0 109 77 2 0.005 1.24 11.08 Intr + 308165 308339 175 0 1 88 -2 128 0.004 2.80 11.09 Intr + 314041 314224 184 1 1 94 35 115 0.028 6.59 11.10 Intr + 322568 322774 207 1 0 43 90 73 0.508 2.27 11.11 Intr + 324930 325079 150 2 0 86 48 169 0.953 13.06 11.12 Intr + 327536 327699 164 1 2 94 83 116 0.607 10.47 11.13 Intr + 330815 330891 77 2 2 53 11 116 0.607 -0.44 11.14 Intr + 331262 331371 110 0 2 81 41 100 0.454 4.60 11.15 Intr + 333060 333097 38 2 2 113 77 34 0.339 1.86 11.16 Intr + 334326 334435 110 0 2 25 88 60 0.063 -0.37 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 51875 51771 105 2 0 31 56 125 0.807 3.82 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:10395271_10729984|GENSCAN_predicted_peptide_1|549_aa MAPPPSIRLAGAEKPGVSGRSFWREPLRVFPSLVLRASPLFGSALSAAMAQADIALIGLA VMGQNLILNMNDHGFVVCAFNRTVSKVDDFLANEAKGTKVVGAQSLKEMVSKLKKPRRII LLVKAGQAVDDFIEKLVPLLDTGDIIIDGGNSEYRDTTRRCRDLKAKGILFVGSGVSGGE EGARYGPSLMPGGNKEAWPHIKTIFQGIAAKVGDEGAGHFVKMVHNGIEYGDMQLICEAY HLMKDVLGMAQDEMAQAFEDWNKTELDSFLIEITANILKFQDTDGKHLLPKIRDSAGQKG TGKWTAISALEYGVPVTLIGEAVFARCLSSLKDERIQASKKLKGPQKFQFDGDKKSFLED IRKNIGPSGISTADENKTGRHKAVTLLMAILALYASKIISYAQGFMLLRQAATEFGWTLN YGGIALMWRGGCIIRSVFLGKIKDAFDRNPELQNLLLDDFFKSAVENCQDSWRRAVSTGV QAGIPMPCFTTALSFYDGYRHEMLPASLIQAQRDYFGAHTYELLAKPGQFIHTNWTGHGG TVSSSSYNA >gi568815597f:10395271_10729984|GENSCAN_predicted_CDS_1|1650_bp atggctccacccccttccattcgattggccggcgccgaaaagccgggcgtgagcggccgc agtttctggagggagccgctgcgggtctttccctcactcgtcctccgcgcgtcgccgctc ttcggttctgctctgtccgccgccatggcccaagctgacatcgcgctgatcggattggcc gtcatgggccagaacttaattctgaacatgaatgaccacggctttgtggtctgtgctttt aataggactgtctccaaagttgatgatttcttggccaatgaggcaaagggaaccaaagtg gtgggtgcccagtccctgaaagagatggtctccaagctgaagaagccccggcggatcatc ctcctggtgaaggctgggcaagctgtggatgatttcatcgagaaattggtaccattgttg gatactggtgacatcatcattgacggaggaaattctgaatatagggacaccacaagacgg tgccgagacctcaaggccaagggaattttatttgtggggagcggagtcagtggtggagag gaaggggcccggtatggcccatcgctcatgccaggagggaacaaagaagcgtggccccac atcaagaccatcttccaaggcattgctgcaaaagtgggagatgagggagcaggccacttc gtgaagatggtgcacaacgggatagagtatggggacatgcagctgatctgtgaggcatac cacctgatgaaagacgtgctgggcatggcgcaggacgagatggcccaggcctttgaggat tggaataagacagagctagactcattcctgattgaaatcacagccaatattctcaagttc caagacaccgatggcaaacacctgctgccaaagatcagggacagcgcggggcagaagggc acagggaagtggaccgccatctccgccctggaatacggcgtacccgtcaccctcattgga gaagctgtctttgctcggtgcttatcatctctgaaggatgagagaattcaagctagcaaa aagctgaagggtccccagaagttccagtttgatggtgataagaaatcattcctggaggac attcggaagaatattggcccttctgggatctccactgctgatgagaataagactggtaga cataaggcggtcactctcctaatggcaatcctagcactctacgcttccaagatcatctct tacgctcaaggctttatgctgctaaggcaggcagccaccgagtttggctggactctcaat tatggtggcatcgccctgatgtggagagggggctgcatcattagaagtgtattcctagga aagataaaggatgcatttgatcgaaacccggaacttcagaacctcctactggacgacttc tttaagtcagctgttgaaaactgccaggactcctggcggcgggcagtcagcactggggtc caggctggcattcccatgccctgttttaccactgccctctccttctatgacgggtacaga catgagatgcttccagccagcctcatccaggctcagcgggattacttcggggctcacacc tatgaactcttggccaaaccagggcagtttatccacaccaactggacaggccatggtggc accgtgtcatcctcgtcatacaatgcctga >gi568815597f:10395271_10729984|GENSCAN_predicted_peptide_2|156_aa MRPHSPALGWSMGLGAVEQGATLIGEARATQEPTEWGRPAVMEEEAETEEQQRFSYQQRL KAAVHYTVGCLCEEVALDKEMQFSKQTIAAISELTFRQCENFAKDLEMFARMGYLEIPAS LKSDRELLCFIKTLNNTKHFGDLLLPFLQTCEKNHN >gi568815597f:10395271_10729984|GENSCAN_predicted_CDS_2|471_bp atgcgcccacactccccagcccttgggtggtcgatgggactgggcgccgtggagcagggg gccacgctcatcggggaggctcgggccacacaggagcccacggagtggggtcggcccgca gtgatggaggaggaggcggagaccgaggagcagcagcgattctcttaccaacagaggcta aaggcagcagttcactatactgtgggttgtctttgcgaggaagttgcattggacaaagag atgcagttcagcaaacagaccattgcggccatttcggagctgactttccgacagtgtgaa aattttgccaaagaccttgaaatgtttgcaaggatgggctacttagagatccctgctagt ttgaagtctgaccgtgaacttctgtgtttcatcaaaactttaaataataccaaacatttt ggtgacttgcttctgccctttctgcagacatgcgaaaagaaccacaattaa >gi568815597f:10395271_10729984|GENSCAN_predicted_peptide_3|43_aa MPLSPGLLLLLLSGATATAALPLEGGPTGRDSEDLAPEQLPNC >gi568815597f:10395271_10729984|GENSCAN_predicted_CDS_3|132_bp atgccattgtcccccggcctcctgctgctgctgctctccggggccacggccaccgctgcc ctgcccctggagggtggccccaccggccgagacagcgaggatttggcaccagaacagctc cctaactgctga >gi568815597f:10395271_10729984|GENSCAN_predicted_peptide_4|366_aa MEVTGDAGVPESGEIRTLKPCLLRRNYSREQHGVAASCLEDLRSKGWLGTPGEGVGEPGP GTYAGKRELLLTISGGPDKEPCDILAIDKSLTPVTLVLAEDGTIVDDDDYFLCLPSNTKF VALASNEKWAYNNSDGGTAWISQESFDVDETDSGAGLKWKNVARQLKEDLSSIILLSEED LQMLVDAPCSDLAQELRQSCATVQRLQHTLQQVLDQREEVRQSKQLLQLYLQALEKEGSL LSKQEESKAAFGEEVDAVDTGISRETSSDVALASHILTALREKQAPELSLSSQDLELVTK EDPKALAVALNWDIKKTETVQEACERELALRLQQTQSLHSLRSISASKASPPGDLQNPKR ARQDPT >gi568815597f:10395271_10729984|GENSCAN_predicted_CDS_4|1101_bp atggaggtgaccggggacgccggggtaccagaatctggcgagatccggactctaaagccg tgtctgctgcgccgcaactacagccgcgaacagcacggcgtggccgcctcctgcctcgaa gacctgaggagcaagggttggctcgggaccccgggcgagggtgtgggggagccagggccg ggaacctatgcaggaaagagggagctgctactgaccatttctggaggtccagacaaggaa ccctgtgacattctggccattgataagtccctgacaccagtcaccctggtcctggcagag gatggcaccatagtggatgatgacgattactttctgtgtctaccttccaatactaagttt gtggcattggctagtaatgagaaatgggcatacaacaattcagatggaggtacagcttgg atttcccaagagtcctttgatgtagatgaaacagacagcggggcagggttgaagtggaag aatgtggccaggcagctgaaagaagatctgtccagcatcatcctcctatcagaggaggac ctccagatgcttgttgacgctccctgctcagacctggctcaggaactacgtcagagttgt gccaccgtccagcggctgcagcacacactccaacaggtgcttgaccaaagagaggaagtg cgtcagtccaagcagctcctgcagctgtacctccaggctttggagaaagagggcagcctc ttgtcaaagcaggaagagtccaaagctgcctttggtgaggaggtggatgcagtagacacg ggtatcagcagagagacctcctcggacgttgcgctggcgagccacatccttactgcactg agggagaagcaggctccagagctgagcttatctagtcaggatttggagttggttaccaag gaagaccccaaagcactggctgttgccttgaactgggacataaagaagacggagactgtt caggaggcctgtgagcgggagctcgccctgcgcctgcagcagacgcagagcttgcattct ctccggagcatctcagcaagcaaggcctcaccacctggtgacctgcagaatcctaagcga gccagacaggatcccacatag >gi568815597f:10395271_10729984|GENSCAN_predicted_peptide_5|42_aa MASSEQAEQPSQPSSTPGSENVLPREPLGASASVVFKEQGLL >gi568815597f:10395271_10729984|GENSCAN_predicted_CDS_5|129_bp atggcgtcctcggagcaggcagagcagccgagccagccaagctctactccaggaagtgaa aatgtgctgcctcgagagccgctgggtgcgagcgcctcagtggtctttaaagaacagggc ctgctgtaa >gi568815597f:10395271_10729984|GENSCAN_predicted_peptide_6|249_aa MECSVISNSFKKNVGDIAKNSINERIQFKHQTHQKTEESQQSRSATNGMPPSPRSCYRMS IPNLKIQNPKCSKIQNFLSANMTVKGHAQRKCSLEHFGFGIFRFEMLNQQTSSSAACTFN DAYRCTTATEQDVENEEVNFAALTLTMRNLWMLIETQIKDLPEGAEKVEIKDPGYCIFFT YADSSDSLETCRHHFKSWSPEQIAQLSNLLPASKRQNNKRLLPPRSYQGHSTLLYRHHMK RSYYLLSIP >gi568815597f:10395271_10729984|GENSCAN_predicted_CDS_6|750_bp atggagtgctctgtcattagtaattcatttaagaagaacgtgggggatatagcaaagaac agcataaacgagcgaatccaatttaaacatcagacccaccagaaaacagaagaaagtcaa caatcaagatctgctactaacgggatgccaccaagtcctagatcctgctacaggatgagc atccctaatctgaaaatccaaaatccaaaatgctccaaaatccagaactttttgagtgcc aacatgacagtcaaaggtcatgcacaaaggaaatgctcactggagcatttcggatttggg attttcagatttgagatgctcaaccaacaaaccagcagctctgctgcctgcacatttaac gatgcctatcggtgcaccacagctacggaacaggatgtggagaacgaggaggtgaacttt gctgccttaactctgacaatgaggaatttgtggatgttgatagaaacacagataaaggat ttacctgagggggcagagaaagtagaaataaaagatcctggatattgtattttctttact tatgcggattccagtgactccctggagacctgcaggcaccactttaagagctggagtcca gagcaaattgctcaattgtccaaccttctccccgccagcaagagacagaacaacaagcgg ctgctgccccctagaagctaccaggggcatagcaccttgctatatagacatcacatgaaa agatcctattatctcctctccatcccataa >gi568815597f:10395271_10729984|GENSCAN_predicted_peptide_7|172_aa MAVSEEVYAEVITEGQNEASSAVMMQRPATGVQGSPENCRGQISKQVDCHKQISGNTCRY NREGQTQGRADEETDDKGVKRSFLEAEFSVEESPSRIRAAGIKLPNAPASSALCNDIHAN GDELGFMEVRLLRMDLKGLPKGEPLIGKKKCSKTKPVLSPLVALTPEKGFYR >gi568815597f:10395271_10729984|GENSCAN_predicted_CDS_7|519_bp atggccgtatctgaggaggtatatgctgaagtaattacagagggccagaatgaagcaagt tctgctgttatgatgcaacgcccggccacaggtgtgcagggaagccctgaaaattgccga ggccagatcagcaaacaggttgactgtcacaagcagatcagtggaaacacatgcagatat aacagagaaggacaaactcagggaagagcagatgaggaaacagatgacaaaggagtgaag aggagctttctggaagctgagttttcagtggaagaatctccaagccgaatccgtgctgct gggataaagctgcctaatgcaccagcctccagcgcgctgtgtaatgacatccacgctaat ggagatgagctcggcttcatggaggttcgactcctgagaatggatcttaaaggactgcct aaaggagagcctttgattgggaaaaaaaaatgttccaaaaccaaacccgttttgtctccc ctggtagctctcactcctgagaaaggcttttaccgctaa >gi568815597f:10395271_10729984|GENSCAN_predicted_peptide_8|362_aa MGLGHRDVLCWLHGSTLTLKAGSSQHQEPMTGQITSCDLETGLTDEEIDMAFQQSGTAAD EPSSLGPATQVVPVQPPHLISQPYSPAGSRWRDYGALAIIMAGIAFGFHQLYKKYLLPLI LGGREDRKQLERMEAGLSELSGSVAQTVTQLQTTLASVQELLIQQQQKIQELAHELAAAK ATTSTNWILESQNINELKSEINSLKGLLLNRRQFPPSPSAPKIPSWQIPVKSPSPSSPAA VNHHSSSDISPVSNESTSSSPGKEGHSPEGSTVTYHLLGPQEEGEGVVDVKGQVRMEVQG EEEKREDKEDEEDEEDDDVSHVDEEDCLGVQREDRRGGDGQINEQVEKLRRPEGASNESE RD >gi568815597f:10395271_10729984|GENSCAN_predicted_CDS_8|1089_bp atggggctagggcaccgggatgtcctatgttggctccatggcagcactttgaccctgaag gcagggtcatcacagcaccaagagccaatgactgggcagataactagttgtgatctggag acagggctgacagatgaagagattgatatggccttccagcagtcgggcactgctgccgat gagccttcgtccttgggcccagccacacaggtggttcctgtccagccccctcacctcata tctcagccatacagtcccgcaggctcccgatggcgagattacggcgccctggccatcatc atggcaggcattgcatttggctttcaccagctctacaagaaatacctgctccccctcatc ctgggcggccgagaggacagaaagcagctggagaggatggaggccggtctctctgagctg agtggcagcgtggcccagacagtgactcagttacagacgaccctcgcctccgtccaggag ctgctgattcagcagcagcagaagatccaggagcttgcccacgagctggccgctgccaag gccaccacatccaccaactggatcctggagtcccagaatatcaacgaactcaagtccgaa attaactccttgaaagggcttcttttaaatcggaggcagttccctccatccccatcagcc ccgaagatcccctcctggcagatcccagtcaagtcaccgtcaccctccagccctgcggcc gtgaaccaccacagcagcagcgacatctcacctgtcagcaacgagtccacgtcgtcctcg cctgggaaggagggccacagccccgagggctccacggtcacctaccacttgctgggcccc caggaggaaggcgagggggtggtggacgtcaagggccaggtgcggatggaggtgcaaggc gaggaggagaagagggaggacaaggaggacgaggaggatgaggaggatgatgatgtgagc catgtggacgaggaggactgcctgggggtgcagagggaggaccgccggggcggggatggg cagatcaacgagcaggtggagaagctgcggcggcccgagggcgccagcaacgagagtgag cgggactag >gi568815597f:10395271_10729984|GENSCAN_predicted_peptide_9|2042_aa MPAGGGLALTRPAEGTRCTDPPAGKPAMAPKRKGGLKLNAICAKLSRQVVVEKRADAGSH TEGSPSQPRDQERSGPESGAARAPRSEEDKRRAVIEKWVNGEYSEEPAPTPVLGRIAREG LELPPEGVYMVQPQGCSDEEDHAEEPSKDGGALEEKDSDGAASKEDSGPSTRQASGEASS LRDYAASTMTEFLGMFGYDDQNTRDELARKISFEKLHAGSTPEAATSSMLPTSEDTLSKR ARFSKYEEYIRKLKAGEQLSWPAPSTKTEERVGKEVVGTLPGLRLPSSTAHLETKATILP LPSHSSVQMQNLVARASKYDFFIQKLKTGENLRPQNGSTYKKPSKYDLENVKYLHLFKPG EGSPDMGGAIAFKTGKVGRPSKYDVRGIQKPGPAKVPPTPSLAPAPLASVPSAPSAPGPG PEPPASLSFNTPEYLKSTFSKTDSITTGTVSTVKNGLPTDKPAVTEDVNIYQKYIARVAS PRPAVTEAVGATSQRPWGLEEDGFPGRQHRFLAVAAAAGEFSGSQHCGHIHCAYQYREHY HCLDPECNYQRFTSKQDVIRHYNMHKKRDNSLQHGFMRFSPLDDCSVYYHGCHLNGKSTH YHCMQVGCNKVYTSTSDVMTHENFHKKNTQLINDGFQRFRATEDCGTADCQFYGQKTTHF HCRRPGCTFTFKNKCDIEKHKSYHIKDDAYAKDGFKKFYKYEECKYEGCVYSKATNHFHC IRAGCGFTFTSTSQMTSHKRKHERRHIRSSGALGLPPSLLGAKDTEHEESSNDDLVDFSA LSSKNSSLSASPTSQQSSASLAAATAATEAGPSATKPPNSKISGLLPQGLPGSIPLALAL SNSGLPTPTPYFPILAGRGSTSLPVGTPSLLGAVSSGSAASATPDTPTLVASGAGDSAPV AAASVPAPPASIMERISASKGLISPMMARLAAAALKPSATFDPGDFGELLECEVPRSENS HRPNRTLMPGGRVEGPPPGPGMGGASLATPFQPGSGQQVTPARFPPAQVKPEPGESTGAP GPHEASQDRSLDLTVKEPSNESNGHAVPANSSLLSSLMNKMSQGNPGLGSLLNIKAEAEG SPAAEPSPFLGKAVKALVQEKLAEPWKVYLRRFGTKDFCDGQCDFLHKAHFHCVVEECGA LFSTLDGAIKHANFHFRTEGGAAKGNTEAAFPASAAETKPPMAPSSPPVPPVTTATVSSL EGPAPSPASVPSTPTLLAWKQLASTIPQMPQIPASVPHLPASPLATTSLENAKPQVKPGF LQFQENDPCLATDCKYANKFHFHCLFGNCKYVCKTSGKAESHCLDHINPNNNLVNVRDQF AYYSLQCLCPNQHCEFRMRGHYHCLRTGCYFVTNITTKLPWHIKKHEKAERRAANGFKYF TKREECGRLGTEGWGCGQGLAAGIKTMAPVAWAASPVKGDILKVNCQKKETNLAFKYEFK KEFSARERTGSHANKRQHLRPVLSLGSVYRCQGCKYNQVNSHFHCIREGCQFSFLLKHQM TSHARKHMRRMLGKNFDRVPPSQGPPGLMDAETDECMDYTGCSPGAMSSESSTMDRSCSS TPVGNESTAAGCPAPPPPPLPPPAAAGDEAARTAPQPPATPSFSPATLRPPLPPLPCLFS PSCLSYSLLSATLGAGRGLAHPTCSPPSFPPVTATPTPVKSDVPLVQDAAGNTISMPTAS GAKKRFWIIEDMSPFGKRRKTASSRKMLDEGMMLEGFRRFDLYEDCKDAACQFSLKVTHY HCTRENCGYKFCGRTHMYKHAQHHDRVDNLVLDDFKRFKASLSCHFADCPFSGTSTHFHC LRCRFRCTDSTKVTAHRKHHGKQDVISAAGFCQFSSSADCAVPDCKYKLKCSHFHCTFPG CRHTVVGMSQMDSHKRKHEKQERGEPAAEGPAPGPPISLDGSLSLGAEPGSLLFLQSAAA GLGLALGDAGDPGPPDAAAPGPREGAAAAAAAAGESSQEDEEEELELPEEEAEDDEDEDD DEDDDDEDDDEDDDDEDLRTDSEESLPEAAAEAAGAGARTPALAALAALGAPGPAPTAAS SP >gi568815597f:10395271_10729984|GENSCAN_predicted_CDS_9|6129_bp atgcctgcgggagggggcctggccctgaccaggccagctgagggcacccggtgcacggac ccgcctgcaggcaagcccgccatggcgcccaaacgcaagggtggcctgaagctgaacgcc atctgcgccaagctgagccgccaggtggtggtggagaagcgagctgacgccggctcccac acggagggcagcccatcgcagccccgggaccaagagcgcagtggccctgagtctggggca gcccgggccccccgcagcgaggaagacaagagacgggcagtgatcgagaagtgggtgaac ggggagtacagcgaggagccggcacccacacccgtgttggggcggattgcccgcgagggc ctggagctgcctcccgagggtgtctacatggtgcagccccaggggtgcagcgatgaggaa gaccacgcggaggagccctccaaggacggcggtgccctggaggagaaggattcggacggg gcagcctccaaggaggacagcggccccagcaccaggcaggcttcaggagaggcctcctcg ctgcgggactacgcggcctccaccatgaccgagttcctcggcatgtttggctatgatgac cagaacacgcgggacgagctggccaggaagatcagctttgagaagctgcacgcgggctcc accccggaggcagccacctcctccatgctgcccacctccgaggataccctcagcaagcgg gcgcggttctctaagtatgaggagtacatccgcaagctcaaggctggcgagcagctctcc tggccggcccccagcaccaagaccgaggagcgggtgggcaaggaggtggtgggcaccctg cccggcctgcggctgcccagcagcacggcccacctggagaccaaggccaccatcctgccc ctgccgtcgcacagcagtgtccagatgcagaacctggtagcccgggcctccaagtacgac ttcttcatccaaaaactgaagaccggcgagaatctgcggccccagaacgggagcacctac aagaagccatccaagtacgacctggagaatgtcaagtacctgcacctcttcaaacccggg gagggcagccccgacatgggcggggccatcgccttcaagacaggcaaggtggggcgccct tccaagtacgacgtccggggcatccagaagccaggccccgccaaggttccgcccaccccc agcctggctcccgcacccctcgccagcgtgcccagtgcccccagcgcccccgggccaggg ccagagcctcctgcctccctgtccttcaacactcccgagtacctgaagtcaaccttctcc aaaacagactccatcaccacggggaccgtctccactgtcaagaacggactgcccacagat aaaccagccgtcactgaagatgtaaacatttaccagaaatatattgccagagtggcctcc ccacggccggccgtcaccgaggctgtgggagccacgtcccagcggccctggggcctggag gaggacgggtttcctgggaggcagcacaggttcctggcggtggcggcggcggctggagag ttctcgggcagccagcactgtggccacatccactgtgcctaccagtaccgcgagcactac cactgccttgaccctgagtgtaactaccagaggttcacgagtaagcaggacgtgatccgc cactacaacatgcacaagaagcgcgacaactccctgcagcacggcttcatgcgtttcagc ccgctggacgactgcagcgtctactaccacggctgccacctcaatgggaagagcacccac tatcactgcatgcaggtgggctgtaacaaggtgtacacgagcacgtctgacgtgatgacc cacgagaacttccacaagaagaatacccagctcattaacgacggcttccagcgcttccga gccaccgaagactgtggcacagccgactgccagttctacggacagaagaccacgcacttc cactgcaggcgccccggctgcacattcactttcaagaacaagtgtgacatcgagaagcac aagagctaccacatcaaggacgatgcctacgccaaggacggcttcaagaagttctacaag tacgaggagtgcaagtacgagggctgcgtgtacagcaaggctaccaaccacttccactgc atccgcgccggctgcggcttcaccttcacctccaccagccagatgacctctcacaagcgc aagcatgagcgccggcacatccgctcctcgggcgcgctggggctgccgccctcgctgctg ggcgccaaggacacggagcacgaggagtccagcaacgacgaccttgttgacttctccgcc ctgagcagcaagaactccagcctgagcgcctcccctaccagccagcagtcctctgcgtcc ctggctgccgccactgccgccaccgaggctgggcccagtgccaccaaacctcccaacagc aagatctcggggctgctgccccagggcctgcctggctcaatccccctggccctggccctc tccaactcgggcctgcccacccccacgccctacttccccatactggctggccgtgggagc acctccctgcctgtgggcacccccagcctcctgggtgccgtgtcgtctgggtcagcagcc tcagccacccctgacacacccacgctggtcgcctcgggagctggagactcagcccccgtg gctgccgcctctgtcccggcaccacccgcctccatcatggagaggatctctgcaagcaag ggcctcatctcgcccatgatggccaggctggctgcagctgccctcaagccctctgccacc tttgacccaggtgactttggagagctgctggaatgtgaagtcccaaggtctgaaaactct cacaggcccaataggacgctcatgcccggtggccgcgtggagggaccgcctcctggtcca ggcatgggtggggcctcgctggccacgcccttccagccaggaagcgggcagcaggtcacc ccagccaggttccccccggcccaagtgaagccggaacccggtgagagcaccggcgcccca ggcccccacgaagcctcccaggaccgcagtctagacctgactgtgaaggagcccagcaac gaatcaaatggccacgcagtcccggcaaattcatctcttttatcctcgcttatgaataag atgtctcagggcaaccctggcctgggcagcctgctgaacatcaaggcggaagcggagggg agccccgctgcggagccctcgcccttcctaggcaaggccgtgaaggcgctggttcaggag aagttggcagagccctggaaggtgtacctgcgcaggtttggtacaaaggacttctgtgac ggccagtgtgacttcctccacaaggcccacttccactgcgtggtggaggaatgcggcgcg ctcttcagcaccttggacggggccatcaagcacgcaaacttccacttccggacagaggga ggagcagcaaaaggaaacacagaggctgcctttccggcctcggccgccgagaccaaacct cccatggccccctcgtcccctccggtccctcctgtcaccacggccacggtgtcctctctg gaggggcccgctcccagcccggcctccgtgccctccacccccaccctgctcgcctggaag cagctggcttccaccataccccagatgcctcagatcccagcgtcagtgcctcacctgccc gcctcgcccttggcaacgacttctctagagaacgccaagccccaggtcaaacccggattc ctccagttccaggagaacgatccttgcctcgccacggactgcaagtacgccaacaagttc cacttccactgtctctttgggaactgcaagtacgtctgcaaaacgtctggcaaggccgaa tcccactgcctggaccacatcaaccccaacaacaacctggtgaacgtgcgagaccagttt gcatactactctctgcagtgtctctgtcccaaccagcactgcgagttccgaatgcgtggg cactaccactgcctccgcaccggctgctattttgtgaccaacatcaccaccaagctcccc tggcacatcaagaagcatgagaaggcggagcggcgggcagccaatggcttcaaatacttc accaagcgcgaggagtgtggcaggctaggtaccgagggctggggctgcgggcagggactt gcagcaggcatcaagaccatggcccccgtggcctgggccgcctctcctgtgaaaggtgac atcctcaaagtcaactgccagaaaaaggaaacaaaccttgccttcaagtatgaattcaaa aaggagttctcggccagggagaggactggtagccatgccaataagcgtcaacacttgcgg cctgtgctgtccctgggctcagtttacagatgtcagggttgcaagtacaaccaggtgaac agccacttccactgcatccgggagggctgccagttctccttcctcctcaagcaccagatg acctcccacgcgcggaagcacatgcggaggatgctggggaagaacttcgaccgcgtgccc ccctcccagggccccccaggcctgatggatgctgagacagatgagtgcatggactacact ggctgcagcccaggcgccatgtcctctgagtcatccaccatggaccggagctgctccagc acccccgtgggtaacgagagcaccgcggcaggctgcccggctcctcctcctcctcctctt cctcctcctgcggccgctggtgatgaggctgcccgcacggccccgcagccgccagcgact ccatccttctcgccggccacgctccggccgcccctcccccccctcccctgcctcttctct ccgtcctgtctctcatactctctgctcagtgccactctcggagccggccggggcctggcc catcccacctgcagcccgcccagcttcccgcccgtcactgccactccaactccagtaaaa agtgacgtccccctagttcaggatgctgcagggaacaccatctctatgccgacagcctcg ggggccaaaaagcgcttctggatcatcgaggacatgtcgcccttcggcaagcggcggaag acggcgtcctcccggaagatgctggacgagggcatgatgctggagggcttccggcgcttc gacctttacgaggactgcaaggacgcagcttgtcagttctcgctcaaggtcacccactac cactgcacgcgcgagaactgcggctacaagttctgcgggcgcacgcacatgtacaagcac gcgcagcaccacgaccgcgtggacaacctggtgctggacgacttcaagcgcttcaaggcc tcactcagctgccacttcgccgactgccccttctcgggcaccagcacgcacttccactgc ctgcgctgccgcttccgctgcaccgacagcaccaaggtcacggcgcatcgcaagcaccac ggcaaacaggacgtgatcagcgccgcgggcttctgccagttcagctccagcgccgactgc gccgtgcccgactgcaagtacaagctcaagtgctcgcacttccactgcaccttcccgggc tgccgccacacggtggtgggcatgtcgcagatggactcgcacaagcgcaagcacgagaag caggagcgcggcgagcccgcggcagagggccccgcgcccgggcctcccatcagtctggac ggctccctgtcgctgggcgccgagccgggctcgctgctcttcctgcagtcggcggccgcc ggcctgggcctggcgctgggcgacgcgggcgaccccggcccgcccgacgccgccgccccc gggccgcgcgagggcgccgccgccgccgccgccgcagctggggagtcctcgcaggaggac gaggaggaggagctggagctgccggaggaggaggccgaagacgacgaggacgaggacgac gacgaggacgacgacgacgaggacgacgacgaggacgacgacgacgaggacctgcgcacc gactcggaggagtcgctgcccgaggcggcggcggaggcggcgggcgcgggcgcgcggacc ccggccttggcggcgctggcggccctgggcgcccccggccccgcgcccactgcagcctcc tcgccctag >gi568815597f:10395271_10729984|GENSCAN_predicted_peptide_10|124_aa MMSPRVLTRYARGWHTPDAALGSGKAASSKWVAKCHAVHWRAKRNLRDARPKPPPADADS PESSVGSEPSPAPLLREGVQPSEATVGHLGFGNFHIRQIDSRDLAPWKAHHFSVFPALGS GPQL >gi568815597f:10395271_10729984|GENSCAN_predicted_CDS_10|375_bp atgatgtcacctcgggtccttactcgctatgcccgaggctggcacaccccagatgcggca ctgggctcaggaaaggcagcatcatctaaatgggtggccaaatgccatgcagtgcactgg agggcaaagaggaacctgagagatgcccggcccaagcccccaccagccgacgcagacagt cctgaatccagcgtggggtccgagccttccccggcaccgctgctgcgggagggcgtccag ccctctgaggccaccgtggggcacctgggctttggcaacttccacataagacaaatcgac tccagagacctggccccttggaaagctcatcacttcagcgtcttccctgccctgggctcg ggtccccaactgtaa >gi568815597f:10395271_10729984|GENSCAN_predicted_peptide_11|692_aa MARGGAAGDGGGEGGARPALGVEEGAELGPPGPVFWTPGTHRDKSECVPPTCNPPLRTDG STRFCKDGHSGMPLEKEAFPQKLKMHLHQQLHFILHPCSPGMCLQTTEINFEINNQRHRP GYPRTAAACRGDPRAVITTVQLPSPSEDEEEKEEKEEEGGRGAASSARPAPAPAGERAER GPAAAAAAAAAAAARIPINWFHLRRLGQGTEALAKGLRVPANKHVRTCEGVRVFACQPWR GLHHPLALASADPLLSVFPMHLHPFKERAQLPEGPQRPPTGSEASEAAPDNSTFTASPRH PSSSLPQAWEADSSLQIEKLALRGYAASQGPDTEIEDPGLPVHDLPILTHSLSLASPFTG AGAMSARQCEEGGGIDNRGSVMETGTFPEKYSVWPPSCIEELSGARNNPVVGQQPSVGGR VRAGWWAFKAVQPLRARLVASATKAGAGTGRGGYGGPFNTGQTHSAKPDFTHPWEGAVVR VWGSDEGKALSKVSTHFYVENVVPVIIVIIVIIIAIAKHFSNQDMASRKSWRQGCMGQTL SPPKRPAPAPQKQAQQDAIRGSLEAFREEHCPGTSPAGRRAYTITTTTTTSTTTTAATTT SMTINCQAEGCMPIRPEETPGTGNGALASVFTRPWRCAAPAWTTTHHHFMDHGKRPLPET LSGGHKVPIVEHSFYVPYSTWGHYADSAKAAX >gi568815597f:10395271_10729984|GENSCAN_predicted_CDS_11|2076_bp atggcacggggcggggcggcgggggatggtggaggggaggggggtgcgaggccggctctg ggggtggaggaaggagctgaacttggacccccagggcctgtcttctggacaccagggacc cacagagacaagtcggagtgcgttccacccacctgcaaccctcccctgcggacagatggg agcactcgtttctgcaaagatgggcactccgggatgcctctagaaaaggaggctttcccc cagaagctcaaaatgcatctgcatcaacaactacattttatcttgcatccctgcagccca gggatgtgtttgcagactacagaaatcaactttgaaattaataatcagagacacagacct ggctaccccagaacagctgctgcttgccgtggggacccacgggcggtaattactacagtg caacttccttccccctctgaagatgaggaggagaaggaggaaaaggaggaagagggcggc cgcggcgccgcctcctcggcccggcccgcgccggccccggcaggtgaaagagcagagcgc ggccccgccgccgccgccgccgccgccgccgccgccgccgcccggatcccaataaactgg ttccatctgaggcgactgggccagggcacagaagccttggccaaaggactgcgtgtgcct gcgaacaagcatgttcgcacctgcgagggagtccgtgtgtttgcgtgccagccctggcgg ggtcttcatcaccctcttgccttggcttcagcagatcccctgctttctgtcttccctatg cacctgcaccctttcaaggagcgcgcccagcttccagaaggtccacagaggcctcccact ggctcagaggccagtgaagcagctcccgataactcaaccttcacagcttctcccaggcac cccagctcctccttgccacaagcctgggaagcagacagcagtttacaaatagagaaactg gcgctcagaggttatgctgcttcccagggccctgacacagaaatagaagacccagggctg ccagtccacgacctccccatcctcacgcacagcctctccctggccagccctttcacaggg gctggggccatgtcagcccggcagtgtgaggaggggggcggcatagacaacaggggcagc gtcatggaaacgggcactttcccggagaaatattctgtttggcctccttcctgtattgag gagctgtcaggagcccgaaacaatcctgtggtaggacaacagccatcagtgggaggaaga gtccgggcaggctggtgggcctttaaggctgtgcagccgctgcgagcacgcctggtcgcc tcagcgaccaaagcgggtgcagggacgggccgagggggctacggaggcccattcaacact gggcagacacactcagcaaagcctgacttcacacatccctgggagggggctgtggtgagg gtttggggaagcgatgagggtaaagctcttagcaaagtgtctacacatttctatgtggaa aatgtggtgccagtcatcattgtgatcattgtcatcatcatcgccatcgctaaacacttc tctaaccaggatatggcctctaggaagagctggcgccagggctgcatggggcagacactc tcccccccaaaaaggcctgcccccgcaccccagaaacaggcccaacaggatgccatccgg gggagcttggaagccttccgggaagagcactgccctggcaccagtcctgcgggaaggagg gcctatactatcaccaccaccactactaccagtactactactactgctgctactaccacc agcatgaccattaattgccaagctgaaggctgcatgccaatccggccagaggaaacaccg ggcacggggaatggagccctcgccagcgtctttactaggccctggcgctgtgcggcccct gcctggacaaccacgcaccatcattttatggaccacgggaaaaggcctcttcccgagacc ctctcaggaggccataaagtccccatagtggaacacagtttctacgtgccctattcaacg tggggacattacgctgacagcgcaaaggcagcggnn