GENSCAN 1.0 Date run: 3-Nov-116 Time: 18:28:49 Sequence gi568815581r:28974773_29266858 : 292086 bp : 49.85% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.08 Intr - 5041 4908 134 2 2 95 113 108 0.917 14.36 1.07 Intr - 7267 6599 669 2 0 84 76 464 0.988 36.47 1.06 Intr - 16917 16797 121 0 1 39 91 13 0.062 -3.23 1.05 Intr - 21089 20946 144 2 0 63 86 73 0.503 5.08 1.04 Intr - 22970 22776 195 2 0 99 72 50 0.742 4.11 1.03 Intr - 23953 23930 24 1 0 105 92 18 0.519 2.12 1.02 Intr - 29791 29583 209 2 2 33 -9 145 0.131 -1.90 1.01 Init - 31097 31043 55 2 1 90 103 229 0.998 24.05 1.00 Prom - 36074 36035 40 -6.86 2.02 PlyA - 37308 37303 6 1.05 2.01 Sngl - 46868 46548 321 2 0 89 53 256 0.985 18.19 2.00 Prom - 55414 55375 40 -3.76 3.05 PlyA - 55626 55621 6 1.05 3.04 Term - 60516 60408 109 2 1 -6 48 263 0.618 10.88 3.03 Intr - 61224 61124 101 0 2 88 60 -12 0.524 -5.09 3.02 Intr - 61418 61323 96 2 0 77 61 71 0.365 3.51 3.01 Init - 62022 61969 54 0 0 88 65 3 0.423 -0.55 3.00 Prom - 64703 64664 40 -5.56 4.00 Prom + 67144 67183 40 -4.06 4.01 Init + 68454 68567 114 2 0 112 84 86 0.968 11.13 4.02 Intr + 70087 70235 149 0 2 88 80 89 0.939 7.13 4.03 Intr + 78148 78361 214 1 1 73 95 84 0.930 6.32 4.04 Intr + 78641 78823 183 1 0 136 110 119 0.999 18.88 4.05 Intr + 79773 79919 147 2 0 81 105 123 0.775 13.73 4.06 Intr + 80291 80449 159 1 0 -2 103 237 0.995 16.38 4.07 Intr + 81041 81116 76 1 1 43 105 90 0.994 5.29 4.08 Term + 81403 81533 131 2 2 104 43 85 0.996 3.94 4.09 PlyA + 82407 82412 6 1.05 5.00 Prom + 83174 83213 40 -7.06 5.01 Init + 85962 86040 79 2 1 74 71 78 0.873 5.92 5.02 Intr + 86598 86815 218 1 2 89 82 98 0.394 7.52 5.03 Term + 92059 92127 69 0 0 112 47 20 0.075 -1.76 5.04 PlyA + 96608 96613 6 1.05 6.03 PlyA - 97998 97993 6 1.05 6.02 Term - 100142 99998 145 1 1 98 41 191 0.992 12.78 6.01 Init - 102356 102256 101 2 2 92 92 1 0.793 0.66 6.00 Prom - 102897 102858 40 -3.96 7.47 PlyA - 103260 103255 6 -0.45 7.46 Term - 106228 104959 1270 0 1 11 39 1377 0.990 116.16 7.45 Intr - 107666 107544 123 1 0 102 99 134 0.855 15.60 7.44 Intr - 111805 111666 140 1 2 94 94 254 0.999 25.86 7.43 Intr - 112349 112164 186 2 0 69 55 358 0.981 30.49 7.42 Intr - 115326 115189 138 0 0 99 90 159 0.992 17.86 7.41 Intr - 115843 115760 84 1 0 110 100 112 0.999 14.62 7.40 Intr - 116154 115945 210 0 0 125 49 334 0.926 32.31 7.39 Intr - 117729 117571 159 0 0 81 77 246 0.877 22.98 7.38 Intr - 118229 118083 147 2 0 59 68 321 0.999 27.63 7.37 Intr - 118655 118551 105 2 0 26 71 238 0.933 16.31 7.36 Intr - 119318 119208 111 2 0 50 116 160 0.940 15.58 7.35 Intr - 120078 119878 201 0 0 100 92 261 0.833 27.18 7.34 Intr - 120287 120164 124 1 1 73 90 194 0.894 18.69 7.33 Intr - 122143 121989 155 1 2 88 90 354 0.999 34.47 7.32 Intr - 122578 122451 128 2 2 103 76 296 0.986 30.30 7.31 Intr - 123127 123016 112 1 1 97 80 257 0.999 25.75 7.30 Intr - 123452 123333 120 2 0 62 94 194 0.999 18.09 7.29 Intr - 123673 123584 90 1 0 108 100 207 0.999 24.09 7.28 Intr - 124197 124054 144 0 0 86 52 251 0.595 21.78 7.27 Intr - 124990 124862 129 1 0 114 48 125 0.994 11.99 7.26 Intr - 128892 128827 66 0 0 80 100 109 0.997 10.40 7.25 Intr - 132417 132308 110 1 2 91 77 152 0.992 14.40 7.24 Intr - 134780 134667 114 0 0 75 77 31 0.375 1.12 7.23 Intr - 135329 135086 244 2 1 81 81 300 0.991 25.67 7.22 Intr - 135850 135664 187 0 1 87 89 305 0.974 30.19 7.21 Intr - 136811 136652 160 0 1 94 63 130 0.999 10.15 7.20 Intr - 137091 136950 142 0 1 82 100 139 0.961 14.43 7.19 Intr - 139325 139239 87 2 0 90 62 185 0.988 16.27 7.18 Intr - 140327 140135 193 1 1 71 55 272 0.998 21.69 7.17 Intr - 140669 140579 91 0 1 63 95 130 0.999 10.25 7.16 Intr - 141068 140892 177 0 0 69 80 232 0.955 20.39 7.15 Intr - 143417 143273 145 2 1 67 86 210 0.849 18.56 7.14 Intr - 143668 143605 64 0 1 85 100 43 0.999 3.92 7.13 Intr - 144663 144563 101 0 2 79 94 158 0.999 14.41 7.12 Intr - 145986 145844 143 1 2 75 94 148 0.916 14.17 7.11 Intr - 146439 146226 214 0 1 84 81 267 0.988 23.89 7.10 Intr - 146951 146775 177 2 0 117 56 238 0.999 23.62 7.09 Intr - 147185 147079 107 0 2 105 100 209 0.999 23.73 7.08 Intr - 147481 147394 88 1 1 115 92 79 0.999 10.54 7.07 Intr - 153772 153080 693 1 0 54 72 539 0.092 40.60 7.06 Intr - 156670 156611 60 1 0 139 87 46 0.121 8.53 7.05 Intr - 163715 163588 128 0 2 101 82 44 0.055 5.50 7.04 Intr - 165501 165408 94 0 1 83 91 6 0.060 -0.06 7.03 Intr - 175791 175579 213 0 0 78 78 47 0.012 1.61 7.02 Intr - 180698 180570 129 2 0 128 28 17 0.014 0.59 7.01 Init - 192168 191170 999 0 0 76 76 1425 0.413 133.50 7.00 Prom - 200593 200554 40 -7.36 8.00 Prom + 200985 201024 40 -4.56 8.01 Init + 202188 202248 61 2 1 96 55 36 0.798 2.45 8.02 Term + 205286 205827 542 0 2 113 54 165 0.756 10.02 8.03 PlyA + 208786 208791 6 1.05 9.05 PlyA - 208847 208842 6 1.05 9.04 Term - 217111 216902 210 1 0 32 48 97 0.137 -2.61 9.03 Intr - 222388 222350 39 1 0 88 75 46 0.312 1.72 9.02 Intr - 223027 222949 79 0 1 118 86 56 0.641 7.95 9.01 Init - 224871 224831 41 0 2 52 63 77 0.711 1.29 9.00 Prom - 226690 226651 40 -5.76 10.02 PlyA - 226807 226802 6 1.05 10.01 Sngl - 229702 228905 798 1 0 99 48 336 0.979 26.56 10.00 Prom - 243751 243712 40 -4.66 11.00 Prom + 245440 245479 40 -0.26 11.01 Init + 250596 250679 84 2 0 77 80 69 0.503 5.82 11.02 Intr + 265861 265887 27 0 0 100 94 -1 0.138 0.01 11.03 Intr + 266825 266913 89 1 2 108 58 62 0.218 3.97 11.04 Intr + 274370 274434 65 2 2 88 121 85 0.098 10.16 11.05 Intr + 275381 275528 148 0 1 87 77 82 0.338 6.29 11.06 Intr + 277292 277433 142 2 1 50 68 88 0.878 3.36 11.07 Intr + 278868 279003 136 2 1 23 9 207 0.689 6.44 11.08 Term + 279430 279581 152 2 2 66 38 171 0.862 7.97 11.09 PlyA + 279697 279702 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 153658 153080 579 1 0 87 72 572 0.877 48.66 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:28974773_29266858|GENSCAN_predicted_peptide_1|517_aa MRPVALLLLPSLLALLAHGRKTVAPETQKPFSEPLAPLAHSPVSSPRYRAAHAGSSGSAQ NLKPCAHVLGRGEESCKTHPRDEDREGEVRCAPEEQNSMKIKGNRGGGDGDWEETTRPLE RSPSNSQNCPSPGEADLSGISGGAESYNPGVQSWTAEAAVGQGFRPLLQLNSAKRPPLEL IAFHTLEPSPWVPPPPAAAALETGTSWEKPQFSQTRKEGSRHPILTLFQEPHPGLFLTER CDYCKAEKSGLSLEAPTVGKGQAPGIEETDGELTAAPTPEQPERGVHFVTTAPTLKLLNH HPLLEEFLQEGLEKGDEELRPALPFQPDPPAPFTPSPLPRLANQDSRPVFTSPTPAMAAV PTQPQSKEGPWSPESESPMLRITAPLPPGPSMAVPTLGPGEIASTTPPSRAWTPTQEGPG DMGRPWVAEVVSQGAGIGIQGTITSSTASGDDEETTTTTTIITTTITTVQTPGPCSWNFS GPEGSLDSPTDLSSPTDVGLDCFFYISVYPGYGVEIK >gi568815581r:28974773_29266858|GENSCAN_predicted_CDS_1|1551_bp atgcgcccggtagccctgctgctcctgccctcgctgctggcgctcctggctcacggaaga aaaactgtggccccggagacgcagaagcctttttctgagcctctagctccacttgcacac agcccagtgtcctcaccccgctaccgggctgcccatgctggatcctccggcagcgcccag aacttgaaaccgtgtgctcacgttctggggagaggggaagagtcctgcaagacccacccc agggatgaagaccgggaaggagaagtcaggtgtgcccctgaagagcagaacagcatgaag atcaaaggaaatcggggtgggggggatggtgactgggaagaaaccacaagaccccttgag aggagcccctccaacagccagaactgtccatcaccaggagaggctgatttgtcaggcatc tctggaggggctgaatcatacaacccaggagtccagagctggaccgctgaggcagcagtg gggcagggcttcaggcccctactacagctcaactctgccaaaaggccaccattagagctc attgccttccacacccttgagccttcaccctgggtcccaccaccaccagctgcagctgct ctggaaactggcaccagctgggagaagccgcagttttcccagacccgaaaggaggggagc agacaccccatcttaactcttttccaggagccccaccctgggctcttcctcactgagagg tgtgactactgtaaggctgagaagtcaggactctctttagaggccccaaccgtggggaaa ggacaagccccaggcatcgaggagacagatggcgagctgacagcagcccccacacctgag cagccagaacgaggcgtccactttgtcacaacagcccccaccttgaagctgctcaaccac cacccgctgcttgaggaattcctacaagaggggctggaaaagggagatgaggagctgagg ccagcactgcccttccagcctgacccacctgcacccttcaccccaagtccccttccccgc ctggccaaccaggacagccgccctgtctttaccagccccactccagccatggctgcggta cccactcagccccagtccaaggagggaccctggagtccggagtcagagtcccctatgctt cgaatcacagctcccctacctccagggcccagcatggcagtgcccaccctaggcccaggg gagatagccagcactacaccccccagcagagcctggacaccaacccaagagggtcctgga gacatgggaaggccgtgggttgcagaggttgtgtcccagggcgcagggatcgggatccag gggaccatcacctcctccacagcttcaggagatgatgaggagaccaccactaccaccacc atcatcaccaccaccatcaccacagtccagacaccaggcccttgtagctggaatttctca ggcccagagggctctctggactcccctacagacctcagctcccccactgatgttggcctg gactgcttcttctacatctctgtctaccctggctatggcgtggaaatcaag >gi568815581r:28974773_29266858|GENSCAN_predicted_peptide_2|106_aa MPFLAIQKRFGLNIDQWWTIQSAEQPYKIAARCHAFEKEWIECAYGISVIRAEKECKIES DDFVECLLRQKTMRRAGTIRKQQDKLIKEGKYTPPPHHIGKEEPQP >gi568815581r:28974773_29266858|GENSCAN_predicted_CDS_2|321_bp atgcctttcttggccatccagaaaagattcggccttaacatagatcaatggtggacaatc cagagtgctgaacagccctacaagattgctgctcgatgccatgcttttgaaaaagaatgg atagaatgtgcatatggaatcagtgttatccgggcagagaaagagtgcaagatagaatct gatgatttcgtagagtgtttgcttcggcagaaaacgatgagacgtgcaggtaccatcagg aagcagcaggataagctgataaaggaagggaagtacacccctccacctcaccacattggc aaggaggagcctcagccctga >gi568815581r:28974773_29266858|GENSCAN_predicted_peptide_3|119_aa MPVLGWPSKVYQRLTAPSAFGEGPIKAQQAVGKMRNQSAQKWSCFSVEVRPGYHYTLKWG VGERVEIGSRKSKQLLHELDWLDRKKKKKKKKKKKKKKEKEKEEKEEKKEKKEGKRERE >gi568815581r:28974773_29266858|GENSCAN_predicted_CDS_3|360_bp atgcctgttctgggctggcccagcaaggtttatcaaaggctgacagcgccctctgctttt ggagaaggccccattaaagctcagcaggcagtgggaaaaatgagaaatcagagtgctcag aagtggtcctgtttctctgtggaggtgaggcctggatatcactataccctaaagtgggga gttggggaaagagtggaaattgggagcaggaaaagtaaacagcttttgcatgagctggat tggcttgataggaagaagaagaagaagaagaagaagaagaagaagaagaagaaggagaaa gagaaggaggagaaggaggagaagaaggagaagaaagaaggaaagagagagagagagtaa >gi568815581r:28974773_29266858|GENSCAN_predicted_peptide_4|390_aa MAAQKDLWDAIVIGAGIQGCFTAYHLAKHRKRILLLEQFFLPHSRGSSHGQSRIIRKAYL EDFYTRMMHECYQIWAQLEHEAGTQLHRQTGLLLLGMKENQELKTIQANLSRQRVEHQCL SSEELKQRFPNIRLPRGEVGLLDNSGGVIYAYKALRALQDAIRQLGGIVRDGEKVVEINP GLLVTVKTTSRSYQAKSLVITAGPWTNQLLRPLGIEMPLQTLRINVCYWREMVPGSYGVS QAFPCFLWLGLCPHHIYGLPTGEYPGLMKVSYHHGNHADPEERDCPTARTDIGDVQILSS FVRDHLPDLKPEPAVIESCMYTNTPDEQFILDRHPKYDNIVIGAGFSGHGFKLAPVVGKI LYELSMKLTPSYDLAPFRISRFPSLGKAHL >gi568815581r:28974773_29266858|GENSCAN_predicted_CDS_4|1173_bp atggcggctcagaaagatctctgggacgccattgtgattggggcggggatccagggctgc ttcactgcataccacctggccaaacacaggaagaggatcctcctgctggagcagttcttt ctaccacactcccgaggaagctcccatggacaaagccggataatccgaaaggcgtacctg gaagacttttacacccggatgatgcatgagtgctatcagatatgggcccagctggagcac gaggctggaacccaattgcacaggcagactggattactgctgctgggaatgaaagagaat caagaattaaagacaatccaggccaatctgtcgaggcagagggtagaacaccagtgtctt tcatctgaggaactgaagcaacgtttcccaaatattcggttgcccaggggagaagtgggg ctcttggacaattccggaggagttatctatgcatataaggccctcagagccctgcaggat gcaattcgacagctaggaggcatagtgcgtgacggagagaaggtggtggagataaaccca gggctactggtcacggtgaaaaccacctccaggagctaccaagctaagagcttggtcatc acagcaggtccttggaccaaccagctcctccgtcccctgggcattgagatgcctctccag accctgcggatcaacgtgtgttactggcgagagatggttcctgggagctatggtgtgtcc caggcctttccgtgcttcctgtggctgggcttgtgtccccaccacatctacggactgccc acaggagagtacccagggctgatgaaggtcagctatcaccacggcaaccacgcagaccct gaggagcgggactgccccacagcacgcacagacatcggagacgtccagatcctgagcagc tttgtcagagatcacttacctgatctgaagcccgagcctgctgtcattgagagctgcatg tacacgaatacccctgatgagcagttcattctcgatcgccacccaaagtatgacaacatt gtcattggtgctggattctctgggcacgggttcaagctggcccctgtggtggggaagatc ctgtatgaattaagcatgaaattaacaccatcttatgacttggcaccttttcgaatcagc cgtttcccaagcctgggcaaagcccacctttga >gi568815581r:28974773_29266858|GENSCAN_predicted_peptide_5|121_aa MDRASEERRQTRSRKEETGNVQTTKAASPKDEEMVGKSGKTGKEGGESGQSNLSSKTQAI VGYQSGIVVGRGGGVYYPQYQFGPQQSLASQNGFNSLRNLPSQPCREKPAALRMAFPEQG I >gi568815581r:28974773_29266858|GENSCAN_predicted_CDS_5|366_bp atggatcgggccagcgaagaaaggagacagacaagaagcaggaaagaagagactggcaat gtgcagaccacgaaggcagcctcaccaaaagatgaggagatggtggggaaatcgggaaag actggcaaagagggaggagagtcagggcaaagtaatttgagttctaagacccaggcgatc gtggggtaccagtcagggatcgtggtggggagaggaggaggtgtatactatcctcagtac cagtttggaccccagcagagtttggcatcccagaatgggtttaacagcttaaggaatctt ccctctcagccatgccgagagaagcctgctgctttaagaatggctttccctgagcaaggg atctaa >gi568815581r:28974773_29266858|GENSCAN_predicted_peptide_6|81_aa MEFTTSPAAPATAQRLDETVWKHGCHLSRCLPSSPTSYWKSLAPDRSDDEHDPLDNTSRP RYSHSYLSDSDTEAKLTETNA >gi568815581r:28974773_29266858|GENSCAN_predicted_CDS_6|246_bp atggagtttaccacctccccggcagctcctgccactgcccagcgtcttgatgaaacagta tggaaacacggctgtcatttatccaggtgtctgcctagcagccccaccagctactggaag tcccttgcccctgatcggtcagatgatgagcacgaccctctcgacaacacctccagaccg cgatactcccacagttatctgagtgacagcgacacagaggccaagctgacggagactaac gcatag >gi568815581r:28974773_29266858|GENSCAN_predicted_peptide_7|2933_aa MFNLMKKDKDKDGGRKEKKEKKEKKERMSAAELRSLEEMSLRRGFFNLNRSSKRESKTRL EISNPIPIKVASGSDLHLTDIDSDSNRGSVILDSGHLSTASSSDDLKGEEGSFRGSVLQR AAKFGSLAKQNSQMIVKRFSFSQRSRDESASETSTPSEHSAAPSPQVEVRTLEGQLVQHP GPGIPRPGHRSRAPELVTKKFPVDLRLPPVVPLPPPTLRELELQRRPTGDFGFSLRRTTM LDRGPEGQACRRVVHFAEPGAGTKDLALGLVPGDRLVEINGHNVESKSRDEIVEMIRQSG DSVRLKVQPIPELSELSRSWLRSGEGPRREPSDHLSLPGPGASRVHTGRQVLPMELFSGS RTEAAPALGQKRAACVSPPGKAGDGGISPSSQEGRAVLSGNFYTSTTTSSGILMPQGQQV PGCVPPQLGQRFLSSPRMSSAQLCNTQERRDLFVISPDVAEVGGERGGAALGLLSELLDK LVLGWDWEEEVSWELPSVPLNGKEGGEENALPLGPQKEKPPVKEEDKTLPKPGSPGKEEG ALEGSSKEGSGPSRSPQPPTSPIPPETSQRARSPAPTLAMNGPGAATAEGLSEEAQGLSR KRVANAVRKVVSKVLPSEELGNAKETPGRGVKSPEHPTRSKRGEKAASSPKPPPPPPPPP APPKPEVKKEAAKDELSLGLRSLMSRGRGKEHKARGKQSSGKGEKPSSQEPGSPDWADSP EKAGSPAKPEAPKKQRSPAPPEELVTPGSTGPKSDLTGEQQSKSPVPGVKQEAKTEEQIA AEEAWNETEKVWLVHRDGFSLASQLKSEELNLPEGKVRVKLDHDGAILDVDEDDVEKANA PSCDRLEDLASLVYLNESSVLHTLRQRYGASLLHTYAGPSLLVLGPRGAPAVYSEKVMHM FKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIILLGSSGSGKTTSCQHLVQYLATIAGI SGNKVFSVEKWQALYTLLEAFGNSPTIINGNATRFSQILSLDFDQAGQVASASIQTMLLE KLRVARRPASEATFNVFYYLLACGDGTLRTELHLNHLAENNVFGIVPLAKPEEKQKAAQQ FSKLQAAMKVLGISPDEQKACWFILAAIYHLGAAGATKAGRKQFARHEWAQKAAYLLGCS LEELSSAIFKHQHKGGTLQRSTSFRQGPEESGLGDGTGPKLSALECLEGMAAGLYSELFT LLVSLVNRALKSSQHSLCSMMIVDTPGFQNPEQGGSARGASFEELCHNYTQDRLQRLFHE RTFVQELERYKEENIELAFDDLEPPTDDSVAAVDQASHQSLVRSLARTDEARGLLWLLEE EALVPGASEDTLLERLFSYYGPQEGDKKGQSPLLHSSKPHHFLLGHSHGTNWVEYNVTGW LNYTKQNPATQNAPRLLQDSQKKIISNLFLGRAGSATVLSGSIAGLEGGSQLALRRATSM RKTFTTGMAAVKKKSLCIQMKLQVDALIDTIKKSKLHFVHCFLPVAEGWAGEPRSASSRR VSSSSELDLPSGDHCEAGLLQLDVPLLRTQLRGSRLLDAMRMYRQVWTVQLCPQQQTQCP FGAICPVRNEVLNLSEVLKIPKKGYPDHMVFSEFRRRFDVLAPHLTKKHGRNYIVVDERR AVEELLECLDLEKSSCCMGLSRVFFRAGTLARLEEQRDEQTSRNLTLFQAACRGYLARQH FKKRKIQDLAIRCVQKNIKKNKGVKDWPWWKLFTTVRPLIEVQLSEEQIRNKDEEIQQLR SKLEKAEKERNELRLNSDRLESRISELTSELTDERNTGESASQLLDAETAERLRAEKEMK ELQTQYDALKKQMEVMEMEVMEARLIRAAEINGEVDDDDAGGEWRLKYERAVREVDFTKK RLQQEFEDKLEVEQQNKRQLERRLGDLQADSEESQRALQQLKKKCQRLTAELQDTKLHLE GQQVRNHELEKKQRRFDSELSQAHEEAQREKLQREKLQREKDMLLAEAFSLKQQLEEKDM DIAGFTQKVVSLEAELQDISSQESKDEASLAKVKKQLRDLEAKVKDQEEELDEQAGTIQM LEQAKLRLEMEMERMRQTHSKEMESRDEEVEEARQSCQKKLKQMEVQLEEEYEDKQKVLR EKRELEGKLATLSDQVNRRDFESEKRLRKDLKRTKALLADAQLMLDHLKNSAPSKREIAQ LKNQEWLTSLPGAGVVCVQLEESEFTCAAAVKARKAMEVEIEDLHLQIDDIAKAKTALEE QLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVAQVPLPGVPVTLCTVTILVVLLMGPP HNRLFWEASRDLAQINDLQAQLEEANKEKQELQEKLQALQSQVEFLEQSMVDKSLVSRQE AKIRELETRLEFERTQVKRLESLASRLKENMEKLTEERDQRIAAENREKEQNKRLQRQLR DTKEEMGELARKEAEASRKKHELEMDLESLEAANQSLQADLKLAFKRIGDLQAAIEDEME SDENEDLINSEGDSDVDSELEDRVDGVKSWLSKNKGPSKAASDDGSLKSSSLSKEAPGVE ERPSSVVSSLSYRKRLTLKDSIGGTGDADSLFTSLSERAASPERPPRKAHVGPREEPCPG RKSEEPEECGSVRSGTGGRAGRGPQKRWGSDFSPASTVSAPVSRASSATRRGSGEDRAGS SLSFSLSGSPGSRRSTSRLDSLSRTLSPSLSRASGLGRESPDSRLSLGRSCLEEWDDGAS MALSEACSQYSHPSLARSLSVPPRPRSSASAVDEPPSSSVRSVSRHSYLDPDLEAAVNEV LSYKPVPFQRSSLEPDSEEDDRKSIQSARSAQLDPPERAASIRRSASAADVSRSRSGRKS RSRRRSGRSSSSSSSSSGSEASSEHKRRKKGRSRKSKKSKSRRKRTETESESSSSSSSGS TVSSHSCSSVKKGPAAESEETGQTHRPSRKEEKKRKKEVDSLMMRYLYRPESD >gi568815581r:28974773_29266858|GENSCAN_predicted_CDS_7|8802_bp atgtttaacctaatgaagaaagacaaggacaaagatggcgggcggaaggagaagaaggag aaaaaggagaaaaaggagcggatgtcagcggcagagcttcggagcctggaggagatgagc ctgcgacgtggcttcttcaacctgaaccgctcctccaagcgtgaatccaagacgcgcctg gaaatctccaaccccatccccatcaaggtggccagcggctctgacctgcacctgactgac attgactccgatagtaaccggggcagcgtcatcctggactcgggccacctaagtacagcc agctccagcgatgacctcaagggtgaggagggtagcttccgtggctcggtgctgcagcgg gcagccaagttcggctcactggccaagcagaactcacagatgattgtcaagcgcttttcc ttctcccagcgtagccgggatgagagcgcctcagaaacctcgacgccctcagagcactct gccgccccctcgccacaggtggaggtgaggactctagagggacagctggtgcagcatcct ggcccaggcatccctcgaccagggcaccgatcccgagcccctgagctagtgactaaaaag ttcccagtcgacctgcgcctgccccccgtggtgcccctgcccccacctaccctccgggag ctggagctgcaacgacggcccactggagactttggcttctccctgcggcgcacaaccatg ctggatcggggccccgagggccaggcctgtcggcgtgtggtccactttgctgagcctggt gcaggcaccaaggacctggccctggggctggtgccaggagatcgactggtggagattaat gggcacaatgtggagagcaagtccagggatgagattgtggagatgatccggcagtcaggg gacagcgtgcggctcaaggtgcagcccattccagagctcagcgagctcagcaggagctgg ctgcggagcggcgagggacctcgcagggagccatccgatcacctctcactgccaggccct ggggcaagcagggtccacacagggcggcaggttcttcccatggagctcttctctggcagc aggactgaggctgccccggccctggggcagaagagggctgcatgtgtgagtccccctggt aaggcaggggatggtggcatttccccttcctctcaagaaggaagagctgtcctctctggc aatttctacacctcaaccaccaccagctctggaatcctaatgccccagggccagcaggtc cctggctgtgtgccaccacaactggggcagcgcttcctttcctcgccaaggatgtcttct gcccagctttgtaacacacaggaaaggagggacctttttgtgatttccccagatgtggct gaagtaggtggggagaggggtggggcagccttggggctcctctccgagcttcttgataag ttggtgctaggttgggactgggaggaagaagtctcctgggagctgccctctgtgcctctg aatgggaaagagggaggagaggagaatgccctgcctcttggtcctcagaaggagaagcct ccggttaaagaggaggacaaaactctgccaaagcctggctctcctggcaaggaggaaggg gctctagagggcagctcaaaggaaggcagtggcccatctaggagtcctcagcctcccacc agcccgatacccccagagacttcccagagagccaggagccctgcacccacactcgccatg aacggcccgggagctgccacagcagaaggcctaagcgaagaggcccagggcctgtcccgg aagcgggtggcaaatgcagtgaggaaggtggtgagtaaggtgctgcccagcgaggagctt gggaatgccaaggagaccccaggcagaggagtcaagtcccccgagcacccgactcgaagc aagaggggagaaaaggcagcttctagtcccaagccgccaccccctcccccacctccccct gctccgcctaagcctgaagtgaagaaggaggcagccaaggatgagctctccctgggcctg cggagcctgatgtctcggggcaggggcaaggagcacaaggcccgcggcaagcagtcttct gggaagggggagaagccctccagccaggagccaggctccccagactgggcagactcccct gagaaagcaggatccccagccaagcctgaagccccaaagaagcagcgctccccagcccca ccggaagagctggtgaccccaggctccaccggcccgaagtcagacctcactggagagcag cagtccaagtcaccagtccctggtgtgaagcaggaggcgaaaacagaagaacagattgca gcagaagaggcctggaatgagacggagaaggtgtggctggtccatagggacggcttctca ctggccagtcaactcaaatctgaggagctcaacttgcctgaggggaaggtgcgtgtgaag ctggaccacgatggggccatcctggatgtggatgaggatgacgttgagaaggctaatgct ccctcctgcgaccgtctggaggatctggcctcactggtgtacctcaatgagtccagcgtc ctgcacaccttgcgccagcgctatggcgctagcctgctgcacacgtatgctggccccagc ctgctggttcttggcccccgtggggcccctgctgtgtactctgagaaggtgatgcacatg ttcaagggttgtcggcgggaggacatggcaccccacatctatgcagtggcccagaccgca tacagggcgatgctgatgagccgtcaggatcagtcaatcatcctcctgggcagtagtggc agtggcaagaccaccagctgccagcatctggtgcagtacctggccaccatcgcgggcatc agcgggaacaaggtgttttctgtggagaagtggcaggctctgtacaccctcctggaagcc tttgggaacagccccaccatcattaatggcaatgccacccgcttctcccagatcctctcc ctggactttgaccaagctggccaggtggcctcagcctccattcagacaatgcttctggag aagctgcgtgtggctcggcgcccagccagtgaagccacattcaacgtcttctactacctg ctggcctgtggggatggcaccctcaggacagagctccacctcaaccacttggcagagaac aatgtgtttgggattgtgccactggccaagcctgaggaaaagcagaaggcagctcagcag tttagtaagctgcaggcggccatgaaggtgctgggcatctcccccgatgaacagaaggcc tgctggttcattctggctgccatctaccacctgggggctgcgggagccaccaaagctggg cgcaagcagtttgcccgccatgagtgggcccagaaggctgcgtacctactgggctgcagc ctggaggagctgtcctcagccatcttcaagcaccagcacaagggtggcaccctgcagcgc tccacctccttccgccagggccccgaggagagtggcctgggagatgggacaggcccgaaa ctgagtgcactggagtgccttgagggcatggcggccggcctctacagcgagctcttcacc cttctcgtctccctggtgaatagggctctcaagtccagccagcactcactctgctccatg atgattgtcgacaccccgggcttccagaaccctgagcagggtgggtcagcccgcggagcc tcctttgaggagctgtgccacaactacacccaagaccggctgcagaggctcttccacgag cgcaccttcgtgcaggagttggaaagatacaaggaggagaacatcgagctggcgtttgac gacttggaacccccgacggatgactctgtggctgctgtggaccaggcctcccatcagtcc ctggtccgctcgctggcccgcacagacgaggcgaggggcctgctctggctattggaagag gaggctctggtgccaggggccagtgaggacaccctcctggagcgccttttctcctattat ggcccccaggaaggtgacaaaaaaggccaaagcccccttctgcacagcagcaaaccacac cactttctcctgggccacagccatggcaccaactgggtagagtacaatgtgactggctgg ctgaactacaccaagcagaacccagccacccagaatgccccccggctcctgcaggactcc cagaaaaaaatcatcagcaacctgtttctgggccgcgcaggcagtgccacggtgctctct ggctccatcgcgggcctggagggcggctcgcagctggcactgcgccgggccaccagcatg cggaaaacctttaccacaggcatggcggctgtcaaaaagaagtcactgtgcatccagatg aagctacaggtggacgccctcatcgacaccatcaagaagtcaaagctgcattttgtgcac tgcttcctgcctgtagctgagggctgggctggggagccccgttccgcctcctcccgccga gtcagcagcagcagtgagctggacctgccctcgggagaccactgcgaggctgggctcctg cagctcgacgtgcccctgctccgcacccagctccgcggctcccgcctgctcgatgccatg cgcatgtaccgccaagtttggacagttcagctttgcccacagcagcaaacccagtgccct ttcggagccatctgcccagtgagaaatgaggtgctaaatctttctgaagtacttaaaatt cccaagaaaggttaccctgaccacatggtgttttccgagttccgccgccgctttgatgtc ctggccccgcacctgaccaagaaacacgggcgtaactacatcgtggtggatgaaaggcgg gcagtggaggagctgctggagtgcttggatctggagaagagcagctgctgcatgggcctg agccgggtgttcttccgggcgggcaccttggcacggctggaggagcagcgggatgaacaa accagcaggaacctaaccctgttccaagcagcctgcaggggctacctggcccgccagcac ttcaagaagagaaagatccaggacctggccattcgctgtgtacagaagaacatcaagaag aacaaaggggtgaaggactggccctggtggaagctttttaccacagtgaggcccctcatc gaagtacagctgtcagaggagcagatccggaacaaagacgaggagatccagcagctgcgg agcaagctcgagaaggcggagaaggagaggaacgagctgcggctcaacagtgaccggctg gagagccggatctcagagctgacatcggagctgacagatgagcgtaacacaggagagtcc gcctcccagctgctggacgcggagacagcagagaggctccgggctgagaaggagatgaag gaactgcagacccagtacgatgcactgaagaagcagatggaggttatggaaatggaggtg atggaggcccgtctcatccgggcagcggagatcaacggggaagtggatgatgatgatgca ggtggcgagtggcggctgaagtatgagcgggctgtgcgggaggtggacttcaccaagaaa cggctccagcaggagtttgaggacaagctggaggtggagcagcagaacaagaggcagctg gaacggcggctcggggacctgcaggcagatagtgaggagagtcagcgggctctgcagcag ctcaagaagaagtgccagcgactgacggctgagctgcaagacaccaagctgcacctggag ggccagcaggtccgcaaccacgaactggagaagaagcagaggaggtttgacagtgagctc tcgcaggcgcatgaggaggcccagcgggagaagctgcagcgggagaagctgcagcgggag aaggacatgctcctcgctgaggctttcagcctgaagcagcaactagaggaaaaagacatg gacattgcagggttcacccagaaggttgtgtctctagaggcagagctccaggacatttct tcccaagagtccaaggatgaggcttctctggccaaggtcaagaaacagctccgggacctg gaggccaaagtcaaggatcaggaagaagagctggatgagcaggcagggaccatccagatg ctggaacaggccaagctgcgtctggagatggagatggagcggatgagacagacccattct aaggagatggagagtcgggatgaggaggtggaggaggcccggcagtcgtgtcagaagaag ttaaaacagatggaggtgcagctagaggaagagtatgaggacaagcagaaggttctgcga gagaagcgggagctggagggcaagctcgccaccctcagcgaccaggtgaaccggcgggac tttgagtcagagaagcggctgcggaaggacctgaagcgcaccaaggccctgctggcagat gcccagctcatgctggaccacctgaagaacagtgctcccagcaagcgagagattgcccag ctcaagaaccaggaatggctgacatctctccctggggcgggggtggtctgtgtccagctg gaggagtcagagttcacctgtgcggcagccgtgaaagcacggaaagcaatggaggtggag atcgaagacctgcacctgcagattgatgacatcgccaaagccaagacagcgctggaggag cagctgagccgccttcagcgtgagaagaatgagatccagaaccggctggaggaagatcag gaagacatgaacgaattgatgaagaagcacaaggctgccgtggctcaggtacccctgcca ggagtgcccgtcacactctgcaccgtcaccatccttgtggtcctcctcatgggtcccccg cacaaccgcttgttttgggaggcttcccgggacctggctcagataaatgatctccaagct cagctagaagaagccaacaaagagaagcaggagctgcaggagaagctacaagccctccag agccaggtggagttcctggagcagtccatggtggacaagtccctggtgagcaggcaggaa gctaagatacgggagctggagacacgcctggagtttgaaaggacgcaagtgaaacggctg gagagcctggctagccgtctcaaggaaaacatggagaagctgactgaggagcgggatcag cgcattgcagccgagaaccgggagaaggaacagaacaagcggctacagaggcagctccgg gacaccaaggaggagatgggcgagcttgccaggaaggaggccgaggcgagccgcaagaag cacgaactggagatggatctagaaagcctggaggctgctaaccagagcctgcaggctgac ctaaagttggcattcaagcgcatcggggacctgcaggctgccattgaggatgagatggag agtgatgagaatgaggacctcatcaacagtgagggagactctgatgtggactcggagctg gaggaccgtgttgacggggtcaagtcctggttgtcaaaaaacaagggaccttccaaggca gcttctgatgatggcagcttaaagagttccagcctctctaaggaggccccgggggtggag gagaggccgtcctcggtggtgagctccctgagctatcggaagcggctcaccctaaaggac tccatcgggggcaccggggacgcggattcgctcttcacctccctgagcgagcgggcggcc tcccctgagaggccccctaggaaggcccatgtgggccccagggaggagccatgcccaggc aggaagtccgaggagccggaggagtgcggctccgtccgctcggggaccggggggcgcgct ggccgggggccgcagaagcggtggggctccgacttcagcccggcctccaccgtctctgca ccggtcagccgggcctcctcggccacgcggcggggctctggcgaagacagggctggctct tccctgagtttctcgctgtcgggctcgcccggttcccgccgcagcacctcccggctcgac agcctctccaggacactcagcccttccctgagccgggcctctggcctaggccgggagagc cctgattctcgcctgtccctgggccggagctgcctagaggagtgggatgatggagccagt atggccctgagcgaggcctgctcgcagtacagccacccgtcgctggcccgcagtctgtcg gtgccaccccggccacgcagctctgcctcggcagtggatgagcctcccagctccagcgtc cgctccgtcagccgtcactcctacctggacccagacctggaggctgccgtcaacgaggtc ttgagctacaagcctgttccattccagcggagcagcctggagcccgactccgaggaggat gacaggaagagcatccaaagtgcccggagcgcccaactggaccccccggagcgagctgcc agcatccgccgctccgcctctgctgctgatgtgtcccggtcccgcagcggccggaagagc cggagccggcgaaggagcggaaggagcagctccagctccagctccagctccggctccgaa gcttcctcagagcacaagaggcggaagaaggggcgctctcggaagagcaagaagtccaag tcgagaagaaagagaacggagacggagtctgagtcctcctcgtcgtcatccagcggctcg accgtctccagccacagctgctccagtgtgaagaagggcccagctgcagaaagtgaagaa actgggcagacgcaccggccgtcgaggaaggaggaaaagaagcgcaagaaagaggtggac agcctgatgatgcggtacctgtaccggcccgagagcgactag >gi568815581r:28974773_29266858|GENSCAN_predicted_peptide_8|200_aa MEERERESSCGLARFPAALPGWLEGPVVPPSPPLPIGPHPRASSSTAPPGTGAGSPGRCF QGTGEEEEEPAGPAARPPSLPAGARPAAPPSRARHRAAAAADRHSPATGARPAGSRSGDA ASSSLGPVRDGAERSAAQLSAAPSDRAGRAERELPAGGCGDKRPACLAAPVRAARRGPGG PGAALAASVRCGLRSPSRCV >gi568815581r:28974773_29266858|GENSCAN_predicted_CDS_8|603_bp atggaggaacgcgagcgggagagctcctgcggcctcgccaggttccctgctgctcttcca ggttggctggaagggccggtggttcccccttccccgccgctcccgatcggcccccacccc cgggcgtcgagctccacggcgccgccagggacgggagccgggagtccgggccgctgcttc caggggacgggggaggaggaggaggagccggcgggccccgccgcccgcccgccctccctc ccggccggagcccgccccgccgccccaccaagccgggcccgccaccgagcggccgccgcc gccgaccggcactcaccggccaccggagcccgcccggcaggcagcagaagcggagacgcg gcatccagcagcctcggcccggtcagggatggagcagagcgcagcgcggcgcagctcagc gccgcccccagcgaccgcgcaggccgagccgaaagggagctccctgctggcggctgtggg gataaacgcccggcctgcctggcagcgccagtgcgcgccgccagacgtgggccaggcggg ccgggcgctgccctggcagcctccgtccgctgtggactccgaagcccctctcgctgtgtc tga >gi568815581r:28974773_29266858|GENSCAN_predicted_peptide_9|122_aa MPVAILVDCVLQDQNHKMSDWSFLGWLLTRVQNDSTVVGKSHEEENLPELHVQRKLFTIK SFTESVLTPELGHWEPLRNPKPRSDMIIFVLWKDNSGSHMEDQLEGMKRKQATTRRPLLL SS >gi568815581r:28974773_29266858|GENSCAN_predicted_CDS_9|369_bp atgcctgtggccatcctagtggactgcgtcctccaggaccaaaatcacaagatgagcgac tggtcattcctgggctggctcctgacccgagtgcagaacgattccaccgtggttggcaag agccatgaggaagaaaatcttcctgaacttcatgtgcagcgtaaactatttaccatcaag tcctttacagaaagtgtgctgacccctgagctgggccattgggaacctttgaggaatccc aagccaagaagtgacatgatcatatttgtattatggaaagacaactctggcagccacatg gaggaccagttagaggggatgaagcggaagcaggcgaccacgagaaggccgttgctattg tccagctaa >gi568815581r:28974773_29266858|GENSCAN_predicted_peptide_10|265_aa MSHQTGIQASEDVKEIFARARNGKYRLLKISIENEQLVIGSYSQPSDSWDKDYDSFVLPL LEDKQPCYILFRLDSQNAQGYEWIFIAWSPDHSHVRQKMLYAATRATLKKEFGGGHIKDE IFGTVKEDVSLHGYKKYLLSQSSPAPLTAAEEELRQIKINEVQTDVGVDTKHQTLQGVAF PISREAFQALEKLNNRQLNYVQLEIDIKNEIIILANTTNTELKDLPKRIPKDSARYHFFL YKHSHEGDYLESIVLFIQCLDTHAV >gi568815581r:28974773_29266858|GENSCAN_predicted_CDS_10|798_bp atgtcccaccagaccggcatccaagcaagtgaagatgttaaagagatctttgccagagcc agaaatggaaagtacagacttctgaaaatatctattgaaaatgagcaacttgtgattgga tcatatagtcagccttcagattcctgggataaggattatgattcctttgttttacccctg ttggaggacaaacaaccatgctatatattattcaggttagattctcagaatgcccaggga tatgaatggatattcattgcatggtctccagatcattctcatgttcgtcaaaaaatgttg tatgcagcaacaagagcaactctgaagaaggaatttggaggtggccacattaaagatgaa atatttggaacagtaaaggaagatgtatcattacatggatataaaaaatacttgctgtca caatcttcccctgccccactgactgcagctgaggaagaattacgacagattaaaatcaat gaggtacagactgacgtgggtgtggacactaagcatcaaacactacaaggagtagcattt cccatttctcgagaagcctttcaggctttggaaaaattgaataacagacagctcaactat gtgcagttggaaatagatataaaaaatgaaattataattttggccaacacaacaaataca gaactgaaagatttgccaaagaggattcccaaggattcagctcgttaccatttctttctg tataaacattcccatgaaggagactatttagagtccatagttttatttattcaatgcctg gatacacatgcagtataa >gi568815581r:28974773_29266858|GENSCAN_predicted_peptide_11|280_aa MVYSMRSSKTGIQKEKNHNSGGKAADKEDAVVGWSAWRQGQGVKAIHKIPLSIYAAWRVA LVSRSPRNPSNHQDGSDQPYAGVPGAMEALLRHSISFQITIYDQENFQGKRMEFTSSCPN VSERSFDNVRSLKVESGAWIGYEHTSFCGQQFILERGEYPRWDAWSGSNAYHIERLMSFR PICSANHKESKMTIFEKENFIGRQWEISDDYPSLQAMGWFNNEVGSMKIQTGFATNILDI VGISISWNVTIMEETINIGESGALMPRLRRSNRFAESNSS >gi568815581r:28974773_29266858|GENSCAN_predicted_CDS_11|843_bp atggtgtatagcatgagaagctccaaaacaggaattcagaaggagaaaaatcacaacagt ggagggaaggcagcagacaaagaggatgctgtggttggctggtcagcctggaggcaaggc cagggagtcaaagccatccacaaaattccactgagcatctatgctgcttggcgtgtagcc ctggtgtccagaagcccaagaaacccttccaaccaccaagatggctcagaccaaccctac gccggggtccctggggccatggaagctctcttgcgccattcaatctcatttcagataacc atctatgatcaggagaactttcagggcaagaggatggagttcaccagctcctgtccaaat gtctctgagcgcagttttgataatgtccggtccctgaaggtggaaagtggcgcctggatt ggttatgagcataccagcttctgtgggcaacagtttatcctggagagaggagaataccct cgctgggatgcctggagtgggagtaatgcctaccacattgagcgtctcatgtccttccgc cccatctgttcagctaatcataaggagtctaagatgaccatctttgagaaggaaaacttt attggacgccagtgggagatctctgacgactacccctccttgcaagccatgggctggttc aacaacgaagtcggctccatgaagatacaaactgggtttgctaccaatatcctggatatc gtgggtatcagtatatcttggaatgtgaccatcatggaggagactataaacattggagag agtggggctctcatgcccagacttcgcagatccaatcgattcgccgaatccaacagtagc tga