GENSCAN 1.0 Date run: 4-Nov-116 Time: 06:58:09 Sequence gi568815587f:71860748_72096312 : 235565 bp : 44.38% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 25564 26097 534 0 0 89 -22 398 0.443 22.02 1.02 Intr + 28961 29260 300 1 0 37 67 191 0.309 8.83 1.03 Intr + 30592 30835 244 0 1 63 41 86 0.034 -1.53 1.04 Intr + 32772 33423 652 1 1 16 -1 332 0.004 8.07 1.05 Intr + 33529 33686 158 1 2 78 82 63 0.104 4.35 1.06 Intr + 42499 42587 89 2 2 69 -3 157 0.293 4.39 1.07 Intr + 48022 48555 534 0 0 76 -22 297 0.522 10.62 1.08 Intr + 49435 49657 223 0 1 43 38 151 0.456 3.30 1.09 Intr + 50682 50858 177 1 0 25 76 94 0.280 1.79 1.10 Intr + 51376 51525 150 2 0 32 83 98 0.785 3.83 1.11 Intr + 52054 52196 143 2 2 112 -5 95 0.721 2.67 1.12 Term + 52323 52406 84 2 0 120 49 70 0.794 3.95 1.13 PlyA + 53404 53409 6 1.05 2.06 PlyA - 54804 54799 6 1.05 2.05 Term - 55277 55266 12 2 0 120 48 2 0.653 -2.40 2.04 Intr - 55481 55327 155 0 2 37 95 55 0.300 0.89 2.03 Intr - 57029 56874 156 0 0 115 58 85 0.240 8.18 2.02 Intr - 58279 58138 142 1 1 108 92 82 0.997 10.53 2.01 Init - 62606 62562 45 2 0 111 76 10 0.897 2.86 2.00 Prom - 67860 67821 40 -4.56 3.00 Prom + 68039 68078 40 -11.63 3.01 Init + 68315 68377 63 1 0 96 99 78 0.897 10.86 3.02 Intr + 82670 82707 38 1 2 64 90 28 0.065 -2.34 3.03 Intr + 96431 96517 87 2 0 67 107 26 0.380 1.49 3.04 Intr + 100003 100144 142 1 1 107 75 185 0.860 19.46 3.05 Intr + 122014 122168 155 0 2 86 100 59 0.717 5.77 3.06 Intr + 126257 126364 108 2 0 128 89 45 0.994 8.00 3.07 Intr + 129850 129970 121 1 1 91 98 141 0.999 15.90 3.08 Intr + 133972 134158 187 0 1 89 30 70 0.611 0.56 3.09 Intr + 134693 134928 236 0 2 46 44 176 0.265 6.41 3.10 Intr + 138085 138272 188 0 2 75 91 57 0.890 3.19 3.11 Intr + 139180 139265 86 1 2 64 96 44 0.966 2.26 3.12 Intr + 140454 140577 124 1 1 77 94 93 0.993 8.44 3.13 Intr + 140658 140805 148 0 1 121 68 175 0.966 19.04 3.14 Term + 141037 141114 78 0 0 118 47 -10 0.832 -4.34 3.15 PlyA + 141760 141765 6 -3.94 4.27 PlyA - 142141 142136 6 1.05 4.26 Term - 142791 142780 12 0 0 101 43 1 0.553 -4.90 4.25 Intr - 143352 143140 213 0 0 65 75 268 0.978 22.11 4.24 Intr - 143594 143478 117 2 0 102 94 49 0.994 7.56 4.23 Intr - 144069 143893 177 0 0 83 80 240 0.999 22.82 4.22 Intr - 144622 144486 137 2 2 119 72 247 0.998 26.49 4.21 Intr - 145516 145288 229 1 1 66 94 148 0.932 10.64 4.20 Intr - 146688 146442 247 2 1 51 77 245 0.980 17.16 4.19 Intr - 148098 147941 158 0 2 125 71 159 0.996 16.71 4.18 Intr - 148438 148220 219 1 0 127 110 375 0.871 42.10 4.17 Intr - 148640 148521 120 2 0 86 110 112 0.991 13.89 4.16 Intr - 150107 150039 69 2 0 145 81 108 0.986 15.18 4.15 Intr - 151695 151654 42 0 0 99 95 -1 0.509 0.14 4.14 Intr - 155513 152148 3366 2 0 54 110 4307 0.927 416.89 4.13 Intr - 155783 155661 123 2 0 80 91 85 0.986 8.78 4.12 Intr - 155910 155830 81 0 0 115 11 82 0.890 3.03 4.11 Intr - 157080 156940 141 0 0 60 86 245 0.995 22.05 4.10 Intr - 157553 157436 118 1 1 121 98 156 0.924 20.37 4.09 Intr - 157766 157649 118 0 1 102 77 237 0.999 23.52 4.08 Intr - 158233 158076 158 0 2 94 73 258 0.995 24.55 4.07 Intr - 158870 158747 124 0 1 97 55 78 0.979 5.04 4.06 Intr - 160544 160457 88 2 1 74 115 122 0.987 13.04 4.05 Intr - 161672 161592 81 2 0 122 115 11 0.992 7.03 4.04 Intr - 162598 162318 281 2 2 90 77 96 0.395 5.90 4.03 Intr - 163606 163527 80 0 2 83 63 48 0.662 0.99 4.02 Intr - 168543 168458 86 0 2 89 82 130 0.978 11.12 4.01 Init - 175196 175155 42 2 0 80 106 55 0.987 4.92 4.00 Prom - 193251 193212 40 -3.66 5.00 Prom + 207946 207985 40 -4.26 5.01 Init + 228337 228418 82 0 1 54 82 71 0.894 4.23 5.02 Intr + 232749 232954 206 1 2 89 76 183 0.904 16.12 5.03 Intr + 234201 234349 149 2 2 105 97 124 0.948 14.03 5.04 Term + 234632 234773 142 2 1 71 54 222 0.997 14.50 5.05 PlyA + 234829 234834 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587f:71860748_72096312|GENSCAN_predicted_peptide_1|1095_aa RRKAESPQAATKWLEEHAPADYQNPQEYGRTQLPGTDPQLDPHEREDMQRLNRDREALLE GLMRGAQKATNVNKLSEDIQGKEESPTQFYERLWEAYRMYTPFDPDSPENQRMIPMALVR QSAEDMRRKLQKQAGLAGMNPSQLLEIASQVFVNRDAVSRKENGKENGGQARRYADLFHD MRQGLAVLPGIRASGAAPFEGLQVDFTEMSKCGGNKYVVVLGHTYSGWEEAYQTRTEKSR EVTPVLLRDLIPRFRRPLGIGSDNGPAFLVALVQKTAKEDFHRGVFTPCDIGSKIILSTQ EMTNKVMGMCTPPAILGVMSSSPNLDVSNQITKGLYTPCDIGCNMILSRPGYRERYHTAE DPELQPLLALLSLSLSMHLVMVLRNLLNILAVSSDSPLHTPTYFFLSNLCWADTGFTSAT VPNMIVDMQSHSRVISHADCLTQISFLLLFACIEGMLLTVMTYDCFVAICCPLHYPVIVN PHLCVFFVLVSFFLSLLDSQLHSWIVLQFTIIKNVEISNSVCDPSQLLKLACSDSVINSI FMHFHNTMFGFLPISGILVSYYKIVPSILRISSSDGKNGVMASVMYAVVTPMLNLFIYSL RNRDIQSALWRLLSRTVESHDLFHPFSCVEDPELQPVVAGLFLSMCLVTVLENLLIILAR RKAESPQAATKWLEEHAPADYQNAQEYGRAQLPGTDPQLDPHERGDMQRLKRDREALLEG LMRGAQKATNVNKLSEVIQGKEESPAQFYQRLCEVYRMYTPCDPDSPENQHMIHMALVRQ SPEDMRRKLQKQAGLAGMNPSQLLEIASQVFVNRDAVSRKENGKENGGQARRYADLFPRT KDYQPVQDLRLLHQAKLTLHPTVNNPSTLLGLLPAEDSWFTCLDLKDVFFPIRLAPERQK LFAFQWEDPESGDWELYVDGSSFFNPQGERGAGYAVITLDTVVEATSLPQATSGQKAELI AFIGALELSEALAKTVRQRCVTCRQHDARQGPAVPPGIRAYGAAPFEGLQVDFTEMPKCG DIRKKCHWGCEQPCDIESSIILSPLHIGNNITSGVYSPLHIGNNITSGGIQNNIIGGVYT LCDIESHIILFRSGY >gi568815587f:71860748_72096312|GENSCAN_predicted_CDS_1|3288_bp cgaagaaaggcggagagtcctcaagcagcaactaagtggctagaggaacatgcaccagct gattatcaaaacccccaagagtatggaaggacccagttgccaggaacagacccccagttg gacccacatgaaagagaggatatgcaaaggctaaaccgagacagggaagctctcttggaa ggattaatgaggggagctcagaaggccacaaacgttaacaagctctctgaggacattcag ggtaaagaagaaagtccaacacaattctacgagagactgtgggaggcctatcgtatgtat actccctttgatcctgatagccctgaaaatcagcgcatgattcccatggctttagtccgt caaagcgcagaagacatgagaagaaaactgcagaaacaggctgggcttgcagggatgaat ccatcccaattactagaaatagctagccaggtgtttgtaaacagggatgcagtaagccgt aaggaaaacggcaaagagaatggaggtcaggcccggcgatatgccgacctgtttcatgat atgaggcaaggtctagctgttctgcccggcataagagcttctggagcagccccctttgaa ggtctccaagtggacttcacagagatgtcaaaatgtggaggtaacaagtatgtagtagtt cttgggcatacctactctgggtgggaggaggcctatcaaacacgaacagagaaatctcgt gaagtaacccctgtgcttcttcgtgatctgattcctagatttcgacggcctttagggatc ggctcagacaacgggcctgcgtttttggttgccttggtacagaagacggcaaaggaagat tttcacaggggtgtgttcaccccctgcgatattgggagtaagatcatcctctccacccag gaaatgactaacaaggtcatggggatgtgtactccgcctgcgattttgggagtaatgtca tcctccccaaacctggatgttagcaaccagatcacaaaggggttgtacacaccctgcgac attggatgtaatatgatcctctcccgacctggatacagagaaagataccacaccgcggag gatccagaactgcagccgctcctcgctttgctgtccctgtccctgtccatgcatctggtc atggtgctgaggaacctgctcaacatcctggctgtcagctctgactcccccctccacacc cccacgtacttcttcctctccaacctgtgctgggctgacaccggtttcacctcggccacg gttcccaatatgattgtggacatgcagtcgcatagcagagtcatctctcatgcggactgc ctgacacagatttccttcttgctcctttttgcatgtatagaaggcatgctcctgactgtg atgacctatgactgctttgtagccatctgttgccctctgcactacccagtcatcgtgaat cctcacctctgtgtcttcttcgttttggtgtcctttttccttagcctgttggattcccag ctgcacagttggattgtgttacaattcaccatcatcaagaatgtggaaatctctaattct gtctgtgacccctctcaacttctcaaacttgcttgttctgacagcgtcatcaatagcata ttcatgcatttccataatactatgtttggttttcttcccatttcagggatccttgtgtct tactataaaatcgtcccctccattcttaggatttcatcgtcagatgggaagaatggtgtg atggcgtcagtgatgtacgctgtggtcacccccatgctgaaccttttcatctacagcctg agaaacagggacatacaaagtgccctgtggaggctgctcagcagaacagtcgaatctcat gatctgttccatcctttttcttgtgtggaggatccagaactgcagccggtcgtcgctggg ctgttcctgtccatgtgcctggtcacggtgctggagaacctgctcatcatcctggcccga agaaaggcggagagtcctcaagcagcaactaagtggctagaggaacatgcaccagctgat tatcaaaacgcccaagagtatggaagggcccagttgccaggaaccgaccctcagttggac ccacatgaaagaggggatatgcaaaggctaaagcgagacagagaagctctcctggaagga ttaatgaggggagctcagaaggccacaaacgttaacaagctctctgaggtcattcagggt aaagaagaaagtccagcacaattctaccagagactgtgtgaggtctatcgtatgtatact ccctgtgatcccgatagccctgaaaatcagcacatgattcacatggctttagtccgtcaa agcccagaagacatgagaagaaaactgcagaaacaggctgggcttgcagggatgaatcca tcccaattactagaaatagctagccaggtgtttgtaaacagggatgcagtaagccgtaag gaaaacggcaaagagaatggaggtcaggcccggcgatatgccgacctgtttccacggacc aaggactaccagccggttcaggatttgcgcttgcttcatcaagctaaactgactttacat ccaacagtaaataacccgtccacattgttggggttgctgccagctgaggacagctggttc acctgcttggacctgaaagacgttttctttcctatcagattagcccctgagaggcagaag ctgtttgcctttcagtgggaagatccggagtcaggagactgggaactatatgtggatggg agcagcttcttcaacccccaaggagagagaggtgcagggtatgcagtgataaccctggac actgttgttgaagccacatcgttgccccaggccacttcaggccagaaagctgaactcatt gctttcattggggccttagaactcagtgaggcccttgccaaaacggtgaggcagcggtgt gttacctgccgccagcatgatgcgaggcaaggtccagccgttccgcccggcatacgagct tatggagcagccccctttgaaggtctccaagtggacttcacagagatgccaaagtgtgga gatattaggaaaaaatgtcactggggatgtgaacagccctgcgatattgagagtagtatc atcctctcccccttgcatattgggaacaacatcacaagtggggtgtactcccccttgcat attgggaacaacatcacaagtgggggtattcagaacaatattataggaggggtgtacacc ctctgcgatattgagagtcatatcatcctctttcgctctggatattag >gi568815587f:71860748_72096312|GENSCAN_predicted_peptide_2|169_aa MAPVKISHVVSFSSQDPKYPVENLLNPDSPRRPWLGCPQDKSGQLKVELQLERAVPTGYI DVGNCGCAFLQIDVGHSSWPLDRPFITLLPATTLMSLTDSKQGKNRSGVRMFKDGELWDR LRLTCSRPFTRHQSFGLAFLRVCSSLDSLDDSVVGPSALLSSVLNKVSV >gi568815587f:71860748_72096312|GENSCAN_predicted_CDS_2|510_bp atggctcctgtgaagatcagccatgtggtatcattttcttctcaggatcccaagtatcct gtagagaacttgctaaacccagatagtccaaggagaccttggctcggctgccctcaggac aagagtgggcaattgaaagtagaactacagctggagagggcagtgcccactggctacatt gatgtgggtaactgtggctgtgcgttcctgcaaattgatgtgggccattcttcctggccc ctggacagacctttcataaccctgctccctgcaaccacgctaatgtctctaactgattca aagcaggggaagaaccgctccggggtccgcatgtttaaagatggagagttatgggatcga cttcgcctgacctgctcccgacccttcacgcgtcatcagtcctttggcctggcctttcta cgggtgtgttcttctctggactccttagatgactctgtggtgggtccctcagcccttctg agctctgtgctgaacaaggtttctgtttaa >gi568815587f:71860748_72096312|GENSCAN_predicted_peptide_3|586_aa MAAVVEVEVGGGAAGERELDELCHAAVICDKSIRDSFKTVTDFSGGVFLQVDMSDLSPEE QWRVEHARMHAKHRGHEAMHAEMVLILIATLVVAQLLLVQWKQRHPRSYNMVTLFQMWVV PLYFTVKLHWWRFLVIWILFSAVTAFVTFRATRKPLVQTTPRLVYKWFLLIYKISYATGI VGYMAVMFTLFGLNLLFKIKPEDAMDFGISLLFYGLYYGVLERDFAEMCADYMASTIGFY SESGMPTKHLSDSVCAVCGQQIFVDVSEEGIIENTYRLSCNHVYPASSSWATSLLQSAHC TVLSFHEFCIRGWCIVGKKQTCPYCKEKVDLKRMFSNPYPLLGSLLGVGCGKKVLASVTC IWVLLGPSLLPPTRTCQTMGWLSSLCASSWVGRESSTDKEEVLSVYFLWWVHTQLDTANL SLGAPRGKLLERKALQDLLGAREEDVVTDKEPGSPAPDACIMTMRHNWTPDGTLSLSCVA CSRFPNFSILYWLGNGSFIEHLPGRLWEGSTSRERGSTGTQLCKALVLEQLTPALHSTNF SCVLVDPEQVVQRHVVLAQLWAGLRATLPPTQEALPSSHSSPQQQG >gi568815587f:71860748_72096312|GENSCAN_predicted_CDS_3|1761_bp atggcggcagtggtggaggtggaggttggaggtggtgctgctggggaacgggagctggat gagctctgccatgcagctgttatctgtgacaagtcaataagggacagttttaagacagta acggatttttctggtggtgtctttctacaggttgatatgtcagatctctctccagaagag caatggagggtcgagcacgcacgcatgcatgccaagcaccgtggccatgaagctatgcat gctgaaatggtcctcatcctcatcgcaaccttggtggtggcccagctgctcctggtgcag tggaagcagaggcacccacgctcctacaatatggtgaccctctttcagatgtgggttgtt cccctctatttcacagtgaagctgcactggtggaggttcctagtgatctggatcttgttc tctgctgtcacagcctttgttaccttccgagccacccgaaaacctctagtacagacaacc ccaaggttggtttataagtggttcctgctaatctataaaatcagctatgccactggcatt gttggctacatggctgtcatgtttaccctctttggtcttaacttattattcaagatcaaa ccagaagatgccatggactttggcatctcccttctcttctatggcctctactatggagtt ctggaacgggactttgcagaaatgtgtgcagactacatggcatctaccatagggttctac agcgagtcgggcatgcctaccaaacatctttcagacagtgtgtgtgctgtgtgtgggcag cagatctttgtggacgtcagtgaagaggggatcattgagaacacgtataggctgtcctgc aatcatgtgtatcctgcctcgagctcctgggccacatctctcctgcaatctgcacactgt acggtgctcagcttccacgagttctgcatccgtggctggtgcatcgtgggaaagaagcaa acgtgtccctactgcaaagagaaggtagacctcaagaggatgttcagcaatccgtatcct ttattggggtcgttgttgggagtgggctgtgggaagaaagtactggccagtgtgacctgc atttgggtcctcctggggccctcacttctgcccccaaccagaacatgtcaaaccatgggt tggctctcgagcttgtgtgccagttcctgggttggccgtgagagttctacagacaaggag gaagtgctctcggtgtatttcctgtggtgggttcacacgcagctagacacagctaacttg agtcttggagctcctagagggaagcttctggaaaggaaggctcttcaggacctcttagga gccagagaagaggacgttgtcacagataaagagccaggctcaccagctcctgacgcatgc atcatgaccatgagacacaactggacaccagatggaacgctgagcttatcctgtgtggcc tgcagccgcttccccaacttcagcatcctctactggctgggcaatggttccttcattgag cacctcccaggccgactgtgggaggggagcaccagccgggaacgtgggagcacaggtacg cagctgtgcaaggccttggtgctggagcagctgacccctgccctgcacagcaccaacttc tcctgtgtgctcgtggaccctgaacaggttgtccagcgtcacgtcgtcctggcccagctc tgggctgggctgagggcaaccttgccccccacccaagaagccctgccctccagccacagc agtccacagcagcagggttaa >gi568815587f:71860748_72096312|GENSCAN_predicted_peptide_4|2208_aa MTLHATRGAALLSWVNSLHVADPVEAVLQLQDCSIFIKIIDRIHGTEEGQQILKQPVSER LDFVCSFLQTLGLTQDRTDFRWKIVRQGKEAIGVRRPTSHSPTRDSHSAPQISVFWDLRY GFQEVHPVFYVVVIAENRKHPSSPECLVSAQKVLEGSELELAKMTMLLLYHSTMSSKSPR DWEQFEYKIQAELAVILKFVLDHEDGLNLNEDLENFLQKAPVPSTCSSTFPEELSPPSHQ AKREIRFLELQKVASSSSGNNFLSGSPASPMGDILQTPQFQMRRLKKQLADERSNRDELE LELAENRKLLTEKDAQIAMMQQRIDRLALLNEKQAASPLEPKELEELRDKNESLTMRLHE TLKQCQDLKTEKSQMDRKINQLSEENGDLSFKLREFASHLQQLQDALNELTEEHSKATQE WLEKQAQLEKELSAALQDKARAKGDLGNKMMGPMFADVYEGLCGNLKCLEEKNEILQGKL SQLEEHLSQLQDNPPQEKGEVLGDVLQLETLKQEAATLAANNTQLQARVEMLETERGQQE AKLLAERGHFEEEKQQLSSLITDLQSSISNLSQAKEELEQASQAHGARLTAQVASLTSEL TTLNATIQQQDQELAGLKQQAKEKQAQLAQTLQQQEQASQGLRHQVEQLSSSLKQKEQQL KEVAEKQEATRQDHAQQLATAAEEREASLRERDAALKQLEALEKEKAAKLEILQQQLQVA NEARDSAQTSVTQAQREKAELSRKVEELQACVETARQEQHEAQAQVAELELQLRSEQQKA TEKERVAQEKDQLQEQLQALKESLKVTKGSLEEEKRRAADALEEQQRCISELKAETRSLV EQHKRERKELEEERAGRKGLEARLQQLGEAHQAETEVLRRELAEAMAAQHTAESECEQLV KEVAAWRERYEDSQQEEAQYGAMFQEQLMTLKEECEKARQELQEAKEKVAGIESHSELQI SRQQNELAELHANLARALQQVQEKEVRAQKLADDLSTLQEKMAATSKEVARLETLVRKAG EQQETASRELVKEPARAGDRQPEWLEEQQGRQFCSTQAALQAMEREAEQMGNELERLRAA LMESQGQQQEERGQQEREVARLTQERGRAQADLALEKAARAELEMRLQNALNEQRVEFAT LQEALAHALTEKEGKDQELAKLRGLEAAQIKELEELRQTVKQLKEQLAKKEKEHASGSGA QSEAAGRTEPTGPKLEALRAEVSKLEQQCQKQQEQADSLERSLEAERASRAERDSALETL QGQLEEKAQELGHSQSALASAQRELAAFRTKVQDHSKAEDEWKAQVARGRQEAERKNSLI SSLEEEVSILNRQVLEKEGESKELKRLVMAESEKSQKLEERLRLLQAETASNSARAAERS SALREEVQSLREEAEKQRVASENLRQELTSQAERAEELGQELKAWQEKFFQKEQALSTLQ LEHTSTQALVSELLPAKHLCQQLQAEQAAAEKRHREELEQSKQAAGGLRAELLRAQRELG ELIPLRQKVAEQERTAQQLRAEKASYAEQLSMLKKAHGLLAEENRGLGERANLGRQFLEV ELDQAREKYVQELAAVRADAETRLAEVQREAQSTARELEVMTAKYEGAKVKVLEERQRFQ EERQKLTAQVEQLEVFQREQTKQVEELSKKLADSDQASKVQQQKLKAVQAQGGESQQEAQ RLQAQLNELQAQLSQKEQAAEHYKLQMEKAKTHYDAKKQQNQELQEQLRSLEQLQKENKE LRAEAERLGHELQQAGLKTKEAEQTCRHLTAQVRSLEAQVAHADQQLRDLGKFQVATDAL KSREPQAKPQLDLSIDSLDLSCEEGTPLSITSKLPRTQPDGTSVPGEPASPISQRLPPKV ESLESLYFTPIPARSQAPLESSLDSLGDVFLDSGRKTRSARRRTTQIINITMTKKLDVEE PDSANSSFYSTRSAPASQASLRATSSTQSLARLGSPDYGNSALLSLPGYRPTTRSSARRS QAGVSSGAPPGRNSFYMGTCQDEPEQLDDWNRIAELQQRNRVCPPHLKTCYPLESRPSLS LGTITDEEMKTGDPQETLRRASMQPIQIAEGTGITTRQQRKRVSLEPHQGPGTPESKKAT SCFPRPMTPRDRHEGRKQSTTEAQKKAAPASTKQADRRQSMAFSILNTPKKLGNSLLRRG ASKKALSKASPNTRSGTRRSPRIATTTASAATAAAIGATPRAKGKAKH >gi568815587f:71860748_72096312|GENSCAN_predicted_CDS_4|6627_bp atgacactccacgccacccggggggctgcactcctctcttgggtgaacagtctacacgtg gctgaccctgtggaggctgtgctgcagctccaggactgcagcatcttcatcaagatcatt gacagaatccatggcactgaagagggacagcaaatcttgaagcagccggtgtcagagaga ctggactttgtgtgcagttttctgcagacccttggattgacccaggatagaactgacttc agatggaaaattgtaaggcaggggaaagaggccataggagtaaggaggcctacatctcat agccccaccagagatagccattctgctccccagatttcagtgttctgggatctcaggtat ggtttccaggaagttcaccctgttttctacgttgttgtcattgcagaaaatcgaaaacat ccctcttccccagaatgcctggtatctgcacagaaggtgctagagggatcagagctggaa ctggcgaagatgaccatgctgctcttataccactctaccatgagctccaaaagtcccagg gactgggaacagtttgaatataaaattcaggctgagttggctgtcattcttaaatttgtg ctggaccatgaggacgggctaaaccttaatgaggacctagagaacttcctacagaaagct cctgtgccttctacctgttctagcacattccctgaagagctctccccacctagccaccag gccaagagggagattcgcttcctagagctacagaaggttgcctcctcttccagtgggaac aactttctctcaggttctccagcttctcccatgggtgatatcctgcagaccccacagttc cagatgagacggctgaagaagcagcttgctgatgagagaagtaatagggatgagctggag ctggagctagctgagaaccgcaagctcctcaccgagaaggatgcacagatagccatgatg cagcagcgcattgaccgcctagccctgctgaatgagaagcaggcggccagcccactggag cccaaggagcttgaggagctgcgtgacaagaatgagagccttaccatgcggctgcatgaa accctgaagcagtgccaggacctgaagacagagaagagccagatggatcgcaaaatcaac cagctttcggaggagaatggagacctttcctttaagctgcgggagtttgccagtcatctg cagcagctacaggatgccctcaatgagctgacggaggagcacagcaaggccactcaggag tggctagagaagcaggcccagctggagaaggagctcagcgcagccctgcaggacaaggcc agggcaaaaggagatcttggtaacaagatgatgggccccatgtttgctgatgtttatgag ggcttgtgtggtaatctgaaatgccttgaagagaagaacgaaatccttcagggaaaactt tcacagctggaagaacacttgtcccagctgcaggataacccaccccaggagaagggcgag gtgctgggtgatgtcttgcagctggaaaccttgaagcaagaggcagccactcttgctgca aacaacacacagctccaagccagggtagagatgctggagactgagcgaggccagcaggaa gccaagctgcttgctgagcggggccacttcgaagaagaaaagcagcagctgtctagcctg atcactgacctgcagagctccatctccaacctcagccaggccaaggaagagctggagcag gcctcccaggctcatggggcccggttgactgcccaggtggcctctctgacctctgagctc accacactcaatgccaccatccagcaacaggatcaagaactggctggcctgaagcagcag gccaaagagaagcaggcccagctagcacagaccctccaacagcaagaacaggcctcccag ggcctccgccaccaggtggagcagctaagcagtagcctgaagcagaaggagcagcagttg aaggaggtagcggagaagcaggaggcaactaggcaggaccatgcccagcaactggccact gctgcagaggagcgagaggcctccttaagggagcgggatgcggctctcaagcagctggag gcactggagaaggagaaggctgccaagctggagattctgcagcagcaacttcaggtggct aatgaagcccgggacagtgcccagacctcagtgacacaggcccagcgggagaaggcagag ctgagccggaaggtggaggaactccaggcctgtgttgagacagcccgccaggaacagcat gaggcccaggcccaggttgcagagctagagttgcagctgcggtctgagcagcaaaaagca actgagaaagaaagggtggcccaggagaaggaccagctccaggagcagctccaggccctc aaagagtccttgaaggtcaccaagggcagccttgaagaggagaagcgcagggctgcagat gccctggaagagcagcagcgttgtatctctgagctgaaggcagagacccgaagcctggtg gagcagcataagcgggaacgaaaggagctggaagaagagagggctgggcgcaaggggctg gaggctcgattacagcagcttggggaggcccatcaggctgagactgaagtcctgcggcgg gagctggcagaggccatggctgcccagcacacagctgagagtgagtgtgagcagctcgtc aaagaagtagctgcctggcgtgagcggtatgaggatagccagcaagaggaggcacagtat ggcgccatgttccaggaacagctgatgactttgaaggaggaatgtgagaaggcccgccag gagctgcaggaggcaaaggagaaggtggcaggcatagaatcccacagcgagctccagata agccggcagcagaacgaactagctgagctccatgccaacctggccagagcactccagcag gtccaagagaaggaagtcagggcccagaagcttgcagatgacctctccactctgcaggaa aagatggctgccaccagcaaagaggtggcccgcttggagaccttggtgcgcaaggcaggt gagcagcaggaaacagcctcccgggagttagtcaaggagcctgcgagggcaggagacaga cagcccgagtggctggaagagcaacagggacgccagttctgcagcacacaggcagcgctg caggctatggagcgggaggcagagcagatgggcaatgagctggaacggctgcgggccgcg ctgatggagagccaggggcagcagcaggaggagcgtgggcagcaggaaagggaggtggcg cggctgacccaggagcggggccgtgcccaggctgaccttgccctggagaaggcggccaga gcagagcttgagatgcggctgcagaacgccctcaacgagcagcgtgtggagttcgctacc ctgcaagaggcactggctcatgccctgacggaaaaggaaggcaaggaccaggagttggcc aagcttcgtggtctggaggcagcccagataaaagagctggaggaacttcggcaaaccgtg aagcaactgaaggaacagctggctaagaaagaaaaggagcacgcatctggctcaggagcc caatctgaggctgctggcaggacagagccaacaggccccaagctggaggcactgcgggca gaggtgagcaagctggaacagcaatgccagaagcagcaggagcaggctgacagcctggaa cgcagcctcgaggctgagcgggcctcccgggctgagcgggacagtgctctggagactctg cagggccagttagaggagaaggcccaggagctagggcacagtcagagtgccttagcctcg gcccaacgggagttggctgccttccgcaccaaggtacaagaccacagcaaggctgaagat gagtggaaggcccaggtggcccggggccggcaagaggctgagaggaaaaatagcctcatc agcagcttggaggaggaggtgtccatcctgaatcgccaggtcctggagaaggagggggag agcaaggagttgaagcggctggtgatggccgagtcagagaagagccagaagctggaggag aggctgcgcctgctgcaggcagagacagccagcaacagtgccagagctgcagaacgcagc tctgctctgcgggaggaggtgcagagcctccgggaggaggctgagaaacagcgggtggct tcagagaacctgcggcaggagctgacctcacaggctgagcgtgcggaggagctgggccaa gaattgaaggcgtggcaggagaagttcttccagaaagagcaggccctctccaccctgcag ctcgagcacaccagcacacaggccctggtgagtgagctgctgccagctaagcacctctgc cagcagctgcaggccgagcaggccgctgccgagaaacgccaccgtgaggagctggagcag agcaagcaggccgctgggggactgcgggcagagctgctgcgggcccagcgggagcttggg gagctgattcctctgcggcagaaggtggcagagcaggagcgaacagctcagcagctgcgg gcagagaaggccagctatgcagagcagctgagcatgctgaagaaggcgcatggcctgctg gcagaggagaaccgggggctgggtgagcgggccaaccttggccggcagtttctggaagtg gagttggaccaggcccgggagaagtatgtccaagagttggcagccgtacgtgctgatgct gagacccgtctggctgaggtgcagcgagaagcacagagcactgcccgggagctggaggtg atgactgccaagtatgagggtgccaaggtcaaggtcctggaggagaggcagcggttccag gaagagaggcagaaactcactgcccaggtggagcagctagaggtatttcagagagagcaa actaagcaggtggaagaactgagtaagaaactggctgactctgaccaagccagcaaggtg cagcagcagaagctgaaggctgtccaggctcagggaggcgagagccagcaggaggcccag cgcctccaggcccagctgaatgaactgcaagcccagttgagccagaaggagcaggcagct gagcactataagctgcagatggagaaagccaaaacacattatgatgccaagaagcagcag aaccaagagctgcaggagcagctgcggagcctggagcagctgcagaaggaaaacaaagag ctgcgagctgaagctgaacggctgggccatgagctacagcaggctgggctgaagaccaag gaggctgaacagacctgccgccaccttactgcccaggtgcgcagcctggaggcacaggtt gcccatgcagaccagcagcttcgagacctgggcaaattccaggtggcaactgatgcttta aagagccgtgagccccaggctaagccccagctggacttgagtattgacagcctggatctg agctgcgaggaggggaccccactcagtatcaccagcaagctgcctcgtacccagccagac ggcaccagcgtccctggagaaccagcctcacctatctcccagcgcctgccccccaaggta gaatccctggagagtctctacttcactcccatccctgctcggagtcaggcccccctggag agcagcctggactccctgggagacgtcttcctggactcgggtcgtaagacccgctccgct cgtcggcgcaccacgcagatcatcaacatcaccatgaccaagaagctagatgtggaagag ccagacagcgccaactcatcgttctacagcacgcggtctgctcctgcttcccaggctagc ctgcgagccacctcctctactcagtctctagctcgcctgggttctcccgattatggcaac tcagccctgctcagcttgcctggctaccgccccaccactcgcagttctgctcgtcgttcc caggccggggtgtccagtggggcccctccaggaaggaacagcttctacatgggcacttgc caggatgagcctgagcagctggatgactggaaccgcattgcagagctgcagcagcgcaat cgagtgtgccccccacatctgaagacctgctatcccctggagtccaggccttccctgagc ctgggcaccatcacagatgaggagatgaaaactggagacccccaagagaccctgcgccga gccagcatgcagccaatccagatagccgagggcactggcatcaccacccggcagcagcgc aaacgggtctccctagagccccaccagggccctggaactcctgagtctaagaaggccacc agctgtttcccacgccccatgactccccgagaccgacatgaagggcgcaaacagagcact actgaggcccagaagaaagcagctccagcttctactaaacaggctgaccggcgccagtcg atggccttcagcatcctcaacacacccaagaagctagggaacagccttctgcggcgggga gcctcaaagaaggccctgtccaaggcttcccccaacactcgcagtggaacccgccgttct ccgcgcattgccaccaccacagccagcgccgccactgctgccgccattggtgccacccct cgagccaagggcaaggcaaagcactaa >gi568815587f:71860748_72096312|GENSCAN_predicted_peptide_5|192_aa MNKRDYMNTSVQEPPLDYSFRSIHVIQDLVNEEPRTGLRPLKRSKSGKSLTQSLWLNNNV LNDLRDFNQVASQLLEHPENLAWIDLSFNDLTSIDPVLTTFFNLSVLYLHGNSIQRLGEV NKLAVLPRLRSLTLHGNPMEEEKGYRQYVLCTLSRITTFDFSGVTKADRTTAEVWKRMNI KPKKAWTKQNTL >gi568815587f:71860748_72096312|GENSCAN_predicted_CDS_5|579_bp atgaacaaacgggactatatgaacacttcggtacaggagccccctcttgactactccttc agaagcatccacgtcattcaagatctggtaaatgaggagccaaggacaggactacgacca ctgaagcgttcaaagtcggggaaatcactgacccagtccctgtggctgaataacaatgtt ctcaatgatctgagagacttcaaccaggtggcttcacagctgttggagcacccagagaac ctggcctggatcgacctgtcctttaatgacctgacttccattgaccctgtcctaacaact ttcttcaacctgagtgtcctctatcttcacggcaacagcatccagcgcctgggggaggtg aataagctggctgtccttcctcggctccgtagcctgacactccatgggaaccccatggag gaagagaaagggtataggcaatatgtgctgtgcaccctgtcccgtatcaccacgttcgac ttcagtggggtcaccaaagcagaccgcaccacagctgaagtctggaaacgcatgaacatc aagcccaagaaggcctggaccaagcagaatacactttga