GENSCAN 1.0 Date run: 4-Nov-116 Time: 21:05:47 Sequence gi568815579f:2994652_3221176 : 226525 bp : 55.65% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 249 353 105 2 0 118 55 170 0.999 15.41 1.02 PlyA + 507 512 6 1.05 2.21 PlyA - 3012 3007 6 1.05 2.20 Term - 3304 3197 108 1 0 109 53 152 0.986 12.61 2.19 Intr - 6072 5996 77 1 2 109 78 110 0.558 11.83 2.18 Intr - 7852 7702 151 1 1 114 66 254 0.986 26.05 2.17 Intr - 10933 10786 148 0 1 93 53 271 0.999 24.85 2.16 Intr - 11317 11070 248 1 2 89 94 335 0.999 30.99 2.15 Intr - 12018 11769 250 2 1 98 81 438 0.794 42.07 2.14 Intr - 14294 14218 77 2 2 109 78 -44 0.760 -4.49 2.13 Intr - 15051 14891 161 1 2 79 89 121 0.855 11.52 2.12 Intr - 16452 16371 82 0 1 110 97 79 0.497 10.71 2.11 Intr - 19167 19018 150 0 0 72 95 50 0.824 4.97 2.10 Intr - 19963 19919 45 1 0 115 105 76 0.998 11.09 2.09 Intr - 21109 21002 108 1 0 130 73 115 0.999 15.18 2.08 Intr - 24815 24549 267 2 0 88 47 182 0.427 12.37 2.07 Intr - 25122 25048 75 0 0 60 84 140 0.998 11.01 2.06 Intr - 30431 30369 63 2 0 65 105 87 0.959 7.51 2.05 Intr - 33222 33178 45 0 0 93 93 84 0.995 8.49 2.04 Intr - 33731 33668 64 1 1 102 77 113 0.997 10.81 2.03 Intr - 34152 34055 98 0 2 121 94 121 0.970 15.31 2.02 Intr - 34346 34230 117 2 0 79 117 -11 0.713 2.07 2.01 Init - 36990 36928 63 0 0 79 59 78 0.731 3.10 2.00 Prom - 46350 46311 40 0.49 3.11 PlyA - 48339 48334 6 1.05 3.10 Term - 59389 59168 222 1 0 108 43 339 0.993 28.64 3.09 Intr - 59543 59469 75 2 0 102 38 93 0.934 5.91 3.08 Intr - 61075 61013 63 1 0 91 94 57 0.951 6.01 3.07 Intr - 61705 61661 45 1 0 86 101 109 0.995 11.09 3.06 Intr - 63091 63028 64 0 1 98 52 82 0.826 4.81 3.05 Intr - 63456 63363 94 1 1 122 76 33 0.967 5.02 3.04 Intr - 65321 65152 170 1 2 34 93 58 0.702 1.01 3.03 Intr - 66606 66509 98 0 2 105 99 122 0.887 14.31 3.02 Intr - 68320 68098 223 0 1 90 -1 114 0.308 1.36 3.01 Init - 72140 72082 59 2 2 70 98 15 0.536 1.43 3.00 Prom - 79543 79504 40 -3.21 4.00 Prom + 83810 83849 40 -3.11 4.01 Init + 85187 85218 32 1 2 88 81 -7 0.257 -2.05 4.02 Intr + 87051 87098 48 0 0 91 99 7 0.358 0.38 4.03 Term + 90379 90532 154 1 1 107 48 121 0.834 7.71 4.04 PlyA + 92444 92449 6 1.05 5.00 Prom + 95355 95394 40 -6.90 5.01 Init + 100001 100136 136 1 1 88 109 302 0.864 32.57 5.02 Intr + 101835 101918 84 1 0 72 77 63 0.813 3.89 5.03 Intr + 105120 105207 88 1 1 52 98 36 0.252 0.53 5.04 Intr + 110785 110972 188 1 2 34 64 145 0.025 6.55 5.05 Intr + 115498 115682 185 2 2 99 105 554 0.995 58.23 5.06 Intr + 118679 118833 155 1 2 124 86 366 0.998 39.38 5.07 Intr + 120293 120476 184 2 1 84 70 370 0.441 35.11 5.08 Intr + 123105 123192 88 2 1 82 43 37 0.332 -1.36 5.09 Intr + 123775 123967 193 2 1 66 19 116 0.533 1.57 5.10 Intr + 124273 124402 130 1 1 103 64 246 0.997 24.90 5.11 Intr + 124555 124708 154 0 1 107 62 374 0.999 36.86 5.12 Intr + 126338 126522 185 0 2 119 10 548 0.000 50.13 5.13 Intr + 136034 136165 132 1 0 -25 0 207 0.458 1.95 5.14 Term + 136569 136769 201 2 0 37 42 352 0.984 23.21 5.15 PlyA + 137243 137248 6 1.05 6.00 Prom + 141226 141265 40 -4.51 6.01 Init + 141800 141944 145 1 1 110 92 233 0.908 26.15 6.02 Intr + 153940 154124 185 2 2 80 105 513 0.995 52.23 6.03 Intr + 155480 155634 155 1 2 114 67 262 0.999 26.08 6.04 Intr + 157056 157184 129 0 0 72 89 322 0.975 31.01 6.05 Intr + 161172 161301 130 0 1 58 69 258 0.994 22.10 6.06 Intr + 163077 163230 154 2 1 72 110 105 0.999 11.26 6.07 Term + 168142 168368 227 2 2 102 46 519 0.999 46.37 6.08 PlyA + 169198 169203 6 -0.45 7.00 Prom + 170017 170056 40 -0.61 7.01 Init + 178007 181227 3221 1 2 55 23 976 0.222 76.93 7.02 Term + 183031 183706 676 1 1 48 48 412 0.315 26.82 7.03 PlyA + 184065 184070 6 -3.24 8.00 Prom + 184082 184121 40 -14.11 8.01 Init + 184142 185281 1140 1 0 84 9 2003 0.010 186.20 8.02 Intr + 186179 186265 87 1 0 45 34 104 0.015 1.36 8.03 Intr + 189757 189927 171 0 0 131 48 10 0.653 1.95 8.04 Intr + 191281 191563 283 0 1 60 86 534 0.445 47.93 8.05 Intr + 194032 194337 306 2 0 100 70 42 0.083 0.57 8.06 Intr + 197819 198009 191 0 2 100 88 432 0.711 44.23 8.07 Intr + 198522 198777 256 2 1 91 75 303 0.817 26.85 8.08 Intr + 199004 199159 156 0 0 91 65 89 0.988 7.39 8.09 Intr + 201532 201626 95 2 2 108 105 206 0.837 24.48 8.10 Intr + 202012 202225 214 0 1 33 93 139 0.639 7.71 8.11 Intr + 203297 203436 140 0 2 1 36 73 0.444 -5.91 8.12 Intr + 204166 204246 81 0 0 91 67 192 0.954 17.73 8.13 Intr + 204283 204405 123 0 0 85 59 43 0.698 2.39 8.14 Intr + 204581 204631 51 1 0 47 101 56 0.687 2.39 8.15 Intr + 206872 206975 104 0 2 99 77 181 0.999 17.57 8.16 Intr + 209105 209193 89 2 2 91 99 115 0.997 13.01 8.17 Intr + 209354 209493 140 0 2 78 76 239 0.997 22.39 8.18 Intr + 209922 210100 179 2 2 80 89 418 0.999 40.33 8.19 Intr + 211288 211375 88 1 1 5 105 109 0.996 4.77 8.20 Intr + 211501 211539 39 0 0 150 91 67 0.989 12.11 8.21 Intr + 211611 211774 164 2 2 90 77 383 0.941 36.69 8.22 Intr + 212547 212600 54 0 0 61 109 95 0.988 7.48 8.23 Intr + 212740 212818 79 1 1 94 99 172 0.744 18.95 8.24 Intr + 216970 217108 139 0 1 47 91 70 0.084 3.74 8.25 Intr + 220311 220479 169 1 1 48 94 117 0.148 7.82 8.26 Term + 222426 222492 67 0 1 50 55 64 0.434 -3.00 8.27 PlyA + 223193 223198 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 126338 126528 191 0 2 119 53 558 0.999 53.33 S.002 Init + 151234 151311 78 0 0 80 38 64 0.859 1.63 S.003 Sngl + 184142 185296 1155 1 0 84 45 2035 0.989 195.57 S.004 Term + 212978 213037 60 1 0 97 47 78 0.835 2.70 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815579f:2994652_3221176|GENSCAN_predicted_peptide_1|34_aa VPEMSPVTCCDVSSNNRLVVTGSGEHASVYQITY >gi568815579f:2994652_3221176|GENSCAN_predicted_CDS_1|105_bp gtgcctgagatgtctccagtcacgtgctgtgacgtctcttccaacaaccgcctcgttgtc acaggctccggggagcacgcctccgtgtaccagatcacctactga >gi568815579f:2994652_3221176|GENSCAN_predicted_peptide_2|798_aa MKLAGRVQWLMPVISVLWEAEPPHHPTSASPTLAASISSSSGGAWGAFRIGRMYPQGRHP TPLQSGQPFKFSILEICDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMYYEMSY GLNIEMHKQAEIVKRLSGICAQIIPFLTQEHQQQVLQAVERAKQVTVGELNSLIGQQQLQ PLSHHAPPVPLTPRPAGLVGGSATGLLALSGALAAQAQLAAAVKEDRAGVEAEGSRGEWA ARGGIGWGDGAGEDDWQSRVANISSASPSPPESLVEEERPSGPGGGGKQRADEKEPSGPY ESDEDKSDYNLVVDEDQPSEPPSPATTPCGKVPICIPARRDLVDSPASLASSLGSPLPRA KELILDASTPGPSSASHLCQLAAKPAPSTDSVALRSPLTLSSPFTTSFSLGSHSTLNGDL SVPSSYVSLHLSPQVSSSVVYGRSPVMAFESHPHLRGSSVSSSLPSIPGGKPAYSFHVSA DGQMQPVPFPSDALVGAGIPRHARQLHTLAHGEVVCAVTISGSTQHVYTGGKGCVKVWDV GQPGAKTPVAQLDCLNRDNYIRSCKLLPDGRSLIVGGEASTLSIWDLAAPTPRIKAELTS SAPACYALAVSPDAKVCFSCCSDGNIVVWDLQNQTMVRQFQGHTDGASCIDISDYGTRLW TGGLDNTVRCWDLREGRQLQQHDFSSQIFSLGHCPNQDWLAVGMESSNVEILHVRKPEKY QLHLHESCVLSLKFASCGRWFVSTGKDNLLNAWRTPYGASIFQSKESSSVLSCDISRNNK YIVTGSGDKKATVYEVVY >gi568815579f:2994652_3221176|GENSCAN_predicted_CDS_2|2397_bp atgaagctggctggccgggtgcagtggctcatgcctgtaatctcagtgctttgggaggcc gagcctccccaccaccccacttcagcctcccccactcttgccgcctccatatcatcaagc tctggtggcgcctggggggcttttcggatcggcaggatgtacccccagggaaggcacccg accccgctccagtccggccagcccttcaagttctcgatcttggagatctgcgaccgcatc aaagaagaattccagtttcttcaggctcaataccacagcctcaagctagaatgtgagaag ctggccagcgagaagacggaaatgcagcgacattatgtcatgtattatgagatgtcgtac gggctcaacattgaaatgcataagcaggcggagattgtgaagcgtctgagcggtatctgc gctcagattatccccttcctgacccaggagcatcagcagcaggtgctccaggccgtagaa cgcgccaagcaggtcaccgtgggggagctgaacagcctcatcgggcagcagcagctccag ccgctgtcccaccacgcaccccctgtgcccctcaccccccgcccagccgggctggtgggc ggcagtgctacggggctgcttgctctgtctggagccctggctgcccaggctcagctggcg gcggctgtcaaggaggaccgtgcgggcgtggaggccgaggggtccagaggtgagtgggca gcacggggtggcattggctggggagacggggctggggaggacgactggcagtccagggta gcaaacatcagcagtgcatctccctcgccccctgagagtctcgtggaggaggagcgaccg agtggccctggtggtggcgggaagcagagagcagatgagaaggagccatcaggaccttat gaaagcgacgaagacaagagtgattacaatctggtggtggacgaggaccaaccctcagag ccccccagcccggctaccaccccctgcggaaaggtacccatctgcattcctgcccgtcgg gacctggtggacagtccagcctccttggcctctagccttggctcaccgctgcctagagcc aaggagctcatcctggacgcttccacccccgggcccagctcggccagtcacctctgccag cttgctgccaagccagcaccttccacggacagcgtcgccctgaggagccccctgactctg tccagtcccttcaccacgtccttcagcctgggctcccacagcactctcaacggagacctc tccgtgcccagctcctacgtcagcctccacctgtccccccaggtcagcagctctgtggtg tacggacgctcccccgtgatggcatttgagtctcatccccatctccgagggtcatccgtc tcttcctccctacccagcatccctgggggaaagccggcctactccttccacgtgtctgcg gacgggcagatgcagccggttcccttcccctcggatgcactggtaggcgcgggcatcccg cggcacgcccggcagctgcacacgctggcccatggcgaggtggtctgcgcggtcaccatc agcggctccacacagcatgtgtacacgggcggcaagggctgtgtgaaggtgtgggacgtg ggccagcctggggccaagacgcccgtggcccagctcgactgcctgaaccgagacaactac attcgttcctgcaagttgctgccggatggccggagtctgatcgtgggcggtgaggccagc accttgtccatttgggacctggcggcgcccaccccccgtatcaaggccgagctgacttcc tcagccccagcctgctacgccctggccgtcagccccgacgccaaggtttgcttctcctgc tgcagcgatggcaacattgtggtctgggacctgcagaatcagactatggtcaggcagttc cagggccacacggacggcgccagctgcattgatatttccgattacggcactcggctctgg acagggggcctggacaacacggtgcgctgctgggacctgcgggagggccgccagctgcag cagcatgacttcagctcccagattttctccctgggccactgccctaaccaggactggctg gcggtcggaatggagagtagcaacgtggagatcctgcacgtccgcaagccggagaaatac cagctgcacctccacgagagctgcgtgctgtccctgaagtttgcctcctgcggacggtgg tttgtgagcaccgggaaggacaacctgctcaacgcctggaggacgccgtacggggccagc attttccagtccaaggagtcgtcctcagtcctgagttgtgacatctccagaaataacaaa tacatcgtgacaggctcgggggacaagaaggccaccgtgtatgaggtggtctactga >gi568815579f:2994652_3221176|GENSCAN_predicted_peptide_3|370_aa MALTEESCSKTSCGSLVPSRALEQRGLAHSRRSINVSQEWLPSGRRHYRCLSAEKETKAQ QEPPPSQSQGHRARPWHASRSCHSRAGGFDAGSRGSSHLPQQLKFTTSDSCDRIKDEFQL LQAQYHRPFVPPRASSFLSSDCRTHSQAGQTETLTVEGAGVSFPCRPQKLLCGFLDMKGW IPPASQRAQLLPTSLPLNSMSSAPMKVYGFIDQARLKLECDKLASEKSEMQRHYVMYYEM SYGLNIEMHKQAEIVKRLNGICAQVLPYLSQEHQQQVLGAIERAKQVTAPELNSIIRQQL QAHQLSQLQALALPLTPLPVGLQPPSLPAVSAGTGLLSLSALGSQAHLSKEDKNGHDGDT HQEDDGEKSD >gi568815579f:2994652_3221176|GENSCAN_predicted_CDS_3|1113_bp atggcgctaactgaagaatcttgctctaaaacctcctgtggctccctagtgccctccagg gccctggagcagcggggcctggcacacagtaggcgttctataaatgtgtcacaagaatgg cttccctcaggaaggcggcattaccgctgcctttctgcagaaaaggaaactaaggctcag caagaaccaccgcccagccagagccaaggtcacagagcacgtccgtggcacgcgtccagg tcgtgccacagcagggccggcggcttcgacgcgggcagccggggctcctcgcacctaccc cagcaactcaaattcaccacctcggactcctgcgaccgcatcaaagacgaatttcagcta ctgcaagctcagtaccacagaccgttcgtgcctcctcgtgcttcatccttcctgtcgtcc gactgtcgtacccattctcaagctgggcaaactgagacactgacagtcgaaggtgctgga gtcagctttccatgcaggccccagaagcttttgtgcgggttcctcgatatgaagggttgg ataccgcctgcatcacagagagctcaactcttaccaacttccctgcccctcaattcaatg tcctcagcccctatgaaagtttacggcttcattgaccaggcacgcctcaagctcgaatgt gacaagttggccagtgagaagtcagagatgcagcgtcactatgtgatgtactacgagatg tcctacggcttgaacatcgagatgcacaaacaggctgagatcgtcaaaaggctgaacggg atttgtgcccaggtcctgccctacctctcccaagagcaccagcagcaggtcttgggagcc attgagagggccaagcaggtcaccgctcccgagctgaactctatcatccgacagcagctc caagcccaccagctgtcccagctgcaggccctggccctgcccttgaccccactacccgtg gggctgcagccgccttcgctgccggcggtcagcgcaggcaccggcctcctctcgctgtcc gcgctgggttcccaggcccacctctccaaggaagacaagaacgggcacgatggtgacacc caccaggaggatgatggcgagaagtcggattag >gi568815579f:2994652_3221176|GENSCAN_predicted_peptide_4|77_aa MGRTWAFPEGSLTSQVLSALVPSMQGSSVPGNGLVLTPGPHKIQFLKVQEAKATRKGGGR GELRREGGAANAMKALL >gi568815579f:2994652_3221176|GENSCAN_predicted_CDS_4|234_bp atggggcggacttgggcttttcctgaaggaagcttgacaagccaagtcctgtctgccctg gtcccctcgatgcagggcagctctgtgccaggaaatggccttgttctcaccccgggccct cataaaatccagttcctcaaggtccaggaggccaaggccacaaggaagggaggaggccgt ggggagctgaggagggagggaggggccgccaacgccatgaaggcactgctgtga >gi568815579f:2994652_3221176|GENSCAN_predicted_peptide_5|700_aa MTLESMMACCLSDEVKESKRINAEIEKQLRRDKRDARRELKLLLLEPPPPSLHAPGLLCL HFGLEIDTGFKDEVCRQALSGQHWWVGLRATCRAWEQCGVRIRQRRCGFGVGGGCVSWFI DSGAFLLCPHADGIPPGTISALMKEPPKSSVAEEADPASQHHCLRGTGESGKSTFIKQMR IIHGAGYSEEDKRGFTKLVYQNIFTAMQAMIRAMETLKILYKYEQNKANALLIREVDVEK VTTFEHQYVSAIKTLWEDPGIQECYDRRREYQLSDSAKYYLTDVDRIATLGYLPTQQDVL RVRVPTTGIIEYPFDLENIIFRYRPGHSRRGGGTERLICPPPLSTTRWQRAGQVLWDAAG GLEGRWDRPRGYRQLCPLQPSLEGPCREQLWHQCLLLRVTLGEHLCPWPSEGKGFCAQGD IQMPDVGSLGTHRRMVDVGGQRSERRKWIHCFENVTSIMFLVALSEYDQVLVESDNENRM EESKALFRTIITYPWFQNSSVILFLNKKDLLEDKILYSHLVDYFPEFDGPQRDAQAAREF ILKMFVDLNPDSDKIIYSHFTCATDTENIRFVFAAVKDTILQLNLKEYNLRAGAAACAAT ARIKAQDLLGKKEEELLKQLDDLKVELSQLRVAKTQKENLGKFYKGKKYKHLHLRPKKTR AVRRPLSKHEENLTIKKQQQSEQLSPVRKCAVKARVARCQ >gi568815579f:2994652_3221176|GENSCAN_predicted_CDS_5|2103_bp atgactctggagtccatgatggcgtgttgcctgagcgatgaggtgaaggagtccaagcgg atcaacgccgagatcgagaagcagctgcggcgggacaagcgcgacgcccggcgcgagctc aagctgctgctgctcgaacccccgccgccatccctgcatgcacccggtcttctgtgcctg cactttggacttgagattgacacggggttcaaggacgaagtctgccgacaggcgctttca ggacagcactggtgggtgggtcttcgggccacctgcagggcctgggagcagtgtggtgtc cggatcagacagcggcgctgtggatttggtgtagggggtggttgtgtttcctggttcata gacagcggtgccttcttgctgtgtcctcatgcggacgggatccctcctggaaccatcagt gccctcatgaaagagccccccaagagctccgtggcggaagaagctgatcccgcctcccaa caccactgtcttaggggcacgggcgagagcgggaagagcacgttcatcaagcagatgcgc atcatccacggcgccggctactcggaggaggacaagcgcggcttcaccaagctcgtctac cagaacatcttcaccgccatgcaggccatgatccgggccatggagacgctcaagatcctc tacaagtacgagcagaacaaggccaatgcgctcctgatccgggaggtggacgtggagaag gtgaccaccttcgagcatcagtacgtcagtgccatcaagaccctgtgggaggacccgggc atccaggaatgctacgaccgcaggcgcgagtaccagctctccgactctgccaagtactac ctgaccgacgttgaccgcatcgccaccttgggctacctgcccacccagcaggacgtgctg cgggtccgcgtgcccaccaccggcatcatcgagtaccctttcgacctggagaacatcatc ttccggtaccgcccgggccacagcaggcggggagggggcactgagaggctcatttgcccg ccgcctctgtccaccacacgctggcagagggccggccaggtgctgtgggatgcagctgga ggccttgaaggccggtgggatcgtcccagaggataccggcaactctgccctctgcagccg tcactggagggtccctgcagggagcaactgtggcatcagtgtttactcctgcgagtgacg cttggtgaacatctgtgtccctggcccagtgaggggaaaggcttctgtgctcagggtgac atccagatgccagatgtagggtccctgggcacccatcgcaggatggtggatgtggggggc cagcggtcggagcggaggaagtggatccactgctttgagaacgtgacatccatcatgttt ctcgtcgccctcagcgaatacgaccaagtcctggtggagtcggacaacgagaaccggatg gaggagagcaaagccctgttccggaccatcatcacctacccctggttccagaactcctcc gtcatcctcttcctcaacaagaaggacctgctggaggacaagatcctgtactcgcacctg gtggactacttccccgagttcgatggtccccagcgggacgcccaggcggcgcgggagttc atcctgaagatgttcgtggacctgaaccccgacagcgacaagatcatctactcacacttc acgtgtgccaccgacacggagaacatccgcttcgtgttcgcggccgtgaaggacaccatc ctgcagctcaacctcaaggagtacaacctgcgagcgggggcggcggcctgtgcagcaacg gccaggatcaaggcccaagaccttctcgggaagaaggaggaggagctgctgaaacagctg gacgacctaaaggtggagctgtcccagctgcgcgtcgccaaaactcagaaagaaaacctc gggaaattctacaagggcaagaagtacaagcacctgcacctgcggcctaagaagacacgc gccgtgcgccgcccgctcagcaagcacgaggagaacctgacgatcaagaagcagcagcag agcgagcagctgtccccagtgcggaagtgcgcggtcaaggcccgagtggcgcgttgtcaa taa >gi568815579f:2994652_3221176|GENSCAN_predicted_peptide_6|374_aa MARSLTWRCCPWCLTEDEKAAARVDQEINRILLEQKKQDRGELKLLLLGPGESGKSTFIK QMRIIHGAGYSEEERKGFRPLVYQNIFVSMRAMIEAMERLQIPFSRPESKHHASLVMSQD PYKVTTFEKRYAAAMQWLWRDAGIRACYERRREFHLLDSAVYYLSHLERITEEGYVPTAQ DVLRSRMPTTGINEYCFSVQKTNLRIVDVGGQKSERKKWIHCFENVIALIYLASLSEYDQ CLEENNQENRMKESLALFGTILELPWFKSTSVILFLNKTDILEEKIPTSHLATYFPSFQG PKQDAEAAKRFILDMYTRMYTGCVDGPEGSKKGARSRRLFSHYTCATDTQNIRKVFKDVR DSVLARYLDEINLL >gi568815579f:2994652_3221176|GENSCAN_predicted_CDS_6|1125_bp atggcccgctcgctgacctggcgctgctgcccctggtgcctgacggaggatgagaaggcc gccgcccgggtggaccaggagatcaacaggatcctcttggagcagaagaagcaggaccgc ggggagctgaagctgctgcttttgggcccaggcgagagcgggaagagcaccttcatcaag cagatgcggatcatccacggcgccggctactcggaggaggagcgcaagggcttccggccc ctggtctaccagaacatcttcgtgtccatgcgggccatgatcgaggccatggagcggctg cagattccattcagcaggcccgagagcaagcaccacgctagcctggtcatgagccaggac ccctataaagtgaccacgtttgagaagcgctacgctgcggccatgcagtggctgtggagg gatgccggcatccgggcctgctatgagcgtcggcgggaattccacctgctcgattcagcc gtgtactacctgtcccacctggagcgcatcaccgaggagggctacgtccccacagctcag gacgtgctccgcagccgcatgcccaccactggcatcaacgagtactgcttctccgtgcag aaaaccaacctgcggatcgtggacgtcgggggccagaagtcagagcgtaagaaatggatc cattgtttcgagaacgtgatcgccctcatctacctggcctcactgagtgaatacgaccag tgcctggaggagaacaaccaggagaaccgcatgaaggagagcctcgcattgtttgggact atcctggaactaccctggttcaaaagcacatccgtcatcctctttctcaacaaaaccgac atcctggaggagaaaatccccacctcccacctggctacctatttccccagtttccagggc cctaagcaggatgctgaggcagccaagaggttcatcctggacatgtacacgaggatgtac accgggtgcgtggacggccccgagggcagcaagaagggcgcacgatcccgacgcctcttc agccactacacatgtgccacagacacacagaacatccgcaaggtcttcaaggacgtgcgg gactcggtgctcgcccgctacctggacgagatcaacctgctgtga >gi568815579f:2994652_3221176|GENSCAN_predicted_peptide_7|1298_aa MGRLYPSWDVCVLMGCLCPHGMAVFSWDACTLMGRLCPSWDACVLMGCLYPHGTPVPLMG HLCSHGTPVPLMGCLYPHGMPVPRMGCLHSHGTPVPSWDTCVLMGHLCSHGTPVPSWDAC ALMGQLCSHGTPVFSWDACALMGCLCSHGTPVPLMGCLCPHGTPVFSWDACAPHGMPVFS WDACAPHGMPVFSWDACAPHGMPVPSWDTCVLMGHLCPHGTPVFSWDACALMGCLCPHGT PVTSWDACVLMGRLCSHGTPVPSWDACVLMGRLCPHGMPVPSWDACALMGRLCSHGTPVP LMGRLCPHGTPVPSWDACVLMGRLCPSWDACALMGRLCPHGTPVFSWDACAPHGTPVFSW DSCALMGRLCSHGTPVPLMGRLCSHGTPVPSWDACVLMGRLCPSWDTCVLMGRLCPHGTP VFSWDACASHGTPVFSWDACALMGRLCPHGTPVFSWDACILMGRLCPHGTPVFSWDTCVL MGCLCPHGMPVPLMGRLCSHGMPVPSWDACTPHGTSVPSWDACAPHGMPVPSWDACVLMG CLYPHGTPVPLMGCLCSHGMPVPRMGRLNSHGTPVPSWDTCVLMGRLCSHGTPVPSWDAC VLMGRLCPHGTPVFSWDACALMGRLCPHGTPVFSWDACAPHGTPVPSWDACVLMGRLCPS WDACVLMGRLCPHGTPVPSWDACALMGRLCSHGTPVPSWDACALMGRLCSHGTPVPSWDA CALMGRLCSHGTPVFSWDACAPHGTPVFSWDACALMGRLCSHGTPVPLMGRLCSHGTPVP LMGHLCPHGTPVPSWDACVLMGRLCPSWDACALMGRLCSHGTPVPLMGHLCSHGTPVPSW DACVLMGRLCPSWDTCVLMGRLCPHGTPVFSWDACAPHGTPVFSRDACALMGRLCSHGTP VPLMGRLCSHGTPVPLMGHLCPHGTPVPSWDACVLMGRLHSHGTPVPSWDTCVLMGHLCS HGMPVPSWDACTPHGTPVFSWDACTLMGRLYPSWDICALMGCLCPSWDACALMGRLHSHG TPVPSWDACTLMGRLCPSWDACVLMGCLCPSWDACILMGHLCPHGMPVPFMGRLFPFCNA SGSLGLSFPFCNASGSLGLSFPFCNASGSLGLSFPFCNASGSLGLSFPFCNASGSLGLSF PFCNASGSLGLSFPFCNASGSLGLSFPFCNASGSLGLSFPFCNASGSLGLSFPFCNASGS LGLSFPFCNASGSLGLSFPFCNASGSLGLSFPFCNASGSLGLSFPFCNASGSLGLSFPFC NAPGSLGLSFLFCNASGPVSGIKSDESVLKGMQALDIY >gi568815579f:2994652_3221176|GENSCAN_predicted_CDS_7|3897_bp atgggacgcctgtacccctcatgggacgtctgtgttctcatgggatgcctgtgccctcat gggatggctgtgttctcatgggacgcctgtaccctcatgggacgcctgtgcccctcatgg gacgcctgtgttctcatgggatgcctgtaccctcatgggacgcctgtgcccctcatggga cacctgtgttctcatgggacacctgtgcccctcatgggatgcctgtaccctcatgggatg cctgtgccccgcatgggatgcctgcattctcatgggacacctgtgccctcatgggacacc tgtgttctcatgggacacctgtgttctcatgggacacctgtgccctcatgggacgcctgt gccctcatgggacaactgtgttctcatgggacgcctgtgttctcatgggacgcctgtgcc ctcatgggatgcctgtgttctcatgggacgcctgtgcccctcatgggatgcctgtgccct catgggacgcctgtgttctcatgggatgcctgtgcccctcatgggatgcctgtgttctca tgggacgcctgtgcccctcatgggatgcctgtgttctcatgggatgcctgtgcccctcat gggatgcctgtgccctcatgggacacctgtgttctcatgggacacctgtgccctcatggg acgcctgtgttctcatgggacgcctgtgccctcatgggatgcctgtgccctcatgggacg cctgtgacctcatgggatgcctgtgttctcatgggacgcctgtgttctcatgggacgcct gtgccctcatgggacgcctgtgttctcatgggacgcctgtgccctcatgggatgcctgtg ccctcatgggatgcctgtgccctcatgggacgcctgtgttctcatgggacgcctgtgccc ctcatgggacgcctgtgccctcatgggacgcctgtgccctcatgggacgcctgtgttctc atgggacgcctgtgcccctcatgggacgcctgtgccctcatgggacgcctgtgccctcat gggacgcctgtgttctcatgggacgcctgtgcccctcatgggacgcctgtgttctcatgg gactcctgtgccctcatgggacgcctgtgttctcatgggacgcctgtgcccctcatggga cgcctgtgttctcatgggacgcctgtgccctcatgggacgcctgtgttctcatgggacgc ctgtgcccctcatgggacacctgtgttctcatgggacgcctgtgccctcatgggacgcct gtgttctcatgggacgcctgtgcctctcatgggacacctgtgttctcatgggacgcctgt gccctcatgggacgcctgtgccctcatgggacacctgtgttctcatgggacgcctgcatt ctcatgggacgcctgtgccctcatgggacacctgtgttctcatgggacacctgtgttctc atgggatgcctgtgccctcatgggatgcctgtacccctcatgggacgcctgtgttctcat gggatgcctgtaccctcatgggacgcctgtacccctcatgggacatctgtgccctcatgg gatgcctgtgcccctcatgggatgcctgtgccctcatgggacgcctgtgttctcatggga tgcctgtaccctcatgggacgcctgtgcccctcatgggatgcctgtgttctcatgggatg cccgtgccccgcatgggacgcctgaattctcatgggacacctgtgccctcatgggacacc tgtgttctcatgggacgcctgtgttctcatgggacacctgtgccctcatgggacgcctgt gttctcatgggacgcctgtgccctcatgggacgcctgtgttctcatgggacgcctgtgcc ctcatgggacgcctgtgccctcatgggacgcctgtgttctcatgggacgcctgtgcccct catgggacgcctgtgccctcatgggacgcctgtgttctcatgggacgcctgtgcccctca tgggacgcctgtgttctcatgggacgcctgtgccctcatgggacgcctgtgccctcatgg gacgcctgtgccctcatgggacgcctgtgttctcatgggacgcctgtgccctcatgggac gcctgtgccctcatgggacgcctgtgttctcatgggacgcctgtgccctcatgggacgcc tgtgccctcatgggacgcctgtgttctcatgggacgcctgtgttctcatgggacgcctgt gcccctcatgggacgcctgtgttctcatgggacgcctgtgccctcatgggacgcctgtgt tctcatgggacgcctgtgcccctcatgggacgcctgtgttctcatgggacgcctgtgccc ctcatgggacacctgtgccctcatgggacgcctgtgccctcatgggacgcctgtgttctc atgggacgcctgtgcccctcatgggacgcctgtgccctcatgggacgcctgtgttctcat gggacgcctgtgcccctcatgggacacctgtgttctcatgggacgcctgtgccctcatgg gacgcctgtgttctcatgggacgcctgtgcccctcatgggacacctgtgttctcatggga cgcctgtgccctcatgggacgcctgtgttctcatgggacgcctgtgcccctcatgggaca cctgtgttctcacgggacgcctgtgccctcatgggacgcctgtgttctcatgggacgcct gtgcccctcatgggacgcctgtgttctcatgggacgcctgtgcccctcatgggacacctg tgccctcatgggacgcctgtgccctcatgggacgcctgcgttctcatgggacgcctgcat tctcatgggacgcctgtgccctcatgggacacctgtgttctcatgggacacctgtgttct catgggatgcctgtgccctcatgggatgcctgtacccctcatgggacgcctgtgttctca tgggatgcctgtaccctcatgggacgcctgtacccctcatgggacatctgtgctctcatg ggatgcctgtgcccctcatgggatgcctgtgccctcatgggacgcctgcattctcatggg acacctgtgccctcatgggatgcctgtaccctcatgggacgcctgtgcccctcatgggat gcctgtgttctcatgggatgcctgtgcccctcatgggacgcctgcattctcatgggacac ctgtgccctcatgggatgcctgtacccttcatgggacgccttttccccttctgcaacgca tcgggctctctgggcctcagtttccccttctgcaacgcatcgggctctctgggcctcagt ttccccttctgcaacgcatcgggctctctgggcctcagtttccccttctgcaacgcctcg ggctctctgggcctcagtttccccttctgcaacgcatcgggctctctgggcctcagtttc cccttctgcaacgcctcgggctctctgggcctcagtttccccttctgcaacgcctcgggc tctctgggcctcagtttccccttctgcaacgcatcgggctctctgggcctcagtttcccc ttctgcaacgcatcgggctctctgggcctcagtttccccttctgcaacgcatcgggctct ctgggcctcagtttccccttctgcaacgcctcgggctctctgggcctcagtttccccttc tgcaacgcctcgggctctctgggcctcagtttccccttctgcaacgcatcgggctctctg ggcctcagtttccccttctgcaacgcatcgggctctctgggcctcagtttccccttctgc aacgcaccgggctctctgggcctcagtttcctcttctgtaacgcatcaggccctgtttcg gggatcaagtcggatgagtcagtgctcaagggcatgcaggcacttgacatttattag >gi568815579f:2994652_3221176|GENSCAN_predicted_peptide_8|1534_aa MNATGTPVAPESCQQLAAGGHSRLIVLHYNHSGRLAGRGGPEDGGLGALRGLSVAASCLV VLENLLVLAAITSHMRSRRWVYYCLVNITLSDLLTGAAYLANVLLSGARTFRLAPAQWFL REGLLFTALAASTFSLLFTAGERFATMVRPVAESGATKTSRVYGFIGLCWLLAALLGMLP LLGWNCLCAFDRCSSLLPLYSKRYILFCLVIFAGVLATIMGLYGAIFRLVQASGQKAPRP AARRKARRLLKTVLMILLAFLVCWGPLFGLLLADVFGSNLWAQEYLRGMDWILALAVLNS AVNPIIYSFRSREVCRAVLSFLCCGCLRLGMRGPGDCLARAVEAHSGASTTDSSLRPRDS FRGSRSLSFRMREPLSSISSWRRFSSTPFSMGDASPTGLRCKQITEFRSASSGLLPAAFV SSIPAPITPGHTIHLLLWTSSSVSVDSVALLGSRLCPQHPPHGWHTDPGGYPCRGESAGA AAAAVPSQLPPRAAPPPARMLEEAGEVLENMLKASCLPLGFIVFLPAVLLLVAPPLPAAD AAHEFTVYRMQQYDLQGQPYVRKLDAAFPVCGIRPAGPWAPGVSGGAHGAAALPQPPCSS SAPWEAQSSSRRVGEAQPAHPSVSLWPPLRDWGWFGPEPCRGQALEKPQRAAFEATLHHH LPGTRNAVLNTEARTMAAEVLSRRCVLMRLLDFSYEQYQKALRQSAGAVVIILPRAMAAV PQDVVRGQSLGPLLPPPGTVTGPPATLPPPPGQQPPKCVSAPQQFMEIEPEMLAMETAVP VYFAVEDEALLSIYKQTQAASASQGSASAAEAAGSVHTKDQNHGLRRSLSDPQGLGVYET HGHLFSSLPAKDILSITSRVGGVVLLRTATANGFQMVTSGVQSKAVSDWLIASVETLRPH RPFRAPRDISRVKRGWGAAGAEVASRIIRADPEGEACVAAAAALAGALKPEEAPTDSSSI SLRDGAAMLMHVQAVHGSFLVVTDTIWSPRLKISPLALYESLLAPGLEQNPTLGRLTGLG GEDLPTIVIVAHYDAFGVAPGLDSGSSPVPRQSACRGWRPGSAALPVTAFARHPGPREGL RRAVMEHVAESAKGKARAWLSLGADSNGSGVSVLLELARLFSRLYTYKRTHAAYNLLFFA SGGGKFNYQGTKRWLEDNLDHTDSSLLQDNVAFVLCLDTVGRGSSLHLHVSKPPREGTLQ HAFLRELETVAAHQFPEVRFSMVHKRINLAEDVLAWEHERFAIRRLPAFTLSHLESHRDG QRSSIMDVRSRVDSKTLTRNTRIIAEALTRVIYNLTEKGTPPDMPVFTEQMQIQQEQLDS VMDWLTNQPRAAQLVDKDSTFLSTLEHHLSRYLKDVKQHHVKADKRDPEFVFYDQLKQVM NAYRVKPAVFDLLLAVGIAAYLGMAYVAVQELSRAKSGQWLPTSVQAEGGFGPIVDPGKP HFRGKASPSSSLLTYTDGNMEAEHKGFRRGLDESEGHAISHILSGVGEKGPGLGHRDNQC PEMDGTYLRPQSSGILLSHEKEGNPDAGYSAEAS >gi568815579f:2994652_3221176|GENSCAN_predicted_CDS_8|4605_bp atgaacgccacggggaccccggtggcccccgagtcctgccaacagctggcggccggcggg cacagccggctcattgttctgcactacaaccactcgggccggctggccgggcgcgggggg ccggaggatggcggcctgggggccctgcgggggctgtcggtggccgccagctgcctggtg gtgctggagaacttgctggtgctggcggccatcaccagccacatgcggtcgcgacgctgg gtctactattgcctggtgaacatcacgctgagtgacctgctcacgggcgcggcctacctg gccaacgtgctgctgtcgggggcccgcaccttccgtctggcgcccgcccagtggttccta cgggagggcctgctcttcaccgccctggccgcctccaccttcagcctgctcttcactgca ggggagcgctttgccaccatggtgcggccggtggccgagagcggggccaccaagaccagc cgcgtctacggcttcatcggcctctgctggctgctggccgcgctgctggggatgctgcct ttgctgggctggaactgcctgtgcgcctttgaccgctgctccagccttctgcccctctac tccaagcgctacatcctcttctgcctggtgatcttcgccggcgtcctggccaccatcatg ggcctctatggggccatcttccgcctggtgcaggccagcgggcagaaggccccacgccca gcggcccgccgcaaggcccgccgcctgctgaagacggtgctgatgatcctgctggccttc ctggtgtgctggggcccactcttcgggctgctgctggccgacgtctttggctccaacctc tgggcccaggagtacctgcggggcatggactggatcctggccctggccgtcctcaactcg gcggtcaaccccatcatctactccttccgcagcagggaggtgtgcagagccgtgctcagc ttcctctgctgcgggtgtctccggctgggcatgcgagggcccggggactgcctggcccgg gccgtcgaggctcactccggagcttccaccaccgacagctctctgaggccaagggacagc tttcgcggctcccgctcgctcagctttcggatgcgggagcccctgtccagcatctccagc tggagacgtttcagctccacaccattcagtatgggagacgccagccccacggggctacgg tgcaagcagataactgaatttcgaagtgcctcctctgggcttctcccagctgcctttgtg tcctctatcccagctcccatcacccctggtcacacaatccacctcctcctttggacttcg agctccgtgtcggtggatagtgtggctttgctggggtcccggctgtgtccccagcaccca ccacacggctggcacacagaccccggcggctacccatgccgaggtgagtccgcgggagcc gccgccgccgccgtcccgtcccagctgccgccccgcgcggccccgccgccggccaggatg ctggaggaagcgggcgaggtgctggagaacatgctgaaggcgtcttgtctgccgctcggc ttcatcgtcttcctgcccgctgtgctgctgctggtggcgccgccgctgcctgccgccgac gccgcgcacgagttcaccgtgtaccgcatgcagcagtacgacctgcagggccagccctac gtgaggaagctggacgcagctttccctgtatgtggcattcgcccagctggtccgtgggct cccggggtgagcggaggagcccatggggcagccgccctgcctcagcctccatgctcctca tctgcgccctgggaagcacagtccagctctcgcagggtaggcgaggcccagccagcacac ccctctgtgtcactctggccgcccctgagagactgggggtggtttgggccagagccctgc cgaggacaggcccttgagaagccccagcgtgctgcctttgaggccaccctccaccaccat cttcccggcacacggaatgcagtgctgaacacggaggcgcgcacgatggcggcggaggtg ctgagccgccgctgcgtgctcatgcggctactggacttctcctacgagcagtaccagaag gccctgcggcagtcggcgggcgccgtggtcatcatcctgcccagggccatggccgccgtg ccccaggacgtcgtccggggacagtcactgggccccctgctacccccaccagggacagtc actgggccccctgctaccttgccacccccaccaggccagcagccccctaagtgtgtgtct gctccacagcaattcatggagatcgagccggagatgctggccatggagaccgccgtcccc gtgtactttgccgtggaggacgaggccctgctgtctatctacaagcagacccaggctgcc tccgcctcccagggctccgcctctgctgctgaagctgctggctctgtgcacacgaaagac cagaaccacggactcaggaggagcctttccgatcctcaggggctcggggtttatgaaaca cacgggcatctgttcagttctctacctgccaaggacattctcagcatcacatctcgagtg ggcggcgtggtactgctgcgcacggccactgccaacggcttccagatggtcaccagcggg gtacagagcaaggccgtgagtgactggctgattgccagcgtggagacgctgcgtcctcac agacccttccgagcaccccgggatatcagcagagtcaagagagggtggggagctgccggt gcggaagtagcctctcggatcatcagggcagatccagagggtgaagcctgtgtcgctgct gctgcagcactggcaggggccttaaaacccgaggaagcacccaccgactcctctagcatt tccctgcgggacggcgccgccatgctgatgcatgtgcaagccgtccatggcagtttctta gttgtaacagacaccatctggtccccaaggctgaaaatatcccctctggccctttatgaa agtctgctggcccccgggctagagcagaatccgactttggggcggctgacggggctgggc ggagaggaccttcccaccatcgtcatcgtggcccactacgacgcctttggagtggccccc ggcttggattctggctccagtccagtgccgaggcaaagcgcctgtaggggatggcggcca gggtcagcggctctccctgtgacggcctttgcccgtcatcctggtccccgcgagggcctc aggcgagcggtcatggagcacgtagcagagtccgcgaagggaaaggcgcgtgcgtggctg tcgctgggcgcggactccaacgggagcggcgtctctgtgctgctggagctggcacgcctc ttctcccggctctacacctacaagcgcacgcacgccgcctacaacctcctgttctttgcg tctggaggaggcaagtttaactaccagggaaccaagcgctggctggaagacaacctggac cacacagactccagcctgcttcaggacaatgtggccttcgtgctgtgcctggacaccgtg ggccggggcagcagcctgcacctgcacgtgtccaagccgcctcgggagggcaccctgcag cacgccttcctgcgggagctggagacggtggccgcgcaccagttccctgaggtacggttc tccatggtgcacaagcggatcaacctggcggaggacgtgctggcctgggagcacgagcgc ttcgccatccgccgactgcccgccttcacgctgtcccacctggagagccaccgtgacggc cagcgcagcagcatcatggacgtgcggtcccgggtggattctaagaccctgacccgtaac acgaggatcattgcagaggccctgactcgagtcatctacaacctgacagagaaggggaca cccccagacatgccggtgttcacagagcagatgcagatccagcaggagcagctggactcg gtgatggactggctcaccaaccagccgcgggccgcgcagctggtggacaaggacagcacc ttcctcagcacgctggagcaccacctgagccgctacctgaaggacgtgaagcagcaccac gtcaaggctgacaagcgggacccagagtttgtcttctacgaccagctgaagcaagtgatg aatgcgtacagagtcaagccggccgtctttgacctgctcctggctgttggcattgctgcc tacctcggcatggcctacgtggctgtccaggagcttagcagggccaagagtggacagtgg ctgcccacctctgtgcaggcagaaggtggctttggccccatcgtggacccaggcaaaccc cattttagaggaaaagccagcccatcatcttccctgctaacatacacagatggtaacatg gaggccgagcacaaggggttcagacgtggattggacgaatctgagggccacgccattagc cacatactgagtggagtgggcgaaaagggcccaggtttaggtcaccgggataaccagtgt ccagagatggacggcacttacctgaggccacagagcagtggaatattactcagccatgaa aaggaagggaatcctgacgcaggctacagtgcggaggcatcttga