GENSCAN 1.0 Date run: 5-Nov-116 Time: 07:32:25 Sequence gi568815587r:108058945_108322536 : 263592 bp : 39.15% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 2162 2228 67 1 1 34 100 42 0.083 1.29 1.02 Intr + 13388 13518 131 0 2 110 93 100 0.678 12.19 1.03 Intr + 14446 14553 108 0 0 108 71 112 0.999 11.06 1.04 Intr + 19232 19296 65 1 2 79 98 46 0.906 1.30 1.05 Intr + 29583 29715 133 0 1 85 33 86 0.815 2.53 1.06 Intr + 35447 35570 124 1 1 79 67 50 0.935 1.24 1.07 Intr + 35868 36043 176 1 2 91 87 65 0.976 5.44 1.08 Intr + 36586 36747 162 0 0 70 3 118 0.591 0.75 1.09 Intr + 38692 38810 119 0 2 69 75 53 0.961 0.44 1.10 Term + 39462 39618 157 0 1 70 42 118 0.348 1.82 1.11 PlyA + 40460 40465 6 1.05 2.00 Prom + 44553 44592 40 -4.85 2.01 Init + 62321 63051 731 1 2 85 44 392 0.324 28.80 2.02 Intr + 73281 73417 137 0 2 85 50 85 0.395 3.69 2.03 Intr + 75277 75372 96 2 0 71 82 114 0.814 8.16 2.04 Intr + 76249 76298 50 2 2 36 110 29 0.438 -2.62 2.05 Intr + 79954 80097 144 0 0 76 54 62 0.629 1.16 2.06 Intr + 81121 81271 151 0 1 65 75 186 0.999 13.81 2.07 Intr + 82661 82756 96 0 0 57 49 146 0.989 6.66 2.08 Intr + 83493 83606 114 1 0 98 82 214 0.994 21.30 2.09 Intr + 85039 85103 65 2 2 84 81 98 0.834 6.22 2.10 Intr + 87258 87415 158 2 2 17 69 172 0.904 6.09 2.11 Term + 88326 88446 121 0 1 50 42 119 0.861 0.47 2.12 PlyA + 88618 88623 6 1.05 3.11 PlyA - 90382 90377 6 1.05 3.10 Term - 103070 101885 1186 1 1 57 40 721 0.420 54.43 3.09 Intr - 103236 103176 61 1 1 42 87 20 0.247 -5.83 3.08 Intr - 114907 113255 1653 2 0 75 88 1028 0.823 87.86 3.07 Intr - 117430 117302 129 2 0 103 99 52 0.994 7.55 3.06 Intr - 118146 118050 97 0 1 91 91 102 0.540 9.46 3.05 Intr - 123269 123046 224 0 2 12 38 173 0.383 1.62 3.04 Intr - 126375 126263 113 2 2 77 65 95 0.636 5.20 3.03 Intr - 129235 129154 82 2 1 42 94 79 0.731 1.68 3.02 Intr - 130257 130162 96 1 0 115 100 41 0.519 7.06 3.01 Init - 137181 137100 82 0 1 84 79 -6 0.138 -0.72 3.00 Prom - 137504 137465 40 -7.95 4.00 Prom + 139748 139787 40 -3.15 4.01 Init + 140341 140568 228 0 0 83 90 125 0.354 10.82 4.02 Intr + 142085 142152 68 1 2 91 70 33 0.878 -1.42 4.03 Intr + 142184 142712 529 2 1 43 55 264 0.170 10.61 4.04 Intr + 145583 145637 55 1 1 91 60 182 0.018 13.33 4.05 Intr + 148646 149002 357 0 0 54 32 206 0.001 5.90 4.06 Intr + 163307 163438 132 0 0 93 73 88 0.200 7.50 4.07 Intr + 163737 163878 142 1 1 49 72 115 0.199 4.49 4.08 Intr + 164551 164705 155 1 2 79 77 76 0.142 4.39 4.09 Intr + 165009 165154 146 1 2 51 67 81 0.175 1.38 4.10 Intr + 167220 167303 84 2 0 98 72 41 0.114 2.60 4.11 Intr + 168832 168944 113 0 2 98 34 159 0.995 9.76 4.12 Intr + 170234 170379 146 2 2 29 100 77 0.890 2.01 4.13 Intr + 185009 185174 166 0 1 97 74 106 0.934 8.20 4.14 Intr + 185844 186082 239 0 2 105 70 181 0.999 14.34 4.15 Intr + 188020 188188 169 2 1 67 98 74 0.945 4.38 4.16 Intr + 191757 192128 372 0 0 110 115 81 0.860 6.25 4.17 Intr + 194870 195095 226 2 1 13 78 165 0.944 5.06 4.18 Intr + 197271 197396 126 2 0 28 97 119 0.947 6.76 4.19 Term + 197725 197811 87 0 0 77 54 14 0.413 -6.42 4.20 PlyA + 198025 198030 6 1.05 5.00 Prom + 199914 199953 40 -6.95 5.01 Sngl + 202582 203598 1017 0 0 88 43 771 0.996 69.37 5.02 PlyA + 203826 203831 6 1.05 6.00 Prom + 203995 204034 40 -6.15 6.01 Init + 205440 207005 1566 2 0 34 -13 425 0.525 19.50 6.02 Intr + 208227 208398 172 2 1 51 67 207 0.791 13.49 6.03 Intr + 209466 209665 200 1 2 100 101 67 0.977 7.35 6.04 Intr + 217668 217793 126 2 0 71 83 80 0.001 5.76 6.05 Intr + 222051 222224 174 2 0 49 71 135 0.989 7.11 6.06 Intr + 223766 223935 170 1 2 90 97 54 0.999 4.32 6.07 Intr + 225283 225529 247 1 1 89 80 147 0.976 10.54 6.08 Intr + 228656 228771 116 1 2 100 75 55 0.989 3.73 6.09 Intr + 230033 230159 127 2 1 53 95 61 0.928 3.06 6.10 Intr + 230658 230857 200 2 2 78 94 41 0.893 0.93 6.11 Intr + 233675 233849 175 2 1 64 91 71 0.885 4.02 6.12 Intr + 234369 234533 165 2 0 63 75 66 0.771 2.04 6.13 Intr + 235983 236115 133 2 1 76 10 87 0.348 -1.00 6.14 Intr + 240770 240941 172 0 1 31 97 147 0.556 7.98 6.15 Intr + 242704 242845 142 1 1 47 15 64 0.578 -5.57 6.16 Intr + 243909 244085 177 2 0 83 63 98 0.748 5.99 6.17 Intr + 245731 245908 178 0 1 77 66 88 0.695 4.07 6.18 Intr + 248953 249040 88 2 1 32 80 38 0.261 -4.49 6.19 Intr + 251216 251371 156 2 0 79 90 126 0.519 10.10 6.20 Intr + 257067 257169 103 0 1 72 88 61 0.575 3.76 6.21 Intr + 258429 258577 149 2 2 50 110 26 0.331 -0.89 6.22 Intr + 262357 262476 120 1 0 66 95 91 0.434 6.29 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 144443 144597 155 2 2 122 -9 100 0.809 2.49 S.002 Term + 145583 145770 188 0 2 91 42 212 0.810 13.57 S.003 Init + 168681 168752 72 2 0 75 62 51 0.838 2.32 S.004 Sngl - 217543 217241 303 1 0 78 54 262 0.991 17.38 S.005 Intr + 220547 220664 118 2 1 70 46 113 0.990 4.85 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:108058945_108322536|GENSCAN_predicted_peptide_1|413_aa MAYGKKFFSSRNLAKGDRKGIQELHLMFSLMDKVPNGIEPMLKDLEEHIISAGLADMVAA AETITTDSEKYVEQLLTLFNRFSKLVKEAFQDDPRFLTARDKAYKAVVNDATIFKLELPL KQKGVGLKTQPESKCPELLANYCDMLLRKTPLSKKLTSEEIEAKLKEVEVGMPADYVNKL ARMFQDIKVSEDLNQAFKEMHKNNKLALPADSVNIKILNAGAWSRSSEKVFVSLPTELED LIPEVEEFYKKNHSGRKLHWHHLMSNGIITFKNEVGQYDLEVTTFQLAVLFAWNQRPREK ISFENLKLATELPDAELRRTLWSLVAFPKLKRQVLLYEPQVNSPKDFTEGTLFSVNQEFS LIKNAKVQKRGKINLIGRLQLTTERMREEENEGIVQLRILRTQVCNVDRMSEV >gi568815587r:108058945_108322536|GENSCAN_predicted_CDS_1|1242_bp atggcatatggcaaaaagtttttttcttcacggaatctagcaaaaggagacagaaaggga atacaagaattacatttaatgttttcattgatggacaaagttcctaatggtatagagcca atgttgaaagacttggaggaacatatcattagtgctggcctggcagatatggtagcagct gctgaaactattactactgactctgagaaatacgttgagcagttacttacactatttaat agatttagtaaactcgtcaaagaagcttttcaagatgatccacgatttcttactgcaaga gataaggcgtataaagcagttgttaatgatgctaccatatttaaacttgaattacctttg aagcagaagggggtgggattaaaaactcagcctgaatcaaaatgccctgagctgcttgcc aattactgtgacatgttgctaagaaaaacaccattaagcaaaaaactaacctctgaagag attgaagcaaagcttaaagaagtggaagttggtatgccagcggattatgtaaacaagctt gctagaatgtttcaggacataaaagtatctgaagatttgaaccaagcttttaaggaaatg cacaaaaataataaattggcattaccagctgattcagttaatataaaaattctgaatgct ggcgcctggtcaagaagttctgagaaagtctttgtctcacttcctactgaactggaggac ttgataccggaagtagaagaattctacaaaaaaaatcatagtggtagaaaattacattgg catcatctcatgtcaaatggaattataacatttaagaatgaagttggtcaatatgatttg gaggtaaccacgtttcagctcgctgtattgtttgcatggaaccaaagacccagagagaaa atcagctttgaaaatcttaagcttgcaactgaactccctgatgctgaacttaggaggact ttatggtctttagtagctttcccaaaactcaaacggcaagttttgttgtatgaacctcaa gtcaactcacccaaagactttacagaaggtaccctcttctcagtgaaccaggagttcagt ttaataaaaaatgcaaaggttcagaaaaggggtaaaatcaacttgattggacgtttgcag ctcactacagaaaggatgagagaagaagagaatgaaggaatagttcaactacgaatacta agaacccaggtttgtaatgttgacagaatgtctgaagtttaa >gi568815587r:108058945_108322536|GENSCAN_predicted_peptide_2|620_aa MLSKSKNLGLHINHRPRQVRLGGHGSQQRVRCSLTPAPVSPSTSFTRQPFLAQAAVSQRQ APPRPWLPANRRRLRGDYWRKRDGRCPRRAARGAGLGRRPLVYACGADTQPSATMAVLAA LLRSGARSRSPLLRRLVQVSGVRPHSTQTRDARSPRLGSFQPAAVAAPAARARGLGPRTV TRRVLVQKPPKCSDNTQGLAYFYSAFLLPRSRRNHLNERYWCVKQGHVSGRAGPPGLALA RPERDSHCIVCGTSRDCIEDPRREGVDQLGIRQGGPLQHLLGPSEESLFWIPKEEVKEAY MGNVLQGGEGQAPTRQAVLGAGMKAIMMASQSLMCGHQDVMVAGGMESMSNVPYVMNRGS TPYGGVKLEDLIVKDGLTDVYNKIHMGSCAENTAKKLNIARNEQDAYAINSYTRSKAAWE AGKFGNEVIPVTVTVKGQPDVVVKEDEEYKRVDFSKVPKLKTVFQKENGTVTAANASTLN DGAAALVLMTADAAKRLNVTPLARIVAFADAAVEPIDFPIAPVYAASMVLKDVGLKKEDI AMWEVNEAFSLVVLANIKMLEIDPQKVNINGGAVSLGHPIGMSGARIVGHLTHALKQGEY GLASICNGGGGASAMLIQKL >gi568815587r:108058945_108322536|GENSCAN_predicted_CDS_2|1863_bp atgctcagtaaatcgaaaaatctcggattacacataaaccaccggccccggcaggttaga cttggtggacacgggagccagcagcgtgtcaggtgctcgttgaccccagcgccagtgtct ccatccacgtccttcactcgtcaacctttcctggcccaggctgcggtgtcccagcgccag gctccgccccgcccttggctgccggccaatcgccgccgactgagaggcgactattggagg aagcgggatgggcggtgcccgcgccgggccgctaggggtgcggggttggggaggaggccg ctagtctacgcctgtggagccgatactcagccctctgcgaccatggctgtgctggcggca cttctgcgcagcggcgcccgcagccgcagccccctgctccggaggctggtgcaggtgagc ggggttcgtccccacagcactcagacccgggatgcgaggagtccccgcctcggaagcttc cagcccgcggccgttgcggctcccgcggcccgggcgcgcggcttaggccccaggacagtc acgcgacgggttctggtccaaaaaccgcctaagtgctccgataacacccagggactcgcc tatttttacagcgcgtttctacttccaaggtctcggcgtaatcatcttaatgagcgttac tggtgtgtcaaacaagggcacgtgtctgggcgggcaggaccgccaggattggcgctggcc cggcctgagcgggactcgcattgtattgtgtgtggtacaagcagagactgtatagaagat cctagaagagagggagtggaccagttgggaattagacaaggagggccattacagcatctc ctaggaccttctgaagagtctctgttttggattccaaaagaagaagtgaaagaagcatac atgggtaatgttctacaaggaggtgaaggacaagctcctacaaggcaggcagtattgggt gcaggaatgaaagccatcatgatggcctctcaaagtcttatgtgtggacatcaggatgtg atggtggcaggtgggatggagagcatgtccaatgttccatatgtaatgaacagaggatca acaccatatggtggggtaaagcttgaagatttgattgtaaaagacgggctaactgatgtc tacaataaaattcatatgggcagctgtgctgagaatacagcaaagaagctgaatattgca cgaaatgaacaggacgcttatgctattaattcttataccagaagtaaagcagcatgggaa gctgggaaatttggaaatgaagttattcctgtcacagttacagtaaaaggtcaaccagat gtagtggtgaaagaagatgaagaatataaacgtgttgattttagcaaagttccaaagctg aagacagttttccagaaagaaaatggcacagtaacagctgccaatgccagtacactgaat gatggagcagctgctctggttctcatgacggcagatgcagcgaagaggctcaatgttaca ccactggcaagaatagtagcatttgctgacgctgctgtagaacctattgattttccaatt gctcctgtatatgctgcatctatggttcttaaagatgtgggattgaaaaaagaagatatt gcaatgtgggaagtaaatgaagcctttagtctggttgtactagcaaacattaaaatgttg gagattgatccccaaaaagtgaatatcaatggaggagctgtttctctgggacatccaatt gggatgtctggagccaggattgttggtcatttgactcatgccttgaagcaaggagaatac ggtcttgccagtatttgcaatggaggaggaggtgcttctgccatgctaattcagaagctg tag >gi568815587r:108058945_108322536|GENSCAN_predicted_peptide_3|1240_aa MDQNVKHRTVRLLNENIRKKLCKVGFDSTQVTRPSGQISDPSRSYFVVVNHSQSQDTVTT GEALNVIPGAQEKKAHASLMSPGRRKSDNNIAQVPKQTDNNPTEPETSIDEFLGLPVWVV GVCMGRCQGDKGKKHTGLVVLPFKNLAQPQPGDPKQQWQGLGQQDTALSMQCWNECLQSL IADGNLVKGPAYPGQTETQSEIHMSEEAIQDILEQTESDPAFQALFDLFDYGKTKNNKNI SQSISSQPMESNPSIVLADETNLAVKGSFETEESDGQSGQPAFCTSYQNDDPLNALKNSN NHDVLRQEDQENFSQISTSIQKKAFKTAVPTEQKCDIDITFESVPNLNDFNQRGNSNAEC NPHCAELYTNQMSTETEMAIGIEKNSLSSNVPSESQLQPDQPDIPITSFVSLGCEANNEN LILSGKSSQLLSQDTSLTGKPSKKSQFCENSNDTVKLKINFHGSKSSDSSEVHKSKIEIN VLEPVMSQLSNCQDNSCLQSEILPVSVESSHLNVSGQVEIHLGDSLSSTKQPSNDSASVE LNHTENEAQASKSENSQEPSSSVKEENTIFLSLGGNANCEKVALTPPEGTPVENSHSLPP ESVCSSVGDSHPESQNTDDKPSSNNSAEIDASNIVSLKVIISDDPFVSSDTELTSAVSSI NGENLPTIILSSPTKSPTKNAELVKCLSSEETVGAVVYAEVGDSASMEQSLLTFKSEDSA VNNTQNEDGIAFSANVTPCVSKDGGYIQLMPATSTAFGNSNNILIATCVTDPTALGTSVS QSNVVVLPGNSAPMTAQPLPPQLQTPPRSNSVFAVNQAVSPNFSQGKQVNNLVDSSGHSV GCHAQKTEVSDKSIATDLGKKSEETTVPFPEESIVPAAKPCHRRVLCFDSTTAPVANTQG PNHKMVSQNKERNAVSFPNLDSPNVSSTLKPPSNNAIKREKEKPPLPKILSKSESAISRH TTIRETQSEKKVSPTEIVLESFHKATANKENELCSDVERQKNPENSKLSIGQQNGGLRSE KSIASLQEMTKKQGTSSNNKNVLSVGTAVKDLKQEQTKSASSLITTEMLQDIQRHSSVSR LADSSDLPVPRTPGSGAGEKHKEEPIDIIKAPSSRRFSEDSSTSKVMVPPVTPDLPACSP ASETGSENSVNMAAHTLMILSRAAISRTTSATPLKDNTQQFRASSRSTTKKRKIEELDER ERNSRPSSKNLTNSSIPMKKKKIKASLKVLISKPHLFFPL >gi568815587r:108058945_108322536|GENSCAN_predicted_CDS_3|3723_bp atggatcagaatgtaaaacacaggaccgtaagacttctaaatgaaaacataagaaaaaaa ttgtgcaaagttggatttgacagtacacaggttactcgaccaagtggccaaatttcagat ccatcgaggtcatattttgtagtggtcaaccactcacagtcacaagatactgtaaccact ggagaagctttaaatgtcattcctggtgctcaggaaaagaaagcacatgccagtttaatg tctcccggtagacgcaaaagtgataacaatattgcccaagtacctaagcaaacagataac aaccctacggagccagagacttcaattgatgaattcctaggacttccggtatgggtggtt ggggtgtgcatgggaagatgccaaggtgacaaggggaaaaagcacactggcctggtagtg ctgccctttaagaatctggctcagcctcagcctggtgaccctaaacagcaatggcagggt ctgggtcagcaagacacagctctctctatgcaatgctggaatgagtgtctccagtctctt attgctgatgggaaccttgtaaaaggacctgcatatcctggacaaactgagacacagagt gaaattcacatgtctgaagaagctatacaggacatattggaacagacagaatcagaccca gcatttcaggcactctttgatctctttgactatggcaaaacaaagaataataaaaatata tcacaaagtatttccagtcaacctatggaatccaatcccagtatagtcttagcagatgaa actaatctagcagttaaaggttcttttgaaacagaagaatctgatggtcagtctggtcag cccgctttttgtacatcctatcagaatgatgacccattaaatgctttgaagaatagcaac aaccatgatgtgcttagacaagaagaccaggaaaatttttcccaaataagtaccagcata cagaaaaaggcctttaaaacagctgtacccactgaacagaagtgtgacattgacattacc tttgagtccgtgcctaatttgaatgactttaaccaaagagggaattctaatgctgaatgt aatccacattgtgctgaattatacaccaatcagatgtccactgaaactgaaatggctata gggattgaaaagaactctttgtcttcaaatgtaccgagtgaatctcagttacagcctgat cagcctgatataccaataacttcatttgtttcacttggttgtgaagctaacaatgaaaac ttaattctctctgggaagagttctcaacttttatcccaagatacttcattaactggaaag ccatctaaaaaaagtcaattttgtgaaaattctaatgatacagtaaaacttaaaattaat tttcatggttccaagtcatcagattctagtgaagttcacaagagtaaaatagaaattaat gtgttagaaccagttatgtcacagctatcaaattgccaagataattcttgtcttcaaagt gaaatactacctgtgtctgttgaaagttcacatttaaatgtatctggacaagtagaaatt catcttggagattcgctgtcttctactaaacaaccatctaatgattcagcatctgttgag ttaaatcatacagaaaatgaagctcaggcatccaagtctgagaattcacaggagccttca tcttctgtaaaagaagagaatactatttttctctctttaggtggaaatgctaactgtgag aaagttgcactgacgcctccagaaggcactcctgtagaaaacagtcactctcttcctcca gaatctgtgtgttcttcagtgggagattctcaccctgagtcccaaaatactgatgataaa ccttctagcaacaactcagcagagatagatgcatcaaatatcgtctctctcaaagttatc attagtgatgatccatttgtttcctcagatactgaacttaccagtgctgtttctagtatt aatggagaaaacctgccaactataatcttgtcttctcctactaaatcacctactaaaaat gcagaactagttaaatgcctatcttcagaagaaactgtaggtgctgttgtatatgccgaa gtaggggattcagcctcaatggaacagagtcttttaacattcaaatctgaagactctgca gtaaacaatactcagaatgaagatggcattgctttttcagctaatgttacaccatgtgtt tccaaggatggaggatatatacagttgatgccagccacaagcacagcttttggcaattca aataacattctgatagctacctgtgtgactgatccaacagcgttaggaacatctgtaagt cagtctaatgtagtggtgttgcctggaaattctgcacctatgactgctcaacctctacca cctcagttacagacaccaccaaggtcaaacagtgtatttgctgtcaaccaagctgtgtca ccaaacttttcacaaggaaaacaagtaaataatttggtggattcgtcaggtcattcagtt ggatgtcatgcacaaaaaactgaagtttctgacaaaagtattgccacagatcttgggaaa aaatcagaagaaaccacagttcccttcccagaagagagtatagttccagctgctaaacca tgccacagacgtgtactctgtttcgacagcactactgctcctgtggcaaatacgcagggg ccaaaccataagatggtgtcccaaaacaaagaaaggaatgcagtctcttttcctaatctt gactcacccaatgtgtcctccaccttaaaacccccttctaataatgctatcaaaagagag aaagagaagcctcctctgcctaagattttatctaaatcggaaagtgccattagccggcat accaccataagagaaactcaatcagaaaagaaagtttcaccaacagaaattgtgcttgaa tctttccataaagcaacagctaataaggagaatgaattatgcagcgatgtagaaagacag aaaaatccagaaaattcaaaactatctattgggcagcaaaatgggggtttgcgaagtgag aaatctatagcttcactgcaagaaatgaccaaaaaacaaggcacatcttcaaacaataaa aatgtactttcagtaggtacagctgtgaaggatctaaaacaagaacaaactaaatccgcc agttctttgattaccacagaaatgttacaggatatacagaggcacagctcagtaagtagg cttgctgatagtagtgatttacctgtgccccggacacctggctcaggggcaggggaaaaa cataaagaagaacctatagatattatcaaggccccctctagtaggcgtttcagtgaagac agtagtacatcaaaagtaatggtccctcctgtcaccccagacttgcctgcctgcagccct gccagtgaaacaggaagtgaaaacagtgtaaatatggctgcccacacattaatgattctc tccagggcagccatttctaggactacttcagcaactcctctgaaagataacacacaacag tttagagcatcttcaaggagcaccacaaaaaagcggaaaattgaggaattagatgaacgt gagcgaaactctcgtccttctagtaaaaatcttacaaattcatcaataccaatgaaaaag aagaaaattaaggcaagtttgaaagttttaatttcaaaaccacaccttttttttccgctt tag >gi568815587r:108058945_108322536|GENSCAN_predicted_peptide_4|1179_aa MAQLDQLRGLMQSCSGHGIRARSSLLGQVGREGPAMSLEASEAQAGAPQATEMLDSIAAL NGSCVISGGLSGICARVKVLQYVDDILLCVPIEEVSGGQGYKVSKSKAQLCQTSVKYLGL VLSEGTRELGEERIKSISSFPLPKTLKQLRGFLGITGLWIPGYGEIVCPLYHIIKETQAT KTHSLIWEPEAKRTFDQLKQALLEAPALSLPIGKTFNLYVSERKGMAQGVPTQAQGPAQK PVGYLSKELNLVAKGWPACLRAVTAVTLSVPEATKMRDKTLEPAEPAEPAEPAVCTHPAR SRQQQAQPQLYFEIGVGTGSRERPGSGSRRFQACRERGASLAPESAGMPAPQLQLHLGGQ GSCPSNLEGDRAPICSQLLLPCGAHSPGHTSRAAAGVLAPGAGLIQRHKQGESVPPRASP RASPRHAASEERSIYIQRGLNCPEPPNDEESPPVSTPNRFRQRKKGAEMKPASVRLRNCR HFRPQTWRGGDEEGGEDDEGEEGGTFSQRAFIDISTSASRLRAWALESYGWNWNSVSRVA ALYCLVKISEIILTCAAEVIGMDETKKTKLFLRYERKKRYQETFETVITEVNMIIANICC VSGTVLWVDVRVCYMGKLHVAEAWCTNDPVTQKEVEKFKRLIRDPETIKHLDRHSDSKQG KYLNWDAVFRFLQKYIQKETECLRIAKPNVSASTQASRQKKMQEISSLVKYFIKCANRKL FSVYFRLYLKPSQDVHRVLVARIIHAVTKGCCSQTDGLNSKFLDFFSKAIQCARQEKSSS GLNHILAALTIFLKTLAVNFRIRVCELGDEILPTLLYIWTQHRLNDSLKEVIIELFQLQI YIHHPKGAKTQEKGAYESTKWRSILYNLYDLLVNEISHIGSRGKYSSGFRNIAVKENLIE LMADICHQVQLQIATQLISKYPASLPNCELSPLLMILSQLLPQQRHGERTPYVLRCLTEV ALCQDKRSNLESSQKSDLLKLWNKIWCITFRGISSEQIQAENFGLLGAIIQGSLVEVDRE FWKLFTGSACRPSCEHHQKDKEELSFSEVEELFLQTTFDKMDFLTIVRECGIEKHQSSIG FSVHQNLKESLDRCLLGLSEQLLNNYSSEITNSETLVRCSRLLVGVLGCYCYMGVIAEEE AYKSELFQKAKPPTPCQALVYNVPLPVSMCSHCSAATYE >gi568815587r:108058945_108322536|GENSCAN_predicted_CDS_4|3540_bp atggcacaactggaccagctgcggggcttaatgcagagctgcagtgggcatggaatccgg gccagaagcagtctgctgggtcaagtgggcagagagggcccagcaatgagcctagaggcg agcgaggcccaggcaggcgcgccacaggccacagagatgctggacagcatagcggcactg aatggatcctgtgtcatttctggaggcttgtctgggatctgtgcaagggtgaaagtttta caatatgtagatgacattctcctttgcgttccaattgaggaagtctcaggaggacaagga tataaggtctcaaaatctaaggctcagctctgtcagacttcagtgaagtacctaggtcta gtcttgtcagaggggaccagggaactaggtgaagaaagaattaagtccatctcctctttt cctctccccaaaaccctcaagcaactgagaggattcttggggattacaggactatggata cctgggtatggtgaaatagtttgtcccttatatcacataataaaggagactcaggcaact aagactcactccctaatttgggaaccagaggctaaaaggacctttgaccaactgaaacaa gccttgcttgaggcaccagcccttagtcttcccatagggaagacattcaatctttatgta tcagaaaggaaaggaatggcccagggagttccaactcaggcccaaggtccagcccagaaa cctgtaggttacctaagcaaggagctgaatttggtagctaaaggatggccagcctgcctc cgggcagtcacagcagtaaccttgtcagtaccagaggccactaagatgcgggacaaaacc ctggaacctgccgaacctgccgaacctgcagaacctgccgtgtgcacacacccagccagg tcccgacagcagcaggctcagcctcaactttactttgaaatcggagtgggcaccgggagc agggagaggccaggcagcgggagtaggcgcttccaagcctgcagggaaaggggggcttcc ctggcccccgagagtgcagggatgcctgctccacagctgcagctgcacctgggagggcag ggctcctgcccatccaacttggaaggggacagggcacccatctgttcccagctcctgctg ccctgtggagcacacagccctggccacacctcccgtgctgcagccggcgtcttggcacca ggtgctggactgattcaaagacacaaacagggtgaaagtgttccaccccgagcttcccct cgggcttcccctcgccacgcagcctctgaagagaggagcatctacatacaaagaggctta aactgcccagaacctccgaatgacgaagaatcaccgccagtctcaactcccaatcgcttc cgccagagaaagaaaggcgccgaaatgaaacccgcctccgttcgccttcggaactgtcgt cacttccgtcctcagacttggaggggcggggatgaggagggcggggaggacgacgagggc gaagagggtggtacttttagtcagcgagcatttattgatatttcaacttcagcctcgcgg ttaagagcttgggctctggaatcatacggctggaattggaattctgtcagtcgtgtggcc gctctctactgtcttgtgaagataagtgagataatcttgacctgtgctgctgaagtcata ggaatggatgagaccaagaaaacaaagctgtttttgaggtatgagcggaagaagagatat caggagactttcgaaacagtcataacggaagttaatatgatcattgctaacatttgctgt gtttcaggcactgttctgtgggtagatgtgcgggtttgttacatgggtaaattgcatgtt gctgaggcttggtgtacgaatgatcctgtcactcagaaagaagttgagaaatttaagcgc ctgattcgagatcctgaaacaattaaacatctagatcggcattcagattccaaacaagga aaatatttgaattgggatgctgtttttagatttttacagaaatatattcagaaagaaaca gaatgtctgagaatagcaaaaccaaatgtatcagcctcaacacaagcctccaggcagaaa aagatgcaggaaatcagtagtttggtcaaatacttcatcaaatgtgcaaacagaaaattg ttctctgtgtacttcaggctctatctgaaaccttcacaagatgttcatagagttttagtg gctagaataattcatgctgttaccaaaggatgctgttctcagactgacggattaaattcc aaatttttggactttttttccaaggctattcagtgtgcgagacaagaaaagagctcttca ggtctaaatcatatcttagcagctcttactatcttcctcaagactttggctgtcaacttt cgaattcgagtgtgtgaattaggagatgaaattcttcccactttgctttatatttggact caacataggcttaatgattctttaaaagaagtcattattgaattatttcaactgcaaatt tatatccatcatccgaaaggagccaaaacccaagaaaaaggtgcttatgaatcaacaaaa tggagaagtattttatacaacttatatgatctgctagtgaatgagataagtcatatagga agtagaggaaagtattcttcaggatttcgtaatattgccgtcaaagaaaatttgattgaa ttgatggcagatatctgtcaccaggtacagctacagattgcaacccaattaatatcaaag tatcctgcaagtttacctaactgtgagctgtctccattactgatgatactatctcagctt ctaccccaacagcgacatggggaacgtacaccatatgtgttacgatgccttacggaagtt gcattgtgtcaagacaagaggtcaaacctagaaagctcacaaaagtcagatttattaaaa ctctggaataaaatttggtgtattacctttcgtggtataagttctgagcaaatacaagct gaaaactttggcttacttggagccataattcagggtagtttagttgaggttgacagagaa ttctggaagttatttactgggtcagcctgcagaccttcatgtgaacaccaccaaaaagat aaagaagaactttcattctcagaagtagaagaactatttcttcagacaacttttgacaag atggactttttaaccattgtgagagaatgtggtatagaaaagcaccagtccagtattggc ttctctgtccaccagaatctcaaggaatcactggatcgctgtcttctgggattatcagaa cagcttctgaataattactcatctgagattacaaattcagaaactcttgtccggtgttca cgtcttttggtgggtgtccttggctgctactgttacatgggtgtaatagctgaagaggaa gcatataagtcagaattattccagaaagccaagcccccgaccccctgtcaggccctggtg tataatgttcccctccctgtgtccatgtgttctcattgttcagctgccacttatgagtga >gi568815587r:108058945_108322536|GENSCAN_predicted_peptide_5|338_aa MGKKQNRKTGNSKKQSASPPPKERSSSPATEQSWMENDFDELREEGFRRSNYSELREDIQ TKGKEVENFEKNLEECITRITNTEKCLKEPMELKTNARELREECRSLRSRCDQLEERVSA MEDEMNEMKREGKFREKRIKRNEQSLQEIWDYGKRPNLRLIGAPESNGENGTKLENTLQD IIQENFRNLARQANIQIQEIQRMPQRYSSRRATPRHIIVRFTKVEMKENMLRAAREKGRV TLRGKHIRLTADLSAETLQARREWGPIFNILKEKNFQPRISCPSKLSFISEGEIKYFTDK QMLRDFVTTRPALQELLKEALNMERNNRYQPLQNHAKM >gi568815587r:108058945_108322536|GENSCAN_predicted_CDS_5|1017_bp atggggaaaaaacagaacagaaaaactggaaactctaaaaagcagagcgcctctcctcca ccaaaggaacgcagttcctcaccagcaacggaacaaagctggatggagaatgactttgac gagctgagagaagaaggcttcagacgatcaaattactctgagctacgggaggacattcaa accaagggcaaagaagttgaaaactttgaaaaaaatttagaagaatgtataactagaata accaatacagagaagtgcttaaaggagccgatggagctgaaaaccaacgctagagaacta cgtgaagaatgcagaagcctcaggagccgatgcgatcaactggaagaaagggtatcagcg atggaagatgaaatgaatgaaatgaagcgagaagggaagtttagagaaaaaagaataaaa agaaatgagcaaagcctccaagaaatatgggactatgggaaaagaccaaatctacgtctg attggtgcacctgaaagtaacggggagaatggaaccaagttggaaaacactctgcaggat attatccaggagaacttccgcaatctagcaaggcaggccaacattcagattcaggaaata cagagaatgccacaaagatactcctcgagaagagcaactccaagacacataattgtcaga ttcaccaaagttgaaatgaaggaaaacatgttaagggcagccagagagaaaggtcgggtt accctgagagggaagcacatcagactaacagcggatctctccgcagaaactctacaagcc agaagagagtgggggccaatattcaacattcttaaagaaaagaattttcaacccagaatt tcatgtccatccaaactaagcttcataagtgaaggagaaataaaatactttacagacaag caaatgctgagagattttgtcaccaccaggcctgccttacaagagctcctgaaggaagca ttaaacatggaaaggaacaaccggtaccagccgctgcaaaatcatgccaaaatgtaa >gi568815587r:108058945_108322536|GENSCAN_predicted_peptide_6|1619_aa MIISIDAEKALDKIQQPFMPKTLNKLGIDGTYLKIIRAIYDSPTANIILNGQKLEAFPLK TGTRQGCPLSPLLFNIVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPTVS AQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLT RDVKDLFKENYKPLLKEIKEDTNKWKNIPCSWVGRINIVKMAILPKVIYRFNGIPIKLPM TFFTELEKTTLKFIWNQKRARIGKSILSQKNKAGGITLPDFKLYYKAAVTKTAWYWYQNR DIDQWNRTEPSEITLHIYNYLLFDKPEKNKQWGKDSLFNKWCWENWLAICRKLKLDPFLT PDRKINSRWIKDLNIRPKTIKTLEENLGITIQDIGMGKDFMSKTPKAMATKDRIDKWDLI KLKSFCTAKETTIRVNRQPTKWEKIFATYSSDKGLISRIYNELKQIYKKKTNNPIKKWAK DMNRHFSKEDIYAAKKHMKKCSSSLAIREMQIKTTMRYHLTPASFIKKPFDRGEVESMED DTNGNLMEVEDQSSMNLFNDYPDSSVSDANEPGESQSTIGAINPLAEEYLSKQDLLFLDM LKFLCLCVTTAQTNTVSFRAADIRRKLLMLIDSSTLEPTKSLHLHMSGPSAADLLEFAGG PLQTLFAWVSSMEAAEQQRLLPVPSSGKSHSAENPETLDEIYNRKSVLLTLIAVVLSCSP ICEKQALFALCKSVKENGLEPHLVKKVLEKVSETFGYRRLEDFMASHLDYLVLEWLNLQD TEYNLSSFPFILLNYTNIEDFYRSCYKVLIPHLVIRSHFDEVKSIANQIQEDWKSLLTDC FPKILVNILPYFAYEGTRDSGMAQQRETATKVYDMLKSENLLGKQIDHLFISNLPEIVVE LLMTLHEPANSSASQSTDLCDFSGDLDPAPNPPHFPSHVIKATFAYISNCHKTKLKSILE ILSKSPDSYQKILLAICEQAAETNNVYKKHRILKIYHLFVSLLLKDIKSGLGGAWAFVLR DVIYTLIHYINQRPSCIMDVSLRSFSLCCDLLSQVCQTAVTYCKDALENHLHVIVGTLIP LVYEQVEVQKQVLDLLKYLVIDNKDNENLYITIKLLDPFPDHVVFKDLRITQQKIKYSRG PFSLLEEINHFLSVSVYDALPLTRLEGLKDLRRQLELHKDQMVDIMRASQEAVGSCLGEV GPIDFSTIAIQHSKDASYTKALKLFEDKELQWTFIMLTYLNNTLVEDCVKVRSAAVTCLK NILATKTGHSFWEIYKMTTDPMLAYLQPFRTSRKKFLEVPRFDKENPFEGLDDINLWIPL SENHDIWIKTLTCAFLDSGGTKCEILQLLKPMCEVKTDFCQTVLPYLIHDILLQDTNESW RNLLSTHVQGFFTSCLRHFSQTSRSTTPANLDSESEHFFRCCLDKKSQRTMLAVVDYMRR QKRPSSGTIFNDAFWLDLNYLEVAKVAQSCAAHFTALLYAEIYADKKSMDDQEKRLRTYE HEAMWGKALVTYDLETAIPSSTRQAGIIQALQNLGLCHILSVYLKGLDYENKDWCPELEE LHYQAAWRNMQWDHCTSVRVKEVEEMCKRSLESVYSLYPTLSRLQAIGELESIGELFSS >gi568815587r:108058945_108322536|GENSCAN_predicted_CDS_6|4857_bp atgattatctcaatagatgcagaaaaggccttggacaaaattcaacaacccttcatgcca aaaactctgaataaattaggtattgatgggacgtatctcaaaataataagagctatctat gacagtcccacagccaatatcatactgaatgggcaaaaactggaagcattccctttgaaa actggcacaagacagggatgccctctctcaccactcctattcaacatagtgttggaagtt ctggccagggcaatcaggcaggagaaggaaataaagggtattcaattaggaaaagaggaa gtcaaattgtccctgtttgcagacgacatgattgtatatctagaaaaccccactgtctca gcccaaaatctccttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaat gtacaaaaatcacaagcattcttatacaccaacaacagacaaacagagagccaaatcatg agtgaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaacttaca agggatgtgaaggacctcttcaaggagaactacaaaccactgctcaaggaaataaaagag gatacaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatattgtgaaa atggccatactgcccaaggtaatttacagattcaatggcatccccatcaagctaccaatg actttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcc cgcatcggcaagtcaatcctaagccaaaagaacaaagctggaggcatcacactacctgac ttcaaactatactacaaggctgcagtaaccaaaacagcatggtactggtaccaaaacaga gatatagatcaatggaacagaacagagccctcagaaataacgctgcatatctacaactat ctgctctttgacaaacctgagaaaaacaagcaatggggaaaggattccctatttaataaa tggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttccttaca cctgatagaaaaatcaattcaagatggattaaagacttaaacattagacctaaaaccata aaaaccctagaagaaaacctaggcattaccattcaggacataggcatgggcaaggacttc atgtctaaaacaccaaaagcaatggcaacaaaagacagaattgacaaatgggatctaatt aaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaacctaca aaatgggagaaaattttcgcaacctactcatctgacaaagggctaatatccagaatctac aatgaactcaaacaaatttacaagaaaaaaacaaacaaccccatcaaaaagtgggcgaag gacatgaacagacacttctcaaaagaagacatttatgcagccaaaaaacacatgaaaaaa tgctcatcatcactggccatcagagaaatgcaaatcaaaaccacaatgagataccatctc acaccagcatccttcatcaaaaagccatttgaccgtggagaagtagaatcaatggaagat gatactaatggaaatctaatggaggtggaggatcagtcatccatgaatctatttaacgat taccctgatagtagtgttagtgatgcaaacgaacctggagagagccaaagtaccataggt gccattaatcctttagctgaagaatatctgtcaaagcaagatctacttttcttagacatg ctcaagttcttgtgtttgtgtgtaactactgctcagaccaatactgtgtcctttagggca gctgatattcggaggaaattgttaatgttaattgattctagcacgctagaacctaccaaa tccctccacctgcatatgtcaggcccctctgctgcagatctgctggagtttgctggaggt ccactccagaccctgtttgcctgggtatcatccatggaggctgcagaacagcaaagattg ctgcctgttccttcctctgggaagtcccatagtgctgagaaccctgaaactttggatgaa atttataatagaaaatctgttttactgacgttgatagctgtggttttatcctgtagccct atctgcgaaaaacaggctttgtttgccctgtgtaaatctgtgaaagagaatggattagaa cctcaccttgtgaaaaaggttttagagaaagtttctgaaacttttggatatagacgttta gaagactttatggcatctcatttagattatctggttttggaatggctaaatcttcaagat actgaatacaacttatcttcttttccttttattttattaaactacacaaatattgaggat ttctatagatcttgttataaggttttgattccacatctggtgattagaagtcattttgat gaggtgaagtccattgctaatcagattcaagaggactggaaaagtcttctaacagactgc tttccaaagattcttgtaaatattcttccttattttgcctatgagggtaccagagacagt gggatggcacagcaaagagagactgctaccaaggtctatgatatgcttaaaagtgaaaac ttattgggaaaacagattgatcacttattcattagtaatttaccagagattgtggtggag ttattgatgacgttacatgagccagcaaattctagtgccagtcagagcactgacctctgt gacttttcaggggatttggatcctgctcctaatccacctcattttccatcgcatgtgatt aaagcaacatttgcctatatcagcaattgtcataaaaccaagttaaaaagcattttagaa attctttccaaaagccctgattcctatcagaaaattcttcttgccatatgtgagcaagca gctgaaacaaataatgtttataagaagcacagaattcttaaaatatatcacctgtttgtt agtttattactgaaagatataaaaagtggcttaggaggagcttgggcctttgttcttcga gacgttatttatactttgattcactatatcaaccaaaggccttcttgtatcatggatgtg tcattacgtagcttctccctttgttgtgacttattaagtcaggtttgccagacagccgtg acttactgtaaggatgctctagaaaaccatcttcatgttattgttggtacacttataccc cttgtgtatgagcaggtggaggttcagaaacaggtattggacttgttgaaatacttagtg atagataacaaggataatgaaaacctctatatcacgattaagcttttagatccttttcct gaccatgttgtttttaaggatttgcgtattactcagcaaaaaatcaaatacagtagagga cccttttcactcttggaggaaattaaccattttctctcagtaagtgtttatgatgcactt ccattgacaagacttgaaggactaaaggatcttcgaagacaactggaactacataaagat cagatggtggacattatgagagcttctcaggaggctgttggaagctgcttgggagaagtg ggtcctatagatttctctaccatagctatacaacatagtaaagatgcatcttataccaag gcccttaagttatttgaagataaagaacttcagtggaccttcataatgctgacctacctg aataacacactggtagaagattgtgtcaaagttcgatcagcagctgttacctgtttgaaa aacattttagccacaaagactggacatagtttctgggagatttataagatgacaacagat ccaatgctggcctatctacagccttttagaacatcaagaaaaaagtttttagaagtaccc agatttgacaaagaaaacccttttgaaggcctggatgatataaatctgtggattcctcta agtgaaaatcatgacatttggataaagacactgacttgtgcttttttggacagtggaggc acaaaatgtgaaattcttcaattattaaagccaatgtgtgaagtgaaaactgacttttgt cagactgtacttccatacttgattcatgatattttactccaagatacaaatgaatcatgg agaaatctgctttctacacatgttcagggatttttcaccagctgtcttcgacacttctcg caaacgagccgatccacaacccctgcaaacttggattcagagtcagagcactttttccga tgctgtttggataaaaaatcacaaagaacaatgcttgctgttgtggactacatgagaaga caaaagagaccttcttcaggaacaatttttaatgatgctttctggctggatttaaattat ctagaagttgccaaggtagctcagtcttgtgctgctcactttacagctttactctatgca gaaatctatgcagataagaaaagtatggatgatcaagagaaaagactacgaacatatgaa cacgaagcaatgtggggcaaagccctagtaacatatgacctcgaaacagcaatcccctca tcaacacgccaggcaggaatcattcaggccttgcagaatttgggactctgccatattctt tccgtctatttaaaaggattggattatgaaaataaagactggtgtcctgaactagaagaa cttcattaccaagcagcatggaggaatatgcagtgggaccattgcacttccgtcagagta aaagaagtggaagagatgtgtaagcgcagccttgagtctgtgtattcgctctatcccaca cttagcaggttgcaggccattggagagctggaaagcattggggagcttttctcaagn