GENSCAN 1.0 Date run: 6-Nov-116 Time: 18:02:09 Sequence gi568815596f:218859821_219060921 : 201101 bp : 51.15% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 177 297 121 2 1 84 89 54 0.243 5.67 1.02 Intr + 537 628 92 1 2 47 51 55 0.099 -2.09 1.03 Intr + 5890 6053 164 0 2 112 77 71 0.507 7.69 1.04 Intr + 11207 11427 221 2 2 145 56 185 0.992 19.47 1.05 Intr + 11665 11999 335 2 2 100 78 447 0.996 40.74 1.06 Term + 13564 14025 462 0 0 110 46 781 0.996 71.54 1.07 PlyA + 14392 14397 6 1.05 2.00 Prom + 14638 14677 40 -2.21 2.01 Init + 21176 21288 113 1 2 99 99 171 0.992 16.96 2.02 Intr + 22341 22603 263 0 2 97 55 313 0.913 26.57 2.03 Intr + 30164 30543 380 0 2 77 94 490 0.820 43.75 2.04 Intr + 32598 32720 123 2 0 84 75 5 0.304 0.09 2.05 Term + 32954 33451 498 1 0 99 53 931 0.958 85.61 2.06 PlyA + 34087 34092 6 1.05 3.05 PlyA - 34471 34466 6 1.05 3.04 Term - 34749 34608 142 2 1 113 43 38 0.433 -0.49 3.03 Intr - 37443 37326 118 1 1 110 81 34 0.921 4.93 3.02 Intr - 38980 38841 140 0 2 50 73 67 0.307 2.02 3.01 Init - 41163 41096 68 0 2 89 80 31 0.376 2.89 3.00 Prom - 41843 41804 40 -4.41 4.00 Prom + 45554 45593 40 -1.81 4.01 Init + 47047 47163 117 0 0 95 100 42 0.586 6.27 4.02 Term + 49216 49302 87 0 0 27 55 155 0.502 4.06 4.03 PlyA + 54439 54444 6 1.05 5.04 PlyA - 54816 54811 6 1.05 5.03 Term - 65491 65299 193 0 1 34 40 195 0.711 6.52 5.02 Intr - 67434 67343 92 0 2 93 79 27 0.902 1.59 5.01 Init - 69642 69568 75 0 0 67 64 106 0.762 5.14 5.00 Prom - 81996 81957 40 -0.11 6.04 PlyA - 82168 82163 6 1.05 6.03 Term - 84874 84663 212 2 2 116 42 4 0.033 -3.62 6.02 Intr - 93897 93777 121 0 1 -63 35 233 0.771 3.47 6.01 Init - 94133 93909 225 2 0 94 92 281 0.860 27.55 6.00 Prom - 96613 96574 40 -6.10 7.00 Prom + 96757 96796 40 -6.80 7.01 Sngl + 100001 101104 1104 1 0 72 50 2252 0.999 215.21 7.02 PlyA + 105757 105762 6 1.05 8.05 PlyA - 105992 105987 6 1.05 8.04 Term - 122436 121847 590 1 2 107 42 1050 0.911 97.40 8.03 Intr - 122703 122631 73 0 1 32 91 94 0.884 3.57 8.02 Intr - 124485 124376 110 1 2 126 73 80 0.533 10.80 8.01 Init - 125680 125623 58 1 1 78 61 122 0.889 7.92 8.00 Prom - 130068 130029 40 -6.01 9.47 PlyA - 130395 130390 6 1.05 9.46 Term - 130579 130432 148 0 1 84 44 256 0.816 18.49 9.45 Intr - 131174 131032 143 2 2 86 75 170 0.999 15.16 9.44 Intr - 132423 132282 142 2 1 89 89 211 0.999 22.16 9.43 Intr - 133360 133196 165 0 0 72 67 217 0.898 17.59 9.42 Intr - 135955 135700 256 2 1 56 95 60 0.485 0.54 9.41 Intr - 136401 136218 184 0 1 72 2 135 0.552 3.18 9.40 Intr - 140303 140271 33 2 0 99 80 29 0.306 2.10 9.39 Intr - 141210 141093 118 2 1 112 61 18 0.516 2.47 9.38 Intr - 141432 141338 95 0 2 95 48 36 0.807 -0.44 9.37 Intr - 141993 141733 261 0 0 16 43 211 0.528 7.62 9.36 Intr - 143201 143159 43 1 1 114 63 87 0.968 7.53 9.35 Intr - 143452 143315 138 0 0 93 110 279 0.999 30.69 9.34 Intr - 144635 144132 504 1 0 91 105 658 0.997 60.48 9.33 Intr - 145742 145614 129 1 0 63 81 86 0.728 5.71 9.32 Intr - 146403 146201 203 0 2 69 119 69 0.970 6.71 9.31 Intr - 146689 146645 45 1 0 84 100 15 0.712 1.39 9.30 Intr - 149334 149227 108 0 0 79 100 241 0.996 25.38 9.29 Intr - 149640 149527 114 0 0 44 97 46 0.727 2.25 9.28 Intr - 150265 150122 144 1 0 104 78 141 0.952 15.69 9.27 Intr - 150884 150726 159 2 0 102 75 109 0.971 11.70 9.26 Intr - 151176 150985 192 0 0 101 101 159 0.920 18.71 9.25 Intr - 153549 153439 111 0 0 48 82 106 0.961 7.08 9.24 Intr - 153765 153699 67 2 1 104 81 63 0.965 6.60 9.23 Intr - 154224 154048 177 2 0 67 35 102 0.531 2.35 9.22 Intr - 155344 155319 26 1 2 107 89 1 0.209 -0.59 9.21 Intr - 155549 155525 25 1 1 113 89 -1 0.579 1.01 9.20 Intr - 159359 159231 129 1 0 121 60 127 0.984 13.51 9.19 Intr - 159899 159686 214 0 1 100 99 261 0.999 26.70 9.18 Intr - 160105 159944 162 2 0 -42 81 135 0.529 0.26 9.17 Intr - 161460 161332 129 1 0 52 67 111 0.962 6.57 9.16 Intr - 162110 161960 151 2 1 48 5 266 0.986 14.55 9.15 Intr - 162509 162351 159 2 0 35 65 149 0.865 8.00 9.14 Intr - 163611 163387 225 0 0 82 70 261 0.994 22.41 9.13 Intr - 164440 164195 246 1 0 101 97 261 0.999 26.49 9.12 Intr - 166339 166202 138 1 0 86 121 105 0.967 14.77 9.11 Intr - 168189 167830 360 0 0 101 93 559 0.892 53.68 9.10 Intr - 168581 168381 201 2 0 38 92 372 0.998 32.60 9.09 Intr - 169848 169583 266 1 2 74 109 384 0.995 36.87 9.08 Intr - 170388 170166 223 0 1 99 71 246 0.982 22.23 9.07 Intr - 171014 170869 146 0 2 60 66 226 0.870 18.11 9.06 Intr - 171485 171286 200 1 2 86 68 252 0.888 22.52 9.05 Intr - 171838 171669 170 1 2 70 60 103 0.980 4.96 9.04 Intr - 172752 172650 103 2 1 108 91 146 0.889 17.58 9.03 Intr - 175844 175660 185 2 2 60 81 108 0.825 6.40 9.02 Intr - 178758 178555 204 0 0 104 110 203 0.931 24.02 9.01 Init - 179016 178954 63 0 0 101 96 -5 0.601 2.89 9.00 Prom - 182699 182660 40 -0.71 10.04 PlyA - 186920 186915 6 1.05 10.03 Term - 196045 195387 659 2 2 132 48 543 0.991 49.24 10.02 Intr - 197874 197613 262 0 1 71 105 443 0.999 41.90 10.01 Init - 200647 200333 315 1 0 76 99 616 0.817 56.84 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:218859821_219060921|GENSCAN_predicted_peptide_1|464_aa PRPLPTRAAVGRSRCCRPYPPASGCCCCCSCARRTSADCGAHQGTGEWVGKIFREDTYPR KRMCEEGPGFRIGAPHPNMGEQRAQDSLLGAPLPAAALTNRKPFCTAKSVREGNASTSPK SDEERWAVGSPLVMDPTSICRKARRLAGRQAELCQAEPEVVAELARGARLGVRECQFQFR FRRWNCSSHSKAFGRILQQDIRETAFVFAITAAGASHAVTQACSMGELLQCGCQAPRGRA PPRPSGLPGTPGPPGPAGSPEGSAAWEWGGCGDDVDFGDEKSRLFMDARHKRGRGDIRAL VQLHNNEAGRLAVRSHTRTECKCHGLSGSCALRTCWQKLPPFREVGARLLERFHGASRVM GTNDGKALLPAVRTLKPPGRADLLYAADSPDFCAPNRRTGSPGTRGRACNSSAPDLSGCD LLCCGRGHRQESVQLEENCLCRFHWCCVVQCHRCRVRKELSLCL >gi568815596f:218859821_219060921|GENSCAN_predicted_CDS_1|1395_bp cctcgccccctgcccacccgggcggccgtagggcggtcacgatgctgccgcccttaccct cccgcctcgggctgctgctgctgctgctcctgtgcccggcgcacgtcggcggactgtggt gctcaccaaggaacaggcgagtgggtaggaaagatctttcgggaagacacatatccgagg aaaagaatgtgtgaggagggccctggattcaggataggtgctccccaccccaatatgggc gagcagagggcccaggacagtctcctgggggcacctttgcctgccgcagccctgaccaat aggaagcccttttgtacagcgaagtctgtcagggaagggaatgcctccacctctcccaag agtgatgaggagagatgggctgtgggcagccccttggttatggaccctaccagcatctgc aggaaggcacggcggctggccgggcggcaggccgagttgtgccaggctgagccggaagtg gtggcagagctagctcggggcgcccggctcggggtgcgagagtgccagttccagttccgc ttccgccgctggaattgctccagccacagcaaggcctttggacgcatcctgcaacaggac attcgggagacggccttcgtgttcgccatcactgcggccggcgccagccacgccgtcacg caggcctgttctatgggcgagctgctgcagtgcggctgccaggcgccccgcgggcgggcc cctccccggccctccggcctgcccggcacccccggaccccctggccccgcgggctccccg gaaggcagcgccgcctgggagtggggaggctgcggcgacgacgtggacttcggggacgag aagtcgaggctctttatggacgcgcggcacaagcggggacgcggagacatccgcgcgttg gtgcaactgcacaacaacgaggcgggcaggctggccgtgcggagccacacgcgcaccgag tgcaaatgccacgggctgtcgggatcatgcgcgctgcgcacctgctggcagaagctgcct ccatttcgcgaggtgggcgcgcggctgctggagcgcttccacggcgcctcacgcgtcatg ggcaccaacgacggcaaggccctgctgcccgccgtccgcacgctcaagccgccgggccga gcggacctcctctacgccgccgattcgcccgacttctgcgcccccaaccgacgcaccggc tcccccggcacgcgcggtcgcgcctgcaatagcagcgccccggacctcagcggctgcgac ctgctgtgctgcggccgcgggcaccgccaggagagcgtgcagctcgaagagaactgcctg tgccgcttccactggtgctgcgtagtacagtgccaccgctgccgtgtgcgcaaggagctc agcctctgcctgtga >gi568815596f:218859821_219060921|GENSCAN_predicted_peptide_2|458_aa MGSAHPRPWLRLRPQPQPRPALWVLLFFLLLLAAAMPRSAPNDILDLRLPPEPVLNANTV CLTLPGLSRRQMEVCVRHPDVAASAIQGIQIAIHECQHQFRDQRWNCSSLETRNKIPYES PIFSRGFRESAFAYAIAAAGVVHAVSNACALGKLKACGCDASRRGDEEAFRRKLHRLQLD ALQRGKGLSHGVPEHPALPTASPGLQDSWEWGGCSPDMGFGERFSKDFLDSREPHRDIHA RMRLHNNRVGRQGWQEREFPCLFGIPLQEARASSPKGPLQLLERRAGEEGSRQAVMENMR RKCKCHGTSGSCQLKTCWQVTPEFRTVGALLRSRFHRATLIRPHNRNGGQLEPGPAGAPS PAPGAPGPRRRASPADLVYFEKSPDFCEREPRLDSAGTVGRLCNKSSAGSDGCGSMCCGR GHNILRQTRSERCHCRFHWCCFVVCEECRITEWVSVCK >gi568815596f:218859821_219060921|GENSCAN_predicted_CDS_2|1377_bp atgggcagcgcccaccctcgcccctggctgcggctccgaccccagccccagccgcggcca gcgctctgggtgctcctgttcttcctactgctgctggctgctgccatgcccaggtcagca cccaatgacattctggacctccgcctccccccggagcccgtgctcaatgccaacacagtg tgcctaacattgccaggcctgagccggcggcagatggaggtgtgtgtgcgtcaccctgat gtggctgcctcagccatacagggcatccagatcgccatccacgaatgccaacaccaattc agggaccagcgctggaactgctcaagcctggagactcgcaacaagatcccctatgagagt cccatcttcagcagaggtttccgagagagcgcttttgcctacgccatcgcagcagctggc gtggtgcacgccgtgtccaatgcgtgtgccctgggcaaactgaaggcctgtggctgtgat gcgtcccggcgaggggacgaggaggccttccgtaggaagctgcaccgcttacaactggat gcactgcagcgtggtaagggcctgagccatggggtcccggaacacccagccctgcccaca gccagcccaggcctgcaggactcctgggagtggggcggctgcagccccgacatgggcttc ggggagcgcttttctaaggactttctggactcccgggagcctcacagagacatccacgcg agaatgaggcttcacaacaaccgagttgggaggcaggggtggcaggagagggaatttccc tgcctatttgggatcccccttcaggaagcaagagcctccagccccaagggccccctgcag ctgctggaaaggcgggcaggagaggaggggagccggcaggcagtgatggagaacatgcgg cggaagtgcaagtgccacggcacgtcaggcagctgccagctcaagacgtgctggcaggtg acgcccgagttccgcaccgtgggggcgctgctgcgcagccgcttccaccgcgccacgctc atccggccgcacaaccgcaacggcggccagctggagccgggcccagcgggggcaccctcg ccggctccgggcgctcccgggccgcgccgacgggccagccccgccgacctggtctacttc gaaaagtctcccgacttctgcgagcgcgagccgcgcctggactcggcgggcaccgtgggc cgcctgtgcaacaagagcagcgccggctcggatggctgcggcagcatgtgctgcggccgc ggccacaacatcctgcgccagacgcgcagcgagcgctgccactgccgcttccactggtgc tgtttcgtggtctgcgaagagtgccgcatcaccgagtgggtcagcgtctgcaagtga >gi568815596f:218859821_219060921|GENSCAN_predicted_peptide_3|155_aa MADSRAVAEKLQDKPGTSCCDRKTRRGIPLSKWAVTYVIMFLFKDFYVGRGEIVPQNLRD EAETTGKMGGRSHPGEECALLAHPCHGNGTPPPPRTAAARPGPAGGEKRQLLLGTATRPG LWQRAQAQGGRLELWHQSPSRHPHCLQPVQAPSLT >gi568815596f:218859821_219060921|GENSCAN_predicted_CDS_3|468_bp atggctgattccagagctgtggcagagaaattacaagataagcctggaacatcttgctgt gacagaaagacgcggcgaggaattccattgagcaaatgggcggtgacttatgtaatcatg ttcctatttaaggacttttacgttgggcgcggagagatagtgccgcagaacctgcgggac gaagctgagacaactggaaaaatgggagggcgctcccaccctggggaagaatgtgccctc cttgcgcacccttgccatggtaacggcacacccccacccccgcgcaccgcagcggcccgg cctggccccgcgggcggcgagaaaagacagttgctgctgggcacggccactcgtcctggc ttgtggcagagggctcaggcacaaggtgggcggttagagctctggcaccagagcccttct cgacatccccactgcttacaaccagtccaagccccatctctgacttag >gi568815596f:218859821_219060921|GENSCAN_predicted_peptide_4|67_aa MDQGAGREAASDGEMSGIGSSPDHVGSNPTRLLLRLSLKARMPRVSASGEKTRYREIGGT YAFTGRK >gi568815596f:218859821_219060921|GENSCAN_predicted_CDS_4|204_bp atggaccagggggcaggcagagaggcagcaagtgacggtgagatgagtggcattggcagc agccccgaccatgtgggcagcaaccccacccggcttcttctcagactctcactgaaggcg aggatgcccagggtgagcgccagcggcgagaagacgcgctatcgcgagatcggtgggacc tacgcgtttacaggccggaagtga >gi568815596f:218859821_219060921|GENSCAN_predicted_peptide_5|119_aa MGSRYVAQAGLELLTSGDLPAFACQVQTQRPPDLRVLRLDVFGYKTRKLLLNWLKQAQSR HLASIMIMIMIIIIIITADAKHLPSLLSGRLIPTALLADGKPLTGSEVWGITKKLVLTM >gi568815596f:218859821_219060921|GENSCAN_predicted_CDS_5|360_bp atgggatctcgctatgttgcccaggctggccttgaactcctgacctcaggtgatctgccc gcctttgcctgccaggtccaaacacaaaggcctcccgaccttagagttctgcgtctggat gtttttggctacaaaaccagaaaactccttctcaactggcttaaacaggctcagagtcgg catttagccagcatcatgatcatgatcatgatcatcatcatcatcatcactgcagatgcc aaacaccttccatctcttctttctgggaggctcattcccacagctcttctggctgatggt aaacctttaactggttctgaagtttggggcatcaccaagaagcttgtgttgaccatgtga >gi568815596f:218859821_219060921|GENSCAN_predicted_peptide_6|185_aa MGMTCVVQRQISEMNQNISRLQAETEGLKGQGASLEAAIADAEQWGELAIKDANTKLSEL EAAMQRAKQDMARSWKLALDIEIATYRKLLEGEESRLESGMQNVSIHKKTTSGYAGEIHP YVDDIQIYVSNLDLSFKHQTYLRSPLGCIIGISTWPKPTPDTPSPLAPAAAPISVNSNSS LLNLR >gi568815596f:218859821_219060921|GENSCAN_predicted_CDS_6|558_bp atggggatgacctgcgttgtgcaaagacagatctccgagatgaaccagaacatcagcagg ctccaggctgagactgagggcctcaaaggccagggggcttccctggaggccgccatcgca gatgccgagcagtggggggagctggccattaaagatgccaacaccaagctgtctgagctg gaggccgccatgcagcgggccaagcaggacatggcacgcagctggaagctggccctggac atcgagatcgccacctacaggaagctgctggagggcgaggagagccgtctggagtctggg atgcagaacgtgagtatccataagaagaccaccagtggctatgcaggtgaaattcaccca tatgttgatgacattcaaatttacgtctccaacctggatctctcttttaaacaccagacc tacttgagatctccacttggatgtattataggcatttcaacatggccaaaaccaactcct gatactccctccccacttgctcctgctgctgctcccatctccgttaacagtaattctagt ttactaaatctcagatga >gi568815596f:218859821_219060921|GENSCAN_predicted_peptide_7|367_aa MGTVLSLSPASSAKGRRPGGLPEEKKKAPPAGDEALGGYGAPPVGKGGKGESRLKRPSVL ISALTWKRLVAASAKKKKGSKKVTPKPASTGPDPLVQQRNRENLLRKGRDPPDGGGTAKP LAVPVPTVPAAAATCEPPSGGSAAAQPPGSGGGKPPPPPPPAPQVAPPVPGGSPRRVIVQ ASTGELLRCLGDFVCRRCYRLKELSPGELVGWFRGVDRSLLLQGWQDQAFITPANLVFVY LLCRESLRGDELASAAELQAAFLTCLYLAYSYMGNEISYPLKPFLVEPDKERFWQRCLRL IQRLSPQMLRLNADPHFFTQVFQDLKNEGEAAASGGGPPSGGAPAASSAARDSCAAGTKH WTMNLDR >gi568815596f:218859821_219060921|GENSCAN_predicted_CDS_7|1104_bp atgggcacagtgctgtctctttcgcctgcctcctcggccaagggccggaggcccggcggg ctgcccgaggagaagaagaaggcgccgcccgcgggggacgaggcgctggggggctacggg gcgccgccagtgggcaagggcggcaaaggcgagagccgactcaagcggccgtccgtgctc atctcggcgctcacctggaagcgcctggtggccgcgtccgccaagaagaagaaaggcagc aagaaggtgacacccaagccggcatccacgggccccgaccccctggtccagcaacgcaac cgcgagaaccttctccgcaagggccgggatccccccgacggcggcggcaccgccaagccc ctggcggtgccagtgcccaccgtgcccgcggctgccgccacctgcgagccaccgtcgggg ggcagcgcggccgctcagccgccgggctcgggcgggggaaagcctccgccgccgcctccc ccagccccgcaggtggcgccgccggtgcctggcggctcgccgcggcgggtcatcgtgcag gcgtccaccggcgagctgctgcgctgtctgggcgacttcgtgtgccgacgctgctatcgc ctcaaggagctgagcccgggcgagctggtgggctggttccgcggtgtggaccgctcgctg ctgctgcagggctggcaagaccaggccttcattacgcctgcaaacctggtgttcgtgtac ctgctgtgccgcgagtcgctgcgtggggacgagctggcgtcggccgccgagctgcaggcc gccttcctcacctgcctctacctcgcctactcctacatgggcaacgagatctcctaccca ctcaagcccttcctcgtggagcccgacaaggagcgcttctggcagcgctgcctgcgcctc atccagcggctcagcccgcagatgctgcggctcaacgccgacccccacttcttcacgcag gtctttcaagacctcaagaacgagggcgaggccgccgccagcggcgggggcccaccgagc gggggcgcgcccgccgcctcctcggccgccagggacagctgcgcggccggaaccaagcac tggactatgaacctggaccgctag >gi568815596f:218859821_219060921|GENSCAN_predicted_peptide_8|276_aa MARAHPPRVSPAAGRGGLPDPVGDGLFKDGKNPSWGPLSPAVQKGEMAGALGGGGEFHKT VHLESYVLFSGSRISIVQKRGSGQIQLWQFLLELLADRANAGCIAWEGGHGEFKLTDPDE VARRWGERKSKPNMNYDKLSRALRYYYDKNIMSKVHGKRYAYRFDFQGLAQACQPPPAHA HAAAAAAAAAAAAQDGALYKLPAGLAPLPFPGLSKLNLMAASAGVAPAGFSYWPGPGPAA TAAAATAALYPSPSLQPPPGPFGAVAAASHLGGHYH >gi568815596f:218859821_219060921|GENSCAN_predicted_CDS_8|831_bp atggcccgggcccatcccccgcgcgtctccccggctgcggggcgcggggggctgccggat cccgtcggagacggtctcttcaaggacgggaagaacccgagctgggggccgctgagcccc gcggttcagaaaggtgagatggccggcgcgctgggcggcggaggcgagttccacaaaacc gtgcatctggaaagctacgtgctcttcagtggaagccgcatttccattgtgcaaaagcgc ggcagcggacagatccagctgtggcagtttctgctggagctgctggctgaccgcgcgaac gccggctgcatcgcgtgggagggcggtcacggcgagttcaagctcacggacccggacgag gtggcgcggcggtggggcgagcgcaagagcaagcccaacatgaactacgacaagctgagc cgcgccctgcgctactactacgacaagaacatcatgagcaaggtgcatggcaagcgctac gcctaccgcttcgacttccagggcctggcgcaggcctgccagccgccgcccgcgcacgct catgccgccgccgcagctgctgccgccgccgcggccgcccaggacggcgcgctctacaag ctgcccgccggcctcgccccgctgcccttccccggcctctccaaactcaacctcatggcc gcctcggccggggtcgcgcccgccggcttctcctactggccgggcccgggccccgccgcc accgctgccgccgccaccgccgcgctctaccccagtcccagcttgcagcccccgcccggg cccttcggggccgtggccgcagcctcgcacttggggggccattaccactag >gi568815596f:218859821_219060921|GENSCAN_predicted_peptide_9|2467_aa MEGPSGVGLVPKSLFGVPSLRLHTQSAPFGLCPKDMMLTQAPSSVVRSRNSRNHTVNSGG SCLSASTVAIPAINDSSAAMSACSTISAQPASSMDTQMHSPKKQERVNKRVIWGIEVAEE LHWKGWELGKETTRNLVLKNRSLKLQKMKYRPPKTKFFFTVIPQPIFLSPGITLTLPIVF RPLEAKEYMDQLWFEKAEGMFCVGLRATLPCHRLICRPPSLQLPMCAVGDTTEAFFCLDN VGDLPTFFTWEFSSPFQMLPATGLLEPGQASQIKVTFQPLTAVIYEVQATCWYGAGSRQR SSIQLQAVAKCAQLLVSIKHKCPEDQDAEGFQKLLYFGSVAVGCTSERQIRLHNPSAVNA PFRIEISPDELAEDQAFSCPTAHGIVLPGEKKCVSVFFHPKTLDTRTVDYCSIMPSGCAS KTLLKVVGFCRGPAVSLQHYCVNFSWVNLGERSEQPLWIENQSDCTAHFQFAIDCLESVF TIRPAFGTLVGKARMTLHCAFQPTHPIICFRRVACLIHHQDPLFLDLMGTCHSDSTKPAI LKPQHLTWYRTHLARGLTLYPPDILDAMLKEKKLAQDQNGALMIPIQDLEDMPAPQYPYI PPMTEFFFDGTSDITIFPPPISVEPVEVDFGACPGPEAPNPVPLCLMNHTKGKIMVVWTR RSDCPFWVTPESCDVPPLKSMAMRLHFQPPHPNCLYTVELEAFAIYKVLQSYSNIEEDCT MCPSWCLTVRARGHSYFAGFEHHIPQYSLDVPKLFPAVSSGEPTYRSLLLVNKDCKLLTF SLAPQRGSDVILRPTSGLVAPGAHQIILICTYPEGSSWKQHTFYLQCNASPQYLKEVSMY SREEPLQLKLDTHKSLYFKPTWVGCSSTSPFTFRNPSRLPLQFEWRVSEQHRKLLAVQPS RGLIQPNERLTLTWTFSPLEETKYLFQVGMWVWEAGLSPNANPAATTHYMLRLVGVGLTS SLSAKEKELAFGNVLVNSKQSRFLVLLNDGNCTLYYRLYLEQGSPEAVDNHPLALQLDRT EGSMPPRSQDTICLTACPKQRSQYSWTITYSLLSHRGTEDNAQYTESVRQGLAVVGNALE PERGPLLLMRVLGPTAQDSCTFGLHAPSREDNKAGEKQELCCVSLVAVYPLLSILDVSSM GSAEGITRKHLWRLFSLDLLNSYLERDPTPCELTYKVPTRHSMSQIPPVLTPLRLDFNFG AAPFKAPPSVVFLALKNSGVVSLDWCVLHTSRQVCVARILLRAFLLPSDQRIDVELWAEQ AELNSTELHQMRVQDNCLFSISPKAGSLSPGQEQMVELKYSHLFIGTDHLPVLFKVSHGR EILLNFIGVTVKPEQKYVHFTSTTHQFIPIPIGDTLPPRQIYELYNGGSVPVTYEVQTDV LSQVQEKNFDHPIFCCLNPKGEIQPGSTARVLWIFSPIEAKTYTVDVPIHILGWNSALIH FQGVGYNPHMMGDTAPFHNISSWDNSSIHSRLVVPGQNVFLSQSHISLGNIPVQSKCSRL LFLNNISKNEEIAFSWQPSPLDFGEVSVSPMIGVVAPEETVPFVVTLRASVHASFYSADL VCKLYSQQLMRQYHKELQEWKDEKVRQEVEFTITDMKVKKRTCCTACEPARKYKTLPPIK NQQSVSRPASWKLQTPKEEVSWPCPQPPSPGMLCLGLTARAHATDYFLANFFSEFPCHFL HRELPKRKAPREESETSEEKSPNKWGPVSKQKKQLLVDILTTIIRGLLEDKNFHEAVDQS LVEQVPYFRQFWNEQSTKFMDQKNSLYLMPILPVPSSSWEDGKGKQPKEDRPEHYPGLGK KEEGEEEKGEEEEEELEEEEEEEEETEEEELGKEEIEEKEEERDEKEEKVSWAGIGPTPQ PESQESMQWQWQQQLNVMVKEEQEQDEKEAIRRLPAFANLQEALLENMIQNILVEASRGE VVLTSRPRVIALPPFCVPRSLTPDTLLPTQQAEVRAEASGALCSTELAEDQDQEITEGDR QAPGPPLPPRDEPLAQTGPERFVRSARVRQGRPLSTSPGAGLIATQAPAAATATAISTVR GLGPPQPPVPVPGLLVMLGKMQALTELEPRLSLKTSETVRLTEKDKPPGTAVHHLPQTSE SHILASISWGGGGVSQCRAGLQLSGQRGLLLPSGGRIQNCRIEMGEGGKARSDSSPLTPA RAGDTEGKDWDTKGPELARAPLCAPALSNPELQRWESKLPRPPRQLPLENFLYTPFPVDI WPTPDHLVRSLVQEVLLSAYYASSTVLGLRITTENKTQSLPLQMELTLCGMSSAPAPGPA PASLTLWDEEDFQGRRCRLLSDCANVCERGGLPRVRSVKVENGVWVAFEYPDFQGQQFIL EKGDYPRWSAWSGSSSHNSNQLLSFRPVLCANHNDSRVTLFEGDNFQGCKFDLVDDYPSL PSMGWASKDVGSLKVSSGAWVAYQYPGYRGYQYVLERDRHSGEFCTYGELGTQAHTGQLQ SIRRVQH >gi568815596f:218859821_219060921|GENSCAN_predicted_CDS_9|7404_bp atggaggggcctagtggggtgggccttgtccccaagtccctctttggagtgccttccctg aggctccatactcagagtgctccctttggactgtgtcccaaggacatgatgctcacccag gctccaagctccgtcgtgaggtccaggaacagcaggaaccacaccgtgaactctggtgga tcctgcctgagtgccagcacagtggccatccctgccatcaacgacagcagtgcagccatg agtgcctgcagcaccatcagcgcccagcccgcaagctccatggacactcagatgcactcc ccaaagaagcaggagagagtgaacaagagggtcatctggggcattgaggtggctgaggag ctgcattggaaaggctgggagctaggaaaggagaccacaaggaatctggttctgaaaaat cgatccttgaaactccagaagatgaagtacaggccccccaagaccaagttcttcttcacg gtcatccctcagcccatcttcctgagcccaggcataaccctcacgctccccatcgtcttc cggcctctggaggcgaaggagtacatggaccagctgtggtttgagaaagcggaggggatg ttctgtgtcggcctacgggccaccctgccctgccacaggctgatctgccgcccaccatcc ctgcagctgcccatgtgtgctgtgggagatacgactgaggcctttttctgcctggataat gtgggggacctgcccaccttcttcacctgggagttctccagcccattccagatgctgccc gccacggggctcctggagccaggccaggcctctcagatcaaggtgacctttcagcccctt acagccgtcatctacgaggtgcaggccacgtgctggtacggggcgggcagccggcagagg agcagcatccagctgcaggctgtggccaagtgcgcccagctgctggtgagcataaagcac aagtgcccggaggaccaggatgccgagggcttccagaagctgttgtactttggctctgtt gctgtgggctgcacctcggagaggcagatcaggctacacaacccgtcggcggtaaatgcc cccttcaggattgaaatttccccggatgaactggccgaagaccaggccttctcatgcccc acggcccatggcatcgtgcttccgggagagaagaaatgtgtgtcggtgttcttccacccc aagactctggacaccagaactgtggactactgctccatcatgccttctggctgtgcctcc aagaccctgcttaaagtcgttggtttctgtagaggccctgctgtgtccctgcagcactac tgtgtcaacttcagctgggtcaaccttggggagcgctccgagcagcccctgtggattgag aaccaatcggactgcacggcccacttccagtttgccatcgactgcttggagagtgtcttt accatcaggcctgcctttgggacgctggtgggcaaggcccgtatgaccctgcactgtgcc ttccagcccactcaccccatcatctgctttcggcgtgtggcctgtctcatccaccaccag gacccactgttcctggacctgatggggacctgccactcggacagcaccaagccagccatc ctgaagcctcagcacctcacctggtaccgcacacacctggcccggggcctgacgctctac ccccctgacatcctggatgccatgctgaaggagaagaagctggcacaggaccagaacggg gctctcatgattcccatccaggatctggaggacatgccggccccgcagtacccttatatc ccccccatgaccgagttcttcttcgacggcaccagcgacataaccatcttccccccgccc atcagtgtagagcctgtcgaggtagacttcggtgcctgcccagggcctgaggcccccaac cctgtacccctgtgcctgatgaaccacaccaagggcaagatcatggtggtctggacgcga aggtctgactgccccttctgggtgactccagagagctgcgacgtgcccccactcaagtcc atggccatgcgcctgcacttccagccgcctcaccccaactgcctttacacggtggagctc gaagccttcgccatctataaggtcctgcagagctacagtaatattgaggaggactgcacc atgtgcccatcctggtgcctgacggtgcgggcacgaggccacagctatttcgctggcttt gagcaccacatcccccagtattccctagatgtccccaagctatttccagcagtgtcctcc ggtgagcccacctaccgcagcctgctcctggtcaacaaagactgcaagctgctgaccttc agcctggccccccagagaggctcagacgtcatccttcggcccacttcgggccttgtggca cccggggcccaccagatcatcctcatctgcacctaccctgagggcagctcctggaagcag cacactttctatctgcagtgcaatgcttccccccagtatctcaaggaggtgagcatgtac agccgggaggagccactgcagctgaagctggacacccacaaaagcctctacttcaagccc acctgggtgggctgctcctccaccagccccttcaccttccgcaacccctcgcgtctgccc ctgcagttcgagtggagggtctctgagcagcatcgaaagctgctggctgtccagccctcc agggggctaatccagcccaacgagagacttacgctgacgtggaccttcagccctttggag gagaccaagtacctgttccaagtggggatgtgggtctgggaagccggcctgtccccaaat gccaaccccgctgccaccacccactacatgctccggctggtgggcgttgggctcaccagc agcctctctgcaaaggaaaaggagctggcctttgggaatgtgctggtgaacagcaagcag tccaggttccttgtcctcctgaatgacggcaactgcaccctctattaccgcctctacctg gagcagggcagccctgaggccgttgacaaccaccccctcgctctgcagctggaccgaaca gaggggagcatgccaccccggtcccaggacaccatctgcctgactgcctgtcccaagcag cggtcccagtactcctggaccatcacctactctctcctttcccacagaggcactgaggac aatgcccagtacacagaaagtgtcagacaagggctggctgttgttggaaatgccctagag cccgagaggggtccactgctcctgatgagggtcctggggccaactgcccaggactcctgc acctttggcctgcatgccccctccagggaagataacaaggctggggagaagcaggagctg tgctgcgtctccctggtggccgtgtaccccttgctttccatcctggatgtcagctccatg ggcagtgctgagggtatcacccggaagcacctgtggcgcctcttctctctggacctgctt aacagttacttggagcgtgaccccaccccctgtgagctcacctacaaggtgcccacccgg cacagcatgagccagatcccccccgtcctcacccctttaaggcttgacttcaatttcggg gccgcaccattcaaggccccaccttccgtggtattcctggccctgaagaacagcggagtg gtgtccctggactggtgcgtgttgcacacttcccgtcaggtgtgtgttgcacgcattctc ctcagggccttcctccttccaagtgaccagcggattgacgtggagctctgggcagagcaa gcagagttgaattccactgagctccaccagatgcgcgtgcaggacaattgcctcttctcc atcagccccaaggctgggagcctgagtcctgggcaggagcagatggtggagttaaaatac agccacctgttcatcggtactgatcacctcccagtgctcttcaaggtgtcccatggccgg gagatcctgctaaatttcataggtgtgacagtgaagccggagcagaagtatgtgcacttc acctctactacccaccagttcatccccattcccattggtgacacgctacccccacggcag atttatgagctgtataatggtggctcagtgcccgtgacatatgaggtccagaccgatgtc ctgtcacaggttcaggaaaaaaattttgatcaccccatcttttgctgcctcaaccccaaa ggggagatccagccaggcagcactgcccgggtcttgtggatcttctcacctatcgaggcc aagacctacacggtggacgtgcccatacacatcctgggatggaactcggccctcatccac ttccagggagtgggctacaacccccatatgatgggggacacagccccattccacaacatc tcctcgtgggacaacagttccatacactctaggctggtggtgcctggacagaatgtcttc ctgtcccagtctcatatttccctgggaaacatacctgtgcagagcaagtgcagccgcctg ctcttcctcaacaacatctccaagaacgaggaaattgccttctcctggcagccaagtcct ctagattttggggaggtgtctgtgagtcccatgataggggtggtggctcctgaagagacg gtcccatttgtggtgaccttgagggcctctgtgcatgccagcttctacagtgcagacctg gtatgcaagctgtactcgcagcagctcatgaggcagtatcacaaggagctgcaggagtgg aaggacgagaaggtgcggcaggaagtggagttcaccatcaccgacatgaaagtgaagaag agaacatgctgcacagcctgtgaacctgcgaggaagtacaagacactgcctcccatcaag aaccagcagtctgtcagccggcctgccagctggaaactgcagaccccaaaggaggaggtg tcctggccctgcccccagccaccctcgccaggcatgctctgcctgggccttactgcccga gcccatgccaccgactactttctggctaacttcttctcagagtttccctgccactttttg caccgggagctgccaaagaggaaggcccccagggaagagtcagagacttctgaggaaaaa tcccctaacaagtggggccctgtttccaagcagaagaagcagctcctggttgacattctc accacaataatcaggggcctgctggaagacaagaacttccatgaggctgtggaccaaagc ctggtggagcaggtgccgtacttccgccaattctggaatgagcagtcaactaagttcatg gaccagaaaaacagcctgtacttaatgccaatcctgcctgtaccctccagcagctgggag gatgggaagggcaagcagccgaaggaagacagaccagagcactatccagggttgggaaag aaggaagagggggaggaggagaagggtgaagaggaagaagaagagttggaggaggaagag gaggaagaagaggagacagaagaggaggagttgggcaaggaggagatagaggagaaggag gaggagagggatgagaaggaagagaaagtgagctgggcgggcatcgggcccacaccacag cctgagtcccaggagtccatgcaatggcagtggcaacagcagctgaatgtcatggtgaag gaggagcaagaacaggacgagaaggaggccatcagaaggctcccggccttcgccaacctg caggaggcgctgctggagaacatgatccagaacatcctggtggaggcgagccgcggggag gtggtactcacctcgcggccacgcgtcatcgccctgccgccgttctgcgtgcccaggagt ctgaccccggacacgctgctgccgacgcagcaagcagaggtgagggcggaggctagcggg gcgctgtgcagcactgagctcgcggaagaccaggaccaggagatcaccgagggcgaccgc caggccccgggccctccgctcccgccccgcgacgagcccctcgcacaaaccggacctgag cgttttgttcgttcggctcgcgtgaggcaggggcggcctctcagcaccagcccgggggcc ggcctgatcgccacgcaggcacctgccgccgccaccgccaccgccatctcaaccgtacgg gggctaggccctccccagcctcctgtcccggttcctgggctcctggtcatgctggggaag atgcaggccctcacggagttggagccaaggctgagtctaaaaacgtctgagacagtcaga ctgactgaaaaggacaagcccccaggcacagcggttcaccaccttcctcaaacctcagaa tcccacatcctcgcttccatcagctggggtgggggcggggtctcccagtgccgggcaggc ctgcagctttcgggccagcgcggcctgctcctgccctctggtggccgaatccagaattgt cggatagagatgggggaaggagggaaggcgagatctgattcttcacccctcacccctgcc cgggctggtgacactgaaggcaaagactgggacaccaagggtccagaactggctcgtgcc ccactctgtgctcctgctctcagcaacccagaacttcagaggtgggagagcaagctgcca agacccccccgccaacttccattggagaattttctctacaccccttttcccgtggacatt tggcccacgcctgaccaccttgttcgttcactcgttcaagaagttttattgagtgcctac tatgcatccagcactgtgctaggtttgaggataacaactgagaacaaaacacagtccctg ccccttcaaatggagcttacactctgcggcatgagcagcgcccccgcgccgggcccggcg cccgccagcctcacgctctgggacgaggaggacttccagggccgtcgctgtcggctgcta agcgactgtgcgaacgtctgcgagcgcggaggcctgcccagggtgcgctcggtcaaggtg gaaaacggcgtttgggtggcctttgagtaccccgacttccagggacagcagttcattctg gagaagggagactatcctcgctggagcgcctggagtggcagcagcagccacaacagcaac cagctgctgtccttccggccagtgctctgcgcgaaccacaatgacagccgtgtgacactg tttgagggggacaacttccaaggctgcaagtttgacctcgttgatgactacccatccctg ccctccatgggctgggccagcaaggatgtgggttccctcaaagtcagctccggagcgtgg gtggcctaccagtacccaggctaccgaggctaccagtatgtgttggagcgggaccggcac agcggagagttctgtacttacggtgagctcggcacacaggcccacactgggcagctgcag tccatccggagagtccagcactag >gi568815596f:218859821_219060921|GENSCAN_predicted_peptide_10|411_aa MSPARLRPRLHFCLVLLLLLVVPAAWGCGPGRVVGSRRRPPRKLVPLAYKQFSPNVPEKT LGASGRYEGKIARSSERFKELTPNYNPDIIFKDEENTGADRLMTQRCKDRLNSLAISVMN QWPGVKLRVTEGWDEDGHHSEESLHYEGRAVDITTSDRDRNKYGLLARLAVEAGFDWVYY ESKAHVHCSVKSEHSAAAKTGGCFPAGAQVRLESGARVALSAVRPGDRVLAMGEDGSPTF SDVLIFLDREPHRLRAFQVIETQDPPRRLALTPAHLLFTADNHTEPAARFRATFASHVQP GQYVLVAGVPGLQPARVAAVSTHVALGAYAPLTKHGTLVVEDVVASCFAAVADHHLAQLA FWPLRLFHSLAWGSWTPGEGVHWYPQLLYRLGRLLLEEGSFHPLGMSGAGS >gi568815596f:218859821_219060921|GENSCAN_predicted_CDS_10|1236_bp atgtctcccgcccggctccggccccgactgcacttctgcctggtcctgttgctgctgctg gtggtgccggcggcatggggctgcgggccgggtcgggtggtgggcagccgccggcgaccg ccacgcaaactcgtgccgctcgcctacaagcagttcagccccaatgtgcccgagaagacc ctgggcgccagcggacgctatgaaggcaagatcgctcgcagctccgagcgcttcaaggag ctcacccccaattacaatccagacatcatcttcaaggacgaggagaacacaggcgccgac cgcctcatgacccagcgctgcaaggaccgcctgaactcgctggctatctcggtgatgaac cagtggcccggtgtgaagctgcgggtgaccgagggctgggacgaggacggccaccactca gaggagtccctgcattatgagggccgcgcggtggacatcaccacatcagaccgcgaccgc aataagtatggactgctggcgcgcttggcagtggaggccggctttgactgggtgtattac gagtcaaaggcccacgtgcattgctccgtcaagtccgagcactcggccgcagccaagacg ggcggctgcttccctgccggagcccaggtacgcctggagagtggggcgcgtgtggccttg tcagccgtgaggccgggagaccgtgtgctggccatgggggaggatgggagccccaccttc agcgatgtgctcattttcctggaccgcgagcctcacaggctgagagccttccaggtcatc gagactcaggaccccccacgccgcctggcactcacacccgctcacctgctctttacggct gacaatcacacggagccggcagcccgcttccgggccacatttgccagccacgtgcagcct ggccagtacgtgctggtggctggggtgccaggcctgcagcctgcccgcgtggcagctgtc tctacacacgtggccctcggggcctacgccccgctcacaaagcatgggacactggtggtg gaggatgtggtggcatcctgcttcgcggccgtggctgaccaccacctggctcagttggcc ttctggcccctgagactctttcacagcttggcatggggcagctggactccgggggagggt gtgcattggtacccccagctgctctaccgcctggggcgtctcctgctagaagagggcagc ttccacccactgggcatgtccggggcagggagctga