GENSCAN 1.0 Date run: 8-Nov-116 Time: 10:48:39 Sequence gi568815576f:46098384_46320077 : 221694 bp : 51.38% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init - 828 625 204 0 0 81 84 123 0.422 9.75 1.00 Prom - 3029 2990 40 3.39 2.00 Prom + 13379 13418 40 -3.11 2.01 Init + 15541 15838 298 0 1 81 76 232 0.967 16.69 2.02 Intr + 18437 18571 135 0 0 26 72 181 0.847 11.35 2.03 Intr + 27531 27652 122 1 2 72 42 77 0.003 2.22 2.04 Intr + 28955 29040 86 1 2 119 53 3 0.730 -0.98 2.05 Intr + 30083 30174 92 2 2 91 80 107 0.992 10.34 2.06 Intr + 30974 31167 194 0 2 67 44 121 0.229 5.33 2.07 Intr + 34698 34822 125 2 2 97 50 54 0.284 2.39 2.08 Intr + 38477 38653 177 2 0 107 42 47 0.199 1.55 2.09 Term + 39058 39175 118 1 1 76 51 107 0.306 4.11 2.10 PlyA + 39778 39783 6 1.05 3.04 PlyA - 40683 40678 6 -4.04 3.03 Term - 41037 40874 164 1 2 11 50 169 0.226 3.81 3.02 Intr - 42010 41941 70 1 1 53 73 82 0.277 2.45 3.01 Init - 49169 49113 57 2 0 99 87 31 0.208 5.35 3.00 Prom - 63336 63297 40 -1.61 4.04 PlyA - 64198 64193 6 1.05 4.03 Term - 73324 73192 133 0 1 156 42 20 0.672 2.07 4.02 Intr - 73822 73719 104 1 2 100 91 36 0.757 4.57 4.01 Init - 74468 74262 207 2 0 64 0 140 0.514 1.60 4.00 Prom - 85827 85788 40 -0.41 5.00 Prom + 99889 99928 40 -1.51 5.01 Init + 100001 100208 208 1 1 94 114 187 0.976 20.86 5.02 Intr + 116790 116950 161 1 2 109 75 203 0.982 21.32 5.03 Intr + 117431 117617 187 1 1 62 53 98 0.741 3.48 5.04 Intr + 118262 118431 170 0 2 60 9 88 0.639 -1.72 5.05 Intr + 119880 120018 139 2 1 92 90 172 0.995 18.34 5.06 Intr + 121429 121631 203 2 2 77 50 287 0.091 23.33 5.07 Intr + 133409 133856 448 1 1 100 105 608 0.632 57.20 5.08 Term + 136774 136997 224 2 2 32 44 280 0.247 15.41 5.09 PlyA + 139326 139331 6 1.05 6.00 Prom + 139336 139375 40 -2.91 6.01 Init + 139436 139564 129 1 0 46 61 146 0.988 7.81 6.02 Intr + 141283 141408 126 0 0 91 66 85 0.945 7.98 6.03 Term + 142292 142543 252 1 0 83 44 112 0.769 2.27 6.04 PlyA + 142630 142635 6 1.05 7.09 PlyA - 143620 143615 6 1.05 7.08 Term - 146447 146376 72 2 0 65 48 39 0.407 -4.20 7.07 Intr - 146783 146713 71 0 2 46 86 94 0.686 4.39 7.06 Intr - 148191 147949 243 1 0 10 98 139 0.777 5.10 7.05 Intr - 148838 148723 116 1 2 91 94 45 0.203 5.99 7.04 Intr - 149901 149789 113 0 2 96 59 61 0.119 3.68 7.03 Intr - 151656 151510 147 0 0 71 25 85 0.049 1.44 7.02 Intr - 152116 151986 131 2 2 67 21 71 0.055 -0.78 7.01 Init - 153191 153080 112 2 1 52 24 97 0.118 0.03 7.00 Prom - 153804 153765 40 -3.81 8.02 PlyA - 153920 153915 6 1.05 8.01 Sngl - 164939 158178 6762 2 0 104 45 2652 0.986 254.90 8.00 Prom - 165568 165529 40 -8.48 9.00 Prom + 167614 167653 40 -3.81 9.01 Init + 169657 169689 33 0 0 106 86 44 0.410 5.90 9.02 Intr + 170131 170208 78 0 0 82 85 118 0.989 11.14 9.03 Intr + 173952 174033 82 2 1 78 97 79 0.941 7.51 9.04 Intr + 175515 175686 172 1 1 57 68 255 0.840 19.92 9.05 Intr + 176865 177038 174 0 0 86 91 102 0.838 9.87 9.06 Intr + 180203 180278 76 2 1 99 107 135 0.866 16.51 9.07 Intr + 181686 181803 118 2 1 121 81 12 0.455 4.34 9.08 Intr + 183238 183335 98 2 2 34 86 104 0.508 5.03 9.09 Intr + 185590 185649 60 0 0 131 116 -22 0.933 4.32 9.10 Intr + 186828 186896 69 2 0 84 113 93 0.614 11.27 9.11 Intr + 188690 188771 82 1 1 86 63 172 0.880 14.21 9.12 Intr + 190040 190254 215 0 2 96 72 253 0.824 23.46 9.13 Intr + 190701 190795 95 2 2 69 81 -12 0.614 -4.54 9.14 Intr + 191019 191178 160 0 1 47 99 287 0.669 26.20 9.15 Intr + 191443 191516 74 0 2 37 105 112 0.810 6.50 9.16 Intr + 194408 194497 90 2 0 96 47 185 0.410 14.81 9.17 Intr + 196497 196531 35 0 2 79 81 5 0.085 -2.75 9.18 Intr + 199010 199153 144 0 0 37 22 141 0.159 3.16 9.19 Intr + 209767 209824 58 2 1 80 82 39 0.841 0.93 9.20 Intr + 209936 210333 398 2 2 49 80 383 0.999 28.39 9.21 Intr + 210440 210560 121 0 1 74 94 54 0.970 4.66 9.22 Intr + 211852 211978 127 1 1 33 48 112 0.676 2.89 9.23 Intr + 213758 213922 165 1 0 13 113 48 0.115 0.47 9.24 Intr + 215458 215630 173 0 2 -26 100 176 0.095 6.66 9.25 Intr + 219373 219456 84 1 0 58 50 116 0.216 4.23 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 27479 27578 100 2 1 80 20 133 0.887 6.31 S.002 Term + 121429 121697 269 2 2 77 49 321 0.897 23.09 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576f:46098384_46320077|GENSCAN_predicted_peptide_1|68_aa MAARMPQEARIIGDSLHSGAGSPGRGASGWPQPLSSSQHDKPAPGHGHQTAMTDSSRQGT SYLLSPQQ >gi568815576f:46098384_46320077|GENSCAN_predicted_CDS_1|204_bp atggccgcacggatgccccaggaggcaagaatcatcggcgacagtctgcactcgggggct ggctcgccagggcgaggggcctcaggctggccgcagcctctgagcagttcccagcatgac aagcccgcccccggccatggccaccagacagccatgacagacagcagcaggcaggggaca tcttacctgctgagcccccagcag >gi568815576f:46098384_46320077|GENSCAN_predicted_peptide_2|448_aa MNLELAPFILEAQGLGSGARLLRDSVEAPGHELFQVMRLVCSRLPVPPESSVGSVLYPGL RIPGPARSLPSRAAFEPGGLNTGTQARWRGPHGAATKAPATCRDRRPSSRASDMPGSLPP TYAPLTTKELSASLPPTASGKTVGGSPQEPPASGSPGPDLGNTCPILHSRTKALSAQEPR GSLVAPLPLLQEPPLDHYDTSCPFLPPLPLTSPWWATAHELEGFEPWAAALLVPAGIPSG SNRAACASPQCLHPEPVRLEGLSAQARAGGQPPEVTCSCRGVALVPLLLQEEEEEEEVAP LLYGAVIPVATASRPGATNPQFSPVLGQSALGCGGCWGWGSEGLRTPRKEMVCQACSECG RRQHQAREGVMYRWTAVLPTGLSGQSLSYMGQGHGRLQKQGHCPQHRHLQSAHPHAHTNT NLHAALTKSQALPQDLTDVTFAVLSATE >gi568815576f:46098384_46320077|GENSCAN_predicted_CDS_2|1347_bp atgaacttggaactggcccctttcattctggaagcccagggactgggctctggggcgcgt ctgctgcgggacagtgtggaggcccctggccacgagctgttccaggtcatgagacttgtc tgctcacgcctcccggtgcctccagagtcctcagttggctccgtgctgtaccctgggctc aggatcccaggcccagcccgttctctgccctctcgtgctgcctttgagccaggcgggctg aacaccgggacccaggcccggtggagaggtccccacggggcagccaccaaggccccagcc acgtgccgggaccgccggccatcgtcccgagccagcgacatgcctggcagcctcccgccc acctacgccccgctcaccaccaaggagctgtcagcatcactgcccccaacggcatctggc aaaactgttgggggcagcccgcaagagcctccagcctctggcagccctggaccagacctt ggtaacacctgccccattctacacagccgaaccaaggcactgagcgctcaggagcccaga ggaagcctggtggctcccctacccctcctccaggaaccccctttggaccactatgatact tcctgtccattcctgccgcccctgcccctgacctcaccctggtgggccactgcccacgag ttggaaggcttcgagccatgggcagctgctctcctagtccctgctggcatccccagtggc agcaacagggcagcctgtgccagcccccagtgtcttcacccagagcctgtgcggctggag gggctctcagcccaggcaagggctggcggccagccacccgaggtgacctgcagctgcaga ggggtggccttggtgcctctcttgctgcaggaggaggaggaggaagaggaagtggcccct ctgctttacggggctgtgattccagtggccacggcatccaggcctggggccacgaacccg cagttttctcccgtcctggggcagtcagccctgggctgtgggggctgctggggctggggg tccgaagggctgcggactcccaggaaggagatggtgtgccaggcgtgttctgagtgtggg cgcaggcagcaccaggctcgagaaggagtgatgtacagatggacagctgtcctgcctact ggcctaagcggtcagtccctcagctacatgggccagggccatgggaggctccagaaacaa ggccactgcccacagcaccggcatttgcaaagcgctcatccccatgcccacacgaacaca aacctccacgcagcactgaccaagagccaggccctaccccaggacctcacagatgtcacc ttcgcggtcctttcagcaaccgaatga >gi568815576f:46098384_46320077|GENSCAN_predicted_peptide_3|96_aa MGMSLVAILVLTAKPHGSKQQKVVTNQGFSSPLLKAVVDPKDTAGMMGPADKVKKVTRGG KLTDGTYFGKITKQRDLKAGFPSVLEALCQEEDPRA >gi568815576f:46098384_46320077|GENSCAN_predicted_CDS_3|291_bp atgggcatgtccttagttgccatcctggtactgacagcaaaaccccatgggtccaagcaa cagaaggtggtcactaatcagggattttcgagcccgttgcttaaggccgtcgttgaccca aaggatacagcaggaatgatgggccctgctgataaggtgaagaaagtgacccgagggggc aaacttacagacgggacatactttggcaaaataacaaaacagcgggacctgaaggctggc ttcccaagcgtcctcgaggccttgtgccaggaagaggaccctcgggcttag >gi568815576f:46098384_46320077|GENSCAN_predicted_peptide_4|147_aa MGPSNLSKQRGFLLNKPCGHPASWRLYLKPGNSCQHALNPTGPDVTSVFYLRTKLRLMAV WTVWFVDRLVLTHGCQVPHLKENPDHLLGMGSSRAWLPSTQIHRTLLESHFLAVPLMSPP CCPPCQDAPILMSAPPALTCFQPFLRC >gi568815576f:46098384_46320077|GENSCAN_predicted_CDS_4|444_bp atggggccaagtaacctgtcaaagcaacgaggcttcctactcaacaagccctgtggccac cctgccagctggaggctctatctgaagcctggaaacagctgtcaacatgccttgaatcca acaggtcccgatgtgacctcggtcttttacctccgcacaaagttacggcttatggctgtt tggacagtctggtttgtcgatcgtttggttctcacacacggctgccaggttcctcacctg aaggagaaccctgatcacctcctggggatgggctcttctcgagcatggcttccaagtacc cagattcacaggacactgctcgagtcacacttcctggctgttcctctcatgagcccaccc tgctgccctccctgtcaggatgctcccatcctgatgtctgctccacctgctctcacctgc ttccagcccttcctcagatgctag >gi568815576f:46098384_46320077|GENSCAN_predicted_peptide_5|579_aa MVDTESPLCPLSPLEAGDLESPLSEEFLQEMGNIQEISQSIGEDSSGSFGFTEYQYLGSC PGSDGSVITDTLSPASSPSSVTYPVVPGSVDESPSGALNIECRICGDKASGYHYGVHACE GCKHRTPESVSPVVNCVFEESSLYAPYENLILDVPRWNTFILKLSLPTPICGKIVFHETG PWWQKGASPPDSSDRNPLCIFILLLAQPALNRCDLGQECMKAGPVVLHAASWTGGPSVFV SVGFFRRTIRLKLVYDKCDRSCKIQKKNRNKCQYCRFHKCLSVGMSHNAIRFGRMPRSEK AKLKAEILTCEHDIEDSETADLKSLAKRIYEAYLKNFNMNKVKARVILSGKASNNPPFVI HDMETLCMAEKTLVAKLVANGIQNKEAEVRIFHCCQCTSVETVTELTEFAKAIPGFANLD LNDQVTLLKYGVYEAIFAMLSSVMNKDGMLVAYGNGFITREFLKSLRKPFCDIMEPKFDF AMKFNALELDDSDISLFVAAIICCGGHIEKMQEGIVHVLRLHLQSNHPDDIFLFPKLLQK MADLRQLVTEHAQLVQIIKKTESDAALHPLLQEIYRDMY >gi568815576f:46098384_46320077|GENSCAN_predicted_CDS_5|1740_bp atggtggacacggaaagcccactctgccccctctccccactcgaggccggcgatctagag agcccgttatctgaagagttcctgcaagaaatgggaaacatccaagagatttcgcaatcc atcggcgaggatagttctggaagctttggctttacggaataccagtatttaggaagctgt cctggctcagatggctcggtcatcacggacacgctttcaccagcttcgagcccctcctcg gtgacttatcctgtggtccccggcagcgtggacgagtctcccagtggagcattgaacatc gaatgtagaatctgcggggacaaggcctcaggctatcattacggagtccacgcgtgtgaa ggctgcaagcacagaacccctgagagcgtaagccctgttgtgaactgcgtatttgaggaa tctagcttgtacgccccttatgagaatctaatacttgatgttccaaggtggaacactttc atcctgaaactatccctccccacccccatctgtggaaaaattgtcttccatgaaaccggt ccctggtggcaaaaaggggcttctccacctgattcaagcgacaggaaccccctgtgcatc ttcatcctcctgctggctcagcctgccctaaacagatgtgacctgggccaggagtgcatg aaggcaggccctgttgtcctgcatgctgccagctggactggtggcccttccgtgtttgtc agcgtgggcttctttcggcgaacgattcgactcaagctggtgtatgacaagtgcgaccgc agctgcaagatccagaaaaagaacagaaacaaatgccagtattgtcgatttcacaagtgc ctttctgtcgggatgtcacacaacgcgattcgttttggacgaatgccaagatctgagaaa gcaaaactgaaagcagaaattcttacctgtgaacatgacatagaagattctgaaactgca gatctcaaatctctggccaagagaatctacgaggcctacttgaagaacttcaacatgaac aaggtcaaagcccgggtcatcctctcaggaaaggccagtaacaatccaccttttgtcata catgatatggagacactgtgtatggctgagaagacgctggtggccaagctggtggccaat ggcatccagaacaaggaggcggaggtccgcatctttcactgctgccagtgcacgtcagtg gagaccgtcacggagctcacggaattcgccaaggccatcccaggcttcgcaaacttggac ctgaacgatcaagtgacattgctaaaatacggagtttatgaggccatattcgccatgctg tcttctgtgatgaacaaagacgggatgctggtagcgtatggaaatgggtttataactcgt gaattcctaaaaagcctaaggaaaccgttctgtgatatcatggaacccaagtttgatttt gccatgaagttcaatgcactggaactggatgacagtgatatctccctttttgtggctgct atcatttgctgtggaggacacattgaaaaaatgcaggagggtattgtacatgtgctcaga ctccacctgcagagcaaccacccggacgatatctttctcttcccaaaacttcttcaaaaa atggcagacctccggcagctggtgacggagcatgcgcagctggtgcagatcatcaagaag acggagtcggatgctgcgctgcacccgctactgcaggagatctacagggacatgtactga >gi568815576f:46098384_46320077|GENSCAN_predicted_peptide_6|168_aa MRSPCVNKIQGLEPNAKPVLSPATLQLLALMYPVPFKEMWSRKHPLSSSPLPAGHEEATS SERPYQMRMETDAPKQALMKPRLPKEVCLNTSHFYGIGIAKHGEVSALLMLKVAQLFLTQ SKLTCLAALEFLTNQIFASSKGGCVHLPNGSDDGCCHSSPSSDVTVWK >gi568815576f:46098384_46320077|GENSCAN_predicted_CDS_6|507_bp atgagaagcccctgtgtcaacaagatccaggggctggagcccaatgccaagcctgtgttg tccccagcgaccctgcagctgctcgctctgatgtaccctgtgccattcaaggagatgtgg tccaggaaacaccccctctcctcttcacctcttcctgctggccacgaggaagccacttcc tcagagagaccctaccagatgcggatggaaacagatgcaccaaagcaagccctgatgaaa ccgcgacttcctaaggaggtttgtctcaacacttcccatttttacggcattggcattgcc aagcatggggaagtatctgctcttctcatgttaaaagtggcccagcttttcttaactcag tccaagctgacttgtttagctgcactggaatttcttaccaaccaaatatttgcatcgagc aaagggggctgtgtgcacctccctaatggcagcgatgatggctgctgtcattcaagccca tcttcagacgtcacagtctggaagtga >gi568815576f:46098384_46320077|GENSCAN_predicted_peptide_7|334_aa MLQDVPSLDTARNKQTTQYVIFEMIQRIECRCSKYSGAHGQTRRPSARNPASRTTPKLRA TSRAEDRGDRKSKAEAENHRERWGERGQAEARGDHQHPRTLSAVPVHRHRPQEPRQGPRT PRRAEEKRDQMASHVECRPLGVFECELCTLTAPYSYVGQKPPNTQSMVLLEESYVMKDPF TSDKDRFLVLGSCCSLCSRLVCVGPVGRMEVEQHGFHSVPSVLRVTPESEKSLSICHRLT HTKGSCINSQGDEMGQGGRSLRSVAWEYLRLMEELVVQEKLGKEDCKGNSARLGEKESSI KEDPQPARFSDGPGLSECCSDVTERDDLLQQGSS >gi568815576f:46098384_46320077|GENSCAN_predicted_CDS_7|1005_bp atgcttcaagatgttccatctttagacaccgccagaaacaaacaaacaacccagtatgtc atctttgaaatgatccaacgaatcgaatgccgttgctctaagtacagtggagcgcacggc cagacgcgacgcccctcagcacggaacccagcctcgcggactaccccgaagctcagagcc acatcccgcgcagaggaccgcggcgaccggaaaagcaaggcagaggcggaaaaccaccgc gagcgctggggcgagagagggcaggcggaggctcgcggggaccaccagcatccccggacc ctctcagccgtccctgttcacaggcaccgcccccaggagcctcggcaggggcctaggacg ccccggcgtgcagaagagaagcgggatcagatggcgtcccatgtagagtgccgtcctctg ggagtgtttgagtgtgaactctgtaccttgacagctccgtacagctatgtgggacagaag ccccccaacacccagtcgatggtcctcctggaggaaagctatgtcatgaaggatcccttc acctccgacaaggacagattcctggtcctcggctcgtgctgcagtttgtgcagcaggctg gtgtgtgtgggcccggtgggacggatggaagtggagcagcatggctttcacagcgtgccc agtgtactcagagtgaccccagagtctgagaagtccctgagcatctgccacaggttgaca cacacaaaggggagctgtatcaatagccagggggacgagatgggccagggtgggaggagc ctccgcagtgtggcatgggagtatctgaggctcatggaggagctggtggtccaggagaag ctgggaaaggaagactgcaaaggaaattcggcaagacttggagaaaaggaaagctccatc aaagaggacccccagccagcccggttctcggacggcccaggcctgagtgagtgctgctca gatgtgactgagagggatgacctccttcagcagggcagctcctaa >gi568815576f:46098384_46320077|GENSCAN_predicted_peptide_8|2253_aa MRPGPALLLLGVGLSLSVGRLPLPPVPRGAQAAVSGAPGGLLRGAPGLGVRGGRALLSLR PSAVRAGGAVLSGRGSLCFPHGGTGRRWYCLDLRVLLSAQRLPWPAAPALALVDLQLSAR GGRLSLTWSVRLPRSPGRLAWAFRLRLLGPGAARPASPAARVSPRSAAPGPRPQQGFVAR TECPTDGPARVMLQAVNSSSHRAVESSVSCQINACVIQRVRINTDQKGAPVRLSMQAEAT INASVQLDCPAARAIAQYWQVFSVPAVGQAPDWTQPLDLPQLEIRNSPLFIHIPNNSLQW GVYVFNFTVSITTGNPKMPEVKDSDAVYVWIVRSSLQAVMLGDANITANFTEQLILDGST SSDPDADSPLQGLQFFWYCTTDPRNYGGDRIILGSKEVCHPEQANLKWPWASGPVLTLLP ETLKGDHVYFFRMVIRKDSRTAFSDKRVHVLQGPKAIAHITCIENCERNFIVSDRFSLFL NCTNCASRDFYKWSILSSSGGEMLFDWMGETVTGRNGAYLSIKAFAFRHFLEAEFSISLY LACWSGVTSVFRHSFIINHGPQIGECKINPAKGIALITKFVVQCSNFRDKHVPLTYKIIV SDLHSVGEISSVKENTLGTILYLGPQSTVPPSFLPVGMLASQYGLKIYAQVYDSLGAFSQ VTLHATAQAPTDKNSSKTVLNQLLSFTVGPSSLLSTLIQKKDFLPAGYLLYIVASVLNNM KTELPLRDDRVNLRKHLIDQSFLLPVSTLVEIGQVVMTITKLTQKPSEFTWDAQKRATMR VWQANQALQEYQQKDKRFRSEQIEIVSTGILMSLSNILKMTSPHQVVKDPFYVIESLSDT ILANKVPGNKTTSMRTPNFNMYVKKVEKWGINQLFRNEKHCRNCFYPTLNVSSVPGLSAN GPISTMFCDFTNDLFPWLNDQENTSVEVSGFRMTGVADNGSVLEITPDVAEVYLVRKNLT FAAFNLTVGPNSEVDGSLKKTTGGFSFQVDSTVLREVLVHIVTEVMVLFTVLVYTGSQIT PTALVATFLVPHDIPPFASQSALFDPACTVKKARVVCLPVSLLQLIAQHSHSPHCTVSIV LQAPRFVMKLNDKLVRISIFSVQCLDMYGIQSEWREGYCILGEKTSWYEVHCICKNVVRA RRQLGTIGLTGIHLHTHYVMAKVIVIPNPVDLRLNIIKSLHQNPVTLFTVLFIILLYVGL AFWALYRDEMDQHLRGHVIVLPDNDPYDNLCYLVTIFTGSRWGSGTRANVFVQLRGTVST SDVHCLSHPHFTTLYRGSINTFLLTTKSDLGDIHSIRVWHNNEGRSPSWYLSRIKVENLF SRHIWLFICQKWLSVDTTLDRTFHVTHPDERLTRKDFFFIDVSSNLRKNHMWFSIFASVV AKTFNRLQRLSCCLAMLLSSLLCNIMFFNLNRQEQTESRERKYMRSMMIGIESVLITIPV QLLITFLFTCSQRKPQADLKEVSPQKHPLMSEASEHWEEYLRKWHAYETAKVHPREVAKP ASKGKPRLPKASPKATSKPKHRHRKAQIKTPETLGPNTNSNNNIEDDQDVHSEQHPSQKD LQQLKKKPRIVLPWWCVYVAWFLVFATSSISSFFIVFYGLTYGYDKSIEWLFASFCSFCQ SVLLVQPSKIILLSGFRTNKPKYCKNLSWSTKYKYTEIRLDGMRMHPEEMQRIHDQIVRI RGTRMYQPLTEDEIRIFKRKKRIKRRALLFLSYILTHFIFLALLLILIVLLRHTDCFYYN QFIRDRFSMDLATVTKLEDIYRWLNSVLLPLLHNDLNPTFLPESSSKILGLPLMRQVRAK SSEKMCLPAEKFVQNSIRREIHCHPKYGIDPEDTKNYSGFWNEVDKQAIDESTNGFTYKP QGTQWLYYSYGLLHTYGSGGYALYFFPEQQRFNSTLRLKELQESNWLDEKTWAVVLELTT FNPDINLFCSISVIFEVSQLGVVNTSISLHSFSLADFDRKASAEIYLYVAILIFFLAYVV DEGCIIMQERASYVRSVYNLLNFALKCIFTVLIVLFLRKHFLATGIIRFYLSNPEDFIPF HAVSQVDHIMRIILGFLLFLTILKTLRYSRFFYDVRLAQRAIQAALPGICHMAFVVSVYF FVYMAFGYLVFGQHEWNYSNLIHSTQTVFSYCVSAFQNTEFSNNRILGVLFLSSFMLVMI CVLINLFQAVILSAYEEMKQPVYEEPSDEVEAMTYLCRKLRTMFSFLTSQSKAKDEPEFF IDMLYGQPEKNSHRYLGLKTRNINGKKMVYLVV >gi568815576f:46098384_46320077|GENSCAN_predicted_CDS_8|6762_bp atgaggcctgggcccgctctcctccttctgggcgtgggcctgagcctgagcgtcggccgc ctcccgctgccgccggttcctcgcggggcacaagccgccgtctccggggcgcccggtggc ctcctcaggggcgccccgggcctcggggtgcgcggcggccgcgccctcctcagtctgcgg cccagcgcggtgcgggcgggcggcgctgtcctgagcggccgcggcagcctctgcttcccc catggcgggaccgggcggcgctggtactgcctcgacttgcgcgtcctgctcagcgcccaa cgcctgccctggccggccgcgcccgcgctcgcgctcgtcgacctgcagctctccgcgcgc ggcggccgcctctccctgacgtggtccgtgcggctgccgcgctcgcccgggcgcctggcc tgggccttccgcctgcggctgctcggacccggcgccgcccgcccggcctcccccgcggcc cgcgtctccccgcgctccgccgcgccaggcccgcggccccagcagggcttcgtggcccgc accgagtgccccacagacggccccgcgcgcgtgatgttgcaggccgtcaactcgtccagc cacagagccgtcgagtcgtccgtgtcctgtcagataaacgcctgcgtcatccagcgcgtg aggatcaacacggaccagaagggcgcccccgtgcgcctgagcatgcaggcggaggccacc atcaacgcctcggtgcagctggactgcccggccgcgcgcgccatcgcccagtactggcag gtgttctccgtgcccgccgtgggtcaggcgcccgactggacgcagcccttggatctgccc cagctcgagatcaggaacagccccttgttcattcacatccccaataattcgttacagtgg ggagtgtatgtgtttaatttcacggtgtccatcaccacagggaaccccaagatgcccgag gtgaaagactcggacgccgtctatgtctggatcgtcaggagttccctgcaggcggtgatg cttggcgatgccaacataacagctaatttcacagagcagctgattctggacgggtccacg tcctcggacccagatgcggacagcccgttacagggactccagttcttttggtactgtacc acagatcccagaaactacggtggggatcgaataatcctggggagcaaggaagtctgtcac cccgagcaggccaatctgaaatggccctgggcctcgggccctgtactgacacttttgcca gaaacacttaaaggcgaccacgtgtatttcttcagaatggtgattcggaaggactctagg acagcgttttctgataagagggtccacgtgctccaaggaccaaaagccatagcacacatc acatgtatcgaaaattgtgagagaaacttcattgtctctgatagattttctttgttccta aattgcacaaattgtgcaagccgtgatttctataaatggtcaattttgtcttcttcaggt ggtgagatgctatttgattggatgggggaaactgtaacaggaaggaatggtgcttatctg tctataaaagcttttgcttttcggcattttttggaagctgagttttcgatttctctgtat ctagcatgttggagtggagtgacctcggtcttcaggcattcttttattattaaccatggc cctcagatcggagaatgcaaaattaatccagctaaaggaattgcacttattactaaattt gttgtccagtgtagtaattttagggataagcacgtccctcttacatataaaataattgtt tctgatttgcacagtgttggtgaaatcagttcagtaaaagagaacaccctggggaccatc ctgtacttggggcctcagtccacagtacccccttcctttctccctgttggtatgttggct agtcaatatggcttgaagatatatgcccaggtctatgattctctaggagctttttctcag gtgactttgcatgccaccgcacaggctcccactgacaaaaattcatcaaagacagtgttg aatcagttactcagtttcaccgtgggaccaagttcattgctgtctactttgattcaaaag aaggattttttacctgcaggttacttactgtatatagtagcttccgttttgaataacatg aaaactgaattacctctccgagatgacagagtcaatctccgaaaacacctcatcgatcag tctttccttcttcctgtaagcactttggtagaaattggccaggtagtcatgactattacc aaattaacccagaaaccctctgaattcacttgggatgctcagaaacgtgccaccatgagg gtatggcaagcaaatcaagccctacaagagtatcagcaaaaagataaacgctttcgatct gaacaaatagaaatcgtgagtactggaatactaatgagtttgtctaatattcttaaaatg acttctcctcaccaagtagttaaagatcctttctatgtaatagaatctctatcagacaca atactggctaataaagtgccagggaacaaaaccacctcaatgagaacccccaatttcaac atgtatgtcaagaaagttgaaaagtggggtatcaaccagctcttcagaaatgagaaacac tgcagaaattgtttttatccaacactcaatgtgagcagtgttcctggtctgtctgcaaat ggtcccatttctacaatgttttgtgatttcacaaatgacctctttccttggttaaatgat caggaaaacacttcggtggaggtgtctggattcagaatgacaggagttgcagataacggt agtgtgctagagatcacacctgatgtagcggaagtgtaccttgtcaggaaaaacttgacc tttgcagcttttaatctcacagtgggacccaacagcgaggttgatgggtccttgaagaag acgacaggtgggtttagctttcaagtggacagcacagtgcttagggaggttctggtccac atagtaacagaagtgatggtgctgttcacagtgttggtgtacacaggcagtcagatcact cccacagcgctggtcgccaccttcctggtgcctcatgacatccctccatttgccagccag agtgccctgtttgacccagcctgcacagtgaagaaggcccgtgtagtctgcctccctgtg tccctgctgcaactcatagctcagcacagccactctccccactgtactgtatccatagtt ctgcaggcacctcgttttgtcatgaagctcaatgacaagctagtgagaatttctattttc agcgtccagtgcttggacatgtatgggatccagagtgaatggagagagggttattgcatt cttggtgagaagaccagctggtatgaggtgcactgcatctgcaagaatgtagtaagggct aggcggcagctgggcacaatcggactcacgggcattcacctgcatacccactatgtgatg gccaaggtgattgtgatccctaatcctgtggatctacggttaaacatcatcaagagcctt caccaaaaccccgtgaccctcttcactgtgcttttcattattctcttatacgtgggccta gctttttgggctttatacagggatgaaatggaccagcatcttcgggggcatgtgatagtt ctaccagataatgatccttatgataatttatgctacttggtgactatttttacaggaagt cgttgggggtctgggaccagggccaatgtctttgtgcaacttagaggaactgtgagtacc agcgacgtgcattgtttaagccatccacatttcacaactctctaccgaggtagcatcaac actttcctcctaacgacaaaaagtgacttgggggacatccattccatccgtgtgtggcac aacaacgagggtcgatcgcctagctggtatttaagtagaatcaaagtggaaaatctgttt agcaggcacatttggctgttcatatgccagaaatggctttctgttgataccactttggac agaacatttcacgttacccatccagatgagcgtctgactagaaaagactttttctttata gatgtgagtagtaatctccggaaaaaccacatgtggttctctatttttgctagtgttgtt gctaaaacattcaataggctccagagattgtcctgttgtttagcaatgttgcttagctct cttctttgtaacattatgttctttaatctaaatagacaagaacaaactgagtcaagagag aggaaatacatgagatcaatgatgataggaattgaaagtgtcttaattacaatccctgtg caattattaataacttttttgttcacctgttcccagaggaaacctcaagcggatctaaag gaggtatctcctcaaaagcatcctctaatgtcagaagcaagtgagcactgggaagaatac ttgagaaagtggcatgcttacgaaactgctaaggtgcaccccagggaggttgcaaaacct gcatctaaaggaaagcccaggcttccaaaggcttctcctaaggcaacctccaaacccaag cacaggcataggaaagcacaaatcaagaccccggagaccctcgggccaaatacaaattcc aataacaacatagaagatgatcaggatgtccattccgaacagcacccttcccaaaaggat ctccagcagcttaagaaaaagccccggatcgtcctaccttggtggtgtgtttatgttgca tggtttttggtttttgctacttctagcatatcctcattcttcattgtattttatggactg acttacggctatgacaagtcaatagaatggctctttgcatctttttgttcattctgtcag tcagttcttctggtgcagccatctaaaattatactcctgtcaggcttcagaacgaataaa cccaagtattgcaaaaacctttcatggtcaaccaagtataaatatactgagatcaggttg gatggaatgcgtatgcatccagaagaaatgcagaggatacatgaccagatcgtccgaatc cgaggcacgaggatgtaccaaccccttacagaagatgaaatcagaatattcaaaagaaag aagaggatcaagagaagagcactcctgtttctgagttacattctaactcactttatcttt ctagcccttctgttgatccttatcgtcttactacgtcacactgactgcttttactataac cagtttattcgtgatcggttctctatggatcttgctactgtgactaagctggaagacatc tatagatggctaaacagcgtgctgttgcctttgttacacaatgacctgaatccaacattt cttcctgaaagctcgtctaaaatccttggccttccattgatgaggcaagtgagagcaaaa tctagtgaaaaaatgtgtctacctgccgaaaagtttgtgcaaaacagcatcagaagagaa attcattgtcaccccaaatatggcattgacccagaagacacaaaaaactattctggcttt tggaatgaagttgataagcaggctatagatgagagtaccaatggatttacttataagcct caaggaacgcaatggctatattattcctatggactactacacacctatggatctggagga tatgcactctatttttttccagaacagcagcggtttaattccacactgaggctcaaagaa cttcaagaaagcaattggctggatgagaagacatgggctgtggttttggaattaacaact tttaatccagatataaatctgttctgtagcatttcggtcatatttgaagtctctcagtta ggagttgtcaacacaagcatatctctgcactctttttcacttgctgattttgacagaaaa gcttcagcagaaatctacttgtatgtggccattctcatttttttcttagcctacgttgtt gatgagggttgtatcattatgcaagaaagagcctcctatgtgagaagtgtgtataatttg ctcaactttgctttaaagtgcatatttactgtgttgattgtgctctttctcaggaaacat ttcctggccactggcataattcggttttacttgtcgaacccagaagacttcattcccttt catgcagtttctcaggtagatcacattatgaggataattttgggtttcctgttatttctg acaattttgaagaccctcaggtattccagattcttctacgatgtgcgcctggctcagagg gccatccaggctgccctccctggcatctgccacatggcatttgttgtgtccgtgtatttc ttcgtatacatggcttttggttacctggtgtttggtcagcatgaatggaactacagtaac ttgattcattccactcagacagtattttcctattgtgtctcagctttccagaacactgaa ttttccaataacaggattctgggggtcctgttcctctcatctttcatgctggtgatgatc tgcgtcttgatcaacttatttcaggctgtaattctgtctgcatatgaggaaatgaagcag cccgtgtatgaggagccatcggatgaagtggaagcaatgacctatttgtgccgtaagctg agaaccatgttcagctttctgacctcgcaatctaaggccaaagatgagcctgagttcttt attgacatgctgtatgggcagccagagaagaacagccaccgttatctggggctgaagacc agaaacatcaacgggaagaaaatggtttaccttgttgtttga >gi568815576f:46098384_46320077|GENSCAN_predicted_peptide_9|994_aa MAAASPLRDCQAWKDARLPLSTTSNEACKLFDATLTQYVKWTNDKSLGGIEGCLSKLKAA DPTFVMGHAMATGLVLIGTGSSVKLDKELDLAVKTMVEISRTQPLTRREQLHVSAVETFA NGNFPKACELWEQILQDHPTDMLALKFSHDAYFYLGYQEQMRDSVARIYPFWTPDIPLSS YVKGIYSFGLMETNFYDQAEKLAKEAPTLCLQHQHPTDNYWAGKAGCDGARSGNTWALCL QPQADAWSVHTVAHIHEMKAEIKDGLEFMQHSETFWKDSDMLACHNYWHWALYLIEKGLI RRTLFFQGEYEAALTIYDTHILPSLQANDAMLDVVDSCSMLYRLQMEGVSVGQRWQDVLP VARKHSRDHILLFNDAHFLMASLGAHDPQTTQELLTTLRDASEYAEGPSRGGGPHPAERC QAFACIISNPDGSVRLALLCLLTDEQTEAGRSPGENCQHLLARDVGLPLCQALVEAEDGN PDRVLELLLPIRYRIVQLGGSNAQRDVFNQLLIHAALNCTSSVHKNVARSLLMERDALKP NSPLTERLIRKAATVHLMQKPSTRQPPLQAALSMEGGGGRDEPSACRAGDVNMDDPKKEG KSLLLRRCCCSGCSVEMEDILLLADEKFDFDLSLSSSSANEDDEVFFGPFGHKERCIAAS LELNNPVPEQPPLPTSESPFAWSPLAGEKFVEVYKEAHLLALHIESSSRNQAAQAAKPED PRSQGVERFIQESKLKINLFEKEKEMKKSPTSLKRETYYLSDSPLLGPPVGNHALLMLQV RQRLRGSPGPNCCCLERPLLEEEASLGLRRRELLGCLKEEEKENNRCYFEDEDVVEDEEK AEDVEKLEAHALLPKKEIPASPSRTKIPAEKESHRDVLPDKPAPGAVNVPAAGSHLGQGK RAIPVPNKVNNEIFADSVFSHFAPVGAEEDPVKSTRLYQQSRKEVLLGACLERGIQCVHI PSSGQRPLLHTRVSRKARETPALLLLQDDTFSAL >gi568815576f:46098384_46320077|GENSCAN_predicted_CDS_9|2982_bp atggccgcagcctcgcctctgcgcgactgccaggcctggaaggatgcgaggctcccgctc tccaccacaagcaacgaagcctgcaagctgttcgatgccacgctgacccagtatgtaaaa tggaccaatgacaagagtctcggtggcatcgagggctgcctgtcaaagctcaaagcagca gatccaacctttgtgatgggccacgccatggctactggccttgtgctgattggcactgga agctccgtgaagctggacaaagagctggacctggctgtgaagacaatggtggagatttca agaacccagccgctgacaaggcgggagcagctgcacgtgtctgcagtagagacatttgcc aatgggaactttccgaaagcctgtgaactatgggaacagattctccaggaccacccgaca gacatgttggccctgaaattttcccatgatgcttatttttacctgggctatcaggaacag atgagagattctgttgctcgaatttaccccttctggacacctgacatccccctaagcagc tatgtgaaaggcatctactcttttggcttgatggaaaccaacttctacgaccaggcagaa aaactcgccaaagaggcaccaactctttgtcttcaacaccagcaccccacagacaactac tgggcaggaaaagcaggctgtgatggggccaggagtggtaacacatgggctctgtgtctg cagccccaggctgacgcatggtcggtgcacaccgtcgctcacatccacgagatgaaagca gagatcaaggatgggttggaattcatgcagcactcagagaccttctggaaggactctgat atgttggcttgtcataactattggcactgggctttatatctgattgagaagggtttaata aggagaactttattcttccagggcgaatatgaggccgcgctgaccatctacgatacccac atccttcccagcctgcaggccaacgatgcaatgctggacgtggtggacagctgctccatg ctctaccgcctgcagatggaaggagtgtctgtgggccagcggtggcaggatgtcctgcct gtggcccggaagcacagccgagaccacatcctgctgttcaatgacgcacacttcctgatg gcatccctgggtgcacacgacccccagaccacacaggagctgctgaccaccctgcgggac gccagcgagtatgcagaggggccttctcggggtgggggtcctcaccctgccgagaggtgc caggcctttgcctgtattatcagcaatcctgacggttctgttagattggcactgttatgc ctgcttacagatgagcaaactgaggctggaagatccccaggggagaactgccagcacctc ctggcccgagacgtggggctgcccctgtgccaggccctggtggaggctgaggacgggaac cctgaccgcgtcctggagctgctcctgcccatccgctaccggatcgtccagctcggtggg agcaatgcccagagagacgtcttcaaccagctgctgattcacgcggccttaaactgcacc tccagcgtccataagaacgtagcccggagccttctgatggagcgtgatgccttgaagccc aactcgcccctgaccgagcggctcatccgcaaggcagctaccgtccacctcatgcagaag ccttctacccgccaacccccactgcaggctgctctctccatggaaggaggcggcggccgc gatgagccttcagcctgccgggcaggggacgtgaacatggatgaccctaagaaggaaggc aagtccttgctgctgcggcgctgttgttgttcaggatgttcagtagagatggaggacatt cttcttttggccgatgaaaaatttgacttcgatctttcattgtcttcttcgagtgcaaat gaagatgatgaagtcttcttcggaccctttggacataaagaaagatgtattgctgccagc ttggaattaaataatccggttcccgaacagcctccgttgcccacatctgagagtcccttt gcctggagccctctggccggggagaagttcgtggaggtgtacaaagaagctcacttactg gctttacacattgagagcagcagccggaaccaggcagcccaagctgccaagcctgaagac cctcggagccagggcgtggaaagattcatacaggagtcaaaattaaaaataaacctcttt gagaaagaaaaggaaatgaagaaaagccccacgtctcttaaaagggagacatactacctg tcagacagccccttgctggggccccctgtggggaatcatgcactgctcatgctgcaagtc aggcagcgactcagaggaagcccgggaccaaattgctgctgcctcgagcggcctctgtta gaggaagaagcatccctggggctgcggagaagagaactgctaggatgcctaaaggaagaa gaaaaagaaaataacaggtgttattttgaggatgaggatgttgtggaggatgaggagaag gctgaggatgtggagaagctggaagctcacgcattgctgcccaagaaagagattccagct agtccttccaggacaaaaatcccagctgagaaggaatcccaccgggatgttctccctgac aaacctgccccgggtgctgtcaatgtgccggccgccggaagccacttgggccagggcaag cgggcgatccctgttccaaacaaggtaaataacgagatctttgctgattctgttttttca cattttgccccagttggggctgaagaagaccctgttaaaagcacccggctctaccagcaa tctcgcaaggaagtcctcctcggggcctgtttggagcggggcatccagtgcgtgcacatc cccagcagtgggcaaaggcctctgctgcacacccgtgtgtcccgcaaggcccgcgagacc cctgcactcctgctgctgcaagatgacaccttctcggccctn