GENSCAN 1.0 Date run: 7-Nov-116 Time: 15:31:58 Sequence gi568815579f:7953050_8157921 : 204872 bp : 51.91% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 Intr - 2581 2451 131 0 2 91 23 83 0.010 2.94 1.04 Intr - 5320 5207 114 0 0 106 31 45 0.015 0.67 1.03 Intr - 7470 7309 162 2 0 96 55 40 0.387 1.11 1.02 Intr - 8563 8368 196 2 1 73 81 115 0.817 8.29 1.01 Init - 9882 9786 97 0 1 67 62 86 0.806 2.48 1.00 Prom - 10213 10174 40 -5.81 2.05 PlyA - 10401 10396 6 -1.75 2.04 Term - 10758 10434 325 2 1 107 35 531 0.821 44.29 2.03 Intr - 14741 14516 226 0 1 58 94 208 0.954 15.97 2.02 Intr - 20829 20676 154 0 1 63 81 260 0.984 22.96 2.01 Init - 20984 20970 15 2 0 110 89 38 0.964 6.02 2.00 Prom - 22766 22727 40 -3.91 3.11 PlyA - 23068 23063 6 -0.45 3.10 Term - 24769 24730 40 0 1 123 54 12 0.382 -1.75 3.09 Intr - 26790 26614 177 2 0 95 81 6 0.231 0.15 3.08 Intr - 27210 27128 83 0 2 81 77 30 0.662 0.13 3.07 Intr - 28176 28034 143 1 2 91 98 179 0.583 19.78 3.06 Intr - 28920 28891 30 1 0 124 94 -21 0.613 0.69 3.05 Intr - 34319 34128 192 0 0 87 49 62 0.317 2.18 3.04 Intr - 37992 37918 75 1 0 89 49 64 0.498 2.58 3.03 Intr - 38782 38595 188 0 2 96 103 240 0.879 26.15 3.02 Intr - 40486 40312 175 2 1 26 23 88 0.133 -4.39 3.01 Init - 45372 45297 76 0 1 47 72 109 0.709 6.40 3.00 Prom - 45909 45870 40 -2.91 4.00 Prom + 46019 46058 40 -6.40 4.01 Init + 52450 52682 233 0 2 40 -26 304 0.023 9.92 4.02 Intr + 59098 59182 85 1 1 73 91 42 0.368 3.32 4.03 Term + 62493 62558 66 2 0 117 42 119 0.960 8.43 4.04 PlyA + 62953 62958 6 1.05 5.03 PlyA - 63778 63773 6 1.05 5.02 Term - 93939 93591 349 2 1 81 35 566 0.901 44.91 5.01 Init - 97850 97843 8 2 2 67 100 11 0.556 -0.15 5.00 Prom - 98771 98732 40 -8.38 6.00 Prom + 99674 99713 40 -2.31 6.01 Init + 100001 100073 73 1 1 75 79 195 0.765 16.48 6.02 Intr + 103103 103220 118 0 1 47 76 111 0.255 5.83 6.03 Intr + 103317 103450 134 0 2 108 86 125 0.916 15.10 6.04 Intr + 104755 104871 117 2 0 91 110 31 0.465 6.54 6.05 Term + 109169 109176 8 0 2 110 52 0 0.217 -3.09 6.06 PlyA + 109575 109580 6 1.05 7.64 PlyA - 109626 109621 6 -0.45 7.63 Term - 110644 110564 81 1 0 48 44 42 0.134 -6.21 7.62 Intr - 113211 112969 243 0 0 122 77 345 0.649 35.03 7.61 Intr - 118293 118204 90 0 0 87 75 27 0.729 1.99 7.60 Intr - 119149 118999 151 0 1 71 61 248 0.970 21.08 7.59 Intr - 120248 120014 235 0 1 147 76 227 0.999 24.68 7.58 Intr - 122141 122022 120 0 0 122 92 111 0.999 15.87 7.57 Intr - 122362 122234 129 2 0 116 87 109 0.999 14.77 7.56 Intr - 128070 127954 117 1 0 100 109 158 0.974 20.04 7.55 Intr - 128431 128309 123 2 0 97 59 135 0.996 12.56 7.54 Intr - 130323 130198 126 1 0 92 80 93 0.982 10.06 7.53 Intr - 132520 132314 207 2 0 98 90 137 0.999 14.47 7.52 Intr - 133276 133151 126 2 0 112 113 165 0.999 22.56 7.51 Intr - 134162 134028 135 0 0 81 80 152 0.996 14.75 7.50 Intr - 134898 134776 123 1 0 70 80 204 0.963 18.86 7.49 Intr - 135130 134948 183 2 0 102 21 197 0.617 14.68 7.48 Intr - 136621 136496 126 2 0 64 92 240 0.999 23.16 7.47 Intr - 136910 136845 66 0 0 134 105 40 0.999 9.67 7.46 Intr - 137202 137050 153 1 0 32 101 72 0.860 3.46 7.45 Intr - 138541 138416 126 2 0 111 111 134 0.974 19.16 7.44 Intr - 141516 141397 120 1 0 97 97 104 0.988 13.17 7.43 Intr - 142454 142326 129 0 0 96 92 134 0.827 15.77 7.42 Intr - 143031 142915 117 1 0 75 92 183 0.997 18.34 7.41 Intr - 143520 143395 126 1 0 80 72 83 0.990 7.06 7.40 Intr - 143957 143832 126 0 0 105 80 143 0.991 16.36 7.39 Intr - 144365 144240 126 0 0 65 45 250 0.821 19.56 7.38 Intr - 147923 147852 72 0 0 91 97 104 0.946 11.47 7.37 Intr - 149824 149675 150 2 0 78 72 138 0.807 11.85 7.36 Intr - 150638 150513 126 0 0 137 86 178 0.989 23.66 7.35 Intr - 153184 153059 126 2 0 66 84 172 0.947 15.76 7.34 Intr - 155189 155121 69 0 0 96 113 59 0.997 8.85 7.33 Intr - 156339 156178 162 1 0 107 81 72 0.950 8.86 7.32 Intr - 156704 156582 123 0 0 102 92 194 0.981 22.26 7.31 Intr - 157918 157796 123 2 0 7 64 196 0.996 10.16 7.30 Intr - 158134 158009 126 2 0 60 86 300 0.919 28.16 7.29 Intr - 158721 158599 123 1 0 71 100 201 0.999 20.66 7.28 Intr - 159050 158928 123 0 0 94 113 4 0.928 4.56 7.27 Intr - 162591 162466 126 1 0 120 93 104 0.985 15.26 7.26 Intr - 163750 163625 126 2 0 62 73 161 0.979 13.16 7.25 Intr - 164242 164120 123 2 0 116 96 91 0.989 13.76 7.24 Intr - 164540 164415 126 0 0 92 72 147 0.960 14.66 7.23 Intr - 165973 165848 126 2 0 49 110 183 0.068 17.76 7.22 Intr - 168337 168209 129 2 0 126 15 221 0.076 19.77 7.21 Intr - 170540 170415 126 0 0 64 96 128 0.999 12.36 7.20 Intr - 170959 170735 225 2 0 68 86 274 0.993 23.68 7.19 Intr - 172968 172843 126 1 0 109 92 174 0.980 21.06 7.18 Intr - 173556 173248 309 1 0 79 48 175 0.468 9.43 7.17 Intr - 173783 173664 120 0 0 65 71 138 0.659 10.77 7.16 Intr - 176104 175979 126 2 0 114 79 164 0.985 19.26 7.15 Intr - 176316 176191 126 1 0 126 99 209 0.951 26.96 7.14 Intr - 178239 178186 54 1 0 116 107 47 0.986 8.84 7.13 Intr - 178780 178505 276 2 0 58 105 519 0.999 48.63 7.12 Intr - 180057 179737 321 1 0 92 55 312 0.521 24.68 7.11 Intr - 183037 182912 126 2 0 133 92 214 0.999 27.46 7.10 Intr - 183260 183141 120 0 0 79 55 151 0.996 11.87 7.09 Intr - 183482 183339 144 0 0 68 85 301 0.906 28.56 7.08 Intr - 185274 185092 183 1 0 93 73 124 0.978 11.68 7.07 Intr - 185515 185363 153 2 0 100 81 88 0.980 9.86 7.06 Intr - 188793 188668 126 1 0 118 98 105 0.998 15.66 7.05 Intr - 189088 188891 198 2 0 94 63 86 0.978 6.54 7.04 Intr - 191923 191828 96 2 0 119 66 140 0.589 15.38 7.03 Intr - 192889 192794 96 2 0 101 75 12 0.715 1.68 7.02 Intr - 193166 193078 89 1 2 43 83 49 0.661 0.01 7.01 Init - 194431 194055 377 1 2 85 83 146 0.592 8.23 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 52450 52686 237 0 0 40 37 309 0.910 14.39 S.002 Init - 166043 165848 196 2 1 94 110 204 0.898 21.57 S.003 Term - 168337 168201 137 2 2 126 42 224 0.924 20.09 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815579f:7953050_8157921|GENSCAN_predicted_peptide_1|234_aa MAAGPSSVGHHWGRGPRAACWAGPPEKDTNVFQSQILRKSQNRGPLPGFDTKEGPCSVAR EQSASWDQNAFLEKEAFQVSLGQHLLMEGGSQHPSDGSVICQAAVFCAGCASSRSSVTDS LPGAESPPRPPSCVNILKFMGFLGGPPAGGAGFTGQWAHRVAPGYGPPPSASHTSAPPTV TRGSCLFPRLPCGHPQPSKEIHPPSSSKEKRRLPIPASPQNSEPERPSPKPGPX >gi568815579f:7953050_8157921|GENSCAN_predicted_CDS_1|702_bp atggcggcgggcccgtccagcgtgggccaccactggggccgggggccacgggctgcatgc tgggcgggccctccagagaaggacacaaacgtgtttcaatcccagatcctcaggaaaagc caaaaccgaggcccattgcccggattcgacacaaaagagggtccctgctctgttgcccga gagcagtctgcatcctgggaccagaatgctttcctggaaaaagaagcctttcaggtttcc ctgggccagcatcttctgatggaaggtgggagccaacacccttctgatggaagtgtaatt tgccaagccgcagttttctgtgctggctgtgcctctagtcgcagctctgtgactgattcc ctcccgggtgctgagtcccctccccggccaccatcctgcgtgaatatcctgaaattcatg ggcttcctcgggggcccgcccgcaggtggtgctgggttcacaggtcagtgggcccaccga gtggcccctgggtatggcccccctccatctgcttcccacaccagtgctcccccaacagtg acccggggatcctgcctcttccctcgtctgccttgcggccacccccaaccaagcaaggaa atccaccccccgagctcctccaaggagaagagacggctcccgatccccgcctcaccccag aattcagagcccgagaggcccagccccaagccagggccctnn >gi568815579f:7953050_8157921|GENSCAN_predicted_peptide_2|239_aa MRLQAVSYARPSSEVIKDANLYISGLPRTMTQKDVEDMFSRFGRIINSRVLVDQTTGLSR GVAFIRFDKRSEAEEAITSFNGHKPPGSSEPITVKFAANPNQNKNVALLSQLYHSPARRF GGPVHHQAQRFRFSPMGVDHMSGLSGVNVPGNASSGWCIFIYNLGQDADEGILWQMFGPF GAVTNVKVIRDFNTNKCKGFGFVTMTNYEEAAMAIASLNGYRLGDKILQVSFKTNKSHK >gi568815579f:7953050_8157921|GENSCAN_predicted_CDS_2|720_bp atgcggctgcaggcggtgtcgtatgctcgcccgagctcagaggtgatcaaagacgccaac ttgtacatcagcgggctcccgcggaccatgacccagaaggacgtagaagacatgttctct cggtttgggcggatcatcaactcgcgggtcctcgtggatcagactacaggtttgtccaga ggggttgcgtttatccggtttgacaaacggtcggaggcagaagaggcaattaccagtttc aatggtcataaacccccaggttcctctgagcccatcacagtgaagtttgcagccaacccc aaccagaacaaaaacgtggcactcctctcgcagctgtaccactcgccagcgcgacggttc ggaggccccgttcaccaccaggcgcagagattcaggttctcccccatgggcgtcgatcac atgagcgggctctctggcgtcaacgtgccaggaaacgcctcctccggctggtgcattttc atctacaacctggggcaggatgccgacgaggggatcctctggcagatgtttgggccgttt ggtgccgtcaccaatgtgaaagtgatccgcgacttcaacaccaacaagtgcaaagggttt ggctttgtgaccatgacaaactatgaagaagccgcgatggccatagccagcctgaacggc taccgcctgggggacaaaatcttacaggtttccttcaaaaccaacaagtcccacaaataa >gi568815579f:7953050_8157921|GENSCAN_predicted_peptide_3|392_aa MPDREHHRVTAEPQWVVTTDREYQRGKTSATEKDISKPANCEDQVPLTGRETLTSFQLTH CVFGNKQVQRTIAYLPPQQQGVQGFLKNTMSNGYEDHMAEDCRGDIGRTNLIVNYLPQNM TQDELRSLFSSIGEVESAKLIRDKVAVGLLSLCVPIRNVEIVASTSWWSLPLAPYVPRGI RRSQPSTLMDDLSCRSHQAAAPFTASVLLYISGRVKAHTAAQGFNHIPEEGVSQEAGSGW SQAWKDGWLTNVFMFSCTGHSLGYGFVNYVTAKDAERAINTLNGLRLQSKTIKAGPCNWA SCSSHCTHPPSVPAFTRYPQTLPHDLGYSVAKPWLSSSSPRPAPCRMDVVCSGLPEGRRA TPASSCPVQVLRGTFPGGGWQGLLEPCAHPGL >gi568815579f:7953050_8157921|GENSCAN_predicted_CDS_3|1179_bp atgcctgaccgtgagcatcatcgtgtgaccgccgagcctcaatgggtggtgaccactgac agggaataccagaggggaaagacttcagcaactgagaaagacatttcaaagcctgcaaac tgtgaagaccaagttcctctgactggtagagaaaccctcacctctttccagctcactcac tgtgtctttgggaataaacaggtgcagaggaccattgcctacctgccccctcagcagcag ggtgtccagggatttttgaaaaatacaatgtctaatggttatgaagaccacatggccgaa gactgcaggggtgacatcgggagaacgaatttgatcgtcaactacctccctcagaacatg acccaggatgagttacgaagcctgttcagcagcattggtgaagttgaatctgcaaaactt attcgggataaagtagcagtgggacttctaagcctttgtgtccccatccggaatgtggag attgtggcttctacatcctggtggtccctacctctggccccatatgtccctcgggggatc agaaggagtcagccctctaccctcatggatgacctgtcatgccggagccaccaggctgca gccccattcacggcttctgtcttgctgtatatctctggtcgggttaaagcccacaccgct gcccaaggctttaaccacatcccggaggagggggtgtcccaagaggcggggtcgggctgg tctcaggcttggaaagacggttggcttaccaatgtcttcatgttctcttgcacaggacac agcttgggctatggctttgtgaactacgtgaccgcgaaggatgcagagagagcgatcaac acgctgaacggcttgaggctccagtcaaaaaccattaaggcaggcccctgcaactgggct tcctgcagcagccactgcacccacccgccctcagtgcccgccttcacaagatacccacag acactgccccatgatcttggttactcagtggccaaaccctggctcagcagcagctctccc agacctgccccatgccgcatggacgtggtgtgcagtgggctccctgagggcaggcgcgcc actcctgcctcctcgtgtccagtgcaggtcctcagaggcaccttccccggtgggggctgg cagggcctcctggagccttgtgctcatcctgggctctga >gi568815579f:7953050_8157921|GENSCAN_predicted_peptide_4|127_aa MRAGGPGAAPGRAGGGGGAALTAPLGLGSGGGGGGDGDGGSGSSSARTTRSGPRPGHAPP AREPIGRRPPRRAAHWPALFLAMTDSQISLVFDDLDSFEECRSGILPTPSEDDEDEDRYD PLPLNER >gi568815579f:7953050_8157921|GENSCAN_predicted_CDS_4|384_bp atgcgggcgggcgggcccggggctgctccggggcgggcgggaggcggcggcggcgcggcg ctaacggctccgctcggcctcggtagcggtggcggcggtggcggcgacggcgacggcggc agcggctcctcctcagcgcgcacgacccgctccggcccgcggcccggacacgcccccccc gcgcgcgagcccattggccgacgtccgccacggcgcgctgctcattggccggcgctcttc ttggctatgacagattctcagatttccctagtttttgatgatcttgacagttttgaggag tgccggtcaggtattttgcctactccgagtgaagatgatgaagatgaagaccgttatgat ccacttccacttaatgaaagatga >gi568815579f:7953050_8157921|GENSCAN_predicted_peptide_5|118_aa MAGCIVVAQQPEEEKEEEEEEEEEEEEEEEEEEKEEEKEEEKEDGEEKEEGEEEEGEKEE GEEKEEGEEKGGEEKEEWEEKEEEGEEEDGEEEEKEEEGRRVFPGCLASPWPDVREFG >gi568815579f:7953050_8157921|GENSCAN_predicted_CDS_5|357_bp atggccggatgcatagtggttgcccagcaaccggaggaggagaaggaggaggaggaggag gaagaggaggaggaagaggaggaggaggaggaggaggagaaggaggaggagaaggaggag gagaaggaggatggggaggagaaggaggagggggaagaggaggagggggagaaggaggag ggggaggagaaggaggagggggaggagaaggggggggaggagaaggaggagtgggaagag aaggaggaggagggggaggaggaggatggggaggaggaggagaaggaggaggaaggaaga agggtttttccaggatgtcttgcaagcccttggcctgatgttcgagaatttggttaa >gi568815579f:7953050_8157921|GENSCAN_predicted_peptide_6|149_aa MNLWLLACLVAGFLGAWAPAVHTQGVFEDCCLAYHYPIGWAVLRRAWTYRIQEVSGSCNL PAAIFYLPKRHRKVCGNPKSREVQRAMKLLDARNKVFAKLHHNTQTFQGPHAVKKLSSGN SKLSSSKFSNPISSSKRNVSLLISANSGL >gi568815579f:7953050_8157921|GENSCAN_predicted_CDS_6|450_bp atgaacctgtggctcctggcctgcctggtggccggcttcctgggagcctgggcccccgct gtccacacccaaggtgtctttgaggactgctgcctggcctaccactaccccattgggtgg gctgtgctccggcgcgcctggacttaccggatccaggaggtgagcgggagctgcaatctg cctgctgcgatattctacctccccaagagacacaggaaggtgtgtgggaaccccaaaagc agggaggtgcagagagccatgaagctcctggatgctcgaaataaggtttttgcaaagctc caccacaacacgcagaccttccaaggccctcatgctgtaaagaagttgagttctggaaac tccaagttatcatcgtccaagtttagcaatcccatcagcagcagtaagaggaatgtctcc ctcctgatatcagctaattcaggactgtga >gi568815579f:7953050_8157921|GENSCAN_predicted_peptide_7|2999_aa MTLEGLYLARGPLARLLLAWSALLCMAGGQGRWDGALEAAGPGRVRRRGSPGILQGCVVP GMLGDPFGVDWAVLGPAEYPGGCPHGQGLTRPISLSPKAECVRLPVPCLLLSRLEDIPWQ EPVCRTRACGEGFCSQPNLCTCADGTLAPSCGVSRGSGCSVSCMNGGTCRGASCLCQKGY TGTVCGQPICDRGCHNGGRCIGPNRCACVYGFMGPQCERDYRTGPCFGQVGPEGCQHQLT GLVCTKALCCATVGRAWGLPCELCPAQPHPCRRGFIPNIHTGACQDVDECQAVPGLCQGG SCVNMVGSFHCRCPVGHRLSDSSAACEDYRAGACFSVLFGGRCAGDLAGHYTRRQCCCDR GRCWAAGPVPELCPPRGSNEFQQLCAQRLPLLPGHPGLFPGLLGFGSNGMGPPLGPARLN PHGSDARGIPSLGPGNSNIGTATLNQTIDICRHFTNLCLNGRCLPTPSSYRCECNVGYTQ DVRGECIDVDECTSSPCHHGDCVNIPGTYHCRCYPGFQATPTRQACVDVDECIVSGGLCH LGRCVNTEGSFQCVCNAGFELSPDGKNCVDHNECATSTMCVNGVCLNEDGSFSCLCKPGF LLAPGGHYCMGEPGARLASGGEKGDGEAEGRRGRGEGPDGDGTSMREGEEKGQGEKEEEE GKGEKRRRRKGRKEEADIDECQTPGICVNGHCTNTEGSFRCQCLGGLAVGTDGRVCVDTH VRSTCYGAIEKGSCARPFPGTVTKSECCCANPDHGFGEPCQLCPAKDSAEFQALCSSGLG ITTDGRDINECALDPEVCANGVCENLRGSYRCVCNLGYEAGASGKDCTDVDECALNSLLC DNGWCQNSPGSYSCSCPPGFHFWQDTEICKDVDECLSSPCVSGVCRNLAGSYTCKCGPGS RLDPSGTFCLDSTKGTCWLKIQESRCEVNLQGASLRSECCATLGAAWGSPCERCEIGSIL LEASQAPMGKALHGAGPPLGWHEKMTPLFTLVLPVADPACARGFARMTGVTCDDVNECES FPGVCPNGRCVNTAGSFRCECPEGLMLDASGRLCVDVRLEPCFLRWDEDECGVTLPGKYR MDVCCCSIGAVWGVECEACPDPESLEFASLCPRGLGFASRDFLSGRPFYKDVNECKVFPG LCTHGTCRNTVGSFHCACAGGFALDAQERNCTDIDECRISPDLCGQGTCVNTPGSFECEC FPGYESGFMLMKNCMDVDECARDPLLCRGGTCTNTDGSYKCQCPPGHELTAKGTACEDID ECSLSDGLCPHGQCVNVIGAFQCSCHAGFQSTPDRQGCVDINECRVQNGGCDVHCINTEG SYRCSCGQGYSLMPDGRACADVDECEENPRVCDQGHCTNMPGGHRCLCYDGFMATPDMRT CVDVDECDLNPHICLHGDCENTKGSFVCHCQLGYMVRKGATGCSDVDECEVGGHNCDSHA SCLNIPGSFSCRCLPGWVGDGFECHDLDECVSQEHRCSPRGDCLNVPGSYRCTCRQGFAG DGFFCEDRDECAENVDLCDNGQCLNAPGGYRCECEMGFDPTEDHRACQDVDECAQGNLCA FGSCENLPGMFRCICNGGYELDRGGGNCTDINECADPVNCINGVCINTPGSYLCSCPQDF ELNPSGVGCVDTRAGNCFLETHDRGDSGISCSAEIGVGVTRASCCCSLGRAWGNPCELCP MANTTEYRTLCPGGEGFQPNRITVILEDIDECQELPGLCQGGDCVNTFGSFQCECPPGYH LSEHTRICEDIDECSTHSGICGPGTCYNTLGNYTCVCPAEYLQVNGGNNCMDMRKSVCFR HYNGTCQNELAFNVTRKMCCCSYNIGQAWNRPCEACPTPISPDYQILCGNQAPGFLTDIH TGKPLDIDECGEIPAICANGICINQIGSFRCECPAGFNYNSILLACEDVDECGSRESPCQ QNADCINIPGSYRCKCTRGYKLSPGGACVGRNECREIPNVCSHGDCMDTEGSYMCLCHRG FQASADQTLCMDIDECDRQPCGNGTCKNIIGSYNCLCFPGFVVTHNGDCVDFDECTTLVG QVCRFGHCLNTAGSFHCLCQDGFELTADGKNCVDTNECLSLAGTCLPGTCQNLEGSFRCI CPPGFQVQSDHCIDIDECSEEPNLCLFGTCTNSPGSFQCLCPPGFVLSDNGHRCFDTRQS FCFTRFEAGKCSVPKAFNTTKTRCCCSKRPGEGWGDPCELCPQEGSAAFQELCPFGHGAV PGPDDSREDVNECAENPGVCTNGVCVNTDGSFRCECPFGYSLDFTGINCVDTDECSVGHP CGQGTCTNVIGGFECACADGFEPGLMMTCEGTTLLAQDLGRVTEMPGGGSGDIDECSLNP LLCAFRCHNTEGSYLCTCPAGYTLREDGAMCRDVDECADGQQDCHARGMECKNLIGTFAC VCPPGMRPLPGSGEGCTDDNECHAQPDLCVNGRCVNTAGSFRCDCDEGFQPSPTLTECHD IRQGPCFAEVLQTMCRSLSSSSEAVTRAECCCGGGRGWGPRCELCPLPGTSAYRKLCPHG SGYTAEGRDVDECRMLAHLCAHGECINSLGSFRCHCQAGYTPDATATTCLDMDECSQVPK PCTFLCKNTKGSFLCSCPRGYLLEEDGRTCKDLDECTSRQHNCQFLCVNTVGAFTCRCPP GFTQHHQACFDNDECSAQPGPCGAHGHCHNTPGSFRCECHQGFTLVSSGHGCEDVNECDG PHRCQHGCQNQLGGYRCSCPQGFTQHSQWAQCVDENECALSPPTCGSASCRNTLGGFRCV CPSGFDFDQALGGCQEVDECAGRRGPCSYSCANTPGGFLCGCPQGYFRAGQGHCVSGLGF SPGPQDTPDKEELLSSEACYECKINGLSPRDRPRRSAHRDHQESQGSSTHHSVAVGQAYG SIPPSLSGHPIKVNLATLDSEALLTLGLNLSHLGRAERILELRPALEGLEGRIRYVIVRG NEQGFFRMHHLRGVSSLQLGRRRPGPGTYRLEVCLWSPTKRYEAGSIITPIFQGGKIEA >gi568815579f:7953050_8157921|GENSCAN_predicted_CDS_7|9000_bp atgactctggagggtctgtatttggcaaggggccccctggcccggctcctgctggcctgg tcggccctgttgtgcatggcaggtggccaaggccgctgggacggggccttggaggctgca ggtcctggacgtgtgcggaggcggggcagcccaggcatcttgcaggggtgcgtggtacct gggatgctgggagaccccttcggtgtggattgggctgtcctggggccagcggaatacccg gggggatgtccacacgggcaggggctcaccagacccatctcgctgtccccgaaggccgaa tgtgtgcggctcccggttccatgcctactgctgtccaggctggaggacattccctggcag gagccagtgtgtcgtacgcgcgcctgcggtgaaggcttctgctcccagcccaacctgtgc acctgtgcggatgggacgctggctcccagctgcggggtgagccgagggtcagggtgcagt gtgagctgtatgaatgggggcacctgccggggggcgtcctgtctgtgtcagaagggctac acaggcaccgtgtgtgggcagcccatctgtgaccgcggctgccacaatgggggtcgctgc attgggcccaaccgctgcgcctgtgtgtatggcttcatgggacctcaatgtgagagagat taccggacgggaccctgctttggccaagtaggccccgaggggtgccagcatcagctgacg ggcctcgtgtgcaccaaggcactttgctgtgccactgtgggccgtgcctggggccttcca tgtgaactttgccctgcacagccacacccctgccgccgcggcttcatccccaatatccac acgggggcctgccaagatgtggatgagtgccaggctgtgccaggcctgtgccagggaggc agctgcgtcaacatggtgggctccttccattgccgctgtccagttggacaccggctcagt gacagcagcgccgcatgtgaagactaccgggccggcgcctgcttctcagtgcttttcggg ggccgctgtgctggagacctcgccggccactacactcgcaggcagtgctgctgtgacagg ggcaggtgctgggcagctggcccggtccctgagctgtgtcctcctcggggctccaatgaa ttccagcaactgtgcgcccagcggctgccgctgctacccggccaccctggcctcttccct ggcctcctgggcttcggatccaatggcatgggtccccctcttgggccagcgcgactcaac ccccatggctctgatgcgcgtgggatccccagcctgggccctggcaactctaatattggc actgctaccctgaaccagaccattgacatctgccgacacttcaccaacctgtgtctgaat ggccgctgcctgcccacgccttccagctaccgctgcgagtgtaacgtgggctacacccag gacgtgcgcggcgagtgcattgatgtagacgaatgcaccagcagcccctgccaccacggt gactgcgtcaacatccccggcacctaccactgccggtgctacccgggcttccaggccacg cccaccaggcaggcatgcgtggatgtggacgagtgcattgtcagtggtggcctttgtcac ctgggccgctgtgtcaacacagagggcagcttccagtgtgtctgcaatgcaggcttcgag ctcagccctgacggcaagaactgtgtggaccacaacgagtgtgccaccagcaccatgtgc gtcaacggcgtgtgtctcaacgaggatggcagcttctcctgcctctgcaaacccggcttc ctgctggcgcctggcggccactactgcatgggtgagcctggagccagactggccagcgga ggggagaagggagatggggaagcagagggacggcgaggaagaggagagggaccggacggg gatgggacgagcatgagagaaggagaagagaaggggcagggagaaaaagaggaggaggag ggaaaaggggagaagcgaagaagaagaaaaggaagaaaagaagaggcagacattgacgag tgccagacgcccggcatctgcgtgaacggccactgtaccaacaccgagggctccttccgc tgccagtgcctgggggggctggcggtaggcacggatggccgcgtgtgcgtggacacccac gtgcgcagcacctgctatggggccatcgagaagggctcctgtgcccgccccttccctggc actgtcaccaagtccgagtgctgctgtgccaatccggaccacggttttggggagccctgc cagctttgtcctgccaaagactccgctgagttccaggcactgtgcagcagtgggcttggc attaccacggatggtcgagacatcaacgagtgtgctctggatcctgaggtttgtgccaat ggcgtgtgcgagaaccttcggggcagctaccgctgtgtctgcaacctgggttatgaggca ggtgcctcaggcaaggactgcacagacgtggatgagtgtgccctcaacagcctcctgtgt gacaacgggtggtgccagaatagccctggcagctacagctgctcctgcccccccggcttc cacttctggcaggacacggagatctgcaaagatgtcgacgaatgcctgtccagcccgtgt gtgagtggcgtctgtcggaacctggccggctcctacacctgcaaatgtggccctggcagc cggctggacccctctggtaccttctgtctagacagcaccaagggcacctgctggctgaag atccaggagagccgctgtgaggtgaaccttcagggagccagcctgcggtctgagtgctgc gccaccctcggggcagcctgggggagcccctgcgaacgctgcgagatcggtagtatcctc cttgaggcctcccaggcacccatgggaaaagccctccatggggcgggtccccccttgggc tggcatgagaaaatgactcctctcttcactcttgttctcccagttgcagaccctgcctgt gcccggggctttgcccggatgacgggtgtcacctgcgatgatgtgaacgagtgtgagtcc ttcccgggagtctgtcccaacgggcgttgcgtcaacactgctgggtctttccgctgtgag tgtccagagggcctgatgctggacgcctcaggccggctgtgcgtggatgtgagattggaa ccatgtttcctgcgatgggatgaggatgagtgtggggtcaccctgcctggcaagtaccgg atggacgtctgctgctgctccatcggggccgtgtggggagtcgagtgcgaggcctgcccg gatcccgagtctctggagttcgccagcctgtgcccgcgggggctgggcttcgccagccgg gacttcctgtctggccgaccattctataaagatgtgaatgaatgcaaggtgttccctggc ctctgcacgcacggtacctgcagaaacacggtgggcagcttccactgcgcctgtgcgggg ggcttcgccctggatgcccaggaacggaactgcacagatatcgacgagtgtcgcatctct cctgacctctgcggccagggcacctgtgtcaacacgccgggcagctttgagtgcgagtgt tttcccggctacgagagtggcttcatgctgatgaagaactgcatggacgtggacgagtgt gcaagggacccgctgctctgccggggaggcacttgcaccaacacggatgggagctacaag tgccagtgtccccctgggcatgagctgacggccaagggcactgcctgtgaggacatcgat gagtgctccctgagtgatggcctgtgtccccatggccagtgtgtcaatgtcatcggtgcc ttccagtgctcctgccatgccggcttccagagcacacctgaccgccagggctgcgtggac atcaacgaatgccgggtccagaatggtgggtgtgacgtgcactgtattaacactgagggc agctaccggtgcagctgtgggcagggctactcgctgatgcccgacggaagggcatgtgca gacgtggacgagtgtgaagagaacccccgcgtttgtgaccaaggccactgcaccaacatg ccagggggtcaccgctgcctgtgctatgatggcttcatggccacgccagacatgaggaca tgtgttgatgtggatgagtgtgacctgaaccctcacatctgcctccatggggactgcgag aacacgaagggttcctttgtctgccactgtcagctgggctacatggtcaggaagggggcc acaggctgctctgatgtggatgaatgcgaggttggaggacacaactgtgacagtcacgcc tcctgtctcaacatcccggggagtttcagctgtaggtgcctgccaggctgggtgggggat ggcttcgaatgtcacgacctggatgaatgcgtctcccaggagcaccggtgcagcccaaga ggtgactgtctcaatgtccctggctcctaccgctgcacctgccgccagggctttgccggg gatggcttcttctgcgaagacagggatgaatgtgccgagaacgtggacctctgtgacaac gggcagtgcctcaatgcgcccggcgggtaccgctgtgaatgtgagatgggctttgacccc accgaggaccaccgggcctgccaggatgtggacgagtgtgcgcaagggaacctctgtgca tttgggagctgtgagaacctgcctggaatgttccgctgcatctgcaatggtggctacgaa ctggaccgagggggtggcaactgcacagacatcaacgagtgtgcagacccagtaaactgc atcaacggcgtgtgcattaacacccccggcagctacctctgcagctgcccccaggatttt gagctgaaccccagcggagtgggctgcgtggacactcgggccgggaactgtttcctggag acgcatgaccgaggggacagtggcatttcctgcagtgccgagatcggagttggtgtcacc cgagcttcctgctgttgctccctgggccgggcttggggcaatccctgtgagctgtgccct atggccaacaccactgagtacagaaccctgtgcccgggtggtgagggcttccagcctaac cgcatcactgtcattctggaagacatcgacgagtgccaagagctgccagggctgtgtcag gggggtgactgcgtcaacacgtttggcagtttccagtgtgagtgcccacctggctaccac ctcagtgagcacacccgcatctgtgaggatattgacgaatgctccacacactccggcatc tgtggccctggcacctgctacaacaccctggggaactacacctgtgtctgccctgcagag tacctccaagtcaatggtggcaacaactgcatggatatgaggaagagtgtctgcttccgg cactataacggcacatgtcaaaatgagctggccttcaacgtgacccggaaaatgtgttgc tgctcctacaacattggccaggcctggaatagaccctgtgaggcctgccccactcccatc agtcctgactaccagatcctgtgtggaaatcaggccccgggattcctcactgacatccac acggggaagccccttgacattgatgagtgtggggagatccccgccatctgtgccaatggc atctgcataaaccagatcgggagtttccgctgcgagtgccccgcaggcttcaactacaac agcatcctgctggcttgtgaagatgtcgatgagtgtggcagcagggagagtccctgccag cagaatgctgactgcatcaacatccccggtagctaccgctgcaagtgcacccgagggtac aaactgtcgccaggcggggcttgtgtgggacggaatgagtgtcgggagatcccgaatgtc tgtagccatggtgactgcatggacacagaaggcagctacatgtgtctgtgtcaccgtgga ttccaggcctctgcagaccagaccctgtgcatggacattgacgagtgtgaccggcagcct tgtggaaatgggacctgcaagaacatcattggctcctacaactgcctctgcttccctggc tttgtggtgacacacaatggggattgtgtggattttgatgagtgtactaccctggtgggg caggtgtgccgatttggccattgcctcaacacagctggttccttccactgcctctgccag gatggctttgagctcacagctgatgggaagaactgtgtggacaccaatgagtgcctcagc cttgcaggaacctgcctacccggcacttgccagaacctcgagggctccttccgctgcatc tgtccccctggcttccaggtgcagagtgaccactgcattgatatcgacgagtgctcagag gagcccaacctctgcctctttggcacctgtaccaacagccctgggagcttccagtgcctc tgcccacctggctttgtcctctctgacaatgggcaccgttgctttgacacacggcagagt ttctgcttcacccgttttgaggctgggaagtgctcggtgcccaaagctttcaacaccacc aagacccgctgctgctgcagtaagaggcctggggagggctggggagacccctgcgaactg tgtccccaggagggcagcgctgcctttcaggagctctgcccctttggccacggggcagtc ccaggcccggatgactcccgagaagacgtgaatgagtgtgcagagaaccctggcgtctgc actaacggcgtctgtgtcaacaccgatggatccttccgctgtgagtgtccctttggctac agcctggacttcactggcatcaactgtgtggacacagacgagtgctctgtcggccacccc tgtgggcaagggacatgcaccaatgtcatcggaggcttcgaatgtgcctgtgctgacggc tttgagcctggcctcatgatgacctgcgagggtacaactctgcttgcccaggacctgggc cgtgtgacggagatgcctgggggagggagtggagacatcgacgaatgctccctgaacccg ctgctctgtgccttccgctgccacaataccgagggctcctacctgtgcacctgtccagcc ggctacaccctgcgggaggatggggccatgtgtcgagatgtggacgagtgtgcagatggt cagcaggactgccacgcccggggcatggagtgcaagaacctcatcggtaccttcgcgtgc gtctgtcccccaggcatgcggcccctgcctggctctggggagggctgcacagatgacaat gaatgccacgctcagcctgacctctgtgtcaacggccgctgtgtcaacaccgcgggcagc ttccggtgcgactgtgatgagggattccagcccagccccacccttaccgagtgccacgac atccggcaggggccctgctttgccgaggtgctgcagaccatgtgccggtctctgtccagc agcagtgaggctgtcaccagggccgagtgctgctgtgggggtggccggggctgggggccc cgctgcgagctctgtcccctgcccggcacctctgcctacaggaagctgtgcccccatggc tcaggctacactgctgagggccgagatgtagatgaatgccgtatgcttgctcacctgtgt gctcatggggagtgcatcaacagccttggctccttccgctgccactgtcaggccgggtac acaccggatgctactgctactacctgcctggatatggatgagtgcagccaggtccccaag ccatgtaccttcctctgcaaaaacacgaagggcagtttcctgtgcagctgtccccgaggc tacctgctggaggaggatggcaggacctgcaaagacctggacgaatgcacctcccggcag cacaactgtcagttcctctgtgtcaacactgtgggcgccttcacctgccgctgtccgccc ggcttcacccagcaccaccaggcctgcttcgacaatgatgagtgctcagcccagcctggc ccatgtggtgcccacgggcactgccacaacaccccgggcagcttccgctgtgaatgccac caaggcttcaccctggtcagctcaggccatggctgtgaagatgtgaatgaatgtgatggg ccccaccgctgccagcatggctgtcagaaccagctagggggctaccgctgcagctgcccc cagggtttcacccagcactcccagtgggcccagtgtgtggatgagaatgagtgtgccctg tcgccccccacctgcgggagcgcctcctgtcgcaacactcttggtggcttccgctgcgtc tgcccctctggctttgactttgatcaggccctcgggggctgccaggaggtggatgagtgc gccggacggcgtggcccctgtagctacagctgtgccaacacgcctggtggcttcctgtgc ggctgtcctcaaggctacttccgggctgggcaagggcactgtgtctccggcctgggcttc agccccggaccccaggacaccccggacaaagaggagctgctctcgtctgaagcctgctac gaatgcaagatcaatggcctctcccctcgggaccggccacgacgcagtgcccacagggac caccaggaatctcagggatcttcaacacaccactcagtagccgtagggcaggcctatggc tccatccctccttccctctcgggccaccccatcaaggtgaacctggccacccttgactcc gaggccctgctgaccttgggcctgaacctctcacacctgggccgggccgagcgcatcctg gagctccggccggccctggagggtctagagggccggatccgctacgtcatcgtccgcgga aacgagcaaggtttctttcgcatgcatcacctccgtggcgtcagctccctgcagctgggg cggaggcggccggggcctggaacctaccggctggaggtgtgtctttggtctccaacaaag cgctatgaggcaggcagtattatcacccccattttccaggggggcaaaattgaggcttag