GENSCAN 1.0 Date run: 4-Nov-116 Time: 09:29:24 Sequence gi568815585f:113206484_113422418 : 215935 bp : 49.80% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 3145 3292 148 0 1 108 90 295 0.999 31.95 1.02 Intr + 3490 3605 116 2 2 109 94 259 0.864 28.77 1.03 Intr + 22963 23036 74 0 2 67 105 106 0.011 8.40 1.04 Intr + 26694 26856 163 0 1 65 109 180 0.854 17.78 1.05 Intr + 29821 29923 103 0 1 -8 42 89 0.293 -5.55 1.06 Intr + 32950 33068 119 2 2 63 79 189 0.987 15.68 1.07 Intr + 36485 36677 193 1 1 68 0 374 0.989 25.67 1.08 Intr + 37927 38031 105 2 0 41 93 148 0.999 10.79 1.09 Intr + 38466 38576 111 1 0 72 115 73 0.997 8.75 1.10 Intr + 38669 38754 86 0 2 82 109 83 0.999 9.34 1.11 Intr + 39473 39580 108 1 0 71 100 48 0.944 4.78 1.12 Intr + 46599 46712 114 2 0 87 100 30 0.929 4.74 1.13 Intr + 48210 48315 106 2 1 43 68 165 0.993 9.79 1.14 Intr + 48470 48642 173 0 2 55 40 94 0.981 0.96 1.15 Intr + 54124 54276 153 0 0 117 94 45 0.993 8.27 1.16 Term + 57004 57099 96 0 0 74 48 117 0.971 4.27 1.17 PlyA + 57914 57919 6 1.05 2.00 Prom + 65530 65569 40 -5.46 2.01 Init + 69992 70226 235 1 1 104 81 25 0.127 1.70 2.02 Intr + 72018 72165 148 1 1 16 67 141 0.281 4.09 2.03 Intr + 82419 82597 179 0 2 77 46 126 0.672 6.86 2.04 Intr + 86100 86189 90 1 0 59 103 50 0.812 3.57 2.05 Term + 89041 89138 98 2 2 102 43 61 0.269 1.13 2.06 PlyA + 89691 89696 6 -1.75 3.00 Prom + 89721 89760 40 -5.96 3.01 Init + 90952 91012 61 0 1 111 85 234 0.988 24.81 3.02 Intr + 100002 100123 122 1 2 53 97 121 0.161 9.71 3.03 Intr + 103160 103379 220 1 1 57 116 144 0.982 11.97 3.04 Intr + 104226 104384 159 1 0 64 75 166 0.973 12.86 3.05 Intr + 106908 108484 1577 1 2 34 2 1051 0.439 79.81 3.06 Intr + 108511 108819 309 0 0 -29 70 212 0.402 4.21 3.07 Intr + 110997 111024 28 2 1 100 39 27 0.306 -3.31 3.08 Intr + 112986 113173 188 1 2 70 66 287 0.508 24.11 3.09 Intr + 113862 113987 126 2 0 84 77 199 0.999 19.28 3.10 Intr + 114921 114987 67 2 1 79 78 41 0.987 0.68 3.11 Intr + 115074 115244 171 1 0 77 105 255 0.964 26.01 3.12 Term + 115799 115938 140 0 2 87 38 276 0.999 20.63 3.13 PlyA + 116923 116928 6 1.05 4.27 PlyA - 117736 117731 6 1.05 4.26 Term - 118094 118005 90 2 0 97 53 93 0.721 4.42 4.25 Intr - 119363 119178 186 2 0 84 89 102 0.998 9.79 4.24 Intr - 119608 119436 173 2 2 54 83 312 0.644 26.96 4.23 Intr - 126243 126029 215 2 2 108 50 30 0.041 -0.44 4.22 Intr - 130036 129896 141 0 0 64 72 68 0.131 2.27 4.21 Intr - 132368 132116 253 0 1 -5 44 193 0.040 2.09 4.20 Intr - 137375 137251 125 1 2 94 93 52 0.206 6.53 4.19 Intr - 139968 139899 70 1 1 78 62 24 0.070 -2.96 4.18 Intr - 142387 142345 43 1 1 63 94 42 0.193 0.11 4.17 Intr - 144490 144366 125 2 2 77 99 194 0.961 19.70 4.16 Intr - 148998 148840 159 1 0 83 94 167 0.966 16.76 4.15 Intr - 157427 157279 149 1 2 108 94 326 0.864 35.08 4.14 Intr - 157698 157537 162 2 0 108 89 -6 0.523 0.59 4.13 Intr - 158417 158265 153 1 0 56 56 91 0.725 1.89 4.12 Intr - 159518 159391 128 2 2 94 50 73 0.938 3.58 4.11 Intr - 160737 160624 114 0 0 88 72 61 0.831 5.14 4.10 Intr - 163604 163550 55 1 1 98 86 60 0.566 5.78 4.09 Intr - 187496 187316 181 0 1 73 32 81 0.038 -0.07 4.08 Intr - 187745 187560 186 0 0 66 35 112 0.331 3.36 4.07 Intr - 194627 194547 81 0 0 85 83 39 0.270 2.71 4.06 Intr - 195165 194976 190 0 1 57 33 69 0.059 -2.54 4.05 Intr - 195756 195587 170 1 2 68 34 118 0.467 4.07 4.04 Intr - 196025 195970 56 1 2 86 84 15 0.595 -0.48 4.03 Intr - 201737 196899 4839 1 0 85 59 2902 0.970 274.45 4.02 Intr - 201845 201825 21 1 0 117 81 25 0.266 1.36 4.01 Init - 213933 213854 80 0 2 44 87 54 0.060 1.43 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 1802 1618 185 0 2 70 42 193 0.909 10.61 S.002 Init - 2577 2472 106 0 1 84 44 147 0.866 8.28 S.003 Init + 22966 23036 71 0 2 59 105 100 0.885 9.41 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585f:113206484_113422418|GENSCAN_predicted_peptide_1|655_aa MADEAPRKGSFSALVGRTNGLTKPAALAAAPAKPGGAGGSKKLVIKNFRDRPRLPDNYTQ DTWRKLHEAVRAVQSSTSIRYNLEELYQIMIRSIFLFLDRTYVLQNSTLPSIWDMGLELF RTHIISDKMVQSKTIDGILLLIERERSGEAVDRSLLRSLLGMLSDLQKSSPRETGKQQPG KSGGNSGESGSHSEAPQPGCAGLDHLLDENRVPDLAQMYQLFSRVRGGQQALLQHWSEYI KTFGTAIVINPEKDKDMVQDLLDFKDKVDHVIEVCFQKNERFVNLMKESFETFINKRPNK PAELIAKHVDSKLRAGNKEATDEELERTLDKIMILFRFIHGKDVFEAFYKKDLAKRLLVG KSASVDAEKSMLSKLKHECGAAFTSKLEGMFKDMELSKDIMVHFKQHMQNQSDSGPIDLT VNILTMGYWPTYTPMEVHLTPEMIKLQEVFKAFYLGKHSGRKLQWQTTLGHAVLKAEFKE GKKEFQVSLFQTLVLLMFNEGDGFSFEEIKMATGIEDSELRRTLQSLACGKARVLIKSPK GKEVEDGDKFIFNGEFKHKLFRIKINQIQMKETVEEQVSTTERVFQDRQYQIDAAIVRIM KMRKTLGHNLLVSELYNQLKFPVKPGDLKKRIESLIDRDYMERDKDNPNQYHYVA >gi568815585f:113206484_113422418|GENSCAN_predicted_CDS_1|1968_bp atggcggacgaggccccgcggaagggcagcttctcggcgctcgtgggccgcaccaacggc ctcaccaagcccgcggccctggccgccgcgcccgccaagccggggggcgcgggcggctcc aagaagctggtcatcaagaacttccgagacagacctcggctgcccgacaactacacgcag gacacgtggcggaagctgcacgaggcggtgcgggccgtgcagagcagcacctccatcagg tacaacctcgaggagctctaccagatcatgatcagaagcatcttcctgttcttggaccgc acctatgtgctgcagaactccacgctgccctccatctgggatatgggattagaactgttt agaacccatattattagtgataaaatggttcagagtaaaaccattgatggaatcctactg ctgatcgagcgcgagaggagcggcgaggccgtggaccggagcctgttgcggagcctcctg ggcatgctgtctgacctgcagaagagcagcccccgggagactgggaagcagcagccaggg aagtccgggggaaatagtggagagagcggcagccacagcgaagccccacaaccagggtgt gcagggctcgaccacttactggatgagaacagagtgccggacctcgcacagatgtaccag ctgttcagccgggtgaggggcgggcagcaggcgctgctgcagcactggagcgagtacatc aagacttttggaacagcgatcgtaatcaatcctgagaaagacaaagacatggtccaagac ctgttggacttcaaggacaaggtggaccacgtgatcgaggtctgcttccagaagaatgag cggttcgtcaacctgatgaaggagtcctttgagacgttcatcaacaagagacccaacaag cctgcagaactgatcgcaaagcatgtggattcaaagttaagagcaggcaacaaagaagcc acagacgaggagctggagcggacgttggacaagatcatgatcctgttcaggtttatccac ggtaaagatgtctttgaagcattttataaaaaagatttggcaaaaagactccttgttggg aaaagtgcctcagtcgatgctgaaaagtctatgttgtcaaagctcaagcatgagtgcggt gcagccttcaccagcaagctggaaggcatgttcaaggacatggagctttcgaaggacatc atggttcatttcaagcagcatatgcagaatcagagtgactcaggccctatagacctcaca gtgaacatactcacaatgggctactggccaacatacacgcccatggaagtgcacttaacc ccagaaatgattaaacttcaggaagtatttaaggcattttatcttggaaagcacagtggt cgaaaacttcagtggcaaactactttgggacatgctgttttaaaagcggagtttaaagaa gggaagaaggaattccaggtgtccctcttccagacactggtgctcctcatgttcaacgag ggagatggcttcagctttgaggagataaaaatggccacggggatagaggatagtgaattg cgcagaacgctgcagtccctggcctgtggcaaagcacgtgtgctgattaaaagtcccaaa ggaaaggaagtggaagatggagacaagttcatttttaatggagagttcaagcacaagttg tttagaataaagatcaatcaaattcagatgaaggaaactgttgaggaacaggttagcacc actgagagagtgtttcaggatagacaatatcagattgatgctgctatcgtcagaataatg aagatgagaaagactcttggtcataatcttctagtttctgaattatataatcagctgaaa tttccagtaaagcctggagatttgaaaaagagaattgaatctctgatagacagagactat atggagagagacaaagacaatccgaatcagtaccactacgtggcctga >gi568815585f:113206484_113422418|GENSCAN_predicted_peptide_2|249_aa MPSPESHFNKDTDELKDEERPRDARSWHCLLCTTVTAVSRAYSLLGTRAFLKERLLAPAA EGATGPKQADHCGRVGEAGVSHGTRLRPCLKKKEDEERHTMKTNAKDKSGYIQIEVDFRT RNLTRDKSLGSTILQGAQIPLLEDSVERPGCGHSLCLLSLKLQLQLCQQLEPGEERRVRD SRVHARAEIPNQWNRCGVCPWSHMRTVVILLELLSLRGWEYIDAISIIKATSYEELPRDN SLQYYLLTW >gi568815585f:113206484_113422418|GENSCAN_predicted_CDS_2|750_bp atgcccagcccagagagtcattttaataaagacacagatgagttaaaggatgaagaaaga ccacgagacgcacgcagctggcattgtcttctctgtacaactgttacagcagtttccaga gcctactctctcctgggcacaagggcatttcttaaggaaagactccttgcaccagctgca gaaggggcaacaggtcccaaacaagcagatcactgtgggcgtgttggagaagctggcgtg agccacggcacccggctgagaccttgtctaaagaagaaagaagatgaagaaagacatacc atgaagacgaatgcgaaagataagagtggctacattcagatcgaagtagacttcaggaca aggaatctcaccagagataagagtcttggatcaaccattctccaaggagcccagattcct ttactggaggacagtgttgagagaccaggatgtggccacagcctgtgcttgctgtcgctg aagctgcagcttcagctctgtcagcagctggagccaggagaggagaggagggtccgtgac tcacgtgtgcatgcacgtgcagagatccctaaccaatggaaccgctgcggcgtgtgcccc tggtctcatatgagaactgtggtgatcctgctggagcttctgtctctgcgtgggtgggaa tatattgatgctatttcaattataaaagccacgagctacgaggagttgcctcgagacaat tcccttcagtactacttgttaacttggtag >gi568815585f:113206484_113422418|GENSCAN_predicted_peptide_3|1055_aa MAAPGSARRPLLLLLLLLLLGLMHCASAAMFMVKNGNGTACIMANFSAAFSVNYDTKSGP KNMTFDLPSDATVVLNRSSCGKENTSDPSLVIAFGRGHTLTLNFTRNATRYSVQLMSFVY NLSDTHLFPNASSKEIKTVESITDIRADIDKKYRCVSGTQVHMNNVTVTLHDATIQAYLS NSSFSRGAFLCVQILDVGSFFNGEHACDSKWKKARQPSPGHLPLLEAASVEMPVCLGRGL LEGVSVEMPVCLGRGLLEGTSVEMSVCLGCGLLEGVSVEMPVCLGHGLLEGTSVEMSVCL GCGLLEGVSVEMPVCLGHGLLEGTSVEMSVCLGRGLLEGVSVEMPVCLGHGLLEGTSVEM PVCLGRGLLEGVSVEMPVCLGRGLLEGTSVEMPVCLGRGLLEGVSVEMPVCLGRGLLEGT SVEMPVCLGRGLPEGVSVEMPVCLGRGLLEGVSVEMPVCLGRGLLEGTSAEMSVCLGRGL PEGVSVEMSVCLGRGLLEGASAEMPVCLGRGLLEGTSVEMPMCLGRGLLEGVSVEMPVCL GRGLLEGTSVEMPVCLGRGLLEGVSVEMPVCLGRGLLEGTSVEMSVCLGRGLLEGVSVEM PVCLGRGLLEGTSVEMSVCLGRGLLEGVSVEMPVCLGRGLLEGTSVEMSVCLGRGLLEGV SVEMPMCLGRGLLEAASVEMPVCLGRGLLEGVSVEMPVCLGHGLLEGTSVEMPGVSVEMP VCLGRGLLEGTSVEMSVCLGCGLLEGTSVEMSVCLGRGLLEGVSAEMPVCLGRGLPEGVS VEMPVCLGRGLLEGASAEMPVCLGRGLLEGTSVEMPLLVTTLQKQETRCEQDRPSPTTAP PAPPSPSPSPVPKSPSVDKYNVSGTNGTCLLASMGLQLNLTYERKDNTTVTRLLNINPNK TSASGSCGAHLVTLELHSEGTTVLLFQFGMNASSSRFFLQGIQLNTILPDARDPAFKAAN GSLRALQATVGNSYKCNAEEHVRVTKAFSVNIFKVWVQAFKVEGGQFGSVEECLLDENSM LIPIAVGGALAGLVLIVLIAYLVGRKRSHAGYQTI >gi568815585f:113206484_113422418|GENSCAN_predicted_CDS_3|3168_bp atggcggcccccggcagcgcccggcgacccctgctgctgctactgctgttgctgctgctc ggcctcatgcattgtgcgtcagcagcaatgtttatggtgaaaaatggcaacgggaccgcg tgcataatggccaacttctctgctgccttctcagtgaactacgacaccaagagtggccct aagaacatgacctttgacctgccatcagatgccacagtggtgctcaaccgcagctcctgt ggaaaagagaacacttctgaccccagtctcgtgattgcttttggaagaggacatacactc actctcaatttcacgagaaatgcaacacgttacagcgtccagctcatgagttttgtttat aacttgtcagacacacaccttttccccaatgcgagctccaaagaaatcaagactgtggaa tctataactgacatcagggcagatatagataaaaaatacagatgtgttagtggcacccag gtccacatgaacaacgtgaccgtaacgctccatgatgccaccatccaggcgtacctttcc aacagcagcttcagccggggagccttcctctgtgtccagatcctggacgttggttccttt tttaatggagaacacgcatgtgattccaaatggaaaaaggcccgacagcccagccctggc cacttgcccctcctggaggcagccagtgtggagatgccagtgtgcctggggcgtggcctc ctagagggagtcagtgtggagatgccggtgtgcctggggcgtggcctcctggagggaacc agtgtggagatgtcggtgtgcctggggtgtggcctcctggagggagtcagtgtggagatg cccgtgtgcctggggcatggcctcctggagggaaccagtgtggagatgtcagtgtgcctg gggtgtggcctcctggagggagtcagtgtggagatgcccgtgtgcctggggcatggcctc ctggagggaaccagtgtggagatgtcagtgtgcctggggcgtggcctcctggagggagtc agtgtggagatgcccgtgtgcctggggcatggcctcctggagggaaccagcgtggagatg ccggtgtgcctggggcgtggcctcctagagggagtcagtgtggagatgcccgtgtgcctg gggcgtggcctcctggagggaaccagtgtggagatgcccgtgtgcctggggcgtggcctc ctagagggagtcagtgtggagatgcccgtgtgcctggggcgtggcctcctggagggaacc agtgtggagatgcccgtgtgcctggggcgtggcctcccggagggagtcagtgtggagatg cccgtgtgcctggggcgtggcctactagagggagtcagtgtggagatgcccgtgtgcctg gggcgtggcctcctggagggaaccagtgcggagatgtcggtgtgcctggggcgtggcctc ccggagggagtcagtgtggagatgtcggtgtgcctggggcgtggcctcctggagggagcc agtgcggagatgcccgtgtgcctggggcggggcctcctggagggaaccagcgtggagatg cccatgtgcctggggcgtggcctcctagagggagtcagtgtggagatgcccgtgtgcctg gggcgtggcctcctggagggaaccagtgtggagatgccagtgtgcctggggcgtggcctc ctagagggagtcagtgtggagatgcccgtgtgcctggggcgtggcctcctggagggaacc agtgtggagatgtcggtgtgcctggggcgtggcctcctagagggagtcagtgtggagatg cccgtgtgcctggggcgtggcctcctggagggaaccagtgtggagatgtcggtgtgcctg gggcgtggcctcctagagggagtcagtgtggagatgcccgtgtgcctggggcgtggcctc ctggagggaaccagtgtggagatgtcggtgtgcctggggcgtggcctcctagagggagtc agtgtggagatgcccatgtgcctggggcgtggcctcctggaggcagccagtgtggagatg ccagtgtgcctggggcgtggcctcctagagggagtcagtgtggagatgcccgtgtgcctg gggcatggcctcctggagggaaccagtgtggagatgccaggagtcagtgtggagatgccc gtgtgcctggggcgtggcctcctggagggaaccagtgtggagatgtcggtgtgtctgggg tgtggcctcctagagggaaccagtgtggagatgtcggtgtgcctggggcgtggcctcctg gagggagtcagtgcggagatgcccgtgtgcctggggcgtggcctcccggagggagtcagt gtggagatgcccgtgtgcctggggcgtggcctcctggagggagccagtgcggagatgcca gtgtgcctggggcggggcctcctggagggaaccagcgtggagatgccgcttcttgtgacc actctacagaaacaggagacacgctgtgaacaagacaggccttccccaaccacagcgccc cctgcgccacccagcccctcgccctcacccgtgcccaagagcccctctgtggacaagtac aacgtgagcggcaccaacgggacctgcctgctggccagcatggggctgcagctgaacctc acctatgagaggaaggacaacacgacggtgacaaggcttctcaacatcaaccccaacaag acctcggccagcgggagctgcggcgcccacctggtgactctggagctgcacagcgagggc accaccgtcctgctcttccagttcgggatgaatgcaagttctagccggtttttcctacaa ggaatccagttgaatacaattcttcctgacgccagagaccctgcctttaaagctgccaac ggctccctgcgagcgctgcaggccacagtcggcaattcctacaagtgcaacgcggaggag cacgtccgtgtcacgaaggcgttttcagtcaatatattcaaagtgtgggtccaggctttc aaggtggaaggtggccagtttggctctgtggaggagtgtctgctggacgagaacagcatg ctgatccccatcgctgtgggtggtgccctggcggggctggtcctcatcgtcctcatcgcc tacctcgtcggcaggaagaggagtcacgcaggctaccagactatctag >gi568815585f:113206484_113422418|GENSCAN_predicted_peptide_4|2714_aa MSRARLHSNRVDSVSIRKSNPSVRIRKLLGPLKSRKSSKTCSDVMSVDAQTLKKKMGKLA CDPAAHSILSSLLLYVTGRADRPPGTEAGGSRPSHQPQTQEATQRPTRFQLLQAKFLGTG RERYLKRTREVGRLISKDKQGPGGGLVGATINKLLEKTKEPAPKPCLSEKPRWGHPAGKS TVKNILKIFLAAEEKEAKEKEAREKPPVERPKAARGLLPKIMGKSSVLSKLREKFEQNSC LCSEASALRLHTQERKKRNLQRKRMHRPEVRVLHTATMASTCVKMPPARFLACTAEPLPA LSIATVVCGPRSWLSHCTKISHSEARRPPRGEASVPPSARETGPSGNKAVGKGPLEEEPQ RQPRPSKPVTPQVMAQRDGHAVPSLAFSCAPCTGGVLPGLVPASSPLGPASPWGTGSAGG DGTADPTAESTAGGVQEVRGARLTWPPGPPGECAGEGPEITMTVCSSEDEREGAGFPDPG RDPLFATQKYFPEQKVPEHIPPLNAPSVQAARRTQPATEPPRITVQIPVVHEMPAPPTRL QNMSSGENKPCICGGENVVENAHTEFPTVTENRRGHRAPVELSKLSGMQAGLSASSPQGP RAAPRLAAARGAVGADHSVPETLLTQRLQTDPAGGKENKGSFENSHGPRNPDDISGERTS ELRDVKHPLPESNEISMQKKGSATNDPAASQNLLRGNTSHASSSQQVPSPTGRNPTGAPT SLAASSKGRTGPEGVTPMGMTVPGALEECRRPLLIESSQPLKAAEEITSHDVRENPLSSL NEPPKPGMKACGAMAAAGSVASRATPAPAPGSTQSPGDRTAGEPETLGQWGSRALSESHP RGEALPRDPHSHGLLAPGGSLEPKSGAAGRSLLRGVALVQHPEDIATLARHPEDAAALAR HPEAARLYISNTSAASRHTAAVGGRKDVAVEGNLLGFSTESGIPASDHPRPQARSVAESP SYGPGLPPSPPENPQAKGREGVRFPRGAEPDHLLPAVPPAEVDMGWVGGTHQRGPPHLQA HLPPTAGDTQAKLRASVPEPRTQAGESQERPLTQADLGRQQSHQAQEETPQPGDAGKRVA PSGSKVVLNPAKEPQTWWAQDLAGDKGMAIGVGGACQRSDQGQQHLQGPWEERGRSTAWG EGTRAARNPAVPPGEPEGPGSPAAQGQAQKQVQEWDRGQVQGHAQEQAQWQTQIEAQGQA QEQAQGGTQGHSQGQAQKQFQNWAQGQAQGHAQEQAQWQTQIEAQGQAQEPAQGGAQGQV QGQAQKWAQGQIQGQAQKQVQGEVQKWAQEEAQGQAQWQTQIKAQKWAQEQTQKGAQERV QGQAQKGAQERAQEQAQEQTQIEAQGQAQKGAQERAREQAQKGAQERAREQAQKGAQERA REQAQKGAQERAREQAQKGAQERAREQAQKGAQERAREQAQKGAQERAQEQGREQTHIEA QGQAQKGAQEWARDRARDQGWEQTQIETQRQTQKGAQERAWEQGREQALTSGMAPRAWEQ PISGIAEGVDAAGRSGGSRSPAPRDGGQSGGSGLGEPSAGYPPPGSRPLRGKSIATSPLG LGKSPTEPKPEAGGCGTPQAPAQEGSPDHPGAERALQDRMEASEPERRGRSRHLAKYKAQ SFRDQRAFDLSFRPMSVRASDTSELPNIEVFPALARDGPRLRPCSGHSVASETRTREATR LPEWPRSSWIFSVNRCSLSPYSPAPLFLVVPSLWKVNSCFLKLPRQGPRGLLYKSGVRAP SPLRGVSAWRVGSKCELLCVEAPEHRVPLARVGPGAAQLHTGWSGGTAARPSLFTYLELE AMSGKNMMTDTGGTGLGLHRPAPAYPAPASPDASCLSPPGGAAGLPGGGCCKRKPRPASC CHIRTPPGAAGSRREPGTQAYVRGPRRWPPGFWSPSLVRSGDGDLLVDSGRGSLGIRTWK PERLQGFQPWPCHRRGLLGQNPYKRGPEVTILKAACGQLTGFSLQDSLELRGVWHMLWLH EDAHVMVHHCSSPYIWPKNTLGGSPAITLRLGSWRPLVRAACHAAFHGSGGKGRAKFKIC SSGKFPGPPGHEKQLKSQDNVIGCGPRRSRSEPELPRILPGLARRPAPLRSPPRPHAPAV PPEGLKRPGAGRRTWDPLASLGARLPPRMQPAERSRVPRIDPYGFERPEDFDDAAYEKFF SSYLVTLTRRAIKWSRLLQGGGVPRSRTVKRYVRKGVPLEHRARVWMVLSGAQAQMDQNP GYYHQLLQGERNPRLEDAIRTDLNRTFPDNVKFRKTTDPCLQRTLYNVLLAYGHHNQGVG YCQLWKLCIRGRGICGAEVLLGLFSATDVFPGLLSHRGPPSLAVVLGQGGCVLILQGRGD VRGSRASPQCLEPGGVWLQLLPGNLQNRDRNGQQTCEPMAELPGEHEGGNGKTGVGVEPS AATEASRPVTQNESSLPWGSDHGVRVRLGIENVMHIVVFFLGGGRDCLPAMVSAGSSGES WPGAESKLAFPSARHTEGTLLSEEDLGLNLTPRGKLWRNVCHWAFVSVAPEACSWVDSSG CVCGNCPYGVLEDRRVGSGNWEVVSVLTSVEVAILLFGEARPGVFDYYSPAMLGLKTDQE VLGELVRAKLPAVGALMERLGVLWTLLVSRWFICLFVDILPVETVLRIWDCLFNEGSKII FRVALTLIKQHQELILEATSVPDICDKFKQITKGSFVMECHTFMQKIFSEPGSLSMATVA KLRESCRARLLAQG >gi568815585f:113206484_113422418|GENSCAN_predicted_CDS_4|8145_bp atgagcagggcgcggctgcattcaaacagggtggactcggtctccatccgcaaatcaaac ccctccgtgagaatcagaaaacttttggggcccctgaaaagccggaagagcagcaagacc tgcagtgacgtcatgtctgtggacgcccaaaccctgaagaagaagatgggcaagctggcc tgtgacccggccgcccactccatcctcagcagcctgctgctctacgtcacgggccgcgca gaccggcccccggggacagaggccggggggagcaggcccagccaccagccccagacccag gaggccacccagcggcccacgcgcttccagctcctgcaggccaagttcctgggcactggc cgggagcgctacctcaagaggaccagggaggtgggccggctgatctccaaggacaagcag gggccgggtgggggcctcgtgggtgccaccatcaacaagctcctggagaagaccaaggag ccggccccaaagccctgcctcagcgagaagccccgctggggccacccggccgggaagagc accgtgaaaaacatcctgaagatattcttggccgccgaggagaaggaggcgaaggagaaa gaagcacgcgagaagccccctgtggagcggcccaaagccgccaggggcctcttgccgaag atcatgggcaagagctcggtgctgtccaagctgcgggagaagttcgagcagaacagctgc ctgtgctccgaggccagtgcgctgaggctgcacacgcaggagcggaagaagaggaacctg cagaggaagaggatgcaccggccggaggtgcgcgtgctgcacacggccaccatggccagc acctgcgtcaagatgccccctgcccgctttctggcctgcacggctgagcccctgccggcc ctcagcatcgccaccgtcgtctgtggccccaggagctggctgtcccactgcaccaaaatc agccactcggaggcaaggcgtccacccagaggagaagccagcgtgccccctagtgccagg gagacggggcccagtgggaacaaagcagtgggaaaggggcccctggaggaggagccccaa agacagccaaggccctcgaagcccgtgacaccccaggtgatggctcagagggacggccac gctgtcccctcgctggccttctcttgtgctccctgcacaggtggagtccttcctggcctt gtgcctgcatcatccccactgggaccggccagcccttggggcaccggatcagccggaggt gatgggactgcagaccccaccgcagagagcaccgcaggaggcgtccaggaggtgagggga gccaggctcacgtggcctcctgggcccccaggcgagtgtgcaggggagggccctgaaatc accatgactgtttgcagttcagaggatgaaagggaaggagcaggcttcccagacccaggg agagaccccctctttgccacccagaagtatttcccagaacagaaggtgccggagcacatc ccacccctgaacgctccatcggtccaggctgctcggagaacacagcctgccacggagcct ccacggataactgtccaaattccagttgtccatgaaatgccagcccctcccaccaggctg caaaatatgtcaagtggtgaaaacaaaccttgtatttgtggaggagagaatgtggttgaa aatgcccacacagaatttcccaccgtgactgagaacagaaggggtcaccgagccccagtg gaactgagcaagctttcagggatgcaggcgggactcagcgcctcttcgccgcaggggcca cgggcagctccgcgtctggcagcagcaagaggggctgtgggcgccgaccacagtgtccca gagacacttctgacacagagactccagacagatcctgctggcgggaaggaaaacaaaggc agctttgagaattcacacggtcccaggaatcctgacgatatttcaggagagaggacctct gagctcagagatgtgaagcatccgttgccggaatctaatgaaatatcaatgcagaagaag ggctccgcaacaaatgatcctgcagcctcacaaaaccttctgaggggaaacaccagccat gcctccagcagccagcaagtgccgtccccgactgggagaaacccaacaggagcccccacg tcactggcggcatcatccaaggggcgtacggggcctgagggtgtcaccccgatgggcatg acagtgcccggtgcgctggaggaatgcagaaggccgctgctaatagagtcttctcagccc ctgaaggctgccgaggaaatcaccagccatgatgttcgcgagaacccactgtcctcatta aatgaaccacccaaaccaggtatgaaagcctgtggggcgatggcagctgcagggagcgtt gcgagccgcgccacgcctgcaccagcgccaggcagcacccagagccctggagaccgcaca gcgggagagccagagacgctgggccagtggggaagcagagccttgtccgaaagccacccc agaggagaggctctccctcgagaccctcacagccacggcctcctggcccctgggggatcc ttggagcccaagagtggagcagcagggaggagcctcctgagaggcgtggccctcgtccag cacccagaggacattgcgaccctcgcccgccacccggaggatgctgcggccctcgcccgg cacccagaggcggcccgattatacatttcaaacacatcagcagccagcagacatacagca gctgtaggaggcagaaaggatgtggctgtggaaggaaaccttttgggtttcagcacagag tctggaattcctgcttctgatcatcctaggccacaggcaaggagcgtggcagagtccccc agctatggccctggcctcccgccgtcgcctcctgagaacccacaggcaaagggcagggag ggcgtcaggtttccccgtggggcggagcctgaccatctgcttcccgcagtgcctcccgcg gaggtggacatggggtgggtaggtggcacccaccagcggggccctccccatctacaggca cacctgccccctactgctggtgacacacaggcaaagctccgggccagtgtccccgagcct aggacgcaggcaggtgaatcccaggagcgtcccctgacacaggcggacctggggaggcaa cagagtcaccaggcacaggaggagaccccccagcccggggatgcggggaagagggtggcg ccgtcgggctcgaaggttgtgctgaacccagcaaaagagcctcagacatggtgggcacag gatctggccggggacaaagggatggccattggggttgggggcgcctgccagcgcagtgac caaggtcagcagcatctgcagggaccctgggaggagcgggggcggagcacggcgtgggga gagggcaccagggctgccaggaacccagctgtgcctccaggggagcccgaggggccggga agcccggcagcccaaggacaggcccagaaacaggttcaggaatgggaccggggacaggtt cagggacacgctcaagaacaggcccagtggcagacccagatagaggcccaggggcaggca caggaacaggctcagggtgggacccagggacacagtcagggacaggcccagaaacagttt cagaattgggcccagggacaggctcagggacacgctcaagaacaggcccagtggcagacc cagatagaggcccaggggcaggcacaggaaccagctcagggtggggcccagggacaggtt cagggacaggcccagaaatgggctcaggggcagattcagggacaggcccagaaacaggtt caaggagaggttcagaaatgggcccaggaagaggctcagggacaggcccagtggcagacc cagataaaggcccagaaatgggctcaggaacagacccagaaaggggctcaggaacgggtt cagggtcaggcccagaaaggggctcaggaacgggcccaggaacaggctcaggagcagacc cagatagaggctcagggtcaggcccagaagggggctcaggaacgggctcgggaacaggcc cagaaaggggctcaggaacgggctcgggaacaggcccagaaaggggctcaggaacgggct cgggaacaggcccagaaaggggctcaggagcgggctcgggaacaggcccagaaaggggct caggaacgggctcgggaacaggcccagaaaggggctcaggaacgggctcgggaacaggcc cagaaaggggctcaggaacgggctcaggaacagggtcgggagcagacccacatagaggct cagggtcaggcccagaaaggggctcaggaatgggctcgggatcgggctcgggatcagggc tgggagcagacccagatagagactcagaggcagacccagaaaggggctcaggaacgggct tgggaacagggtcgggaacaggccttgacaagtgggatggcgcctcgggcctgggagcag cccattagtggcatagctgagggagtggatgctgctggcaggagtggggggtccagaagc ccagcccccagggatggtggacagtcagggggcagtggcctgggggagcccagcgccggg tacccacccccaggaagccgccccctcaggggcaagagcattgccacctctcccctgggg ctgggaaagagcccaactgagcccaagcctgaggctgggggctgcgggactccccaggcc ccggcccaggaggggtccccagaccaccctggggctgagagggccctgcaggacaggatg gaggcatcggagcccgagcgtcgcgggaggtccaggcacctggccaagtacaaagcccag agcttccgtgaccagagggccttcgatttgtccttcagaccaatgagcgtcagggccagc gacacgtctgagctcccaaacatagaagtgtttcctgccctggcccgagacggcccacga ctccggccctgttcaggccactcggtggcctctgaaacccggacccgtgaggccacacgg ctgcctgagtggccccggagcagctggattttcagcgtgaaccgatgcagcctgtcgccc tacagccccgcacccttgtttttggtggtgccctccctgtggaaagttaattcctgcttt ttgaagctcccccgacagggcccccggggcctcctctataaatcaggggtccgtgctccc agcccgcttcgaggggtttctgcttggcgtgtgggttctaagtgcgagctgctgtgtgtg gaggctcctgagcaccgggtgcccctcgcgcgtgtgggtcccggggcagcccagctgcac acaggatggtccgggggcacagcggcacggccctcactctttacctatttggaattggag gcaatgagtggcaagaacatgatgacagacacgggagggacaggcctcgggctgcaccgt cccgcgcccgcctaccccgcgcccgcctcgccggacgcctcctgcctgagtccgccaggg ggcgctgccggcctcccgggcggcggctgctgcaagaggaagccccggcccgcgagctgc tgccacatccggacgccgccgggagccgccgggagccgccgggagccggggacacaggcg tacgtgcggggacctcggcgctggcctccgggcttctggagcccctccctggtgcggagc ggggacggtgacctcctcgtggattctgggaggggcagccttggcatcaggacctggaag ccggagaggctgcagggcttccagccatggccttgccaccggagaggccttctcggacag aacccctacaagagaggccctgaggtgaccatccttaaggcagcatgtggtcaattgact ggcttcagcctccaggacagcctggagctccgtggggtttggcacatgctgtggctccac gaagatgcccacgtcatggttcatcactgctcgtctccctacatctggcccaaaaataca cttggtggcagtcccgccatcacactcaggctcgggagctggcggcccctggtccgtgct gcctgccacgctgctttccacggcagtggtgggaaagggagagcaaaattcaaaatctgc agttctggcaaatttccggggcctcccggacacgaaaagcagctgaaatctcaggacaat gtcattggatgcggaccgagaagaagccgttcggaaccagagctgccccggattctccca ggcctggctcgccgccccgcccctctccgctccccgccccgcccccacgcgcccgcggtc ccgccggaaggacttaagcgccccggagccgggaggcgaacttgggacccgctggcctcg ctcggtgcgcgcctccctccccgcatgcagcccgccgagcgctcgcgggtccccaggatc gacccgtacggattcgagcggcctgaggacttcgacgacgccgcctacgagaagtttttc tccagctacctggtcacgctcacccgcagggcgatcaaatggtcccggctgctgcagggc gggggcgtccccaggagccggacagtgaagcgctatgtccggaaaggggtcccgctggag caccgtgcccgcgtctggatggtgctgagtggggcccaggcgcagatggaccagaatccc ggctactaccaccagcttctccagggagagagaaaccccaggctggaggacgccatcagg acagacctgaaccggaccttccccgacaacgtgaagttccggaagaccacggacccctgc ttacagaggaccctgtacaatgtgctgctggcatatgggcaccataaccagggagtgggc tactgccagctctggaagctgtgcatcagaggtcgcggcatctgcggggcagaggtcctc ctgggtctgttctcggccacagatgtcttcccaggtctgctcagccacagaggtcctccc agcctggctgtggtgctgggtcaagggggctgcgtgctcatcctgcagggcagaggagac gtgaggggcagccgggccagtccccagtgtttggagcctgggggtgtgtggctccagctc ctgcctggcaatttacaaaatagagaccggaatggccagcagacctgtgaacccatggct gagctccctggtgagcatgaaggtggaaatggaaagactggggttggagtggagccttct gcagccaccgaggcctccaggcccgtgacacaaaacgaaagctccctgccgtggggcagt gaccacggggtgcgagtccgtctgggtattgaaaatgtcatgcacattgtcgtctttttc cttggaggaggccgagactgtctcccagcgatggtgtctgcaggcagcagtggagaatcc tggcctggtgctgagtccaagttggcctttcctagtgctcggcacaccgagggcactttg ctttcagaggaggacttgggactgaatcttacacctagagggaaactctggcgtaatgtg tgtcactgggcctttgtcagtgtggctcctgaagcttgttcttgggtggattcctcggga tgtgtttgtgggaactgtccttacggagtgcttgaagaccgtcgggtgggatctgggaac tgggaggtggtgtcggtgttgactagtgtcgaggtggccatactgttgtttggggaagca cgccctggagtatttgattactacagcccggccatgctgggcctgaagaccgaccaggag gtcctcggggagctggtgcgggcgaagctgccggctgtgggggccctgatggagcgtctc ggtgtgctgtggacgctgctggtgtcccgctggttcatctgcctgtttgtggacatcttg cccgtggagacagtgcttcggatctgggactgtttgtttaacgaaggctcgaagattatc ttccgggtggccctgaccttaattaagcagcaccaggagttgattttggaagccaccagc gttccagacatttgcgataagtttaagcagataaccaaagggagtttcgtgatggagtgt cacacgtttatgcagaaaatattttcagaacctggaagcttatccatggccaccgtcgcc aagctccgcgagagctgcagggcccggctgctggcacaggggtga