GENSCAN 1.0 Date run: 5-Nov-116 Time: 14:00:59 Sequence gi568815586f:53195568_53399342 : 203775 bp : 50.74% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.21 Intr - 154 59 96 1 0 114 94 69 0.959 10.31 1.20 Intr - 632 474 159 2 0 37 59 148 0.990 6.98 1.19 Intr - 1253 1012 242 0 2 117 94 238 0.985 24.57 1.18 Intr - 2096 1926 171 0 0 84 85 301 0.999 29.31 1.17 Intr - 2384 2183 202 2 1 56 105 267 0.945 24.06 1.16 Intr - 4879 4676 204 1 0 95 63 71 0.130 4.70 1.15 Intr - 5690 5666 25 1 1 97 55 13 0.019 -3.07 1.14 Intr - 11690 11635 56 2 2 94 76 65 0.058 3.68 1.13 Intr - 16296 16154 143 1 2 129 6 170 0.067 12.97 1.12 Intr - 17676 17518 159 1 0 106 96 254 0.999 27.96 1.11 Intr - 18133 17929 205 1 1 97 64 359 0.999 33.17 1.10 Intr - 18668 18492 177 2 0 119 119 215 0.999 27.82 1.09 Intr - 19039 18879 161 2 2 46 89 219 0.824 17.51 1.08 Intr - 19867 19726 142 1 1 96 102 205 0.999 22.63 1.07 Intr - 20227 20079 149 2 2 123 93 160 0.989 19.95 1.06 Intr - 24655 24383 273 2 0 31 51 138 0.245 1.91 1.05 Intr - 32008 31795 214 1 1 70 56 148 0.785 8.09 1.04 Intr - 32120 32083 38 0 2 151 63 -6 0.890 1.18 1.03 Intr - 35668 35602 67 1 1 83 94 50 0.625 3.58 1.02 Intr - 36513 36407 107 1 2 48 109 62 0.885 4.23 1.01 Init - 40830 40701 130 0 1 54 71 26 0.292 -2.19 1.00 Prom - 48174 48135 40 -5.16 2.00 Prom + 50010 50049 40 -8.46 2.01 Init + 57269 58555 1287 1 0 96 79 991 0.001 89.05 2.02 Intr + 73188 73280 93 2 0 85 81 73 0.202 6.46 2.03 Intr + 73457 74518 1062 1 0 84 59 625 0.874 49.71 2.04 Intr + 74811 74915 105 2 0 115 84 72 0.994 9.91 2.05 Intr + 75111 75231 121 2 1 145 65 137 0.999 17.17 2.06 Intr + 77154 77290 137 1 2 118 82 124 0.989 15.09 2.07 Intr + 79250 79443 194 1 2 63 87 109 0.951 6.59 2.08 Intr + 81053 81292 240 2 0 70 95 355 0.995 31.06 2.09 Intr + 81516 81660 145 0 1 86 81 170 0.844 16.38 2.10 Intr + 81903 82041 139 2 1 89 81 219 0.997 21.34 2.11 Intr + 82254 82393 140 1 2 55 90 104 0.991 7.48 2.12 Intr + 84165 84299 135 2 0 27 109 130 0.944 9.76 2.13 Intr + 85940 86059 120 1 0 87 63 97 0.995 7.79 2.14 Intr + 86697 86868 172 2 1 148 62 97 0.999 12.62 2.15 Intr + 87562 87690 129 2 0 81 101 127 0.999 13.97 2.16 Intr + 87815 87971 157 0 1 84 100 93 0.999 9.17 2.17 Intr + 88491 88600 110 0 2 68 99 173 0.999 16.33 2.18 Intr + 90357 91345 989 1 2 105 100 645 0.975 57.99 2.19 Intr + 92405 92774 370 1 1 83 91 216 0.999 16.08 2.20 Intr + 92971 93132 162 2 0 84 28 91 0.569 2.65 2.21 Intr + 93523 93736 214 2 1 77 94 174 0.999 14.67 2.22 Intr + 93837 94027 191 0 2 113 56 142 0.997 12.73 2.23 Intr + 94518 94645 128 1 2 64 97 82 0.915 7.10 2.24 Intr + 94780 94902 123 0 0 11 85 148 0.903 7.58 2.25 Intr + 95274 95429 156 2 0 90 87 131 0.999 13.41 2.26 Intr + 96123 96293 171 2 0 90 116 141 0.879 17.24 2.27 Intr + 96417 96521 105 2 0 61 59 57 0.584 0.51 2.28 Intr + 96711 96826 116 2 2 38 71 87 0.936 1.25 2.29 Intr + 97239 97403 165 0 0 81 97 201 0.937 19.38 2.30 Term + 97706 97907 202 2 1 103 40 143 0.995 7.86 2.31 PlyA + 98051 98056 6 1.05 3.00 Prom + 98061 98100 40 -11.63 3.01 Init + 100001 100072 72 1 0 115 75 109 0.972 13.37 3.02 Intr + 100272 100381 110 2 2 103 49 176 0.677 14.28 3.03 Intr + 100445 100586 142 2 1 51 15 81 0.629 -2.54 3.04 Intr + 102283 102357 75 0 0 94 113 123 0.645 15.11 3.05 Intr + 102478 102583 106 0 1 68 110 116 0.999 11.59 3.06 Intr + 104073 104386 314 1 2 62 63 212 0.423 12.00 3.07 Intr + 104583 104695 113 2 2 111 92 260 0.999 27.88 3.08 Intr + 107467 107626 160 1 1 120 81 201 0.986 22.59 3.09 Intr + 110341 110493 153 0 0 34 94 215 0.999 16.97 3.10 Intr + 110631 110753 123 2 0 62 80 61 0.916 3.48 3.11 Intr + 111113 111294 182 1 2 126 93 71 0.911 10.07 3.12 Term + 111399 111582 184 0 1 110 34 154 0.992 9.22 3.13 PlyA + 111586 111591 6 1.05 4.17 PlyA - 111920 111915 6 1.05 4.16 Term - 112146 111922 225 0 0 153 48 25 0.900 1.78 4.15 Intr - 112362 112278 85 2 1 78 89 31 0.975 2.02 4.14 Intr - 112566 112485 82 1 1 80 119 36 0.924 4.60 4.13 Intr - 112782 112715 68 2 2 98 100 9 0.788 1.65 4.12 Intr - 112961 112868 94 0 1 110 83 18 0.728 2.52 4.11 Intr - 113248 113158 91 1 1 72 89 28 0.810 0.87 4.10 Intr - 113453 113393 61 1 1 76 110 -7 0.851 -0.96 4.09 Intr - 113714 113590 125 2 2 95 109 58 0.990 7.98 4.08 Intr - 114154 114034 121 0 1 129 100 17 0.991 7.50 4.07 Intr - 118874 118731 144 1 0 114 131 91 0.998 15.40 4.06 Intr - 119282 119184 99 1 0 65 89 51 0.781 2.13 4.05 Intr - 119573 119527 47 2 2 96 97 37 0.965 2.61 4.04 Intr - 119859 119768 92 1 2 66 84 18 0.963 -1.09 4.03 Intr - 120215 120160 56 1 2 101 110 60 0.997 8.12 4.02 Intr - 125125 124998 128 1 2 64 73 79 0.952 3.48 4.01 Init - 125898 125776 123 0 0 75 37 93 0.344 3.39 4.00 Prom - 129631 129592 40 -10.35 5.03 PlyA - 131027 131022 6 1.05 5.02 Term - 133853 132579 1275 2 0 116 52 906 0.380 81.39 5.01 Init - 145022 144879 144 2 0 74 81 70 0.114 3.02 5.00 Prom - 160655 160616 40 -2.26 6.00 Prom + 179358 179397 40 -5.86 6.01 Init + 184922 185105 184 1 1 65 26 163 0.714 5.08 6.02 Intr + 186092 186246 155 0 2 58 111 151 0.963 14.19 6.03 Intr + 186543 188055 1513 2 1 92 110 893 0.537 81.13 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 16296 16109 188 1 2 129 47 209 0.924 18.55 S.002 Sngl + 57269 58621 1353 1 0 96 40 1075 0.998 97.65 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:53195568_53399342|GENSCAN_predicted_peptide_1|1040_aa MEILCKTDSYIHPHTGWQTPVALASHTLQATVEADVLAPGSGTGAPQYGEPRTLRRSIQE TARRRDLGAPPPPFPLPLQQLRPSSLNLTQYVEASLCRRPAGLLEAQWAGQAGRTPGDSH TAAAMATNKERLFAAGALGPGSGYPGAGFPFAFPGALRGSPPFEMLSPSFRGLGQPDLPK EMASLSPYASPPPPPLLERGAAGGGGGIGCGSLVFPAPSFPSSRVAMYDCMETFAPGPRR LYGAAGPGAGLLRRATGGSCFAGLESFAWPQPASLQSVETQSTSSEEMVPSSPSPPPPPR VYKPCFVCNDKSSGYHYGVSSCEGCKGFFRRSIQKNMVYTCHRDKNCIINKVTRNRCQYC RLQKCFEVGMSKEAVRNDRNKKKKEVKEEGSPDSYELSPQLEELITKVSKAHQETFPSLC QLGKYTTNSSADHRVQLDLGLWDKFSELATKCIIKIVEFAKRLPGFTGLSIADQITLLKA ACLDILMLRICTRYTPEQDTMTFSDGLTLNRTQMHNAGFGPLTDLVFAFAGQLLPLEMDD TETGLLSAICLICGDRMDLEEPEKVDKLQEPLLEALRLYARRRRPSQPYMFPRMLMKITD LRGISTKGAERAITLKMEIPGPMPPLIREMLENPEMFEDDSSQPGPHPNASSEDEISTQR LLLPPEEGLLCTHLIQDCKSRQGMVALPMVLVLLLVLSRGESELDAKIPSTGDATEWRNP HLSMLGSCQPAPSCQKCILSHPSCAWCKQLNFTASGEAEARRCARREELLARGCPLEELE EPRGQQEVLQDQPLSQGARGEGATQLAPQRVRVTLRPGEPQQLQVRFLRAEGYPVDLYYL MDLSYSMKDDLERVRQLGHALLVRLQEVTHSVRIGFGSFVDKTVLPFVSTVPSKLRHPCP TRLERCQSPFSFHHVLSLTGDAQAFEREVGRQSVSGNLDSPEGGFDAILQAALCQEQIGW RNVSRLLVFTSDDTFHTAGDGKLGGIFMPSDGHCHLDSNGLYSRSTEFDYPSVGQVAQAL SAANIQPIFAVTSAALPVYQ >gi568815586f:53195568_53399342|GENSCAN_predicted_CDS_1|3120_bp atggagattctgtgcaaaacagacagctacatccacccacacactggctggcagacacct gtagcactcgcctcacacacactccaggcaactgtggaggcagacgtgctagctccaggg agtgggacaggagccccccagtacggcgagccccggacattgcgacgctccatccaagag actgcccgacgccgggacctcggggctccgccgcctcccttccccctcccactccagcag ctacggcccagttccctcaacctgacccagtatgtagaagccagtctctgcaggcggcca gcgggacttttggaggcccagtgggcaggccaggcagggcggaccccaggggactctcac accgcagctgccatggccaccaataaggagcgactctttgcggctggtgccctggggcct ggatctggctacccaggggcaggtttccccttcgccttcccaggggcactcagggggtct ccgcctttcgagatgctgagccctagcttccggggcctgggccagcctgacctccccaag gagatggcctctctgtcgccctatgctagccctccccctccccccctgctggagcggggc gccgccgggggaggagggggaatcggctgcgggtccttggtgtttccagcacccagtttc ccttcaagccgggtcgcgatgtacgactgtatggaaacgtttgccccgggtccgcgacgg ctgtacggggcggccgggcccggggccggcttgctgcgcagagccaccggcggctcctgt ttcgccggacttgaatcttttgcctggccgcaacccgccagcctgcaatcggtggagaca cagagcaccagctcagaggagatggtgcccagctcgccctcgccccctccgcctcctcgg gtctacaagccatgcttcgtgtgcaatgacaagtcctctggctaccactatggggtcagc tcttgtgaaggctgcaagggcttctttcgccgaagcatccagaagaacatggtgtacacg tgtcaccgcgacaaaaactgtatcatcaacaaggtgaccaggaatcgctgccagtactgc cggctacagaagtgcttcgaagtgggcatgtccaaggaagctgtgcgaaatgaccggaac aagaagaagaaagaggtgaaggaagaagggtcacctgacagctatgagctgagccctcag ttagaagagctcatcaccaaggtcagcaaagcccatcaggagactttcccctcgctctgc cagctgggcaagtataccacgaactccagtgcagaccaccgcgtgcagctggatctgggg ctgtgggacaagttcagtgagctggctaccaagtgcatcatcaagatcgtggagtttgcc aagcggttgcctggctttacagggctcagcattgctgaccagatcactctgctcaaagct gcctgcctagatatcctgatgctgcgtatctgcacaaggtacaccccagagcaggacacc atgaccttctccgacgggctgaccctgaaccggacccagatgcacaatgccggcttcggg cccctcacagaccttgtctttgcctttgctgggcagctcctgcccctggagatggatgac accgagacagggctgctcagcgccatctgcctcatctgcggagaccgcatggacctggag gagcccgaaaaagtggacaagctgcaggagccactgctggaagccctgaggctgtacgcc cggcgccggcggcccagccagccctacatgttcccaaggatgctaatgaaaatcaccgac ctccggggcatcagcactaagggagctgaaagggccattactctgaagatggagattcca ggcccgatgcctcccttaatccgagagatgctggagaaccctgaaatgtttgaggatgac tcctcgcagcctggtccccaccccaatgcctctagcgaggatgagatcagtacacaaagg ctgctgctgccgccagaggaaggactgctctgcacgcacctaatccaagattgtaaaagc cgccaaggcatggtggctttgccaatggtccttgttttgctgctggtcctgagcagaggt gagagtgaattggacgccaagatcccatccacaggggatgccacagaatggcggaatcct cacctgtccatgctggggtcctgccagccagccccctcctgccagaagtgcatcctctca caccccagctgtgcatggtgcaagcaactgaacttcaccgcgtcgggagaggcggaggcg cggcgctgcgcccgacgagaggagctgctggctcgaggctgcccgctggaggagctggag gagccccgcggccagcaggaggtgctgcaggaccagccgctcagccagggcgcccgcgga gagggtgccacccagctggcgccgcagcgggtccgggtcacgctgcggcctggggagccc cagcagctccaggtccgcttccttcgtgctgagggatacccggtggacctgtactacctt atggacctgagctactccatgaaggacgacctggaacgcgtgcgccagctcgggcacgct ctgctggtccggctgcaggaagtcacccattctgtgcgcattggttttggttcctttgtg gacaaaacggtgctgccctttgtgagcacagtaccctccaaactgcgccacccctgcccc acccggctggagcgctgccagtcaccattcagctttcaccatgtgctgtccctgacgggg gacgcacaagccttcgagcgggaggtggggcgccagagtgtgtccggcaatctggactcg cctgaaggtggcttcgatgccattctgcaggctgcactctgccaggagcagattggctgg agaaatgtgtcccggctgctggtgttcacttcagacgacacattccatacagctggggac gggaagttgggcggcattttcatgcccagtgatgggcactgccacttggacagcaatggc ctctacagtcgcagcacagagtttgactacccttctgtgggtcaggtagcccaggccctc tctgcagcaaatatccagcccatctttgctgtcaccagtgccgcactgcctgtctaccag >gi568815586f:53195568_53399342|GENSCAN_predicted_peptide_2|2525_aa MLVTAYLAFVGLLASCLGLELSRCRAKPPGRACSNPSFLRFQLDFYQVYFLALAADWLQA PYLYKLYQHYYFLEGQIAILYVCGLASTVLFGLVASSLVDWLGRKNSCVLFSLTYSLCCL TKLSQDYFVLLVGRALGGLSTALLFSAFEAWYIHEHVERHDFPAEWIPATFARAAFWNHV LAVVAGVAAEAVASWIGLGPVAPFVAAIPLLALAGALALRNWGENYDRQRAFSRTCAGGL RCLLSDRRVLLLGTIQALFESVIFIFVFLWTPVLDPHGAPLGIIFSSFMAASLLGSSLYR IATSKRYHLQPMHLLSLAVLIVVFSLFMLTFSTSPGQESPVESFIAFLLIELACGLYFPS MSFLRRKVIPETEQAGVLNWFRVPLHSLACLGLLVLHDSDRKTGTRNMFSICSAVMVMAL LAVVGLFTVLSGVMRSFKRVNFGTLLSSQKEAEELLPALKEFLSNPPAGFPSSRSDAERR QACDAILRACNQQLTAKLACPRHLGSLLELAELACDGYLVSTPQRPPLYLERILFVLLRN AAAQGSPEATLRLAQPLHACLVQCSREAAPQDYEAVARGSFSLLWKGAEALLERRAAFAA RLKALSFLVLLEDESTPCEVPHFASPTACRAVAAHQLFDASGHGLNEADADFLDDLLSRH VIRALVGERGSSSGLLSPQRALCLLELTLEHCRRFCWSRHHDKAISAVEKAHSYLRNTNL APSLQLCQLGVKLLQVGEEGPQAVAKLLIKASAVLSKSMEAPSPPLRALYESCQFFLSGL ERGTKRRYRLDAILSLFAFLGGYCSLLQQLRDDGVYGGSSKQQQSFLQMYFQGLHLYTVV VYDFAQGCQIVDLADLTQLVDSCKSTVVWMLEALEGLSGQELTDHMGMTASYTSNLAYSF YSHKLYAEACAISEPLCQHLGLVKPGTYPEVPPEKLHRCFRLQVESLKKLGKQAQGCKMV ILWLAALQPCSPEHMAEPVTFWVRVKMDAARAGDKELQLKTLRDSLSGWDPETLALLLRE ELQAYKAVRADTGQERFNIICDLLELSPEETPAGAWARATHLVELAQVLCYHDFTQQTNC SALDAIREALQLLDSVRPEAQARDQLLDDKAQALLWLYICTLEAKMQEGIERDRRAQAPG NLEEFEVNDLNYEDKLQEDRFLYSNIAFNLAADAAQSKCLDQALALWKELLTKGQAPAVR CLQQTAASLQILAALYQLVAKPMQALEVLLLLRIVSERLKDHSKAAGSSCHITQLLLTLG CPSYAQLHLEEAASSLKHLDQTTDTYLLLSLTCDLLRSQLYWTHQKVTKGVSLLLSVLRD PALQKSSKAWYLLRVQVLQLVAAYLSLPSNNLSHSLWEQLCAQGWQTPEIALIDSHKLLR SIILLLMGSDILSTQKAAVETSFLDYGENLVQKWQVLSEVLSCSEKLVCHLGRLGSVSEA KAFCLEALKLTTKLQIPRQCALFLVLKGELELARNDIDLCQSDLQQVLFLLESCTEFGGV TQHLDSVKKVHLQKGKQQAQVPCPPQLPEEELFLRGPALELVATVAKEPGPIAPSTNSSP VLKTKPQPIPNFLSHSPTCDCSLCASPVLTAVCLRWVLVTAGVRLAMGHQAQGLDLLQVV LKGCPEAAERLTQALQASLNHKTPPSLVPSLLDEILAQAYTLLALEGLNQPSNESLQKVL QSGLKFVAARIPHLEPWRASLLLIWALTKLGGLSCCTTQLFASSWGWQPPLIKSVPGSEP SKTQGQKRSGRGRQKLASAPLRLNNTSQKGLEGRGLPCTPKPPDRIRQAGPHVPFTVFEE VCPTESKPEVPQAPRVQQRVQTRLKVNFSDDSDLEDPVSAEAWLAEEPKRRGTASRGRGR ARKGLSLKTDAVVAPGSAPGNPGLNGRSRRAKKVASRHCEERRPQRASDQARPGPEIMRT IPEEELTDNWRKMSFEILRGSDGEDSASGGKTPAPGPEAASGEWELLRLDSSKKKLPSPC PDKESDKDLGPRLRLPSAPVATGLSTLDSICDSLSVAFRGISHCPPSGLYAHLCRFLALC LGHRDPYATAFLVTESVSITCRHQLLTHLHRQLSKAQKHRGSLEIADQLQGLSLQEMPGD VPLARIQRLFSFRALESGHFPQPEKESFQERLALIPSGVTVCVLALATLQPGTVGNTLLL TRLEKDSPPVSVQIPTGQNKLHLRSVLNEFDAIQKAQKENSSCTDKREWWTGRLALDHRM EVLIASLEKSVLGCWKGLLLPSSEEPGPAQEASRLQELLQDCGWKYPDRTLLKIMLSGAG ALTPQDIQALAYGLCPTQPERAQELLNEAVGRLQGLTVPSNSHLVLVLDKDLQKLPWESM PSLQALPVTRLPSFRFLLSYSIIKEYGASPVLSQGVDPRSTFYVLNPHNNLSSTEEQFRA NFSSYAGHGAGARFLDGQAVLRLSCRAVALLFGCSSAALAVRGNLEGAGIVLKYIMAGCP LFLGNLWDVTDRDIDRYTEALLQGWLGAGPGAPLLYYVNQARQAPRLKYLIGAAPIAYGL PVSLR >gi568815586f:53195568_53399342|GENSCAN_predicted_CDS_2|7578_bp atgctggtgactgcctaccttgcttttgtaggcctcctggcctcctgcctggggctggaa ctgtcaagatgccgggctaaaccccctggaagggcctgcagcaatccctccttccttcgg tttcaactggacttctatcaggtctacttcctggccctggcagctgattggcttcaggcc ccctacctctataaactctaccagcattactacttcctggaaggtcaaattgccatcctc tatgtctgtggccttgcctctacagtcctctttggcctagtggcctcctcccttgtggat tggctgggtcgcaagaattcttgtgtcctcttctccctgacttactcactatgctgctta accaaactctctcaagactactttgtgctgctagtggggcgagcacttggtgggctgtcc acagccctgctcttctcagccttcgaggcctggtatatccatgagcacgtggaacggcat gacttccctgctgagtggatcccagctacctttgctcgagctgccttctggaaccatgtg ctggctgtagtggcaggtgtggcagctgaggctgtagccagctggatagggctggggcct gtagcgccctttgtggctgccatccctctcctggctctggcaggggccttggcccttcga aactggggggagaactatgaccggcagcgtgccttctcaaggacctgtgctggaggcctg cgctgcctcctgtcggaccgccgcgtgctgctgttgggcaccatacaagctctatttgag agtgtcatcttcatctttgtcttcctctggacacctgtgctggacccacacggggcccct ctgggcattatcttctccagcttcatggcagccagcctgcttggctcttccctgtaccgt atcgccacctccaagaggtaccaccttcagcccatgcacctgctgtcccttgctgtgctc atcgtcgtcttctctctcttcatgttgactttctctaccagcccaggccaggagagtccg gtggagtccttcatagcctttctacttattgagttggcttgtggattatactttcccagc atgagcttcctacggagaaaggtgatccctgagacagagcaggctggtgtactcaactgg ttccgggtacctctgcactcactggcttgcctagggctccttgtcctccatgacagtgat cgaaaaacaggcactcggaatatgttcagcatttgctctgctgtcatggtgatggctctg ctggcagtggtgggactcttcaccgtgctctccggtgtcatgaggagcttcaaaagagtc aactttgggactctgctaagcagccagaaggaggctgaagagttgctgcccgccttgaag gagttcctgtccaaccctccagctggttttcccagcagccgatctgatgctgagaggaga caagcttgtgatgccatcctgagggcttgcaaccagcagctgactgctaagctagcttgc cctaggcatctggggagcctgctggagctggcagagctggcctgtgatggctacttagtg tctaccccacagcgtcctcccctctacctggaacgaattctctttgtcttactgcggaat gctgctgcacaaggaagcccagaggccacactccgccttgctcagcccctccatgcctgc ttggtgcagtgctctcgcgaggctgctccccaggactatgaggccgtggctcggggcagc ttttctctgctttggaagggggcagaagccctgttggaacggcgagctgcatttgcagct cggctgaaggccttgagcttcctagtactcttggaggatgaaagtaccccttgtgaggtt cctcactttgcttctccaacagcctgtcgagcggtagctgcccatcagctatttgatgcc agtggccatggtctaaatgaagcagatgctgatttcctagatgacctgctctccaggcac gtgatcagagccttggtgggtgagagagggagctcttctgggcttctttctccccagagg gccctctgcctcttggagctcaccttggaacactgccgtcgcttttgctggagccgccac catgacaaagccatcagcgcagtggagaaggctcacagttacctaaggaacaccaatcta gcccctagccttcagctatgtcagctgggggttaagctgctgcaggttggggaggaagga cctcaggcagtggccaagcttctgatcaaggcatcagctgtcctgagcaagagtatggag gcaccatcacccccacttcgggcattgtatgagagctgccagttcttcctttcaggcctg gaacgaggcaccaagaggcgctatagacttgatgccattctgagcctctttgcttttctt ggagggtactgctctcttctgcagcagctgcgggatgatggtgtgtatgggggctcctcc aagcaacagcagtcttttcttcagatgtactttcagggacttcacctctacactgtggtg gtttatgactttgcccaaggctgtcagatagttgatttggctgacctgacccaactagtg gacagttgtaaatctaccgttgtctggatgctggaggccttagagggcctgtcgggccaa gagctgacggaccacatggggatgaccgcttcttacaccagtaatttggcctacagcttc tatagtcacaagctctatgccgaggcctgtgccatctctgagccgctctgtcagcacctg ggtttggtgaagccaggcacttatcccgaggtgcctcctgagaagttgcacaggtgcttc cggctacaagtagagagtttgaagaaactgggtaaacaggcccagggctgcaagatggtg attttgtggctggcagccctgcaaccctgtagccctgaacacatggctgagccagtcact ttctgggttcgggtcaagatggatgcggccagggctggagacaaggagctacagctaaag actctgcgagacagcctcagtggctgggacccggagaccctggccctcctgctgagggag gagctgcaggcctacaaggcggtgcgggccgacactggacaggaacgcttcaacatcatc tgtgacctcctggagctgagccccgaggagacaccagccggggcctgggcacgagccacc cacctggtagaactggctcaggtgctctgctaccacgactttacgcagcagaccaactgc tctgctctggatgctatccgggaagccctgcagcttctggactctgtgaggcctgaggcc caggccagagatcagcttctggacgataaagcacaggccttgctgtggctttacatctgt actctggaagccaaaatgcaggaaggtatcgagcgggatcggagagcccaggcccctggt aacttggaggaatttgaagtcaatgacctgaactatgaagataaactccaggaagatcgt ttcctatacagtaacattgccttcaacctggctgcagatgctgctcagtccaaatgcctg gaccaagccctggccctgtggaaggagctgcttacaaaggggcaggccccagctgtacgg tgtctccagcagacagcagcctcactgcagatcctagcagccctctaccagctggtggca aagcccatgcaggctctggaggtcctcctgctgctacggattgtctctgagagactgaag gaccactcgaaggcagctggctcctcctgccacatcacccagctcctcctgaccctcggc tgtcccagctatgcccagttacacctggaagaggcagcatcgagcctgaagcatctcgat cagactactgacacatacctgctcctttccctgacctgtgatctgcttcgaagtcaactc tactggactcaccagaaggtgaccaagggtgtctctctgctgctgtctgtgcttcgggat cctgccctccagaagtcctccaaggcttggtacttgctgcgtgtccaggtcctgcagctg gtggcagcttaccttagcctcccgtcaaacaacctctcacactccctgtgggagcagctc tgtgcccaaggctggcagacacctgagatagctctcatagactcccataagctcctccga agcatcatcctcctgctgatgggcagtgacattctctcaactcagaaagcagctgtggag acatcgtttttggactatggtgaaaatctggtacaaaaatggcaggttctttcagaggtg ctgagctgctcagagaagctggtctgccacctgggccgcctgggtagtgtgagtgaagcc aaggccttttgcttggaggccctaaaacttacaacaaagctgcagataccacgccagtgt gccctgttcctggtgctgaagggcgagctggagctggcccgcaatgacattgatctctgt cagtcggacctgcagcaggttctgttcttgcttgagtcttgcacagagtttggtggggtg actcagcacctggactctgtgaagaaggtccacctgcagaaggggaagcagcaggcccag gtcccctgtcctccacagctcccagaggaggagctcttcctaagaggccctgctctagag ctggtggccactgtggccaaggagcctggccccatagcaccttctacaaactcctcccca gtcttgaaaaccaagccccagcccatacccaacttcctgtcccattcacccacctgtgac tgctcgctctgcgccagccctgtcctcacagcagtctgtctgcgctgggtattggtcacg gcaggggtgaggctggccatgggccaccaagcccagggtctggatctgctgcaggtcgtg ctgaagggctgtcctgaagccgctgagcgcctcacccaagctctccaagcttccctgaat cataaaacacccccctccttggttccaagcctcttggatgagatcttggctcaagcatac acactgttggcactggagggcctgaaccagccatcaaacgagagcctgcagaaggttcta cagtcagggctgaagtttgtagcagcacggataccccacctagagccctggcgagccagc ctgctcttgatttgggccctcacaaaactaggtggcctcagctgctgtactacccaactt tttgcaagctcctggggctggcagccaccattaataaaaagtgtccctggctcagagccc tctaagactcagggccaaaaacgttctggacgagggcgccaaaagttagcctctgctccc ctgcgcctcaataatacctctcagaaaggtctggaaggtagaggactgccctgcacacct aaacccccagaccggatcaggcaagctggccctcatgtccccttcacggtgtttgaggaa gtctgccctacagagagcaagcctgaagtaccccaggcccccagggtacaacagagagtc cagacgcgcctcaaggtgaacttcagtgatgacagtgacttggaagaccctgtctcagct gaggcctggctggcagaggagcctaagagacggggcactgcttcccggggccgggggcga gcaaggaagggcctgagcctaaagacggatgccgtggttgccccaggtagtgcccctggg aaccctggcctgaatggcaggagccggagggccaagaaggtggcatcaagacattgtgag gagcggcgtccccagagggccagtgaccaggccaggcctggccctgagatcatgaggacc atccctgaggaagaactgactgacaactggagaaaaatgagctttgagatcctcaggggc tctgacggggaagactcagcctcaggtgggaagactccagctccgggccctgaggcagct tctggagaatgggagctgctgaggctggattccagcaagaagaagctgcccagcccatgc ccagacaaggagagtgacaaggaccttggtcctcggctccggctcccctcagcccccgta gccactggtctttctaccctggactccatctgtgactccctgagtgttgctttccggggc attagtcactgtcctcctagtgggctctatgcccacctctgccgcttcctggccttgtgc ctgggccaccgggatccttatgccactgctttccttgtcaccgagtctgtctccatcacc tgtcgccaccagctgctcacccacctccacagacagctcagcaaggcccagaagcaccga ggatcacttgaaatagcagaccagctgcaggggctgagccttcaggagatgcctggagat gtccccctggcccgcatccagcgcctcttttccttcagggctttggaatctggccacttc ccccagcctgaaaaggagagtttccaggagcgcctggctctgatccccagtggggtgact gtgtgtgtgttggccctggccaccctccagcccggaaccgtgggcaacaccctcctgctg acccggctggaaaaggacagtcccccagtcagtgtgcagattcccactggccagaacaag cttcatctgcgttcagtcctgaatgagtttgatgccatccagaaggcacagaaagagaac agcagctgtactgacaagcgagaatggtggacagggcggctggcactggaccacaggatg gaggttctcatcgcttccctagagaagtctgtgctgggctgctggaaggggctgctgctg ccgtccagtgaggagcccggccctgcccaggaggcctcccgcctacaggagctgctacag gactgtggctggaaatatcctgaccgcactctgctgaaaatcatgctcagtggtgccggt gccctcacccctcaggacattcaggccctggcctacgggctgtgcccaacccagccagag cgagcccaggagctcctgaatgaggcagtaggacgtctacagggcctgacagtaccaagc aatagccaccttgtcttggtcctagacaaggacttgcagaagctgccgtgggaaagcatg cccagcctccaagcactgcctgtcacccggctgccctccttccgcttcctactcagctac tccatcatcaaagagtatggggcctcgccagtgctgagtcaaggggtggatccacgaagt accttctatgtcctgaaccctcacaataacctgtcaagcacagaggagcaatttcgagcc aatttcagcagctatgcagggcatggggctggtgcccgcttccttgatgggcaggctgtc ctgcggctgagctgtcgggcagtggccctgctgtttggctgtagcagtgcggccctggct gtgcgtggaaacctggagggggctggcatcgtgctcaagtacatcatggctggttgcccc ttgtttctgggtaatctctgggatgtgactgaccgcgacattgaccgctacacggaagct ctgctgcaaggctggcttggagcaggcccaggggccccccttctctactatgtaaaccag gcccgccaagctccccgactcaagtatcttattggggctgcacctatagcctatggcttg cctgtctctctgcggtaa >gi568815586f:53195568_53399342|GENSCAN_predicted_peptide_3|577_aa MAQSINITELNLPQLEMLKNQLDQEVEFLSTSIAQLKVVQTKYVEAKDCLNVLNKSNEGM GFSYLKRENPLHIVYRFMNPLHVACLELKSTGHSSAPIAPVSVLFTLSMYVPGKLHDVEH VLIDVGTGYYVEKTAEDAKDFFKRKIDFLTKQMEKIQPALQEKHAMKQELVAKGTVGKRK WGCAGASSSGSALLPPCRELLMGHQFLRGLLTLLLPPPPLYTRHRMLGPESVPPPKRSRS KLMAPPRIGTHNGTFHCDEALACALLRLLPEYRDAEIVRTRDPEKLASCDIVVDVGGEYD PRRHRYDHHQRSFTETMSSLSPGKPWQTKLSSAGLIYLHFGHKLLAQLLGTSEEDSMVGT LYDKMYENFVEEVDAVDNGISQWAEGEPRYALTTTLSARVARLNPTWNHPDQDTEAGFKR AMDLVQEEFLQRLDFYQHSWLPARALVEEALAQRFQVDPSGEIVELAKGACPWKEHLYHL ESGLSPPVAIFFVIYTDQAGQWRIQCVPKEPHSFQSRLPLPEPWRGLRDEALDQVSGIPG CIFVHASGFTGGHHTREGALSMARATLAQRSYLPQIS >gi568815586f:53195568_53399342|GENSCAN_predicted_CDS_3|1734_bp atggcgcagtctattaacatcacggagctgaatctgccgcagctagaaatgctcaagaac cagctggaccaggaagtggagttcttgtccacgtccattgctcagctcaaagtggtacag accaagtatgtggaagccaaggactgtctgaacgtgctgaacaagagcaacgagggtatg ggtttttcttacctgaaacgagaaaatccattacatatcgtataccgcttcatgaaccct ttgcatgttgcctgcctagaattgaaaagtacaggacattcctctgctcctattgcccct gtttccgttcttttcacactgtctatgtatgtccctgggaagctgcatgatgtggaacac gtgctcatcgatgtgggaactgggtactatgtagagaagacagctgaggatgccaaggac ttcttcaagaggaagatagattttctaaccaagcagatggagaaaatccaaccagctctt caggagaagcacgccatgaaacaggagctggttgccaagggaacggttggcaagcggaag tggggctgcgctggcgcttcctcttccgggtcggcgctcctgcctccctgcagggagctg cttatgggacaccaattcctgcgcggcctcttaacgctgctgctgccgccgccacccctg tatacccggcaccgcatgctcggtccagagtccgtcccgcccccaaaacgatcccgcagc aaactcatggcaccgccccgaatcgggacgcacaatggcaccttccactgcgacgaggca ctggcatgcgcactgcttcgcctcctgccggagtaccgggatgcagagattgtgcggacc cgggatcccgaaaaactcgcttcctgtgacatcgtggtggacgtggggggcgagtacgac cctcggagacaccgatatgaccatcaccagaggtctttcacagagaccatgagctccctg tcccctgggaagccgtggcagaccaagctgagcagtgcgggactcatctatctgcacttc gggcacaagctgctggcccagttgctgggcactagtgaagaggacagcatggtgggcacc ctctatgacaagatgtatgagaactttgtggaggaggtggatgctgtggacaatgggatc tcccagtgggcagagggggagcctcgatatgcactgaccactaccctgagtgcacgagtt gctcgacttaatcctacctggaaccaccccgaccaagacactgaggcagggttcaagcgt gcaatggatctggttcaagaggagtttctgcagagattagatttctaccaacacagctgg ctgccagcccgggccttggtggaagaggcccttgcccagcgattccaggtggacccaagt ggagagattgtggaactggcgaaaggtgcatgtccctggaaggagcatctctaccacctg gaatctgggctgtcccctccagtggccatcttctttgttatctacactgaccaggctgga cagtggcgaatacagtgtgtgcccaaggagccccactcattccaaagccggctgcccctg ccagagccatggcggggtcttcgggacgaggccctggaccaggtcagtgggatccctggc tgcatcttcgtccatgcaagcggcttcactggcggtcaccacacccgagagggtgccttg agcatggcccgtgccaccttggcccagcgctcatacctcccacaaatctcctag >gi568815586f:53195568_53399342|GENSCAN_predicted_peptide_4|546_aa MCSLGLFPPPPPRGQVTLYEHNNELVTGSSYESPPPDFRGQWINLPVLQLTKDPLKTPGR LDHGTRTAFIHHREQVWKRCINIWRDVGLFGVLNEIANSEEEVFEWVKTASGWALALCRW ASSLHGSLFPHLSLRSEDLIAEFAQVTNWSSCCLRVFAWHPHTNKFAVALLDDSVRVYNA SSTIVPSLKHRLQRNVASLAWKPLSASVLAVACQSCILIWTLDPTSLSTRPSSGCAQVLS HPGHTPVTSLAWAPSGGRLLSASPVDAAIRVWDVSTETCVPLPWFRGGGVTNLLWSPDGS KILATTPSAVFRVWEAQMWTCERWPTLSGRCQTGCWSPDGSRLLFTVLGEPLIYSLSFPE RCGEGKGCVGGAKSATIVADLSETTIQTPDGEERLGGEAHSMVWDPSGERLAVLMKGKPR VQDGKPVILLFRTRNSPVFELLPCGIIQGEPGAQPQLITFHPSFNKGALLSVGWSTGRIA HIPLYFVNAQFPRFSPVLGRAQEPPAGGGGSIHDLPLFTETSPTSAPWDPLPGPPPVLPH SPHSHL >gi568815586f:53195568_53399342|GENSCAN_predicted_CDS_4|1641_bp atgtgctctctggggttgttccctcctccaccgcctcggggtcaagtcaccctatatgag cacaataacgagctggtgacgggcagtagctatgagagcccgccccccgacttccggggc cagtggatcaatcttcctgtcctacaactgacaaaggatcccctaaagacccctggaagg ctggaccatggcacaagaactgccttcatccatcaccgggagcaagtgtggaagagatgc atcaacatttggcgtgatgtgggcctttttggggtgctaaatgaaattgcaaactcagaa gaagaggtgtttgagtgggtgaagacggcatccggctgggccctggcactctgtcgatgg gcctcttccctccatgggtccctgttcccccatctgtctctcaggagcgaagatctgatc gctgaatttgcccaagtcacaaattggtccagctgctgcttgcgtgtctttgcatggcac ccccacaccaacaagtttgcagtggccctgctagatgactcagtccgtgtgtataatgcc agcagcaccatagtcccctccctgaagcaccggctgcagcgaaatgtggcgtctctggcc tggaagccccttagtgcctctgtcttggctgtggcctgccagagctgcattcttatctgg accctggaccctacctccttgtctacccgaccctcttctggctgtgcccaagtgctgtct caccctgggcatacacctgttaccagcttggcctgggcccccagtggggggcggctgctc tcagcttcacccgtggatgctgctatccgggtatgggatgtctcaacagagacctgtgtc ccccttccctggttccgaggaggtggggtgaccaacctgctctggtccccagacggcagc aaaatcctggctaccactccttcagctgtctttcgagtctgggaggcccagatgtggact tgtgagaggtggcctactctatcagggcgctgtcagactggctgctggagcccagatggc agccgactgctgttcactgtattgggagagccactgatttactccctgtcttttccagaa cgttgtggtgagggaaaggggtgcgttggaggtgcaaagtcagcaacgattgtggcagat ctgtctgagacaacaatacagacaccagatggtgaggagaggcttgggggagaggctcac tccatggtctgggaccccagtggggaacgtctggctgtgcttatgaaaggaaagccaagg gtacaggatggtaaaccagtcatcctcctttttcgcactcgaaacagccctgtgtttgag ctccttccctgtggcattatccagggggagccaggagcccagccccagctcatcactttc catccttccttcaacaaaggggccctgctcagtgtgggctggtccacaggccgaattgcc cacatcccgctgtactttgtcaatgcccagtttccacgttttagcccagtgcttgggcgg gcccaggaaccccctgctgggggtggaggctctattcatgacctgcccctctttactgag acatccccaacctctgccccttgggaccctctcccagggccaccacctgttctgccccac tccccacattcccacctctaa >gi568815586f:53195568_53399342|GENSCAN_predicted_peptide_5|472_aa MSQRGRRAAGAWVGVGMLGFPRTREFAGAIWVSELWCVCGVLDQKYLKEEVHYGSSPLAM LTAACSKFGGSSPLRDSTTLGKAGTKKPYSVGSDLSASKTMGDAYPAPFTSTNGLLSPAG SPPAPTSGYANDYPPFSHSFPGPTGTQDPGLLVPKGHSSSDCLPSVYTSLDMTHPYGSWY KAGIHAGISPGPGNTPTPWWDMHPGGNWLGGGQGQGDGLQGTLPTGPAQPPLNPQLPTYP SDFAPLNPAPYPAPHLLQPGPQHVLPQDVYKPKAVGNSGQLEGSGGAKPPRGASTGGSGG YGGSGAGRSSCDCPNCQELERLGAAAAGLRKKPIHSCHIPGCGKVYGKASHLKAHLRWHT GERPFVCNWLFCGKRFTRSDELERHVRTHTREKKFTCLLCSKRFTRSDHLSKHQRTHGEP GPGPPPSGPKELGEGRSTGEEEASQTPRPSASPATPEKAPGGSPEQSNLLEI >gi568815586f:53195568_53399342|GENSCAN_predicted_CDS_5|1419_bp atgagtcagcggggccgcagagcagcgggggcctgggtgggggtagggatgctggggttc cccaggaccagggagttcgctggggccatctgggtctctgaactctggtgtgtgtgtgga gtattggatcaaaagtacctaaaggaggaagttcactatggctccagtcccctggccatg ctgacggcagcgtgcagcaaatttggtggctctagccctctgcgggactcaacaactctg ggcaaagcaggcacaaagaagccgtactctgtgggcagtgacctttcagcctccaaaacc atgggggatgcttatccagccccctttacaagcactaatgggctcctttcacctgcaggc agtcctccagcacccacctcaggctatgctaatgattaccctcccttttcccactcattc cctgggcccacaggcacccaggaccctgggctactagtgcccaaggggcacagctcttct gactgtctgcccagtgtctacacctctctggacatgacacacccctatggctcctggtac aaggcaggcatccatgcaggcatttcaccaggcccaggcaacactcctactccatggtgg gatatgcaccctggaggcaactggctaggtggtgggcagggccagggtgatgggctgcaa gggacactgcccacaggtccagctcagcctccactgaacccccagctgcccacctaccca tctgactttgctccccttaatccagccccctacccagctccccacctcttgcaaccaggg ccccagcatgtcttgccccaagatgtctataaacccaaggcagtgggaaatagtgggcag ctagaagggagtggtggagccaaacccccacggggtgcaagcactgggggtagtggtgga tatgggggcagtggggcagggcgctcctcctgcgactgccctaattgccaggagctagag cggctgggagcagcagcggctgggctgcggaagaagcccatccacagctgccacatccct ggctgcggcaaggtgtatggcaaggcttcgcacctgaaggcccacttgcgctggcacaca ggcgagaggcccttcgtctgcaactggctcttctgcggcaagaggttcactcgttcggat gagctggagcgtcatgtgcgcactcacacccgggagaagaagttcacctgcctgctctgc tccaagcgctttacccgaagcgaccacctgagcaaacaccagcgcacccatggagaacca ggcccgggtccccctcccagtggccccaaggagctgggggagggccgcagcacgggggaa gaggaggccagtcagacgccccgaccttctgcctcgccagcaaccccagagaaagcccct ggaggcagccctgagcagagcaacttgctggagatctga >gi568815586f:53195568_53399342|GENSCAN_predicted_peptide_6|618_aa MGRPPRGRGQRGLARPLPAPATGDGPYPPLLGRPPEAPPAGGWSRGGGPSSEGPARANRL PDQDHSMDEMTAVVKIEKGVGGNNGGNGNGGGAFSQARSSSTGSSSSTGGGGQESQPSPL ALLAATCSRIESPNENSNNSQGPSQSGGTGELDLTATQLSQGANGWQIISSSSGATPTSK EQSGSSTNGSNGSESSKNRTVSGGQYVVAAAPNLQNQQVLTGLPGVMPNIQYQVIPQFQT VDGQQLQFAATGAQVQQDGSGQIQIIPGANQQIITNRGSGGNIIAAMPNLLQQAVPLQGL ANNVLSGQTQYVTNVPVALNGNITLLPVNSVSAATLTPSSQAVTISSSGSQESGSQPVTS GTTISSASLVSSQASSSSFFTNANSYSTTTTTSNMGIMNFTTSGSSGTNSQGQTPQRVSG LQGSDALNIQQNQTSGGSLQAGQQKEGEQNQQTQQQQILIQPQLVQGGQALQALQAAPLS GQTFTTQAISQETLQNLQLQAVPNSGPIIIRTPTVGPNGQVSWQTLQLQNLQVQNPQAQT ITLAPMQGVSLGQTSSSNTTLTPIASAASIPAGTVTVNAAQLSSMPGLQTINLSALGTSG IQVHPIQGLPLAIANAPX >gi568815586f:53195568_53399342|GENSCAN_predicted_CDS_6|1854_bp atgggccgcccgccccgggggagggggcagcgtggcctcgcccgccccctgcccgccccg gccacgggggacgggccttaccccccactactcggccgcccgcctgaggctcctcccgcc gggggctggagccgcgggggcggcccgagcagcgaaggccccgcccgggccaaccgcctg cctgaccaagatcactccatggatgaaatgacagctgtggtgaaaattgaaaaaggagtt ggtggcaataatgggggcaatggtaatggtggtggtgccttttcacaggctcgaagtagc agcacaggcagtagcagcagcactggaggaggagggcaggagtcccagccatcccctttg gctctgctggcagcaacttgcagcagaattgagtcacccaatgagaacagcaacaactcc cagggcccgagtcagtcagggggaacaggtgagcttgacctcacagccacacaactttca cagggtgccaatggctggcagatcatctcttcctcctctggggctacccctacctcaaag gaacagagtggcagcagtaccaatggcagcaatggcagtgagtcttccaagaatcgcaca gtctctggtgggcagtatgttgtggctgccgctcccaacttacagaaccagcaagttctg acaggactacctggagtgatgcctaatattcagtatcaagtaatcccacagttccagacc gttgatgggcaacagctgcagtttgctgccactggggcccaagtgcagcaggatggttct ggtcaaatacagatcataccaggtgcaaaccaacagattatcacaaatcgaggaagtgga ggcaacatcattgctgctatgccaaacctactccagcaggctgtccccctccaaggcctg gctaataatgtactctcaggacagactcagtatgtgaccaatgtaccagtggccctgaat gggaacatcaccttgctacctgtcaacagcgtttctgcagctaccttgactcccagctct caggcagtcacgatcagcagctctgggtcccaggagagtggctcacagcctgtcacctca gggactaccatcagttctgccagcttggtatcatcacaagccagttccagctcctttttc accaatgccaatagctactcaactactactaccaccagcaacatgggaattatgaacttt actaccagtggatcatcagggaccaactctcaaggccagacaccccagagggtcagtggg ctacaggggtctgatgctctgaacatccagcaaaaccagacatctggaggctcattgcaa gcaggccagcaaaaagaaggagagcaaaaccagcagacacagcagcaacaaattcttatc cagcctcagctagttcaagggggacaggccctccaggccctccaagcagcaccattgtca gggcagacctttacaactcaagccatctcccaggaaaccctccagaacctccagcttcag gctgttccaaactctggtcccatcatcatccggacaccaacagtggggcccaatggacag gtcagttggcagactctacagctgcagaacctccaagttcagaacccacaagcccaaaca atcaccttagccccaatgcagggtgtttccttggggcagaccagcagcagcaacaccact ctcacacccattgcctcagctgcttccattcctgctggcacagtcactgtgaatgctgct caactctcctccatgccaggcctccagaccattaacctcagtgcattgggtacttcagga atccaggtgcacccaattcaaggcctgccgttggctatagcaaatgccccagnn