GENSCAN 1.0 Date run: 6-Nov-116 Time: 00:20:23 Sequence gi568815586r:53207492_53421465 : 213974 bp : 50.27% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.14 PlyA - 129 124 6 1.05 1.13 Term - 4372 4185 188 2 2 129 47 209 0.998 18.55 1.12 Intr - 5752 5594 159 2 0 106 96 254 0.999 27.96 1.11 Intr - 6209 6005 205 2 1 97 64 359 0.999 33.17 1.10 Intr - 6744 6568 177 0 0 119 119 215 0.999 27.82 1.09 Intr - 7115 6955 161 0 2 46 89 219 0.824 17.51 1.08 Intr - 7943 7802 142 2 1 96 102 205 0.999 22.63 1.07 Intr - 8303 8155 149 0 2 123 93 160 0.989 19.95 1.06 Intr - 12731 12459 273 0 0 31 51 138 0.245 1.91 1.05 Intr - 20084 19871 214 2 1 70 56 148 0.785 8.09 1.04 Intr - 20196 20159 38 1 2 151 63 -6 0.890 1.18 1.03 Intr - 23744 23678 67 2 1 83 94 50 0.625 3.58 1.02 Intr - 24589 24483 107 2 2 48 109 62 0.885 4.23 1.01 Init - 28906 28777 130 1 1 54 71 26 0.292 -2.19 1.00 Prom - 36250 36211 40 -5.16 2.00 Prom + 38086 38125 40 -8.46 2.01 Init + 45345 46631 1287 2 0 96 79 991 0.001 89.05 2.02 Intr + 61264 61356 93 0 0 85 81 73 0.202 6.46 2.03 Intr + 61533 62594 1062 2 0 84 59 625 0.874 49.71 2.04 Intr + 62887 62991 105 0 0 115 84 72 0.994 9.91 2.05 Intr + 63187 63307 121 0 1 145 65 137 0.999 17.17 2.06 Intr + 65230 65366 137 2 2 118 82 124 0.989 15.09 2.07 Intr + 67326 67519 194 2 2 63 87 109 0.951 6.59 2.08 Intr + 69129 69368 240 0 0 70 95 355 0.995 31.06 2.09 Intr + 69592 69736 145 1 1 86 81 170 0.844 16.38 2.10 Intr + 69979 70117 139 0 1 89 81 219 0.997 21.34 2.11 Intr + 70330 70469 140 2 2 55 90 104 0.991 7.48 2.12 Intr + 72241 72375 135 0 0 27 109 130 0.944 9.76 2.13 Intr + 74016 74135 120 2 0 87 63 97 0.995 7.79 2.14 Intr + 74773 74944 172 0 1 148 62 97 0.999 12.62 2.15 Intr + 75638 75766 129 0 0 81 101 127 0.999 13.97 2.16 Intr + 75891 76047 157 1 1 84 100 93 0.999 9.17 2.17 Intr + 76567 76676 110 1 2 68 99 173 0.999 16.33 2.18 Intr + 78433 79421 989 2 2 105 100 645 0.975 57.99 2.19 Intr + 80481 80850 370 2 1 83 91 216 0.999 16.08 2.20 Intr + 81047 81208 162 0 0 84 28 91 0.569 2.65 2.21 Intr + 81599 81812 214 0 1 77 94 174 0.999 14.67 2.22 Intr + 81913 82103 191 1 2 113 56 142 0.997 12.73 2.23 Intr + 82594 82721 128 2 2 64 97 82 0.915 7.10 2.24 Intr + 82856 82978 123 1 0 11 85 148 0.903 7.58 2.25 Intr + 83350 83505 156 0 0 90 87 131 0.999 13.41 2.26 Intr + 84199 84369 171 0 0 90 116 141 0.879 17.24 2.27 Intr + 84493 84597 105 0 0 61 59 57 0.584 0.51 2.28 Intr + 84787 84902 116 0 2 38 71 87 0.936 1.25 2.29 Intr + 85315 85479 165 1 0 81 97 201 0.937 19.38 2.30 Term + 85782 85983 202 0 1 103 40 143 0.995 7.86 2.31 PlyA + 86127 86132 6 1.05 3.00 Prom + 86137 86176 40 -11.63 3.01 Init + 88077 88148 72 2 0 115 75 109 0.972 13.37 3.02 Intr + 88348 88457 110 0 2 103 49 176 0.677 14.28 3.03 Intr + 88521 88662 142 0 1 51 15 81 0.629 -2.54 3.04 Intr + 90359 90433 75 1 0 94 113 123 0.645 15.11 3.05 Intr + 90554 90659 106 1 1 68 110 116 0.999 11.59 3.06 Intr + 92149 92462 314 2 2 62 63 212 0.423 12.00 3.07 Intr + 92659 92771 113 0 2 111 92 260 0.999 27.88 3.08 Intr + 95543 95702 160 2 1 120 81 201 0.986 22.59 3.09 Intr + 98417 98569 153 1 0 34 94 215 0.999 16.97 3.10 Intr + 98707 98829 123 0 0 62 80 61 0.916 3.48 3.11 Intr + 99189 99370 182 2 2 126 93 71 0.911 10.07 3.12 Term + 99475 99658 184 1 1 110 34 154 0.992 9.22 3.13 PlyA + 99662 99667 6 1.05 4.17 PlyA - 99996 99991 6 1.05 4.16 Term - 100222 99998 225 1 0 153 48 25 0.900 1.78 4.15 Intr - 100438 100354 85 0 1 78 89 31 0.975 2.02 4.14 Intr - 100642 100561 82 2 1 80 119 36 0.924 4.60 4.13 Intr - 100858 100791 68 0 2 98 100 9 0.788 1.65 4.12 Intr - 101037 100944 94 1 1 110 83 18 0.728 2.52 4.11 Intr - 101324 101234 91 2 1 72 89 28 0.810 0.87 4.10 Intr - 101529 101469 61 2 1 76 110 -7 0.851 -0.96 4.09 Intr - 101790 101666 125 0 2 95 109 58 0.990 7.98 4.08 Intr - 102230 102110 121 1 1 129 100 17 0.991 7.50 4.07 Intr - 106950 106807 144 2 0 114 131 91 0.998 15.40 4.06 Intr - 107358 107260 99 2 0 65 89 51 0.781 2.13 4.05 Intr - 107649 107603 47 0 2 96 97 37 0.965 2.61 4.04 Intr - 107935 107844 92 2 2 66 84 18 0.963 -1.09 4.03 Intr - 108291 108236 56 2 2 101 110 60 0.997 8.12 4.02 Intr - 113201 113074 128 2 2 64 73 79 0.952 3.48 4.01 Init - 113974 113852 123 1 0 75 37 93 0.344 3.39 4.00 Prom - 117707 117668 40 -10.35 5.03 PlyA - 119103 119098 6 1.05 5.02 Term - 121929 120655 1275 0 0 116 52 906 0.380 81.39 5.01 Init - 133098 132955 144 0 0 74 81 70 0.114 3.02 5.00 Prom - 148731 148692 40 -2.26 6.00 Prom + 167434 167473 40 -5.86 6.01 Init + 172998 173181 184 2 1 65 26 163 0.714 5.08 6.02 Intr + 174168 174322 155 1 2 58 111 151 0.963 14.19 6.03 Intr + 174619 176131 1513 0 1 92 110 893 0.601 81.13 6.04 Intr + 199094 199262 169 0 1 124 113 65 0.044 11.60 6.05 Intr + 201871 202070 200 1 2 90 99 116 0.868 11.89 6.06 Term + 203436 203749 314 1 2 118 50 315 0.985 25.86 6.07 PlyA + 205531 205536 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 45345 46697 1353 2 0 96 40 1075 0.998 97.65 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:53207492_53421465|GENSCAN_predicted_peptide_1|669_aa MEILCKTDSYIHPHTGWQTPVALASHTLQATVEADVLAPGSGTGAPQYGEPRTLRRSIQE TARRRDLGAPPPPFPLPLQQLRPSSLNLTQYVEASLCRRPAGLLEAQWAGQAGRTPGDSH TAAAMATNKERLFAAGALGPGSGYPGAGFPFAFPGALRGSPPFEMLSPSFRGLGQPDLPK EMASLSPYASPPPPPLLERGAAGGGGGIGCGSLVFPAPSFPSSRVAMYDCMETFAPGPRR LYGAAGPGAGLLRRATGGSCFAGLESFAWPQPASLQSVETQSTSSEEMVPSSPSPPPPPR VYKPCFVCNDKSSGYHYGVSSCEGCKGFFRRSIQKNMVYTCHRDKNCIINKVTRNRCQYC RLQKCFEVGMSKEAVRNDRNKKKKEVKEEGSPDSYELSPQLEELITKVSKAHQETFPSLC QLGKYTTNSSADHRVQLDLGLWDKFSELATKCIIKIVEFAKRLPGFTGLSIADQITLLKA ACLDILMLRICTRYTPEQDTMTFSDGLTLNRTQMHNAGFGPLTDLVFAFAGQLLPLEMDD TETGLLSAICLICGDRMDLEEPEKVDKLQEPLLEALRLYARRRRPSQPYMFPRMLMKITD LRGISTKGAERAITLKMEIPGPMPPLIREMLENPEMFEDDSSQPGPHPNASSEDEVPGGQ GKGGLKSPA >gi568815586r:53207492_53421465|GENSCAN_predicted_CDS_1|2010_bp atggagattctgtgcaaaacagacagctacatccacccacacactggctggcagacacct gtagcactcgcctcacacacactccaggcaactgtggaggcagacgtgctagctccaggg agtgggacaggagccccccagtacggcgagccccggacattgcgacgctccatccaagag actgcccgacgccgggacctcggggctccgccgcctcccttccccctcccactccagcag ctacggcccagttccctcaacctgacccagtatgtagaagccagtctctgcaggcggcca gcgggacttttggaggcccagtgggcaggccaggcagggcggaccccaggggactctcac accgcagctgccatggccaccaataaggagcgactctttgcggctggtgccctggggcct ggatctggctacccaggggcaggtttccccttcgccttcccaggggcactcagggggtct ccgcctttcgagatgctgagccctagcttccggggcctgggccagcctgacctccccaag gagatggcctctctgtcgccctatgctagccctccccctccccccctgctggagcggggc gccgccgggggaggagggggaatcggctgcgggtccttggtgtttccagcacccagtttc ccttcaagccgggtcgcgatgtacgactgtatggaaacgtttgccccgggtccgcgacgg ctgtacggggcggccgggcccggggccggcttgctgcgcagagccaccggcggctcctgt ttcgccggacttgaatcttttgcctggccgcaacccgccagcctgcaatcggtggagaca cagagcaccagctcagaggagatggtgcccagctcgccctcgccccctccgcctcctcgg gtctacaagccatgcttcgtgtgcaatgacaagtcctctggctaccactatggggtcagc tcttgtgaaggctgcaagggcttctttcgccgaagcatccagaagaacatggtgtacacg tgtcaccgcgacaaaaactgtatcatcaacaaggtgaccaggaatcgctgccagtactgc cggctacagaagtgcttcgaagtgggcatgtccaaggaagctgtgcgaaatgaccggaac aagaagaagaaagaggtgaaggaagaagggtcacctgacagctatgagctgagccctcag ttagaagagctcatcaccaaggtcagcaaagcccatcaggagactttcccctcgctctgc cagctgggcaagtataccacgaactccagtgcagaccaccgcgtgcagctggatctgggg ctgtgggacaagttcagtgagctggctaccaagtgcatcatcaagatcgtggagtttgcc aagcggttgcctggctttacagggctcagcattgctgaccagatcactctgctcaaagct gcctgcctagatatcctgatgctgcgtatctgcacaaggtacaccccagagcaggacacc atgaccttctccgacgggctgaccctgaaccggacccagatgcacaatgccggcttcggg cccctcacagaccttgtctttgcctttgctgggcagctcctgcccctggagatggatgac accgagacagggctgctcagcgccatctgcctcatctgcggagaccgcatggacctggag gagcccgaaaaagtggacaagctgcaggagccactgctggaagccctgaggctgtacgcc cggcgccggcggcccagccagccctacatgttcccaaggatgctaatgaaaatcaccgac ctccggggcatcagcactaagggagctgaaagggccattactctgaagatggagattcca ggcccgatgcctcccttaatccgagagatgctggagaaccctgaaatgtttgaggatgac tcctcgcagcctggtccccaccccaatgcctctagcgaggatgaggttcctgggggccag ggcaaagggggcctgaagtccccagcctga >gi568815586r:53207492_53421465|GENSCAN_predicted_peptide_2|2525_aa MLVTAYLAFVGLLASCLGLELSRCRAKPPGRACSNPSFLRFQLDFYQVYFLALAADWLQA PYLYKLYQHYYFLEGQIAILYVCGLASTVLFGLVASSLVDWLGRKNSCVLFSLTYSLCCL TKLSQDYFVLLVGRALGGLSTALLFSAFEAWYIHEHVERHDFPAEWIPATFARAAFWNHV LAVVAGVAAEAVASWIGLGPVAPFVAAIPLLALAGALALRNWGENYDRQRAFSRTCAGGL RCLLSDRRVLLLGTIQALFESVIFIFVFLWTPVLDPHGAPLGIIFSSFMAASLLGSSLYR IATSKRYHLQPMHLLSLAVLIVVFSLFMLTFSTSPGQESPVESFIAFLLIELACGLYFPS MSFLRRKVIPETEQAGVLNWFRVPLHSLACLGLLVLHDSDRKTGTRNMFSICSAVMVMAL LAVVGLFTVLSGVMRSFKRVNFGTLLSSQKEAEELLPALKEFLSNPPAGFPSSRSDAERR QACDAILRACNQQLTAKLACPRHLGSLLELAELACDGYLVSTPQRPPLYLERILFVLLRN AAAQGSPEATLRLAQPLHACLVQCSREAAPQDYEAVARGSFSLLWKGAEALLERRAAFAA RLKALSFLVLLEDESTPCEVPHFASPTACRAVAAHQLFDASGHGLNEADADFLDDLLSRH VIRALVGERGSSSGLLSPQRALCLLELTLEHCRRFCWSRHHDKAISAVEKAHSYLRNTNL APSLQLCQLGVKLLQVGEEGPQAVAKLLIKASAVLSKSMEAPSPPLRALYESCQFFLSGL ERGTKRRYRLDAILSLFAFLGGYCSLLQQLRDDGVYGGSSKQQQSFLQMYFQGLHLYTVV VYDFAQGCQIVDLADLTQLVDSCKSTVVWMLEALEGLSGQELTDHMGMTASYTSNLAYSF YSHKLYAEACAISEPLCQHLGLVKPGTYPEVPPEKLHRCFRLQVESLKKLGKQAQGCKMV ILWLAALQPCSPEHMAEPVTFWVRVKMDAARAGDKELQLKTLRDSLSGWDPETLALLLRE ELQAYKAVRADTGQERFNIICDLLELSPEETPAGAWARATHLVELAQVLCYHDFTQQTNC SALDAIREALQLLDSVRPEAQARDQLLDDKAQALLWLYICTLEAKMQEGIERDRRAQAPG NLEEFEVNDLNYEDKLQEDRFLYSNIAFNLAADAAQSKCLDQALALWKELLTKGQAPAVR CLQQTAASLQILAALYQLVAKPMQALEVLLLLRIVSERLKDHSKAAGSSCHITQLLLTLG CPSYAQLHLEEAASSLKHLDQTTDTYLLLSLTCDLLRSQLYWTHQKVTKGVSLLLSVLRD PALQKSSKAWYLLRVQVLQLVAAYLSLPSNNLSHSLWEQLCAQGWQTPEIALIDSHKLLR SIILLLMGSDILSTQKAAVETSFLDYGENLVQKWQVLSEVLSCSEKLVCHLGRLGSVSEA KAFCLEALKLTTKLQIPRQCALFLVLKGELELARNDIDLCQSDLQQVLFLLESCTEFGGV TQHLDSVKKVHLQKGKQQAQVPCPPQLPEEELFLRGPALELVATVAKEPGPIAPSTNSSP VLKTKPQPIPNFLSHSPTCDCSLCASPVLTAVCLRWVLVTAGVRLAMGHQAQGLDLLQVV LKGCPEAAERLTQALQASLNHKTPPSLVPSLLDEILAQAYTLLALEGLNQPSNESLQKVL QSGLKFVAARIPHLEPWRASLLLIWALTKLGGLSCCTTQLFASSWGWQPPLIKSVPGSEP SKTQGQKRSGRGRQKLASAPLRLNNTSQKGLEGRGLPCTPKPPDRIRQAGPHVPFTVFEE VCPTESKPEVPQAPRVQQRVQTRLKVNFSDDSDLEDPVSAEAWLAEEPKRRGTASRGRGR ARKGLSLKTDAVVAPGSAPGNPGLNGRSRRAKKVASRHCEERRPQRASDQARPGPEIMRT IPEEELTDNWRKMSFEILRGSDGEDSASGGKTPAPGPEAASGEWELLRLDSSKKKLPSPC PDKESDKDLGPRLRLPSAPVATGLSTLDSICDSLSVAFRGISHCPPSGLYAHLCRFLALC LGHRDPYATAFLVTESVSITCRHQLLTHLHRQLSKAQKHRGSLEIADQLQGLSLQEMPGD VPLARIQRLFSFRALESGHFPQPEKESFQERLALIPSGVTVCVLALATLQPGTVGNTLLL TRLEKDSPPVSVQIPTGQNKLHLRSVLNEFDAIQKAQKENSSCTDKREWWTGRLALDHRM EVLIASLEKSVLGCWKGLLLPSSEEPGPAQEASRLQELLQDCGWKYPDRTLLKIMLSGAG ALTPQDIQALAYGLCPTQPERAQELLNEAVGRLQGLTVPSNSHLVLVLDKDLQKLPWESM PSLQALPVTRLPSFRFLLSYSIIKEYGASPVLSQGVDPRSTFYVLNPHNNLSSTEEQFRA NFSSYAGHGAGARFLDGQAVLRLSCRAVALLFGCSSAALAVRGNLEGAGIVLKYIMAGCP LFLGNLWDVTDRDIDRYTEALLQGWLGAGPGAPLLYYVNQARQAPRLKYLIGAAPIAYGL PVSLR >gi568815586r:53207492_53421465|GENSCAN_predicted_CDS_2|7578_bp atgctggtgactgcctaccttgcttttgtaggcctcctggcctcctgcctggggctggaa ctgtcaagatgccgggctaaaccccctggaagggcctgcagcaatccctccttccttcgg tttcaactggacttctatcaggtctacttcctggccctggcagctgattggcttcaggcc ccctacctctataaactctaccagcattactacttcctggaaggtcaaattgccatcctc tatgtctgtggccttgcctctacagtcctctttggcctagtggcctcctcccttgtggat tggctgggtcgcaagaattcttgtgtcctcttctccctgacttactcactatgctgctta accaaactctctcaagactactttgtgctgctagtggggcgagcacttggtgggctgtcc acagccctgctcttctcagccttcgaggcctggtatatccatgagcacgtggaacggcat gacttccctgctgagtggatcccagctacctttgctcgagctgccttctggaaccatgtg ctggctgtagtggcaggtgtggcagctgaggctgtagccagctggatagggctggggcct gtagcgccctttgtggctgccatccctctcctggctctggcaggggccttggcccttcga aactggggggagaactatgaccggcagcgtgccttctcaaggacctgtgctggaggcctg cgctgcctcctgtcggaccgccgcgtgctgctgttgggcaccatacaagctctatttgag agtgtcatcttcatctttgtcttcctctggacacctgtgctggacccacacggggcccct ctgggcattatcttctccagcttcatggcagccagcctgcttggctcttccctgtaccgt atcgccacctccaagaggtaccaccttcagcccatgcacctgctgtcccttgctgtgctc atcgtcgtcttctctctcttcatgttgactttctctaccagcccaggccaggagagtccg gtggagtccttcatagcctttctacttattgagttggcttgtggattatactttcccagc atgagcttcctacggagaaaggtgatccctgagacagagcaggctggtgtactcaactgg ttccgggtacctctgcactcactggcttgcctagggctccttgtcctccatgacagtgat cgaaaaacaggcactcggaatatgttcagcatttgctctgctgtcatggtgatggctctg ctggcagtggtgggactcttcaccgtgctctccggtgtcatgaggagcttcaaaagagtc aactttgggactctgctaagcagccagaaggaggctgaagagttgctgcccgccttgaag gagttcctgtccaaccctccagctggttttcccagcagccgatctgatgctgagaggaga caagcttgtgatgccatcctgagggcttgcaaccagcagctgactgctaagctagcttgc cctaggcatctggggagcctgctggagctggcagagctggcctgtgatggctacttagtg tctaccccacagcgtcctcccctctacctggaacgaattctctttgtcttactgcggaat gctgctgcacaaggaagcccagaggccacactccgccttgctcagcccctccatgcctgc ttggtgcagtgctctcgcgaggctgctccccaggactatgaggccgtggctcggggcagc ttttctctgctttggaagggggcagaagccctgttggaacggcgagctgcatttgcagct cggctgaaggccttgagcttcctagtactcttggaggatgaaagtaccccttgtgaggtt cctcactttgcttctccaacagcctgtcgagcggtagctgcccatcagctatttgatgcc agtggccatggtctaaatgaagcagatgctgatttcctagatgacctgctctccaggcac gtgatcagagccttggtgggtgagagagggagctcttctgggcttctttctccccagagg gccctctgcctcttggagctcaccttggaacactgccgtcgcttttgctggagccgccac catgacaaagccatcagcgcagtggagaaggctcacagttacctaaggaacaccaatcta gcccctagccttcagctatgtcagctgggggttaagctgctgcaggttggggaggaagga cctcaggcagtggccaagcttctgatcaaggcatcagctgtcctgagcaagagtatggag gcaccatcacccccacttcgggcattgtatgagagctgccagttcttcctttcaggcctg gaacgaggcaccaagaggcgctatagacttgatgccattctgagcctctttgcttttctt ggagggtactgctctcttctgcagcagctgcgggatgatggtgtgtatgggggctcctcc aagcaacagcagtcttttcttcagatgtactttcagggacttcacctctacactgtggtg gtttatgactttgcccaaggctgtcagatagttgatttggctgacctgacccaactagtg gacagttgtaaatctaccgttgtctggatgctggaggccttagagggcctgtcgggccaa gagctgacggaccacatggggatgaccgcttcttacaccagtaatttggcctacagcttc tatagtcacaagctctatgccgaggcctgtgccatctctgagccgctctgtcagcacctg ggtttggtgaagccaggcacttatcccgaggtgcctcctgagaagttgcacaggtgcttc cggctacaagtagagagtttgaagaaactgggtaaacaggcccagggctgcaagatggtg attttgtggctggcagccctgcaaccctgtagccctgaacacatggctgagccagtcact ttctgggttcgggtcaagatggatgcggccagggctggagacaaggagctacagctaaag actctgcgagacagcctcagtggctgggacccggagaccctggccctcctgctgagggag gagctgcaggcctacaaggcggtgcgggccgacactggacaggaacgcttcaacatcatc tgtgacctcctggagctgagccccgaggagacaccagccggggcctgggcacgagccacc cacctggtagaactggctcaggtgctctgctaccacgactttacgcagcagaccaactgc tctgctctggatgctatccgggaagccctgcagcttctggactctgtgaggcctgaggcc caggccagagatcagcttctggacgataaagcacaggccttgctgtggctttacatctgt actctggaagccaaaatgcaggaaggtatcgagcgggatcggagagcccaggcccctggt aacttggaggaatttgaagtcaatgacctgaactatgaagataaactccaggaagatcgt ttcctatacagtaacattgccttcaacctggctgcagatgctgctcagtccaaatgcctg gaccaagccctggccctgtggaaggagctgcttacaaaggggcaggccccagctgtacgg tgtctccagcagacagcagcctcactgcagatcctagcagccctctaccagctggtggca aagcccatgcaggctctggaggtcctcctgctgctacggattgtctctgagagactgaag gaccactcgaaggcagctggctcctcctgccacatcacccagctcctcctgaccctcggc tgtcccagctatgcccagttacacctggaagaggcagcatcgagcctgaagcatctcgat cagactactgacacatacctgctcctttccctgacctgtgatctgcttcgaagtcaactc tactggactcaccagaaggtgaccaagggtgtctctctgctgctgtctgtgcttcgggat cctgccctccagaagtcctccaaggcttggtacttgctgcgtgtccaggtcctgcagctg gtggcagcttaccttagcctcccgtcaaacaacctctcacactccctgtgggagcagctc tgtgcccaaggctggcagacacctgagatagctctcatagactcccataagctcctccga agcatcatcctcctgctgatgggcagtgacattctctcaactcagaaagcagctgtggag acatcgtttttggactatggtgaaaatctggtacaaaaatggcaggttctttcagaggtg ctgagctgctcagagaagctggtctgccacctgggccgcctgggtagtgtgagtgaagcc aaggccttttgcttggaggccctaaaacttacaacaaagctgcagataccacgccagtgt gccctgttcctggtgctgaagggcgagctggagctggcccgcaatgacattgatctctgt cagtcggacctgcagcaggttctgttcttgcttgagtcttgcacagagtttggtggggtg actcagcacctggactctgtgaagaaggtccacctgcagaaggggaagcagcaggcccag gtcccctgtcctccacagctcccagaggaggagctcttcctaagaggccctgctctagag ctggtggccactgtggccaaggagcctggccccatagcaccttctacaaactcctcccca gtcttgaaaaccaagccccagcccatacccaacttcctgtcccattcacccacctgtgac tgctcgctctgcgccagccctgtcctcacagcagtctgtctgcgctgggtattggtcacg gcaggggtgaggctggccatgggccaccaagcccagggtctggatctgctgcaggtcgtg ctgaagggctgtcctgaagccgctgagcgcctcacccaagctctccaagcttccctgaat cataaaacacccccctccttggttccaagcctcttggatgagatcttggctcaagcatac acactgttggcactggagggcctgaaccagccatcaaacgagagcctgcagaaggttcta cagtcagggctgaagtttgtagcagcacggataccccacctagagccctggcgagccagc ctgctcttgatttgggccctcacaaaactaggtggcctcagctgctgtactacccaactt tttgcaagctcctggggctggcagccaccattaataaaaagtgtccctggctcagagccc tctaagactcagggccaaaaacgttctggacgagggcgccaaaagttagcctctgctccc ctgcgcctcaataatacctctcagaaaggtctggaaggtagaggactgccctgcacacct aaacccccagaccggatcaggcaagctggccctcatgtccccttcacggtgtttgaggaa gtctgccctacagagagcaagcctgaagtaccccaggcccccagggtacaacagagagtc cagacgcgcctcaaggtgaacttcagtgatgacagtgacttggaagaccctgtctcagct gaggcctggctggcagaggagcctaagagacggggcactgcttcccggggccgggggcga gcaaggaagggcctgagcctaaagacggatgccgtggttgccccaggtagtgcccctggg aaccctggcctgaatggcaggagccggagggccaagaaggtggcatcaagacattgtgag gagcggcgtccccagagggccagtgaccaggccaggcctggccctgagatcatgaggacc atccctgaggaagaactgactgacaactggagaaaaatgagctttgagatcctcaggggc tctgacggggaagactcagcctcaggtgggaagactccagctccgggccctgaggcagct tctggagaatgggagctgctgaggctggattccagcaagaagaagctgcccagcccatgc ccagacaaggagagtgacaaggaccttggtcctcggctccggctcccctcagcccccgta gccactggtctttctaccctggactccatctgtgactccctgagtgttgctttccggggc attagtcactgtcctcctagtgggctctatgcccacctctgccgcttcctggccttgtgc ctgggccaccgggatccttatgccactgctttccttgtcaccgagtctgtctccatcacc tgtcgccaccagctgctcacccacctccacagacagctcagcaaggcccagaagcaccga ggatcacttgaaatagcagaccagctgcaggggctgagccttcaggagatgcctggagat gtccccctggcccgcatccagcgcctcttttccttcagggctttggaatctggccacttc ccccagcctgaaaaggagagtttccaggagcgcctggctctgatccccagtggggtgact gtgtgtgtgttggccctggccaccctccagcccggaaccgtgggcaacaccctcctgctg acccggctggaaaaggacagtcccccagtcagtgtgcagattcccactggccagaacaag cttcatctgcgttcagtcctgaatgagtttgatgccatccagaaggcacagaaagagaac agcagctgtactgacaagcgagaatggtggacagggcggctggcactggaccacaggatg gaggttctcatcgcttccctagagaagtctgtgctgggctgctggaaggggctgctgctg ccgtccagtgaggagcccggccctgcccaggaggcctcccgcctacaggagctgctacag gactgtggctggaaatatcctgaccgcactctgctgaaaatcatgctcagtggtgccggt gccctcacccctcaggacattcaggccctggcctacgggctgtgcccaacccagccagag cgagcccaggagctcctgaatgaggcagtaggacgtctacagggcctgacagtaccaagc aatagccaccttgtcttggtcctagacaaggacttgcagaagctgccgtgggaaagcatg cccagcctccaagcactgcctgtcacccggctgccctccttccgcttcctactcagctac tccatcatcaaagagtatggggcctcgccagtgctgagtcaaggggtggatccacgaagt accttctatgtcctgaaccctcacaataacctgtcaagcacagaggagcaatttcgagcc aatttcagcagctatgcagggcatggggctggtgcccgcttccttgatgggcaggctgtc ctgcggctgagctgtcgggcagtggccctgctgtttggctgtagcagtgcggccctggct gtgcgtggaaacctggagggggctggcatcgtgctcaagtacatcatggctggttgcccc ttgtttctgggtaatctctgggatgtgactgaccgcgacattgaccgctacacggaagct ctgctgcaaggctggcttggagcaggcccaggggccccccttctctactatgtaaaccag gcccgccaagctccccgactcaagtatcttattggggctgcacctatagcctatggcttg cctgtctctctgcggtaa >gi568815586r:53207492_53421465|GENSCAN_predicted_peptide_3|577_aa MAQSINITELNLPQLEMLKNQLDQEVEFLSTSIAQLKVVQTKYVEAKDCLNVLNKSNEGM GFSYLKRENPLHIVYRFMNPLHVACLELKSTGHSSAPIAPVSVLFTLSMYVPGKLHDVEH VLIDVGTGYYVEKTAEDAKDFFKRKIDFLTKQMEKIQPALQEKHAMKQELVAKGTVGKRK WGCAGASSSGSALLPPCRELLMGHQFLRGLLTLLLPPPPLYTRHRMLGPESVPPPKRSRS KLMAPPRIGTHNGTFHCDEALACALLRLLPEYRDAEIVRTRDPEKLASCDIVVDVGGEYD PRRHRYDHHQRSFTETMSSLSPGKPWQTKLSSAGLIYLHFGHKLLAQLLGTSEEDSMVGT LYDKMYENFVEEVDAVDNGISQWAEGEPRYALTTTLSARVARLNPTWNHPDQDTEAGFKR AMDLVQEEFLQRLDFYQHSWLPARALVEEALAQRFQVDPSGEIVELAKGACPWKEHLYHL ESGLSPPVAIFFVIYTDQAGQWRIQCVPKEPHSFQSRLPLPEPWRGLRDEALDQVSGIPG CIFVHASGFTGGHHTREGALSMARATLAQRSYLPQIS >gi568815586r:53207492_53421465|GENSCAN_predicted_CDS_3|1734_bp atggcgcagtctattaacatcacggagctgaatctgccgcagctagaaatgctcaagaac cagctggaccaggaagtggagttcttgtccacgtccattgctcagctcaaagtggtacag accaagtatgtggaagccaaggactgtctgaacgtgctgaacaagagcaacgagggtatg ggtttttcttacctgaaacgagaaaatccattacatatcgtataccgcttcatgaaccct ttgcatgttgcctgcctagaattgaaaagtacaggacattcctctgctcctattgcccct gtttccgttcttttcacactgtctatgtatgtccctgggaagctgcatgatgtggaacac gtgctcatcgatgtgggaactgggtactatgtagagaagacagctgaggatgccaaggac ttcttcaagaggaagatagattttctaaccaagcagatggagaaaatccaaccagctctt caggagaagcacgccatgaaacaggagctggttgccaagggaacggttggcaagcggaag tggggctgcgctggcgcttcctcttccgggtcggcgctcctgcctccctgcagggagctg cttatgggacaccaattcctgcgcggcctcttaacgctgctgctgccgccgccacccctg tatacccggcaccgcatgctcggtccagagtccgtcccgcccccaaaacgatcccgcagc aaactcatggcaccgccccgaatcgggacgcacaatggcaccttccactgcgacgaggca ctggcatgcgcactgcttcgcctcctgccggagtaccgggatgcagagattgtgcggacc cgggatcccgaaaaactcgcttcctgtgacatcgtggtggacgtggggggcgagtacgac cctcggagacaccgatatgaccatcaccagaggtctttcacagagaccatgagctccctg tcccctgggaagccgtggcagaccaagctgagcagtgcgggactcatctatctgcacttc gggcacaagctgctggcccagttgctgggcactagtgaagaggacagcatggtgggcacc ctctatgacaagatgtatgagaactttgtggaggaggtggatgctgtggacaatgggatc tcccagtgggcagagggggagcctcgatatgcactgaccactaccctgagtgcacgagtt gctcgacttaatcctacctggaaccaccccgaccaagacactgaggcagggttcaagcgt gcaatggatctggttcaagaggagtttctgcagagattagatttctaccaacacagctgg ctgccagcccgggccttggtggaagaggcccttgcccagcgattccaggtggacccaagt ggagagattgtggaactggcgaaaggtgcatgtccctggaaggagcatctctaccacctg gaatctgggctgtcccctccagtggccatcttctttgttatctacactgaccaggctgga cagtggcgaatacagtgtgtgcccaaggagccccactcattccaaagccggctgcccctg ccagagccatggcggggtcttcgggacgaggccctggaccaggtcagtgggatccctggc tgcatcttcgtccatgcaagcggcttcactggcggtcaccacacccgagagggtgccttg agcatggcccgtgccaccttggcccagcgctcatacctcccacaaatctcctag >gi568815586r:53207492_53421465|GENSCAN_predicted_peptide_4|546_aa MCSLGLFPPPPPRGQVTLYEHNNELVTGSSYESPPPDFRGQWINLPVLQLTKDPLKTPGR LDHGTRTAFIHHREQVWKRCINIWRDVGLFGVLNEIANSEEEVFEWVKTASGWALALCRW ASSLHGSLFPHLSLRSEDLIAEFAQVTNWSSCCLRVFAWHPHTNKFAVALLDDSVRVYNA SSTIVPSLKHRLQRNVASLAWKPLSASVLAVACQSCILIWTLDPTSLSTRPSSGCAQVLS HPGHTPVTSLAWAPSGGRLLSASPVDAAIRVWDVSTETCVPLPWFRGGGVTNLLWSPDGS KILATTPSAVFRVWEAQMWTCERWPTLSGRCQTGCWSPDGSRLLFTVLGEPLIYSLSFPE RCGEGKGCVGGAKSATIVADLSETTIQTPDGEERLGGEAHSMVWDPSGERLAVLMKGKPR VQDGKPVILLFRTRNSPVFELLPCGIIQGEPGAQPQLITFHPSFNKGALLSVGWSTGRIA HIPLYFVNAQFPRFSPVLGRAQEPPAGGGGSIHDLPLFTETSPTSAPWDPLPGPPPVLPH SPHSHL >gi568815586r:53207492_53421465|GENSCAN_predicted_CDS_4|1641_bp atgtgctctctggggttgttccctcctccaccgcctcggggtcaagtcaccctatatgag cacaataacgagctggtgacgggcagtagctatgagagcccgccccccgacttccggggc cagtggatcaatcttcctgtcctacaactgacaaaggatcccctaaagacccctggaagg ctggaccatggcacaagaactgccttcatccatcaccgggagcaagtgtggaagagatgc atcaacatttggcgtgatgtgggcctttttggggtgctaaatgaaattgcaaactcagaa gaagaggtgtttgagtgggtgaagacggcatccggctgggccctggcactctgtcgatgg gcctcttccctccatgggtccctgttcccccatctgtctctcaggagcgaagatctgatc gctgaatttgcccaagtcacaaattggtccagctgctgcttgcgtgtctttgcatggcac ccccacaccaacaagtttgcagtggccctgctagatgactcagtccgtgtgtataatgcc agcagcaccatagtcccctccctgaagcaccggctgcagcgaaatgtggcgtctctggcc tggaagccccttagtgcctctgtcttggctgtggcctgccagagctgcattcttatctgg accctggaccctacctccttgtctacccgaccctcttctggctgtgcccaagtgctgtct caccctgggcatacacctgttaccagcttggcctgggcccccagtggggggcggctgctc tcagcttcacccgtggatgctgctatccgggtatgggatgtctcaacagagacctgtgtc ccccttccctggttccgaggaggtggggtgaccaacctgctctggtccccagacggcagc aaaatcctggctaccactccttcagctgtctttcgagtctgggaggcccagatgtggact tgtgagaggtggcctactctatcagggcgctgtcagactggctgctggagcccagatggc agccgactgctgttcactgtattgggagagccactgatttactccctgtcttttccagaa cgttgtggtgagggaaaggggtgcgttggaggtgcaaagtcagcaacgattgtggcagat ctgtctgagacaacaatacagacaccagatggtgaggagaggcttgggggagaggctcac tccatggtctgggaccccagtggggaacgtctggctgtgcttatgaaaggaaagccaagg gtacaggatggtaaaccagtcatcctcctttttcgcactcgaaacagccctgtgtttgag ctccttccctgtggcattatccagggggagccaggagcccagccccagctcatcactttc catccttccttcaacaaaggggccctgctcagtgtgggctggtccacaggccgaattgcc cacatcccgctgtactttgtcaatgcccagtttccacgttttagcccagtgcttgggcgg gcccaggaaccccctgctgggggtggaggctctattcatgacctgcccctctttactgag acatccccaacctctgccccttgggaccctctcccagggccaccacctgttctgccccac tccccacattcccacctctaa >gi568815586r:53207492_53421465|GENSCAN_predicted_peptide_5|472_aa MSQRGRRAAGAWVGVGMLGFPRTREFAGAIWVSELWCVCGVLDQKYLKEEVHYGSSPLAM LTAACSKFGGSSPLRDSTTLGKAGTKKPYSVGSDLSASKTMGDAYPAPFTSTNGLLSPAG SPPAPTSGYANDYPPFSHSFPGPTGTQDPGLLVPKGHSSSDCLPSVYTSLDMTHPYGSWY KAGIHAGISPGPGNTPTPWWDMHPGGNWLGGGQGQGDGLQGTLPTGPAQPPLNPQLPTYP SDFAPLNPAPYPAPHLLQPGPQHVLPQDVYKPKAVGNSGQLEGSGGAKPPRGASTGGSGG YGGSGAGRSSCDCPNCQELERLGAAAAGLRKKPIHSCHIPGCGKVYGKASHLKAHLRWHT GERPFVCNWLFCGKRFTRSDELERHVRTHTREKKFTCLLCSKRFTRSDHLSKHQRTHGEP GPGPPPSGPKELGEGRSTGEEEASQTPRPSASPATPEKAPGGSPEQSNLLEI >gi568815586r:53207492_53421465|GENSCAN_predicted_CDS_5|1419_bp atgagtcagcggggccgcagagcagcgggggcctgggtgggggtagggatgctggggttc cccaggaccagggagttcgctggggccatctgggtctctgaactctggtgtgtgtgtgga gtattggatcaaaagtacctaaaggaggaagttcactatggctccagtcccctggccatg ctgacggcagcgtgcagcaaatttggtggctctagccctctgcgggactcaacaactctg ggcaaagcaggcacaaagaagccgtactctgtgggcagtgacctttcagcctccaaaacc atgggggatgcttatccagccccctttacaagcactaatgggctcctttcacctgcaggc agtcctccagcacccacctcaggctatgctaatgattaccctcccttttcccactcattc cctgggcccacaggcacccaggaccctgggctactagtgcccaaggggcacagctcttct gactgtctgcccagtgtctacacctctctggacatgacacacccctatggctcctggtac aaggcaggcatccatgcaggcatttcaccaggcccaggcaacactcctactccatggtgg gatatgcaccctggaggcaactggctaggtggtgggcagggccagggtgatgggctgcaa gggacactgcccacaggtccagctcagcctccactgaacccccagctgcccacctaccca tctgactttgctccccttaatccagccccctacccagctccccacctcttgcaaccaggg ccccagcatgtcttgccccaagatgtctataaacccaaggcagtgggaaatagtgggcag ctagaagggagtggtggagccaaacccccacggggtgcaagcactgggggtagtggtgga tatgggggcagtggggcagggcgctcctcctgcgactgccctaattgccaggagctagag cggctgggagcagcagcggctgggctgcggaagaagcccatccacagctgccacatccct ggctgcggcaaggtgtatggcaaggcttcgcacctgaaggcccacttgcgctggcacaca ggcgagaggcccttcgtctgcaactggctcttctgcggcaagaggttcactcgttcggat gagctggagcgtcatgtgcgcactcacacccgggagaagaagttcacctgcctgctctgc tccaagcgctttacccgaagcgaccacctgagcaaacaccagcgcacccatggagaacca ggcccgggtccccctcccagtggccccaaggagctgggggagggccgcagcacgggggaa gaggaggccagtcagacgccccgaccttctgcctcgccagcaaccccagagaaagcccct ggaggcagccctgagcagagcaacttgctggagatctga >gi568815586r:53207492_53421465|GENSCAN_predicted_peptide_6|844_aa MGRPPRGRGQRGLARPLPAPATGDGPYPPLLGRPPEAPPAGGWSRGGGPSSEGPARANRL PDQDHSMDEMTAVVKIEKGVGGNNGGNGNGGGAFSQARSSSTGSSSSTGGGGQESQPSPL ALLAATCSRIESPNENSNNSQGPSQSGGTGELDLTATQLSQGANGWQIISSSSGATPTSK EQSGSSTNGSNGSESSKNRTVSGGQYVVAAAPNLQNQQVLTGLPGVMPNIQYQVIPQFQT VDGQQLQFAATGAQVQQDGSGQIQIIPGANQQIITNRGSGGNIIAAMPNLLQQAVPLQGL ANNVLSGQTQYVTNVPVALNGNITLLPVNSVSAATLTPSSQAVTISSSGSQESGSQPVTS GTTISSASLVSSQASSSSFFTNANSYSTTTTTSNMGIMNFTTSGSSGTNSQGQTPQRVSG LQGSDALNIQQNQTSGGSLQAGQQKEGEQNQQTQQQQILIQPQLVQGGQALQALQAAPLS GQTFTTQAISQETLQNLQLQAVPNSGPIIIRTPTVGPNGQVSWQTLQLQNLQVQNPQAQT ITLAPMQGVSLGQTSSSNTTLTPIASAASIPAGTVTVNAAQLSSMPGLQTINLSALGTSG IQVHPIQGLPLAIANAPGDHGAQLGLHGAGGDGIHDDTAGGEEGENSPDAQPQAGRRTRR EACTCPYCKDSEGRGSGDPGKKKQHICHIQGCGKVYGKTSHLRAHLRWHTGERPFMCTWS YCGKRFTRSDELQRHKRTHTGEKKFACPECPKRFMRSDHLSKHIKTHQNKKGGPGVALSV GTLPLDSGAGSEGSGTATPSALITTNMVAMEAICPEGIARLANSGINVMQVADLQSINIS GNGF >gi568815586r:53207492_53421465|GENSCAN_predicted_CDS_6|2535_bp atgggccgcccgccccgggggagggggcagcgtggcctcgcccgccccctgcccgccccg gccacgggggacgggccttaccccccactactcggccgcccgcctgaggctcctcccgcc gggggctggagccgcgggggcggcccgagcagcgaaggccccgcccgggccaaccgcctg cctgaccaagatcactccatggatgaaatgacagctgtggtgaaaattgaaaaaggagtt ggtggcaataatgggggcaatggtaatggtggtggtgccttttcacaggctcgaagtagc agcacaggcagtagcagcagcactggaggaggagggcaggagtcccagccatcccctttg gctctgctggcagcaacttgcagcagaattgagtcacccaatgagaacagcaacaactcc cagggcccgagtcagtcagggggaacaggtgagcttgacctcacagccacacaactttca cagggtgccaatggctggcagatcatctcttcctcctctggggctacccctacctcaaag gaacagagtggcagcagtaccaatggcagcaatggcagtgagtcttccaagaatcgcaca gtctctggtgggcagtatgttgtggctgccgctcccaacttacagaaccagcaagttctg acaggactacctggagtgatgcctaatattcagtatcaagtaatcccacagttccagacc gttgatgggcaacagctgcagtttgctgccactggggcccaagtgcagcaggatggttct ggtcaaatacagatcataccaggtgcaaaccaacagattatcacaaatcgaggaagtgga ggcaacatcattgctgctatgccaaacctactccagcaggctgtccccctccaaggcctg gctaataatgtactctcaggacagactcagtatgtgaccaatgtaccagtggccctgaat gggaacatcaccttgctacctgtcaacagcgtttctgcagctaccttgactcccagctct caggcagtcacgatcagcagctctgggtcccaggagagtggctcacagcctgtcacctca gggactaccatcagttctgccagcttggtatcatcacaagccagttccagctcctttttc accaatgccaatagctactcaactactactaccaccagcaacatgggaattatgaacttt actaccagtggatcatcagggaccaactctcaaggccagacaccccagagggtcagtggg ctacaggggtctgatgctctgaacatccagcaaaaccagacatctggaggctcattgcaa gcaggccagcaaaaagaaggagagcaaaaccagcagacacagcagcaacaaattcttatc cagcctcagctagttcaagggggacaggccctccaggccctccaagcagcaccattgtca gggcagacctttacaactcaagccatctcccaggaaaccctccagaacctccagcttcag gctgttccaaactctggtcccatcatcatccggacaccaacagtggggcccaatggacag gtcagttggcagactctacagctgcagaacctccaagttcagaacccacaagcccaaaca atcaccttagccccaatgcagggtgtttccttggggcagaccagcagcagcaacaccact ctcacacccattgcctcagctgcttccattcctgctggcacagtcactgtgaatgctgct caactctcctccatgccaggcctccagaccattaacctcagtgcattgggtacttcagga atccaggtgcacccaattcaaggcctgccgttggctatagcaaatgccccaggtgatcat ggagctcagcttggtctccatggggctggtggtgatggaatacatgatgacacagcaggt ggagaggaaggagaaaacagcccagatgcccaaccccaagccggtcggaggacccggcgg gaagcatgcacctgcccctactgtaaagacagtgaaggaaggggctcgggggatcctggc aaaaagaaacagcatatttgccacatccaaggctgtgggaaagtgtatggcaagacctct cacctgcgggcacacttgcgctggcatacaggcgagaggccatttatgtgtacctggtca tactgtgggaaacgcttcacacgttcggatgagctacagaggcacaaacgtacacacaca ggtgagaagaaatttgcctgccctgagtgtcctaagcgcttcatgaggagtgaccacctg tcaaaacatatcaagacccaccagaataagaagggaggcccaggtgtagctctgagtgtg ggcactttgcccctggacagtggggcaggttcagaaggcagtggcactgccactccttca gcccttattaccaccaatatggtagccatggaggccatctgtccagagggcattgcccgt cttgccaacagtggcatcaacgtcatgcaggtggcagatctgcagtccattaatatcagt ggcaatggcttctga