GENSCAN 1.0 Date run: 5-Nov-116 Time: 11:03:32 Sequence gi568815597f:151162160_151367340 : 205181 bp : 47.45% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3663 4044 382 0 1 62 80 171 0.682 8.61 1.02 Intr + 4312 4382 71 0 2 84 107 82 0.998 7.68 1.03 Intr + 4775 4863 89 2 2 80 100 28 0.996 2.81 1.04 Intr + 4962 5059 98 1 2 133 74 49 0.995 7.73 1.05 Intr + 5167 5255 89 0 2 120 80 22 0.688 3.37 1.06 Intr + 5985 6179 195 0 0 94 113 64 0.677 8.03 1.07 Term + 6827 6926 100 2 1 92 43 112 0.999 4.70 1.08 PlyA + 6950 6955 6 1.05 2.10 PlyA - 7847 7842 6 1.05 2.09 Term - 7944 7922 23 1 2 93 41 23 0.642 -3.43 2.08 Intr - 8504 8360 145 2 1 118 83 170 0.877 19.36 2.07 Intr - 8904 8761 144 0 0 46 76 171 0.985 12.18 2.06 Intr - 9381 9274 108 0 0 87 73 33 0.815 2.18 2.05 Intr - 9604 9474 131 2 2 80 68 123 0.801 9.91 2.04 Intr - 10198 10109 90 2 0 114 73 86 0.997 9.67 2.03 Intr - 11456 11340 117 0 0 86 94 137 0.999 14.54 2.02 Intr - 12388 12232 157 1 1 35 51 173 0.969 7.88 2.01 Init - 12716 12594 123 2 0 91 75 163 0.999 15.47 2.00 Prom - 13289 13250 40 -7.96 3.07 PlyA - 13901 13896 6 1.05 3.06 Term - 14872 14485 388 0 1 118 49 335 0.936 27.01 3.05 Intr - 15986 15842 145 0 1 55 96 183 0.999 15.14 3.04 Intr - 22334 22158 177 0 0 94 106 112 0.993 13.49 3.03 Intr - 23461 23347 115 1 1 95 79 109 0.599 10.72 3.02 Intr - 23791 23639 153 1 0 109 87 284 0.999 30.67 3.01 Init - 26873 26871 3 2 0 98 53 0 0.503 -2.50 3.00 Prom - 28925 28886 40 -4.66 4.00 Prom + 29657 29696 40 -8.36 4.01 Init + 37624 37676 53 0 2 81 75 94 0.823 8.03 4.02 Intr + 55952 56079 128 2 2 41 84 66 0.722 1.82 4.03 Intr + 62086 62120 35 2 2 121 105 31 0.960 6.04 4.04 Intr + 65161 65241 81 0 0 96 93 27 0.931 3.83 4.05 Intr + 69512 69642 131 1 2 89 92 84 0.999 8.39 4.06 Intr + 70089 70206 118 0 1 65 99 172 0.999 16.47 4.07 Intr + 70392 70544 153 2 0 78 91 90 0.994 8.57 4.08 Intr + 72038 72337 300 1 0 103 97 255 0.994 24.83 4.09 Intr + 74399 74604 206 1 2 83 100 126 0.988 11.30 4.10 Intr + 76023 76106 84 0 0 100 94 36 0.951 4.34 4.11 Intr + 76971 77019 49 0 1 83 83 24 0.890 0.08 4.12 Intr + 77796 77880 85 2 1 125 95 107 0.994 14.49 4.13 Intr + 80279 80408 130 0 1 106 81 30 0.960 3.85 4.14 Intr + 100002 100175 174 0 0 41 80 107 0.514 4.25 4.15 Intr + 101755 101869 115 1 1 76 84 203 0.995 19.15 4.16 Intr + 102673 102759 87 0 0 64 76 146 0.996 11.17 4.17 Intr + 103007 103075 69 1 0 85 105 55 0.984 6.28 4.18 Intr + 103235 103450 216 1 0 77 85 248 0.998 22.10 4.19 Intr + 103845 103953 109 2 1 86 62 126 0.820 9.66 4.20 Intr + 104149 104280 132 2 0 104 99 210 0.998 24.32 4.21 Intr + 104361 104428 68 1 2 69 100 54 0.999 3.32 4.22 Intr + 105014 105180 167 1 2 93 44 294 0.014 24.26 4.23 Intr + 121029 121159 131 0 2 57 86 101 0.723 7.14 4.24 Intr + 124116 126247 2132 1 2 100 82 1463 0.983 134.24 4.25 Intr + 126369 126547 179 2 2 74 51 91 0.998 2.72 4.26 Intr + 126936 127112 177 0 0 111 61 95 0.992 8.13 4.27 Intr + 127219 127381 163 1 1 99 87 94 0.995 10.38 4.28 Intr + 127519 127848 330 0 0 108 113 210 0.995 21.33 4.29 Intr + 127963 128075 113 0 2 100 111 17 0.999 4.38 4.30 Intr + 128273 128414 142 2 1 121 96 125 0.999 16.96 4.31 Term + 128556 129050 495 2 0 83 34 427 0.999 31.37 4.32 PlyA + 129289 129294 6 -1.75 5.14 PlyA - 129662 129657 6 1.05 5.13 Term - 130874 130693 182 0 2 111 47 395 0.994 35.47 5.12 Intr - 131303 131241 63 0 0 128 65 50 0.940 5.39 5.11 Intr - 131979 131859 121 0 1 126 81 206 0.959 23.77 5.10 Intr - 132382 132250 133 0 1 116 76 195 0.798 21.75 5.09 Intr - 136914 136649 266 0 2 68 96 219 0.956 17.01 5.08 Intr - 139808 139685 124 1 1 123 110 141 0.998 20.39 5.07 Intr - 140139 140035 105 2 0 56 113 96 0.992 8.23 5.06 Intr - 141491 141382 110 2 2 83 72 148 0.988 11.78 5.05 Intr - 144204 143977 228 0 0 78 78 386 0.974 34.67 5.04 Intr - 145642 145415 228 1 0 36 71 284 0.970 19.57 5.03 Intr - 148096 148052 45 1 0 86 94 28 0.676 1.81 5.02 Intr - 154350 153414 937 2 1 112 83 989 0.640 91.99 5.01 Init - 154869 154859 11 0 2 92 31 5 0.274 -4.78 5.00 Prom - 156646 156607 40 -4.96 6.13 PlyA - 159206 159201 6 1.05 6.12 Term - 164214 164017 198 0 0 60 50 94 0.815 0.20 6.11 Intr - 165279 165112 168 0 0 14 99 166 0.384 10.44 6.10 Intr - 181019 180126 894 2 0 92 59 160 0.118 5.01 6.09 Intr - 181283 181183 101 0 2 88 110 112 0.518 13.23 6.08 Intr - 181723 181522 202 1 1 52 70 139 0.544 7.36 6.07 Intr - 182119 181996 124 0 1 14 98 -7 0.211 -6.51 6.06 Intr - 182377 182258 120 0 0 93 7 141 0.543 6.11 6.05 Intr - 182688 182569 120 2 0 85 72 134 0.620 11.11 6.04 Intr - 183029 182947 83 2 2 97 100 72 0.997 7.74 6.03 Intr - 183802 183769 34 0 1 115 119 30 0.997 7.03 6.02 Intr - 184174 184046 129 0 0 86 88 45 0.300 4.11 6.01 Init - 193256 193222 35 2 2 114 53 27 0.163 -0.54 6.00 Prom - 194512 194473 40 -4.26 7.08 PlyA - 195321 195316 6 1.05 7.07 Term - 202546 202384 163 0 1 102 43 190 0.999 13.31 7.06 Intr - 202885 202767 119 1 2 73 94 138 0.999 12.16 7.05 Intr - 203122 203030 93 1 0 70 116 61 0.819 7.26 7.04 Intr - 203525 203404 122 0 2 90 49 232 0.659 19.71 7.03 Intr - 203687 203609 79 2 1 79 109 104 0.999 10.72 7.02 Intr - 204294 204116 179 1 2 81 46 233 0.994 18.04 7.01 Intr - 204745 204563 183 2 0 113 85 195 0.999 21.46 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 105014 105184 171 1 0 93 53 310 0.986 25.83 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:151162160_151367340|GENSCAN_predicted_peptide_1|341_aa XLLSLQTPQIRQDARAWIQLSGKVFEKRLHFPKGSASAESQGTIGNAVPSEHQRVDFRIA QCMRGSASPEVRRPTGREGKKRRRCPALATVAWRSGETGTENLSLLGLTVVMSFKREGDD WSQLNVLKKRRVGDLLASYIPEDEALMLRDGRFACAICPHRPVLDTLAMLTAHRAGKKHL SSLQLFYGKKQPGKERKQNPKHQNELRREETKAEAPLLTQTRLITQSALHRAPHYNSCCR RKYRPEAPGPSVSLSPMPPSEVKLQSGKISREPEPAAGPQAEESATVSAPAPMSPTRRRA LDHYLTLRSSGWIPDGRGRWVKDENVEFDSDEEEPPDLPLD >gi568815597f:151162160_151367340|GENSCAN_predicted_CDS_1|1026_bp nngttgttgtccctccaaacgcctcagatccgccaagatgcaagggcctggatccagttg agcggcaaggtctttgagaaaagacttcatttcccaaaaggctccgcgtctgcggagtcg cagggcaccataggaaatgcagttccctccgagcaccagagagtggatttccggatcgcg caatgcatgcgtggaagtgcgtccccggaagtacggaggccgacaggaagagaaggaaaa aagagaaggcgctgtcccgctcttgctacggtggcctggaggagtggcgaaaccggaaca gagaatttatcacttctgggactcacagtcgtgatgtctttcaagagggaaggagacgat tggagtcaactcaatgtgctcaaaaaaagaagagtcggggacctcctagccagttacatt ccagaggatgaggcgctgatgcttcgggatggacgctttgcttgtgccatctgcccccat cgaccggtactggacaccctggccatgctgactgcccaccgtgcaggcaagaaacatctg tccagcttgcagcttttctatggcaagaagcagccgggaaaggaaagaaagcagaatcca aaacatcagaatgaattgagaagggaagaaaccaaagctgaggctcctctgctaactcag acacgacttatcacccagagtgctctgcacagagctccccactataacagttgctgccgc cggaagtacagaccagaagcccctggtccctctgtctccctttcccctatgccaccctca gaggtcaaactccaaagtgggaagatcagtagggaacctgaacctgcggctggcccacag gccgaggagtcagcaactgtctcagcccctgcacccatgagccccacaagaagacgagcc ctggaccattatctcacccttcgaagctctggatggatcccagatggacgaggtcgatgg gtaaaagatgaaaatgttgagtttgactctgatgaggaggaaccacctgatctccccttg gactga >gi568815597f:151162160_151367340|GENSCAN_predicted_peptide_2|345_aa MSSYQKELEKYRDIDEDEILRTLSPEELEQLDCELQEMDPENMLLPAGLRQRDQTKKSPT GPLDREALLQYLEQQALEVKERDDLVPFTGEKKGKPYIQPKREIPAEEQITLEPELEEAL AHATDAEMCDIAAILDMYTLMSNKQYYDALCSGEICNTEGISSVVQPDKYKPVPDEPPNP TNIEEILKRVRSNDKELEEVNLNNIQDIPIPMLSELCEAMKANTYVRSFSLVATRSGDPI ANAVADMLRENRSLQSLNIESNFISSTGLMAVLKAVRENATLTELRVDNQRQWPGDAVEM EMATVLEQCPSIVRFGYHFTQQGPRARAAQAMTRNNELRRQQKKR >gi568815597f:151162160_151367340|GENSCAN_predicted_CDS_2|1038_bp atgtcatcatatcagaaggaactggagaaatacagagacatagatgaagatgagatccta aggaccttgagccccgaggagctagagcagctggactgcgaactacaggagatggatcct gagaacatgctcctgccagctggactaagacaacgtgaccagacaaagaagagcccaacg gggccactggaccgagaggcccttttgcagtacttggagcaacaggcactagaagtcaaa gagcgtgatgacttggtgcccttcacaggcgagaagaaggggaaaccctatattcagccc aagagggaaatcccagcagaggagcagatcaccctggagcctgagctggaggaggcactg gcacatgccacagatgctgaaatgtgtgacattgcagcaattctggacatgtacacactg atgagtaacaagcaatactatgatgccctctgcagtggagaaatctgcaacactgaaggc attagcagtgtggtacagcctgacaagtataagccagtgccggatgaacccccaaatccc acaaacattgaggagatactaaagagggtccgaagcaatgacaaggagctggaggaggtg aacttgaataatatacaggacatcccaatacccatgctaagtgagctgtgtgaggcaatg aaggcaaatacctatgtgcggagcttcagtctggtagccacgaggagtggtgaccccatt gccaatgcagtggctgacatgttgcgtgagaatcgtagcctccagagcctaaacatcgaa tccaacttcattagcagcacaggactcatggctgtgctgaaggcagttcgggaaaatgcc acactcactgagctccgtgtagacaatcagcgccagtggcctggtgatgcagtggagatg gagatggccaccgtgctagagcagtgtccctctattgtccgctttggctaccactttaca cagcaggggccacgagctcgggcagcccaggccatgacccgaaacaatgaactacgtcgc cagcaaaagaagagataa >gi568815597f:151162160_151367340|GENSCAN_predicted_peptide_3|326_aa MESGDDEYQGDQSDTEDEVDSDFDIDEGDEPSSDGEAEEPRRKRRVVTKAYKEPLKSLRP RKVNTPAGSSQKAREEKALLPLELQDDGSDSRKSMRQSTAEHTRQTFLRVQERQGQSRRR KGPHCERPLTQEELLREAKITEELNLRSLETYERLEADKKKQVHKKRKCPGPIITYHSVT VPLVGEPGPKEENVDIEGLDPAPSVSALTPHAGTGPVNPPARCSRTFITFSDDATFEEWF PQGRPPKVPVREVCPVTHRPALYRDPVTDIPYATARAFKIIREAYKKYITAHGLPPTASA LGPGPPPPEPLPGSGPRALRQKIVIK >gi568815597f:151162160_151367340|GENSCAN_predicted_CDS_3|981_bp atggaatccggagatgatgagtatcaaggggaccagtcagacacagaggacgaagtggac tctgactttgacattgatgaaggggatgaaccatccagtgatggagaagcagaagagcca agaaggaagcgccgagtagtcaccaaggcctataaggaacctctcaagagcttaaggcct cgaaaggtcaacaccccggctggtagctctcagaaggcgcgagaagagaaggcactactg ccattagaactacaagatgacggctctgacagtcggaagtctatgcgtcagtctacagct gagcatacacgacaaacgttccttcgggtacaggagaggcagggccagtcaagacggcga aaggggccccactgtgagcggccactaacccaggaggaactgctccgggaggccaagatc acagaagagcttaatttacggtcactggagacatatgagcggctcgaggctgataaaaag aagcaggttcataagaagcggaagtgccccgggcccataatcacctatcattcagtgaca gtgccacttgttggggagccaggccccaaggaagagaacgttgacatagaaggacttgat cctgctccctcggtgtctgcattgactcctcatgctgggactggacccgtcaacccccct gctcgctgctcacgtaccttcatcacttttagtgatgatgcaactttcgaggaatggttc ccccaagggcggcccccaaaagtccctgttcgtgaggtctgtccagtgacccatcgtcca gccctataccgggaccctgttacagacataccctatgccactgctcgagccttcaagatc attcgtgaggcttacaagaagtacattactgcccatggactgccgcccactgcctcagcc ctgggccccggcccgccacctcctgagcccctccctggctctgggccccgagccttgcgc cagaaaattgtcattaaatga >gi568815597f:151162160_151367340|GENSCAN_predicted_peptide_4|2183_aa MGLKRYRKVIGRFRFLERQKGIFYASEDLNVCKTVKLRFQVSDLKVSAPLEYWHCLIQNK AASGIKRPMASEVPYASGMPIKKIGHRSVDSSGETTYKKTTSSALKGAIQLGITHTVGSL STKPERDVLMQDFYVVESIFFPSEGSNLTPAHHYNDFRFKTYAPVAFRYFRELFGIRPDD YLYSLCSEPLIELCSSGASGSLFYVSSDDEFIIKTVQHKEAEFLQKLLPGYYMNLNQNPR TLLPKFYGLYCVQAGGKNIRIVVMNNLLPRSVKMHIKYDLKGSTYKRRASQKEREKPLPT FKDLDFLQDIPDGLFLDADMYNALCKTLQRDCLVLQSFKIMDYSLLMSIHNIDHAQREPL SSETQYSVDTRRPAPQKALYSTAMESIQGEARRGGTMETDDHMGGIPARNSKGERLLLYI GIIDILQSYRFVKKLEHSWKALVHDGDTVSVHRPGFYAERFQRFMCNTVFKKIPCVHLGR PDVLPQTPPLEEISEGSPIPDPSFSPLVGETLQMLTTSVDNSEYMRNGDFLPTRLQAQQD AVNIVCHSKTRSNPENNVGLITLAKYGGQRRRGLSSDCEVLTTLTPDTGRILSKLHTVQP KGKITFCTGIRVAHLALKHRQGKNHKMRIIAFVGSPVEDNEKDLVKLAKRLKKEKVNVDI INFGEEEVNTEKLTAFVNTLNGKDGTGSHLVTVPPGPSLADALISSPILAGEGGAMLGLG ASDFEFGVDPSADPELALALRVSMEEQRQRQEEEARRAAAASAAEAGIATTGTEDSDDAL LKMTISQQEFGRTGLPDLSSMTEEEQIAYAMQMSLQGAEFGQAESADIDASSAMDTSEPA KEEDDYDVMQDPEFLQSVLENLPGVDPNNEAIRNAMGSLASQATKDGKKDKKEEDKKTCS PRQPSADGRDLIPSATQLGLNLIAASPSPSSALQALGPGKGLGSADMGDMKTPDFDDLLA AFDIPDIDANEAIHSGPEENEGPGGPGKPEPGVGSESEDTAAASAGDGPGVPAQASDHGL PPPDISVVSVIVKNTVCPEQSEALAGGSAGDGAQAAGVTKEGPVGPHRMQNGFGSPEPSL PGTPHSPAPPSGGTWKEKGMEGKTPLDLFAHFGPEPGDHSDPLPPSAPSPTREGALTPPP FPSSFELAQENGPGMQPPVSSPPLGALKQESCSPHHPQVLAQQGSGSSPKATDIPASASP PPVAGVPFFKQSPGHQSPLASPKVPVCQPLKEEDDDEGPVDKSSPGSPQSPSSGAEAADE DSNDSPASSSSRPLKVRIKTIKTSCGNITRTVTQVPSDPDPPAPLAEGAFLAEASLLKLS PATPTSEGPKVVSVQLGDGTRLKGTVLPVATIQNASTAMLMAASVARKAVVLPGGTATSP KMIAKNVLGLVPQALPKADGRAGLGTGGQKVNGASVVMVQPSKTATGPSTGGGTVISRTQ SSLVEAFNKILNSKNLLPAYRPNLSPPAEAGLALPPTGYRCLECGDAFSLEKSLARHYDR RSMRIEVTCNHCARRLVFFNKCSLLLHAREHKDKGLVMQCSHLVMRPVALDQMVGQPDIT PLLPVAVPPVSGPLALPALGKGEGAITSSAITTVAAEAPVLPLSTEPPAAPATSAYTCFR CLECKEQCRDKAGMAAHFQQLGPPAPGATSNVCPTCPMMLPNRCSFSAHQRMHKNRPPHV CPECGGNFLQANFQTHLREACLHVSRRVGYRCPSCSVVFGGVNSIKSHIQTSHCEVFHKC PICPMAFKSGPSAHAHLYSQHPSFQTQQAKLIYKCAMCDTVFTHKPLLSSHFDQHLLPQR VSVFKCPSCPLLFAQKRTMLEHLKNTHQSGRLEETAGKGAGGALLTPKTEPEELAVSQGG AAPATEESSSSSEEEEVPSSPEPPRPAKRPRRELGSKGLKGGGGGPGGWTCGLCHSWFPE RDEYVAHMKKEHGKSVKKFPCRLCERSFCSAPSLRRHVRVNHEGIKRVYPCRYCTEGKRT FSSRLILEKHVQVRHGLQLGAQSPGRGTTLARGSSARAQGPGRKRRQSSDSCSEEPDSTT PPAKSPRGGPGSGGHGPLRYRSSSSTEQSLMMGLRVEDGAQQCLDCGLCFASPGSLSRHR FISHKKRRGVGKASALGLGDGEEEAPPSRSDPDGGDSPLPASGGPLTCKVCGKSCDSPLN LKTHFRTHGMAFIRARQGAVGDN >gi568815597f:151162160_151367340|GENSCAN_predicted_CDS_4|6552_bp atgggacttaaaagatatcgaaaagttattggacgctttcgatttctggagagacaaaaa ggtattttctatgctagtgaggaccttaacgtctgtaaaactgtgaaactgcgctttcaa gtatcagacttgaaagtttctgctcctctagaatactggcattgcctaattcagaacaaa gcagcatctggaatcaagagacccatggcatctgaggtgccttatgcctctggcatgccc atcaagaaaataggccatagaagtgttgattcctcaggagagacaacatataaaaagaca acctcatcagccttgaaaggtgccatccagttaggcattacccacactgtggggagcctg agtaccaaaccagagcgtgatgtcctcatgcaagatttctacgtggttgagagtatcttc tttcccagtgaagggagcaacctgacccctgctcatcactacaatgactttcgtttcaag acctatgcacctgttgccttccgctacttccgggagctatttggtatccggcccgatgat tacttgtattccctctgcagtgagccgctgattgaactctgtagctctggagctagtggt tccctattctatgtgtccagcgacgatgagttcattattaagacagtccaacataaagag gcggaatttctgcagaagctgcttccaggatactacatgaacctcaaccagaaccctcgg actttgctgcctaaattctatggactgtactgtgtgcaggcaggtggcaagaacattcgg attgtggtgatgaacaatcttttaccaagatcggtaaaaatgcatatcaaatatgacctc aaaggctcaacctacaaacggcgggcttcccagaaagagcgagagaagcctcttcccaca tttaaagacctagacttcttacaagacatccctgatggtctttttttggatgctgacatg tacaacgctctctgtaagaccctgcagcgtgactgtttggtgctgcagagcttcaagata atggattacagcctcttgatgtcaatccataatatagatcatgcacaacgagagccctta agcagtgaaacacagtactcagttgatactcgaagaccggccccccaaaaggctctgtat tccacagccatggaatccatccagggagaggctcgacggggtggtaccatggagactgat gaccatatgggtggcatccctgcccggaatagtaaaggggaaaggcttctgctttatatt ggcatcattgacattctacagtcttacaggtttgttaagaagttggagcactcttggaaa gccctggtacatgacggagacactgtctcagtgcatcgcccaggcttctacgctgaacgg ttccagcgcttcatgtgcaacacagtatttaagaagattccctgcgttcaccttggtcgt cctgatgttttacctcagactccacctttggaggaaatcagtgagggctcgcctattcct gaccccagtttctcacctctagttggagagactttgcaaatgctaactacaagtgtggac aacagtgagtatatgcggaatggagacttcttacccaccaggctgcaggcccagcaggat gctgtcaacatagtttgtcattcaaagacccgcagcaaccctgagaacaacgtgggcctt atcacactggctaagtatgggggacagaggaggaggggactcagtagtgactgtgaagtg ctgaccacactcaccccagacactggccgtatcctgtccaagctacatactgtccaaccc aagggcaagatcaccttctgcacgggcatccgcgtggcccatctggctctgaagcaccga caaggcaagaatcacaagatgcgcatcattgcctttgtgggaagcccagtggaggacaat gagaaggatctggtgaaactggctaaacgcctcaagaaggagaaagtaaatgttgacatt atcaattttggggaagaggaggtgaacacagaaaagctgacagcctttgtaaacacgttg aatggcaaagatggaaccggttctcatctggtgacagtgcctcctgggcccagtttggct gatgctctcatcagttctccgattttggctggtgaaggtggtgccatgctgggtcttggt gccagtgactttgaatttggagtagatcccagtgctgatcctgagctggccttggccctt cgtgtatctatggaagagcagcggcagcggcaggaggaggaggcccggcgggcagctgca gcttctgctgctgaggccgggattgctacgactgggactgaagactcagacgatgccctg ctgaagatgaccatcagccagcaagagtttggccgcactgggcttcctgacctaagcagt atgactgaggaagagcagattgcttatgccatgcagatgtccctgcagggagcagagttt ggccaggcggaatcagcagacattgatgccagctcagctatggacacatctgagccagcc aaggaggaggatgattacgacgtgatgcaggaccccgagttccttcagagtgtcctagag aacctcccaggtgtggatcccaacaatgaagccattcgaaatgctatgggctccctggcc tcccaggccaccaaggacggcaagaaggacaagaaggaggaagacaagaagacttgctcc ccgcgccagccctcggcagatggcagggacttaattccgtctgctacccagcttggcctc aacctaatcgccgccagcccctcgccctcctctgcgctgcaggccttgggcccgggcaaa ggtctgggatctgccgatatgggggatatgaagacccctgattttgatgacctccttgct gcctttgacatccctgacattgatgcgaatgaagccatccattctgggccagaagaaaat gaggggcctggaggcccagggaagccagaaccaggtgtaggaagtgaatctgaagacaca gcagcagcctctgctggggatggccctggagttccagcccaggcctctgaccatggcctg ccaccgccagacatttctgtagtcagtgtcattgtcaagaacactgtgtgtcccgagcag tctgaggccctggctggaggctcagcaggagacggggcccaggctgctggggtaactaaa gaagggcctgtggggcctcatcgaatgcagaatggttttgggagccctgaaccttccctc ccaggaactccccactctcctgctcctcccagtgggggcacctggaaagaaaaaggcatg gaaggcaaaactcccttggacctgtttgctcattttggccctgagccaggggaccactca gatccgctgcctccctctgcaccctctcccactcgggagggggctctgaccccgcctcct ttcccctcttcctttgagctggcccaggagaatggcccaggcatgcagccacctgtttct tccccaccattgggggccttgaagcaggagagctgcagcccccatcatccccaggtccta gcccaacaaggctcaggctccagccctaaggccacggacatccctgccagtgcctcgcct cccccagttgctggggtgcccttcttcaagcagtctccagggcaccagagccctcttgcc tcccccaaagtgcccgtctgtcagcccttgaaggaagaagatgatgatgaggggccagtg gacaagtcttccccaggaagtccccagagtccctctagtggggccgaggctgcagatgag gacagcaatgactcccctgcctccagctcctctaggcctcttaaggtgcggatcaagacc attaaaacatcctgcgggaatatcacaaggactgtaactcaggtcccctcagatcctgat ccacctgcccccttggctgagggggccttcttggctgaggctagcctcttgaagctgtcc cctgcaacacctacttctgagggtccaaaggtggtgagcgtacagttgggtgatggtaca aggctgaaaggcactgtgctgcctgtggccaccatccagaacgccagtactgccatgctg atggcagccagtgtggctcgcaaggctgtggtgctgcctggggggactgccaccagccct aagatgattgctaagaacgtgctaggcctggtgccccaagccctgcctaaggctgacggg cgggcagggctggggactgggggacagaaggtgaatggtgcctcggtggtgatggtgcaa ccttcaaagacagctactgggccaagtacagggggcggcacagtgatatcacggacccag tccagcctggtggaggccttcaacaagatcctcaacagcaagaacctgctccctgcctat aggccaaacctgagcccaccagctgaggctgggctggccctgcctcccaccggctaccgc tgcctggagtgtggggatgccttctcattggagaagagcctggcacggcactatgaccgt cggagcatgcgcatcgaggtcacctgcaaccactgcgcccgccgcctggtcttcttcaac aagtgcagcctgctcctgcatgcacgtgaacacaaggacaaggggctcgtcatgcagtgc tcacatttggtcatgaggcctgtagcccttgaccagatggtggggcagccggacatcaca ccgctgctgcctgtagctgtcccacctgtctctggacctctggccttgcctgccttgggc aagggtgagggggccatcacctcctctgccattactacagttgctgctgaggcccctgtc ctgccgctctccacagagccgcctgctgccccggccacctctgcttacacatgctttcgc tgcctggagtgcaaggaacagtgccgggacaaggctggcatggcagctcacttccagcag ctcggcccccctgcccctggggccaccagcaatgtgtgcccaacctgccccatgatgctc cccaatcgctgcagcttcagcgcccaccagcgcatgcataagaatcgacccccccatgtc tgtcctgagtgtgggggcaacttcctgcaagccaattttcagacccatctccgggaggcc tgtctgcacgtctctcgccgtgtaggatacaggtgccccagctgttcagtggtgtttggg ggtgtgaactccatcaagtcccacatccagacgtcgcactgcgaggttttccacaagtgc cccatctgccccatggccttcaagtctgggccaagtgcccatgcccacctctactcccag catcccagcttccaaactcagcaggccaagctgatctacaagtgcgccatgtgcgacaca gtcttcactcacaaacccctcctctcctcacacttcgaccagcacttgctgccccagcgt gtcagtgtctttaagtgcccgtcttgtcctctgctctttgcccaaaaaaggaccatgctg gaacatctcaagaacacccatcagtctgggcgcttggaggagactgctgggaaaggggcc gggggtgccctgctgacccccaagactgagcctgaggagctggctgtttctcagggaggg gcagcccctgctactgaggagtcgtcttcatcttcagaagaggaggaagtacccagctcc cctgagcccccccgtccagccaaacggcctcggcgggaactagggagcaaaggcctcaag ggtgggggtggggggcctggaggctggacctgtggcctgtgtcactcctggttccctgag cgtgatgaatacgtggcccacatgaagaaggagcatggcaagtcagtgaaaaagttcccc tgtcgcctgtgtgagcgctccttctgctccgcccccagcctgaggcgccatgtcagagtt aatcacgagggcatcaagcgagtttacccctgcaggtattgcacagagggaaaacgcacc ttcagcagccgcctgatcctagagaaacatgtccaggtccggcacggcttgcagcttggg gcccagtcccctggccgggggaccaccttggctcggggttccagtgccagagcccagggg ccaggtcggaaacgccgccagtcttctgactcttgcagtgaggagcctgacagcacgaca ccgccagccaagtcccccaggggcggacctggatctggaggccatggccctctgcgctac cggagcagcagctccacagaacagagcctcatgatggggttgagggtggaggatggtgcc cagcagtgcctcgactgtggcttgtgctttgcctcccctggctccctgagccgacaccgt ttcatcagccacaagaagagacggggtgtgggtaaagccagtgccctggggctgggggat ggggaggaagaggcccctccatcaaggtctgaccccgatggtggagactcacccctgcct gcttctggaggcccactgacctgtaaggtctgtggcaagagctgcgacagccctctaaac ctcaagacccacttccgcacgcatggcatggcgttcatcagggctcggcagggggctgtt ggggacaactag >gi568815597f:151162160_151367340|GENSCAN_predicted_peptide_5|850_aa MVKPLEARSLAVAMGDTVVEPAPLKPTSEPTSGPPGNNGGSLLSVITEGVGELSVIDPEV AQKACQEVLEKVKLLHGGVAVSSRGTPLELVNGDGVDSEIRCLDDPPAQIREEEDEMGAA VASGTAKGARRRRQNNSAKQSWLLRLFESKLFDISMAISYLYNSKEPGVQAYIGNRLFCF RNEDVDFYLPQLLNMYIHMDEDVGDAIKPYIVHRCRQSINFSLQCALLLGAYSSDMHIST QRHSRGTKLRKLILSDELKPAHRKRELPSLSPAPDTGLSPSKRTHQRSKSDATASISLSS NLKRTASNPKVENEDEELSSSTESIDNSFSSPVRLAPEREFIKSLMAIGKRLATLPTKEQ KTQRLISELSLLNHKLPARVWLPTAGFDHHVVRVPHTQAVVLNSKDKAPYLIYVEVLECE NFDTTSVPARIPENRIRSTRSVENLPECGITHEQRAGSFSTVPNYDNDDEAWSVDDIGEL QVELPEVHTNSCDNISQFSVDSITSQESKEPVFIAAGDIRRRLSEQLAHTPTAFKRDPED PSAVALKEPWQEKVRRIREGSPYGHLPNWRLLSVIVKCGDDLRQELLAFQVLKQLQSIWE QERVPLWIKPYKILVISADSGMIEPVVNAVSIHQVKKQSQLSLLDYFLQEHGSYTTEAFL SAQRNFVQSCAGYCLVCYLLQVKDRHNGNILLDAEGHIIHIDFGFILSSSPRNLGFETSA FKLTTEFVDVMGGLDGDMFNYYKMLMLQGLIAARKHMDKVVQIVEIMQQGCRRCSGSSPS GPMMTVAQVICSQLPCFHGSSTIRNLKERFHMSMTEEQLQLLVEQMVDGSMRSITTKLYD GFQYLTNGIM >gi568815597f:151162160_151367340|GENSCAN_predicted_CDS_5|2553_bp atggtgaaacccttggaagctcgaagtctggctgtggccatgggagatacagtagtggag cctgcccccttgaagccaacttctgagcccacttctggcccaccagggaataatgggggg tccctgctaagtgtcatcacggagggggtcggggaactatcagtgattgaccctgaggtg gcccagaaggcctgccaggaggtgttggagaaagtcaagcttttgcatggaggcgtggca gtctctagcagaggcaccccactggagttggtcaatggggatggtgtggacagtgagatc cgttgcctagatgatccacctgcccagatcagggaggaggaagatgagatgggggccgct gtggcctcaggcacagccaaaggagcaagaagacggcggcagaacaactcagctaaacag tcttggctgctgaggctgtttgagtcaaaactgtttgacatctccatggccatttcatac ctgtataactccaaggagcctggagtacaagcctacattggcaaccggctcttctgcttt cgcaacgaggacgtggacttctatctgccccagttgcttaacatgtacatccacatggat gaggacgtgggtgatgccattaagccctacatagtccaccgttgccgccagagcattaac ttttccctccagtgtgccctgttgcttggggcctattcttcagacatgcacatttccact caacgacactcccgtgggaccaagctacggaagctgatcctctcagatgagctaaagcca gctcacaggaagagggagctgccctccttgagcccggcccctgacacagggctgtctccc tccaaaaggactcaccagcgctctaagtcagatgccactgccagcataagtctcagcagc aacctgaaacgaacagccagcaaccctaaagtggagaatgaggatgaggagctctcctcc agcaccgagagtattgataattcattcagttcccctgttcgactggctcctgagagagaa ttcatcaagtccctgatggcgatcggcaagcggctggccacgctccccaccaaagagcag aaaacacagaggctgatctcagagctctccctgctcaaccataagctccctgcccgagtc tggctgcccactgctggctttgaccaccacgtggtccgtgtaccccacacacaggctgtt gtcctcaactccaaggacaaggctccctacctgatttatgtggaagtccttgaatgtgaa aactttgacaccaccagtgtccctgcccggatccccgagaaccgaattcggagtacgagg tccgtagaaaacttgcccgaatgtggtattacccatgagcagcgagctggcagcttcagc actgtgcccaactatgacaacgatgatgaggcctggtcggtggatgacataggcgagctg caagtggagctccccgaagtgcataccaacagctgtgacaacatctcccagttctctgtg gacagcatcaccagccaggagagcaaggagcctgtgttcattgcagcaggggacatccgc cggcgcctttcggaacagctggctcataccccgacagccttcaaacgagacccagaagat ccttctgcagttgctctcaaagagccctggcaggagaaagtacggcggatcagagagggc tccccctacggccatctccccaattggcggctcctgtcagtcattgtcaagtgtggggat gaccttcggcaagagcttctggcctttcaggtgttgaagcaactgcagtccatttgggaa caggagcgagtgcccctttggatcaagccatacaagattcttgtgatttcggctgatagt ggcatgattgaaccagtggtcaatgctgtgtccatccatcaggtgaagaaacagtcacag ctctccttgctcgattacttcctacaggagcacggcagttacaccactgaggcattcctc agtgcacagcgcaattttgtgcaaagttgtgctgggtactgcttggtctgctacctgctg caagtcaaggacagacacaatgggaatatccttttggacgcagaaggccacatcatccac atcgactttggcttcatcctctccagctcaccccgaaatctgggctttgagacgtcagcc tttaagctgaccacagagtttgtggatgtgatgggcggcctggatggcgacatgttcaac tactataagatgctgatgctgcaagggctgattgccgctcggaaacacatggacaaggtg gtgcagatcgtggagatcatgcagcaaggttgtcgccgttgctcaggatcatccccatct ggccccatgatgacggtggcccaggtcatctgttctcagcttccttgcttccatggctcc agcaccattcgaaacctcaaagagaggttccacatgagcatgactgaggagcagctgcag ctgctggtggagcagatggtggatggcagtatgcggtctatcaccaccaaactctatgac ggcttccagtacctcaccaacggcatcatgtga >gi568815597f:151162160_151367340|GENSCAN_predicted_peptide_6|735_aa MPGRLHLLTGKFPHAGMAEDEPDAKSPKTGGRAPPGGAEAGEPTTLLQRLRGTISKAVQN KVEGILQDVQKFSDNDKLYLYLQLPSGPTTGDKSSEPSTLSNEEYMYAYRWIRNHLEEHT DTCLPKQSVYDAYRKYCESLACCRPLSTANFGKIIREIFPDIKARRLGGRGQSKYCYSGI RRKTLVSMPPLPGLDLKGSESVSTKSPSNSTLLSQPEMGPEVTPAPRDELVEAACALTCD WAERILKRSFSSIVEVARFLLQQHLISARSAHAHVLKAMGLAEEDEHAPRERSSKPKNGL ENPEGGAHKKPERLAQPPKDLEARTGAGPLARGERKKSVVESSAPGANNLQVNALVARLP LLLPRAPRSLIPPIPVSPPILAPRLSSGALKVATLPLSSRAGAPPAAVPIINMILPTVPA LPGPGPGPGRAPPGGLTQPRGTENREVGIGGDQGPHDKGVKRTAEVPVSEASGQAPPAKA AKQDIEDTASDAKRKRGRPRKKSGGSGERNSTPLKSAAAMESAQSSRLPWETWGSGGEGN SAGGAERPGPMGEAEKGAVLAQGQGDGTVSKGGRGPGSQHTKEAEDKIPLVPSKVSVIKG SRSQKEAFPLAKGEAAPRAAPQPGPGAASAAAVREAQAANGTRERSLLLRRQCRQSNRDC PHPLRGGPPEDQLNLELRRKMCCEQGEPQLPQAVQDRRVFLKAGASFEEVPIRLHLVDYS GAATKRDEQACVEIE >gi568815597f:151162160_151367340|GENSCAN_predicted_CDS_6|2208_bp atgcccggccgactccacctcttgacagggaagttccctcatgccgggatggcagaagat gagcctgatgctaagagccccaagactgggggaagggcccccccaggtggtgctgaggct ggggaacctaccacccttcttcagaggctccgaggtaccatttccaaggccgtgcagaac aaagtagaggggatcctgcaagatgtacagaaattttctgacaatgacaagctgtatctc taccttcagctcccctcaggacccaccactggagacaaaagctcagagccaagtacactg agcaatgaggagtacatgtatgcctataggtggatccgcaaccacctggaagagcacact gacacctgtctgccaaagcaaagtgtttatgatgcctatcggaagtactgtgagagtctt gcctgttgccgcccactcagcacagccaactttggcaagatcatcagagagatcttccct gacatcaaagctcgaaggcttggtggccggggccagtccaaatattgctacagtggcata aggaggaagaccttggtgtctatgccacccctgcctggacttgacctaaagggttctgag agtgtaagtaccaaatcaccttccaattccactcttctctcccagccagaaatgggccca gaagtaaccccagcacctcgagatgaactggtggaggcagcgtgtgccctgacctgtgac tgggcagagcggatcctgaaacggtccttcagttccatcgttgaggtcgcccgcttcctg ctacagcagcatctcatctctgcccgatctgcacatgcccatgtgcttaaggccatgggg ctcgctgaagaggacgaacatgcacctcgggaacggtcatctaaaccaaagaatggttta gagaacccagagggtggagcccacaagaagccagagagactggcccagcctcctaaggat ctggaagcccgaactggggccggtcctctcgcacgtggagagcggaagaagagtgtagtt gagagctcggccccaggagccaataacctgcaggttaatgccctagtggctcggctgcct ctgctccttccccgggcccctcgctcactaattccgccaatcccagtctctccacctatt ctggcccccaggctttcttcaggtgccctgaaagtggctacactgcctctgtctagtagg gccggggcacccccagcagctgtgcccatcattaacatgatcttaccaactgttcctgct ttgcctggacctggacctgggcctgggcgagctccacctgggggactcactcagccccgg ggcacagagaacagagaggtaggcataggtggtgaccaaggaccacatgacaagggtgtc aagaggacagctgaagtacctgtgagtgaggccagtgggcaggctccaccagctaaagca gcaaagcaggatatagaggatacagcaagtgatgccaaaaggaaacgggggcgccctcga aaaaagtcaggtggaagtggggaaaggaattctacccctctcaagtcagcagctgccatg gaatctgcccagtcctcaaggttaccatgggagacatggggctcaggaggggaaggcaac tcagctggaggggcagagaggccagggccaatgggagaggctgaaaagggggcagtactt gcccagggtcagggagatggtactgtttccaaaggaggaaggggccccggttcccagcat accaaagaagcagaagataaaattcccttggtcccctcaaaagtgagtgtcatcaagggc agcagaagccaaaaggaggcttttcctttggcaaagggagaggcggcgccgcgggcagcc ccgcagccggggcctggtgcagcctccgcggccgctgtcagggaagcgcaggcggccaat ggaacccgggagcggtcgctgctgctgaggcggcagtgtcggcagtccaaccgcgactgc ccgcaccccctccgcgggggtcccccagaggatcaactaaaccttgaactaagaagaaaa atgtgttgtgagcagggggagcctcagctgcctcaggccgttcaggacagaagggtgttt ctgaaggccggagcaagttttgaagaagtccctatcagattacacttggttgactactcc ggagcagccactaagagggatgaacaggcctgcgtggaaattgaatga >gi568815597f:151162160_151367340|GENSCAN_predicted_peptide_7|312_aa XGFVLLDGETFEVKGTWERPGGAAPLGYDFWYQPRHNVMISTEWAAPNVLRDGFNPADVE AGLYGSHLYVWDWQRHEIVQTLSLKDGLIPLEIRFLHNPDAAQGFVGCALSSTIQRFYKN EGGTWSVEKVIQVPPKKVKGWLLPEMPGLITDILLSLDDRFLYFSNWLHGDLRQYDISDP QRPRLTGQLFLGGSIVKGGPVQVLEDEELKSQPEPLVVKGKRVAGGPQMIQLSLDGKRLY ITTSLYSAWDKQFYPDLIREGSVMLQVDVDTVKGGLKLNPNFLVDFGKEPLGPALAHELR YPGGDCSSDIWI >gi568815597f:151162160_151367340|GENSCAN_predicted_CDS_7|939_bp nggggttttgtgctgctggatggggagacgttcgaggtgaaggggacatgggagagacct gggggtgctgcaccgttgggctatgacttctggtaccagcctcgacacaatgtcatgatc agcactgagtgggcagctcccaatgtcttacgagatggcttcaaccccgctgatgtggag gctggactgtacgggagccacttatatgtatgggactggcagcgccatgagattgtgcag accctgtctctaaaagatgggcttattcccttggagatccgcttcctgcacaacccagac gctgcccaaggctttgtgggctgcgcactcagctccaccatccagcgcttctacaagaac gagggaggtacatggtcagtggagaaggtgatccaggtgccccccaagaaagtgaagggc tggctgctgcccgaaatgccaggcctgatcaccgacatcctgctctccctggacgaccgc ttcctctacttcagcaactggctgcatggggacctgaggcagtatgacatctctgaccca cagagaccccgcctcacaggacagctcttcctcggaggcagcattgttaagggaggccct gtgcaagtgctggaggacgaggaactaaagtcccagccagagcccctagtggtcaaggga aaacgggtggctggaggccctcagatgatccagctcagcctggatgggaagcgcctctac atcaccacgtcgctgtacagtgcctgggacaagcagttttaccctgatctcatcagggaa ggctctgtgatgctgcaggttgatgtagacacagtaaaaggagggctgaagttgaacccc aacttcctggtggacttcgggaaggagccccttggcccagcccttgcccatgagctccgc taccctgggggcgattgtagctctgacatctggatttga