GENSCAN 1.0 Date run: 4-Nov-116 Time: 20:18:27 Sequence gi568815578f:34877630_35102085 : 224456 bp : 46.00% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 5165 5360 196 0 1 97 58 209 0.262 17.27 1.02 Intr + 35467 35558 92 1 2 90 96 69 0.913 7.54 1.03 Intr + 35764 35867 104 2 2 89 46 119 0.859 7.69 1.04 Intr + 36124 36196 73 0 1 104 62 38 0.893 1.78 1.05 Intr + 36467 36542 76 0 1 69 100 113 0.997 9.17 1.06 Intr + 36694 36808 115 1 1 78 109 25 0.989 4.05 1.07 Intr + 37589 37627 39 1 0 86 68 43 0.579 0.52 1.08 Intr + 41806 41949 144 0 0 105 12 141 0.657 8.68 1.09 Intr + 42910 43080 171 0 0 149 77 208 0.999 25.94 1.10 Intr + 43383 43510 128 2 2 -64 52 232 0.635 3.88 1.11 Intr + 43701 43833 133 0 1 95 79 98 0.985 10.25 1.12 Intr + 44157 44237 81 2 0 93 81 63 0.984 5.93 1.13 Intr + 45694 45802 109 0 1 118 89 73 0.999 10.26 1.14 Intr + 48069 48137 69 1 0 63 81 111 0.992 7.05 1.15 Intr + 48476 48652 177 0 0 45 61 242 0.994 17.09 1.16 Intr + 49248 49322 75 1 0 32 91 77 0.783 1.89 1.17 Term + 49458 49585 128 1 2 54 47 159 0.998 6.84 1.18 PlyA + 50311 50316 6 1.05 2.14 PlyA - 50826 50821 6 1.05 2.13 Term - 51322 51199 124 0 1 82 55 164 0.995 10.36 2.12 Intr - 51961 51772 190 2 1 28 99 235 0.999 17.24 2.11 Intr - 53788 53707 82 1 1 108 92 126 0.995 14.21 2.10 Intr - 54504 54310 195 0 0 103 76 122 0.987 12.11 2.09 Intr - 58013 57947 67 1 1 92 77 86 0.972 6.81 2.08 Intr - 59211 59134 78 2 0 79 49 93 0.897 3.17 2.07 Intr - 59394 59314 81 2 0 67 111 50 0.937 3.95 2.06 Intr - 64200 64084 117 2 0 67 80 72 0.913 3.88 2.05 Intr - 64998 64859 140 0 2 86 80 139 0.986 12.16 2.04 Intr - 65377 65302 76 0 1 89 91 105 0.999 10.42 2.03 Intr - 68469 68324 146 0 2 108 91 174 0.999 18.78 2.02 Intr - 74231 74095 137 0 2 135 99 133 0.989 19.39 2.01 Init - 91327 91228 100 1 1 36 110 76 0.341 5.02 2.00 Prom - 95180 95141 40 -8.76 3.00 Prom + 97090 97129 40 -7.26 3.01 Init + 100121 100183 63 1 0 65 24 90 0.547 1.45 3.02 Intr + 100338 100467 130 2 1 69 127 142 0.419 16.47 3.03 Intr + 101761 101867 107 2 2 68 20 200 0.991 11.13 3.04 Intr + 102032 102175 144 1 0 85 47 282 0.999 24.28 3.05 Intr + 102949 103105 157 0 1 84 64 189 0.985 15.68 3.06 Intr + 103404 103431 28 1 1 130 94 37 0.974 5.67 3.07 Intr + 104830 104926 97 1 1 44 60 118 0.973 4.61 3.08 Intr + 107063 107086 24 1 0 129 84 29 0.857 4.82 3.09 Intr + 107225 107317 93 1 0 82 59 148 0.999 11.46 3.10 Intr + 107437 107500 64 0 1 107 68 105 0.998 8.69 3.11 Intr + 108471 108593 123 1 0 58 27 173 0.774 8.76 3.12 Intr + 109257 109360 104 1 2 119 15 223 0.999 17.99 3.13 Intr + 109520 109658 139 1 1 109 42 237 0.999 21.24 3.14 Intr + 109928 110046 119 0 2 89 99 150 0.977 16.38 3.15 Intr + 110136 110285 150 2 0 84 78 261 0.999 25.06 3.16 Intr + 110463 110633 171 2 0 71 98 426 0.996 42.04 3.17 Intr + 112111 112290 180 0 0 77 66 359 0.999 32.66 3.18 Intr + 112385 112517 133 1 1 94 81 147 0.945 14.82 3.19 Intr + 112605 112681 77 1 2 76 116 88 0.999 9.63 3.20 Intr + 113109 113196 88 2 1 79 66 175 0.997 13.94 3.21 Intr + 113375 113492 118 0 1 126 64 222 0.997 23.12 3.22 Intr + 115473 115596 124 0 1 131 58 221 0.989 23.99 3.23 Intr + 115705 115841 137 0 2 127 94 184 0.989 22.27 3.24 Intr + 116517 116772 256 0 1 83 68 528 0.999 47.75 3.25 Intr + 117707 117949 243 1 0 115 70 577 0.987 56.29 3.26 Intr + 118717 118893 177 0 0 63 86 373 0.661 34.72 3.27 Intr + 118984 119129 146 0 2 110 50 355 0.983 32.98 3.28 Intr + 119454 119544 91 0 1 75 96 186 0.780 18.10 3.29 Intr + 119622 120011 390 2 0 78 32 687 0.927 57.02 3.30 Intr + 120666 120792 127 2 1 50 77 219 0.931 17.25 3.31 Intr + 120882 121000 119 1 2 126 86 126 0.999 16.38 3.32 Intr + 121090 121286 197 0 2 95 89 414 0.999 40.41 3.33 Intr + 121427 121776 350 2 2 48 53 698 0.998 57.30 3.34 Intr + 121942 122066 125 2 2 56 55 213 0.998 15.10 3.35 Intr + 122162 122277 116 1 2 119 85 262 0.999 28.25 3.36 Intr + 122664 123060 397 0 1 99 75 803 0.999 74.68 3.37 Intr + 123139 123264 126 0 0 108 48 145 0.999 13.38 3.38 Intr + 123359 123529 171 1 0 105 93 343 0.983 36.64 3.39 Intr + 123616 123720 105 0 0 79 80 208 0.804 19.51 3.40 Intr + 123802 123963 162 0 0 72 94 242 0.555 23.37 3.41 Intr + 124319 124456 138 1 0 42 100 290 0.987 26.26 3.42 Term + 124548 124559 12 2 0 97 49 10 0.769 -3.80 3.43 PlyA + 124682 124687 6 1.05 4.20 PlyA - 124823 124818 6 1.05 4.19 Term - 125654 125517 138 2 0 77 43 149 0.948 7.26 4.18 Intr - 125987 125781 207 2 0 102 92 505 0.999 51.57 4.17 Intr - 126941 126829 113 0 2 120 94 213 0.998 25.20 4.16 Intr - 128168 128066 103 2 1 66 83 201 0.633 17.15 4.15 Intr - 128946 128806 141 0 0 86 98 161 0.931 17.45 4.14 Intr - 130011 129921 91 2 1 45 64 119 0.961 5.20 4.13 Intr - 131118 131035 84 2 0 114 109 72 0.998 10.84 4.12 Intr - 132659 132558 102 1 0 103 109 198 0.768 22.69 4.11 Intr - 135156 134969 188 0 2 52 53 91 0.659 0.49 4.10 Intr - 138510 138379 132 0 0 103 119 72 0.999 12.64 4.09 Intr - 143703 143561 143 1 2 111 80 133 0.883 14.87 4.08 Intr - 157679 157494 186 0 0 82 86 186 0.856 17.46 4.07 Intr - 167083 166876 208 1 1 82 98 216 0.952 20.65 4.06 Intr - 172365 172237 129 0 0 89 105 87 0.990 11.39 4.05 Intr - 177402 177347 56 1 2 86 75 44 0.142 1.60 4.04 Intr - 179942 179885 58 2 1 52 106 35 0.047 0.16 4.03 Intr - 182340 182111 230 1 2 80 -37 425 0.057 26.79 4.02 Intr - 184522 184486 37 1 1 49 52 4 0.074 -9.26 4.01 Init - 185595 185542 54 0 0 83 70 91 0.754 8.08 4.00 Prom - 194140 194101 40 -2.46 5.05 PlyA - 194900 194895 6 1.05 5.04 Term - 198556 197733 824 2 2 -62 43 377 0.042 11.46 5.03 Intr - 198936 198774 163 0 1 77 71 36 0.054 0.35 5.02 Intr - 200545 200417 129 1 0 87 88 71 0.066 7.89 5.01 Init - 215152 214985 168 1 0 109 96 162 0.957 16.64 5.00 Prom - 219824 219785 40 -2.56 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 182037 182240 204 2 0 33 49 438 0.820 31.77 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578f:34877630_35102085|GENSCAN_predicted_peptide_1|636_aa XFWGDIAKEFYWKTPCPGPFLRYNFDVTKGKIFIEWMKGATTNICYNVLDRNVHEKKLGD KVAFYWEGNEPGETTQITYHQLLVQVCQFSNVLRKQGIQKGDRVAIYMPMIPELVVAMLA CARIGALHSIVFAGFSSESLCERILDSSCSLLITTDAFYRGEKLVNLKELADEALQKCQE KGFPVRCCIVVKHLGRAELGMGDSTSQSPPIKRSCPDVQGKLKEKSKRVQPQISWNQGID LWWHELMQEAGDECEPEWCDAEDPLFILYTSGSTGKPKASGVVHTVGGYMLYVATTFKYV FDFHAEDVFWCTADIGWITGHSYVTYGPLANGATSVLGIPTYPDVNRLWSIVDKYKVTKF YTAPTAIRLLMKFGDEPVTKHSRASLQVLGTVGEPINPEAWLWYHRVVGAQRCPIVDTFW QTETTFPFFGVAPAILNESGEELEGEAEGYLVFKQPWPGIMRTVYGNHERFETTYFKKFP GYYVTGDGCQRDQDGYYWITGRIDDMLNVSGHLLSTAEVESALVEHEAVAEAAVVGHPHP VKGECLYCFVTLCDGHTFSPKLTEELKKQIREKIGPIATPDYIQNAPGLPKTRSGKIMRR VLRKIAQNDHDLGDMSTVADPSVISHLFSHRCLTIQ >gi568815578f:34877630_35102085|GENSCAN_predicted_CDS_1|1911_bp naattctggggagacattgccaaggaattttactggaagactccatgccctggcccattc cttcggtacaactttgatgtgactaaagggaaaatcttcattgagtggatgaaaggagca actaccaacatctgctacaatgtactggatcgaaatgtccatgagaaaaagcttggagat aaagttgctttttactgggagggcaatgagccaggggagaccactcagatcacataccat cagcttctggtccaagtgtgtcagttcagcaatgttctccgaaaacagggcattcagaag ggggaccgagtggccatctacatgcctatgatcccagagcttgtggtggccatgctggca tgtgcccgcattggggctttgcactccattgtgtttgcaggcttctcttcagagtctcta tgtgaacggatcttggattccagctgcagtcttctcatcactacagatgccttctacagg ggggaaaagcttgtgaacctgaaggagctggctgacgaggccctgcagaagtgtcaggag aagggtttcccagtaagatgctgcattgtggtcaagcacctggggcgggcagagctcggc atgggtgactccaccagccagtcccccccaattaagaggtcatgcccagatgtgcagggt aaactgaaagagaaatccaagcgtgttcagccccagatctcatggaaccaagggattgac ttgtggtggcatgagctcatgcaagaggcaggggatgagtgtgagcccgagtggtgtgat gccgaggacccactcttcatcctgtacaccagtggctccacaggcaaacccaaggcaagt ggtgtggttcacacagttgggggctacatgctctatgtagccacaaccttcaagtatgtg tttgacttccatgcagaggatgtgttctggtgcacggcagacattggttggatcactggt cattcctacgtcacctatgggccactggccaatggtgccaccagtgttttggggattccc acatatccggacgtgaaccgcctgtggagcattgtggacaaatacaaggtgaccaagttc tacacagcacccacagccatccgtctgctcatgaagtttggagatgagcctgtcaccaag catagccgggcatccttgcaggtgttaggcacagtgggtgaacccatcaaccctgaggcc tggctatggtaccaccgggtggtaggtgcccagcgctgccccatcgtggacaccttctgg caaacagagacaactttcccattctttggtgtagctcctgcaatcctgaatgagtccggg gaagagttggaaggtgaagctgaaggttatctggtgttcaagcagccctggccagggatc atgcgcacagtctatgggaaccacgaacgctttgagacaacctactttaagaagtttcct ggatactatgttacaggagatggctgccagcgggaccaggatggctattactggatcact ggcaggattgatgacatgctcaatgtatctggacacctgctgagtacagcagaggtggag tcagcacttgtggaacatgaggctgttgcagaggcagctgtggtgggccaccctcatcct gtgaagggtgaatgcctctactgctttgtcaccttgtgtgatggccacaccttcagcccc aagctcaccgaggagctcaagaagcagattagagaaaagattggccccattgccacacca gactacatccagaatgcacctggcttgcctaaaacccgctcagggaaaatcatgaggcga gtgcttcggaagattgctcagaatgaccatgacctcggggacatgtctactgtggctgac ccatctgtcatcagtcacctcttcagccaccgctgcctgaccatccagtga >gi568815578f:34877630_35102085|GENSCAN_predicted_peptide_2|510_aa MPAVAAAHIGFQNHSTPRSPVPPPAPCVLASLPGVGMATNWGSLLQDKQQLEELARQAVD RALAEGVLLRTSQEPTSSEVVSYAPFTLFPSLVPSALLEQAYAVQMDFNLLVDAVSQNAA FLEQTLSSTIKQDDFTARLFDIHKQVLKEGIAQTVFLGLNRSDYMFQRSADGSPALKQIE INTISASFGGLASRTPAVHRHVLSVLSKTKEAGKILSNNPSKGLALGIAKAWELYGSPNA LVLLIAQEKERNIFDQRAIENELLARNIHVIRRTFEDISEKGSLDQDRRLFVDGQEIAVV YFRDGYMPRQYSLQNWEARLLLERSHAAKCPDIATQLAGTKKVQQELSRPGMLEMLLPGQ PEAVARLRATFAGLYSLDVGEEGDQAIAEALAAPSRFVLKPQREGGGNNLYGEEMVQALK QLKDSEERASYILMEKIEPEPFENCLLRPGSPARVVQCISELGIFGVYVRQEKTLVMNKH VGHLLRTKAIEHADGGVAAGVAVLDNPYPV >gi568815578f:34877630_35102085|GENSCAN_predicted_CDS_2|1533_bp atgccagcagttgctgctgctcatatcggcttccagaaccactcaaccccaagaagccca gtcccgcccccagccccttgtgtgctggcctccctgccaggtgttgggatggccaccaac tgggggagcctcttgcaggataaacagcagctagaggagctggcacggcaggccgtggac cgggccctggctgagggagtattgctgaggacctcacaggagcccacttcctcggaggtg gtgagctatgccccattcacgctcttcccctcactggtccccagtgccctgctggagcaa gcctatgctgtgcagatggacttcaacctgctagtggatgctgtcagccagaacgctgcc ttcctggagcaaactctttccagcaccatcaaacaggatgactttaccgctcgtctcttt gacatccacaagcaagtcctaaaagagggcattgcccagactgtgttcctgggcctgaat cgctcagactacatgttccagcgcagcgcagatggctccccagccctgaaacagatcgaa atcaacaccatctctgccagctttgggggcctggcctcccggaccccagctgtgcaccga catgttctcagtgtcctgagtaagaccaaagaagctggcaagatcctctctaataatccc agcaagggactggccctgggaattgccaaagcctgggagctctacggctcacccaatgct ctggtgctactgattgctcaagagaaggaaagaaacatatttgaccagcgtgccatagag aatgagctactggccaggaacatccatgtgatccgacgaacatttgaagatatctctgaa aaggggtctctggaccaagaccgaaggctgtttgtggatggccaggaaattgctgtggtt tacttccgggatggctacatgcctcgtcagtacagtctacagaattgggaagcacgtcta ctgctggagaggtcacatgctgccaagtgcccagacattgccacccagctggctgggact aagaaggtgcagcaggagctaagcaggccgggcatgctggagatgttgctccctggccag cctgaggctgtggcccgcctccgcgccacctttgctggcctctactcactggatgtgggt gaagaaggggaccaggccatcgccgaggcccttgctgcccctagccggtttgtgctaaag ccccagagagagggtggaggtaacaacctatatggggaggaaatggtacaggccctgaaa cagctgaaggacagtgaggagagggcctcctacatcctcatggagaagatcgaacctgag ccttttgagaattgcctgctacggcctggcagccctgcccgagtggtccagtgcatttca gagctgggcatctttggggtctatgtcaggcaggaaaagacactcgtgatgaacaagcac gtggggcatctacttcgaaccaaagccatcgagcatgcagatggtggtgtggcagcggga gtggcagtcctggacaacccataccctgtgtga >gi568815578f:34877630_35102085|GENSCAN_predicted_peptide_3|2005_aa MGGRESSESGVEEALVFEGVHGFQLLLLHPSATAMMDVSELGESARYLRQGYQEMTKVHT IPWDGKKRVWVPDEQDAYVEAEVKSEATGGRVTVETKDQKVLMVREAELQPMNPPRFDLL EDMAMMTHLNEASVLHNLRQRYARWMIYTYSGLFCVTINPYKWLPVYTASVVAAYKGKRR SDSPPHIYAVADNAYNDMLRNRDNQSMLITGESGAGKTVNTKRVIQYFAIVAALGDGPGK KAQFLATKTGGTLEDQIIEANPAMEAFGNAKTLRNDNSSRFGKFIRIHFGPSGKLASADI DSYLLEKSRVIFQLPGERSYHVYYQILSGRKPELQGEGQYDEGDMLLLSMNPYDYHFCSQ GVITVDNMNDGEELIATDHAMDILGFSVDEKCACYKIVGALLHFGNMKFKQKQREEQAEA DGTESADKAAYLMGVSSGDLLKGLLHPRVRVGNEYVTKGQSVEQVVFAVGALAKATYDRL FRWLVSRINQTLDTKLPRQFFIGVLDIAGFEIFEFNSFEQLCINFTNEKLQQFFNQHMFV LEQEEYKREGIDWVFIDFGLDLQPCIDLIEKPLGILSILEEECMFPKASDASFRAKLYDN HAGKSPNFQQPRPDKKRKYQAHFEVVHYAGVVPYSIVGWLEKNKDPLNETVVPIFQKSQN RLLATLYENYAGSCSTEPPKSGVKEKRKKAASFQTVSQLHKENLNKLMTNLRATQPHFVR CIVPNENKTPGVMDAFLVLHQLRCNGVLEGIRICRQGFPNRLLYTDFRQRYRILNPSAIP DDTFMDSRKATEKLLGSLDLDHTQYQFGHTKVFFKAGLLGVLEELRDQRLAKVLTLLQAR SRGRLMRLEYQRLLGGRDALFTIQWNIRAFNAVKNWSWMKLFFKMKPLLRSAQAEEELAA LRAELRGLRGALAAAEAKRQELEETHVSITQEKNDLALQLQAEQDNLADAEERCHLLIKS KVQLEGKVKELSERLEDEEEVNADLAARRRKLEDECTELKKDIDDLELTLAKAEKEKQAT ENKVKNLTEEMAALDESVARLTKEKKALQEAHQQALGDLQAEEDRVSALTKAKLRLEQQV EDLECSLEQEKKLRMDTERAKRKLEGDLKLTQESVADAAQDKQQLEEKLKKKDSELSQLS LRVEDEQLLGAQMQKKIKELQARAEELEEELEAERAARARVEKQRAEAARELEELSERLE EAGGASAGQREGCRKREAELGRLRRELEEAALRHEATVAALRRKQAEGAAELGEQVDSLQ RVRQKLEKEKSELRMEVDDLAANVETLTRAKASAEKLCRTYEDQLSEAKIKVEELQRQLA DASTQRGRLQTESGELSRLLEEKECLISQLSRGKALAAQSLEELRRQLEEESKAKSALAH AVQALRHDCDLLREQHEEEAEAQAELQRLLSKANAEVAQWRSKYEADAIQRTEELEEAKK KLALRLQEAEEGVEAANAKCSSLEKAKLRLQTESEDVTLELERATSAAAALDKKQRHLER ALEERRRQEEEMQRELEAAQRESRGLGTELFRLRHGHEEALEALETLKRENKNLQEEISD LTDQVSLSGKSIQELEKTKKALEGEKSEIQAALEEAEGALELEETKTLRIQLELSQVKAE VDRKLAEKDEECANLRRNHQRAVESLQASLDAETRARNEALRLKKKMEGDLNDLELQLGH ATRQATEAQAATRLMQAQLKEEQAGRDEEQRLAAELHEQAQALERRASLLAAELEELRAA LEQGERSRRLAEQELLEATERLNLLHSQNTGLLNQKKKLEADLAQLSGEVEEAAQERREA EEKAKKAITDAAMMAEELKKEQDTSAHLERMKKTLEQTVRELQARLEEAEQAALRGGKKQ VQKLEAKVRELEAELDAEQKKHAEALKGVRKHERRVKELAYQAEEDRKNLARMQDLVDKL QSKVKSYKRQFEEAVSALGPGHLDRAPQLCPRVCGQEQQANTNLAKYRKAQHELDDAEER ADMAETQANKLRARTRDALGPKHKE >gi568815578f:34877630_35102085|GENSCAN_predicted_CDS_3|6018_bp atgggtggccgcgagtcttctgaatctggagtggaggaagccctggtgtttgagggtgtg catggtttccagctcctcctccttcaccccagtgccactgccatgatggatgtgagtgaa cttggggagtctgcccgctacctccgccagggctaccaggagatgacgaaggtgcacact atcccatgggacgggaagaagcgagtctgggtgcctgatgaacaggacgcctacgtggag gccgaggtcaagtcggaggctaccgggggcagagtcaccgtggagaccaaagaccagaag gtgctgatggtgcgtgaagccgagctgcagcccatgaacccgcctcgcttcgacttactg gaggacatggccatgatgacgcacctgaacgaggcctctgtgctgcacaacctgcgccag cgctatgcccgctggatgatctatacctactcaggcctcttctgtgtcaccatcaacccc tacaaatggctcccagtctatacggcctccgtagtggctgcttacaagggaaagcgccgc tcagattccccgccccatatatatgcggtggcggacaacgcctacaacgacatgctgcgc aaccgagacaaccagtccatgctgatcaccggagagtcgggggccggtaagacggttaac accaagcgggtcattcagtactttgccatcgtcgctgccctgggagacgggccgggcaag aaggcccaatttctggcaacaaagacggggggcacccttgaggatcaaatcatcgaggcc aaccctgccatggaggcctttggcaacgccaagaccctgaggaatgataactcctcccgc tttggcaagttcatccgcattcactttggtccctctgggaagctggcatccgcggatatt gacagctatctcctggagaagtcgcgggtgatcttccagttgcctggtgagcgcagctac catgtctactaccagatcctctcagggaggaagccagagctgcagggtgaggggcagtac gatgaaggggacatgctgcttctgtctatgaacccctatgactaccacttctgcagccag ggcgtcatcaccgtggacaacatgaatgatggggaggagctcatcgccaccgaccatgcc atggacatcctaggcttcagcgtggatgagaaatgtgcctgctataagatcgtgggcgcc ctcctgcactttggcaacatgaagttcaagcagaagcagcgggaggagcaggcggaggcc gatggcactgagagtgctgacaaggctgcctacctgatgggggtcagcagtggggacctc ctcaaaggccttttgcacccccgggtgcgtgtagggaacgagtacgtgaccaagggccag agtgtggagcaggtggtgtttgctgtgggggctctggccaaggccacctatgaccggctg ttcaggtggctggtgtctcggatcaaccagaccctggacacaaagctgccccggcagttc ttcatcggggttctggacatcgctgggtttgagatctttgagttcaacagcttcgaacag ctgtgcatcaacttcaccaatgagaaattgcagcagttcttcaaccagcacatgtttgtg ctggagcaggaggagtacaagcgggagggcatcgactgggtcttcatcgacttcggcctt gacctgcagccttgcatcgacctcatcgagaagccactgggcatcctgtccatcctggag gaggaatgcatgttccccaaggcctcagacgccagcttccgggccaagctctacgacaac cacgcggggaagtcacccaatttccagcagcctcggcctgacaagaagcgcaagtaccag gcccacttcgaggtggtccactacgcaggcgtggtgccttacagcattgtgggctggctg gagaaaaacaaggatcccctgaatgagaccgtggtccccatcttccagaagtcacagaat aggctcctggcgactctctatgagaattatgcgggctcctgctccactgagccccccaag tctggggtgaaagagaagcgtaagaaggcagcatcgttccagacggtgtcccagctgcac aaggagaacctcaacaagctgatgaccaacctgcgggccacacagccccacttcgtccgc tgcattgtccccaacgagaacaaaaccccaggggtcatggatgccttcttggtgctacac cagctgcgctgcaatggggtcctggaggggatccggatctgccgccaagggttccccaac aggttgctctacaccgacttccggcagcggtaccgtatcctgaaccccagtgccatcccg gatgacaccttcatggacagcaggaaggccacagagaaactgctgggctcgctggacttg gatcacacccagtaccagtttggccacaccaaggtgttcttcaaggctgggcttctaggc gtcctggaagagctccgtgaccagcgcctggccaaggtgctgacgctgctgcaggcgcgg agccgtggccgcctcatgcgccttgagtaccagcgcctgctgggaggcagggatgcgctg ttcaccatccagtggaacatccgtgccttcaatgccgtcaagaactggtcatggatgaag ctctttttcaagatgaagccgctgctgcgctcggcgcaggctgaggaggagctggcggcc ctgcgggcagagctgcgggggttgcgaggggcgctggctgcggccgaggccaagcgccag gaactggaggagacgcacgtcagcatcacccaggagaagaatgacctggccctgcagctg caggctgagcaggacaacctggcagatgccgaggagcgctgccacttgctgatcaagtcc aaggtgcagctggaggggaaggtgaaggagctgagtgagcggctggaggatgaggaggag gtgaacgctgacctggccgcccgccggcgcaagctggaggacgagtgcacggagctcaag aaggacattgatgacctggagctgacactggccaaagctgagaaggagaagcaagccact gagaacaaggtgaagaacctgacggaagagatggctgcgctggacgagtcagtggcccgg ctgaccaaggagaagaaggcgttgcaggaggcccaccaacaggccctgggtgacctgcag gccgaggaggaccgtgtgagcgcgctgaccaaggccaagctccggctggagcaacaggtg gaggacctggaatgctccctggagcaggagaagaagctgcgcatggacacggagcgggcc aagcgcaagctggagggtgacctgaagctgacgcaggagtcggtggctgatgctgctcaa gacaagcagcagctggaggagaagctcaagaagaaggactccgagctgagccagctgagc ctgcgggtggaagacgagcagctcttgggggcccagatgcagaagaagatcaaggagctg caggctcgggcggaggagctggaagaggagctggaggcagagcgggcagcccgggcccgc gtggagaagcagcgtgcagaggcggcgcgggagctggaggagctgagcgagcggctggag gaggcaggcggcgcatccgcggggcagcgcgagggctgccgcaagcgggaggcggagctg gggaggctgcggcgggagctggaggaggcggcgctgcggcacgaggccacagtggcggca ctgcggcgcaagcaggcggagggcgcggcggagctgggggagcaggtggacagcctgcag cgggtgcggcagaagctggagaaggagaagagtgagctgcgcatggaggtggacgacctg gctgccaacgtggagactctgacccgcgccaaggccagtgcagagaagctgtgccggacc tatgaggatcagctaagcgaggccaagatcaaggtggaggagctgcagcggcagctggcg gacgcaagcacgcagcgtgggcgactacagacggaaagcggggagctgagtcgcctgcta gaggagaaggagtgtctgatcagtcagctgagccgtggaaaggccctggccgcccaaagc ctggaagagttgcggcgccagctagaggaggaaagcaaggccaagagtgccctggcccac gccgtgcaggctctgcggcacgactgtgacctcctgcgggagcaacacgaggaggaggct gaggcccaggctgagctgcagcggctgctgtccaaggccaatgccgaggtggcccagtgg aggagcaagtacgaagcagatgccatccagaggaccgaggagctggaggaggccaaaaaa aagctggcactgcggctgcaggaggcagaggagggcgtggaggctgccaacgccaagtgc tcatcgttggagaaggccaagctgcggctacagacagagtcagaggatgtaaccctggag ctggagcgggcgacctcagcagctgctgcgctggacaagaagcagcggcacttggaacgg gcactggaggaacggcggcggcaggaggaggagatgcagcgggagctggaggcggcacag agggagtcccgtggcctgggcaccgagctcttccggctgcggcacggccacgaggaggca cttgaagccctggagacgctcaagcgggagaacaagaacctgcaggaggagatcagcgac ctcacagaccaggtgagtctcagtgggaagagcatccaggaactggagaaaaccaagaag gcgctggaaggcgagaagagtgagatccaggctgcactggaggaggcagagggggccctg gagctggaggagaccaagacgctgcggatccagctggagctctcccaggtcaaagcagaa gtggaccggaagctggcagagaaagacgaggagtgcgctaacctgaggcgcaaccaccag cgagctgtggagtccctgcaggcctccctggatgcagagacacgggcccgcaatgaggcg ctgcggctcaagaagaagatggagggtgacctcaacgacctggagctgcagctgggccat gccacccgtcaggccacagaggcccaggctgccacgcggctgatgcaggcacagctcaag gaggagcaggcagggcgggacgaggagcagcggctggcagctgagctccacgagcaggcg caggctctggagcgccgggcctcgctgctggctgcggagctggaggagctgcgggctgcc ctggagcagggcgagcgcagccggcgactggcagagcaggagcttttggaggccaccgag cgcctcaaccttctgcattcgcagaacacaggcctcctaaaccagaagaagaagctggag gcggacttggcccagctgagcggggaggtggaggaggctgcacaggagaggcgggaggct gaggagaaggccaaaaaggccatcactgatgcggccatgatggccgaggagctgaagaag gagcaggacacaagtgcacacctggaacggatgaagaagacgctggagcagacggtgcgc gagctccaggcccgccttgaggaggcagaacaggccgccctccgtggcgggaagaagcag gtgcagaagctggaggccaaggtacgggagctggaggctgagcttgatgcagagcagaag aagcacgccgaggcccttaagggcgtgcgcaagcatgagcgccgtgtcaaggagctcgca taccaggccgaggaggacaggaagaacctggctcgcatgcaggacctggtggacaagctg cagagcaaggtcaagagctacaagcgccagtttgaggaggcggtgagtgcgctggggcct ggacacctggaccgggcaccccagctctgccccagggtctgtggccaggagcagcaggcc aacaccaacctggccaagtatcgcaaggcccagcacgagctggatgatgcggaggagcgg gcagacatggcggaaacccaggccaacaagctgcgggcacggacccgggacgccctgggc cccaagcacaaggagtga >gi568815578f:34877630_35102085|GENSCAN_predicted_peptide_4|799_aa MAEGKGEAGTFFTGQQDRLLFVFFIITIVVGLFRCSVFSSSSSSSSLSSSSSSSSSSSLS SSFLHLFFIFIFVFFVFIFFVFIFVFVFAFFFIFFVFVVFFVFFFFVLLVEILMRPTISI RGQKLKISDEMSKDCLSILYNTCVCTEGVTKRLAEKNDFVIFLFTLMTSKKTFLQTATLI EDILGVKKEMIRLDEVPNLSSLVSNFDQQQLANFCRILAVTISEMDTGNDDKHTLLAKNA QQKKSLSLGPSAAEINQAALLSIPGFVERLCKLATRKVSESTGTASFLQELEEWYTWLDN ALVLDALMRVANEESEHNQGASEENGLPHTSARTQLPQSMKIMHEIMYKLEVLYVLCVLL MGRQRNQVHRMIAEFKLIPGLNNLFDKLIWRKHSASALVLHGHNQNCDCSPPPQELSMEV VGPLLPLESMDFSTYCPSKARVDSVAIPRGRQELQTGKADCLLSSWRILPSTRRNKYLLL NNQELNELSAISLKANIPEVEAVLNTDRSLVCDGKRGLLTRLLQVMKKEPAESSFRFWQA RAVESFLRGTTSYADQMFLLKRGLLEHILYCIVDSECKSRDVLQSYFDLLGELMKFNVDA FKRFNKYINTDAKVFLKQINSSLVDSNMLVRCVTLSLDRFENQVDMKVAEVLSECRLLAY ISQVPTQMSFLFRLINIIHVQTLTQENVSCLNTSLVILMLARRKERLPLYLRLLQRMEHS KKYPGFLLNNFHNLLRFWQQHYLHKDKDSTCLENSSCISFSYWKETVSILLNPDRQSPSA LVSYIEEPYMDIDRDFTEE >gi568815578f:34877630_35102085|GENSCAN_predicted_CDS_4|2400_bp atggcagaaggcaaaggagaagcaggcaccttcttcacagggcagcaggacagattacta tttgtctttttcattataaccatcgtagtagggctgttcagatgttctgttttttcttct tcttcgtcttcttcgtctttgtcttcttcgtcttcgtcttcttcatcttcatctttgtct tcgtcttttcttcatcttttcttcatcttcatcttcgtcttctttgtcttcatcttcttc gtcttcatctttgtcttcgtcttcgccttcttcttcatcttctttgtcttcgtcgtcttc ttcgtcttcttcttcttcgtccttcttgtagaaattcttatgaggcctacgatctctatc cggggacagaaactgaaaataagtgatgaaatgtccaaggactgcttgagtatcctgtat aatacctgtgtctgtacagagggagttacaaagcgtttggcagaaaagaatgactttgtg atcttcctgtttacattgatgacaagtaagaagacattcttacaaacagcaaccctcatt gaagatattttgggtgttaaaaaggaaatgatccgactagatgaagtccccaatctgagt tccttagtatccaatttcgatcagcagcagctcgctaatttctgccggattctggctgtc accatttcagagatggatacagggaatgatgacaagcacacgcttcttgccaaaaatgct caacagaagaagagcttgagtttggggccttctgcagctgaaatcaatcaagcggccctt ctcagcattcctggctttgttgagcggctttgcaaactggcgactcgaaaggtgtcagag tcaacgggcacagccagcttccttcaggagttggaagagtggtacacatggctagacaat gctttggtgctagatgccctgatgcgagtggccaatgaggagtcagagcacaatcaaggg gcttctgaggagaatggcctgcctcacacgtcagccagaacccagctgccccagtcaatg aagattatgcatgagatcatgtacaaactggaagtgctctatgtcctctgcgtgctgctg atggggcgtcagcgaaaccaggttcacagaatgattgcagagttcaagctgatccctgga cttaataatttgtttgacaaactgatttggaggaagcattcagcatctgcccttgtcctc catggtcacaaccagaactgtgactgtagcccgccaccccaggaactgagtatggaagtc gttggccccttactacctttggaatcaatggatttctcaacctactgcccctctaaagca agagtggacagcgtggctattcccaggggccggcaggaacttcagacaggcaaagccgac tgcttactgagcagctggaggattcttccctccacaaggaggaacaagtacttgttactc aacaaccaggagctgaatgaactcagtgccatctctctcaaggccaacatccctgaggtg gaagctgtcctcaacaccgacaggagtttggtgtgtgatgggaagaggggcttattaact cgtctgctgcaggtcatgaagaaggagccagcagagtcgtctttcaggttttggcaagct cgggctgtggagagtttcctccgagggaccacctcctatgcagaccagatgttcctgctg aagcgaggcctcttggagcacatcctttactgcattgtggacagcgagtgtaagtcaagg gatgtgctccagagttactttgacctcctgggggagctgatgaagttcaacgttgatgca ttcaagagattcaataaatatatcaacaccgatgcaaaggtattcctgaagcagatcaac agctccctggtggactccaacatgctggtgcgctgtgtcactctgtccctggaccgattt gaaaaccaggtggatatgaaagttgccgaggtactgtctgaatgccgcctgctcgcctac atatcccaggtgcccacgcagatgtccttcctcttccgcctcatcaacatcatccacgtg cagacgctgacccaggagaacgtcagctgcctcaacaccagcctggtgatcctgatgctg gcccgacggaaagagcggctgcccctgtacctgcggctgctgcagcggatggagcacagc aagaagtaccccggcttcctgctcaacaacttccacaacctgctgcgcttctggcagcag cactacctgcacaaggacaaggacagcacctgcctagagaacagctcctgcatcagcttc tcatactggaaggagacagtgtccatcctgttgaacccggaccggcagtcaccctctgct ctcgttagctacattgaggagccctacatggacatagacagggacttcactgaggagtga >gi568815578f:34877630_35102085|GENSCAN_predicted_peptide_5|427_aa MAAAPVAAGSGAGRGRRSAATVAAWGGWGGRPRPGNILLQLRQGQLTGRGLVRAVQFTET FLTERDKQSKWSGIPQLLLKLHTTSHLHSDFVECQNILKPLLVIPRQTGSGVDLQQTLTD LQLRVLTVRRKTNKQKGHPHQNPICMSPSSKTKEAKNLDKRLDEWLTRINSIEKTLNDLM ELNTMARKLRDACTSFSSQFDQVEERVSVIEDQMNEMKREEKFREKRIKRNKQSLQEIWD CVKRPNLRLIGVPESDGENGTKLENTLQDIIQENFPNLARQVNIQIQEIQRTPQRYSSRR GTPRHIIVRFTKVEIKEKILRAAREKGRVTHKGKPIRLTAVLLAETLQARREWGPIFNIL KEKNFQPRVSYPAKLSFISEGEVKSFTDKQMLRDFVTTRPALQELLKEALNMERNNRYQL LQKHAKL >gi568815578f:34877630_35102085|GENSCAN_predicted_CDS_5|1284_bp atggcggcggcgccggtagcggctgggtctggagccggccgagggagacggtcggcagcc acagtggcggcttggggcggatggggcggccggccgcggcctggtaacattctgctgcag ctgcggcagggccagctgaccggccggggcctggtccgggcggtgcagttcactgagact tttttgacggagagggacaaacaatccaagtggagtggaattcctcagctgctcctcaag ctgcacaccaccagccacctccacagtgactttgttgagtgtcaaaacatcctcaagcct ctgctggtgatacccagacaaacagggtctggagtggacctccagcaaactctgacagac ctgcagctgagggtcctgactgttagaaggaaaactaacaaacagaaaggacatccacac caaaaccccatctgtatgtcaccatcatcaaagaccaaagaagctaaaaaccttgacaaa agattagacgaatggctaactagaataaacagcatagaaaagaccttaaatgacctgatg gagctgaacaccatggctcgaaaactacgtgacgcatgcacaagcttcagtagccaattc gatcaagtggaagaaagggtatcagtgattgaagatcaaatgaatgaaatgaagcgagaa gagaagtttagagaaaaaagaataaaaagaaacaaacaaagcctccaagaaatatgggac tgtgtgaaaagaccaaatctacgtctgattggtgtacctgaaagtgacggggagaatgga accaagttggaaaacactcttcaggatattatccaggagaacttccccaatctagcaagg caggtcaacattcaaattcaggaaatacagagaacgccacaaagatactcctcgaggaga ggaactccaagacacataattgtcagattcaccaaagttgaaattaaggaaaaaatatta agggcagccagagagaaaggtcgggttacccacaaagggaagcccatcagactaacagcg gttctcttggcagaaactctacaagccagaagagagtgggggccaatattcaacattctt aaagaaaagaattttcaacccagagtttcatatccagccaaactaagcttcataagtgaa ggagaagtaaaatcctttacagacaagcaaatgctgagagattttgtcaccaccaggcct gccttacaagagctcctgaaggaagcgctaaacatggaaaggaacaaccgataccaacta ctgcaaaaacatgccaaattgtaa