GENSCAN 1.0 Date run: 6-Nov-116 Time: 01:01:32 Sequence gi568815578r:34828831_35051852 : 223022 bp : 47.48% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 PlyA - 721 716 6 1.05 1.01 Sngl - 5101 4745 357 1 0 67 36 408 0.514 29.56 1.00 Prom - 11908 11869 40 -2.96 2.15 PlyA - 12065 12060 6 1.05 2.14 Term - 16661 16498 164 0 2 40 43 222 0.999 11.00 2.13 Intr - 21230 21131 100 2 1 110 79 119 0.996 12.88 2.12 Intr - 22538 22401 138 2 0 60 94 73 0.981 5.76 2.11 Intr - 23442 23325 118 2 1 76 62 149 0.987 11.57 2.10 Intr - 23708 23559 150 1 0 110 57 181 0.963 16.48 2.09 Intr - 25789 25701 89 1 2 94 94 98 0.990 9.77 2.08 Intr - 26093 25966 128 0 2 127 94 118 0.993 16.70 2.07 Intr - 28063 27976 88 1 1 105 45 93 0.973 6.24 2.06 Intr - 30809 30613 197 0 2 102 107 248 0.995 27.23 2.05 Intr - 31247 31139 109 2 1 117 68 91 0.930 9.86 2.04 Intr - 32732 32603 130 1 1 93 85 92 0.607 10.10 2.03 Intr - 34135 33984 152 1 2 125 90 134 0.999 16.26 2.02 Intr - 34718 34483 236 0 2 79 94 473 0.998 44.41 2.01 Init - 43985 43817 169 2 1 96 93 350 0.827 36.00 2.00 Prom - 45952 45913 40 -10.25 3.00 Prom + 46961 47000 40 -8.36 3.01 Init + 47816 47993 178 1 1 82 60 239 0.968 19.93 3.02 Intr + 53964 54159 196 1 1 97 58 209 0.277 17.27 3.03 Intr + 84266 84357 92 2 2 90 96 69 0.913 7.54 3.04 Intr + 84563 84666 104 0 2 89 46 119 0.859 7.69 3.05 Intr + 84923 84995 73 1 1 104 62 38 0.893 1.78 3.06 Intr + 85266 85341 76 1 1 69 100 113 0.997 9.17 3.07 Intr + 85493 85607 115 2 1 78 109 25 0.989 4.05 3.08 Intr + 86388 86426 39 2 0 86 68 43 0.579 0.52 3.09 Intr + 90605 90748 144 1 0 105 12 141 0.657 8.68 3.10 Intr + 91709 91879 171 1 0 149 77 208 0.999 25.94 3.11 Intr + 92182 92309 128 0 2 -64 52 232 0.635 3.88 3.12 Intr + 92500 92632 133 1 1 95 79 98 0.985 10.25 3.13 Intr + 92956 93036 81 0 0 93 81 63 0.984 5.93 3.14 Intr + 94493 94601 109 1 1 118 89 73 0.999 10.26 3.15 Intr + 96868 96936 69 2 0 63 81 111 0.992 7.05 3.16 Intr + 97275 97451 177 1 0 45 61 242 0.994 17.09 3.17 Intr + 98047 98121 75 2 0 32 91 77 0.783 1.89 3.18 Term + 98257 98384 128 2 2 54 47 159 0.998 6.84 3.19 PlyA + 99110 99115 6 1.05 4.14 PlyA - 99625 99620 6 1.05 4.13 Term - 100121 99998 124 1 1 82 55 164 0.995 10.36 4.12 Intr - 100760 100571 190 0 1 28 99 235 0.999 17.24 4.11 Intr - 102587 102506 82 2 1 108 92 126 0.995 14.21 4.10 Intr - 103303 103109 195 1 0 103 76 122 0.987 12.11 4.09 Intr - 106812 106746 67 2 1 92 77 86 0.972 6.81 4.08 Intr - 108010 107933 78 0 0 79 49 93 0.897 3.17 4.07 Intr - 108193 108113 81 0 0 67 111 50 0.937 3.95 4.06 Intr - 112999 112883 117 0 0 67 80 72 0.913 3.88 4.05 Intr - 113797 113658 140 1 2 86 80 139 0.986 12.16 4.04 Intr - 114176 114101 76 1 1 89 91 105 0.999 10.42 4.03 Intr - 117268 117123 146 1 2 108 91 174 0.999 18.78 4.02 Intr - 123030 122894 137 1 2 135 99 133 0.989 19.39 4.01 Init - 140126 140027 100 2 1 36 110 76 0.341 5.02 4.00 Prom - 143979 143940 40 -8.76 5.00 Prom + 145889 145928 40 -7.26 5.01 Init + 148920 148982 63 2 0 65 24 90 0.547 1.45 5.02 Intr + 149137 149266 130 0 1 69 127 142 0.419 16.47 5.03 Intr + 150560 150666 107 0 2 68 20 200 0.991 11.13 5.04 Intr + 150831 150974 144 2 0 85 47 282 0.999 24.28 5.05 Intr + 151748 151904 157 1 1 84 64 189 0.985 15.68 5.06 Intr + 152203 152230 28 2 1 130 94 37 0.974 5.67 5.07 Intr + 153629 153725 97 2 1 44 60 118 0.973 4.61 5.08 Intr + 155862 155885 24 2 0 129 84 29 0.857 4.82 5.09 Intr + 156024 156116 93 2 0 82 59 148 0.999 11.46 5.10 Intr + 156236 156299 64 1 1 107 68 105 0.998 8.69 5.11 Intr + 157270 157392 123 2 0 58 27 173 0.774 8.76 5.12 Intr + 158056 158159 104 2 2 119 15 223 0.999 17.99 5.13 Intr + 158319 158457 139 2 1 109 42 237 0.999 21.24 5.14 Intr + 158727 158845 119 1 2 89 99 150 0.977 16.38 5.15 Intr + 158935 159084 150 0 0 84 78 261 0.999 25.06 5.16 Intr + 159262 159432 171 0 0 71 98 426 0.996 42.04 5.17 Intr + 160910 161089 180 1 0 77 66 359 0.999 32.66 5.18 Intr + 161184 161316 133 2 1 94 81 147 0.945 14.82 5.19 Intr + 161404 161480 77 2 2 76 116 88 0.999 9.63 5.20 Intr + 161908 161995 88 0 1 79 66 175 0.997 13.94 5.21 Intr + 162174 162291 118 1 1 126 64 222 0.997 23.12 5.22 Intr + 164272 164395 124 1 1 131 58 221 0.989 23.99 5.23 Intr + 164504 164640 137 1 2 127 94 184 0.989 22.27 5.24 Intr + 165316 165571 256 1 1 83 68 528 0.999 47.75 5.25 Intr + 166506 166748 243 2 0 115 70 577 0.987 56.29 5.26 Intr + 167516 167692 177 1 0 63 86 373 0.661 34.72 5.27 Intr + 167783 167928 146 1 2 110 50 355 0.983 32.98 5.28 Intr + 168253 168343 91 1 1 75 96 186 0.780 18.10 5.29 Intr + 168421 168810 390 0 0 78 32 687 0.927 57.02 5.30 Intr + 169465 169591 127 0 1 50 77 219 0.931 17.25 5.31 Intr + 169681 169799 119 2 2 126 86 126 0.999 16.38 5.32 Intr + 169889 170085 197 1 2 95 89 414 0.999 40.41 5.33 Intr + 170226 170575 350 0 2 48 53 698 0.998 57.30 5.34 Intr + 170741 170865 125 0 2 56 55 213 0.998 15.10 5.35 Intr + 170961 171076 116 2 2 119 85 262 0.999 28.25 5.36 Intr + 171463 171859 397 1 1 99 75 803 0.999 74.68 5.37 Intr + 171938 172063 126 1 0 108 48 145 0.999 13.38 5.38 Intr + 172158 172328 171 2 0 105 93 343 0.983 36.64 5.39 Intr + 172415 172519 105 1 0 79 80 208 0.804 19.51 5.40 Intr + 172601 172762 162 1 0 72 94 242 0.555 23.37 5.41 Intr + 173118 173255 138 2 0 42 100 290 0.987 26.26 5.42 Term + 173347 173358 12 0 0 97 49 10 0.769 -3.80 5.43 PlyA + 173481 173486 6 1.05 6.15 PlyA - 173622 173617 6 1.05 6.14 Term - 174453 174316 138 0 0 77 43 149 0.948 7.26 6.13 Intr - 174786 174580 207 0 0 102 92 505 0.999 51.57 6.12 Intr - 175740 175628 113 1 2 120 94 213 0.998 25.20 6.11 Intr - 176967 176865 103 0 1 66 83 201 0.633 17.15 6.10 Intr - 177745 177605 141 1 0 86 98 161 0.931 17.45 6.09 Intr - 178810 178720 91 0 1 45 64 119 0.961 5.20 6.08 Intr - 179917 179834 84 0 0 114 109 72 0.998 10.84 6.07 Intr - 181458 181357 102 2 0 103 109 198 0.768 22.69 6.06 Intr - 183955 183768 188 1 2 52 53 91 0.659 0.49 6.05 Intr - 187309 187178 132 1 0 103 119 72 0.999 12.64 6.04 Intr - 192502 192360 143 2 2 111 80 133 0.883 14.87 6.03 Intr - 206478 206293 186 1 0 82 86 186 0.856 17.46 6.02 Intr - 215882 215675 208 2 1 82 98 216 0.952 20.65 6.01 Intr - 221164 221036 129 1 0 89 105 87 0.995 11.39 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578r:34828831_35051852|GENSCAN_predicted_peptide_1|118_aa MKDYGPAKGGKNDPNAPKRPLSGFFLFCSEFCPKIKSTNPGISIGDVAKKLGEMWNNLND SEKQPYVTKVAKLKKYEKDVADYKSKGKLDGTKGPAKVAWEKMEEEDEEDGEEEKEDE >gi568815578r:34828831_35051852|GENSCAN_predicted_CDS_1|357_bp atgaaggattatggaccagctaagggaggcaagaacgatcctaatgcccccaaaaggcca ctgtctggattcttcctgttctgttcagaattctgccccaagatcaaatccacaaaccct ggcatctctattggagacgtggcaaaaaagctgggtgagatgtggaataacttaaatgac agtgaaaagcagccttatgtcactaaggtggcaaagctgaagaagtatgagaaggatgtt gctgactataagtcgaaaggaaagttggacggcacaaaaggtcctgctaaagttgcctgg gaaaagatggaagaagaagatgaagaagatggggaggaagagaaggaggatgaataa >gi568815578r:34828831_35051852|GENSCAN_predicted_peptide_2|655_aa MAAENEASQESALGAYSPVDYMSITSFPRLPEDEPAPAAPLRGRKDEDAFLGDPDTDPDS FLKSARLQRLPSSSSEMGSQDGSPLRETRKDPFSAAAAECSCRQDGLTVIVTACLTFATG VTVALVMQIYFGDPQIFQQGAVVTDAARCTSLGIEVLSKQGSSVDAAVAAALCLGIVAPH SSGLGGGGVMLVHDIRRNESHLIDFRESAPGALREETLQRSWETKVGTLDLLHPLLSGPP RLPWSQVLAFAAAVAQDGFNVTHDLARALAEQLPPNMSERFRETFLPSGRPPLPGSLLHR PDLAEVLDVLGTSGPAAFYAGGNLTLEMVAEAQHAGGVITEEDFSNYSALVEKPVCGVYR GHLVLSPPPPHTGPALISALNILEGFNLTSLVSREQALHWVAETLKIALALASRLGDPVY DSTITESMDDMLSKVEAAYLRGHINDSQAAPAPLLPVYELDGAPTAAQVLIMGPDDFIVA MVSSLNQPFGSGLITPSGILLNSQMLDFSWPNRTANHSAPSLENSVQPGKRPLSFLLPTV VRPAEGLCGTYLALGANGAARGLSGLTQVLLNVLTLNRNLSDSLARGRLHPDLQSNLLQV DSEFTEEEIEFLEARGHHVEKVDVLSWVHGSRRTNNFIIAVKDPRSPDAAGATIL >gi568815578r:34828831_35051852|GENSCAN_predicted_CDS_2|1968_bp atggcggcggagaacgaggccagccaggagagcgccctgggcgcctactcgccagtggac tacatgagcatcaccagcttcccgcggctgcccgaggacgagccggcgcccgcggccccg ctgaggggccgcaaggacgaggacgcctttctgggagaccccgacaccgacccggactcc ttcctgaagtctgcacggctgcagcggctgccatcgtcgtcgtcggagatgggcagccaa gacgggtcgccgctacgcgagacgcgcaaagacccgttctccgccgcagcggccgagtgc tcctgccgccaggatgggctcacggtcatcgtcacggcctgtctcaccttcgctaccggt gtcaccgtggcgctggtcatgcagatctacttcggggacccccagatcttccagcagggt gccgtggtgaccgatgctgcccgctgcacttcactgggcatcgaggtgctcagtaaacag ggatcttctgtggacgcagcggtggcagcagccttgtgtttgggtatcgtggctccacac agttctggcctgggcggtgggggcgtgatgctggtacatgacatccgacgaaatgagagc cacctaattgatttccgggagtccgcaccaggggccctcagggaagagaccctgcaaaga tcctgggagaccaaggtggggaccctggacctcctccaccccctgctctccggccccccc aggctgccatggtcccaagtcctggcctttgcagcagctgtggcccaagatggcttcaac gtgactcatgatctagcccgtgccctggctgaacagctgccacccaacatgtccgagcgc ttccgggagacgttcctgccatcgggccgcccgccactacctggctcgttgctgcatcgg cccgacctggctgaggtgctggatgtacttggcacctccggcccggctgccttctacgca ggtggcaacctcacactggagatggtggccgaggctcagcacgcagggggtgtcataacc gaagaggacttcagcaattacagcgcccttgtggagaagcctgtgtgtggcgtgtacaga ggccacctggttcttagtcccccacctccgcacacgggccctgccctcatcagtgctctc aacatcctggagggcttcaatctcaccagcctggtatcccgagaacaggctcttcactgg gtggcagagaccctgaagattgcattagccctggccagcagactgggagatcccgtctat gattctaccatcactgagagcatggatgacatgctcagcaaggtggaggccgcctacctc cggggccatatcaatgactcccaggcagcccctgccccactcctgcctgtctatgaacta gacggagctcccacggctgcccaggtgctgatcatgggacctgatgacttcattgtggcc atggttagctccctgaaccagccctttggcagcggccttatcaccccctcggggatcctg ctcaacagccagatgctggacttctcctggcccaaccggacagctaaccactctgcaccc agcctggagaattcagtgcagccagggaagcggccactctctttcctgctgcccacagtg gtccgacccgcggaggggctctgtggaacctacctcgctctgggggccaatggagctgcg cggggcctcagcggcctgacacaggttctgctgaatgtcctgaccttgaaccggaacctg agtgacagcctggcccgcggccgcctacacccggacctgcagtccaacctcctgcaggtg gacagtgagttcacagaggaagagattgagttcctggaagccaggggtcaccacgtggag aaagtagatgtcttatcctgggtccatggcagccgaaggaccaacaacttcatcatcgct gttaaggaccctcggagcccagatgcagctggagccaccatcctgtag >gi568815578r:34828831_35051852|GENSCAN_predicted_peptide_3|695_aa MGLPEERVRSGSGSRGQEEAGAGGRARSWSPPPEVSRSAHVPSLQRYRELHRRSVEEPRE FWGDIAKEFYWKTPCPGPFLRYNFDVTKGKIFIEWMKGATTNICYNVLDRNVHEKKLGDK VAFYWEGNEPGETTQITYHQLLVQVCQFSNVLRKQGIQKGDRVAIYMPMIPELVVAMLAC ARIGALHSIVFAGFSSESLCERILDSSCSLLITTDAFYRGEKLVNLKELADEALQKCQEK GFPVRCCIVVKHLGRAELGMGDSTSQSPPIKRSCPDVQGKLKEKSKRVQPQISWNQGIDL WWHELMQEAGDECEPEWCDAEDPLFILYTSGSTGKPKASGVVHTVGGYMLYVATTFKYVF DFHAEDVFWCTADIGWITGHSYVTYGPLANGATSVLGIPTYPDVNRLWSIVDKYKVTKFY TAPTAIRLLMKFGDEPVTKHSRASLQVLGTVGEPINPEAWLWYHRVVGAQRCPIVDTFWQ TETTFPFFGVAPAILNESGEELEGEAEGYLVFKQPWPGIMRTVYGNHERFETTYFKKFPG YYVTGDGCQRDQDGYYWITGRIDDMLNVSGHLLSTAEVESALVEHEAVAEAAVVGHPHPV KGECLYCFVTLCDGHTFSPKLTEELKKQIREKIGPIATPDYIQNAPGLPKTRSGKIMRRV LRKIAQNDHDLGDMSTVADPSVISHLFSHRCLTIQ >gi568815578r:34828831_35051852|GENSCAN_predicted_CDS_3|2088_bp atggggcttcctgaggagcgggtccggagcggcagcgggagccggggccaggaggaagct ggagccggaggccgggcgcggagttggtctccgccgcccgaggtcagccgctccgcgcac gtcccctcgctgcagcgctaccgcgagctgcaccggcgctccgtggaggagccgcgggaa ttctggggagacattgccaaggaattttactggaagactccatgccctggcccattcctt cggtacaactttgatgtgactaaagggaaaatcttcattgagtggatgaaaggagcaact accaacatctgctacaatgtactggatcgaaatgtccatgagaaaaagcttggagataaa gttgctttttactgggagggcaatgagccaggggagaccactcagatcacataccatcag cttctggtccaagtgtgtcagttcagcaatgttctccgaaaacagggcattcagaagggg gaccgagtggccatctacatgcctatgatcccagagcttgtggtggccatgctggcatgt gcccgcattggggctttgcactccattgtgtttgcaggcttctcttcagagtctctatgt gaacggatcttggattccagctgcagtcttctcatcactacagatgccttctacaggggg gaaaagcttgtgaacctgaaggagctggctgacgaggccctgcagaagtgtcaggagaag ggtttcccagtaagatgctgcattgtggtcaagcacctggggcgggcagagctcggcatg ggtgactccaccagccagtcccccccaattaagaggtcatgcccagatgtgcagggtaaa ctgaaagagaaatccaagcgtgttcagccccagatctcatggaaccaagggattgacttg tggtggcatgagctcatgcaagaggcaggggatgagtgtgagcccgagtggtgtgatgcc gaggacccactcttcatcctgtacaccagtggctccacaggcaaacccaaggcaagtggt gtggttcacacagttgggggctacatgctctatgtagccacaaccttcaagtatgtgttt gacttccatgcagaggatgtgttctggtgcacggcagacattggttggatcactggtcat tcctacgtcacctatgggccactggccaatggtgccaccagtgttttggggattcccaca tatccggacgtgaaccgcctgtggagcattgtggacaaatacaaggtgaccaagttctac acagcacccacagccatccgtctgctcatgaagtttggagatgagcctgtcaccaagcat agccgggcatccttgcaggtgttaggcacagtgggtgaacccatcaaccctgaggcctgg ctatggtaccaccgggtggtaggtgcccagcgctgccccatcgtggacaccttctggcaa acagagacaactttcccattctttggtgtagctcctgcaatcctgaatgagtccggggaa gagttggaaggtgaagctgaaggttatctggtgttcaagcagccctggccagggatcatg cgcacagtctatgggaaccacgaacgctttgagacaacctactttaagaagtttcctgga tactatgttacaggagatggctgccagcgggaccaggatggctattactggatcactggc aggattgatgacatgctcaatgtatctggacacctgctgagtacagcagaggtggagtca gcacttgtggaacatgaggctgttgcagaggcagctgtggtgggccaccctcatcctgtg aagggtgaatgcctctactgctttgtcaccttgtgtgatggccacaccttcagccccaag ctcaccgaggagctcaagaagcagattagagaaaagattggccccattgccacaccagac tacatccagaatgcacctggcttgcctaaaacccgctcagggaaaatcatgaggcgagtg cttcggaagattgctcagaatgaccatgacctcggggacatgtctactgtggctgaccca tctgtcatcagtcacctcttcagccaccgctgcctgaccatccagtga >gi568815578r:34828831_35051852|GENSCAN_predicted_peptide_4|510_aa MPAVAAAHIGFQNHSTPRSPVPPPAPCVLASLPGVGMATNWGSLLQDKQQLEELARQAVD RALAEGVLLRTSQEPTSSEVVSYAPFTLFPSLVPSALLEQAYAVQMDFNLLVDAVSQNAA FLEQTLSSTIKQDDFTARLFDIHKQVLKEGIAQTVFLGLNRSDYMFQRSADGSPALKQIE INTISASFGGLASRTPAVHRHVLSVLSKTKEAGKILSNNPSKGLALGIAKAWELYGSPNA LVLLIAQEKERNIFDQRAIENELLARNIHVIRRTFEDISEKGSLDQDRRLFVDGQEIAVV YFRDGYMPRQYSLQNWEARLLLERSHAAKCPDIATQLAGTKKVQQELSRPGMLEMLLPGQ PEAVARLRATFAGLYSLDVGEEGDQAIAEALAAPSRFVLKPQREGGGNNLYGEEMVQALK QLKDSEERASYILMEKIEPEPFENCLLRPGSPARVVQCISELGIFGVYVRQEKTLVMNKH VGHLLRTKAIEHADGGVAAGVAVLDNPYPV >gi568815578r:34828831_35051852|GENSCAN_predicted_CDS_4|1533_bp atgccagcagttgctgctgctcatatcggcttccagaaccactcaaccccaagaagccca gtcccgcccccagccccttgtgtgctggcctccctgccaggtgttgggatggccaccaac tgggggagcctcttgcaggataaacagcagctagaggagctggcacggcaggccgtggac cgggccctggctgagggagtattgctgaggacctcacaggagcccacttcctcggaggtg gtgagctatgccccattcacgctcttcccctcactggtccccagtgccctgctggagcaa gcctatgctgtgcagatggacttcaacctgctagtggatgctgtcagccagaacgctgcc ttcctggagcaaactctttccagcaccatcaaacaggatgactttaccgctcgtctcttt gacatccacaagcaagtcctaaaagagggcattgcccagactgtgttcctgggcctgaat cgctcagactacatgttccagcgcagcgcagatggctccccagccctgaaacagatcgaa atcaacaccatctctgccagctttgggggcctggcctcccggaccccagctgtgcaccga catgttctcagtgtcctgagtaagaccaaagaagctggcaagatcctctctaataatccc agcaagggactggccctgggaattgccaaagcctgggagctctacggctcacccaatgct ctggtgctactgattgctcaagagaaggaaagaaacatatttgaccagcgtgccatagag aatgagctactggccaggaacatccatgtgatccgacgaacatttgaagatatctctgaa aaggggtctctggaccaagaccgaaggctgtttgtggatggccaggaaattgctgtggtt tacttccgggatggctacatgcctcgtcagtacagtctacagaattgggaagcacgtcta ctgctggagaggtcacatgctgccaagtgcccagacattgccacccagctggctgggact aagaaggtgcagcaggagctaagcaggccgggcatgctggagatgttgctccctggccag cctgaggctgtggcccgcctccgcgccacctttgctggcctctactcactggatgtgggt gaagaaggggaccaggccatcgccgaggcccttgctgcccctagccggtttgtgctaaag ccccagagagagggtggaggtaacaacctatatggggaggaaatggtacaggccctgaaa cagctgaaggacagtgaggagagggcctcctacatcctcatggagaagatcgaacctgag ccttttgagaattgcctgctacggcctggcagccctgcccgagtggtccagtgcatttca gagctgggcatctttggggtctatgtcaggcaggaaaagacactcgtgatgaacaagcac gtggggcatctacttcgaaccaaagccatcgagcatgcagatggtggtgtggcagcggga gtggcagtcctggacaacccataccctgtgtga >gi568815578r:34828831_35051852|GENSCAN_predicted_peptide_5|2005_aa MGGRESSESGVEEALVFEGVHGFQLLLLHPSATAMMDVSELGESARYLRQGYQEMTKVHT IPWDGKKRVWVPDEQDAYVEAEVKSEATGGRVTVETKDQKVLMVREAELQPMNPPRFDLL EDMAMMTHLNEASVLHNLRQRYARWMIYTYSGLFCVTINPYKWLPVYTASVVAAYKGKRR SDSPPHIYAVADNAYNDMLRNRDNQSMLITGESGAGKTVNTKRVIQYFAIVAALGDGPGK KAQFLATKTGGTLEDQIIEANPAMEAFGNAKTLRNDNSSRFGKFIRIHFGPSGKLASADI DSYLLEKSRVIFQLPGERSYHVYYQILSGRKPELQGEGQYDEGDMLLLSMNPYDYHFCSQ GVITVDNMNDGEELIATDHAMDILGFSVDEKCACYKIVGALLHFGNMKFKQKQREEQAEA DGTESADKAAYLMGVSSGDLLKGLLHPRVRVGNEYVTKGQSVEQVVFAVGALAKATYDRL FRWLVSRINQTLDTKLPRQFFIGVLDIAGFEIFEFNSFEQLCINFTNEKLQQFFNQHMFV LEQEEYKREGIDWVFIDFGLDLQPCIDLIEKPLGILSILEEECMFPKASDASFRAKLYDN HAGKSPNFQQPRPDKKRKYQAHFEVVHYAGVVPYSIVGWLEKNKDPLNETVVPIFQKSQN RLLATLYENYAGSCSTEPPKSGVKEKRKKAASFQTVSQLHKENLNKLMTNLRATQPHFVR CIVPNENKTPGVMDAFLVLHQLRCNGVLEGIRICRQGFPNRLLYTDFRQRYRILNPSAIP DDTFMDSRKATEKLLGSLDLDHTQYQFGHTKVFFKAGLLGVLEELRDQRLAKVLTLLQAR SRGRLMRLEYQRLLGGRDALFTIQWNIRAFNAVKNWSWMKLFFKMKPLLRSAQAEEELAA LRAELRGLRGALAAAEAKRQELEETHVSITQEKNDLALQLQAEQDNLADAEERCHLLIKS KVQLEGKVKELSERLEDEEEVNADLAARRRKLEDECTELKKDIDDLELTLAKAEKEKQAT ENKVKNLTEEMAALDESVARLTKEKKALQEAHQQALGDLQAEEDRVSALTKAKLRLEQQV EDLECSLEQEKKLRMDTERAKRKLEGDLKLTQESVADAAQDKQQLEEKLKKKDSELSQLS LRVEDEQLLGAQMQKKIKELQARAEELEEELEAERAARARVEKQRAEAARELEELSERLE EAGGASAGQREGCRKREAELGRLRRELEEAALRHEATVAALRRKQAEGAAELGEQVDSLQ RVRQKLEKEKSELRMEVDDLAANVETLTRAKASAEKLCRTYEDQLSEAKIKVEELQRQLA DASTQRGRLQTESGELSRLLEEKECLISQLSRGKALAAQSLEELRRQLEEESKAKSALAH AVQALRHDCDLLREQHEEEAEAQAELQRLLSKANAEVAQWRSKYEADAIQRTEELEEAKK KLALRLQEAEEGVEAANAKCSSLEKAKLRLQTESEDVTLELERATSAAAALDKKQRHLER ALEERRRQEEEMQRELEAAQRESRGLGTELFRLRHGHEEALEALETLKRENKNLQEEISD LTDQVSLSGKSIQELEKTKKALEGEKSEIQAALEEAEGALELEETKTLRIQLELSQVKAE VDRKLAEKDEECANLRRNHQRAVESLQASLDAETRARNEALRLKKKMEGDLNDLELQLGH ATRQATEAQAATRLMQAQLKEEQAGRDEEQRLAAELHEQAQALERRASLLAAELEELRAA LEQGERSRRLAEQELLEATERLNLLHSQNTGLLNQKKKLEADLAQLSGEVEEAAQERREA EEKAKKAITDAAMMAEELKKEQDTSAHLERMKKTLEQTVRELQARLEEAEQAALRGGKKQ VQKLEAKVRELEAELDAEQKKHAEALKGVRKHERRVKELAYQAEEDRKNLARMQDLVDKL QSKVKSYKRQFEEAVSALGPGHLDRAPQLCPRVCGQEQQANTNLAKYRKAQHELDDAEER ADMAETQANKLRARTRDALGPKHKE >gi568815578r:34828831_35051852|GENSCAN_predicted_CDS_5|6018_bp atgggtggccgcgagtcttctgaatctggagtggaggaagccctggtgtttgagggtgtg catggtttccagctcctcctccttcaccccagtgccactgccatgatggatgtgagtgaa cttggggagtctgcccgctacctccgccagggctaccaggagatgacgaaggtgcacact atcccatgggacgggaagaagcgagtctgggtgcctgatgaacaggacgcctacgtggag gccgaggtcaagtcggaggctaccgggggcagagtcaccgtggagaccaaagaccagaag gtgctgatggtgcgtgaagccgagctgcagcccatgaacccgcctcgcttcgacttactg gaggacatggccatgatgacgcacctgaacgaggcctctgtgctgcacaacctgcgccag cgctatgcccgctggatgatctatacctactcaggcctcttctgtgtcaccatcaacccc tacaaatggctcccagtctatacggcctccgtagtggctgcttacaagggaaagcgccgc tcagattccccgccccatatatatgcggtggcggacaacgcctacaacgacatgctgcgc aaccgagacaaccagtccatgctgatcaccggagagtcgggggccggtaagacggttaac accaagcgggtcattcagtactttgccatcgtcgctgccctgggagacgggccgggcaag aaggcccaatttctggcaacaaagacggggggcacccttgaggatcaaatcatcgaggcc aaccctgccatggaggcctttggcaacgccaagaccctgaggaatgataactcctcccgc tttggcaagttcatccgcattcactttggtccctctgggaagctggcatccgcggatatt gacagctatctcctggagaagtcgcgggtgatcttccagttgcctggtgagcgcagctac catgtctactaccagatcctctcagggaggaagccagagctgcagggtgaggggcagtac gatgaaggggacatgctgcttctgtctatgaacccctatgactaccacttctgcagccag ggcgtcatcaccgtggacaacatgaatgatggggaggagctcatcgccaccgaccatgcc atggacatcctaggcttcagcgtggatgagaaatgtgcctgctataagatcgtgggcgcc ctcctgcactttggcaacatgaagttcaagcagaagcagcgggaggagcaggcggaggcc gatggcactgagagtgctgacaaggctgcctacctgatgggggtcagcagtggggacctc ctcaaaggccttttgcacccccgggtgcgtgtagggaacgagtacgtgaccaagggccag agtgtggagcaggtggtgtttgctgtgggggctctggccaaggccacctatgaccggctg ttcaggtggctggtgtctcggatcaaccagaccctggacacaaagctgccccggcagttc ttcatcggggttctggacatcgctgggtttgagatctttgagttcaacagcttcgaacag ctgtgcatcaacttcaccaatgagaaattgcagcagttcttcaaccagcacatgtttgtg ctggagcaggaggagtacaagcgggagggcatcgactgggtcttcatcgacttcggcctt gacctgcagccttgcatcgacctcatcgagaagccactgggcatcctgtccatcctggag gaggaatgcatgttccccaaggcctcagacgccagcttccgggccaagctctacgacaac cacgcggggaagtcacccaatttccagcagcctcggcctgacaagaagcgcaagtaccag gcccacttcgaggtggtccactacgcaggcgtggtgccttacagcattgtgggctggctg gagaaaaacaaggatcccctgaatgagaccgtggtccccatcttccagaagtcacagaat aggctcctggcgactctctatgagaattatgcgggctcctgctccactgagccccccaag tctggggtgaaagagaagcgtaagaaggcagcatcgttccagacggtgtcccagctgcac aaggagaacctcaacaagctgatgaccaacctgcgggccacacagccccacttcgtccgc tgcattgtccccaacgagaacaaaaccccaggggtcatggatgccttcttggtgctacac cagctgcgctgcaatggggtcctggaggggatccggatctgccgccaagggttccccaac aggttgctctacaccgacttccggcagcggtaccgtatcctgaaccccagtgccatcccg gatgacaccttcatggacagcaggaaggccacagagaaactgctgggctcgctggacttg gatcacacccagtaccagtttggccacaccaaggtgttcttcaaggctgggcttctaggc gtcctggaagagctccgtgaccagcgcctggccaaggtgctgacgctgctgcaggcgcgg agccgtggccgcctcatgcgccttgagtaccagcgcctgctgggaggcagggatgcgctg ttcaccatccagtggaacatccgtgccttcaatgccgtcaagaactggtcatggatgaag ctctttttcaagatgaagccgctgctgcgctcggcgcaggctgaggaggagctggcggcc ctgcgggcagagctgcgggggttgcgaggggcgctggctgcggccgaggccaagcgccag gaactggaggagacgcacgtcagcatcacccaggagaagaatgacctggccctgcagctg caggctgagcaggacaacctggcagatgccgaggagcgctgccacttgctgatcaagtcc aaggtgcagctggaggggaaggtgaaggagctgagtgagcggctggaggatgaggaggag gtgaacgctgacctggccgcccgccggcgcaagctggaggacgagtgcacggagctcaag aaggacattgatgacctggagctgacactggccaaagctgagaaggagaagcaagccact gagaacaaggtgaagaacctgacggaagagatggctgcgctggacgagtcagtggcccgg ctgaccaaggagaagaaggcgttgcaggaggcccaccaacaggccctgggtgacctgcag gccgaggaggaccgtgtgagcgcgctgaccaaggccaagctccggctggagcaacaggtg gaggacctggaatgctccctggagcaggagaagaagctgcgcatggacacggagcgggcc aagcgcaagctggagggtgacctgaagctgacgcaggagtcggtggctgatgctgctcaa gacaagcagcagctggaggagaagctcaagaagaaggactccgagctgagccagctgagc ctgcgggtggaagacgagcagctcttgggggcccagatgcagaagaagatcaaggagctg caggctcgggcggaggagctggaagaggagctggaggcagagcgggcagcccgggcccgc gtggagaagcagcgtgcagaggcggcgcgggagctggaggagctgagcgagcggctggag gaggcaggcggcgcatccgcggggcagcgcgagggctgccgcaagcgggaggcggagctg gggaggctgcggcgggagctggaggaggcggcgctgcggcacgaggccacagtggcggca ctgcggcgcaagcaggcggagggcgcggcggagctgggggagcaggtggacagcctgcag cgggtgcggcagaagctggagaaggagaagagtgagctgcgcatggaggtggacgacctg gctgccaacgtggagactctgacccgcgccaaggccagtgcagagaagctgtgccggacc tatgaggatcagctaagcgaggccaagatcaaggtggaggagctgcagcggcagctggcg gacgcaagcacgcagcgtgggcgactacagacggaaagcggggagctgagtcgcctgcta gaggagaaggagtgtctgatcagtcagctgagccgtggaaaggccctggccgcccaaagc ctggaagagttgcggcgccagctagaggaggaaagcaaggccaagagtgccctggcccac gccgtgcaggctctgcggcacgactgtgacctcctgcgggagcaacacgaggaggaggct gaggcccaggctgagctgcagcggctgctgtccaaggccaatgccgaggtggcccagtgg aggagcaagtacgaagcagatgccatccagaggaccgaggagctggaggaggccaaaaaa aagctggcactgcggctgcaggaggcagaggagggcgtggaggctgccaacgccaagtgc tcatcgttggagaaggccaagctgcggctacagacagagtcagaggatgtaaccctggag ctggagcgggcgacctcagcagctgctgcgctggacaagaagcagcggcacttggaacgg gcactggaggaacggcggcggcaggaggaggagatgcagcgggagctggaggcggcacag agggagtcccgtggcctgggcaccgagctcttccggctgcggcacggccacgaggaggca cttgaagccctggagacgctcaagcgggagaacaagaacctgcaggaggagatcagcgac ctcacagaccaggtgagtctcagtgggaagagcatccaggaactggagaaaaccaagaag gcgctggaaggcgagaagagtgagatccaggctgcactggaggaggcagagggggccctg gagctggaggagaccaagacgctgcggatccagctggagctctcccaggtcaaagcagaa gtggaccggaagctggcagagaaagacgaggagtgcgctaacctgaggcgcaaccaccag cgagctgtggagtccctgcaggcctccctggatgcagagacacgggcccgcaatgaggcg ctgcggctcaagaagaagatggagggtgacctcaacgacctggagctgcagctgggccat gccacccgtcaggccacagaggcccaggctgccacgcggctgatgcaggcacagctcaag gaggagcaggcagggcgggacgaggagcagcggctggcagctgagctccacgagcaggcg caggctctggagcgccgggcctcgctgctggctgcggagctggaggagctgcgggctgcc ctggagcagggcgagcgcagccggcgactggcagagcaggagcttttggaggccaccgag cgcctcaaccttctgcattcgcagaacacaggcctcctaaaccagaagaagaagctggag gcggacttggcccagctgagcggggaggtggaggaggctgcacaggagaggcgggaggct gaggagaaggccaaaaaggccatcactgatgcggccatgatggccgaggagctgaagaag gagcaggacacaagtgcacacctggaacggatgaagaagacgctggagcagacggtgcgc gagctccaggcccgccttgaggaggcagaacaggccgccctccgtggcgggaagaagcag gtgcagaagctggaggccaaggtacgggagctggaggctgagcttgatgcagagcagaag aagcacgccgaggcccttaagggcgtgcgcaagcatgagcgccgtgtcaaggagctcgca taccaggccgaggaggacaggaagaacctggctcgcatgcaggacctggtggacaagctg cagagcaaggtcaagagctacaagcgccagtttgaggaggcggtgagtgcgctggggcct ggacacctggaccgggcaccccagctctgccccagggtctgtggccaggagcagcaggcc aacaccaacctggccaagtatcgcaaggcccagcacgagctggatgatgcggaggagcgg gcagacatggcggaaacccaggccaacaagctgcgggcacggacccgggacgccctgggc cccaagcacaaggagtga >gi568815578r:34828831_35051852|GENSCAN_predicted_peptide_6|654_aa TEGVTKRLAEKNDFVIFLFTLMTSKKTFLQTATLIEDILGVKKEMIRLDEVPNLSSLVSN FDQQQLANFCRILAVTISEMDTGNDDKHTLLAKNAQQKKSLSLGPSAAEINQAALLSIPG FVERLCKLATRKVSESTGTASFLQELEEWYTWLDNALVLDALMRVANEESEHNQGASEEN GLPHTSARTQLPQSMKIMHEIMYKLEVLYVLCVLLMGRQRNQVHRMIAEFKLIPGLNNLF DKLIWRKHSASALVLHGHNQNCDCSPPPQELSMEVVGPLLPLESMDFSTYCPSKARVDSV AIPRGRQELQTGKADCLLSSWRILPSTRRNKYLLLNNQELNELSAISLKANIPEVEAVLN TDRSLVCDGKRGLLTRLLQVMKKEPAESSFRFWQARAVESFLRGTTSYADQMFLLKRGLL EHILYCIVDSECKSRDVLQSYFDLLGELMKFNVDAFKRFNKYINTDAKVFLKQINSSLVD SNMLVRCVTLSLDRFENQVDMKVAEVLSECRLLAYISQVPTQMSFLFRLINIIHVQTLTQ ENVSCLNTSLVILMLARRKERLPLYLRLLQRMEHSKKYPGFLLNNFHNLLRFWQQHYLHK DKDSTCLENSSCISFSYWKETVSILLNPDRQSPSALVSYIEEPYMDIDRDFTEE >gi568815578r:34828831_35051852|GENSCAN_predicted_CDS_6|1965_bp acagagggagttacaaagcgtttggcagaaaagaatgactttgtgatcttcctgtttaca ttgatgacaagtaagaagacattcttacaaacagcaaccctcattgaagatattttgggt gttaaaaaggaaatgatccgactagatgaagtccccaatctgagttccttagtatccaat ttcgatcagcagcagctcgctaatttctgccggattctggctgtcaccatttcagagatg gatacagggaatgatgacaagcacacgcttcttgccaaaaatgctcaacagaagaagagc ttgagtttggggccttctgcagctgaaatcaatcaagcggcccttctcagcattcctggc tttgttgagcggctttgcaaactggcgactcgaaaggtgtcagagtcaacgggcacagcc agcttccttcaggagttggaagagtggtacacatggctagacaatgctttggtgctagat gccctgatgcgagtggccaatgaggagtcagagcacaatcaaggggcttctgaggagaat ggcctgcctcacacgtcagccagaacccagctgccccagtcaatgaagattatgcatgag atcatgtacaaactggaagtgctctatgtcctctgcgtgctgctgatggggcgtcagcga aaccaggttcacagaatgattgcagagttcaagctgatccctggacttaataatttgttt gacaaactgatttggaggaagcattcagcatctgcccttgtcctccatggtcacaaccag aactgtgactgtagcccgccaccccaggaactgagtatggaagtcgttggccccttacta cctttggaatcaatggatttctcaacctactgcccctctaaagcaagagtggacagcgtg gctattcccaggggccggcaggaacttcagacaggcaaagccgactgcttactgagcagc tggaggattcttccctccacaaggaggaacaagtacttgttactcaacaaccaggagctg aatgaactcagtgccatctctctcaaggccaacatccctgaggtggaagctgtcctcaac accgacaggagtttggtgtgtgatgggaagaggggcttattaactcgtctgctgcaggtc atgaagaaggagccagcagagtcgtctttcaggttttggcaagctcgggctgtggagagt ttcctccgagggaccacctcctatgcagaccagatgttcctgctgaagcgaggcctcttg gagcacatcctttactgcattgtggacagcgagtgtaagtcaagggatgtgctccagagt tactttgacctcctgggggagctgatgaagttcaacgttgatgcattcaagagattcaat aaatatatcaacaccgatgcaaaggtattcctgaagcagatcaacagctccctggtggac tccaacatgctggtgcgctgtgtcactctgtccctggaccgatttgaaaaccaggtggat atgaaagttgccgaggtactgtctgaatgccgcctgctcgcctacatatcccaggtgccc acgcagatgtccttcctcttccgcctcatcaacatcatccacgtgcagacgctgacccag gagaacgtcagctgcctcaacaccagcctggtgatcctgatgctggcccgacggaaagag cggctgcccctgtacctgcggctgctgcagcggatggagcacagcaagaagtaccccggc ttcctgctcaacaacttccacaacctgctgcgcttctggcagcagcactacctgcacaag gacaaggacagcacctgcctagagaacagctcctgcatcagcttctcatactggaaggag acagtgtccatcctgttgaacccggaccggcagtcaccctctgctctcgttagctacatt gaggagccctacatggacatagacagggacttcactgaggagtga