GENSCAN 1.0 Date run: 3-Nov-116 Time: 01:02:58 Sequence gi568815590r:22121356_22328357 : 207002 bp : 50.77% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.09 Intr - 1253 1138 116 0 2 70 110 107 0.963 11.27 1.08 Intr - 1524 1435 90 1 0 128 84 70 0.986 10.57 1.07 Intr - 2458 2294 165 2 0 86 91 90 0.518 9.03 1.06 Intr - 4149 3956 194 2 2 135 61 174 0.794 18.54 1.05 Intr - 4871 4553 319 0 1 64 -6 175 0.708 0.72 1.04 Intr - 6474 5682 793 0 1 95 116 365 0.628 31.04 1.03 Intr - 7855 7204 652 0 1 44 113 345 0.520 24.51 1.02 Intr - 9248 9073 176 2 2 65 115 209 0.996 20.04 1.01 Init - 10494 10225 270 0 0 56 94 147 0.682 7.24 1.00 Prom - 14680 14641 40 -9.16 2.10 PlyA - 16683 16678 6 1.05 2.09 Term - 17197 17132 66 1 0 128 46 117 0.999 9.44 2.08 Intr - 17438 17284 155 0 2 90 78 216 0.848 20.59 2.07 Intr - 17706 17571 136 0 1 107 85 148 0.976 16.54 2.06 Intr - 18174 17875 300 0 0 90 68 309 0.631 26.03 2.05 Intr - 18728 18608 121 1 1 116 75 217 0.999 23.70 2.04 Intr - 18893 18817 77 2 2 118 60 18 0.993 0.31 2.03 Intr - 19191 19081 111 0 0 90 68 38 0.823 2.58 2.02 Intr - 19342 19252 91 0 1 93 57 71 0.662 4.50 2.01 Init - 20127 20096 32 0 2 98 59 51 0.823 0.58 2.00 Prom - 23553 23514 40 -4.96 3.09 PlyA - 25497 25492 6 1.05 3.08 Term - 27622 26805 818 2 2 91 50 1370 0.753 126.80 3.07 Intr - 30298 30134 165 2 0 100 82 136 0.974 14.13 3.06 Intr - 30645 30476 170 2 2 77 91 236 0.999 22.39 3.05 Intr - 32684 32613 72 1 0 90 61 137 0.998 9.72 3.04 Intr - 32858 32787 72 1 0 53 76 92 0.890 3.02 3.03 Intr - 33276 33205 72 2 0 110 12 116 0.958 4.72 3.02 Intr - 34108 34037 72 0 0 95 8 108 0.801 2.02 3.01 Init - 35187 34982 206 0 2 110 76 321 0.999 29.42 3.00 Prom - 36445 36406 40 -8.86 4.00 Prom + 38196 38235 40 -4.26 4.01 Init + 40474 40515 42 0 0 85 117 55 0.298 8.61 4.02 Intr + 41219 41377 159 1 0 71 81 259 0.552 23.68 4.03 Intr + 41725 41847 123 0 0 57 94 172 0.999 15.48 4.04 Intr + 42081 42191 111 2 0 38 89 174 0.985 13.08 4.05 Term + 42528 42686 159 2 0 48 48 111 0.689 1.04 4.06 PlyA + 43098 43103 6 1.05 5.00 Prom + 43105 43144 40 -16.60 5.01 Init + 44051 44198 148 1 1 100 88 459 0.999 45.35 5.02 Intr + 52247 52360 114 0 0 114 115 151 0.984 20.82 5.03 Intr + 54794 54958 165 0 0 80 93 66 0.581 6.23 5.04 Intr + 55178 55295 118 0 1 99 88 169 0.999 17.52 5.05 Intr + 55606 55784 179 1 2 82 96 362 0.900 35.96 5.06 Intr + 56497 56602 106 2 1 114 75 160 0.986 16.57 5.07 Intr + 58350 58474 125 0 2 121 71 154 0.937 17.23 5.08 Intr + 59013 59128 116 1 2 115 102 83 0.925 12.57 5.09 Intr + 63354 63429 76 2 1 50 67 25 0.225 -4.31 5.10 Intr + 67481 67596 116 0 2 99 59 81 0.650 6.47 5.11 Intr + 70694 70796 103 1 1 114 82 196 0.994 21.35 5.12 Intr + 72703 72819 117 2 0 84 79 165 0.999 15.64 5.13 Intr + 73090 73235 146 2 2 94 92 224 0.994 23.40 5.14 Intr + 73369 73564 196 0 1 33 46 441 0.998 33.39 5.15 Intr + 74107 74232 126 2 0 112 96 226 0.990 26.45 5.16 Intr + 75325 75485 161 2 2 121 95 301 0.991 33.81 5.17 Intr + 75885 76065 181 2 1 76 68 392 0.991 35.44 5.18 Intr + 80448 80573 126 1 0 110 102 320 0.953 36.25 5.19 Intr + 85499 85626 128 0 2 93 95 176 0.987 19.20 5.20 Intr + 85948 86161 214 0 1 118 84 395 0.999 40.39 5.21 Intr + 88090 88340 251 2 2 97 60 566 0.999 51.86 5.22 Term + 90239 90373 135 1 0 92 47 212 0.934 15.52 5.23 PlyA + 90947 90952 6 1.05 6.06 PlyA - 98389 98384 6 1.05 6.05 Term - 100532 99998 535 1 1 33 50 985 0.905 82.92 6.04 Intr - 103027 102871 157 2 1 37 80 283 0.805 21.47 6.03 Intr - 105670 105496 175 1 1 96 60 282 0.999 25.71 6.02 Intr - 107031 106838 194 1 2 88 100 328 0.681 33.21 6.01 Init - 110147 110069 79 2 1 74 55 115 0.608 6.03 6.00 Prom - 120142 120103 40 -5.26 7.00 Prom + 120830 120869 40 -4.96 7.01 Init + 124095 124259 165 2 0 97 75 108 0.294 8.14 7.02 Intr + 125866 125909 44 0 2 97 89 83 0.997 6.34 7.03 Intr + 126502 126653 152 1 2 80 70 108 0.999 8.01 7.04 Intr + 126799 126923 125 2 2 68 26 170 0.649 9.10 7.05 Intr + 127126 127294 169 0 1 7 73 247 0.400 14.62 7.06 Intr + 127689 127954 266 1 2 32 27 328 0.552 18.33 7.07 Intr + 128720 128872 153 1 0 118 110 138 0.993 19.27 7.08 Intr + 129041 129159 119 1 2 75 55 63 0.119 0.96 7.09 Intr + 150175 150224 50 1 2 87 105 50 0.454 4.92 7.10 Intr + 153899 154043 145 0 1 27 109 68 0.322 2.14 7.11 Intr + 158086 158229 144 1 0 57 90 38 0.068 0.30 7.12 Intr + 159740 159852 113 2 2 23 91 40 0.116 -2.18 7.13 Intr + 160022 160160 139 0 1 79 80 46 0.846 2.42 7.14 Intr + 161679 161885 207 0 0 105 106 63 0.946 7.99 7.15 Intr + 166173 166290 118 0 1 49 52 115 0.934 4.47 7.16 Intr + 167187 167311 125 2 2 72 103 79 0.998 7.18 7.17 Intr + 168878 168991 114 2 0 63 84 112 0.882 7.86 7.18 Intr + 182666 182854 189 2 0 51 69 200 0.983 13.10 7.19 Intr + 183429 183513 85 0 1 87 101 40 0.986 5.02 7.20 Intr + 184572 184661 90 2 0 80 53 105 0.755 6.39 7.21 Intr + 186578 186718 141 1 0 67 92 42 0.655 3.05 7.22 Intr + 189793 189945 153 0 0 37 83 135 0.607 8.17 7.23 Intr + 192973 193074 102 0 0 69 109 102 0.913 10.77 7.24 Intr + 193674 193790 117 2 0 111 105 63 0.997 10.96 7.25 Intr + 194890 194978 89 0 2 61 84 110 0.990 6.67 7.26 Intr + 196815 196920 106 0 1 66 78 110 0.990 8.02 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590r:22121356_22328357|GENSCAN_predicted_peptide_1|925_aa MGASPAPGLEAPGTLGMGQTLTARGLEPVSELIWAHDLSRHLAKSRPLDQGQTQAQALPE VLGRNPLCPWALEEVWEGFGVEDGKEQLGQSAPAAPGEGERERAEQTERENASSPAGREA PELAHGEQAPGAGHDDRHRPRRDRPVKPRDPPLGEPHEGRRVMESTPSFLKGTPTWEKTA PENGIVRQEPGSPPRDGLHHGPLCLGEPAPFWRGVLSTPDSWLPPGFPQGPKDMLPLVEG EGPQNGERKVNWLGSKEGLRWKEAMLTHPLAFCGPACPPRCGPLMPEHSGGHLKSDPVAF RPWHCPFLLETKILERAPFWVPTCLPPYLVSGLPPEHPCDWPLTPHPWVYSGGQPKVPSA FSLGSKGFYYKDPSIPRLAKEPLAAAEPGLFGLNSGGHLQRAGEAERPSLHQRDGEMGAG RQQNPCPLFLGQPDTVPWTSWPACPPGLVHTLGNVWAGPGDGNLGYQLGPPATPRCPSPE PPVTQRGCCSSYPPTKGGGLGPCGKCQEGLEGGASGASEPSEEVNKASGPRACPPSHHTK LKKTWLTRHSEQFECPRGCPEVEERPVARLRALKRAGSPEVQGAMGSPAPKRPPDPFPGT AEQGAGGWQEVRDTSIGNKDVDSGQHDEQKGLCPGWEFVGPFAFLGYCTCGLCRNSLALL TFRRPPSTLGLSPHVTQPVVSLSPQWSYPKSTLAGSCTCVRTIAPAAQSLGLNTPDNLFA RLPVHVTASSDSLLSLASPLGGELQQEEDTATNSSSEEGPGSGPDSRLSTGLAKHLLSGL GDRLCRLLRREREALAWAQREGQGPAVTEDSPGIPRCCSRCHHGLFNTHWRCPRCSHRLC VACGRVAGTGRAREKAGFQEQSAEECTQEAGHAACSLMLTQFVSSQALAELSTAMHQVWV KFDIRGHCPCQADARVWAPGDAGQQ >gi568815590r:22121356_22328357|GENSCAN_predicted_CDS_1|2775_bp atgggcgcctctccagcccctggcctggaagcaccaggaaccctggggatggggcagacc ctcacagcccggggtctggagccggtgtcggagctcatctgggcccatgacctctccaga catttggcaaaatcaaggcccttagaccagggacagacccaagcccaggccctcccagag gtcctaggacgcaaccctttgtgcccttgggctctggaagaggtttgggaagggtttggg gtggaagatggcaaagagcagcttggccagagcgcccccgccgccccgggggaaggagag cgcgagcgcgctgagcagacagagcgggagaacgcgtcctcgcccgccggccgggaggcc ccggagctggcccatggggagcaggcgcccggtgccggccacgacgaccgccaccgcccg cgccgcgaccggccggtgaagcccagggacccccctctgggagagccccatgagggcagg agagtgatggagagtacgcccagcttcctgaagggcaccccaacctgggagaagacggcc ccagagaacggcatcgtgagacaggagcccggcagcccgcctcgagatggactgcaccat gggccgctgtgcctgggagagcctgctcccttttggaggggcgtcctgagcaccccagac tcctggcttccccctggcttcccccagggccccaaggacatgctcccacttgtggagggc gagggcccccagaatggggagaggaaggtcaactggctgggcagcaaagagggactgcgc tggaaggaggccatgcttacccatccgctggcattctgcgggccagcgtgcccacctcgc tgtggccccctgatgcctgagcatagtggtggccatctcaagagtgaccctgtggccttc cggccctggcactgccctttccttctggagaccaagatcctggagcgagctcccttctgg gtgcccacctgcttgccaccctacctagtgtctggcctgcccccagagcatccatgtgac tggcccctgaccccgcacccctgggtatactccgggggccagcccaaagtgccctctgcc ttcagcttaggcagcaagggcttttactacaaggatccgagcattcccaggttggcaaag gagcccttggcagctgcggaacctgggttgtttggcttaaactctggtgggcacctgcag agagccggggaggccgaacgcccttcactgcaccagagggatggagagatgggagctggc cggcagcagaatccttgcccgctcttcctggggcagccagacactgtgccctggacctcc tggcccgcttgtcccccaggccttgttcatactcttggcaacgtctgggctgggccaggc gatgggaaccttgggtaccagctggggccaccagcaacaccaaggtgcccctctcctgag ccgcctgtcacccagcggggctgctgttcatcctacccacccactaaaggtgggggtctt ggcccttgtgggaagtgccaggagggcctggaggggggtgccagtggagccagcgaaccc agcgaggaagtgaacaaggcctctggccccagggcctgtccccccagccaccacaccaag ctgaagaagacatggctcacacggcactcggagcagtttgaatgtccacgcggctgccct gaggtcgaggagaggccggttgctcggctccgggccctcaaaagggcaggcagccccgag gtccagggagcaatgggcagtccagcccccaagcggccaccggacccttttccaggcact gcagaacagggggctgggggttggcaggaggtgcgggacacatcgatagggaacaaggat gtggactcgggacagcatgatgagcagaaaggcctctgccctggctgggaatttgttggc cccttcgccttcttgggttactgcacctgtggcctgtgtcgaaactctctggcccttctc accttccggcgccctccgtccacgttgggattgtctcctcatgtcacgcagccagtggta tcactaagccctcagtggagctatccaaagtctacactggctggatcctgcacttgtgtc aggacgatcgccccagctgcccaaagccttgggctcaacacaccagacaacctcttcgct cggctgccggtccatgtcacggcatcctcagactccctgctcagcctggcatcgcctctg ggaggggagctgcagcaggaggaagacacagccaccaactccagctctgaggaaggccca gggtccggccctgacagccggctcagcacaggcctcgccaagcacctgctcagtggtttg ggggaccgactgtgccgcctgctgcggagggagcgggaggccctggcttgggcccagcgg gaaggccaagggccagccgtgacagaggacagcccaggcattccacgctgctgcagccgt tgccaccatggactcttcaacacccactggcgatgtccccgctgcagccaccggctgtgt gtggcctgtggtcgtgtggcaggcactgggcgggccagggagaaagcaggctttcaggag cagtccgcggaggagtgcacgcaggaggccgggcacgctgcctgttccctgatgctgacc cagtttgtctccagccaggctttggcagagctgagcactgcaatgcaccaggtctgggtc aagtttgatatccgggggcactgcccctgccaagctgatgcccgggtatgggcccccggg gatgcaggccagcag >gi568815590r:22121356_22328357|GENSCAN_predicted_peptide_2|362_aa MVSWMICRLVVLVFGMLCPAYASYKAVKTKNIREYVSVGVWGQGCVAHPEWSGEGERMGP VQEGVDRGQAIGVPVEDLVRWMMYWIVFALFMAAEIVTDIFISWFPFYYEIKMAFVLWLL SPYTKGASLLYRKFVHPSLSRHEKEIDAYIVQAKERSYETVLSFGKRGLNIAASAAVQAA TKVLWAPSPPAAPIPPLRPFGLIQLLCQGAQRWQPGSRGEKVTGGYWQKRGAGVRPSFLA LALSSQGALAGRLRSFSMQDLRSISDAPAPAYHDPLYLEDQVSHRRPPIGYRAGGLQDSD TEDECWSDTEAVPRAPARPREKPLIRSQSLRVVKRKPPVREGTSRSLKVRTRKKTVPSDV DS >gi568815590r:22121356_22328357|GENSCAN_predicted_CDS_2|1089_bp atggtgtcctggatgatctgtcgcctggtggtgctggtgtttgggatgctgtgtccagct tatgcttcctataaggctgtgaagaccaagaacattcgtgaatatgtgagcgtgggggtt tgggggcaaggctgtgtggcacatcctgagtggagtggggagggagagagaatgggccct gttcaagagggtgtagatagagggcaggccatcggggtgcctgtggaagatctggtgcgg tggatgatgtactggattgtttttgcactcttcatggcagcagagatcgttacagacatt tttatctcctggttccctttctactatgagatcaagatggccttcgtgctgtggctgctc tcaccctacaccaagggcgccagcctgctttaccgcaagtttgtccacccgtccctgtcc cgccatgagaaggagatcgacgcgtacatcgtgcaggccaaggagcgcagctacgagacc gtgctcagcttcgggaagcggggcctcaacattgccgcctccgctgctgtgcaggctgcc accaaggtgctctgggcccccagccctccagcagcccccatcccacccctaaggcccttt gggctcattcagctcctctgccagggagcccagagatggcagccggggagcaggggtgag aaggtgacgggggggtattggcagaagcgtggagctggagtcagaccttccttcctggct ctggcactgagcagtcagggggcgctggccggcaggctgcggagcttctccatgcaggac ctgcgctccatctctgacgcacctgcccctgcctaccatgaccccctctacctggaggac caggtgtcccaccggaggccacccattgggtaccgggccgggggcctgcaggacagcgac accgaggatgagtgttggtcagatactgaggcagtcccccgggcgccagcccggccccga gagaagcccctaatccgcagccagagcctgcgtgtggtcaagaggaagccaccggtgcgg gagggcacctcgcgctccctgaaggttcggacgaggaaaaagactgtgccctcagacgtg gacagctag >gi568815590r:22121356_22328357|GENSCAN_predicted_peptide_3|548_aa MAGLRARGGPGPGLLALSALGFCLMLQVSAKRPPKTPPCPPSCSCTRDTAFCVDSKAVPR NLPSEVISLTLVNAAFSEIQDGAFSHLPLLQFLLLNSNKFTLIGDNAFTGLSHLQYLFIE NNDIWALSKFTFRGLKSLTHLSLANNNLQTLPRDIFRPLDILNDLDLRGNSLNCDCKVKW LVEWLAHTNTTVAPIYCASPPRFQEHKVQDLPLREFDCITTDFVLYQTLAFPAVSAEPFL YSSDLYLALAQPGVSACTILKWDYVERQLRDYDRIPAPSAVHCKPMVVDSQLYVVVAQLF GGSYIYHWDPNTTRFTRLQDIDPQRVRKPNDLEAFRIDGDWYFAVADSSKAGATSLYRWH QNGFYSHQALHPWHRDTDLEFVDGEGKPRLIVSSSSQAPVIYQWSRTQKQFVAQGEVTQV PDAQAVKHFRAGRDSYLCLSRYIGDSKILRWEGTRFSEVQALPSRGSLALQPFLVGGRRY LALGSDFSFTQIYQWDEGRQKFVRFQELAVQAPRAFCYMPAGDAQLLLAPSFKGQTLVYR HIVVDLSA >gi568815590r:22121356_22328357|GENSCAN_predicted_CDS_3|1647_bp atggcggggctgcgggccagggggggcccggggccggggctgctggcgctctccgcgctc ggcttctgcctcatgctgcaagtcagcgctaagaggccccccaagacgcccccctgcccg cccagctgctcttgcaccagggacaccgccttctgcgtggactcaaaggcggtgcccagg aacctgccctcggaggtcatctccctgaccctggtgaatgccgccttctcagagatccag gatggagcgttctcccacctcccgctgctgcagttcttgttactcaactccaacaagttt acactgattggagacaacgccttcacaggactgtcgcacctgcagtatctcttcattgag aacaatgacatctgggcactatccaagttcaccttccgaggactcaagtccttgactcac ctctcgctggccaacaataacctgcagacactgcccagagacatcttccggcccctggac atcctgaatgacttggacctgcggggcaactcactcaactgtgactgcaaggtgaagtgg ttggtggagtggctggcacacaccaacaccacggtggcacccatctactgcgccagcccg ccccgcttccaggagcacaaggtgcaggacctgccgctgcgggagttcgattgcatcacc acagattttgtgttgtaccagaccctggccttcccagcagtgtcggctgagcccttcctc tactccagtgacctctatttggctctggcccagccaggagtcagtgcctgcaccatcctg aagtgggactatgttgagcggcagcttcgagactatgatagaatcccagccccctctgca gtgcactgcaagccgatggtggtggacagccagctgtacgtggtcgtggcccagctgttt ggcggctcttacatttaccactgggatcccaacaccacgcgcttcaccaggctgcaagac attgacccgcagcgcgtgcgcaagcctaacgacctagaagccttccgcatcgacggtgac tggtactttgccgtggctgacagctccaaggcaggcgccaccagcctctaccgctggcac cagaatggcttctactcccaccaggcactgcacccctggcaccgtgacaccgacctggag tttgtggacggcgagggcaagccacggctgattgtgtccagcagctcccaggcacccgtc atctatcagtggagtcgcacccagaagcagtttgtggcccagggtgaggtgacccaggtg cctgatgcccaagctgtgaaacactttcgtgccggccgcgacagctacctgtgcctcagc cgctacattggcgactccaagatcctgcgctgggagggtacccgcttctcggaggtgcag gccctgccctcccggggctcgctggccctgcagcccttccttgtgggtggccgccgctac ctggcactgggcagtgatttctccttcacccagatctaccagtgggatgagggacgacag aagtttgtacggttccaggagctggctgtgcaggctcctcgggccttctgctacatgcct gctggggacgcccagctactcctggcccccagcttcaagggacagacgctggtgtataga cacattgtggtggatctcagtgcctag >gi568815590r:22121356_22328357|GENSCAN_predicted_peptide_4|197_aa MDVGSKEVLMESPPDYSAAPRGRFGIPCCPVHLKRLLIVVVVVVLIVVVIVGALLMGLHM SQKHTEMVLEMSIGAPEAQQRLALSEHLVTTATFSIGSTGLVVYDYQQLLIAYKPAPGTC CYIMKIAPESIPSLEALTRKVHNFQMECSLQAKPAVPTSKLGQAEGRDAGSAPSGGDPAF LGMAVSTLCGEVPLYYI >gi568815590r:22121356_22328357|GENSCAN_predicted_CDS_4|594_bp atggatgtgggcagcaaagaggtcctgatggagagcccgccggactactccgcagctccc cggggccgatttggcattccctgctgcccagtgcacctgaaacgccttcttatcgtggtg gtggtggtggtcctcatcgtcgtggtgattgtgggagccctgctcatgggtctccacatg agccagaaacacacggagatggttctggagatgagcattggggcgccggaagcccagcaa cgcctggccctgagtgagcacctggttaccactgccaccttctccatcggctccactggc ctcgtggtgtatgactaccagcagctgctgatcgcctacaagccagcccctggcacctgc tgctacatcatgaagatagctccagagagcatccccagtcttgaggctctcactagaaaa gtccacaacttccagatggaatgctctctgcaggccaagcccgcagtgcctacgtctaag ctgggccaggcagaggggcgagatgcaggctcagcaccctccggaggggacccggccttc ctgggcatggccgtgagcaccctgtgtggcgaggtgccgctctactacatctag >gi568815590r:22121356_22328357|GENSCAN_predicted_peptide_5|1048_aa MPGVARLPLLLGLLLLPRPGRPLDLADYTYDLAEEDDSEPLNYKDPCKAAAFLGDIALDE EDLRAFQVQQAVDLRRHTARKSSIKAAGNTSTPSCQSTNGQPQRGACGRWRGRSRSRRAA TSRPERVWPDGVIPFVIGGNFTGSQRAVFRQAMRHWEKHTCVTFLERTDEDSYIVFTYRP CGCCSYVGRRGGGPQAISIGKNCDKFGIVVHELGHVVGFWHEHTRPDRDRHVSIVRENIQ PGQEYNFLKMEPQEVESLGETYDFDSIMHYARNTFSRGIFLDTIVPKYEVNGVKPPIGQR TRLSKGDIAQARKLYKCPACGETLQDSTGNFSSPEYPNGYSAHMHCVWRISVTPGEKVPL SLLYSQENGGFKRLSKSTQVYTDRAKAASKDEGKRALVLSTAAEQNCLAGPGRPRGGVVE GIILNFTSLDLYRSRLCWYDYVEVRDGFWRKAPLRGRFCGSKLPEPIVSTDSRLWVEFRS SSNWVGKGFFAVYEAICGGDVKKDYGHIQSPNYPDDYRPSKVCIWRIQVSEGFHVGLTFQ SFEIERHDSCAYDYLEVRDGHSESSTLIGRYCGYEKPDDIKSTSSRLWLKFVSDGSINKA GFAVNFFKEVDECSRPNRGGCEQRCLNTLGSYKCSCDPGYELAPDKRRCEAACGGFLTKL NGSITSPGWPKEYPPNKNCIWQLVAPTQYRISLQFDFFETEGNDVCKYDFVEVRSGLTAD SKLHGKFCGSEKPEVITSQYNNMRVEFKSDNTVSKKGFKAHFFSDKDECSKDNGGCQQDC VNTFGSYECQCRSGFVLHDNKHDCKEAGCDHKVTSTSGTITSPNWPDKYPSKKECTWAIS STPGHRVKLTFMEMDIESQPECAYDHLEVFDGRDAKAPVLGRFCGSKKPEPVLATGSRMF LRFYSDNSVQRKGFQASHATECGGQVRADVKTKDLYSHAQFGDNNYPGGVDCEWVIVAEE GYGVELVFQTFEVEEETDCGYDYMELFDGYDSTAPRLGRYCGSGPPEEVYSAGDSVLVKF HSDDTITKKGFHLRYTSTKFQDTLHSRK >gi568815590r:22121356_22328357|GENSCAN_predicted_CDS_5|3147_bp atgcccggcgtggcccgcctgccgctgctgctcgggctgctgctgctcccgcgtcccggc cggccgctggacttggccgactacacctatgacctggcggaggaggacgactcggagccc ctcaactacaaagacccctgcaaggcggctgcctttcttggggacattgccctggacgaa gaggacctgagggccttccaggtacagcaggctgtggatctcagacggcacacagctcgt aagtcctccatcaaagctgcaggaaacacttctacccccagctgccagagcaccaacggg cagcctcagaggggagcctgtgggagatggagaggtagatcccgtagccggcgggcggcg acgtcccgaccagagcgtgtgtggcccgatggggtcatcccctttgtcattgggggaaac ttcactggtagccagagggcagtcttccggcaggccatgaggcactgggagaagcacacc tgtgtcaccttcctggagcgcactgacgaggacagctatattgtgttcacctatcgacct tgcgggtgctgctcctacgtgggtcgccgcggcgggggcccccaggccatctccatcggc aagaactgtgacaagttcggcattgtggtccacgagctgggccacgtcgtcggcttctgg cacgaacacactcggccagaccgggaccgccacgtttccatcgttcgtgagaacatccag ccagggcaggagtataacttcctgaagatggagcctcaggaggtggagtccctgggggag acctatgacttcgacagcatcatgcattacgctcggaacacattctccaggggcatcttc ctggataccattgtccccaagtatgaggtgaacggggtgaaacctcccattggccaaagg acacggctcagcaagggggacattgcccaagcccgcaagctttacaagtgcccagcctgt ggagagaccctgcaagacagcacaggcaacttctcctcccctgaataccccaatggctac tctgctcacatgcactgcgtgtggcgcatctctgtcacacccggggagaaggtaccatta tccctactgtacagccaagagaatggaggcttcaagcgcttgagtaaatctacccaagtt tacacagacagagccaaggctgcttctaaggatgaaggaaagcgggcgcttgtcctcagc acagctgctgagcagaactgcctggcaggcccaggaaggccacgtggaggagtggtggag ggtatcatcctgaacttcacgtccctggacctgtaccgcagccgcctgtgctggtacgac tatgtggaggtccgagatggcttctggaggaaggcgcccctccgaggccgcttctgcggg tccaaactccctgagcctatcgtctccactgacagccgcctctgggttgaattccgcagc agcagcaattgggttggaaagggcttctttgcagtctacgaagccatctgcgggggtgat gtgaaaaaggactatggccacattcaatcgcccaactacccagacgattaccggcccagc aaagtctgcatctggcggatccaggtgtctgagggcttccacgtgggcctcacattccag tcctttgagattgagcgccacgacagctgtgcctacgactatctggaggtgcgcgacggg cacagtgagagcagcaccctcatcgggcgctactgtggctatgagaagcctgatgacatc aagagcacgtccagccgcctctggctcaagttcgtctctgacgggtccattaacaaagcg ggctttgccgtcaactttttcaaagaggtggacgagtgctctcggcccaaccgcgggggc tgtgagcagcggtgcctcaacaccctgggcagctacaagtgcagctgtgaccccgggtac gagctggccccagacaagcgccgctgtgaggctgcttgtggcggattcctcaccaagctc aacggctccatcaccagcccgggctggcccaaggagtacccccccaacaagaactgcatc tggcagctggtggcccccacccagtaccgcatctccctgcagtttgacttctttgagaca gagggcaatgatgtgtgcaagtacgacttcgtggaggtgcgcagtggactcacagctgac tccaagctgcatggcaagttctgtggttctgagaagcccgaggtcatcacctcccagtac aacaacatgcgcgtggagttcaagtccgacaacaccgtgtccaaaaagggcttcaaggcc cacttcttctcagacaaggacgagtgctccaaggataacggcggctgccagcaggactgc gtcaacacgttcggcagttatgagtgccaatgccgcagtggcttcgtcctccatgacaac aagcacgactgcaaagaagccggctgtgaccacaaggtgacatccaccagtggtaccatc accagccccaactggcctgacaagtatcccagcaagaaggagtgcacgtgggccatctcc agcacccccgggcaccgggtcaagctgaccttcatggagatggacatcgagtcccagcct gagtgtgcctacgaccacctagaggtgttcgacgggcgagacgccaaggcccccgtcctc ggccgcttctgtgggagcaagaagcccgagcccgtcctggccacaggcagccgcatgttc ctgcgcttctactcagataactcggtccagcgaaagggcttccaggcctcccacgccaca gagtgcgggggccaggtacgggcagacgtgaagaccaaggacctttactcccacgcccag tttggcgacaacaactaccctgggggtgtggactgtgagtgggtcattgtggccgaggaa ggctacggcgtggagctcgtgttccagacctttgaggtggaggaggagaccgactgcggc tatgactacatggagctcttcgacggctacgacagcacagcccccaggctggggcgctac tgtggctcagggcctcctgaggaggtgtactcggcgggagattctgtcctggtgaagttc cactcggatgacaccatcaccaaaaaaggtttccacctgcgatacaccagcaccaagttc caggacacactccacagcaggaagtga >gi568815590r:22121356_22328357|GENSCAN_predicted_peptide_6|379_aa MGKGHRCARRVCVRARELAGGLLAQIVPTGDNPDGSMELLSTPHSIEINNITCDSFRISW AMEDSDLERVTHYFIDLNKKENKNSNKFKHRDVPTKLVAKAVPLPMTVRGHWFLSPRTEY SVAVQTAVKQSDGEYLVSGWSETVEFCTGAGPRLTYRIWDLADYAKEHLAQLQEKAEQIA GRMLRFSVFYRNHHKEYFQHARTHCGNMLQPYLKDNSGSHGSPTSGMLHGVFFSCNTEFN TGQPPQDSPYGRWRFQIPAQRLFNPSTNLYFADFYCMYTAYHYAILVLAPKGSLGDRFCR DRLPLLDIACNKFLTCSVEDGELVFRHAQDLILEIIYTEPVDLSLGTLGEISGHQLMSLS TADAKKDPSCKTCNISVGR >gi568815590r:22121356_22328357|GENSCAN_predicted_CDS_6|1140_bp atggggaaggggcaccggtgtgcgcggcgtgtgtgtgtgcgcgctcgcgagttggctgga ggcttgctggcgcagatagtccccacaggagacaaccctgacgggagcatggagctgctg tccacgccccacagcattgagatcaacaacatcacctgcgactccttccgcatctcctgg gccatggaggacagtgacctggagagggtcacccattacttcattgaccttaacaagaag gagaataagaattccaacaagttcaagcaccgggacgtccccaccaagctcgtggccaag gcagtgccgctgcccatgacggtgagaggccactggttcctgagcccccgcacggagtac agtgtggccgtgcagacggcagtgaagcagagcgatggggagtacctggtgtccggctgg agcgagacggtggagttctgcactggggctggccctcggctcacctaccgcatctgggat cttgcagattatgccaaggagcacctggctcagcttcaggagaaagctgagcagatcgca ggccgcatgctccgcttctccgtcttctaccgcaaccatcacaaggagtacttccagcat gccaggacccactgcgggaacatgctgcagccttacctgaaggacaacagtggcagccac ggctcccccaccagcggtatgctccacggggtcttcttcagctgcaacacggagttcaac acgggccagcccccgcaggactccccctacggccgctggcgcttccagatcccagctcag cgcctcttcaatcccagcaccaacctctactttgcggacttctactgcatgtacacggcc taccactacgccatcctggtgctggcgcccaaaggctccctgggggaccgcttctgccgc gaccgcctgcccctcctggacattgcttgcaacaagttcctgacctgcagcgtggaggat ggggagctggtcttccgccacgcccaggacctcatcctggagatcatctacactgagccc gtcgacctgtccctgggcaccctgggggagatcagtgggcaccagctcatgagtctgtct actgccgatgccaagaaggaccccagctgcaagacctgcaacatcagcgtgggccgctag >gi568815590r:22121356_22328357|GENSCAN_predicted_peptide_7|1140_aa MSEGNAAGEPSTPGGPRPLLTGARGLIGRRPAPPLTPGRLPSIRSRDLTLGGVKKKTFTP NIISRKIKEEPKEEVTVKKEKRERDRDRQREGHGRGRGRPEVIQSHSIFEQGPAEMMKKK GNWDKTVDVSDMGPSHIINIKKEKRETDEETKQILRMLEKDDFLDDPGLRNDTRNMPVQL PLAHSGWLFKEENDEPDVKPWLAGPKEEDMEVDIPAVKVKEEPRDEEEEAKMKAPPKAAR KTPGLPKDVSVAELLRELSLTKEEELLFLQLPDTLPGQPPTQDIKPIKTEVQGEDGQVVL IKQEKDREAKLAENACTLADLTEGQVGKLLIRKSGRVQLLLGKVTLDVTMGTACSFLQEL VSVGLGDSRTGEMTVLGHVKHKLVCSPDFESLLDHKHRSGDYDSMIKVLANLISGLVANG LIPGGLEERKLALEHVGPGAWVELGLPLRPRGAGRLGRGLAHSARLYGCQAVGHKLLNLW TQLWAGEHLQAEAMYLESQRNQAHRGGQHKGTLFFDFPQESVGLVSMFRGLGIETVSKTP LKREMLPSGRGILGRGLSANLVRKDREELSPTFWDPKVLAAGDSKMAETSVGWSRTLGRG SSDASLLPLGRAAGGISREVDKPPCTFSTPSRGPPQLSSPPALPQSPLHSPDRPLVLTVE HKENPNVECKSMRFGMLKDHQAVTGNVTAFDGSILYLPVKLQQVLELKSQRKTDSAEISI KIQMTKILEPCSDLCIPFYNVVFRRLQIWPGYAASIRRTDGGLFLLADVSHKVIRNDCVL DVMHAIYQQNKEHFQDECTKLLVGNIVITRYNNRTYRIDDVDWNKTPKDSFTMSDGKEIT FLEYYSKNYGITVKEEDQPLLIHRPSERQDNHGMLLKGEILLLPELSFMTGIPEKMKKDF RAMKDLAQQINLSPKQHHSALECLLQRIAKNEAATNELMRWGLRLQKDVHKRAMDQAREL VNMLEKIAGPIGMRMSPPAWVELKDDRIETYVRTIQSTLGAEGKIQMVVCIIMGPRDDLY GAIKKLCCVQSPVPSQVVNVRTIGQPTRLRSVAQKILLQINCKLGGELWGVDIPLKQLMV IGMDVYHDPSRGMRSVVGFVASINLTLTKWYSRVVFQMPHQEIVDSLKLCLVGSLKKFYE >gi568815590r:22121356_22328357|GENSCAN_predicted_CDS_7|3420_bp atgtcggaaggaaacgccgccggcgagcccagcacgccgggagggccccgacctctcctg actggggcccgggggctcatcgggcggcggccggcgcctcccctcacccccggccgcctt ccctccatccgttccagggacctcaccctcgggggagtcaagaagaaaaccttcacccca aatatcatcagtcggaagatcaaggaagagcccaaggaagaagtaactgtcaagaaggag aagcgtgaaagggacagagaccgacaacgagaggggcatggacgagggcgaggccgtcca gaagtgatccagtctcactccatctttgagcagggcccagctgaaatgatgaagaaaaaa gggaactgggataagacagtggatgtgtcagacatgggaccttctcatatcatcaacatc aaaaaagagaagagagagacagacgaagaaactaaacagatcttgcgtatgctggagaag gacgatttcctcgatgaccccggcctgaggaacgacactcgaaatatgcctgtgcagctg ccgctggctcactcaggatggctttttaaggaagaaaatgacgaaccagatgttaaacct tggctggctggccccaaggaagaggacatggaggtggacatacctgctgtgaaagtgaaa gaggagccacgagatgaggaggaagaggccaagatgaaggctcctcccaaagcagccagg aagactccaggcctcccgaaggatgtatctgtggcagagctgctgagggagctgagcctc accaaggaagaggaactgctgtttctgcagctgccagacaccctccctggccagccaccc acccaggacatcaagcctatcaagacagaggtgcagggcgaggacggacaggtggtgctc atcaagcaggagaaagaccgagaagccaaattggcagagaatgcttgtaccctggctgac ctgacagagggtcaggttggcaagctactcatccgcaagtctggaagggtgcaactcctc ttgggcaaggtgactctggacgtgaccatgggaactgcctgctccttcctgcaggagctg gtgtccgtgggccttggagacagtaggacaggggagatgacagtcctgggacacgtgaag cacaaacttgtatgttcccctgattttgaatccctcttggatcacaaacaccgctctgga gactatgattctatgatcaaggtgctggcaaatttgatttctggtcttgtggccaatggg ctcatccccggaggcctcgaagagcggaaattggccctggagcatgtgggccccggggcg tgggttgagctcggtcttcccctgaggccgcgcggagctgggcgactggggcgaggactc gcgcacagtgccaggctgtacggatgccaggctgttggccacaagcttctaaacctttgg acccagctctgggcaggggagcacctgcaggcagaggccatgtatttggaaagccagagg aaccaagcacacagagggggccagcacaaaggaactttattctttgactttccacaggag tctgtgggtttggtctccatgttccgaggcctgggcattgaaacagtttctaagacccct ctgaaacgggaaatgcttccatcaggtagaggcattttaggtcgaggcttgtctgctaat ctggtacgcaaggacagggaggaactctctcccactttttgggatccaaaagtgttggcg gctggggacagcaagatggcagagacctccgttggttggagtaggacgcttggaagaggg agttcagatgcgtctttattaccactgggaagagcagcaggtggtatcagcagagaagtg gacaagcctccctgtaccttcagcacaccgtcccggggtcccccgcagctgtcatcacca ccagctctgccccagtctcccctgcactctccagatcgccctctggtcctgactgtggaa cacaaggaaaaccccaatgtggagtgcaaaagcatgaggttcggcatgttgaaggaccat caagctgtcaccggcaacgtcactgcgtttgatggatctattctctatctgcctgttaag cttcaacaagttcttgagttaaaaagtcaaaggaaaacagacagtgctgaaatcagcatt aagattcagatgacaaagatcctggagccctgctctgacctgtgcattcccttctacaat gttgttttccgtcgattgcagatctggccaggctatgcagctagcatccgaaggacagat ggagggctcttcctgctagctgatgtctcccataaggtcattcggaatgactgtgtgctg gatgtcatgcatgccatttatcagcagaataaagaacacttccaggatgagtgtactaag cttctggttggcaatattgttatcacccgatataacaatcgtacctatcgtattgatgat gtggattggaataagactccaaaggatagcttcacgatgtctgatgggaaagagatcaca ttcttggaatactacagcaaaaattatgggatcacagttaaggaagaggaccagccattg ctgattcacaggcccagtgagagacaggataatcatgggatgctgctaaaaggggaaatc ctgctgctgcctgagctttcttttatgaccggaatcccagagaagatgaagaaggacttc agagccatgaaggatttggctcagcaaatcaatctgagccccaagcaacaccatagtgct ttggaatgcttgctgcaaagaattgcaaagaacgaggcagccaccaatgaactgatgcgt tgggggctccgtctgcaaaaggatgtacataagagagcaatggaccaggctcgagaactg gtcaacatgttggagaagatagccggccccattggcatgcgtatgagcccaccggcctgg gttgaactaaaggatgaccgaatagagacttatgtcagaaccattcaatccacgttagga gctgaggggaagatacagatggttgtttgcatcatcatgggcccacgtgatgatctctat ggggccatcaagaagctgtgctgtgtgcagtccccagtgccctcccaggttgtcaatgtt cgaaccattggtcagcccaccaggcttcggagtgtggcccagaagattttacttcagatt aactgtaaattgggtggtgagctctggggagtggatattcctctgaaacagttaatggtg atcgggatggatgtttaccatgaccccagtagaggcatgcgctccgtggttggcttcgtg gcaagcatcaatctcaccctcacaaaatggtattcccgggtggtgttccagatgccgcat caggagattgtggacagcctgaagctatgcctcgtgggctccttaaaaaagttttatgag