GENSCAN 1.0 Date run: 3-Nov-116 Time: 11:56:42 Sequence gi568815579f:19116954_19349908 : 232955 bp : 53.15% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.23 PlyA - 238 233 6 1.05 1.22 Term - 3230 2977 254 0 2 99 37 224 0.803 14.53 1.21 Intr - 3908 3812 97 2 1 110 72 208 0.977 21.48 1.20 Intr - 4213 4039 175 0 1 72 51 308 0.999 25.96 1.19 Intr - 4468 4355 114 0 0 111 38 108 0.713 8.17 1.18 Intr - 4715 4551 165 1 0 77 31 193 0.976 12.09 1.17 Intr - 4866 4806 61 1 1 113 69 35 0.589 2.38 1.16 Intr - 13337 13203 135 0 0 25 119 256 0.514 23.45 1.15 Intr - 15555 15364 192 1 0 4 43 283 0.890 15.38 1.14 Intr - 15801 15704 98 2 2 97 25 103 0.999 5.05 1.13 Intr - 16257 16177 81 2 0 78 101 51 0.959 4.65 1.12 Intr - 17934 17831 104 0 2 98 99 82 0.900 9.77 1.11 Intr - 21597 21504 94 2 1 28 92 61 0.082 0.97 1.10 Intr - 29069 28848 222 1 0 125 74 201 0.177 20.17 1.09 Intr - 29431 29320 112 2 1 66 111 23 0.553 2.34 1.08 Intr - 29695 29602 94 1 1 106 88 26 0.950 4.44 1.07 Intr - 29922 29789 134 1 2 131 75 18 0.917 5.67 1.06 Intr - 30230 30083 148 2 1 127 99 49 0.995 10.22 1.05 Intr - 30879 30745 135 0 0 103 92 106 0.999 13.77 1.04 Intr - 32476 32273 204 1 0 70 65 553 0.893 51.12 1.03 Intr - 33811 33729 83 2 2 143 109 20 0.260 9.45 1.02 Intr - 44263 44038 226 1 1 46 80 128 0.095 5.89 1.01 Init - 53458 53252 207 1 0 56 96 75 0.007 3.98 1.00 Prom - 58424 58385 40 -3.01 2.06 PlyA - 58431 58426 6 1.05 2.05 Term - 61300 61180 121 0 1 53 39 124 0.723 2.25 2.04 Intr - 65730 65620 111 2 0 95 70 273 0.775 26.20 2.03 Intr - 69145 69081 65 1 2 90 99 90 0.999 8.31 2.02 Intr - 70052 69940 113 0 2 43 105 243 0.999 22.10 2.01 Init - 72361 72334 28 1 1 75 66 42 0.304 -0.91 2.00 Prom - 72499 72460 40 -5.41 3.00 Prom + 73858 73897 40 -6.10 3.01 Init + 76994 77180 187 1 1 68 53 136 0.900 7.32 3.02 Intr + 80013 80093 81 1 0 82 74 99 0.654 8.01 3.03 Intr + 80233 80298 66 2 0 65 100 86 0.985 6.87 3.04 Intr + 80568 80668 101 1 2 81 65 176 0.999 14.93 3.05 Intr + 81154 81279 126 0 0 139 68 282 0.999 32.68 3.06 Intr + 81704 81770 67 1 1 66 92 44 0.999 1.57 3.07 Intr + 82201 82281 81 2 0 114 82 140 0.993 16.11 3.08 Term + 84696 84766 71 1 2 79 49 148 0.992 8.40 3.09 PlyA + 84894 84899 6 1.05 4.07 PlyA - 84927 84922 6 -9.56 4.06 Term - 85072 84972 101 2 2 92 53 193 0.999 14.79 4.05 Intr - 85622 85517 106 2 1 104 74 84 0.980 8.89 4.04 Intr - 85928 85838 91 1 1 104 94 70 0.997 9.70 4.03 Intr - 86296 86228 69 0 0 89 94 12 0.033 0.69 4.02 Intr - 90230 90160 71 2 2 66 72 -11 0.013 -6.43 4.01 Init - 90556 90389 168 1 0 87 41 117 0.408 6.41 4.00 Prom - 90799 90760 40 -1.51 5.00 Prom + 93537 93576 40 -6.01 5.01 Init + 100001 100073 73 1 1 81 96 140 0.993 13.29 5.02 Intr + 101962 102363 402 2 0 92 82 480 0.940 42.77 5.03 Intr + 107068 107242 175 2 1 109 85 249 0.999 26.21 5.04 Intr + 107353 107480 128 1 2 53 98 100 0.993 8.33 5.05 Intr + 108024 108317 294 1 0 153 92 555 0.950 59.93 5.06 Intr + 109533 110120 588 1 0 133 105 365 0.999 35.91 5.07 Intr + 110328 111686 1359 1 0 98 96 508 0.379 41.92 5.08 Intr + 116836 116952 117 2 0 92 97 129 0.994 15.14 5.09 Intr + 118030 118143 114 2 0 103 113 73 0.998 12.22 5.10 Intr + 121300 121494 195 2 0 94 47 199 0.687 16.31 5.11 Intr + 123650 123732 83 0 2 66 105 136 0.999 12.95 5.12 Intr + 128360 128504 145 1 1 95 85 213 0.999 22.07 5.13 Intr + 131771 131929 159 0 0 84 111 195 0.866 21.87 5.14 Term + 132813 132958 146 1 2 103 49 149 0.999 10.88 5.15 PlyA + 135259 135264 6 1.05 6.06 PlyA - 137823 137818 6 1.05 6.05 Term - 141255 140864 392 1 2 127 44 378 0.996 32.82 6.04 Intr - 141902 141570 333 0 0 71 73 625 0.963 55.40 6.03 Intr - 144222 143860 363 1 0 101 53 618 0.974 55.32 6.02 Intr - 144610 144493 118 1 1 93 92 98 0.965 11.24 6.01 Init - 145779 145777 3 0 0 83 115 0 0.930 2.13 6.00 Prom - 145939 145900 40 -11.63 7.24 PlyA - 147180 147175 6 1.05 7.23 Term - 147920 147711 210 2 0 90 54 227 0.999 17.12 7.22 Intr - 149656 149537 120 1 0 114 94 91 0.976 13.49 7.21 Intr - 150760 150668 93 1 0 86 94 122 0.961 13.26 7.20 Intr - 151134 151033 102 0 0 89 105 52 0.996 7.87 7.19 Intr - 151801 151677 125 2 2 19 101 153 0.994 10.51 7.18 Intr - 152816 152734 83 1 2 84 87 106 0.955 9.88 7.17 Intr - 153323 153220 104 2 2 98 56 137 0.999 10.97 7.16 Intr - 153489 153392 98 1 2 85 65 198 0.999 17.43 7.15 Intr - 154172 154069 104 1 2 125 105 122 0.999 17.92 7.14 Intr - 156346 156168 179 1 2 34 35 361 0.837 24.63 7.13 Intr - 160123 159994 130 0 1 116 80 382 0.981 41.40 7.12 Intr - 160926 160781 146 0 2 87 89 263 0.999 25.89 7.11 Intr - 161843 161737 107 0 2 74 105 191 0.992 19.83 7.10 Intr - 162437 162260 178 2 1 53 63 393 0.979 33.31 7.09 Intr - 163338 163232 107 1 2 96 94 182 0.873 20.03 7.08 Intr - 180391 180036 356 0 2 122 80 385 0.773 36.59 7.07 Intr - 185423 185312 112 0 1 60 110 83 0.989 7.64 7.06 Intr - 186495 186395 101 2 2 108 100 47 0.988 8.15 7.05 Intr - 186959 186771 189 1 0 101 66 126 0.453 10.92 7.04 Intr - 189123 188844 280 1 1 53 75 228 0.351 15.08 7.03 Intr - 193247 193144 104 1 2 118 109 88 0.977 14.22 7.02 Intr - 199640 199469 172 0 1 120 91 219 0.907 24.92 7.01 Init - 199922 199811 112 2 1 83 86 -44 0.126 -6.07 7.00 Prom - 200524 200485 40 -6.10 8.00 Prom + 201965 202004 40 -5.61 8.01 Init + 203907 204182 276 2 0 109 89 580 0.739 55.18 8.02 Intr + 220217 220312 96 1 0 76 68 164 0.897 13.91 8.03 Intr + 221892 221986 95 2 2 83 78 93 0.993 7.06 8.04 Intr + 223893 223920 28 0 1 153 115 7 0.998 8.50 8.05 Intr + 224299 224454 156 0 0 106 89 282 0.996 30.82 8.06 Intr + 225582 225728 147 2 0 38 78 205 0.999 15.44 8.07 Intr + 225823 225913 91 0 1 118 89 221 0.956 25.27 8.08 Intr + 226884 226987 104 1 2 85 94 166 0.985 17.29 8.09 Intr + 227896 227973 78 0 0 79 105 63 0.851 7.34 8.10 Intr + 228047 228191 145 1 1 -13 81 64 0.365 -4.03 8.11 Intr + 228316 228416 101 2 2 56 90 169 0.674 14.23 8.12 Intr + 230327 230410 84 1 0 107 52 95 0.506 8.31 8.13 Intr + 231936 231985 50 2 2 103 99 38 0.776 4.37 8.14 Intr + 232202 232279 78 2 0 79 108 118 0.999 12.06 8.15 Intr + 232372 232483 112 1 1 79 96 161 0.999 16.88 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 29069 28844 226 1 1 125 49 210 0.823 17.38 S.002 Init - 86107 86070 38 1 2 98 110 -26 0.942 0.32 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815579f:19116954_19349908|GENSCAN_predicted_peptide_1|1044_aa MNSRVGATPKCTPIAASRVTPCMQISPSRPISSRGQDRPLPRHLRPRAPDLRAAAAGPCA QRPRAQAEQDGEVTPRPSERHQDPSTTNPPRICKAPYCEAPPPSSARWPLLPPDFLEEET EFQRSSATGESLPEPTVSPESLRADKDHSTQPGTMGRKKIQISRILDQRNRQVTFTKRKF GLMKKAYELSVLCDCEIALIIFNSANRLFQYASTDMDRVLLKYTEYSEPHESRTNTDILE TLKRRGIGLDGPELEPDEGPEEPGEKFRRLAGEGGDPALPRPRLYPAAPAMPSPDVVYGA LPPPGCDPSGLGEALPAQSRPSPFRPAAPKAGPPGLVHPLFSPSHLTSKTPPPLYLPTEG RRSDLPGGLAGPRGGLNTSRSLYSGLQNPCSTATPGPPLGSFPFLPGGPPEYGLGDPPPP PGLLQPPTLAPWQPSRGDGPPAVSSQPSGGRSLGEEGPPTRGASPPTPPVSIKSERLSPA PGGPGDFPKTFPYPLLLARSLAEPLRPGPALRRLPLADGWPRHTDWSDGMLVRPESGSNW SSFRRVPGALGRWAVLGVQLVVTLLTATLMHRLAPHCSFARWLLCNGSLFRYKHPSEEEL RALAGKPRPRGRKERWANGLSEEKPLSVPRDAPFQLETCPLTTVDALVLRFFLEYQWFVD FAVYSGGVYLFTEAYYYMLGPAKETNIAVFWCLLTVTFSMYPSSLLPASGAVTRLYFSAE EGGERSVCLTFAFLFLLLAMLVQVVREETLELGLEPGLASMTQNLEPLLKKQGWDWALPV AKLAIRVGLAVVGSVLGAFLTFPGLRLAQTHRDALTMSEDRPMLQLSGSLGGFLLHTSFL SPLFILWLWTKPIARDFLHQPPFGETRFSLLSDSAFDSGRLWLLVVLCLLRLAVTRPHLQ AYLCLAKARVEQLRREAGRIEAREIQQRVVRVYCYVTVVSLQYLTPLILTLNCTLLLKTL GGYSWGLGPAPLLSPDPSSASAAPIGSGEDEVQQTAARIAGALGGLLTPLFLRGVLAYLI WWTAACQLLASLFGLYFHQHLAGS >gi568815579f:19116954_19349908|GENSCAN_predicted_CDS_1|3135_bp atgaactcccgggtcggggccacgcccaagtgcacgcccatcgccgcctccagggtgacg ccctgtatgcaaataagcccgtcccgcccaatcagctcccgggggcaggaccgcccgctc ccaaggcacctgcggccgcgggcccctgatcttcgtgcagccgccgcgggtccgtgcgcc cagcgtcccagggcccaggccgagcaggatggagaggtgaccccaaggccttctgagagg catcaggacccatcgaccacaaatcctccccgcatctgtaaagcaccttactgtgaagcc cctccccccagttcagcgaggtggcccctgctgccccctgactttctggaggaggaaact gagttccagagaagctcagcaacaggggagtcactgcccgagcccaccgtctcccctgag tccctgagggcagacaaagatcattccactcagcctgggacgatggggaggaaaaaaatc cagatctcccgcatcctggaccaaaggaatcggcaggtgacgttcaccaagcggaagttc gggctgatgaagaaggcctatgagctgagcgtgctctgtgactgtgagatagccctcatc atcttcaacagcgccaaccgcctcttccagtatgccagcacggacatggaccgtgtgctg ctgaagtacacagagtacagcgagccccacgagagccgcaccaacactgacatcctcgag acgctgaagcggaggggcattggcctcgatgggccagagctggagccggatgaagggcct gaggagccaggagagaagtttcggaggctggcaggcgaagggggtgatccggccttgccc cgaccccggctgtatcctgcagctcctgctatgcccagcccagatgtggtatacggggcc ttaccgccaccaggctgtgaccccagtgggcttggggaagcactgcccgcccagagccgc ccatctcccttccgaccagcagcccccaaagccgggcccccaggcctggtgcaccctctc ttctcaccaagccacctcaccagcaagacaccacccccactgtacctgccgacggaaggg cggaggtcagacctgcctggtggcctggctgggccccgagggggactaaacacctccaga agcctctacagtggcctgcagaacccctgctccactgcaactcccggacccccactgggg agcttccccttcctccccggaggccccccagaatatggcctgggagaccctccaccgccc cctggcttgttgcagccccccaccctggccccctggcagccctcgaggggtgatgggccc cccgccgtgtcctcccagcccagtgggggccgaagcctgggcgaggagggtcccccaacc cgcggcgcctccccgccgacccccccagtcagcatcaagtctgagcgcctctctccggcc cccgggggccccggcgactttcctaagaccttcccctatcccttgctcctcgcccggtcc ctggcagagcctctgcggcctgggcccgccctgcgccggctgcccttggccgacggctgg ccccgacacactgattggtcggacgggatgctcgtcagaccagagtcgggctctaattgg tcgagtttccgcagagtgcccggagccctaggccggtgggcggtcctcggagtacagctg gtggtgaccctgctcactgccaccctcatgcacaggctggcgccacactgctccttcgcg cgctggctgctctgtaacggcagtttgttccgatacaagcacccgtctgaggaggagctt cgggccctggcggggaagccgaggcccagaggcaggaaagagcggtgggccaatggcctt agtgaggagaagccactgtctgtgccccgagatgccccgttccagctggagacctgcccc ctcacgaccgtggatgccctggtcctgcgcttcttcctggagtaccagtggtttgtggac tttgctgtgtactcgggcggcgtgtacctcttcacagaggcctactactacatgctggga ccagccaaggagactaacattgctgtgttctggtgcctgctcacggtgaccttctccatg tatccttcctctctgctccctgccagtggggcggtgacacggctgtacttcagcgccgag gaggggggtgagcgctctgtctgcctcacctttgccttcctcttcctgctgctggccatg ctggtgcaagtggtgcgggaggagaccctcgagctgggcctggagcctggtctggccagc atgacccagaacttagagccacttctgaagaagcagggctgggactgggcgcttcctgtg gccaagctggctatccgcgtgggactggcagtggtgggctctgtgctgggtgccttcctc accttcccaggcctgcggctggcccagacccaccgggacgcactgaccatgtcggaggac agacccatgctgcagttaagtgggtcgcttggtgggttcctcctgcacaccagcttcctg tctcccctgttcatcctgtggctctggacaaagcccattgcacgggacttcctgcaccag ccgccgtttggggagacgcgtttctccctgctgtccgattctgccttcgactctgggcgc ctctggttgctggtggtgctgtgcctgctgcggctggcggtgacccggccccacctgcag gcctacctgtgcctggccaaggcccgggtggagcagctgcgaagggaggctggccgcatc gaagcccgtgaaatccagcagagggtggtccgagtctactgctatgtgaccgtggtgagc ttgcagtacctgacgccgctcatcctcaccctcaactgcacacttctgctcaagacgctg ggaggctattcctggggcctgggcccagctcctctactatcccccgacccatcctcagcc agcgctgcccccatcggctctggggaggacgaagtccagcagactgcagcgcggattgcc ggggctctgggtggcctgcttactcccctcttcctccgtggcgtcctggcctacctcatc tggtggacggctgcctgccagctgctcgccagccttttcggcctctacttccaccagcac ttggcaggctcctag >gi568815579f:19116954_19349908|GENSCAN_predicted_peptide_2|145_aa MAVPRCAWTVTDKFTESVYVLANEPSVALYRLQEHVRRSLPELAQHKADMQRWEEQSQGA IYTVEYACSAVKNLVDSSVYFRSVEGLLKQAISIRDHMNASAQGHRSPEVALLLLDSHWP PIDPWTNNCHQNNPDYAPIKSLALG >gi568815579f:19116954_19349908|GENSCAN_predicted_CDS_2|438_bp atggccgtgcctcgctgtgcctggaccgtcacggacaagttcactgagagcgtctacgtc ctggccaacgagccatccgtggccctgtaccggctgcaggagcatgtgcgtcgctccctc cccgagctggcccagcacaaggcagacatgcagcgttgggaggagcagagccagggagcc atctacactgtggagtacgcctgcagcgccgtgaagaacctggtggacagcagcgtctac ttccgcagcgtggagggtctgctcaaacaggccatcagcatccgggaccatatgaatgcc agtgcccagggccacagaagtccagaagtggctttgctgctcctggactctcactggccc cccatcgacccttggaccaataactgccatcaaaataatcctgattatgcacctatcaag tccctggccctgggctaa >gi568815579f:19116954_19349908|GENSCAN_predicted_peptide_3|259_aa MELTQPAEDLIQTQQTPASELGDPEDPGEEAADGSDTVVLSLFPCTPEPVNPEPDASVSS PQGSSLKHSTTLTNRQRGNEVSALPATLDSLSIHQLAAQGELDQLKEHLRKGDNLVNKPD ERGFTPLIWASAFGEIETVRFLLEWGADPHILAKERESALSLASTGGYTDIVGLLLERDV DINIYDWNGGTPLLYAVRGNHVKCVEALLARGADLTTEADSGYTPMDLAVALGYRKVQQV IENHILKLFQSNLVPADPE >gi568815579f:19116954_19349908|GENSCAN_predicted_CDS_3|780_bp atggagcttacccagcctgcagaagacctcatccagacccagcagacccctgcctcagaa cttggggaccctgaagaccccggagaggaggctgcagatggctcagacactgtggtcctc agtctctttccctgcacccctgagcctgtgaatcctgaaccggatgccagtgtttcctct ccacagggcagctccctgaagcactccaccactctcaccaaccggcagcgagggaacgag gtgtcagctctgccggccaccctagactccctgtccatccaccagctcgcagcacagggg gagctggaccagctgaaggagcatttgcggaaaggtgacaacctcgtcaacaagccagac gagcgcggcttcacccccctcatctgggcctccgcctttggagagattgagaccgttcgc ttcctgctggagtggggtgccgacccccacatcctggcaaaagagcgagagagcgccctg tcgctggccagcacaggcggctacacagacattgtggggctgctgctggagcgtgacgtg gacatcaacatctatgattggaatggagggacgccactgctgtacgctgtgcgcgggaac cacgtgaaatgcgttgaggccttgctggcccgaggcgctgacctcaccaccgaagccgac tctggctacaccccgatggaccttgccgtggccctgggataccggaaagtgcaacaggtg atcgagaaccacatcctcaagctcttccagagcaacctggtgcccgctgaccctgagtga >gi568815579f:19116954_19349908|GENSCAN_predicted_peptide_4|201_aa MHNHTQPYTTTLSHVQRHPAMHNYTQPYTTTPSHAQPHPAMHSHNGPHTITYTINLQQRA YVGPTLLTPNQVSSHRHTHRGLIEATFRTGVGEGVAVLVPSYRVSSVLNRNTRQFGKKHL FDQDEETCWNSDQGPSQWVTLEFPQLIRVSQLQIQFQGGFSSRRGCLEAAEVDRLKVTFE DATDFFGRVVIYHLRVLGEKV >gi568815579f:19116954_19349908|GENSCAN_predicted_CDS_4|606_bp atgcacaaccacacccagccatacacaaccacactcagccatgtacaacgacatccagcc atgcacaactacacccagccgtacacaaccacacccagccatgcacaaccacacccagcc atgcacagccacaatgggccacacacaatcacatacacaatcaacctgcagcaaagagcc tacgttggccccacattgctcacacccaaccaggtttctagtcacagacacacccacagg ggattgatcgaggcgacatttaggactggggtgggcgagggagtggcggttctcgtccca agttacagggtgagttcagtgctgaatcgcaacactcggcagtttggaaaaaaacatctt ttcgaccaggatgaggagacatgttggaactcagaccagggcccctcccagtgggtgacg ctggagtttccccagctcatccgtgtctcccagctgcagatccagtttcagggtggcttc tccagtcgccggggctgcctggaagctgctgaagtggaccggctgaaggtgacgtttgag gatgccactgacttttttggccgtgtggtcatctaccacctgcgggtgcttggggagaag gtgtga >gi568815579f:19116954_19349908|GENSCAN_predicted_peptide_5|1325_aa MGAPFVWALGLLMLQMLLFVAGEQGTQDITDASERGLHMQKLGSGSVQAALAELVALPCL FTLQPRPSAARDAPRIKWTKVRTASGQRQDLPILVAKDNVVRVAKSWQGRVSLPSYPRRR ANATLLLGPLRASDSGLYRCQVVRGIEDEQDLVPLEVTGVVFHYRSARDRYALTFAEAQE ACRLSSAIIAAPRHLQAAFEDGFDNCDAGWLSDRTVRYPITQSRPGCYGDRSSLPGVRSY GRRNPQELYDVYCFARELGGEVFYVGPARRLTLAGARAQCRRQGAALASVGQLHLAWHEG LDQCDPGWLADGSVRYPIQTPRRRCGGPAPGVRTVYRFANRTGFPSPAERFDAYCFRAHH PTSQHGDLETPSSGDEGEILSAEGPPVRELEPTLEEEEVVTPDFQEPLVSSGEEETLILE EKQESQQTLSPTPGDPMLASWPTGEVWLSTVAPSPSDMGAGTAASSHTEVAPTDPMPRRR GRFKGLNGRYFQQQEPEPGLQGGMEASAQPPTSEAAVNQMEPPLAMAVTEMLGSGQSRSP WADLTNEVDMPGAGSAGGKSSPEPWLWPPTMVPPSISGHSRAPVLELEKAEGPSARPATP DLFWSPLEATVSAPSPAPWEAFPVATSPDLPMMAMLRGPKEWMLPHPTPISTEANRVEAH GEATATAPPSPAAETKVYSLPLSLTPTGQGGEAMPTTPESPRADFRETGETSPAQVNKAE HSSSSPWPSVNRNVAVGFVPTETATEPTGLRGIPGSESGVFDTAESPTSGLQATVDEVQD PWPSVYSKGLDASSPSAPLGSPGVFLVPKVTPNLEPWVATDEGPTVNPMDSTVTPAPSDA SGIWEPGSQVFEEAESTTLSPQVALDTSIVTPLTTLEQGDKVGVPAMSTLGSSSSQPHPE PEDQVETQGTSGASVPPHQSSPLGKPAVPPGTPTAASVGESASVSSGEPTVPWDPSSTLL PVTLGIEDFELEVLAGSPGVESFWEEVASGEEPALPGTPMNAGAEEVHSDPCENNPCLHG GTCNANGTMYGCSCDQGFAGENCEIDIDDCLCSPCENGGTCIDEVNGFVCLCLPSYGGSF CEKDTEGCDRGWHKFQGHCYRYFAHRRAWEDAEKDCRRRSGHLTSVHSPEEHSFINSRGS GEGATCLEGFGHENTWIGLNDRIVERDFQWTDNTGLQFENWRENQPDNFFAGGEDCVVMV AHESGRWNDVPCNYNLPYVCKKGTVENASLIGARKAKYNVHATVRYQCNEGFAQHHVATI RCRSNGKWDRPQIVCTKPRRSHRMRRHHHHHQHHHQHHHHKSRKERRKHKKHPTEDWEKD EGNFC >gi568815579f:19116954_19349908|GENSCAN_predicted_CDS_5|3978_bp atgggggccccgtttgtctgggccttgggccttttgatgctgcagatgctgctctttgtg gctggggaacagggcacacaggatatcaccgatgccagcgaaagggggctccacatgcag aagctggggtctgggtcagtgcaggctgcgctggcggagctggtggccctgccctgtctc tttaccctgcagccacggccaagcgcagcccgagatgcccctcggataaagtggaccaag gtgcggactgcgtcgggccagcgacaggacttgcccatcctggtggccaaggacaatgtc gtgagggtggccaaaagctggcagggacgagtgtcactgccttcctacccccggcgccga gccaacgccacgctacttctggggccactgagggccagtgactctgggctgtaccgctgc caggtggtgaggggcatcgaggatgagcaggacctggtgcccttggaggtgacaggtgtt gtgttccactaccgatcagcccgggaccgctatgcactgaccttcgctgaggcccaggag gcctgccgtctcagctcagccatcattgcagcccctcggcatctacaggctgcctttgag gatggctttgacaactgtgatgctggctggctctctgaccgcactgttcggtatcctatc acccagtcccgtcctggttgctatggcgaccgtagcagccttccaggggttcggagctat gggaggcgcaacccacaggaactctacgatgtgtattgctttgcccgggagctggggggc gaggtcttctacgtgggcccggcccgccgcctgacactggccggcgcgcgtgcacagtgc cgccgccagggtgccgcgctggcctcggtgggacagctgcacctggcctggcatgagggc ctggaccagtgcgacccgggctggctggccgacggcagcgtgcgctacccgatccagacg ccgcgccggcgctgcgggggcccagccccgggcgtgcgcaccgtctaccgcttcgctaac cggaccggcttcccctcacccgccgagcgcttcgacgcctactgcttccgagctcatcac cccacgtcacaacatggagacctagagaccccatcctctggggatgagggggagattctg tcagcagaggggcccccagttagagaactggagcccaccctggaggaggaagaggtggtc acccctgacttccaggagcctctggtgtccagtggggaagaagaaaccctgattttggag gagaagcaggagtctcaacagaccctcagccctacccctggggaccccatgctggcctca tggcccactggggaagtgtggctaagcacggtggcccccagccctagcgacatgggggca ggcactgcagcaagttcacacacggaggtggccccaactgaccctatgcctaggagaagg gggcgcttcaaagggttgaatgggcgctacttccagcagcaggaaccggagccggggctg caaggggggatggaggccagcgcccagccccccacctcagaggctgcagtgaaccaaatg gagcctccgttggccatggcagtcacagagatgttgggcagtggccagagccggagcccc tgggctgatctgaccaatgaggtggatatgcctggagctggttctgctggtggcaagagc tccccagagccctggctgtggccccctaccatggtcccacccagcatctcaggccacagc agggcccctgtcctggagctagagaaagccgagggccccagtgccaggccagccacccca gacctgttttggtcccccttggaggccactgtctcagctcccagccctgccccctgggag gcattccctgtggccacctccccagatctccctatgatggccatgctgcgtggtcccaaa gagtggatgctaccacaccccacccccatctccaccgaggccaatagagttgaggcacat ggtgaggccaccgccacggctccaccctcccctgctgcagagaccaaggtgtattccctg cctctctctttgaccccaacaggacagggtggagaggccatgcccacaacacctgagtcc cccagggcagacttcagagaaactggggagaccagccctgctcaggtcaacaaagctgag cactccagctccagcccatggccttctgtaaacaggaatgtggctgtaggttttgtcccc actgagactgccactgagccaacgggcctcaggggtatcccggggtctgagtctggggtc ttcgacacagcagaaagccccacttctggcttgcaggccactgtagatgaggtgcaggac ccctggccctcagtgtacagcaaagggctggatgcaagttccccatctgcccccctgggg agccctggagtcttcttggtacccaaagtcaccccaaatttggagccttgggttgctaca gatgaaggacccactgtgaatcccatggattccacagtcacgccggcccccagtgatgct agtggaatttgggaacctggatcccaggtgtttgaagaagccgaaagcaccaccttgagc cctcaggtggccctggatacaagcattgtgacgcccctcacgaccctggagcagggggac aaggttggagttccagccatgtctacactgggctcctcaagctcccaaccccacccagag ccagaggatcaggtggagacccagggaacatcaggagcttcagtgcctccgcatcagagc agtcccctagggaaaccggctgttcctcctgggacaccgactgcagccagtgtgggcgag tctgcctcagtttcctcaggggagcctacggtaccgtgggacccctccagcaccctgctg cctgtcaccctgggcatagaggacttcgaactggaggtcctggcagggagcccgggtgta gagagcttctgggaggaggtggcaagtggagaggagccagccctgccagggacccctatg aatgcaggtgcggaggaggtgcactcagatccctgtgagaacaacccttgtcttcatgga gggacatgtaatgccaatggcaccatgtatggctgtagctgtgatcagggcttcgccggg gagaactgtgagattgacattgatgactgcctctgcagcccctgtgagaatggaggcacc tgtattgatgaggtcaatggctttgtctgcctttgcctccccagctatgggggcagcttt tgtgagaaagacaccgagggctgtgaccgcggctggcataagttccagggccactgttac cgctattttgcccaccggagggcatgggaagatgccgagaaggactgccgccgccgctcc ggccacctgaccagcgtccactcaccggaggaacacagcttcattaatagtaggggctct ggggagggggccacctgcctggagggctttgggcatgaaaacacgtggatcggcctgaac gacaggatcgtggagagagatttccagtggacggacaacaccgggctgcaatttgagaac tggcgagagaaccagccggacaatttcttcgcgggtggcgaggactgtgtggtgatggtg gcgcatgaaagcgggcgctggaacgatgtcccctgcaactacaacctaccctatgtctgc aagaagggcacagtggagaatgcctcactcatcggtgcccgcaaggccaagtacaatgtc catgccactgtaaggtaccagtgcaatgaaggatttgcccagcaccatgtggccaccatt cgatgccggagcaatggcaagtgggacaggccccaaattgtctgcaccaaacccagacgt tcacatcggatgcggcgacaccaccaccaccaccaacaccaccaccagcatcaccaccac aaatcccgcaaggagcgcagaaaacacaagaaacacccaacggaggactgggagaaggac gaagggaatttctgctga >gi568815579f:19116954_19349908|GENSCAN_predicted_peptide_6|402_aa MVCARAALGPGALWAAAWGVLLLTAPAGAQRGRKKVVHVLEGESGSVVVQTAPGQVVSHR GGTIVLPCRYHYEAAAHGHDGVRLKWTKVVDPLAFTDVFVALGPQHRAFGSYRGRAELQG DGPGDASLVLRNVTLQDYGRYECEVTNELEDDAGMVKLDLEGVVFPYHPRGGRYKLTFAE AQRACAEQDGILASAEQLHAAWRDGLDWCNAGWLRDGSVQYPVNRPREPCGGLGGTGSAG GGGDANGGLRNYGYRHNAEERYDAFCFTSNLPGRVFFLKPLRPVPFSGAARACAARGAAV AKVGQLFAAWKLQLLDRCTAGWLADGSARYPIVNPRARCGGRRPGVRSLGFPDATRRLFG VYCYRAPGAPDPAPGGWGWGWAGGGGWAGGARDPAAWTPLHV >gi568815579f:19116954_19349908|GENSCAN_predicted_CDS_6|1209_bp atggtgtgcgctcgggcggccctcggtcccggcgcgctctgggccgcggcctggggcgtc ctgctgctcacagcccctgcgggggcgcagcgtggccggaagaaggtcgtgcacgtgctg gagggtgagtcgggctcggtagtggtacagacagcgcctgggcaggtggtaagccaccgt ggtggcaccatcgtcttgccctgccgctaccactatgaggcagccgcccacggtcacgac ggcgtccggctcaagtggacaaaggtggtggacccgctggccttcaccgacgtcttcgtg gcactaggcccccagcaccgggcattcggcagctaccgtgggcgggctgagctgcagggc gacgggcctggggatgcctccctggtcctccgcaacgtcacgctgcaagactacgggcgc tatgagtgcgaagtcaccaatgagctggaagatgacgctggcatggtcaagctggacctg gaaggcgtggtctttccctaccacccccgtggaggccgatacaagctgaccttcgcggag gcgcagcgcgcgtgcgccgagcaggacggcatcctggcatctgcagaacagctgcacgcg gcctggcgcgacggcctggactggtgcaacgcgggctggttgcgcgacggctcagtgcaa taccccgtgaaccggccccgggagccctgcggcggcctgggggggaccgggagtgcaggg ggcggcggtgatgccaacgggggcctgcgcaactacgggtatcgccataacgccgaggaa cgctacgacgccttctgcttcacgtccaacctgccggggcgcgtgttcttcctgaagccg ctgcgacctgtacccttctccggagctgcgcgcgcgtgtgctgcgcgtggcgcggccgtg gccaaggtggggcagctgttcgccgcgtggaagctgcagctgctagaccgctgcaccgcg ggttggctggccgatggcagtgcgcgctaccccatcgtgaacccgcgagcgcgctgcgga ggccgcaggcctggtgtgcgcagcctcggcttcccggacgccacccgacggctcttcggc gtctactgctaccgcgctccaggagcaccggacccggcacctggcggctggggctggggc tgggcgggcggcggcggctgggcagggggcgcgcgcgatcctgctgcctggacccctctg cacgtctag >gi568815579f:19116954_19349908|GENSCAN_predicted_peptide_7|1103_aa MSLCTSPKVTFLLWVYCLPSLKRVPRGQRPVFSCIARRKANRWFGVAPPKSGKMNMNILH QEELIAQKKREIEAKMEQKAKQNQVASPQPPHPGEITNAHNSSCISNKFANDGSFLQQFL KLQKAQTSTDAPTSAPSAPPSTPTPSAGKRSLLISRRTGLGLASLPGPVKSYSHAKQLPV AHRPSVFQSPDEDEEEDYEQWLEIKGKLWELLLPHHQCPCSSRESVPIDCGVCELNPSTD SLLKVSPPEGAETRKVIEKLARFVAEGGPELEKVAMEDYKDNPAFAFLHDKNSREFLYYR KKVAEIRKEAQKSQAASQKEDEEVKNLAEKLARFIADGGPEVETIALQNNRENQAFSFLY EPNSQGYKYYRQKLEEFRKAKASSTGSFTAPDPGLKRKSPPEALSGSLPPATTCPASSTP APTIIPAPAAPGKPASAATVKRKRKSRWGPEEDKVELPPAELVQRDVDASPSPLSVQDLK GLGYEKGKPVGLVGVTELSDAQKKQLKEQQEMQQMYDMIMQHKRAMQDMQLLWEKAVQQH QHGYDSDEEVDSELGTWEHQLRRMEMDKTREWAEQLTKMGRGKHFIGDFLPPDELEKFME TFKALKEGREPDYSEYKEFKLTVENIGYQMLMKMGWKEGEGLGSEGQGIKNPVNKGTTTV DGAGFGIDRPAELSKEDDEYEAFRKRMMLAYRFRPNPLPRPGLAANEEPSALENASGLSR GPAAAAMDIPPLAGKIAALSLSALPVSYALNHVSALSHPLWVALMSALILGLLFVAVYSL SHGEVSYDPLYAVFAVFAFTSVVDLIIALQEDSYVVGFMEFYTKEGEPYLRTAHGVFICY WDGTVHYLLYLAMAGAICRRKRYRNFGLYWLGSFAMSILVFLTGNILGKYSSEIRPAFFL TIPYLLVPCWAGMKVFSQPRALTRCTANMVQEEQRKGLLQRPADLALVIYLILAGFFTLF RGLVVLDCPTDACFVYIYQYEPYLRDPVAYPKVQMLMYMFYVLPFCGLAAYALTFPGCSW LPDWALVFAGGIGQAQFSHMGASMHLRTPFTYRVPEDTWGCFFVCNLLYALGPHLLAYRC LQWPAFFHQPPPSDPLALHKKQH >gi568815579f:19116954_19349908|GENSCAN_predicted_CDS_7|3312_bp atgagcctctgcaccagcccgaaagttacttttttattatgggtctattgtcttcccagc ctgaagcgagtcccacgtgggcagagacctgtgttctcttgcatagcacgtagaaaggct aaccggtggtttggggttgctccccctaaatctggaaaaatgaacatgaacatccttcac caggaagagctcatcgctcagaagaaacgggaaattgaagccaaaatggaacagaaagcc aagcagaatcaggtggccagccctcagcccccacatcctggcgaaatcacaaatgcacac aactcttcctgcatttccaacaagtttgccaacgatggtagcttcttgcagcagtttctg aagttgcagaaggcacagaccagcacagacgccccgaccagtgcgcccagcgcccctccc agcacacccacccccagcgctgggaagaggtccctgctcatcagcaggcggacaggcctg gggctggccagcctgccgggccctgtgaagagctactcccacgccaagcagctgcccgtg gcgcaccgcccgagtgtcttccagtcccctgacgaggacgaggaggaggactatgagcag tggctggagatcaaaggtaagttgtgggagctgctgctcccccaccaccagtgcccgtgc tctagcagagagagtgtgcctattgactgtggggtgtgtgagttgaaccccagtactgac agcctccttaaagtttcacccccagagggagccgagactcggaaagtgatagagaaattg gcccgctttgtggcagaaggaggccccgagttagaaaaagtagctatggaggactacaag gataacccagcatttgcatttttgcacgataagaatagcagggaattcctctactacagg aagaaggtggctgagataagaaaggaagcacagaagtcgcaggcagcctctcagaaagag gacgaagaggtcaagaaccttgcagaaaagttggccaggttcatagcggacgggggtccc gaggtggaaaccattgccctccagaacaaccgtgagaaccaggcattcagctttctgtat gagcccaatagccaagggtacaagtactaccgacagaagctggaggagttccggaaagcc aaggccagctccacaggcagcttcacagcacctgatcccggcctgaagcgcaagtcccct cctgaggccctgtcagggtccttacccccagccaccacctgccccgcctcgtccacgcct gcgcccactatcatccctgctccagctgcccccgggaagccagcctccgcagccaccgtg aagaggaagcggaagagccggtgggggcctgaagaggataaggtagagctcccacctgct gaactggtgcagagggacgtggatgcctctccctcgcctctgtcagttcaggacctcaag gggctcggctatgagaaggggaagcctgtgggtctagtgggcgtcacagagctttcagac gcccagaagaagcagctgaaggagcagcaggagatgcagcagatgtacgacatgatcatg cagcacaagcgggccatgcaggacatgcagctgctgtgggagaaggcagtccaacagcac cagcacggctatgacagtgatgaggaggtggacagcgagctgggcacctgggagcaccag ctgcggcgcatggagatggataagaccagggaatgggccgagcagctgacaaagatgggc cggggcaagcacttcatcggagacttcctgcctccagacgagctggaaaagtttatggag accttcaaggccctgaaggagggccgtgagcctgactactcagagtacaaggagttcaag ctgactgtggagaacatcggctaccagatgctgatgaagatgggctggaaggagggcgag gggctgggctcagagggccagggcatcaagaacccagtgaacaagggcaccaccacagtg gacggcgctggcttcggcattgaccggccggcggagctctccaaggaggacgacgagtat gaggcgttccgcaagaggatgatgctggcctaccgcttccggcccaaccccctgcctcgg cccggactcgccgccaacgaggagccgagcgccctggagaacgcgtcggggctgagccgg ggtccagcagccgccgctatggacatcccgccgctggccggcaagatcgcggcgctgtcg ctgagcgccctcccggtgtcctacgcgctcaaccacgtctcggcgctctcgcaccccctg tgggtggcattgatgagcgccctaatcctgggtctgcttttcgtggcggtctacagcttg tcccatggcgaggtctcctatgacccactctatgctgtcttcgctgtcttcgccttcacc tcggttgtggacctcatcatcgctcttcaggaagacagctatgtggtgggcttcatggag ttctacaccaaggagggagagccatacctgcgcacagcgcacggagtcttcatctgctac tgggatggcactgttcactacctcctctacctggccatggccggcgccatctgcagaagg aagagataccggaattttggactctactggctgggttccttcgccatgagcatcctggtg ttccttacaggaaacattcttggcaaatacagctccgagatcaggcctgccttcttcctc accatcccctacctgctggtgccatgctgggctggcatgaaggtcttcagccagccccgg gcgctaacccgctgcaccgccaacatggtgcaagaggaacaaagaaagggactcctgcag cgtccggctgacctggcccttgtcatatatctcatccttgctggcttcttcactctgttc cggggcctggtggtgcttgattgccccacagatgcctgctttgtctatatctaccagtat gagccatacctgcgggaccctgtggcctaccctaaggtgcagatgctgatgtacatgttt tatgtcctgcctttctgcggcctggctgcctatgctctcaccttccctggttgctcctgg cttccagactgggccttggtgtttgctggaggcatcggccaggcacagttctcgcacatg ggggcttccatgcacctgcgcacacccttcacctaccgtgtgcctgaggacacctggggc tgcttcttcgtgtgcaatctgctgtatgcgctgggcccccacctgctggcctaccgttgc cttcagtggcccgcattcttccaccagccaccaccctccgaccccctagccctccacaag aagcagcattga >gi568815579f:19116954_19349908|GENSCAN_predicted_peptide_8|547_aa MAAQAAAAAQAAAAQAAQAEAADSWYLALLGFAEHFRTSSPPKIRLCVHCLQAVFPFKPP QRIEARTHLQLGSVLYHHTKNSEQARSHLEKANSVDAAKPLLRKAIQISQQTPYWHCRLL FQLAQLHTLEKDLVSACDLLGVGAEYARVVGSEYTRALFLLSKGMLLLMERKLQEVHPLL TLCGQIVENWQGNPIQKESLRVFFLVLQVTHYLDAGQVKSVKPCLKQLQQCIQTISTLHD DEILPSNPADLFHWLPKEHMCVLVYLVTVMHSMQAGYLEKAQKYTDKALMQLEKLKMLDC SPILSSFQVILLEHIIMCRLVTGHKATALQEISQVCQLCQQSPRLFSNHAAQLHTLLNIP SEGPPDSTLIRIRNVPIGNHITFPGTIRNAPSEHLVPPHIGLGLKGAGPDDNTTFFQGLY CVSVNCMDNAEAQFTTALRLTNHQELWAFIVTNLASVYIREGNRHQELYSLLERINPDHS FPVSSHCLRAAAFYVRGLFSFFQGRYNEAKRFLRETLKMSNAEDLNRLTACSLVLLGHIF YVLGNHR >gi568815579f:19116954_19349908|GENSCAN_predicted_CDS_8|1641_bp atggcggctcaggcggcggcagcggcccaggcggcggcggcccaggctgcgcaggccgag gcggccgactcgtggtacctggcgcttctgggcttcgctgagcacttccgcacttccagc ccgcccaaaatccgcctgtgcgtgcactgcctgcaggccgtgttccccttcaagccgccg cagcgcatcgaggcccgtacacacctgcagctgggctccgttctctatcaccacaccaag aacagcgagcaggcgcgcagccacctggagaaggcgaattccgttgatgcagcaaagccg ctgctgcggaaggcgatccagatctcacagcagaccccatattggcactgccgcctgctc ttccagctcgctcaactgcacacgcttgagaaggacctggtgtcggcctgtgacctcctg ggtgtaggggccgagtacgcccgggtggtgggatctgaatacacacgggcgctgttcctc ctcagcaaggggatgctgctgctgatggagcgaaagctgcaggaggtgcacccgctgctg accctctgcgggcagatcgtggagaactggcaggggaaccccatccagaaggagtcgctg cgtgtcttcttcctggtgctccaggtcacccactatctggatgccgggcaggtgaagagc gtgaagccgtgtctgaagcagctgcagcagtgcatccagaccatctccacactgcacgat gatgagatcctgcccagcaaccccgctgacctcttccactggctgcccaaggagcacatg tgtgtgcttgtctacctggtgactgtgatgcactccatgcaggccggctacctggagaag gcgcagaagtacacggacaaggccctcatgcagctggagaagctcaagatgctggactgc agccccatcctgtcatccttccaagtgatcctgctggagcacatcatcatgtgccgcctt gtcacgggtcacaaggccacggcgctgcaggagatctcccaggtctgccagctgtgccag cagtccccccggctcttctccaaccatgcagcacagctgcacacattgctgaacattccc agtgaaggaccccctgacagcaccctaatcagaatccggaatgttcccataggtaatcat ataaccttcccaggcacgatccggaatgctcccagtgagcatcttgtcccaccccacatt ggcttgggtctgaaaggggccggccctgatgacaacaccaccttcttccagggcctgtac tgtgtctctgtcaactgcatggacaacgcggaagcccagttcaccacggccctgcggctc accaaccaccaggagctgtgggccttcatcgtcaccaacctggcgagtgtgtatatacgg gaaggaaatagacaccaagagctctacagtctgctggagaggatcaacccggaccacagc ttccctgtcagctcgcactgcctccgagcagccgccttctatgtgcgtgggctcttctcc ttcttccagggacgctacaacgaggccaagcgatttctgcgggaaactctgaagatgtcc aatgctgaggacctgaaccggctcacagcctgctccctcgtgcttctgggccacatcttc tatgtgctgggaaaccacagg