GENSCAN 1.0 Date run: 5-Nov-116 Time: 13:37:30 Sequence gi568815586r:122132249_122366408 : 234160 bp : 46.67% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 60 135 76 0 1 102 80 21 0.583 2.22 1.02 Intr + 1100 1739 640 1 1 119 99 300 0.994 25.83 1.03 Intr + 2976 3097 122 1 2 102 72 115 0.871 11.51 1.04 Intr + 3241 3418 178 0 1 100 81 103 0.983 10.29 1.05 Intr + 5221 5342 122 2 2 61 81 69 0.954 3.71 1.06 Intr + 5838 6047 210 2 0 92 105 197 0.928 20.91 1.07 Intr + 6176 6303 128 1 2 83 58 274 0.989 23.38 1.08 Intr + 6513 6651 139 0 1 91 -13 153 0.749 5.97 1.09 Intr + 8706 8835 130 2 1 35 72 184 0.922 11.77 1.10 Intr + 9034 9102 69 2 0 79 89 21 0.572 0.45 1.11 Term + 9443 9564 122 0 2 123 37 170 0.997 14.04 1.12 PlyA + 12340 12345 6 1.05 2.07 PlyA - 12822 12817 6 -4.04 2.06 Term - 12963 12909 55 2 1 89 38 96 0.540 1.73 2.05 Intr - 13396 13261 136 2 1 61 90 62 0.505 3.33 2.04 Intr - 13992 13806 187 0 1 48 94 42 0.506 0.06 2.03 Intr - 14906 14794 113 0 2 103 4 92 0.566 2.40 2.02 Intr - 17170 17033 138 2 0 87 44 65 0.333 2.44 2.01 Init - 19276 19228 49 1 1 86 99 24 0.713 2.43 2.00 Prom - 29497 29458 40 -0.86 3.05 PlyA - 30318 30313 6 1.05 3.04 Term - 40493 40164 330 2 0 126 48 310 0.998 25.46 3.03 Intr - 41744 41596 149 0 2 105 76 24 0.898 2.85 3.02 Intr - 41989 41909 81 2 0 97 69 27 0.312 1.31 3.01 Init - 45413 45371 43 2 1 82 78 27 0.621 1.71 3.00 Prom - 48650 48611 40 -10.05 4.00 Prom + 49835 49874 40 -8.66 4.01 Init + 50897 51046 150 1 0 96 41 237 0.796 19.84 4.02 Intr + 52271 52531 261 1 0 60 86 342 0.818 28.88 4.03 Intr + 53942 54052 111 1 0 142 67 176 0.963 21.48 4.04 Intr + 55453 55592 140 0 2 87 53 107 0.970 6.36 4.05 Intr + 57882 58120 239 0 2 76 77 479 0.235 42.86 4.06 Intr + 59132 59319 188 0 2 116 53 255 0.966 24.21 4.07 Intr + 60497 60756 260 1 2 97 86 294 0.988 26.36 4.08 Intr + 67941 68082 142 0 1 96 86 218 0.920 22.76 4.09 Intr + 68284 68412 129 0 0 118 81 66 0.998 9.79 4.10 Intr + 68498 68686 189 1 0 114 54 327 0.934 31.68 4.11 Intr + 69048 69081 34 2 1 87 113 46 0.998 4.80 4.12 Term + 71067 71194 128 1 2 70 49 262 0.926 18.94 4.13 PlyA + 72632 72637 6 1.05 5.00 Prom + 72725 72764 40 -10.45 5.01 Sngl + 74079 75140 1062 2 0 70 49 871 0.978 76.56 5.02 PlyA + 75270 75275 6 -1.95 6.21 PlyA - 75606 75601 6 -5.12 6.20 Term - 76329 76133 197 1 2 89 55 233 0.993 17.67 6.19 Intr - 84336 84240 97 0 1 44 82 50 0.896 -0.42 6.18 Intr - 84621 84511 111 0 0 91 99 27 0.390 4.68 6.17 Intr - 86059 86018 42 1 0 85 95 41 0.242 2.94 6.16 Intr - 94003 93578 426 1 0 89 16 169 0.133 3.79 6.15 Intr - 94320 94194 127 2 1 64 13 373 0.246 28.08 6.14 Intr - 98683 98611 73 2 1 87 28 63 0.203 -1.34 6.13 Intr - 100179 100078 102 1 0 75 77 62 0.357 3.95 6.12 Intr - 100720 100552 169 1 1 72 111 176 0.980 17.82 6.11 Intr - 103675 103538 138 1 0 74 82 124 0.999 11.06 6.10 Intr - 106476 106339 138 0 0 84 109 8 0.877 3.16 6.09 Intr - 107697 107630 68 1 2 49 95 51 0.987 0.52 6.08 Intr - 110260 110134 127 1 1 94 75 85 0.994 8.05 6.07 Intr - 112514 112321 194 0 2 62 78 314 0.864 27.01 6.06 Intr - 117797 117623 175 2 1 86 98 110 0.997 11.31 6.05 Intr - 118851 118735 117 0 0 101 80 136 0.810 14.76 6.04 Intr - 120779 120720 60 2 0 132 50 9 0.319 0.43 6.03 Intr - 129199 129013 187 0 1 93 106 52 0.866 7.19 6.02 Intr - 131451 131324 128 0 2 80 70 42 0.941 1.08 6.01 Init - 134160 134059 102 0 0 99 100 136 0.917 16.36 6.00 Prom - 140314 140275 40 -8.96 7.23 PlyA - 140445 140440 6 1.05 7.22 Term - 140852 140627 226 1 1 41 45 205 0.962 7.75 7.21 Intr - 141914 141790 125 2 2 34 68 154 0.710 7.38 7.20 Intr - 145955 145906 50 0 2 46 95 52 0.522 0.10 7.19 Intr - 146694 146544 151 0 1 65 103 136 0.954 12.54 7.18 Intr - 146897 146780 118 1 1 48 65 151 0.998 9.27 7.17 Intr - 156293 156241 53 2 2 94 121 36 0.879 5.21 7.16 Intr - 177634 177514 121 0 1 102 86 90 0.809 10.70 7.15 Intr - 184607 184501 107 2 2 56 101 47 0.846 1.81 7.14 Intr - 187100 186984 117 2 0 88 13 120 0.854 5.16 7.13 Intr - 192133 189866 2268 1 0 67 55 2052 0.941 187.14 7.12 Intr - 195914 195699 216 2 0 100 68 304 0.999 28.30 7.11 Intr - 196178 196013 166 1 1 65 95 186 0.997 16.96 7.10 Intr - 200895 200739 157 1 1 71 71 173 0.998 12.97 7.09 Intr - 201862 201779 84 2 0 52 110 32 0.750 1.59 7.08 Intr - 202457 202400 58 2 1 80 95 29 0.844 1.26 7.07 Intr - 209449 208505 945 1 0 67 76 901 0.947 78.14 7.06 Intr - 215231 215127 105 2 0 92 57 117 0.985 9.41 7.05 Intr - 220538 220478 61 1 1 99 83 65 0.656 5.84 7.04 Intr - 222308 222205 104 2 2 73 55 182 0.999 12.37 7.03 Intr - 223064 222867 198 2 0 92 72 299 0.982 28.25 7.02 Intr - 228933 228711 223 2 1 82 66 164 0.822 11.73 7.01 Intr - 231859 231735 125 1 2 129 76 95 0.988 11.78 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:122132249_122366408|GENSCAN_predicted_peptide_1|645_aa XRSIFGSMLPASASAPVPDPNNPPAQESILPTTALPTVSLPDSLIAPPTAPSLAHMDEQG CEHTSRTEDPFIQPTDFGPSEPPLSVPQPFLPVFTMPLLSPSPAPPPISPVLPLVPPPAT ALNPPAPPTFHQPQKFAGVNKAPSVITHTASATLTHDAPATTFSQSQGLVITTHHPAPSA APCGLALSPVTRPPQPRLTFVHPKPVSLTGGRPKQPHKIVPAPKPEPVSLVLKNARIAPA AFSGQPQAVIMTSGPLKREGMLASTVSQSNVVIAPAAIARAPGVPEFHSSILVTDLGHGT SSPPAPVSRLFPSTAQDPLGKGEQVPLHGGSPQVTVTGPSRDCPNSGQASPCASEQSPSP QSPQNNCSGKSDPKNVAALKLSSPGPWAPRISVGGEEGWTVCLQCACLSDQRVLQQNRQM KHISAEQKRRFNIKMCFDMLNSLISNNSKLTSHAITLQKTVEYITKLQQERGQMQEEARR LREEIEELNATIMVLIGHWLLHSSHGSCHFSSCQQLLPATGVPVTRRQFDHMKDMFDEYF SIIIKPLFESFKGMVSTSSLEELHRTALSWLDQHCSLPILRPTCSDPACMDLVPTICQAL YQVPSVVLSTLRQLSTSTSILTDPAQLPEQASKAVTRIGKRLGES >gi568815586r:122132249_122366408|GENSCAN_predicted_CDS_1|1938_bp nnccgctccatttttggctccatgctacctgcatctgcctcagcacctgtaccagatccc aacaacccacctgcacaggagagcatcctgccgaccacagccctccccactgtgagcctt cctgacagcctcatcgcgccccctaccgccccatccctggctcacatggatgagcagggc tgtgaacacacctcccggactgaggacccgtttatccagcccacggacttcggtccctca gagccgccactgagtgtcccgcagcccttcctccctgtcttcaccatgcccctgctgtct cccagccccgccccaccgcccatctcccccgtgttaccattagttcctcctcctgccact gccctgaaccccccggctccacccaccttccatcagccacagaagtttgctggagtcaac aaagcgccgtctgtcatcacccacacggcctctgccaccctcacccacgatgcccccgcc accacctttagccagagtcagggccttgtgatcaccacccatcaccctgccccgtcagcg gccccttgtgggctggcactgtctcctgtcacccggcctccccagccacggttaactttt gtgcaccccaaacctgtatccttgactgggggcaggcctaagcagccccacaaaatagtg cctgctcccaaaccagagcccgtgtccttggtgttgaagaatgcccgtatcgccccagct gccttttcaggccaaccacaagcggtgatcatgacgtcagggcctctgaagagagaaggg atgttggcctccaccgtgtcccagtccaacgtggtcattgcgcctgctgccatcgccagg gctcctggggtcccggagttccacagcagcatcctggtgacagatctcggccatggcacg agcagcccgcctgcccccgtctcccggctcttcccaagcacagcgcaagaccccctgggg aagggcgagcaggtcccgctgcatgggggcagcccccaggtcactgtcacagggcccagt cgggactgcccaaactcagggcaggcctctccgtgtgcatcggagcagagccccagtcct caatctccccagaacaactgctcagggaaatccgaccccaaaaatgtggctgcactaaag ctctctagccctggcccttgggcacccaggatcagcgtagggggtgaggaagggtggaca gtgtgcctgcaatgtgcctgtcttagtgaccagcgggttctccagcagaaccggcagatg aagcacatctcagctgagcagaaaaggcgcttcaacatcaagatgtgcttcgacatgctc aacagcctcatctccaacaattccaagctgaccagtcacgccatcacactgcagaagact gtggagtacatcaccaagctgcagcaggagagaggccagatgcaggaggaggcccggcgg ctgcgggaggagatcgaggagctcaatgccaccatcatggtccttattggccactggctg ctccacagctcccacggctcctgtcatttcagctcctgccagcagctgctccctgccacg ggagtccccgttacccggcgccagtttgatcacatgaaagacatgtttgacgaatacttc agcatcatcatcaagccgctgtttgagtcgttcaagggcatggtgtccaccagcagcctg gaggagctgcaccggacggcgctctcctggctggaccagcactgctccctgcccatcctc aggccgacttgttcagacccagcttgcatggatctagtgccaactatttgccaggccctg tatcaggtgccttcagtggtattgagcacgctgcggcagctgagcacctccacctccatc ctcacagacccggcacagctgccagagcaggcgtccaaggctgtcaccaggattggcaag agattgggagagtcctag >gi568815586r:122132249_122366408|GENSCAN_predicted_peptide_2|225_aa MGFLHVGQAGLELLTSVPGCGQRQLEQDGLQESSSTGCLDICFLPGLCEVRHQALVLVGS ESDHEVEPSLQRSPATITKAHTRKNRHLHNKLNLKKERNMCADKVKIPQSRDVEGPEPHT TSPCIFGTLAGHCCTRCLPGSVLSSPPFPSPRKRWAKLSSKTGSPGGQLGPWRKGPRQFS AAAPLYERDPYLHPSCRQGQSWLCTEMRPNAVISTNGQRCYCFLR >gi568815586r:122132249_122366408|GENSCAN_predicted_CDS_2|678_bp atggggtttctccatgttggtcaagctggtcttgaactcctgacctcagtcccagggtgt gggcagaggcagctggagcaggatgggctccaggagagttcatccacaggctgtctcgat atctgcttcctgccaggtctgtgcgaggtacggcaccaggccctcgtcctggtgggctca gaatctgaccacgaggtggaaccatctcttcaaaggagccctgccaccatcaccaaagct cacaccagaaaaaaccgccatttacacaacaaactgaacctgaaaaaggaacggaacatg tgtgcagacaaggtaaaaatcccccaaagcagagatgtcgagggcccggagccacacacc acatccccgtgcatctttggcactctggctggccactgctgcacacgctgcctgcccgga tcagtgctgtccagccctcctttcccttccccaaggaaacgctgggccaagctgagctcc aaaacaggttcacccgggggccagctagggccttggaggaaggggcctcgccagttctca gctgcagctcccctctatgagagagacccttacctgcaccccagctgccggcagggccag tcatggctgtgcacggagatgagacccaatgctgtcatcagcaccaatggccagcgctgc tattgcttcctgcggtaa >gi568815586r:122132249_122366408|GENSCAN_predicted_peptide_3|200_aa MVVQAYSPNYLGGWGSHGEHIWLQKPPLKLALLSLAMASHSGPSTSVLFLFCCLGGWLAS HTLPVRLLRPSDDVQKIVEELQSLSKMLLKDVEEEKGVLVSQNYTLPCLSPDAQPPNNIH SPAIRAYLKTIRQLDNKSVIDEIIEHLDKLIFQDAPETNISVPTDTHECKRFILTISQQF SECMDLALKSLTSGAQQATT >gi568815586r:122132249_122366408|GENSCAN_predicted_CDS_3|603_bp atggtggtgcaggcctacagtcccaactacttgggaggctggggcagccatggcgaacac atctggctccagaagcccccactgaagctggccttgctctctctcgccatggcctctcac tcaggcccctcgacgtctgtgctctttctgttctgctgcctgggaggctggctggcctcc cacacgttgcccgtccgtttactacgaccaagtgatgatgtacagaaaatagtcgaggaa ttacagtccctctcgaagatgcttttgaaagatgtggaggaagagaagggcgtgctcgtg tcccagaattacacgctgccgtgtctcagccctgacgcccagccgccaaacaacatccac agcccagccatccgggcatatctcaagacaatcagacagctagacaacaaatctgttatt gatgagatcatagagcacctcgacaaactcatatttcaagatgcaccagaaacaaacatt tctgtgccaacagacacccatgaatgtaaacgcttcatcctgactatttctcaacagttt tcagagtgcatggacctcgcactaaaatcattgacctctggagcccaacaggccaccact taa >gi568815586r:122132249_122366408|GENSCAN_predicted_peptide_4|656_aa MEASYESESESESEAGPGTQRPGTGTVSAAVREHLRKLCLREFPCGAGSWNKSRFLPQTW RTWRELVPREEDVVSPGEETVEALLGLVRSRHSPWALLNNSNAEDSFLRELAIRNPLTIT DTFFYSYFRSLRVIDKKVTLVDKDLLKFLKLEELVLSANRIKEVDATNLPPTLKVLELYG NEISSMECLCAHPPAGLQHLGLGHNKLLGPLESLYVTANHWPNLVSLDLGFNDLTDLQSM VTSLRTLRHLRLLVLQGNPLALVPYYRGLTIDSLAQLCVLDDITVSPNEKHLFRGLSLNG DLLAQEAQFVVTIGNIRGVLDTSVLDPEPRPEGPFITYNYYVTYDFVKDEEGEMNESAGV LAEIVKPSPSLELLVEESPEEVVEDVIEDIVEEVTEEVEGSLESEVEESGESELSVISGP STILQMPRASAEELAKLRLRIDPRLCPSPGTVLFSTAHKPWAEVIPCSYEMQHSLRDLVP LKAFLLAGTTVTIVEEKILSWPVVLPAVDSPLSAKKGKGEKDKKGKEKDRTGKGEKEPAK EWKVLKKKKEPPKELRQDPPILQVLGRGLVILEPLLAGEPLVSTVCNFGVVRTLTSDRLT LARDSKKIKKVAKKEKPKAVIPIYEGDYHPEPLTVEVQIQLNQCRSAEEALRMFAV >gi568815586r:122132249_122366408|GENSCAN_predicted_CDS_4|1971_bp atggaggcgtcgtacgagtccgagtccgagtccgagtctgaggccgggcctgggactcag cggcccgggaccgggaccgtgagcgcggccgtgcgcgagcacttgcggaagctgtgtctg cgcgagttcccgtgcggtgccggcagctggaataagtcgcgctttcttcctcaaacttgg cgaacttggagggagcttgtccccagagaggaggatgtggtgagccccggagaggagacg gtggaggccctgctgggcctggtccgcagccgccactccccctgggctctgctgaacaac tcgaatgcagaagacagtttcctgagagaattggccatccggaacccgctgacgatcaca gacaccttcttctactcctacttccggtccctgcgggtaatagacaagaaggtcaccctg gtggataaagacctcctgaaatttctaaagctggaggagttggtactgagcgccaatcga atcaaggaggtggatgccaccaatctgccccccacactcaaggtgctggagctctacggc aatgagatcagcagcatggagtgtctgtgtgcccacccacccgccggcctgcagcacttg gggttaggccacaacaaacttctaggccccttggaaagtctctacgtcaccgctaatcac tggcccaacctcgtctccctggacctgggcttcaacgacctgacagacctgcagagcatg gtcaccagcctgaggaccctccggcacctgcgactcctggtgctgcagggaaacccactg gccttggtgccctactaccgcggcctcaccatcgacagcctggcccagctctgcgtgctg gacgacatcaccgtgtctcccaatgagaagcatctcttccgggggctcagcctcaatggc gatctcttggcacaggaggcgcagtttgtggtgaccatcggaaacatcagaggagtcctg gacacctctgtcttagacccggaacccaggcccgaaggccctttcatcacttacaactat tacgtgacctatgattttgtgaaagatgaagaaggcgaaatgaatgagtccgcgggcgtc ctggccgagatcgtcaagccctctcccagcttagaattattagttgaggaatctcctgaa gaggtcgtggaagacgtcatcgaagacattgttgaagaggttactgaagaggtcgaaggg tctctggagtctgaggtggaggagtcaggagagtcggagctgtctgtcatctcggggcct tcgaccatcttgcagatgccgagggcctctgcagaagagctggccaagttgaggctgcgt atagatccccggctctgcccgtccccagggactgtcctcttcagcactgcccacaagccc tgggctgaggtcatcccctgcagttacgagatgcagcactctctcagggacctggtccca ctgaaggccttcctgctggcggggaccaccgtgaccatcgtggaggagaagattctctcc tggcctgtggtgctacctgctgttgacagtcccctgtctgccaagaaaggaaagggggag aaagacaagaaagggaaggagaaagacaggacggggaaaggagagaaagagccggccaag gagtggaaggtgctgaagaagaagaaagagccgcccaaggagctccggcaggaccccccc atcctccaggtgctgggccggggcctggtgatcctggagcccctgctcgccggggagccc ctggtgtccaccgtgtgcaacttcggcgtggtccgcacattgacatctgacaggctgacg ttggccagggattcaaagaagattaagaaagttgccaaaaaagaaaagccgaaagccgtg attccgatctacgaaggcgattaccaccctgagcccctgaccgtagaggtgcagatccag ctgaaccagtgccgctcggcggaggaggctctgcgcatgttcgccgtgtag >gi568815586r:122132249_122366408|GENSCAN_predicted_peptide_5|353_aa MLCRLCWLVSYSLAVLLLGCLLFLRKAAKPAGDPTAHQPFWAPPTPRHSRCPPNHTVSSA SLSLPSRHRLFLTYRHCRNFSILLEPSGCSKDTFLLLAIKSQPGHVERRAAIRSTWGRVG GWARGRQLKLVFLLGVAGSAPPAQLLAYESREFDDILQWDFTEDFFNLTLKELHLQRWVV AACPQAHFMLKGDDDVFVHVPNVLEFLDGWDPAQDLLVGDVIRQALPNRNTKVKYFIPPS MYRATHYPPYAGGGGYVMSRATVRRLQAIMEDAELFPIDDVFVGMCLRRLGLSPMHHAGF KTFGIRRPLDPLDPCLYRGLLLVHRLSPLEMWTMWALVTDEGLKCAAGPIPQR >gi568815586r:122132249_122366408|GENSCAN_predicted_CDS_5|1062_bp atgctctgcaggctgtgctggctggtctcgtacagcttggctgtgctgttgctcggctgc ctgctcttcctgaggaaggcggccaagcccgcaggagaccccacggcccaccagcctttc tgggctcccccaacaccccgtcacagccggtgtccacccaaccacacagtgtctagcgcc tctctgtccctgcctagccgtcaccgtctcttcttgacctatcgtcactgccgaaatttc tctatcttgctggagccttcaggctgttccaaggataccttcttgctcctggccatcaag tcacagcctggtcacgtggagcgacgtgcggctatccgcagcacgtggggcagggtgggg ggatgggctaggggccggcagctgaagctggtgttcctcctaggggtggcaggatccgct cccccagcccagctgctggcctatgagagtagggagtttgatgacatcctccagtgggac ttcactgaggacttcttcaacctgacgctcaaggagctgcacctgcagcgctgggtggtg gctgcctgcccccaggcccatttcatgctaaagggagatgacgatgtctttgtccacgtc cccaacgtgttagagttcctggatggctgggacccagcccaggacctcctggtgggagat gtcatccgccaagccctgcccaacaggaacactaaggtcaaatacttcatcccaccctca atgtacagggccacccactacccaccctatgctggtgggggaggatatgtcatgtccaga gccacagtgcggcgcctccaggctatcatggaagatgctgaactcttccccattgatgat gtctttgtgggtatgtgcctgaggcggctggggctgagccctatgcaccatgctggcttc aagacatttggaatccggcggcccctggaccccttagacccctgcctgtatagggggctc ctgctggttcaccgcctcagccccctcgagatgtggaccatgtgggcactggtgacagat gaggggctcaagtgtgcagctggccccataccccagcgctga >gi568815586r:122132249_122366408|GENSCAN_predicted_peptide_6|925_aa MAAHLSYGRVNLNVLREAVRRELREFLDKCAGSKEHEVEKMFTLKGNRLPAADVKNIIFF VRPRLELMDIIAENVLSEDRRGPTRDFHILFVPRRSLLCEQRLKDLGVLGSFIHREEYSL DLIPFDGDLLSMESEGAFKALDPQGPLSSGCGQQGHVGLECYLEGDQTSLYHAAKGLMTL QALYGTIPQIFGKGECARQVANMMIRMKREFTGSQNSIFPVFDNLLLLDRNVDLLTPLAT QLTYEGLIDEIYGIQNSYVKLPPEKFAPKKQGDGGKDLPTEAKKLQLNSAEELYAEIRDK NFNAVGSVLSKKAKIISAAFEERHNAKTVGEIKQFVSQLPHMQAARGSLANHTSIAELIK DVTTSEDFFDKLTVEQEFMSGIDTDKVNNYIEDCIAQKHSLIKVLRLVCLQSVCNSGLKQ KVLDYYKREILQTYGYEHILTLHNLEKAGLLKPQTGGRNNYPTIRKTLRLWMDDVNEQNP TDISYVYSGYAPLSVRLAQLLSRPGWRSIEEVLRILPGPHFEERQPLPTGLQKKRQPGEN RVTLIFFLGGVTFAEIAALRFLSQLEDGGVSLDDLGDQQPQLLLLLQETVRLCAMPASST VHVLQLLRELLAFVLLSYTVLIGALLLAGWTTYFLALHMPRPARKFRRLVALRGRYSLCP KPRPLSPCLPNPDCFLGWGRGYGARRSQVEAVPRSALTKLRHFRRVRLASARCTMAALKS WLSRSVTSFFRYRCGRVEGTGPEGGTDADSAVARPTAPAFLTRSSLVRPAGSRRLAPTTY ALIEAITEYTKAVYTLTSLYRQYTSLLGKMNSEEEDEVWQVIIGARAEMTSKHQEYLKLE TTWMTAVGLSEMAAEAAYQTGADQASITARNHIQLVKLQVEEVHQLSRKAETKLAEAQIE ELRQKTQEEGEERAESEQEAYLRED >gi568815586r:122132249_122366408|GENSCAN_predicted_CDS_6|2778_bp atggcggctcatctgtcctacggccgagtgaacctaaacgtgttgcgcgaggcggtgcgt cgcgagctgcgcgagttcctggacaagtgcgcaggaagcaaggaacatgaagtggaaaaa atgttcacacttaaaggaaatcgtttgccggcagctgatgtgaagaatataatttttttt gtcagacccaggctagagttgatggatataatcgctgaaaacgtgctcagtgaagataga cgaggcccaacgagagattttcatattctgtttgtgccacgccgtagcctgttgtgcgaa cagcggttgaaggatctgggtgtcttgggatcctttattcacagggaggagtacagctta gatctcattccattcgatggggatctcttatccatggaatcagagggtgcattcaaagcc ctggaccctcagggccccctcagttctggctgtggacagcagggacatgttgggttggag tgctacctggagggtgaccagacgagcctgtaccacgcagccaaggggctgatgaccctg caagctctgtatggaacgatcccccagatctttgggaaaggagaatgcgctcggcaagtg gccaatatgatgatcaggatgaagagagagtttacaggaagccagaattcaatatttcct gtttttgataatctcttgttgcttgatcggaatgtggatttattaacacctcttgccact cagctgacatatgaaggactcattgatgaaatttatggcattcagaacagttatgtgaaa ttacctccagagaaatttgcacctaagaaacagggcgatggtggtaaggacctccccacg gaagcaaagaagctgcagctgaattctgcagaggagctctatgctgagatccgagataag aacttcaacgcagttggctctgtgctcagcaagaaagcaaagatcatctctgcagcattc gaggaaagacacaatgctaagaccgtgggggagatcaagcagtttgtttcccagttgccc cacatgcaggcagcaaggggctcgcttgcaaaccatacctcaattgcagaattgatcaaa gatgtcactacttctgaagacttttttgataaattaaccgtggaacaggagtttatgtct ggaatagacactgataaggtcaacaattacattgaggattgtatcgcccaaaagcactcg ttgatcaaggtgttaagactagtttgcctccaatccgtgtgtaatagtgggctcaaacaa aaagttttggattattacaaaagagagattctccagacatacggctatgagcacatattg accttacacaacctggagaaggccggcctgctgaaaccgcagacggggggcagaaacaat tacccaactatacggaaaacattacgcctctggatggatgatgttaatgagcaaaacccc acggacatatcgtatgtgtacagtgggtatgccccgctcagtgtgcggctggcccagctg ctttcccggcctggctggcggagcatcgaggaggtcctccgcatcctcccagggccccac tttgaggagcggcagccactgcccacaggactgcagaagaaacgtcaaccgggagaaaac cgagtgactctgatatttttccttgggggcgtaaccttcgctgaaattgctgccctgcga tttctctcccagttggaagatggaggtgtatctcttgatgatcttggagaccagcagcca cagctgctgctactcctgcaggagactgtcaggctgtgcgcgatgccggcctcgtccacc gtccacgtgctgcagctgctgcgggagctgctcgccttcgtgctcctcagctacacggtg ctcatcggggcgctgctgctggccggctggaccacttacttcctggccctgcatatgccc cgccccgcgcggaagttccggcggttggttgccttgcgcggccgttacagcctttgccct aagcctcgccccctttccccctgcctgcccaatcccgactgcttccttgggtgggggcgt ggctatggggcgaggcgctctcaggtggaggccgtgccccgctccgcgctcacgaagctg cgtcacttccggcgtgtgcgtctggcgtccgcgcgctgcacaatggcggctctgaagagt tggctgtcgcgcagcgtaacttcattcttcaggtaccgctgcggccgcgtagaggggaca ggaccagagggagggaccgacgcggacagcgctgtggcccggcccacggcgcccgccttc cttacgcgctcatctctcgtccgcccagctgggtcgcggcgtctcgctccgaccacatat gcgttgattgaagctattactgaatatactaaggctgtttataccttaacttctctttac cgacaatatacaagtttacttgggaaaatgaattcagaggaggaagatgaagtgtggcag gtgatcataggagccagagctgagatgacttcaaaacaccaagagtacttgaagctggaa accacttggatgactgcagttggtctttcagagatggcagcagaagctgcatatcaaact ggcgcagatcaggcctctataaccgccaggaatcacattcagctggtgaaactgcaggtg gaagaggtgcaccagctctcccggaaagcagaaaccaagctggcagaagcacagatagaa gagctccgtcagaaaacacaggaggaaggggaggagcgggctgagtcggagcaggaggcc tacctgcgtgaggattga >gi568815586r:122132249_122366408|GENSCAN_predicted_peptide_7|1925_aa VGGTKAGVVRFLGETDFAKGEWCGVELDEPLGKNDGAVAGTRYFQCQPKYGLFAPVHKVT KIGFPSTTPAKAKANAVRRVMATTSASLKRSPSASSLSSMSSVASSVSSRPSRTGLLTET SSRYARKISGTTALQEALKEKQQHIEQLLAERDLERAEVAKATSHVGEIEQELALARDGH DQHVLELEAKMDQLRTMVEAADREKVELLNQLEEEKRKVEDLQFRVEEESITKGDLETQT KLEHARIKELEQSLLFEKTKADKLQRELEDTRVATVSEKSRIMELEKDLALRVQEVAELR RRLESNKPAGDVDMSLSLLQEISSLQEKLEVTRTDHQREITSLKEHFGAREETHQKEIKA LYTATEKLSKENESLKSKLEHANKENSDVIALWKSKLETAIASHQQAMEELKVSFSKGLG TETAEFAELKTQIEKMRLDYQHEIENLQNQQDSERAAHAKEMEALRAKLMKVIKEKENSL EAIRSKLDKAEDQHLVEMEDTLNKLQEAEIKVKELEVLQAKCNEQTKVIDNFTSQLKATE EKLLDLDALRKASSEGKSEMKKLRQQLEAAEKQIKHLEIEKNAESSKKEKFAEASEEAVS VQRSMQETVNKLHQKEEQFNMLSSDLEKLRENLADMEAKFREKDEREEQLIKAKEKLEND IAEIMKMSGDNSSQLTKMNDELRLKERDVEELQLKLTKANENASFLQKSIEDMTVKAEQS QQEAAKKHEEEKKELERKLSDLEKKMETSHNQCQELKARYERATSETKTKHEEILQNLQK TLLDTEDKLKGAREENSGLLQELEELRKQADKAKSLTYLLTSAKKEIELMSEELRGLKSE KQLLSQEGNDLKLENGSLLSKLVELEAKIALLQGDQQKLWSVNETLNLEKEKFLEEKQDA EKYYEQEHLNKEALAVEREKLLKEINVVQEELLKINVENDSLQASKVSMQALIEELQLSK DTLIAKTEKDQEEKDHLEDQIKKLITENFILAKDKDDIIQKLQRSYEELVKDQKALVQET EDLTAEKKSALEKLSNLDNTCIALKVERDNVLQNNRNLQLETDMLLQDQEKLNASLQAAL QVKQLLRSEASGLRAQLDDASKALRKAELETVQLEAANTSLTKLLEEIKARRAVTDSECI QLLHEKETLAASERRLLAEKEELLSENRIITEKLHKCLEEAAHTEMSLNEKITYLTSEKE MASQKMTELKKQQDSLLKEKSSLETQNGALLAERENSIKAIGDLKRQCDQESANRSLVVQ ENMKLLGNIDALKKELQERKKENQELVASKCDLSLMLKEAQNTKKNLEKEHTHILQAKES LDAQLNTCCSEKNILLRDGLNLQEECHKLSKEIQEMQQSLILEQEARAKESESSLYENNQ LHGRMVLLEQEVEELRVCIEELQSEKFVLLQEKSKSEQELAEIIEEKELLTAEAAQLAAH IKTLKSDFAALSKSKAELQELHSCLTKILDDLQRNHEVTLAEKAQVMQDNQNLLAEKSEM MLEKDELLKEKETLAESYFILQKEISQLAKTNSHISANLLESQNENRTLRKDKNKLTLKI RELETLQSFTAAQTAEDAMQIMEQMTKEKTETLASLEDTKQTNAKLQNELDTLKENNLKN VEELNKSKELLTVENQKMEEFRKEIETLKQAAAQKSQQLSALQEENVKLAEELGRSRDEV TSHQKLEEERSVLNNQLLEMKKRESKFIKDADEEKASLQKSISITSALLTEKDAELEKLR NEVTVLRGENASAKSLHSVVQTLESDKVKLELKVKNLELQLKENKRQLSSSSGNTDTQAD EDERAQESQIDFLNSVIVDLQRKNQDLKMKVEMMSEAALNGNGDDLNNYDSDDQEKQSKK KPRLFCDICDCFDLHDTEDCPTQAQMSEDPPHSTHHGSRGEERPYCEICEMFGHWATNCN DDETF >gi568815586r:122132249_122366408|GENSCAN_predicted_CDS_7|5778_bp gttggtggcactaaggctggtgtagtccggtttcttggggagaccgactttgccaagggg gagtggtgtggcgtggagttagatgagccacttgggaagaatgatggcgctgttgctgga acaaggtattttcagtgtcaacccaaatatggcttgttcgctcctgtccacaaagttacc aagattggcttcccttccactacaccagccaaagccaaggccaacgcagtgaggcgagtg atggcgaccacgtccgccagcctgaagcgcagcccttctgcctcttccctcagctccatg agctcagtggcctcctctgtgagcagcaggcccagtcggacaggactattgactgaaacc tcctcccgttacgccaggaagatctccggtaccactgccctccaggaggccctgaaggag aagcagcagcacattgagcagctgctggcggaacgggatctggagagggcggaggtggcc aaggccacgagccacgtgggggagatagagcaggagctagctctggcccgggacggacat gaccagcatgtcctggaattggaagccaaaatggaccagctgcgaacaatggtggaagct gctgacagggagaaggtggagcttctcaaccagcttgaagaggagaaaaggaaggttgag gaccttcagttccgggttgaagaagaatcaattaccaaaggtgatcttgagacgcagacc aaactggagcatgcccgcattaaggagcttgaacagagcctgctctttgaaaagaccaaa gctgacaaactccagagggagttagaagacactagggtggctacagtttcagaaaagtca cgtataatggaactggagaaagacctagcattgagagtacaggaagtagctgagctccga agaaggctagagtccaataagcctgctggggatgtggacatgtcactttcccttttgcaa gagataagctctttgcaagaaaagttagaagtcacccgtactgaccaccagagagaaata acttctctgaaggagcattttggagcccgggaagaaactcatcagaaggagataaaggct ctgtataccgccacggaaaagctttccaaagagaacgagtcattgaaaagcaagctggag catgccaacaaagagaactcagatgtgatagctctatggaagtccaaactggagactgcc atcgcatcccaccagcaggcgatggaagaactgaaggtatctttcagcaaagggcttgga acagagacggcagaatttgctgaactaaaaacacaaatagagaaaatgagactagattac caacacgaaatagaaaatttgcagaatcaacaagactctgaacgggctgcccatgctaaa gagatggaagccttgagggctaaactgatgaaagttattaaagaaaaggaaaacagtctg gaagccatcaggtcgaaactggacaaagcagaagaccagcatctcgtagaaatggaagac acgttaaacaaattacaggaagctgaaataaaggtaaaggagctagaggtactgcaagcc aaatgcaatgaacaaaccaaggttattgataattttacatcacagctcaaggctactgaa gaaaagctcttggatcttgatgcacttcggaaagccagttccgaaggtaaatcggaaatg aagaaacttagacagcagcttgaggcagctgagaaacagattaaacatttagagattgaa aagaatgctgaaagtagcaagaaagaaaagtttgctgaagcttcagaggaggcagtctct gttcagagaagtatgcaagaaactgtaaataagttacaccaaaaggaggaacagtttaac atgctgtcttctgacttggagaagctgagagaaaacttagcagatatggaggcaaaattt agagagaaagatgagagagaagagcagctgataaaggcaaaggaaaaactggaaaatgac attgcagaaataatgaagatgtcaggagataactcttctcagctgacaaaaatgaacgat gaattacgtctgaaagaaagagatgtagaagaattacagctaaaacttacaaaggctaat gaaaatgcaagttttctgcaaaaaagtattgaggacatgactgtcaaagctgaacagagc cagcaagaagcagctaaaaagcatgaggaagaaaagaaagaattggagaggaaattgtcg gacctggaaaagaaaatggaaacaagccacaaccagtgtcaggagctgaaagccaggtat gagagagccacttctgagacaaaaaccaagcatgaagaaatcctacagaacctccagaag acgctgctggacacagaggacaagctgaagggcgcacgggaggagaacagtggcttgctg caggagctggaggagctgagaaagcaagccgacaaagccaaatcgctaacttatttgtta acatcagccaaaaaagaaattgaactaatgtcagaagagctgaggggtctgaaatcagag aagcagcttctttctcaggagggaaatgatttaaagttagaaaacggttcacttttatcc aagcttgtagaattggaggccaaaatagctttacttcagggagaccagcagaaactgtgg tcagtgaatgaaactcttaatttagaaaaggagaaattcttagaagaaaagcaagatgcc gaaaagtattatgagcaggaacatctcaataaagaagctttggctgttgagagagagaaa ttgcttaaagaaatcaatgttgtacaggaagaactcttgaagataaatgtggaaaatgac tctttgcaagcttccaaggtgagcatgcaggcactcattgaagagctccagctcagcaaa gatactttgattgctaagactgagaaggaccaggaagaaaaagatcacctggaggaccag atcaagaaacttattaccgaaaacttcatcttggccaaagataaggatgacatcattcag aagcttcaaaggtcttatgaggagctagtcaaagatcagaaagctttggtacaggagact gaagatctcacagctgagaaaaagtcagctttagagaaattgtccaatctcgataacacg tgcatagccttaaaggtagaacgagataatgttcttcagaacaacagaaatctgcagtta gagacggacatgctgcttcaagatcaggaaaagctgaatgccagcctccaggccgctctc caggtcaaacagctgctccgctcagaagccagtgggctccgcgcacagctggatgatgcc agcaaggccctgaggaaggcagagctggagaccgtgcaactcgaggccgcaaacacaagc ctcacgaagctcttggaggaaattaaggccaggcgggcggtcacggactccgagtgcatc cagcttctgcatgagaaagaaaccttggctgcctccgagagaaggctcttggctgagaaa gaggaacttttaagtgaaaatagaataatcactgaaaaactccacaaatgcttagaagag gctgcccatactgagatgagcctgaatgagaagatcacttacctgacttccgagaaggag atggcttctcagaaaatgactgaacttaaaaagcagcaggatagtctcttgaaagaaaaa tcctcactggaaacgcaaaatggagctttacttgcagagagagagaattccatcaaagcc ataggagacctcaaaaggcaatgtgatcaagagtctgcaaacagaagtttagttgtgcaa gagaatatgaaactcctcggtaatattgatgctctgaagaaggagcttcaagagagaaaa aaggaaaaccaagaactagtggccagcaagtgcgacctctctttgatgctgaaagaggct caaaataccaaaaagaatctggaaaaagaacacactcacatattgcaagcaaaggagagt ttggatgctcaacttaacacgtgttgttccgagaagaacattttgctgagagatggcttg aacctgcaagaagagtgtcacaaattaagcaaggagatccaggaaatgcagcagtcctta atcctggaacaggaagccagagcaaaggagagcgagtcatccttgtacgaaaacaatcaa cttcacgggaggatggtgctcctggagcaggaggtggaggagttaagagtgtgtatcgag gagctgcagtccgagaagtttgtgctacttcaagagaagagcaaatcagagcaagaactg gcagagataatcgaggagaaggaactgttgactgcagaagcagctcaacttgctgcccat ataaagactctgaaaagtgattttgctgccttgtccaaatccaaggcagagctgcaggaa ctgcacagctgcctcaccaagattctggatgaccttcagcggaaccatgaggtgaccctg gccgaaaaagcccaggtgatgcaagacaaccagaacctcctggctgagaagagcgaaatg atgctggaaaaggatgagctcctgaaggagaaggaaaccctggcagaaagctacttcatc cttcagaaagagatcagccagttggccaaaaccaacagccatatttcagccaatctccta gaatctcaaaatgaaaaccgtactttgagaaaagacaagaacaagcttactcttaaaatt agagagctcgagactcttcagtcatttacggctgctcaaacagcggaagatgccatgcag ataatggaacagatgaccaaagagaagactgagactctggcctccttggaggacaccaag caaacaaatgcaaaactacagaatgaattggacacacttaaagaaaacaacttgaaaaat gtggaagagctgaacaaatcaaaagaactcctgactgtagagaatcaaaaaatggaagaa tttaggaaagaaatagaaaccctaaagcaggcagcagctcagaagtcccagcagctttca gcgttgcaagaagagaacgttaaacttgctgaggagctggggagaagcagggacgaagtc acaagtcatcaaaagctggaagaagaaagatctgtgctcaataatcagttgttagaaatg aaaaaaagagaatccaagttcataaaagacgcagatgaagagaaagcttccttgcagaaa tccatcagtataactagtgccttactcacagaaaaggatgccgagctggagaaactgaga aatgaggtcacagtgctcaggggagaaaacgcctctgccaagtccttgcattcagttgtt cagactctagagtctgataaggtgaagctcgagctcaaggtaaagaacttggagcttcaa ctcaaagaaaacaagaggcagctcagcagctcctcaggtaatacagacactcaggcagac gaggatgaaagagcccaggagagtcagattgatttcctaaattcagtaatagtggacctt caaaggaagaatcaagacctcaagatgaaggtggagatgatgtcagaagcagccctgaat gggaacggggatgacctaaacaattatgacagtgatgatcaggagaaacagtccaagaag aaacctcgcctcttctgtgacatttgtgactgctttgatctccacgacacagaggattgt cctacccaggcacagatgtcagaggaccctccccattccacacaccatggcagtcggggt gaggaacgcccatactgtgaaatctgtgagatgtttggacactgggccaccaactgcaat gacgacgaaaccttctga