GENSCAN 1.0 Date run: 6-Nov-116 Time: 14:20:44 Sequence gi568815583r:39701135_40020026 : 318892 bp : 40.01% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.13 PlyA - 555 550 6 1.05 1.12 Term - 11081 11031 51 2 0 105 39 61 0.856 -0.65 1.11 Intr - 12447 12310 138 0 0 66 103 118 0.847 10.84 1.10 Intr - 25613 25455 159 2 0 105 68 113 0.408 10.26 1.09 Intr - 32541 32458 84 0 0 38 76 76 0.023 0.50 1.08 Intr - 37067 36957 111 2 0 25 94 106 0.188 4.56 1.07 Intr - 38655 38531 125 1 2 37 87 65 0.431 0.68 1.06 Intr - 40766 40671 96 0 0 47 71 116 0.497 4.86 1.05 Intr - 62780 62687 94 2 1 55 97 21 0.138 -1.68 1.04 Intr - 64612 64458 155 2 2 6 116 146 0.792 8.07 1.03 Intr - 69476 69293 184 2 1 64 91 185 0.927 14.84 1.02 Intr - 72665 72641 25 1 1 118 45 -19 0.372 -6.09 1.01 Init - 74637 74441 197 0 2 93 31 161 0.440 9.35 1.00 Prom - 78009 77970 40 -4.25 2.00 Prom + 79050 79089 40 -7.45 2.01 Sngl + 81345 81860 516 2 0 53 39 430 0.394 28.39 2.02 PlyA + 83233 83238 6 1.05 3.06 PlyA - 83844 83839 6 1.05 3.05 Term - 84831 84666 166 2 1 23 42 148 0.023 0.11 3.04 Intr - 94764 94640 125 0 2 42 81 117 0.234 4.86 3.03 Intr - 101120 100067 1054 1 1 101 102 979 0.913 90.05 3.02 Intr - 106124 105872 253 0 1 61 110 125 0.226 7.57 3.01 Init - 122164 122074 91 1 1 70 88 44 0.030 3.30 3.00 Prom - 124555 124516 40 -4.25 4.00 Prom + 135805 135844 40 -3.45 4.01 Sngl + 137451 137894 444 2 0 42 49 232 0.983 10.69 4.02 PlyA + 138259 138264 6 1.05 5.00 Prom + 140322 140361 40 -5.35 5.01 Sngl + 143332 143697 366 0 0 32 51 203 0.476 6.84 5.02 PlyA + 145408 145413 6 1.05 6.08 PlyA - 148300 148295 6 1.05 6.07 Term - 155509 155339 171 1 0 48 54 111 0.689 0.54 6.06 Intr - 159685 159534 152 2 2 83 98 135 0.877 12.96 6.05 Intr - 161026 160943 84 2 0 79 97 42 0.708 3.07 6.04 Intr - 186797 186578 220 2 1 104 48 54 0.042 -0.05 6.03 Intr - 191474 191376 99 2 0 77 72 108 0.482 7.49 6.02 Intr - 215001 214943 59 1 2 116 100 8 0.160 2.48 6.01 Init - 218892 218721 172 0 1 83 111 310 0.695 32.45 6.00 Prom - 222659 222620 40 -9.25 7.00 Prom + 225949 225988 40 -7.75 7.01 Init + 233062 233205 144 0 0 97 74 156 0.845 15.35 7.02 Intr + 238371 238483 113 2 2 114 56 80 0.263 5.66 7.03 Intr + 242249 242351 103 2 1 104 94 58 0.237 7.26 7.04 Intr + 247982 248134 153 1 0 50 78 134 0.732 7.95 7.05 Intr + 252770 252850 81 1 0 38 80 77 0.635 0.82 7.06 Intr + 254486 254634 149 1 2 106 97 89 0.938 9.71 7.07 Intr + 260650 260765 116 1 2 47 97 52 0.818 1.17 7.08 Intr + 264552 264709 158 1 2 75 115 114 0.991 11.61 7.09 Term + 266210 266749 540 1 0 61 48 570 0.907 43.57 7.10 PlyA + 269359 269364 6 1.05 8.00 Prom + 269541 269580 40 -4.45 8.01 Init + 271853 271880 28 1 1 72 113 31 0.943 3.81 8.02 Intr + 272458 272615 158 2 2 88 70 186 0.904 15.61 8.03 Intr + 275280 275710 431 2 2 115 69 603 0.915 52.69 8.04 Intr + 276944 277013 70 2 1 113 76 73 0.965 6.87 8.05 Intr + 284671 284754 84 0 0 57 94 52 0.712 1.80 8.06 Intr + 286849 286971 123 0 0 71 31 158 0.895 8.26 8.07 Intr + 289139 289243 105 1 0 71 84 45 0.764 1.89 8.08 Intr + 291041 291095 55 1 1 57 97 52 0.776 0.63 8.09 Intr + 291635 291714 80 0 2 103 110 51 0.675 7.15 8.10 Intr + 295830 295931 102 2 0 103 42 81 0.953 4.45 8.11 Intr + 297597 297650 54 2 0 49 91 93 0.930 3.96 8.12 Intr + 299854 300090 237 0 0 37 86 354 0.989 26.99 8.13 Intr + 301579 301654 76 0 1 96 80 77 0.746 5.87 8.14 Intr + 302059 302180 122 2 2 69 76 106 0.747 6.79 8.15 Intr + 305882 305931 50 1 2 120 107 -26 0.687 -0.94 8.16 Intr + 306893 307187 295 2 1 83 27 120 0.334 1.49 8.17 Intr + 315368 315538 171 1 0 119 36 176 0.989 14.72 8.18 Intr + 315974 316108 135 1 0 94 81 179 0.999 17.64 8.19 Intr + 317959 318066 108 0 0 129 84 131 0.994 16.36 8.20 Intr + 318269 318359 91 1 1 98 83 130 0.948 12.15 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 35566 35753 188 2 2 97 33 155 0.871 7.57 S.002 Init + 117448 117458 11 0 2 75 95 -6 0.875 -1.04 S.003 Init + 246458 246547 90 1 0 93 84 68 0.920 7.44 S.004 Term + 309633 309818 186 2 0 60 50 146 0.841 4.51 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583r:39701135_40020026|GENSCAN_predicted_peptide_1|472_aa MAEGEGEAGTSYMVGAGERESEGGGATHFQTTKFHENTVRRTAMIQSPTTRPLLRHWELQ FDMRFGSLYKWNHTVDTASNLNSGKEDHSESSNTENRRTSNDDKQESCSEKIKLAEEGSD EDLDLVQHQIISECSDEPKLKELDSQLQDAIQKMKKLDKILAKKQRREKEIKKQGLEMRI KLWEEIKSAKYSEAWQSKEEMENTKKFLSLTAVSEETVGPSHEEEDTFSSVFHTQIPPEE YEMQMQKLNKDFTCDVERNESLIKSGKKPFSNTEKIELRGKHNQDFIKRNIELAKESRNP VVMVDREKKRLVELLKDLDEKDSGLSSSEDFYLHLSGQAVSRSHLFSMLLQVRRGIQGDQ SGWVVPVKGYELAVTQHQQLAEIDIKLQELSAASPTISSFSPRLENRNNQKPDRDGERNM EVTPGEKILRNTKEQRDLHNRLREIDEKLKMMKENVAHCSSTDTEPTGAQAP >gi568815583r:39701135_40020026|GENSCAN_predicted_CDS_1|1419_bp atggcggaaggtgaaggggaagctggcacatcctacatggttggagcaggagagagagag agtgaaggaggaggtgctacacactttcaaacaaccaaatttcatgagaacactgtcaga agaacagcaatgattcagtcacctaccaccaggcccctcctccgacactgggaattacaa ttcgacatgagatttggaagtttgtataaatggaatcatacagtcgatactgcaagcaac ttgaactctggtaaagaggaccactccgaaagcagtaatacagagaacagaagaactagt aatgatgataagcaggaaagctgctctgagaaaataaaattggctgaagagggatcagat gaagatctggatttggttcaacatcagataatctctgagtgttcagatgaacccaaatta aaagaattagattctcaacttcaagatgctattcagaagatgaaaaaacttgataaaata ttggcaaagaaacaacgcagagaaaaagaaattaagaagcaaggtctagaaatgagaata aagctgtgggaagaaattaagtctgcaaaatatagtgaagcttggcaaagtaaagaggag atggaaaatacaaaaaaatttttatctttgactgctgtttctgaagaaactgttggtcct tctcatgaggaggaagacaccttttcctcagtgtttcatactcaaatccctccagaagaa tatgaaatgcagatgcagaaactcaataaagattttacctgtgatgtggaaagaaatgag tcattgatcaaatcaggaaagaaacctttctcgaatacagaaaagattgagctcaggggt aaacacaaccaggattttattaagagaaacattgagttggccaaggaatcaagaaaccca gtggttatggttgacagagagaagaaaaggctggttgagcttttgaaggacttggatgag aaagattccgggctctccagttctgaggacttctacctgcatctcagcggccaagctgta tcgcgtagccatttgttcagtatgttgctgcaagtgagaagggggattcagggtgatcag tctggctgggtggtcccagtaaaaggatatgaacttgcagtcacccagcatcagcagctt gctgaaattgatataaaactccaagaactctctgcagcctcccctacaatttccagtttt tctccaagacttgaaaatcggaataatcagaaacctgaccgtgatggtgaaagaaatatg gaagtaactccaggagaaaagatacttaggaacaccaaagagcaacgcgatctgcataat cggctgagagagattgatgaaaagctgaaaatgatgaaggaaaatgtggcccactgcagc agcactgacacagagcctacaggagctcaggccccctaa >gi568815583r:39701135_40020026|GENSCAN_predicted_peptide_2|171_aa MSPPPPSHWPCPLSLTVHLHAKRPDAAPPRPLLGAPPRDRAPPSSARPLTFPPGLLEELA AVSQGADGRRSAPQTLAAPATYPQRRGVSPATSASLVVEEAAGPDPSCLWDRCFPGNQMR DASGLAIAERAVPSGNGTGAGGCWSPASWDGRRVSFRLRCRRWGLGAIFRL >gi568815583r:39701135_40020026|GENSCAN_predicted_CDS_2|516_bp atgagcccgcccccgccctctcattggccctgtccgctctccctgacagtccatctacat gccaagcggcccgacgcagctccgccccgcccactcctcggcgcaccgccccgcgaccga gctccgccctcatctgcgcggcccctcaccttcccgccgggcctcctcgaggaactggcc gccgtcagtcagggcgcagatggccgcagaagcgcgccgcagacactcgccgcgccagcc acttacccccagaggagaggggtctctcccgccacctcagcctctctggtagtggaagaa gccgccggtccagatccctcctgcctgtgggaccgctgtttccctggcaaccagatgcgc gatgcttccggtctcgcgatagcggaaagggctgtaccatccggaaatgggaccggggcc ggcggctgttggagccctgcgagctgggacggccggcgtgtgtccttccgtcttcgctgc cgacggtgggggttgggagcgatttttcgtctttag >gi568815583r:39701135_40020026|GENSCAN_predicted_peptide_3|562_aa MIVRVQSKEVKWAYGYLVGLWIFGSIAQIRGNFMVLWSTCRTTVFKSVTNRFIKNLACSG ICASLVCVPFDIILSTSPHCCWWIYTMLFCKVVKFLHKVFCSVTILSFPAIALDRYYSVL YPLERKISDAKSRELVMYIWAHAVVASVPVFAVTNVADIYATSTCTEVWSNSLGHLVYVL VYNITTVIVPVVVVFLFLILIRRALSASQKKKVIIAALRTPQNTISIPYASQREAELHAT LLSMVMVFILCSVPYATLVVYQTVLNVPDTSVFLLLTAVWLPKVSLLANPVLFLTVNKSV RKCLIGTLVQLHHRYSRRNVVSTGSGMAEASLEPSIRSGSQLLEMFHIGQQQIFKPTEDE EESEAKYIGSADFQAKEIFSTCLEGEQGPQFAPSAPPLSTVDSVSQVAPAAPVEPETFPD KYSLQFGFGPFELPPQWLSETRNSKKRLLPPLGNTPEELIQTKVPKLDICFRDGLADRPS EFGGVRKELERDDRVNREGILHTHHVAWMCLRFLKMELQKTQYRHLVLSDLYPRAVLRVS PAIADTEVSGEIKNKYTTNAIY >gi568815583r:39701135_40020026|GENSCAN_predicted_CDS_3|1689_bp atgattgtgagagtccaaagtaaagaagtcaagtgggcatatggatatctggttggctta tggatatttgggtctatagctcagataagaggaaacttcatggtgttatggtcaacttgc cgcacaaccgtgttcaaatctgtcaccaacaggttcattaaaaacctggcctgctcgggg atttgtgccagcctggtctgtgtgcccttcgacatcatcctcagcaccagtcctcactgt tgctggtggatctacaccatgctcttctgcaaggtcgtcaaatttttgcacaaagtattc tgctctgtgaccatcctcagcttccctgctattgctttggacaggtactactcagtcctc tatccactggagaggaaaatatctgatgccaagtcccgtgaactggtgatgtacatctgg gcccatgcagtggtggccagtgtccctgtgtttgcagtaaccaatgtggctgacatctat gccacgtccacctgcacggaagtctggagcaactccttgggccacctggtgtacgttctg gtgtataacatcaccacggtcattgtgcctgtggtggtggtgttcctcttcttgatactg atccgacgggccctgagtgccagccagaagaagaaggtcatcatagcagcgctccggacc ccacagaacaccatctctattccctatgcctcccagcgggaggccgagctgcacgccacc ctgctctccatggtgatggtcttcatcttgtgtagcgtgccctatgccaccctggtcgtc taccagactgtgctcaatgtccctgacacttccgtcttcttgctgctcactgctgtttgg ctgcccaaagtctccctgctggcaaaccctgttctctttcttactgtgaacaaatctgtc cgcaagtgcttgatagggaccctggtgcaactacaccaccggtacagtcgccgtaatgtg gtcagtacagggagtggcatggctgaggccagcctggaacccagcatacgctcgggtagc cagctcctggagatgttccacattgggcagcagcagatctttaagcccacagaggatgag gaagagagtgaggccaagtacattggctcagctgacttccaggccaaggagatatttagc acctgcctggagggagagcaggggccacagtttgcgccctctgccccacccctgagcaca gtggactctgtatcccaggtggcaccggcagcccctgtggaacctgaaacattccctgat aagtattccctgcagtttggctttgggccttttgagttgcctcctcagtggctctcagag acccgaaacagcaagaagcggctgcttccccccttgggcaacaccccagaagagctgatc cagacaaaggtgcccaagctggatatctgcttcagggatgggcttgctgatcggccctct gaatttggaggtgtgagaaaggaattggagagagatgacagagtaaacagagagggcatc ctgcacacccatcatgtggcatggatgtgtttgcgatttttgaaaatggaactgcagaaa acacagtatagacatttggtgctctcggacttatatccccgtgctgttttgagggtgtca cctgcaatagctgacacagaagtatctggggaaattaaaaacaaatacacaacaaatgca atttattaa >gi568815583r:39701135_40020026|GENSCAN_predicted_peptide_4|147_aa MIISIDAEKAFDKIQQPFMLKTLNKLGIDGTYLKIIRAIYDKPTASIKLNGQKLEAFPSK TGTKRGCPLSPLLFNIVLEVLARAIRQEKEIKGILSGKEEVKLSLFADDMIVYLENPNIS AQNLLKLISNFSKVSGYKINMQKSQAL >gi568815583r:39701135_40020026|GENSCAN_predicted_CDS_4|444_bp atgattatctcaatagatgcagaaaaggcctttgacaaaattcaacagcccttcatgcta aaaactctgaataaactaggtattgatggaacatatctcaaaataataagagctatttat gacaaacccacagccagtatcaaactgaatgggcaaaaactggaagcattcccttcaaaa actggcacaaaacggggatgccctctctcaccactcctattcaacatagtgttggaagtt ctggccagggcaatcaggcaggagaaagaaataaagggtattctatcaggaaaagaggaa gtcaaattgtccctgtttgcagatgacatgattgtatatttagaaaaccccaacatctca gcccaaaatctccttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaac atgcaaaaatcacaagcattgtga >gi568815583r:39701135_40020026|GENSCAN_predicted_peptide_5|121_aa MVLTTVTGSAKRTPEKRSRNLLSICSEDLMGAPNPVCPHRLLKKKNRSGENLGQEPIGYI DPGPISAHTFPGFATLQSTRSNKASTVQGFEESNLGSPTLIVNHLNVPSTSSLLTSNQKR Y >gi568815583r:39701135_40020026|GENSCAN_predicted_CDS_5|366_bp atggttctcaccacagtcacaggatcagccaagaggacccctgaaaagagatcaagaaat cttctttcaatctgttccgaggacctaatgggagcccccaaccccgtctgcccccaccgc ctactgaaaaaaaaaaatagatcaggtgagaatttggggcaagaacctataggctacata gatccaggccctataagtgcccatacttttcctgggtttgctaccttacagagcacacgt tctaacaaggcctcaacagtacaaggctttgaagaatcaaacctgggcagcccaacactg atcgtgaaccacttaaatgtcccctcaacctcttccctcctgacatccaaccagaaaagg tactga >gi568815583r:39701135_40020026|GENSCAN_predicted_peptide_6|318_aa MGHNGSWISPNASEPHNASGAEAAGVNRSALGEFGEAQLYRQFTTTVQVVIFIGSLLDSQ SKYQENWAPLWCFDLKKEGLTLMQCDLILITTSAKIVSKKGRIRSWPGERTPLQIGGWQR ASFISASSQLPSTPNNPYAKVAYFGMAYSATLQHTPLEEDLGFSVKNFESFRLKIIVPGL LIGDCTSKNLSLNNQTLRVEPGEVNSHSKGKGRYQYRESCKNQNVSTHRRAKSIRLFMTR SSVSLKSSSASRSKLASPIHTRCLWEKSAAMCSDSPVKRFTWQELRHLANSHVGEQSILQ PVNLQMTAAMPHSLTIVS >gi568815583r:39701135_40020026|GENSCAN_predicted_CDS_6|957_bp atgggacataacgggagctggatctctccaaatgccagcgagccgcacaacgcgtccggc gccgaggctgcgggtgtgaaccgcagcgcgctcggggagttcggcgaggcgcagctgtac cgccagttcaccaccaccgtgcaggtcgtcatcttcataggctcgctgctcgattcacag tctaagtaccaagagaactgggctcctctctggtgctttgacttgaaaaaggaagggctc accctaatgcaatgcgacctcatcttgattactacatctgcaaagattgtttctaaaaaa ggtcgtattcgcagctggccaggggagaggactcctttgcagataggaggatggcagaga gcttcttttatatctgcatcttcccagttgccttcaactccaaataatccttatgccaaa gtggcatattttgggatggcatattcagctacccttcaacatactcctctggaagaggat ttaggattctcagtaaagaactttgaatctttccgcttgaaaatcattgtgccaggctta ctaataggtgattgtaccagtaaaaatttaagtttaaacaaccaaacactgagagtggaa ccaggagaagtgaattctcactccaagggcaaaggtaggtatcagtatcgggaaagttgt aaaaaccagaatgtaagtactcacagaagagcaaagtcaatcaggctcttcatgactcgg agtagcgtcagcctaaaatcctcatcggcaagcaggtctaagttagcttctcctatccat actaggtgtctctgggagaagtcagctgccatgtgttcagacagccctgtgaaaaggttc acctggcaggaactgaggcatcttgccaacagccatgtgggtgaacagagcatcctccag cccgttaaccttcagatgactgcagccatgccccatagcttgaccatagtctcctga >gi568815583r:39701135_40020026|GENSCAN_predicted_peptide_7|518_aa MAGGRGAPGRGRDEPPESYPQRQDHELQALEAIYGADFQDLRPDACGPVKEPPEINLVLY PQGLTGEEVYVKVDLRVKCPPTYPDVVPEIELKNAKGLSNESVNLLKSRLEELAKKHCGE VMIFELAYHVQSFLSEHNKPPPKSFHEEMLERRAQEEQQRLLEAKRKEEQEQREILHEIQ RRKEEIKEEKKRKEMAKQERLEIASLSNQDHTSKKDPGGHRTAAILHGGSPDFVGNGKHR ANSSGRSRRERQYSVCNSEDSPGSCEILYFNMGSPDQLMVHKGKCIGSDEQLGKLVYNAL ETATGGFVLLYEWVLQWQKKMGPFLTSQEKEKIDKCKKQIQGTETEFNSLVKLSHPNVVR YLAMNLKEQDDSIVVDILVEHISGVSLAAHLSHSGPIPVHQLRRYTAQLLSGLDYLHSNS VVHKVLSASNVLVDAEGTVKITDYSISKRLADICKEDVFEQTRVRFSDNALPYKTGKKGD VWRLGLLLLSLSQGQECGEYPVTIPSDLPADFQDFLKK >gi568815583r:39701135_40020026|GENSCAN_predicted_CDS_7|1557_bp atggctgggggccgtggggcccccgggcgcggccgggacgagcctccggagagctacccg caacgacaggaccacgagctacaggccctggaggccatttacggcgcggacttccaagac ctgcggccggacgcttgcggaccggtcaaagagccccctgaaatcaatttagttttgtac cctcaaggcctaactggtgaagaagtatatgtaaaagtggatttgagggttaaatgccca cctacctatccagatgtagttcctgaaatagagttaaaaaatgccaaaggtctatcaaat gaaagtgtcaatttgttaaaatctcgcctagaagaactggccaagaaacactgtggggag gtgatgatctttgaactggcttaccacgtgcagtcatttctcagcgagcataacaagccc cctcccaagtcttttcatgaagaaatgctggaaaggcgggctcaggaggagcagcagagg ctgttggaggccaagcggaaagaagagcaggagcaacgtgaaatcctgcatgagattcag agaaggaaagaagagataaaagaagagaaaaaaaggaaagaaatggctaagcaggaacgt ttggaaattgctagtttgtcaaaccaagatcatacctctaagaaggacccaggaggacac agaacggctgccattctacatggaggctctcctgactttgtaggaaatggtaaacatcgg gcaaactcctcaggaaggtctaggcgagaacgtcagtattctgtatgtaatagtgaagat tctcctggctcttgtgaaattctgtatttcaatatggggagtcctgatcagctcatggtg cacaaagggaaatgtattggcagtgatgaacaacttggaaaattagtctacaatgctttg gaaacagccactggtggctttgtcttgttgtatgagtgggtccttcagtggcagaaaaaa atgggtccattccttaccagtcaagaaaaagagaagattgataagtgcaaaaagcagatt caaggaacagaaacagaattcaactcactggtaaaattgagccatccaaatgtagtacgc taccttgcaatgaatctcaaagagcaagacgactccatcgtggtggacattttagtggag cacattagtggggtctctcttgctgcacacctgagccactcaggccccatccctgtgcat cagcttcgcaggtacacagctcagctcctgtcaggccttgattatctgcacagcaattct gtggtgcataaggtcctgagtgcatctaatgtcttggtggatgcagaaggcaccgtcaag attacggactatagcatttctaagcgcctcgcagacatttgcaaggaggatgtgtttgag caaacccgagttcgttttagtgacaatgctctgccttataaaacggggaagaaaggagat gtttggcgtcttggccttctgctgctgtccctcagccaaggacaggaatgtggagagtac cctgtgaccatccctagtgacttaccagctgactttcaagattttctaaagaagtga >gi568815583r:39701135_40020026|GENSCAN_predicted_peptide_8|859_aa MPLVEQSPEDSEGQDYVETVIPSNRLPSAAFFSETQRQFSRYFIEFEELQLLGKGAFGAV IKVQNKLDGCCYAVKRIPINPASRQFRRIKGEVTLLSRLHHENIVRYYNAWIERHERPAG PGTPPPDSGPLAKDDRAARGQPASDTDGLDSVEAAAPPPILSSSVEWSTSGERSASARFP ATGPGSSDDEDDDEDEHGGVFSQSFLPASDSESDIIFDNEDENSKSQNQDEDCNEKNGCH ESEPSVTTEAVHYLYIQMEYCEKSTLRDTIDQGLYRDTVRLWRLFREILDGLAYIHEKGM IHRDLKPVNIFLDSDDHVKIGDFGLATDHLAFSADSKQDDQTGDLIKSDPSGHLTGMVGT ALYVSPEVQGSTKSAYNQKVDLFSLGIIFFEMSYHPMVTASERIFVLNQLRDPTSPKFPE DFDDGEHAKQKSVISWLLNHDPAKRPTATELLKSELLPPPQMEESELHEVLHHTLTNVDG KAYRTMMAQIFSQRISPAIDYTYDSDILKGNFSIRTAKMQQHVCETIIRIFKRHGAVQLC TPLLLPRNRQIYEHNEAALFMDHSGMLVMLPFDLRIPFARYVARNNILNLKRYCIERVFR PRKLDRFHPKELLECAFDIVTSTTNSFLPTAEIIYTIYEIIQEFPALQVPFHILTFQDSP LYFFTKCGGENHSYRPASQSVLQMNHIFTKLCRLYKFIEQKGDLQDLMPTINSLIKQKTG IAQLVKYGLKDLEEVVGLLKKLGIKLQVLINLGLVYKVQQHNGIIFQFVAFIKRRQRAVP EILAAGGRYDLLIPQFRGPQALGPVPTAIGVSIAIDKISAAVLNMEESLDDLRPGVPNPQ AVDWYPSRACYEPGRTAGX >gi568815583r:39701135_40020026|GENSCAN_predicted_CDS_8|2577_bp atgcctctagtggaacaaagtcctgaagattctgaaggacaagattatgttgagactgtt attcctagcaaccggctacccagtgctgccttctttagtgagacacagagacagttttcc cgatacttcattgagtttgaagaattacaacttcttggtaaaggagcttttggagctgtc atcaaggtgcagaacaagttggacggctgctgctacgcagtgaagcgcatccccatcaac ccggccagccggcagttccgcaggatcaagggcgaagtgacactgctgtcacggctgcac catgagaacattgtgcgctactacaacgcctggatcgagcggcacgagcggccggcggga ccggggacgccgcccccggactccgggcccctggccaaggatgaccgagctgcacgcggg cagccggcgagcgacacagacggcctggacagcgtagaggccgccgcgccgccacccatc ctcagcagctcggtggagtggagcacttcgggcgagcgctcggccagtgcccgtttcccc gccaccggcccgggctccagcgatgacgaggacgacgacgaggacgagcacggtggcgtc ttctcccagtccttcctgcctgcttcagattctgaaagtgatattatctttgacaatgaa gatgagaacagtaaaagtcagaatcaggatgaagattgcaatgaaaagaatggctgccat gaaagtgagccatcagtgacgactgaggctgtgcactacctatacatccagatggagtac tgtgagaagagcactttacgagacaccattgaccagggactgtatcgagacaccgtcaga ctctggaggctttttcgagagattctggatggattagcttatatccatgagaaaggaatg attcaccgggatttgaagcctgtcaacatttttttggattctgatgaccatgtgaaaata ggtgattttggtttggcgacagaccatctagccttttctgctgacagcaaacaagacgat cagacaggagacttgattaagtcagacccttcaggtcacttaactgggatggttggcact gctctctatgtaagcccagaggtccaaggaagcaccaaatctgcatacaaccagaaagtg gatctcttcagcctgggaattatcttctttgagatgtcctatcaccccatggtcacggct tcagaaaggatctttgttctcaaccaactcagagatcccacttcgcctaagtttccagaa gactttgacgatggagagcatgcaaagcagaaatcagtcatctcctggctgttgaaccac gatccagcaaaacggcccacagccacagaactgctcaagagtgagctgctgcccccaccc cagatggaggagtcagagctgcatgaagtgctgcaccacacgctgaccaacgtggatggg aaggcctaccgcaccatgatggcccagatcttctcgcagcgcatctcccctgccatcgat tacacctatgacagcgacatactgaagggcaacttctcaatccgtacagccaagatgcag cagcatgtgtgtgaaaccatcatccgcatctttaaaagacatggagctgttcagttgtgt actccactactgcttccccgaaacagacaaatatatgagcacaacgaagctgccctattc atggaccacagcgggatgctggtgatgcttccttttgacctgcggatcccttttgcaaga tatgtggcaagaaataatatattgaatttaaaacgatactgcatagaacgtgtgttcagg ccgcgcaagttagatcgatttcatcccaaagaacttctggagtgtgcatttgatattgtc acttctaccaccaacagctttctgcccactgctgaaattatctacactatctatgaaatc atccaagagtttccagcacttcaggttcctttccatattttaacattccaagattcccca ctatatttcttcaccaagtgtggtggggaaaaccacagctacagacctgcttctcagtct gtcctgcagatgaatcacatctttactaaactgtgtcgactctacaagtttattgaacag aagggagatttgcaagatcttatgccaacaataaattcattaataaaacagaaaacaggt attgcacagttggtgaagtatggcttaaaagacctagaggaggttgttggactgttgaag aaactcggcatcaagttacaggtcttgatcaatttgggcttggtttacaaggtgcagcag cacaatggaatcatcttccagtttgtggctttcatcaaacgaaggcaaagggctgtacct gaaatcctcgcagctggaggcagatatgacctgctgattccccagtttagagggccacaa gctctggggccagttcccactgccattggggtcagcatagctatagacaagatatctgct gctgtcctcaacatggaggaatctctggacgaccttagaccaggggtccctaacccccag gcagtggactggtacccatccagggcctgttacgaaccgggccgcacagcaggagnn