GENSCAN 1.0 Date run: 5-Nov-116 Time: 13:24:25 Sequence gi568815581r:29515735_29717175 : 201441 bp : 47.31% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1719 1922 204 2 0 77 107 138 0.998 14.00 1.02 Intr + 6603 6785 183 2 0 21 86 158 0.647 8.88 1.03 Intr + 14673 14885 213 2 0 88 78 125 0.998 10.41 1.04 Term + 18384 18572 189 2 0 108 42 171 0.490 11.95 1.05 PlyA + 19578 19583 6 1.05 2.05 PlyA - 20202 20197 6 1.05 2.04 Term - 27202 27023 180 1 0 104 46 170 0.978 12.01 2.03 Intr - 34130 34016 115 1 1 -26 111 76 0.034 -1.05 2.02 Intr - 47352 46831 522 2 0 54 62 516 0.142 37.66 2.01 Init - 51232 50352 881 1 2 89 94 1303 0.975 122.95 2.00 Prom - 52296 52257 40 -12.78 3.00 Prom + 52508 52547 40 -16.15 3.01 Init + 53025 53096 72 2 0 55 94 167 0.994 13.07 3.02 Intr + 53584 53625 42 0 0 101 109 -3 0.722 1.54 3.03 Intr + 55857 55985 129 2 0 117 79 66 0.998 9.49 3.04 Intr + 56408 56963 556 1 1 116 81 307 0.650 25.22 3.05 Term + 57078 57190 113 1 2 84 49 107 0.999 5.12 3.06 PlyA + 57403 57408 6 -1.75 4.20 PlyA - 57759 57754 6 1.05 4.19 Term - 59180 58968 213 2 0 65 43 538 0.999 44.23 4.18 Intr - 59408 59345 64 1 1 138 87 50 0.974 8.62 4.17 Intr - 59736 59554 183 2 0 31 84 141 0.988 6.90 4.16 Intr - 59969 59896 74 2 2 98 60 74 0.895 3.80 4.15 Intr - 60397 60344 54 1 0 62 102 86 0.989 6.58 4.14 Intr - 60716 60486 231 2 0 60 74 221 0.999 15.87 4.13 Intr - 60940 60788 153 1 0 88 78 342 0.609 33.47 4.12 Intr - 61262 61129 134 0 2 71 5 285 0.999 18.86 4.11 Intr - 61513 61402 112 1 1 119 75 186 0.996 20.35 4.10 Intr - 62008 61911 98 2 2 99 94 57 0.997 7.13 4.09 Intr - 62637 62565 73 0 1 93 109 178 0.999 19.38 4.08 Intr - 63045 62997 49 2 1 102 115 60 0.998 8.78 4.07 Intr - 65646 65604 43 1 1 65 109 57 0.997 2.80 4.06 Intr - 66102 66008 95 2 2 77 88 93 0.999 7.81 4.05 Intr - 66410 66193 218 2 2 64 94 247 0.999 20.20 4.04 Intr - 67069 66964 106 0 1 75 42 170 0.999 11.32 4.03 Intr - 67303 67191 113 1 2 126 81 107 0.999 12.98 4.02 Intr - 67882 67749 134 2 2 87 92 97 0.993 10.36 4.01 Init - 73644 73593 52 0 1 49 102 100 0.935 6.83 4.00 Prom - 75197 75158 40 -5.96 5.00 Prom + 75884 75923 40 -6.66 5.01 Init + 77462 78001 540 1 0 58 91 252 0.443 15.39 5.02 Intr + 92008 92143 136 0 1 124 80 142 0.999 17.14 5.03 Intr + 92252 92376 125 0 2 116 100 322 0.922 36.50 5.04 Intr + 92461 92506 46 0 1 119 86 60 0.911 6.88 5.05 Intr + 93117 93260 144 1 0 60 77 175 0.997 13.85 5.06 Intr + 93352 93541 190 2 1 90 90 267 0.963 25.74 5.07 Intr + 93621 93687 67 0 1 126 100 69 0.944 10.81 5.08 Intr + 94951 95032 82 0 1 106 90 103 0.749 11.51 5.09 Intr + 95845 95909 65 2 2 59 84 8 0.355 -4.06 5.10 Intr + 96142 96272 131 0 2 46 53 164 0.580 8.19 5.11 Intr + 96382 96539 158 1 2 33 77 234 0.863 16.45 5.12 Intr + 96668 96868 201 0 0 58 35 324 0.651 23.36 5.13 Intr + 96918 97081 164 1 2 83 96 241 0.993 24.09 5.14 Intr + 97153 97229 77 0 2 80 82 81 0.998 5.01 5.15 Term + 97620 97848 229 0 1 85 43 369 0.995 28.20 5.16 PlyA + 99005 99010 6 1.05 6.23 PlyA - 99329 99324 6 -3.74 6.22 Term - 100441 99998 444 1 0 53 41 721 0.766 59.14 6.21 Intr - 100602 100545 58 2 1 77 100 75 0.999 6.49 6.20 Intr - 101308 100968 341 1 2 62 65 394 0.915 28.77 6.19 Intr - 101885 101766 120 2 0 115 54 178 0.945 17.79 6.18 Intr - 102433 102314 120 1 0 2 82 114 0.743 2.89 6.17 Intr - 103237 103056 182 2 2 73 101 257 0.999 25.09 6.16 Intr - 103455 103326 130 0 1 117 41 131 0.999 11.57 6.15 Intr - 104039 103917 123 2 0 67 58 180 0.983 13.68 6.14 Intr - 105689 105490 200 0 2 99 100 308 0.999 32.17 6.13 Intr - 106209 106059 151 0 1 -26 28 153 0.247 -2.36 6.12 Intr - 107123 106954 170 0 2 7 80 186 0.120 9.37 6.11 Intr - 114669 114575 95 2 2 86 25 89 0.024 2.01 6.10 Intr - 117197 115237 1961 2 2 100 -6 827 0.054 61.83 6.09 Intr - 121068 120234 835 2 1 67 93 493 0.726 39.10 6.08 Intr - 132610 132410 201 0 0 27 113 188 0.387 13.60 6.07 Intr - 135066 134920 147 2 0 85 87 133 0.541 12.25 6.06 Intr - 139873 139827 47 1 2 116 98 70 0.986 8.01 6.05 Intr - 151261 151133 129 1 0 60 119 76 0.997 8.79 6.04 Intr - 151489 151396 94 0 1 85 94 76 0.997 7.87 6.03 Intr - 156395 156201 195 1 0 68 103 64 0.670 4.43 6.02 Intr - 168950 168829 122 2 2 87 80 81 0.823 6.49 6.01 Init - 200870 200562 309 2 0 66 77 124 0.560 6.41 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 47352 46827 526 2 1 54 48 506 0.816 36.94 S.002 Term - 117197 115107 2091 2 0 100 37 840 0.916 65.47 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:29515735_29717175|GENSCAN_predicted_peptide_1|262_aa ELNENQSTPKKEKQEWLSKQKENIQHFQAEEEANLLRRQRQYLELECRRFKRRMLLGRHN LEQDLVREHESMQELEFRHLNTIQKMRCELIRLQHQTELTNQLEYNKRRERELRRKHVME VRQQPKSLKSKELQIKKQFQDTCKIQTRQYKALRNHLLETTPKSEHKAVLKRLKEEQTRK LAILAEQYDHSINEMLSTQALRLDEAQEAECQVLKMQLQQELELLNAYQSKIKMQAEAQH DRELRELEQRVSLRRALLEQKV >gi568815581r:29515735_29717175|GENSCAN_predicted_CDS_1|789_bp gagctaaatgaaaaccagagtacccccaaaaaagaaaaacaggagtggctttcaaagcag aaggagaatatacagcatttccaagcagaagaagaagctaaccttcttcgacgtcaaaga caatacctagagctggaatgccgtcgcttcaagagaagaatgttacttgggcgtcataac ttagagcaggaccttgtcagggagcatgaatctatgcaagaactggagttccgccacctc aacacaattcagaagatgcgctgtgagttgatcagattacagcatcaaactgagctcact aaccagctggaatataataagcgaagagaacgagaactaagacgaaagcatgtcatggaa gttcgacaacagcctaagagtttgaagtctaaagaactccaaataaaaaagcagtttcag gatacctgcaaaatccaaaccagacagtacaaagcattaagaaatcacctgctggagact acaccaaagagtgagcacaaagctgttctgaaacggctcaaggaggaacagacccggaaa ttagctatcttggctgagcagtatgatcacagcattaatgaaatgctctccacacaagcc ctgcgtttggatgaagcacaggaagcagagtgccaggttttgaagatgcagctgcagcag gaactggagctgttgaatgcgtatcagagcaaaatcaagatgcaagctgaggcacaacat gatcgagagcttcgcgagcttgaacagagggtctccctccggagggcactcttagaacaa aaggtataa >gi568815581r:29515735_29717175|GENSCAN_predicted_peptide_2|565_aa MPPWGAALALILAVLALLGLLGPRLRGPWGRAVGERTLPGAQDRDDGEEADGGGPADQFS DGREPLPGGCSLVCKPSALAQCLLRALRRSEALEAGPRSWFSGPHLQTLCHFVLPVAPGP ELAREYLQLADDGLVALDWVVGPCVRGRRITSAGGLPAVLLVIPNAWGRLTRNVLGLCLL ALERGYYPVIFHRRGHHGCPLVSPRLQPFGDPSDLKEAVTYIRFRHPAAPLFAVSEGSGS ALLLSYLGECGSSSYVTGAACISPVLRCREWFEAGLPWPYERGFLLHQKIALSRYATALE DTVDTSRLFRSRSLREFEEALFCHTKSFPISWDTYWDRNDPLRDVDEAAVPVLCICSADD PVCGPPDHTLTTELFHSNPYFFLLLSRHGGHCGFLRQEPLPAWSHEVILESFRALTEFFR TEERIKGLSRHRASFLGGRRRGGALQRREVSSSSNLEEIFNWKRSYTRKKGGGFFDIVKC KEGQQHIWNGNQKRAVWQYTRTANILKLSASEPGGYCGLPYCYLEVPLALALKGDPMAGV HLALDGPKLVVGHPWDDPSEVLDPQ >gi568815581r:29515735_29717175|GENSCAN_predicted_CDS_2|1698_bp atgccgccgtggggcgccgccctcgcgctcatcttggccgtgctcgcccttctcggcctg ctcggcccgcggctccggggaccctgggggcgcgccgtcggagagaggaccctgccgggg gcccaagaccgagacgacggggaggaggcggacggcggaggcccggcggaccagttcagc gacgggcgcgagccactgccgggagggtgcagccttgtttgcaagccgtcggccctggcc cagtgcctgctgcgcgccctgcggcgctcagaggcgctggaggccggcccgcgctcctgg ttctccgggccccacctgcagaccctctgccacttcgtcctgcccgtagcgcctgggcct gagctggcccgggagtacctgcagttggcggacgatgggctagtggccctggactgggtg gtaggaccttgtgttcggggccgccggatcaccagcgccgggggccttcctgcggtgctt ctggtgatccccaatgcgtggggtcgcctcacccgcaacgtgctcggcctttgcttgctc gccctggagcgcggctactacccggtcatcttccatcgccgcggccaccacggttgccca ctggtcagcccccggctgcagcctttcggggacccgtccgacctcaaggaggcggtcaca tacatccgcttccgacacccggcggcgccgctgttcgcggtgagcgaaggctcgggctcg gcgctgctcctgtcctacctgggcgagtgcggctcctccagctacgtgacaggcgccgcc tgcatctcgcccgtgctgcgctgccgagagtggttcgaggccggcctgccctggccctac gagcggggctttctgctccaccagaagatcgccctcagcaggtatgccacagccctggag gacactgtggacaccagcagactgttcaggagccgttcccttcgagagtttgaggaggct ctcttctgccacaccaaaagcttccccatcagctgggatacctactgggaccgcaacgac ccgctccgggatgtcgatgaggcagccgtgcctgtgctgtgtatctgcagtgctgacgac cccgtgtgtggacccccagaccacactctgacaactgaactcttccacagcaacccctac ttcttcctcctgctcagtcgccacggaggccactgtggcttcctgcgccaggagcccttg ccagcctggagccatgaggtcatcttggagtccttccgggccttgactgagttcttccga acggaggagaggattaaagggctgagcaggcacagagcttccttccttgggggccgtcgt cgtgggggagccttgcagaggcgggaagtctcttcctcttccaacctggaggagatcttt aactggaagcgatcatacacaaggaagaagggtggtggtttctttgatattgtgaaatgc aaagaaggacagcagcacatttggaatgggaaccaaaagagagctgtctggcaatacacc aggacagccaacatcttgaagctgtccgcctcagagcctgggggctattgcggactccca tactgctacctcgaggtaccccttgcattggccctgaagggtgaccccatggctggggtc caccttgcattggatggccccaagcttgtggtgggccacccatgggatgaccccagtgag gtcctggacccccagtag >gi568815581r:29515735_29717175|GENSCAN_predicted_peptide_3|303_aa MAPPPPSPQLLLLAALARLLGPSEVSPRVTYTRVSPGQAEDVTFLYHPCAHPWLKLQLAL LAYACMANPSLTPDFSLTQDRALALAFALRSWRPPGTEVTSQGPRQPSSSGAKRRRLRAA LGPQPTRSALRFPSASPGSLKAKQSMAGIPGRESNAPSVPTVSLLPGAPGGNASSRTEAQ VPNGQGSPGGCVCSSQASPAPRAAAPPRAARGPTPRTEEAAWAAMALTFLLVLLTLATLC TRLHRNFRRGESIYWGPTADSQDTVAAVLKRRLLQPSRRVKRSRRRPLLPPTPDSGPEGE SSE >gi568815581r:29515735_29717175|GENSCAN_predicted_CDS_3|912_bp atggcgcctcctccgccttcgccccaactgcttctcctggcagccctcgcgaggctcctg ggtcccagcgaggtgtcaccaagagtgacctacacacgagtgagcccagggcaggctgag gatgtcaccttcctctaccacccctgtgcccatccctggctgaagctccagcttgccctc ctggcctatgcttgtatggctaacccttccctcacccctgacttcagcctcacgcaggat cgggccctggctctggcctttgctctgcggagctggcggccccctggcacagaggtgaca tctcaagggcccaggcagccctcttctagtggtgccaagaggcggaggctgcgggctgcc cttggtccccagcccactcgctcagccctgaggtttccctctgcttccccagggagcttg aaggccaagcagtccatggcgggaatccctggtagggagagtaatgccccatctgtgccc actgtctccctgctgccgggggcgcctggaggcaatgccagctccaggacagaggctcag gtgcccaacgggcaaggcagcccagggggctgtgtctgttcaagtcaggcttccccggcc cctcgcgcagcagcgcctccacgggcagcccggggccccaccccacgcactgaagaggcc gcctgggctgccatggccctgaccttcctgctggtgctgctcaccctggccacgctctgc acacggctgcacagaaacttccgacgcggggagagcatctactgggggcccacagcggac agccaggacacagtggctgctgtgctgaagcggaggctgctgcagccctcgcgccgggtc aagcgctcgcgccggagacccctcctcccgcccacgccggacagcggcccggaaggcgag agctcggagtga >gi568815581r:29515735_29717175|GENSCAN_predicted_peptide_4|732_aa MSRKGPRAEVCADCSAPDPGWASISRGVLVCDECCSVHRSLGRHISIVKHLRHSAWPPTL LQMVHTLASNGANSIWEHSLLDPAQVQSGRRKANPQDKVHPIKSEFIRAKYQMLAFVHKL PCRDDDGVTAKDLSKQLHSSVRTGNLETCLRLLSLGAQANFFHPEKGTTPLHVAAKAGQT LQAELLVVYGADPGSPDVNGRTPIDYARQAGHHELAERLVECQYELTDRLAFYLCGRKPD HKNGHYIIPQMADSLDLSELAKAAKKKLQALSNRLFEELAMDVYDEVDRRENDAVWLATQ NHSTLVTERSAVPFLPVNPEYSATRNQGRQKLARFNAREFATLIIDILSEAKRRQQGKSL SSPTDNLELSLRSQSDLDDQHDYDSVASDEDTDQEPLRSTGATRSNRARSMDSSDLSDGA VTLQEYLELKKALATSEAKVQQLMKVNSSLSDELRRLQREIHKLQAENLQLRQPPGPVPT PPLPSERAEHTPMAPGGSTHRRDRQAFSMYEPGSALKPFGGPPGDELTTRLQPFHSTELE DDAIYSVHVPAGLYRSKLSRHGSGADSDYENTQSGDPLLGLEGKRFLELGKEEDFHPELE SLDGDLDPGLPSTEDVILKTEQVTKNIQELLRAAQEFKHDSFVPCSEKIHLAVTEMASLF PKRPALEPVRSSLRLLNASAYRLQSECRKTVPPEPGAPVDFQLLTQQVIQCAYDIAKAAK QLVTITTREKKQ >gi568815581r:29515735_29717175|GENSCAN_predicted_CDS_4|2199_bp atgtcccgaaaggggccgcgagcggaggtgtgtgcggactgcagcgccccggaccctggc tgggcatccatcagcaggggtgtgctggtgtgtgacgagtgctgcagcgtgcaccggagc ctgggacgccacatctccattgtcaagcaccttcgccacagcgcctggcctcccacgctg ctgcagatggtgcacacgcttgccagcaacggggccaactccatctgggagcactccctg ctggaccccgcacaagtgcagagcggccggcgtaaagccaacccccaagacaaagtccac cccatcaagtcagagttcatcagggccaagtaccagatgctggcatttgtgcacaagctt ccctgccgggacgatgatggagtcaccgccaaagacctcagcaagcaactacactcgagc gtgcggacaggcaacctggagacatgtctgcgcctgctctccctgggtgcccaggccaac ttcttccacccagagaagggcaccacacctctgcacgtggctgccaaggcaggacagaca ctgcaggccgagctgcttgtagtgtatggggctgaccctggctcccctgatgttaatggc cgcacacccattgactatgccaggcaggcggggcaccatgagctggcggaaaggctggtt gagtgccaatatgagctcactgaccggctggccttctacctctgtggacgcaagccggat cacaagaatgggcattacatcatcccacagatggctgacagccttgacttatccgaattg gccaaagctgctaagaagaagctgcaggcgctcagcaaccggctttttgaggaactcgcc atggacgtgtatgacgaggtggatcgaagagaaaatgatgcagtgtggctggctacccaa aaccacagcactctggtgacagagcgcagtgccgtgcccttcctgcctgttaacccggaa tactcagccacgcggaatcaggggcgacaaaagctggcccgctttaatgcccgagagttt gccaccttgatcatcgacattctcagtgaggccaagcggagacagcagggcaagagcctg agcagccccacagacaacctcgagctgtctctgcggagccagagtgacctcgacgaccaa cacgactacgacagcgtggcctctgacgaggacacagaccaggagcccctgcgcagcacc ggcgccactcggagcaaccgggcccggagcatggactcctcggacttgtctgacggggct gtgacgctgcaggagtacctggagctgaagaaggccctggctacatcggaggcaaaggtg cagcagctcatgaaggtcaacagtagcctgagcgacgagctccggaggctgcagcgagag atccacaagctgcaggcggagaacctgcagctccggcagcctccagggccggtgcccaca cctccactccccagtgaacgggcggaacacacacccatggcgccaggcgggagcacacac cgcagggatcgccaggccttttccatgtatgaacctggctctgccctgaagccctttggg ggcccccctggggacgagctcactacgcggctgcagcctttccacagcactgagctagag gacgacgccatctattcagtgcacgtccctgctggcctttaccggagcaagctttcccgc cacggcagtggagccgacagtgactatgagaacacgcaaagtggggacccactgctgggg ctggaagggaagaggtttctagagctgggcaaagaggaagacttccacccagagctggaa agcctggatggagacctagatcctgggcttcccagcacagaggatgtcatcttgaagaca gagcaggtcaccaagaacattcaggaactgttgcgggcagcccaggagttcaagcatgac agcttcgtgccctgctcagagaagatccatttggctgtgaccgagatggcctccctcttc ccaaagaggccagccctggagccagtgcggagctcactgcggctgctcaacgccagcgcc taccggctgcagagtgagtgccggaagacagtgcccccagagcccggcgccccagtggac ttccagctgctgactcagcaggtgatccagtgcgcctatgacatcgccaaggctgccaag cagctggtcaccatcaccacccgagagaagaagcagtga >gi568815581r:29515735_29717175|GENSCAN_predicted_peptide_5|784_aa MQLWGPGGCGRASLRRRPRGPRPVRPRARCGRLLPPPSGVFVCGGVGGERESAQRRRVPA PSQGGWGPLRAARRRRLARTPVARAARTAPGAPPPPAAARTCPSRSPASRQRRPPAPRPR APAPAPSLLPGRAPRPRHEERAMIPANASARKGPEGKYPLHYLVWHNRHRELEKEVRAGQ VDIEQLDPRGRTPLHLATTLGHLECARVLLAHGADVGRENRSGWTVLQEAVSTRDLELVQ LVLRYRDYQRVVKRLAGIPVLLEKLRKAQDFYVEMKWEFTSWVPLVSKICPSDTYKVWKS GQNLRVDTTLLGFDHMTWQRGNRSFVFRGQDTSAVVMEIDHDRRVVYTETLALAGQDREL LLAAAQPTEEQVLSRLTAPVVTTQLDTKNISFERNKTGILGWRSEKTEMVNGYEAKVYGA SNVELITRTRTEHLSEQHKGKVKGCKTPLQSFLGIAEQHGGPQNGTLITQTLSQANPTAI TAEEYFNPNFELGNRDMGRPMELTTKTQKFKAKLWLCEEHPLSLCEQVAPIIDLMAVSNA LFAKLRDFITLRLPPGFPVKIEIPIFHILNARITFGNLNGCDEPVPSVRGSPSSETPSPG SDSSSVSSSSSTSEAPRENACPSALPGVASCRGCEISPALFEAPRGYSMMGGQREAATRD DDDDLLQFAIQQSLLEAGSEYDQVTIWEALTNSKPGTHPMSYEGRRQDRSAPPTPQRQPA PPASVPSPRPSSGPGSGGHVFRSYDEQLRLAMELSAQEQEERRRRARQEEEELERILRLS LTEQ >gi568815581r:29515735_29717175|GENSCAN_predicted_CDS_5|2355_bp atgcagctgtggggacccgggggctgcgggcgcgcgtccctgcggcggcgtccccggggc ccgcgtcccgtgcgcccccgcgcccgctgcgggcgcctgctccctccgccgagcggcgtc tttgtgtgcgggggtgtgggaggcgagcgcgagtccgcgcagcgccgccgagtgcccgct ccctcccagggcgggtgggggcctctccgcgccgcccgccgccgccgcctcgcgaggacg cccgtcgcccgcgccgcccgcaccgcgccgggcgcgccgcccccgcccgccgccgctcgc acatgcccgagccgcagccccgcgagcaggcagcgccggccccccgccccgcggccccgg gccccggctccggcgccgtccctcctccccggccgggcgccgcggccccggcatgaggag cgggcgatgatccccgccaacgcctccgccaggaaggggcccgagggcaagtatccgctg cactacctcgtgtggcacaaccgccaccgcgagctggagaaggaggtccgcgcgggccag gtggacatcgagcagctggatccccgcggccggactcccctgcacctggccaccacgctg gggcaccttgagtgtgcccgtgtgctcctggcgcacggcgcagacgtgggcagggagaat cgcagcggctggacagtgctccaggaggctgtgagtacccgggacctggagctggtgcag ctggtgcttcggtaccgggactaccagcgggtggtgaagcggctggcgggcatccccgtg ctcctggagaagctgcgcaaggcccaggacttctacgtggagatgaaatgggagttcact agctgggtgcccctggtgtccaagatctgccctagtgacacctacaaagtgtggaagagc ggccagaacctgagggtagacaccacactcctgggctttgaccacatgacctggcagcga gggaaccgcagctttgtcttcaggggccaagacacaagcgccgtggtcatggagattgac cacgaccgccgggtggtgtacacagagactctggcactggctgggcaggaccgggagctg ctgctggctgctgctcagcccactgaggaacaggtgctgagccggcttaccgcgcccgtc gtcaccactcagcttgacaccaagaatatctcctttgagaggaacaagactggcatcctg ggctggcgcagtgaaaagacggagatggtgaatgggtatgaagctaaggtgtatggggca tctaacgtggagctcatcacccgcacacggacagaacatctttcagaacagcacaagggc aaggtcaaaggctgtaagacacctttgcagtccttcctgggaatcgctgagcagcacggg ggcccccaaaatgggaccctgatcactcagactctgagccaagccaaccccactgccatc actgcagaagaatacttcaaccccaactttgagctgggcaaccgtgatatgggccgcccc atggaactgaccaccaagacacagaagttcaaggccaagctgtggctgtgtgaggagcat cccctgtccctgtgtgagcaggtggcccccatcattgacctcatggccgtcagcaatgcg ctttttgccaagctccgggacttcatcaccctgcgtctgcctcctggcttcccagttaag attgaaatcccgatcttccacatcctcaacgcccgcatcaccttcgggaacctcaacggc tgcgacgaaccggtgccatcggtgcgaggcagccccagcagcgagacgccttccccaggc agcgactcctccagcgtcagcagctccagctccacgagtgaggccccccgcgagaacgcc tgcccctcggctctccccggggtggcctcctgccgcggctgcgagatctccccagcgttg ttcgaggccccgcgcggctacagcatgatgggcggccagcgggaggcggcgacccgggac gacgacgacgacctgctgcaattcgccatccagcagagcctgcttgaggcgggcagtgag tatgaccaggtcaccatctgggaggcgctaaccaacagcaagccaggcacccaccccatg tcctacgagggtcgccgacaggacaggagcgccccgcccacgccgcagcgccagcctgcg cccccggcgtcagtgcccagccctcggcccagctcagggccaggttccggcggccacgtg ttccggagctacgacgagcagctgcggctggcgatggaactgtcggcgcaggagcaggag gagaggcggcggcgcgcgcgccaggaggaggaggagctggagcgcatcctgaggctctca ctgaccgagcagtag >gi568815581r:29515735_29717175|GENSCAN_predicted_peptide_6|2057_aa MHMRIYKKGDNVDIKGMGTVQKGMPHKCYHGRTEGVYNVNQHAAGIVVHKLRARFLPREL MCVLSTLSTLINVCIEHIKHSKSQDSFLKHVKENDQKKKPKRKAVRLESTYQNRTRYMVV VSTNGRQDTEESIVLGMDFSSNDRSALQSLHKACEVARAHNYYPGSLFLTWVSYYESHIN SDQSSVNEWNAMQDVQSHRPDSPALFTDIPTERERTERLIKTKLREIMMQKDLENITSKE IRTELEMQMVCNLREFKEFIDNEMIVILGQMDSPTQIFEHVFLGSEWNASNLEDLQNRGV RYILNVTREIDNFFPGVFEYHNIRVYDEEATDLLAYWNDTYKFISKAKKHGSKCLVHCKM GVSRSASTVIAYAMKEYGWNLDRAYDYVKERRTVTKPNPSFMRQLEEYQGILLASKQRHN KLWRSHSDSDLSDHHEPICKPGLELNKKDITTSADQIAEVKTMESHPPIPPVFVEHMVPQ DANQKGLCTKERMICLEFTSREFHAGQIEDELNLNDINGCSSGCCLNESKFPLDNCHASK ALIQPGHVPEMANKFPDLTVEDLETDALKADMNVHLLPMEELTSPLKDPPMSPDPESPSP QPSCQTEISDFSTDRIDFFSALEKFVELSQETRSRSFSHSRMEELGGGRNESCRLSVVEV APSKVTADDQRSSSLSNTPHASEESSMDEEQSKAISELVSPDIFMQSHSENAISVKEIVT EIESISQGVGQIQLKGDILPNPCHTPKKNSIHELLLERAQTPENKPGHMEQDEDSCTAQP ELAKDSGMCNPEGCLTTHSSIADLEEGEPAEGEQELQGSGMHPGAKWYPGSVRRATLEFE ERLRQEQEHHGAAPTCTSLSTRKNSKNDSSVADLAPKGKSDEAPPEHSFVLKEPEMSKGK GKYSGSEAGSLSHSEQNATVPAPRVLEFDHLPDPQEGPGSDTGTQQEGVLKDLRTVIPYQ ESETQAVPLPLPKRVEIIEYTHIVTSPNHTGPGSEIATSEKSGEQGLRKVNMEKSVTVLC TLDENLNRTLDPNQVSLHPQVLPLPHSSSPEHNRPTDHPTSILSSPEDRGSSLSTALETA APFVSHTTHLLSASLDYLHPQTMVHLEGFTEQSSTTDEPSAEQVSWEESQESPLSSGSEV PYKDSQLSSADLSLISKLGDNTGELQEKMDPLPVACRLPHSSSSENIKSLSHSPGVVKER AKEIESRVVFQAGLTKPSQMRRSASLAKLGYLDLCKDCLPEREPASCESPHLKLLQPFLR TDSGMHAMEDQESLENPGAPHNPEPTKSFVEQLTTTECIVQSKPVERPLVQYAKEFGSSQ QYLLPRAGLELTSSEGGLPVLQTQGLQPEASNGLKVSQKSEGSQPSSSRSPLEFLKEAES RRIGQSAELDTRVPDTTDTRRGGSFALKPSSAPYALGPLPQRIRSVSAPQPAQDQMRVRY PVVAAVLAPYLALSQDPMVKSSASGQGASGSYNHVREEMLIKAGGAMSRRVVRQSKFRHV FGQAAKADQAYEDIRVSKVTWDSSFCAVNPKFLAIIVEAGGGGAFIVLPLAKTGRVDKNY PLVTGHTAPVLDIDWCPHNDNVIASASDDTTIMVWQIPDYTPMRNITEPIITLEGHSKRV GILSWHPTARNVLLSAGGDNVIIIWNVGTGEVLLSLDDMHPDVIHSVCWNSNGSLLATTC KDKTLRIIDPRKGQVVAEQARPHEGARPLRAVFTADGKLLSTGFSRMSERQLALWDPERF AAHEGMRPMRAVFTRQGHIFTTGFTRMSQRELGLWDPNNFEEPVALQEMDTSNGVLLPFY DPDSSIVYLCGKVLTAGQGEQGTGWRCGGPCPGAPLNRLILQGDSSIRYFEITDEPPFVH YLNTFSSKEPQRGMGFMPKRGLDVSKCEIARFYKLHERKCEPIIMTVPRKSDLFQDDLYP DTPGPEPALEADEWLSGQDAEPVLISLRDGYVPPKHRELRVTKRNILDVRPPSGPRRSQS ASDAPLSVRSALLHSGPIYISPNRPYTTRPLCPPPQQQHTLETLLEEIKALRERVQAQEQ RITALENMLCELVDGTD >gi568815581r:29515735_29717175|GENSCAN_predicted_CDS_6|6174_bp atgcatatgcgaatctataagaaaggtgataatgtagacatcaagggaatgggtactgtt caaaaaggaatgccccacaagtgttaccatggcagaactgaaggagtctacaatgttaac cagcatgctgctggcattgttgtacacaagttgagggcaagattcttgccaagagaatta atgtgtgtattgagcacattaagcactctaattaatgtgtgtattgagcacataaagcac tctaagagccaagacagcttcctgaagcacgtgaaagaaaatgatcagaaaaagaagcca aagagaaaggctgtaagactggaaagtacttaccagaatcgaacacgctatatggtagtg gtttcaactaatggtagacaagacactgaagaaagcatcgtcctaggaatggatttctcc tctaatgacaggtctgcactacagagcttacacaaggcttgtgaagtcgccagagcgcat aactactacccaggcagcctatttctcacttgggtgagttattatgagagccatatcaac tcagatcaatcctcagtcaatgaatggaatgcaatgcaagatgtacagtcccaccggccc gactctccagctctcttcaccgacatacctactgaacgtgaacgaacagaaaggctaatt aaaaccaaattaagggagatcatgatgcagaaggatttggagaatattacatccaaagag ataagaacagagttggaaatgcaaatggtgtgcaacttgcgggaattcaaggaatttata gacaatgaaatgatagtgatccttggtcaaatggatagccctacacagatatttgagcat gtgttcctgggctcagaatggaatgcctccaacttagaggacttacagaaccgaggggta cggtatatcttgaatgtcactcgagagatagataacttcttcccaggagtctttgagtat cataacattcgggtatatgatgaagaggcaacggatctcctggcgtactggaatgacact tacaaattcatctctaaagcaaagaaacatggatctaaatgccttgtgcactgcaaaatg ggggtgagtcgctcagcctccaccgtgattgcctatgcaatgaaggaatatggctggaat ctggaccgagcctatgactatgtgaaagaaagacgaacggtaaccaagcccaacccaagc ttcatgagacaactggaagagtatcaggggatcttgctggcaagcaaacagcggcataac aaactatggagatctcattcagatagtgacctctcagaccaccacgaacccatctgcaaa cctgggctagaactcaacaagaaggatatcaccacctcagcagaccagattgctgaggtg aagaccatggagagtcacccacccatacctcctgtctttgtggaacatatggtcccacaa gatgcaaatcagaaaggcctgtgtaccaaagaaagaatgatctgcttggagtttacttct agggaatttcatgctggacagattgaggatgaattaaacttaaatgacatcaatggatgc tcatcagggtgttgtctgaatgaatcaaaatttcctcttgacaattgccatgcatccaaa gccttaattcagcctggacatgtcccagaaatggccaacaagtttccagacttaacagtg gaagatttggagacagatgcactgaaagcagacatgaatgtccacctactgcctatggaa gaattgacatctccactgaaagacccccccatgtcccctgatcctgagtcaccaagcccc caacccagttgccagactgaaatctcagatttcagtacagatcgcattgacttttttagt gccctagagaagtttgtggagctctcccaagaaacccggtcacgatctttttcccattca aggatggaggaactgggtggaggaaggaatgagagctgtcgactgtcagtggtagaagta gccccttccaaagtgacagctgatgaccagagaagcagctctttgagtaatactccccat gcatcagaagaatcttcaatggatgaggaacagtcaaaggcaatttcagaactggtcagc ccagacatcttcatgcagtctcactcggaaaatgcaatttcagtcaaagaaattgtcact gaaattgagtccatcagtcaaggagttgggcagattcaactgaaaggagacatcttaccc aacccatgccatacaccaaagaagaacagcatccatgagctgctccttgagagggcccag actccagagaacaaacctggacatatggagcaagatgaggactcctgcacagcccagcct gaactagccaaagactcagggatgtgcaacccagaaggctgcctaaccacacactcatct atagcagacttggaagaaggggaaccagctgagggggaacaagagctccagggctcaggg atgcacccaggtgccaagtggtaccctgggtctgtgaggcgagccaccttggagttcgaa gagcgcttacggcaggagcaagagcatcatggtgctgccccaacatgtacctcattgtcc actcgtaagaattcaaagaatgattcttctgtggcagacctagcaccaaaagggaaaagt gatgaagcccccccagaacattcatttgtcctcaaggaaccagaaatgagcaaaggcaaa gggaaatacagtgggtctgaggctggctcactgtcccattctgagcagaatgccactgtt ccagctcccagggtgctggagtttgaccacttgccagatcctcaggagggcccagggtca gatactggaacacagcaggaaggagtcctgaaggatctgaggactgtgattccataccag gagtctgaaacacaagcagtccctcttccccttcccaagagggtagaaatcattgaatat acccacatagttacatcacccaatcacactgggccagggagtgaaatagccaccagtgag aagagcggagagcaagggctgaggaaagtgaacatggaaaaatctgtcactgtgctctgc acactggatgaaaatctaaacaggactctggaccccaaccaggtttctctgcacccccaa gtgctacctctgcctcattcttcctcccctgagcacaacagacccactgaccatccaacc tccatcctgagtagccctgaagacagaggcagcagcctgtccacagccctggagacagca gcaccttttgtcagtcatacaacccatttactgtctgccagtttggattacctgcatccc cagactatggttcacctggagggcttcacagagcagagcagcactacagatgagccctct gcagaacaggttagctgggaagaaagtcaggagagccctctctccagtggcagtgaggtg ccatataaggactcccagctaagtagcgcagacctaagtttaattagcaaacttggtgac aacactggggagttacaggagaaaatggacccattgcctgtagcctgtcgactcccacat agctctagtagtgaaaacataaagagtctcagccacagccccggtgtggtgaaggagcgt gctaaagaaatcgagtctcgagtggttttccaggcagggctcaccaaaccatcccaaatg aggcgctcagcttctctcgccaaattaggttacttggacctctgtaaagactgcttacca gagagggagcctgcctcctgtgaatcccctcatctcaaactgcttcagcctttcctcaga acagactcaggcatgcacgcgatggaggaccaagagtccctagaaaacccaggtgccccc cacaacccagagcccaccaagtcttttgtagaacaactcacaacaacagagtgtattgtg cagagcaagccagtggagaggccccttgtgcagtatgccaaagaatttggttctagtcag cagtatttgctccccagggcaggacttgaattgactagttctgaaggaggccttcccgtg ctacagacccagggactgcaacctgaggcatcgaatgggctaaaggtcagccagaagtca gagggctctcagccttcatccagccgcagccctttagagttcctgaaggaggcagagagc cgcaggatcggccagagtgcggagctggacacccgggtcccagatactacagacacccgg agaggtggctccttcgccctgaagccttcctcggccccctacgcactcgggccccttccg cagaggattcgcagcgtgagcgccccgcagcccgctcaggaccagatgcgagttcggtat cctgtggtggctgcagtcttggccccatacctggctttaagccaagatccaatggtcaag tcttctgcttctggacagggtgcctctgggagctacaaccacgtccgtgaagagatgctc atcaaggctggcggtgctatgagcagacgtgtggttcggcaaagcaagttccgccatgtg tttgggcaggcagcaaaggccgaccaggcctacgaggacatccgtgtgtccaaggtcaca tgggacagctccttctgtgccgtcaaccccaaattcctggccattattgtggaggctgga ggcgggggtgccttcatcgtcctgcctctggccaagacagggcgagtggataagaactac ccactggtcactgggcacactgcccctgtgctggatattgactggtgtccacacaatgac aacgttatcgccagtgcctcagacgacaccaccatcatggtgtggcagattccagactat acccccatgcgcaacattacggaacctatcatcacacttgagggccactccaagcgtgtg ggcatcctctcctggcaccctactgccaggaatgtcctgctcagtgcaggtggtgacaat gtgatcatcatctggaatgtgggcaccggggaggtgctgctgagcctggatgatatgcac ccagacgtcatccacagtgtgtgctggaacagcaacggtagcctgctagccaccacctgc aaggacaagaccttgcgcatcattgaccccagaaaaggccaagtggtggcggagcaagcc cggcctcacgagggcgcccgcccgctgcgggctgtcttcaccgcagacgggaagctgctc agcaccggcttcagcaggatgagtgagcggcaactcgcgctctgggacccggagaggttt gcggcccacgaggggatgaggcccatgcgggccgtcttcacgcgccagggccatatcttc accacgggcttcacccgcatgagccagcgagagctgggcctgtgggacccgaacaacttc gaggagccagtggcactgcaggagatggacacaagcaacggggtcctattgcccttttac gatcccgactccagcatcgtctacctgtgtggcaaggtgctcacggccgggcagggagaa cagggcactggatggagatgtggagggccttgtccgggcgcgcccctgaaccgactgatt ttgcagggcgacagcagcattcggtactttgagattaccgacgagccgcctttcgtgcac tacctgaacacgttcagcagcaaagagccgcagcggggcatgggtttcatgcccaaaagg ggactggatgtcagcaagtgtgagatcgcccggttctacaagctacacgaaagaaagtgt gaacctatcatcatgactgtgccccgcaagtcagacctcttccaggacgatctgtacccg gatacgccaggcccggagccggccctagaagcggacgaatggctatccggccaggacgcc gaacccgtgctcatttcgctgagggacggctatgtgccccccaagcaccgcgagctccgg gtcacgaagcgcaacatcctggacgtgcgcccgccctccggcccccgccgcagccagtcg gccagcgacgcccccttgtcggtaagatcggccctgctgcactctggcccaatctacatc tcccccaaccgcccctatacgacacggcctctctgtcctccgccccagcagcagcacacc ctggagacgctgctggaagagatcaaggccctccgcgagcgggtgcaggcccaggagcag cgcatcacggctctggagaacatgctgtgcgagctggtggacggcacggactag