GENSCAN 1.0 Date run: 4-Nov-116 Time: 23:21:54 Sequence gi568815595r:48651588_48947596 : 296009 bp : 46.92% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.14 Intr - 461 290 172 2 1 114 99 83 0.996 11.52 1.13 Intr - 966 850 117 0 0 128 75 74 0.855 10.76 1.12 Intr - 1600 1415 186 1 0 95 41 86 0.840 4.49 1.11 Intr - 2201 2032 170 0 2 105 89 67 0.984 8.17 1.10 Intr - 2416 2291 126 2 0 97 83 187 0.991 19.75 1.09 Intr - 2865 2702 164 2 2 68 96 280 0.917 26.42 1.08 Intr - 3614 3457 158 2 2 88 53 178 0.967 13.11 1.07 Intr - 3807 3719 89 1 2 125 81 74 0.887 10.09 1.06 Intr - 4264 4149 116 0 2 87 63 208 0.951 18.29 1.05 Intr - 4778 4553 226 0 1 81 49 418 0.998 34.24 1.04 Intr - 5761 5111 651 2 0 110 105 1201 0.999 115.88 1.03 Intr - 10958 7300 3659 1 2 122 94 4991 0.005 490.50 1.02 Intr - 11686 11513 174 0 0 108 90 12 0.002 2.45 1.01 Init - 20961 20837 125 0 2 66 55 101 0.017 4.24 1.00 Prom - 21174 21135 40 -5.26 2.38 PlyA - 22279 22274 6 1.05 2.37 Term - 22457 22323 135 2 0 65 48 100 0.508 1.72 2.36 Intr - 23160 22981 180 0 0 101 13 307 0.515 24.56 2.35 Intr - 27149 26977 173 0 2 81 46 276 0.869 22.36 2.34 Intr - 27382 27266 117 2 0 97 48 164 0.573 13.74 2.33 Intr - 27596 27468 129 0 0 73 75 196 0.999 17.47 2.32 Intr - 27870 27790 81 1 0 131 95 74 0.788 12.01 2.31 Intr - 28126 27988 139 1 1 31 72 220 0.968 14.74 2.30 Intr - 28300 28214 87 1 0 94 80 84 0.992 8.37 2.29 Intr - 28642 28472 171 1 0 119 95 242 0.978 28.14 2.28 Intr - 30193 29700 494 2 2 106 100 494 0.946 45.12 2.27 Intr - 30589 30458 132 2 0 37 83 89 0.878 3.92 2.26 Intr - 30965 30730 236 1 2 69 90 158 0.924 11.43 2.25 Intr - 31425 31316 110 0 2 136 81 202 0.997 23.38 2.24 Intr - 34258 34050 209 2 2 47 109 342 0.310 31.00 2.23 Intr - 37186 36706 481 1 1 154 50 846 0.293 80.01 2.22 Intr - 38126 37951 176 0 2 34 94 192 0.621 14.06 2.21 Intr - 38581 38536 46 1 1 90 2 24 0.319 -8.02 2.20 Intr - 39895 39685 211 0 1 105 90 89 0.797 9.72 2.19 Intr - 41592 41367 226 1 1 110 105 157 0.813 16.64 2.18 Intr - 41751 41683 69 1 0 76 86 37 0.725 1.45 2.17 Intr - 43834 43503 332 0 2 72 115 293 0.449 25.67 2.16 Intr - 71661 71588 74 0 2 83 34 49 0.150 -2.80 2.15 Intr - 72992 72885 108 2 0 71 93 36 0.433 2.88 2.14 Intr - 76805 76698 108 2 0 3 80 110 0.028 2.18 2.13 Intr - 87379 87295 85 0 1 21 78 75 0.219 -0.38 2.12 Intr - 88932 88828 105 2 0 127 85 -4 0.007 2.53 2.11 Intr - 100131 100002 130 1 1 65 42 187 0.062 11.55 2.10 Intr - 100730 100589 142 2 1 56 82 107 0.998 6.83 2.09 Intr - 104857 104792 66 1 0 147 96 23 0.958 8.10 2.08 Intr - 113491 113417 75 1 0 88 109 113 0.996 13.11 2.07 Intr - 121521 121368 154 2 1 109 105 91 0.998 12.97 2.06 Intr - 131505 131399 107 0 2 82 70 135 0.080 10.11 2.05 Intr - 142462 142410 53 2 2 74 90 69 0.158 4.33 2.04 Intr - 146083 146017 67 1 1 106 19 46 0.100 -2.02 2.03 Intr - 156097 156062 36 1 0 138 91 44 0.981 8.16 2.02 Intr - 190858 190410 449 2 2 61 86 95 0.170 -0.33 2.01 Init - 196009 195748 262 1 1 91 102 429 0.765 41.83 2.00 Prom - 199428 199389 40 -3.86 3.10 PlyA - 201289 201284 6 1.05 3.09 Term - 206185 206123 63 1 0 94 54 45 0.838 -0.41 3.08 Intr - 207044 206920 125 0 2 84 101 120 0.949 13.20 3.07 Intr - 207614 207505 110 1 2 70 105 114 0.988 11.23 3.06 Intr - 208040 207968 73 0 1 90 78 -13 0.777 -3.64 3.05 Intr - 211072 210955 118 1 1 139 76 133 0.790 17.24 3.04 Intr - 227861 227771 91 1 1 81 110 14 0.822 2.90 3.03 Intr - 232537 232410 128 1 2 103 109 142 0.948 17.28 3.02 Intr - 240485 240393 93 2 0 105 73 3 0.566 0.66 3.01 Init - 247207 247103 105 1 0 96 86 241 0.998 24.92 3.00 Prom - 261539 261500 40 -4.36 4.02 PlyA - 262076 262071 6 1.05 4.01 Sngl - 267562 266690 873 1 0 47 41 242 0.877 11.37 4.00 Prom - 270336 270297 40 -7.46 5.00 Prom + 271131 271170 40 -5.56 5.01 Init + 275972 276226 255 1 0 56 105 447 0.646 40.43 5.02 Intr + 293549 293594 46 1 1 89 119 25 0.011 3.68 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 11044 7300 3745 1 1 93 94 5072 0.927 493.90 S.002 Term - 100131 99998 134 1 2 65 43 191 0.936 10.55 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:48651588_48947596|GENSCAN_predicted_peptide_1|2045_aa MGPPSKRLSEVEAATFHRKPMGGKYGVLVGDALLTAYCQKARISHSTPSQSYENAHGHQN FRPKSGTRFRTSGQEGGTRDLEGQKKAGRRSWDSALEGGSQEELGGGGHQGWDPGLAATT GPRAHIGGGALALCPESSGVREDGGPGLGVREPIFVGLRGRRQSARNSRGPPEQPNEELG IEHGVQPLGSRERETGQGPGSVLYWRPEVSSCGRTGPLQRGSLSPGALSSGVPGSGNSSP LPSDFLIRHHGPKPVSSQRNAGTGSRKRVGTARCCGELWATGSKGQGERATTSGAERTAP RRNCLPGASGSGPELDSAPRTARTAPASGSAPRESRTAPEPAPKRMRSRGLFRCRFLPQR PGPRPPGLPARPEARKVTSANRARFRRAANRHPQFPQYNYQTLVPENEAAGTAVLRVVAQ DPDAGEAGRLVYSLAALMNSRSLELFSIDPQSGLIRTAAALDRESMERHYLRVTAQDHGS PRLSATTMVAVTVADRNDHSPVFEQAQYRETLRENVEEGYPILQLRATDGDAPPNANLRY RFVGPPAARAAAAAAFEIDPRSGLISTSGRVDREHMESYELVVEASDQGQEPGPRSATVR VHITVLDENDNAPQFSEKRYVAQVREDVRPHTVVLRVTATDRDKDANGLVHYNIISGNSR GHFAIDSLTGEIQVVAPLDFEAEREYALRIRAQDAGRPPLSNNTGLASIQVVDINDHIPI FVSTPFQVSVLENAPLGHSVIHIQAVDADHGENARLEYSLTGVAPDTPFVINSATGWVSV SGPLDRESVEHYFFGVEARDHGSPPLSASASVTVTVLDVNDNRPEFTMKEYHLRLNEDAA VGTSVVSVTAVDRDANSAISYQITGGNTRNRFAISTQGGVGLVTLALPLDYKQERYFKLV LTASDRALHDHCYVHINITDANTHRPVFQSAHYSVSVNEDRPMGSTIVVISASDDDVGEN ARITYLLEDNLPQFRIDADSGAITLQAPLDYEDQVTYTLAITARDNGIPQKADTTYVEVM VNDVNDNAPQFVASHYTGLVSEDAPPFTSVLQISATDRDAHANGRVQYTFQNGEDGDGDF TIEPTSGIVRTVRRLDREAVSVYELTAYAVDRGVPPLRTPVSIQVMVQDVNDNAPVFPAE EFEVRVKENSIVGSVVAQITAVDPDEGPNAHIMYQIVEGNIPELFQMDIFSGELTALIDL DYEARQEYVIVVQATSAPLVSRATVHVRLVDQNDNSPVLNNFQILFNNYVSNRSDTFPSG IIGRIPAYDPDVSDHLFYSFERGNELQLLVVNQTSGELRLSRKLDNNRPLVASMLVTVTD GLHSVTAQCVLRVVIITEELLANSLTVRLENMWQERFLSPLLGRFLEGVAAVLATPAEDV FIFNIQNDTDVGGTVLNVSFSALAPRGAGAGAAGPWFSSEELQEQLYVRRAALAARSLLD VLPFDDNVCLREPCENYMKCVSVLRFDSSAPFLASASTLFRPIQPIAGLRCRCPPGFTGD FCETELDLCYSNPCRNGGACARREGGYTCVCRPRFTGEDCELDTEAGRCVPGVCRNGGTC TDAPNGGFRCQCPAGGAFEGPRCEVAARSFPPSSFVMFRGLRQRFHLTLSLSFATVQQSG LLFYNGRLNEKHDFLALELVAGQVRLTYSTGESNTVVSPTVPGGLSDGQWHTVHLRYYNK PRTDALGGAQGPSKDKVAVLSVDDCDVAVALQFGAEIGNYSCAAAGVQTSSKKSLDLTGP LLLGGVPNLPENFPVSHKDFIGCMRDLHIDGRRVDMAAFVANNGTMAGCQAKLHFCDSGP CKNSGFCSERWGSFSCDCPVGFGGKDCQLTMAHPHHFRGNGTLSWNFGSDMAVSVPWYLG LAFRTRATQGVLMQVQAGPHSTLLCQLDRGLLSVTVTRGSGRASHLLLDQVTVSDGRWHD LRLELQEEPGGRRGHHVLMVSLDFSLFQDTMAVGSELQGLKVKQLHVGGLPPGSAEEAPQ GLVGCIQGVWLGSTPSGSPALLPPSHRVNAEPGCVVTNACASGPCPPHADCRDLWQTFSC TCQPX >gi568815595r:48651588_48947596|GENSCAN_predicted_CDS_1|6135_bp atggggccccccagcaagaggctctcagaagttgaagcagcaacattccataggaaaccc atgggaggaaagtatggggtcctggtgggggatgccctgctcaccgcttactgtcagaag gccaggatttcacacagtaccccctcccagagctatgagaacgctcatgggcaccaaaat ttcagaccaaaatccgggacacggttcagaactagtggtcaggaagggggaacaagagat ctggaaggacagaaaaaggctgggagacggagctgggacagcgccctagaaggagggagc caggaggagctggggggcggtgggcaccagggctgggacccaggcttagctgccactacg gggccaagggcgcatatcggtggcggagccttagctctttgtccggagtcttccggggtc cgggaggatggggggcctggcctgggggtcagggagcctatcttcgtggggctccgaggg agaaggcaaagcgcccggaatagtcgagggccccctgagcagccgaatgaggagctgggg attgaacacggcgtccagccattgggcagccgcgaacgagagacaggacagggaccaggg tctgtgttatactggcgcccagaggtctcctcttgcgggcggacaggacctttgcaaaga ggtagtctgtcaccaggggctctgtcctcaggggtcccgggctcggggaacagctcgccc ctcccttcagactttttgattcggcaccacggtcccaagccggtgtcctcccagcggaac gctgggacaggctcccgcaaaagagtgggcaccgcgcgctgctgtggggaattatgggca acagggagcaagggtcagggcgagagagccacgacatccggagcagaaaggacagccccc cggcggaactgtcttccaggggcctcgggatctggccccgagctggattcagcaccacgc acggcgaggacagctcctgcatcaggttcagcaccccgcgagtctcggacagctcccgag ccggcgcccaagcgcatgcgctcccggggtctcttccgctgccgcttcctcccgcagcgc cccgggccgcgtcccccgggactcccggcccgtcctgaagccaggaaagtaacctcggcg aaccgggcacgctttcgtcgcgccgcaaaccgccacccgcagtttccgcagtacaactac cagacgctggtgccggagaatgaggcagcaggcaccgcggtgctacgcgtggttgctcag gacccggacgccggcgaggccgggcgcctagtctactcgctggcggcactcatgaacagc cgctcgctggagctgttcagcatcgacccgcagagcggccttatccgtacggcggcagct ctggaccgcgagagcatggagcgtcactacctgcgtgtgaccgcgcaggaccacgggtcg ccgcgcctctcggccaccacgatggtggccgtgacagtagccgaccgcaacgaccactcg ccggtttttgagcaagcgcagtaccgggagacccttcgcgagaatgtggaggagggctac cctatcctgcagctgcgtgccactgacggcgacgcgccccccaacgccaacctgcgctac cgcttcgtggggccgccagctgcgcgcgctgcagctgccgccgccttcgagattgatcca cgctccggcctcatcagcaccagcggccgagtggaccgcgagcacatggaaagctatgag ctggtggtggaagccagcgaccagggccaggaacccgggccgcgctcggccactgtgcgc gtacacataactgtgctagacgagaacgacaatgctcctcagttcagcgagaagcgctac gtggcgcaggtgcgcgaggatgtgcgcccccacacagtcgtgctgcgcgtcacggccact gaccgggacaaggacgccaacggattggtgcactacaacatcatcagtggcaatagccgt ggacactttgccatcgacagcctcactggcgagatccaggtggtggcacctctggacttc gaggcagagagagagtatgccttgcgcatcagggcgcaggatgctggccggccaccgctg tccaacaacacgggcctggccagcatccaggtggtggacatcaatgaccacattcctatt tttgtcagcacgcccttccaagtttctgtcttggaaaatgctcccttgggtcactcagtc atccacattcaggcagtcgatgcagaccatggggagaatgccagattggagtactcccta actggtgtggcacctgatactccttttgtgataaacagcgccactggctgggtctctgtg agtggtcccctggaccgtgagtctgtggagcattacttctttggtgtggaggctcgagac catggctcacccccactctctgcctcagccagtgtcaccgtgactgtgctggacgttaat gacaatcggcctgagttcacaatgaaggagtaccacctacgactgaatgaggatgcagct gtgggcaccagtgtggtcagcgtgaccgcagtagaccgtgatgccaacagtgccatcagc taccagatcacaggcggcaacacccggaatcgctttgccatcagcacccaggggggtgtg ggtctggtgactctggctctgccactggactacaagcaggaacgctacttcaagctggta ctaactgcatctgaccgtgcccttcatgatcactgctatgtgcacatcaacatcacagat gccaacactcatcggccggtctttcaaagtgcccactactcagtgagtgtgaatgaagat cggccaatgggtagcaccatagtggtcatcagtgcctctgatgatgacgtgggtgagaat gctcgtatcacctatctcctggaggacaacctgccccagttccgcattgatgcagactca ggagccattacattacaggccccattagactatgaggaccaggtgacctacaccctggct atcacagctcgggacaatggcatcccacagaaggcagacactacttatgtggaggtgatg gtcaatgacgtgaatgacaatgctccacaatttgtggcctcccactatacagggctggtc tctgaggatgccccacctttcaccagtgtcctgcagatctcagccactgaccgggatgct catgccaatggccgggtccagtacactttccagaatggtgaagatggggatggagatttt accattgagcccacctctggaattgtccgtacagtaaggcggctagaccgggaggcagta tcagtgtatgagttgactgcctacgcagtggacagaggtgtgcccccactccggactcca gtcagtatccaggtgatggtgcaggatgtgaacgacaatgcacctgtcttcccagctgag gagtttgaggtgcgggtgaaagagaatagcattgtgggctcagtggtggcccagatcact gcagtggaccctgacgaaggccccaatgcccatataatgtaccagatcgtggaggggaac atccctgagctgttccaaatggacatcttctctggagaactgacggcactcattgaccta gactatgaggctcgccaagaatatgtgattgtggtgcaggccacatctgctcctttggtc agccgggccactgtgcacgtccgcctggttgaccagaatgacaacagccctgtgctcaac aacttccagatcctcttcaacaactatgtatccaaccgttcagacaccttcccgtcgggc attattgggcgcatcccagcttatgaccccgatgtctccgaccacctcttctactccttt gagcgtggcaatgagctgcagctgctggtagtcaaccagaccagtggggagctgcgactc agccgaaagctagacaataaccgcccactggtggcctccatgttggtgactgtcacagat ggcctgcacagcgtgacggcgcagtgtgtgctgcgcgtggtcatcatcacggaggagttg ctggccaacagcctgaccgtgcgccttgagaacatgtggcaggagcgcttcctgtcaccg ctgctgggccgcttcctcgagggcgtggctgcggtgctcgctacgcccgctgaggacgtc ttcatcttcaacatccagaacgacacagacgtagggggcaccgtgctcaatgtgagtttc tcggcgctagctccacgtggggccggggcgggcgctgcagggccctggttcagctccgag gagctgcaggagcagttgtacgtgcgccgggcggcgctggcggctcgctccctgctcgac gtactgcccttcgacgacaacgtgtgcctgcgagagccctgtgagaactacatgaaatgc gtgtccgtgctccgctttgactcgtccgcgcccttcctggcctcggcctccacgctgttc cgacccatccagcccatcgctggcctgcgctgccgctgcccgcccggattcacgggagac ttttgcgagaccgagctcgacctctgctactccaacccatgtcgcaacggcggagcctgc gcgcggcgcgagggaggctacacgtgcgtctgccgcccgcgcttcaccggagaggactgc gagctggacaccgaggccggccgctgcgtgccgggcgtctgccgcaacgggggcacctgc accgacgcgcccaacggcggctttcgctgccagtgcccggcaggcggcgccttcgagggc ccgcgctgcgaggtggctgcgcgctccttcccgcccagttcgttcgtcatgtttcgcggc ctgcggcagcgattccaccttacgctgtccctctcgttcgcgacagtgcagcagagcggg ctgctcttctacaacgggcgcctgaacgagaagcacgacttcctggccctggaactcgtg gctggccaagtgcggctcacatattccacgggtgaatccaacaccgtggtcagccccaca gttccagggggcttgagtgacgggcaatggcatacagtgcatctgagatactacaacaag ccccggacagatgccctagggggtgcacagggcccctccaaggacaaggtggctgtgcta agcgtggatgattgtgatgtggccgtggctctgcagtttggtgctgagattggcaactac tcatgcgcggctgctggtgtgcaaacaagctccaagaagtccctggacctgacgggccct cttcttctgggaggtgtccccaacctccccgagaacttccccgtatcccataaggacttc atcggctgtatgcgggacctgcacattgatggccgccgagtggacatggcggcttttgtc gcaaataatggcaccatggcaggctgccaagccaagctacacttttgtgactcaggcccc tgcaagaacagtggcttctgctcggagcgctggggcagcttcagctgcgactgccctgtg ggcttcggcggcaaagactgtcagcttactatggcccatccccaccatttccgtggcaac ggcacactgagctggaactttggaagtgacatggctgtgtctgtgccatggtacctgggg ctggcatttcggacacgggcaacgcagggggtcctgatgcaagtgcaggctgggccacac agcacgctcctttgccagctagatcgggggttactgtctgtgacagtgaccaggggctcg ggccgtgcttcccatctccttctggaccaggtgactgtcagtgatggccggtggcacgat ctgcggctggagttgcaggaggaaccaggtggccggcggggccaccatgtccttatggtc tcactggactttagcctcttccaggacaccatggcggtggggagtgagctgcagggcctg aaggtaaagcagctccacgtgggaggcctgccccccggcagtgcagaggaggctcctcag ggtctggttggctgcatccagggggtgtggctcggctccacaccctctggctccccggcc ctgctaccccccagccaccgagtgaatgcggagcctggctgtgttgtgaccaacgcctgt gcctctgggccctgcccacctcacgcagactgccgggacctctggcagaccttttcttgc acctgccagccagnn >gi568815595r:48651588_48947596|GENSCAN_predicted_peptide_2|1984_aa MSHIQIPPGLTELLQGYTVEVLRQQPPDLVEFAVEYFTRLREARAPASVLPAATPRQSLG HPPPEPGPDRVADAKGDSESEEDEDLEVLEVVARAIRQEKEIKAIQLGKEEVKLSLFADD MIVYLENPIVSAPNLLKLISNFSKVSGYKINAQKSQAFLYTNNRKTESQVMSELPFTIAS KRIKYLGIQLTRDVKDLFKENYKPLLNEIKEDTNKWKNIPCSWIGRINIVKMAILSKFQF LADLIDEYQDIGSLLGTWYRWHYCTSGMGHTICAETYNPDEEEEDTDPREQLSQVLDAMF ERIVKADEHVIDQGDDGDNFYVIERGTYDILVTKDNQTRSVGQYDNRGSFGELALMYNTP RAATIVATSEGSLWGLVSERMKIVDVIGEKIYKDGERIITQGEKADSFYIIESGEVSILI RSRTKSNKDGGNQEVEIARCHKGQYFGELALVTNKPRAASAYAVGDVKCLVMDVQAFERL LGPCMDIMKRNISHYEEQLVKMFGSSVDLGNLGQTALSQGTKVTVYSAISQDPAIPFLAR SSNSRKEGRLLPPPPLEVLFGDQRPISARGLKTYILEPSLMLTTAGVMSQTSKKAVEQFS KRIPNETIRQIPQSDNRLAFISHITQAVSQTLDIQQNLHTPYRPQSSGKFCRCYIAYFDD HSLPSDEFQRLFMFRASWPEQAKEPRREGHTDKQQTEDVLAAGLRCLPHLPAICARRMSP AFRAMDVEPRAKGVLLEPFVHQVGGHSCVLRFNETTLCKPLVPREHQFYETLPAEMRKFT PQYKAVLIFVKFADEFGASGNIETKELGVVSVRFEEDEDRNLCLIAYPLKGDHGIVDIVD NSDCEPKSKLLRWTTNKKHHVLETEKTPKDWVRQHRKEEKMKSHKLEEEFEWLKKSEVLY YTVEKKGNISSQLKHYNPWSMKCHQQQLQRMKENAKHRNQYSILCSRASLHLRTFLVHTA EKQTFKEQKFILLENLTSRYEVPCVLDLKMGTRQHGDDASEEKAANQIRKCQQSTSAVIG VRVCGMQVYQAGSGQLMFMNKYHGRKLSVQGFKEALFQFFHNGRYLRRELLGPVLKKLTE LKAVLERQESYRFYSSSLLVIYDGKERPEVVLDSDAEDLEDLSEESADESAGAYAYKPIG ASSVDVRMIDFAHTTCRLYGEDTVVHEGQDAGYIFGLQSLIDIVTEITLATFPAPALPGL MYRALYAFRSAEPNALAFAAGETFLVLERSSAHWWLAARARSGETGYVPPAYLRRLQGLE QDVLQAIDRAIEAVHNTAMRDGGKYSLEQRGVLQKLIHHRKETLSRRGPSASSVAVMTSS TSDHHLDAAAARQPNGVCRAGFERQHSLPSSEHLGADGGLYQVCGEDIVEKGDPTSIFQI PLPSSQIPPQPRRAAPTTPPPPVKRRDREALMASGSGGHNTMPSGGNSVSSGSSVSSTSL DTLYTSSSPSEPGSSCSPTPPPVPRRGTHTTVSQVQPPPSKASAPEPPAEEEVATGTTSA SDDLEALGTLSLGTTEEKAAAEAAVPRTIGAELMELVRRNTGLSHELCRVAIGIIVGHIQ ASVPASSPVMEQVLLSLVEGKDLSMALPSGQVCHDQQRLEVIFADLARRKDDAQQRSWAL YEDEGVIRCYLEELLHILTDADPEVCKKMCKRNEFESVLALVAYYQMEHRASLRLLLLKC FGAMCSLDAAIISTLVSSVLPVELARDMQTDTQDHQKLCYSALILAMVFSMGEAVPYAHY EHLGTPFAQFLLNIVEDGLPLDTTEQLPDLCVNLLLALNLHLPAADQNVIMAALSKHANV KIFSEKLLLLLNRGGEGPGICADDPVRIFKHEPQPPHSVLKFLQDVFGSPATAAIFYHTD MMALIDITVRHIADLSPGDKLRMEYLSLMHAIVRTTPYLQHRHRLPDLQAILRRILNEEE TSPQCQMDRMIVREMCKEFLRPWPSCCFESGILETGHPSWSLMEQDDGTEKANDRISFLC KNVD >gi568815595r:48651588_48947596|GENSCAN_predicted_CDS_2|5955_bp atgagccacatccagatcccgccggggctcacggagctgctgcagggctacacggtggag gtgctgcgacagcagccgcctgacctcgtcgaattcgcagtggagtacttcacccgcctg cgcgaggcccgcgccccagcctcagtcctgcccgccgccaccccacgccagagcctgggc caccccccgccagaacccggcccggaccgtgtcgccgacgccaaaggggacagcgagtcg gaggaggacgaggacttggaagtgttggaagttgtggccagggcaattaggcaggagaag gaaataaaggctattcagttaggaaaagaggaagtcaaattgtccctgtttgcagatgac atgattgtatatctagaaaaccccattgtctcagccccaaatctccttaagctgataagc aacttcagcaaagtctcaggatacaaaatcaatgcacaaaaatcacaagcattcttatat accaataacagaaaaacagagagccaagtcatgagtgaactcccattcacaattgcttca aagagaataaaatacctaggaatccaacttacaagggacgtgaaggacctcttcaaggag aactacaaaccactgctcaatgaaataaaagaggatacaaacaaatggaagaacattcca tgctcatggataggaagaatcaatatagtgaaaatggccatactgtccaagttccagttc ctagcagatttaatagacgagtatcaggatatagggtccctgttgggcacttggtatcga tggcattactgcacaagtggcatgggccacacaatctgtgctgagacctataaccctgat gaggaagaggaagatacagatccaagggaacagctttctcaagttctcgatgccatgttt gaaaggatagtcaaagctgatgagcatgtcattgaccaaggagatgatggagacaacttt tatgtcatagaacggggaacttatgacattttagtaacaaaagataatcaaacccgctct gttggtcaatatgacaaccgtggcagttttggagaactagctctgatgtacaacaccccg agagctgctaccattgttgctacctcagaaggctccctttggggactggtgtcagaacga atgaagattgtggatgtaataggagagaagatctataaggatggagaacgcataatcact cagggtgaaaaggctgatagcttttacatcatagagtctggcgaagtgagcatcttgatt agaagcaggactaaatcaaacaaggatggtgggaaccaggaggtcgagattgcccgctgc cataaggggcagtactttggagagcttgccctggtcaccaacaaacccagagctgcctca gcttatgcagttggagatgtcaaatgcttagttatggatgtacaagcattcgagaggctt ctggggccctgcatggacatcatgaagaggaacatctcacactatgaggaacagctggtg aagatgtttggctccagcgtggatctgggcaacctcgggcaaactgcactgagccagggg acaaaggtcactgtgtactctgctataagccaggatccagcaataccctttctggctagg tcctccaactcccggaaggaggggagactcttgccccctccacctttggaggtcctcttt ggggatcagaggcccatctctgccaggggcctgaagacctacatcctggagcccagtctc atgctgacaactgctggtgtcatgagccagacctccaagaaggcagtagagcagttctct aagcggatccccaatgaaactatcaggcagattccccagtccgataacagactggccttt attagtcatatcacccaagcagtttctcagactcttgatattcagcaaaaccttcacacc ccttaccgtcctcaatcttcaggaaagttttgtcgttgctatattgcttactttgatgac cattccttgccttctgatgagtttcaacgtcttttcatgttcagagcatcctggccagaa caagccaaggagccaagacgagagggacacacggacaaacaacagacagaagacgtactg gccgctggactccgctgcctcccccatctccccgccatctgcgcccggaggatgagccca gccttcagggccatggatgtggagccccgcgccaaaggcgtccttctggagccctttgtc caccaggtcggggggcactcatgcgtgctccgcttcaatgagacaaccctgtgcaagccc ctggtcccaagggaacatcagttctacgagaccctccctgctgagatgcgcaaattcact ccccagtacaaagctgttttgatatttgttaagtttgcagatgagtttggggcctctggc aacatagagactaaggaactgggtgtggtatctgtgcgctttgaagaagatgaagacagg aacttgtgtctaatagcatatccattgaaaggggaccatggaattgtggacattgtagat aattcagactgtgaaccaaaaagtaagctcctaaggtggacaacaaacaaaaaacatcat gtcttagaaacagaaaagacccctaaggactgggtgcgtcagcaccgtaaagaggagaaa atgaagagccataagttagaagaagaatttgagtggctaaagaaatctgaagtcttgtac tacactgtagagaagaaggggaatataagttcccagcttaaacactataacccttggagc atgaaatgtcaccagcaacagttacagagaatgaaggagaatgcaaagcatcggaaccag tacagtatcctttgttcacgggcatctctgcatttgaggacctttttagtgcacacagct gagaagcagacgtttaaagagcaaaaatttatcttactggaaaacctgacttcccgctac gaggtgccttgtgtccttgacctcaagatgggcacacgacaacatggtgatgatgcttca gaggagaaggcagccaaccagatccgaaaatgtcagcagagcacatctgcagtcattggt gtgcgtgtgtgtggcatgcaggtgtaccaagcaggcagtgggcagctcatgttcatgaac aagtaccatggacggaagctatcggtgcagggcttcaaggaggcacttttccagttcttc cacaatgggcggtacctgcgccgtgaactcctgggccctgtgctcaagaagctgactgag ctcaaggcagtgttggagcgacaggagtcctaccgcttctactcaagctccctgctggtc atttatgatggcaaggagcggcccgaagtggtcctggactcagatgctgaggatttggag gacctgtcagaggaatcagctgatgagtctgctggtgcctatgcctacaaacccatcggc gccagctctgtagatgtgcgcatgatcgactttgcacacaccacctgcaggctgtatggc gaggacaccgtggtgcatgagggccaggatgctggctatatcttcgggctccagagcctg atagacattgtcacagagataacccttgccaccttccctgcacctgccctgcccggcctc atgtaccgcgcgctgtacgcgttccgctcggcggagcccaacgcgctggcgttcgccgcg ggcgagaccttcctggtgctagagcgaagcagcgcgcactggtggctggccgcgcgggcg cgcagtggtgagacgggctacgtgccgccagcctacctgcgccgcctgcagggcctggag caggatgtcctccaggccattgaccgggccatcgaggctgtacacaacacagccatgcgg gatggtggcaagtacagcctggaacagcgtggagtcctccagaagctgatccaccaccgg aaagagaccctgtcacgcagaggcccttcagcctccagtgttgcagttatgacctcatca accagtgaccaccacttggatgctgctgcagccaggcagcccaatggggtgtgtcgagct gggttcgagcggcagcacagcctacccagttctgagcatcttggggcagatggaggcctc taccaggtgtgtggggaggacatcgtggagaagggagatcctacttccatcttccagatc ccacttccatcttcccagatcccaccacagcctcgccgagcagcacccaccacaccgccc ccaccagtgaagcgccgagaccgcgaggccctgatggcctctgggagtggtggccacaac accatgccctccgggggtaactctgtgtccagcggctcctcagtcagcagcacctccctg gacacgctctataccagctccagcccatctgaaccaggctccagctgctcacccacaccc ccacctgtgccccgccgaggcacccacaccaccgtgtcccaagtccagccccctccctcc aaggcatcagcacctgaaccccctgcagaagaagaagtggcaactggtacaacctcagcc tctgatgacctggaagccctgggtacactgagcctggggaccacagaggagaaggcagca gctgaggcggctgtgcccaggaccattggggccgagctgatggagctggtgcggagaaac actggcctgagccacgaattatgccgggtggccatcggcatcatagtgggtcacatccag gcctcggtgccggccagctcaccagtcatggagcaggtcctcctctcactcgtagagggc aaggacctcagcatggccctgccctcagggcaggtctgccacgaccagcagaggctggag gtgatctttgcagacctggctcgccggaaggacgacgcccagcagcgcagttgggcacta tatgaggatgagggtgtcatccgctgctacctagaggagctgctgcatattctgactgat gcagaccctgaagtttgcaagaaaatgtgcaagagaaacgagttcgagtctgtcctggcc ttggtggcctattaccaaatggaacaccgagcatcactgcggctgctgctcctcaagtgc tttggcgccatgtgcagcctggatgcagccatcatctccacgcttgtgtcatccgtgctg cctgtagagctggcgagggacatgcagacagacacgcaggaccaccagaaactctgttac tctgccctcatcctggccatggtcttctccatgggagaggcagtgccctatgcacactat gagcacctgggcacgcctttcgcccagttcctactgaacatcgtcgaggatgggctgccc ttggacaccacagagcagctgccggacctctgcgtgaacctgcttctggctctcaacctg cacctgccagctgctgaccagaatgtcatcatggctgccctgagcaaacacgccaatgtc aagatcttctccgagaagctgttgttgctcctgaacagagggggtgagggcccgggaatc tgcgcggatgaccctgtgcgcatcttcaaacatgagccacagccaccacactctgtcctc aagttcctgcaggacgtgtttggcagcccggccacagctgccatcttctaccacacagac atgatggctctcattgacatcactgtgcggcacatcgcagacctgtcaccaggagacaag ctgcgcatggagtacctctccctgatgcatgctatagtccgcaccacaccctacctgcag caccgccaccggctacccgacctgcaggccatactgcgacgcatcctgaatgaggaggag acctcaccccagtgccagatggaccgcatgattgtccgagagatgtgcaaggaattcctg aggccttggccaagctgctgctttgaaagcgggatcttggagacaggccatccatcctgg agcctcatggaacaggatgatggcactgagaaagccaatgaccgaatctcttttctctgt aaaaatgtagactga >gi568815595r:48651588_48947596|GENSCAN_predicted_peptide_3|301_aa MADQPKPISPLKNLLAGGFGGVCLVFVGHPLDTVKVRLQTQPPSLPGQPPMYSGTFDCFR KTLFREGITGLYRGMAAPIIGVTPMFAVCFFGFGLGKKLQQKHPEDVLSYPQLFAAGMLS GVFTTGIMTPGERIKCLLQIQASSGESKYTGTLDCAKKLYQEFGIRGIYKGTVLTLMRDV PASGMYFMTYEWLKNIFTPEGKRVSELSAPRILVAGGIAGIFNWAVAIPPDVLKSRFQTA PPGKYPNGFRDVLRELIRDEGVTSLYKGFNAVMIRAFPANAACFLGFEVAMKFLNWATPN L >gi568815595r:48651588_48947596|GENSCAN_predicted_CDS_3|906_bp atggccgaccagccaaaacccatcagcccgctcaagaacctgctggccggcggctttggc ggcgtgtgcctggtgttcgtcggtcaccctctggacacggtcaaggtccgactgcagaca cagccaccgagtttgcctggacaacctcccatgtactctgggacctttgactgtttccgg aagactctttttagagagggcatcacggggctatatcggggaatggctgcccctatcatc ggggtcactcccatgtttgccgtgtgcttctttgggtttggtttggggaagaaactacaa cagaaacacccagaagatgtgctcagctatccccagctttttgcagctgggatgttatct ggcgtattcaccacaggaatcatgactcctggagaacggatcaagtgcttattacagatt caggcttcttcaggagaaagcaagtacactggtaccttggactgtgcaaagaagctgtac caggagtttgggatccgaggcatctacaaagggactgtgcttacccttatgcgagatgtc ccagctagtggaatgtatttcatgacatatgaatggctgaaaaatatcttcactccggag ggaaagagggtcagtgagctcagtgcccctcggatcttggtggctgggggcattgcaggg atcttcaactgggctgtggcaatccccccagatgtgctcaagtctcgattccagactgca cctcctgggaaatatcctaatggtttcagagatgtgctgagggagctgatccgggatgaa ggagtcacatccttgtacaaagggttcaatgcagtgatgatccgagccttcccagccaat gcggcctgtttccttggctttgaagttgccatgaagttccttaattgggccacccccaac ttgtga >gi568815595r:48651588_48947596|GENSCAN_predicted_peptide_4|290_aa MLGQRAGDGERPGLPGDGEGGVPARPGRRAERPPQRPAKVNKAVTCAAHLPGAAASRPLS PNKPDRVRPGQRDRIGAKRQRRRRADAGQARAASSRRVVPTAPEVLGAVASLPDRGRPTV ARVATGSRLEGLFSAASLKLSALTQSLTRVRQAPTASGATIRLPASPVEMFLTSAFLTGF SFHCLYSGIGHGEDILASVEQITIVSRPLSGQRGAGPGNSAYTPRRSQGGPRAATTPGFR FPCRGLVRRAVLRLTVTVQDCILTALLAVSFHSIGVVIMTSSYLLGPVVK >gi568815595r:48651588_48947596|GENSCAN_predicted_CDS_4|873_bp atgctcggacaacgtgccggcgacggggagcgcccgggcctcccgggcgacggcgaaggc ggagtcccggcccggccagggaggcgcgcggagaggcccccccagcggccagccaaggta aacaaggccgtgacgtgcgccgcgcacttaccgggagctgcggcctcgcggccgctgagc ccgaacaagccagaccgggtcaggccgggccagagggaccggattggggcgaagcggcag cggaggcggcgggccgacgccggtcaagcccgcgctgcttcctccagaagagtcgtcccc acagctccggaagtgcttggcgccgttgcgtcacttccggatcggggtcgacccacggtc gctcgggtcgcgacaggctcccggctagagggcctgtttagcgccgcctccttgaaactt agcgctctgacccagagtctgaccagggtacggcaggcgccgaccgcgtctggagccact attcgcctaccagcgtctcccgtcgagatgtttttaaccagcgcgtttctcaccggcttc tcatttcactgtttgtactccgggatcgggcacggagaagacatcctggcgtcagtggag cagataaccattgtttctcggccgctatctggtcagaggggagctgggccgggaaattct gcctacaccccgaggcggtcgcagggtggccccagagccgcaaccacaccaggctttcgc ttcccctgccgaggcctcgttcgccgcgcagttctccgacttacggtcaccgtgcaagat tgcatcttaactgccttgcttgcagtttcttttcacagtataggagttgtcatcatgact tccagttaccttctgggaccggttgtcaaatga >gi568815595r:48651588_48947596|GENSCAN_predicted_peptide_5|101_aa MSVDMNSQGSDSNEEDYDPNCEEEEEEEEDDPGDIEDYYVGVASDVEQQGADAFDPEEYQ FTCLTYKESEGALNEHMTSLASVLKAVDAHCHQLHPPYGTX >gi568815595r:48651588_48947596|GENSCAN_predicted_CDS_5|303_bp atgtcagtggacatgaatagccaggggtctgacagcaatgaagaggactatgacccaaat tgtgaggaagaggaagaagaagaagaagacgaccctggggacatagaggactattacgtg ggagtagccagcgatgtggagcagcagggggctgatgcctttgatcccgaggagtaccag ttcacttgcttgacctacaaggaatctgagggtgccctcaatgagcacatgaccagctta gcttctgtcctaaaggccgttgatgctcactgccatcagctgcatcctccctatggcact gnn