GENSCAN 1.0 Date run: 5-Nov-116 Time: 02:04:16 Sequence gi568815581r:76125754_76326806 : 201053 bp : 48.63% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 5386 5466 81 0 0 59 89 113 0.654 7.47 1.02 Term + 6554 6613 60 1 0 76 48 37 0.271 -3.70 1.03 PlyA + 8716 8721 6 1.05 2.03 PlyA - 10602 10597 6 1.05 2.02 Term - 12367 11600 768 1 0 88 48 1189 0.992 108.31 2.01 Init - 14642 14145 498 2 0 114 80 838 0.999 78.66 2.00 Prom - 15014 14975 40 -10.45 3.27 PlyA - 16742 16737 6 1.05 3.26 Term - 17334 17281 54 0 0 84 45 25 0.141 -4.64 3.25 Intr - 17619 17555 65 1 2 132 97 13 0.368 5.04 3.24 Intr - 19600 19508 93 2 0 123 23 98 0.384 6.74 3.23 Intr - 26712 26602 111 1 0 96 105 60 0.506 8.85 3.22 Intr - 28575 28527 49 0 1 90 43 49 0.686 -1.15 3.21 Intr - 29564 29499 66 2 0 65 76 50 0.493 0.60 3.20 Intr - 29981 29809 173 0 2 93 94 162 0.991 16.96 3.19 Intr - 30568 30457 112 1 1 77 83 20 0.954 0.35 3.18 Intr - 32748 32640 109 2 1 54 76 66 0.953 2.29 3.17 Intr - 33820 33582 239 1 2 87 89 159 0.969 12.31 3.16 Intr - 35894 35782 113 0 2 111 100 81 0.999 11.70 3.15 Intr - 36249 36063 187 0 1 76 68 187 0.552 14.76 3.14 Intr - 36870 36799 72 0 0 105 115 67 0.998 10.70 3.13 Intr - 39042 38995 48 0 0 87 86 27 0.705 1.28 3.12 Intr - 39792 39749 44 1 2 113 113 8 0.940 3.76 3.11 Intr - 40774 40708 67 1 1 121 66 92 0.388 8.78 3.10 Intr - 41373 41256 118 2 1 59 83 136 0.999 10.67 3.09 Intr - 42044 41898 147 1 0 75 110 280 0.997 28.25 3.08 Intr - 43470 43232 239 0 2 -31 58 187 0.972 0.21 3.07 Intr - 44417 44304 114 2 0 113 76 63 0.988 8.24 3.06 Intr - 48037 47949 89 2 2 65 108 37 0.275 3.09 3.05 Intr - 50899 50795 105 2 0 35 66 119 0.030 4.59 3.04 Intr - 77055 76926 130 0 1 30 85 87 0.082 2.87 3.03 Intr - 86729 86611 119 0 2 125 44 78 0.381 7.28 3.02 Intr - 89293 89265 29 0 2 89 71 20 0.473 -1.84 3.01 Init - 90539 90472 68 2 2 89 84 8 0.480 1.31 3.00 Prom - 90634 90595 40 -7.36 4.02 PlyA - 92073 92068 6 1.05 4.01 Sngl - 101053 99998 1056 1 0 97 33 889 0.936 81.46 4.00 Prom - 101373 101334 40 -11.14 5.03 PlyA - 102180 102175 6 1.05 5.02 Term - 103877 103798 80 0 2 113 44 105 0.799 6.53 5.01 Init - 114487 114400 88 1 1 109 93 211 0.996 24.50 5.00 Prom - 122127 122088 40 -4.26 6.02 PlyA - 123167 123162 6 1.05 6.01 Sngl - 132187 131738 450 1 0 61 36 269 0.829 13.38 6.00 Prom - 135181 135142 40 -10.25 7.00 Prom + 138333 138372 40 -10.05 7.01 Init + 139366 139872 507 0 0 37 64 429 0.426 28.36 7.02 Intr + 140154 140216 63 2 0 127 103 68 0.991 11.11 7.03 Term + 144441 144752 312 2 0 124 53 292 0.999 24.30 7.04 PlyA + 146332 146337 6 1.05 8.21 PlyA - 148314 148309 6 1.05 8.20 Term - 148507 148326 182 2 2 75 53 60 0.176 -1.03 8.19 Intr - 150194 150066 129 0 0 119 70 91 0.982 11.07 8.18 Intr - 150629 150453 177 0 0 52 71 81 0.830 2.69 8.17 Intr - 151083 150927 157 0 1 33 79 158 0.421 8.98 8.16 Intr - 151557 151410 148 2 1 113 77 187 0.999 20.34 8.15 Intr - 152436 152236 201 2 0 76 44 339 0.965 26.70 8.14 Intr - 152809 152752 58 2 1 118 85 -8 0.654 -0.16 8.13 Intr - 153389 153256 134 1 2 101 1 162 0.654 9.09 8.12 Intr - 154401 154280 122 0 2 85 78 181 0.956 16.09 8.11 Intr - 154698 154534 165 0 0 69 100 239 0.961 23.36 8.10 Intr - 154975 154901 75 1 0 94 52 123 0.995 9.01 8.09 Intr - 155200 155078 123 1 0 88 42 319 0.526 28.08 8.08 Intr - 156362 156111 252 2 0 90 75 268 0.991 23.33 8.07 Intr - 161748 161439 310 2 1 97 94 168 0.946 14.62 8.06 Intr - 162144 162047 98 0 2 98 98 26 0.997 3.41 8.05 Intr - 164324 164239 86 0 2 70 78 114 0.999 8.14 8.04 Intr - 168268 165262 3007 1 1 111 55 629 0.964 49.27 8.03 Intr - 178772 178629 144 2 0 63 31 84 0.400 0.68 8.02 Intr - 179188 179033 156 1 0 111 20 81 0.627 3.81 8.01 Init - 182245 181712 534 1 0 108 84 968 0.971 93.26 8.00 Prom - 183914 183875 40 -8.36 9.04 PlyA - 184144 184139 6 1.05 9.03 Term - 185947 185789 159 1 0 47 33 129 0.904 1.24 9.02 Intr - 187263 187117 147 0 0 111 55 231 0.999 22.53 9.01 Init - 188133 188068 66 0 0 84 44 78 0.617 4.07 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 197703 197939 237 2 0 76 44 153 0.893 4.79 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:76125754_76326806|GENSCAN_predicted_peptide_1|46_aa MPASAAPTQAALRRSALWALPSGGGEQLRLAGFWGTMSFQLAPANF >gi568815581r:76125754_76326806|GENSCAN_predicted_CDS_1|141_bp atgccggcttccgctgcccccactcaggctgccctgcggcgctcagcactgtgggcgctc ccttcagggggtggggaacagctcaggctggctggcttctgggggacaatgtccttccag ttggcccctgccaacttctga >gi568815581r:76125754_76326806|GENSCAN_predicted_peptide_2|421_aa MAESWLRLSGAGPAEEAGPEGGLEEPDALDDSLTSLQWLQEFSILNAKAPALPPGGTDPH GYHQVPGSAAPGSPLAADPACLGQPHTPGKPTSSCTSRSAPPGLQAPPPDDVDYATNPHV KPPYSYATLICMAMQASKATKITLSAIYKWITDNFCYFRHADPTWQNSIRHNLSLNKCFI KVPREKDEPGKGGFWRIDPQYAERLLSGAFKKRRLPPVHIHPAFARQAAQEPSAVPRAGP LTVNTEAQQLLREFEEATGEAGWGAGEGRLGHKRKQPLPKRVAKVPRPPSTLLPTPEEQG ELEPLKGNFDWEAIFDAGTLGGELGALEALELSPPLSPASHVDVDLTIHGRHIDCPATWG PSVEQAADSLDFDETFLATSFLQHPWDESGSGCLPPEPLFEAGDATLASDLQDWASVGAF L >gi568815581r:76125754_76326806|GENSCAN_predicted_CDS_2|1266_bp atggcggagagctggctgcgcctctcgggagccgggccggcggaggaggccgggccggag ggcggcctggaggagcccgacgccctggatgacagcctgaccagcctgcagtggctgcag gaattctccattctcaacgccaaggcccccgccctgcccccggggggcaccgacccccac ggctaccaccaggtgccaggttcagcggcgcccgggtcccccctggcggccgaccccgcc tgcctggggcagccacacacgccgggcaagcccacgtcgtcgtgcacgtcgcggagcgcg cccccggggctgcaggccccaccccccgacgacgtggactacgccaccaatccgcacgtg aagcctccctactcgtatgccacgctcatctgcatggccatgcaggccagcaaggccacc aagatcaccctgtcggccatctacaagtggatcacggacaacttctgctacttccgccac gcagatcccacctggcagaattcaatccgccacaacctgtctctgaacaagtgcttcatc aaagtgcctcgggagaaggacgaaccaggcaaggggggcttctggcgcattgacccccag tacgcggagcggctactgagcggcgctttcaagaagcggcgactgccccctgtccacatc cacccagcctttgcccgccaggccgcgcaggagcccagcgctgtcccccgggccgggccg ctgacggtgaataccgaggcccagcagctgctgcgggagttcgaggaggccaccggggag gcgggctggggtgcaggcgagggcaggctggggcataagcgcaaacagccgctgcccaag cgggtggccaaggtcccgcggccccccagcaccctgctgcccaccccggaggagcagggt gagctggaacccctcaaaggcaactttgactgggaggccatcttcgacgccggcactctg ggcggggagctgggtgcactggaggccctggagctgagcccgcctctgagccccgcctca cacgtggacgtggacctcaccatccacggccgccacatcgactgccctgccacctggggg ccttcggtggagcaggctgccgacagcctggacttcgatgagaccttcctggccacatcc ttcctgcagcacccctgggacgagagcggcagtggctgcctgcccccggagcccctcttt gaggctggggatgccaccctggcctccgacctgcaggactgggccagcgtgggggccttc ttgtaa >gi568815581r:76125754_76326806|GENSCAN_predicted_peptide_3|919_aa MATSKPIIHGFLLPQIPPLRSHRNGVYESVDKGSYFASHFIMGGEKFDSTHPEGYLFGEN SDLNFLGNRPVVNHGPIYNLAEKAPGLMIIVVTCLKPRLATFWYYAKAELLPPTPGRQAV EGLSVCSLRPPCSSRCDGSGCSGQPTTVINISLRRPTSPRTREDSEKPGQYPKGHTEARQ NAGSQAFVKREENCFLEIGPAWIGKPKRLELEVPQQIQKVEQKSREKENRRRKTQVIEHP FRRSDIQLTGSTENRECKRKEIISDIEFKKTSQNQRVQVSKWKGPPSAHTVAEKEDRLCA EEVKSPGEEASKAKVHYNVEFTFDTDARVAITIYYQATEEFQNGIASYIPKDNSLQSETV QYKRGVCQQFCLPSHTVDPSEWAEEELGFDLDREVYPLVVHAVVDEGDEYFGHCHVLLGT FEKHTDGTFCVKPLKQKQVVDGVSYLLQEIYGIENKYNTQDSKVAEDEVSDNSAECVVCL SDVRDTLILPCRHLCLCNTCADTLRYQANNCPICRLRKPQSGRGTAFRALLQIRAMRKKL GPLSPTSFNPIISSQTSDSEEHPSSENIPPGYEVVSLLEALNGPLTPSPAVPPLHVLGDG HLSGMLPSYGSDGHLPPVRTISPLDRLSDSSSQGLKLKKSLSKSTSQNSSVLHEEEDEHS CSESETQLSQRPSVQHLGEECGVTPESENLTLSSSGAIDQSSCTGTPLSSTISSPEGPAS SSLAQSVMSMASSQISTDTVSSMSGSYIAPGTEEEGEALSSPQPASRAPSEEGEGLPAES PDSNFAGLPAGEQDAEGNDVIEEEDGSPTQEGGQRTCAFLGMECDNNNDFDIASVKALDN KLCSEVCLPGAWQADDNAVSRNAQRRRLSSSSLEDSETRPCSIHSVVLPGRREEGAEMKE IQLSGPQKYSIQGAGPQSA >gi568815581r:76125754_76326806|GENSCAN_predicted_CDS_3|2760_bp atggccacctctaagccaatcatccatggatttttactgccccagatccctcccctgaga tctcacagaaatggggtatatgagagtgttgataaaggaagctattttgccagccacttc attatgggaggagagaagtttgactcaactcatcctgaaggttacctgtttggagagaac agcgatctgaactttctggggaacagaccagttgtgaaccatggaccaatttataacctt gcagagaaggccccagggctgatgatcattgttgtgacttgtttgaagcctcgattggcc actttttggtactatgccaaggctgagctgcttcctccaacccctgggcggcaagctgtg gaggggctgtctgtctgctccctgcgacccccctgcagcagccgttgtgatggcagcggc tgctctggacagcccaccactgtcatcaatatttccttacgccgccccacctccccaaga acccgtgaagactctgagaagcctggtcaatatccgaaaggacacactgaggctcgtcaa aatgctggcagccaggcctttgtcaagagagaggaaaactgttttctggaaattggacct gcctggataggaaaacctaagagactagagttagaggttcctcaacagatccagaaagta gagcaaaaatccagagagaaggaaaatagaagaagaaagacgcaagtaatagagcaccca tttagaaggtctgacatccaactaacgggttctacagagaacagagaatgtaaaaggaag gaaatcatcagtgacatagaatttaagaaaacttctcagaaccagagggtacaagtttct aaatggaaaggaccaccaagtgcccacacagtggctgagaaagaggaccgtctatgtgct gaggaagtgaagagccctggagaagaggccagtaaagctaaagtccactacaatgttgag ttcacctttgacacagatgctcgggtagccatcaccatctattaccaggccacggaagag ttccagaatggtattgccagctacattcccaaagacaacagcctccagtcggagactgtg cagtacaagcgaggagtgtgtcagcagttctgcctgccctcccacaccgtggatccctcc gagtgggccgaagaggagcttggctttgatttagaccgagaagtttaccctctagtggta catgccgtggtggatgaaggagacgagtattttggccattgccatgtactgctgggtact tttgagaagcacacagatggaactttctgtgtcaagcccctcaaacagaaacaagtagta gacggggtcagctacctccttcaggagatctatggaattgaaaacaagtacaacacacaa gattctaaggtggctgaagacgaagtgagtgataacagtgccgagtgtgtggtgtgtctc tcggatgtccgggacaccttgattctgccctgtcgccacctctgcctctgtaacacctgt gcagacacgctgcgctaccaggccaacaactgccccatctgccgactgcgtaagccccag agcggcagggggactgccttccgggcactgcttcagatccgagccatgaggaaaaaattg ggccccttgtccccaaccagctttaaccccatcatctcatcccagacatctgactctgaa gagcatccatcctcagagaatattccaccaggctatgaagtagtatctcttctggaggcc ctcaacgggcccctcaccccgtccccagcagttcctccacttcacgtgcttggagatggc cacctctcaggaatgctcccttcatatggcagtgatggccacctgccccccgtcaggacg atctcgcctcttgaccgcctgtctgacagcagcagtcagggactcaaactcaaaaagagt ctctccaaatccacttcccaaaactcttccgtgctgcatgaagaggaagatgagcattcc tgcagcgagtcggagacacagctctctcagagaccgtcggttcagcatctcggagaggaa tgtggtgtgactccagaaagtgagaatctcaccttgtcgtcatctggagctattgaccag tcgtcttgcacagggacgcctctgtcatccactatttcctccccagaaggccctgccagc agcagcttggcccagtctgtcatgtccatggcatcctcccagatcagcactgacaccgtc tcctccatgtctggctcctacatcgcccctggcactgaagaggagggagaggctctctct tccccccagcctgccagcagggccccctcagaagaaggagaggggctgccagcggagtct ccagacagcaactttgctggcctcccagctggagagcaggatgcagagggaaatgatgtt atagaggaagaggatggatcacccacgcaggaaggtggccagaggacgtgcgcatttcta ggtatggagtgtgacaataacaatgactttgacatcgcaagcgtgaaagcactggacaat aagctgtgctctgaggtctgcttacctggtgcctggcaggctgatgacaatgccgtcagt cggaatgcccagcgccggcgcttgtcatccagcagcctggaggactctgagacgaggccc tgtagtattcacagtgtggtgttacctgggaggagggaggaaggagctgagatgaaggag attcagctctctggcccacagaaatactccatccaaggagctggtcctcagtcagcctga >gi568815581r:76125754_76326806|GENSCAN_predicted_peptide_4|351_aa MTEMSFLSNEVLVGDLMSPFDQSGLGAEESIGLLDDYVEVAKHFKPHGFSSDKAKAGSSE WLTVDGLVSPSNNSKEDAFSGTHWMLEKMDLKEFDFGALLGIDDLETMPDDLLTTLDDTC NLFAPLVQETNKVPPQMVNPIGHLPESLTKPDQVAPFTFLQLLPLSPGVQSSTPDHSFSL ELGSEVDITEEDRKPDSTAYVAMIPQCIKEEDTPSDNDSGICMSPESYLGSPQHSPSTRG SPNRSLPSPGVLCGSAHPKPYDPPGEKMVAAKVKGEKLDKKLKKMEQNKTAATRYRQKKR AEQEALTGECKELEKKNEALKERADSLAKEIQYLKDLIEEVRKARGKKRVP >gi568815581r:76125754_76326806|GENSCAN_predicted_CDS_4|1056_bp atgaccgaaatgagcttcctgagcaacgaggtgttggtgggggacttgatgtcccccttc gaccagtcgggtttgggggctgaagaaagcataggtctcttagatgactacgtggaggtg gccaagcacttcaaacctcatgggttctccagcgacaaggctaaggcgggctcctccgaa tggctgactgtggatgggttggtcagtccctccaacaacagcaaggaggatgccttctct gggacacattggatgttggagaaaatggatttgaaggagttcgactttggtgccctgttg ggtatagatgacctggaaaccatgccagatgaccttttgaccacgttggatgacacttgt aatctctttgcccccctagtccaggagactaataaggtgcccccccagatggtgaaccca attggccatctcccagaaagtttaacaaaacccgaccaggttgcccccttcaccttcttg caacttcttcccctttccccaggggtccagtcctccactccagatcattcctttagttta gagctgggcagtgaagtggatatcactgaagaagataggaagccggactccactgcttac gttgccatgatccctcagtgcataaaggaggaagacaccccttcagataatgatagtggc atctgtatgagcccagagtcctatctggggtctcctcagcatagcccctctaccaggggc tctccaaataggagcctcccatctccaggtgttctctgtgggtctgcccaccccaaacct tacgatcctcctggagagaagatggtagcagcaaaagtaaagggtgagaaactggataag aagctgaaaaaaatggagcaaaacaagacagcagccactaggtaccgccagaagaagagg gcggagcaggaggctctcactggcgagtgcaaagagctggaaaagaagaacgaggctcta aaagagagggcagattccctggccaaggagatccagtacctgaaagatttgatagaagag gtccgcaaggcaagggggaagaaaagggtcccctag >gi568815581r:76125754_76326806|GENSCAN_predicted_peptide_5|55_aa MGALTSRQHAGVEEVDIPSNSVYRYPPKSEEKVIEFEAKMGYLDEKIWQVIEDLD >gi568815581r:76125754_76326806|GENSCAN_predicted_CDS_5|168_bp atgggggccctgacgagccggcagcacgcgggcgtggaggaggtggacatcccgtctaat tccgtgtaccgctacccgcccaagtccgaagaaaaggttattgagtttgaggcaaagatg ggctatctggatgaaaagatatggcaggtgattgaagatttggactga >gi568815581r:76125754_76326806|GENSCAN_predicted_peptide_6|149_aa MAGLIALPARGNEGLSTRASGCGGCTGSPSSASPPALRSISRRALAAFPRGRARDLQPAM PEPPTPSVGSCAAPASPMSAAPCSTAPSPIDHPRAEECGHTARDWQAAPPAAPVRDPLGE ASWAPESGGDVENLYVQLRDCKYTNQHPV >gi568815581r:76125754_76326806|GENSCAN_predicted_CDS_6|450_bp atggcagggctcattgccctgccggcccggggcaatgaggggcttagcacccgggccagc ggctgcggagggtgtactgggtcccccagcagtgccagcccaccggctctgcgctcaatt tctcgccgggccttagctgccttcccgcggggcagggctcgcgacctgcagcccgccatg cctgagcctcccaccccctccgtgggctcctgtgcagccccagcctccccgatgagcgcc gccccctgctccacagcccccagtcccatcgaccacccaagggctgaggagtgcgggcac acggcgcgggactggcaggcagctccacctgcagccccagtgcgggatccactgggtgaa gctagctgggctcctgagtctggtggggacgtggagaacctttatgtccagctcagggat tgtaaatacaccaatcagcatcctgtctag >gi568815581r:76125754_76326806|GENSCAN_predicted_peptide_7|293_aa MLGSPTAPSSANGRGRAKRETVRAPQAAGFESNVLTGAGRWRSGAPANQRQSPNVGPAQP RPALLFVVSAGPRESNHNIRRAGGGGAGGGAAAAAVSASPGAPGGRPPAPAQPRVCVRPE TPGPAPRRAMSVNMDELRHQVMINQFVLAAGCAADQAKQLLQAAHWQFETALSTFFQETN IPNSHHHHQMMCTPSNTPATPPNFPDALAMFSKLRASEGLQSSNSPMTAAACSPPANFSP FWASSPPSHQAPWIPPSSPTTFHHLHRPQPTWPPGAQQGGAQQKAMAAMDGQR >gi568815581r:76125754_76326806|GENSCAN_predicted_CDS_7|882_bp atgctggggagccccacggcgccgagctccgccaatgggcgcggccgggcgaaaagagaa acagtgagggccccacaggccgcaggattcgaatcgaacgtccttacgggagccgggcgc tggcggagcggcgccccggccaatcagcggcagtccccgaacgttgggccggcccagccc cgccccgcgctgttgtttgtggtgtcggccgggccgcgggagtcgaaccacaacattcgc cgggcgggcggcggcggagcgggcggcggagcggcggcagcagcggtgagcgcgagcccc ggagccccgggcggccggcctcccgcgcccgcgcagccccgcgtctgcgtccggccggag acgccgggccccgcgccgcgccgcgccatgtcggtgaacatggacgagctgcggcaccag gtcatgatcaaccagttcgtgctggccgcgggctgcgcggccgaccaggcgaagcagttg ctgcaggcggcccactggcagttcgagaccgcgctgagcacgttcttccaagaaaccaac attcccaacagccaccaccaccaccagatgatgtgcactcccagcaacacccctgccacg ccgcccaacttccccgatgcgctggccatgttctccaagctccgcgcctccgagggcctg cagagcagcaacagccccatgacagccgcagcctgctccccacctgcaaacttcagcccc ttctgggcctcgtccccgcccagccaccaggcgccctggatcccgccctcctcccccacc accttccaccacctccaccgcccacagcccacgtggcccccaggagcacagcaggggggc gcccagcagaaagccatggcggccatggacggccagagatga >gi568815581r:76125754_76326806|GENSCAN_predicted_peptide_8|2085_aa MPPATTVSLRELADLSIGTPEVGAVNFTALHTLIVAMLKNLDLQNTRIDFQPSSPEPSRS LQSVRSSFSIPHLPAPKEVPKGAPREKRRGVGQAPSSALESQVKDLGGQVEDLSKQLKRV DGQVQGIATHVQHFSQASGLDLAALEWPEEQEVGVRAFDRVRTGSIMKDAAEELSFARVL LQRVDELEKLFKDREQFLVIPASSMGKGSPGGWPSLSVALSPGLVHPGQAELVSRKLSLV PGAEEVTMVTWEELEQAITDGWRASQAVNQWGPGPWSWGSETLMGFSKHGGFTSLTSPEG TLSGDSTKQPSIEQALDSASGLGPDRTASGSGGTAHPSDGVSSREQSKVPSGTGRQQQPR ARDEAGVPRLHQSSTFQFKSDSDRHRSREKLTSTQPRRNARPGPVQQDLPLARDQPSSVP ASQSQVHLRPDRRGLEPTGMNQPGLVPASTYPHGVVPLSMGQLGVPPPEMDDRELIPFVV DEQRMLPPSVPGRDQQGLELPSTDQHGLVSVSAYQHGMTFPGTDQRSMEPLGMDQRGCVI SGMGQQGLVPPGIDQQGLTLPVVDQHGLVLPFTDQHGLVSPGLMPISADQQGFVQPSLEA TGFIQPGTEQHDLIQSGRFQRALVQRGAYQPGLVQPGADQRGLVRPGMDQSGLAQPGADQ RGLVWPGMDQSGLAQPGRDQHGLIQPGTGQHDLVQSGTGQGVLVQPGVDQPGMVQPGRFQ RALVQPGAYQPGLVQPGADQIDVVQPGADQHGLVQSGADQSDLAQPGAVQHGLVQPGVDQ RGLAQPRADHQRGLVPPGADQRGLVQPGADQHGLVQPGVDQHGLAQPGEVQRSLVQPGIV QRGLVQPGAVQRGLVQPGAVQRGLVQPGVDQRGLVQPGAVQRGLVQPGAVQHGLVQPGAD QRGLVQPGVDQRGLVQPGVDQRGLVQPGMDQRGLIQPGADQPGLVQPGAGQLGMVQPGIG QQGMVQPQADPHGLVQPGAYPLGLVQPGAYLHDLSQSGTYPRGLVQPGMDQYGLRQPGAY QPGLIAPGTKLRGSSTFQADSTGFISVRPYQHGMVPPGREQYGQVSPLLASQGLASPGID RRSLVPPETYQQGLMHPGTDQHSPIPLSTGLGSTHPDQQHVASPGPGEHDQVYPDAAQHG HAFSLFDSHDSMYPGYRGPGYLSADQHGQEGLDPNRTRASDRHGIPAQKAPGQDVTLFRS PDSVDRVLSEGSEVSSEVLSERRNSLRRMSSSFPTAVETFHLMGELSSLYVGLKESMKDL DEEQAGQTDLEKIQFLLAQMVKRTIPPELQEQLKTVKTLAKEVWQEKAKVERLQRILEGE GNQEAGKELKAGELRLQLGVLSAQSYRPEASQDRGIARDSLDKIHWRGVEGERPPAVSAS PCVPDGPGSAWVWPECGVWQSLIVHHRVTVADIEKELAELRESQDRGKAAMENSVSEASL YLQDQLDKLRMIIESMLTSSSTLLSMSMAPHKAHTLAPGQIDPEATCPACSLDVSHQVST LVRRYEQLQDMVNSLAVSRPSKKAKLQRQDEELLGRVQSAILQVQGDCEKLNITTSNLIE DHRQKQKDIAMLYQGLEKLEKEKANREHLEMEIDVKADKSALATKVSRVQFDATTEQLNH MMQELVAKMSGQEQDWQKMLDRLLTEMDNKLDRLELDPVKQLLEDRWKSLRQQLRERPPL YQADEAAAMRSAIPVTPAGPGLPGHHSIRPYTVFELEQVRQHSRKYAMGQHMGHMGWESI VQRTSGCSWSPVGSSLKLGSAFPRGDLAQMEQSVGRLRSMHSKMLMNIEKVQIHFGGSTK ASSQIIRELLHAQCLGSPCYKRVTDMADYTYSTVPRRCGGSHTLTYPYHRSRPQHLPRGL YPTEEIQIAMKSRVKGPGEGCGCYSGDGICSLLQHDEVDILGLDGHIYKGRMDTRLPGIL RKDRPPPPSKLSTVPSAEPHRFSPRMPASRGNARSLRFVTSHHAFSCVCAFLDASLPALE TPGSGTSKRKSQQPRPHVHRPPSLSSNGQLPSRPQSAQISAGNTSVSSRQQKDRPSSEGR LSQPNTAHPPSSAAVANRGLERHVDMPPGEGLEEPTRGPRSSTAQ >gi568815581r:76125754_76326806|GENSCAN_predicted_CDS_8|6258_bp atgccgcccgcgaccacggtctccctccgggagctggcggacctctccatcggcacgcca gaggtgggcgccgtcaacttcacggccctgcacacgctcatcgtggccatgctcaagaac ctcgacctccaaaatacccggatcgacttccagccctcgtcgcccgagcccagccgctcg ctgcagtccgtccggagctcgttcagcatcccgcacctgcccgcgcccaaggaggtgccc aagggggcgccccgggagaagcgcaggggcgtgggccaggcgccttcgtcagcgctggag agccaagtgaaggacctgggcggccaggtggaggacctgagcaagcagctcaagcgtgtg gacggccaggtgcagggcatcgccacgcacgtgcagcacttctcccaggccagcgggctt gacctggccgcgctagagtggccggaggagcaggaggtgggcgtgcgggcgttcgatagg gtgcggactgggagtatcatgaaggacgccgccgaggagctcagctttgccagggtactt ttacagcgggttgatgaactagagaagctattcaaagatcgggagcaattcctggtaata ccagccagttccatgggaaagggctctccagggggctggccgtctctcagtgtggccctc tctccagggcttgtccatccagggcaggcagaactagtcagccggaagctgagtttggtt cctggtgcagaagaagtcaccatggtcacctgggaagagctggagcaggcgattacggac ggctggagagcctcacaagcggtgaaccagtgggggcctgggccgtggtcttggggctca gaaacacttatgggattttctaagcacggagggttcacttccttaacatcacctgaaggg actctaagcggagactctaccaagcaaccaagtattgagcaggctctggattctgccagt ggtcttggcccggatcggactgcatcaggatctggtggcacagcacacccctctgatggg gtttccagtagggaacaaagcaaggtcccctctggtactgggagacagcagcagccgagg gcccgtgatgaagctggcgtgccacgactccatcagtcttctacattccaattcaaatca gactcagatcgtcacaggagtagagagaagcttacctcgacacaaccaagaagaaatgca cgtcctggtccagttcaacaggacttacccttggccagagaccagcccagtagtgtgccc gctagccagagtcaggtccatctaaggccagatcgtcgtgggttagaaccaactggcatg aatcagcctggattagtgcctgctagcacttacccacatggtgtggtacccctcagcatg ggtcagcttggtgtgccaccacctgaaatggatgatcgggaattgataccatttgtcgtg gatgagcaacgtatgttgccaccatcagtacctggcagagaccagcaaggattggaacta cctagcacagaccaacatggtctggtttcagtcagtgcatatcagcatggtatgacattt cctggcacagaccaacgcagtatggaaccacttggcatggatcagcgtggatgtgtaata tcaggcatgggtcagcaaggactagtaccccctggtatagaccagcaaggattgacattg cctgtcgtcgatcaacatggcctggttctaccttttacagaccagcatggtttggtatca cctggtttgatgccaattagtgcagatcagcaaggttttgtgcagcccagtttggaagca actggcttcatacaacctggcacagagcagcatgatttgatccagtctggcagatttcag cgtgctttggtgcagcgtggtgcatatcagcctggcttggtccaacctggtgcagatcag cgtggtttggtccggcctggaatggatcagtctggtttggcccaacctggtgcagatcag cgtggtttggtctggcctggaatggatcagtctggtttggcccaacctggtagagatcag catggtttgatccagcctggcacaggtcagcatgatttggtccaatctggcacaggtcag ggtgtcttggtacagcctggtgtagatcagcctggcatggtccaacctggcagatttcag cgtgctttggtgcagcctggtgcatatcagcctggcttggtccaacctggtgcagatcag attgatgtggtgcaacctggtgcagatcagcatggtttggtacaatctggtgcagatcag agtgatttggctcaacctggtgcagttcagcatggtttggtccaacctggagtagatcag cgtggtttggcacaacctcgtgcagatcatcagcgtggtttggtcccacctggtgcagat cagcgtggtttggtccaacctggtgcagatcagcatggtttggtccaacctggagtggat cagcatggtttggcacaacctggtgaagttcagcgtagtttggtgcaacctggtatagtt cagcgtggtttggtgcaacctggtgcagttcagcgtggtttggtgcaacctggtgcagtt cagcgtggtttggtccaacctggagtggatcagcgtggtttggttcaacctggtgcagtt cagcgtggtttggtccaacctggtgcagttcagcatggtttggtccaacctggtgcagat cagcgtggtttggtccaacctggagtggatcagcgtggtttggtgcaacctggagtggat cagcgtggtttggtccaacctggaatggaccagcgtggtttgatccaacctggtgcagat cagcctggtttggtccagcctggtgcaggtcagctgggtatggtgcagcctggaataggt cagcaaggtatggtgcaacctcaggcagatccacatggcctggtacaacctggtgcctat cctcttggtttggtacaacctggtgcatatttgcatgatttatctcaatctgggacatat ccacgtggtctggtgcagccaggaatggatcagtatggtttgagacaacctggtgcatat cagccaggcttgatagcaccaggcacaaagcttcgtggctcttcaacattccaggcagat tctacaggttttatatcagtacgtccatatcaacatggtatggtacctcctggcagagaa caatacggccaggtgtcaccactcctagccagtcaaggtttggcatcacctggtatagat cgaaggagtttggtaccaccagaaacttatcagcaaggtttgatgcatcctggcacagac cagcacagcccaataccactgagtacaggtttgggatctacacacccagatcaacagcat gtggcatcacctggcccaggtgagcatgaccaggtatacccagatgcagctcagcatggc catgctttctctctctttgacagtcatgattcaatgtatcctggttatcgtggcccaggg tatctaagtgctgatcagcatggccaggaaggtttggatccaaatagaacacgagcctcg gaccgacatggaattcctgcccagaaggccccaggccaagatgtcactcttttcaggagt ccagactccgtcgaccgagtcttatcagaagggagcgaagtctcgagtgaagtcctgagt gagcgacgcaattcactgcgtagaatgagttctagtttccccacggcagtggagacattt catctgatgggagagctcagtagcctctatgtggggctaaaggagagtatgaaggatctg gatgaggagcaggccggccaaaccgacttggagaagatccagttcctgctggcacagatg gtcaaaaggaccatacctcctgaactgcaggagcagctgaagaccgtaaagacgctagcc aaagaagtttggcaggagaaagcaaaagtggaaaggctgcagaggatcctggaaggggaa gggaatcaagaagcagggaaggaactgaaagctggagagctgagattgcagctgggtgtc ctcagtgcccagtcctaccgcccagaagcctcccaggacagaggcatagcccgggactcc ctagacaaaatccattggaggggggtggagggtgagagacctccagctgtgtctgcatcc ccctgcgtccctgatggcccaggctctgcatgggtctggcctgagtgtggagtctggcag tctctcatcgtgcaccacagagtcaccgtggctgacatagaaaaggagctggccgagttg agggagagccaagacaggggcaaggctgccatggaaaattctgtctctgaagcctccctt tacctgcaggaccagttggacaagctcaggatgatcattgagagcatgctgacctcctcc tccacgctcctgtccatgagcatggccccgcacaaggcccacaccttggctcctggccag atcgaccctgaggccacctgtccagcctgcagcctggatgtgagccatcaggtcagcacg ctggtgcggcgctatgagcaactccaagacatggtcaacagcctggccgtctcccgaccc tccaagaaggccaagctccagagacaggacgaggagctgctgggccgtgtgcagagtgcc atcctgcaggtgcagggtgactgcgagaagctcaacatcaccaccagcaacctcatcgag gaccatcggcagaaacagaaggacattgctatgctgtaccagggtctggagaagctcgaa aaggaaaaggccaacagggagcacctggagatggagatcgatgtgaaagccgacaagagt gctctggccaccaaagtgagccgtgtccagtttgatgccaccacggagcagctgaaccac atgatgcaggagctggtggccaagatgagcgggcaggagcaggactggcagaagatgctg gacaggctgctcacagagatggacaacaagctggaccgcctggagctggacccagtgaag cagttgctggaggatcggtggaaatcgctgcgacagcagctcagggagcgccccccactc taccaggcagacgaggcggctgccatgcggagtgccatccccgtgacccccgcgggtcca ggcctacctgggcaccattccatccgcccctacacggtgtttgaactggagcaggtccgg cagcatagccgcaagtatgctatgggacagcatatgggacatatgggatgggagtccata gttcagaggacttcaggctgctcttggagccctgtgggttccagcctcaagctgggcagc gccttccctcggggtgacctggcgcagatggagcagagcgtggggcgcctgcgctccatg cactccaagatgctgatgaacattgagaaggtgcagatccacttcgggggctccaccaag gccagcagccagataatccgcgagctgctgcacgcccagtgcctgggctccccctgctac aaacgggtgacagatatggctgattacacctactcaactgtgccccggcgctgcgggggc agccacaccctcacctacccctaccaccgcagccgcccgcagcaccttccccggggcctg tatcctactgaagagatccagattgccatgaagtcccgtgtgaaggggccaggggagggc tgtggctgctacagtggcgacggtatctgttcacttctacagcatgatgaggtggacatc ttgggcctggatggccacatttacaagggacggatggacacaaggctgccaggcatcctc cgaaaagacaggcctcctcctccttcaaaactcagcacagtgccctctgctgagccccac cgcttcagcccccgaatgcccgcctcccgtggcaatgcccgaagcctgcgctttgtcact tcccatcatgcatttagctgtgtatgcgcctttctagatgcctcccttcctgccctggaa accccaggctcagggacctcaaagcgcaagtcccagcagcccaggccccacgtgcacagg ccgccatccctcagcagcaatggccagctgccctctcggccacagagcgcccagatttcg gctggcaacacctcagtttcttctcgtcaacagaaagatagaccttcctccgagggccgt ctctcccagccgaacacagcccacccgcccagctccgccgcggtggcaaacagggggctg gagaggcacgtggacatgcctcctggggaggggctcgaggagcccacgcgggggccgcgg tccagcaccgctcagtga >gi568815581r:76125754_76326806|GENSCAN_predicted_peptide_9|123_aa MAKEKPPITVVGDVGGRIAIIVDDIIDDVESFVAAAEILKERGAYKIYVMATHGILSAEA PRLIEESSVDEVVVTNTVPHEVQKLQCPKIKTVDISLILSEAIRRIHNGESMAYLFRNIT VDD >gi568815581r:76125754_76326806|GENSCAN_predicted_CDS_9|372_bp atggccaaagagaagccaccgataactgtagttggagatgttggaggccgcatcgcaatc atcgtggatgacattattgacgatgtggagagttttgttgctgccgcggagatcctgaaa gagagaggcgcctataagatctatgttatggccacccacggcatcctgtctgcagaggcc cctcgcctgattgaggagtcctccgtagacgaggtggtggtgacgaatactgtccctcat gaggttcagaagctgcaatgtcccaagataaagactgtggatatcagtttgattctttct gaagccattcggagaatccacaatggagagtccatggcctaccttttccgaaacatcact gtggatgactag