GENSCAN 1.0 Date run: 3-Nov-116 Time: 06:24:14 Sequence gi568815583f:74032755_74234989 : 202235 bp : 50.96% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 402 660 259 2 1 91 91 146 0.837 12.24 1.02 Intr + 1724 1776 53 0 2 100 109 97 0.991 11.63 1.03 Intr + 2418 3005 588 2 0 29 28 303 0.604 11.22 1.04 Intr + 3874 3937 64 0 1 94 60 54 0.800 1.49 1.05 Intr + 5078 5201 124 0 1 101 50 45 0.863 1.64 1.06 Intr + 7638 7719 82 0 1 68 70 53 0.764 1.14 1.07 Intr + 10235 10385 151 1 1 117 37 199 0.999 17.44 1.08 Intr + 11309 11377 69 0 0 86 101 12 0.727 1.45 1.09 Term + 11467 12254 788 2 2 111 54 1006 0.990 92.99 1.10 PlyA + 13996 14001 6 1.05 2.00 Prom + 15004 15043 40 -7.36 2.01 Init + 19851 19906 56 2 2 77 61 76 0.123 2.88 2.02 Intr + 23225 23321 97 2 1 67 37 113 0.101 4.11 2.03 Intr + 29586 29654 69 2 0 104 82 38 0.257 4.18 2.04 Intr + 30016 30198 183 0 0 96 4 306 0.168 22.98 2.05 Term + 30935 31036 102 1 0 85 47 158 0.658 9.68 2.06 PlyA + 33638 33643 6 1.05 3.33 PlyA - 34241 34236 6 1.05 3.32 Term - 38283 38156 128 1 2 59 49 145 0.995 6.14 3.31 Intr - 38523 38370 154 0 1 57 101 151 0.669 12.95 3.30 Intr - 38716 38613 104 2 2 87 97 141 0.999 14.79 3.29 Intr - 39086 38987 100 2 1 30 64 82 0.666 -0.32 3.28 Intr - 39555 39464 92 1 2 85 101 84 0.998 9.11 3.27 Intr - 40063 39988 76 1 1 117 87 54 0.998 7.29 3.26 Intr - 40303 40187 117 1 0 20 -19 197 0.797 2.86 3.25 Intr - 41462 41382 81 2 0 86 46 163 0.826 11.73 3.24 Intr - 42247 41739 509 2 2 76 68 560 0.567 45.49 3.23 Intr - 42736 42649 88 1 1 94 86 93 0.999 9.24 3.22 Intr - 43036 42929 108 1 0 126 78 31 0.988 6.38 3.21 Intr - 43282 43145 138 1 0 17 94 153 0.846 9.46 3.20 Intr - 44370 44264 107 1 2 86 96 142 0.997 14.73 3.19 Intr - 44548 44467 82 1 1 48 81 83 0.969 2.81 3.18 Intr - 44699 44649 51 2 0 111 105 17 0.930 4.80 3.17 Intr - 44999 44961 39 2 0 119 21 57 0.561 0.52 3.16 Intr - 45935 45855 81 2 0 78 111 76 0.832 8.73 3.15 Intr - 47981 47862 120 2 0 116 76 9 0.901 3.19 3.14 Intr - 55685 55585 101 0 2 -8 115 130 0.731 5.93 3.13 Intr - 56377 56276 102 2 0 123 31 82 0.571 6.15 3.12 Intr - 57452 57361 92 1 2 101 28 -9 0.284 -5.96 3.11 Intr - 58697 58577 121 0 1 114 82 166 0.988 18.15 3.10 Intr - 63025 62944 82 1 1 87 61 67 0.314 3.11 3.09 Intr - 64087 63918 170 2 2 90 50 33 0.358 -0.63 3.08 Intr - 64439 64359 81 0 0 71 92 57 0.617 4.01 3.07 Intr - 64719 64600 120 1 0 -25 102 151 0.675 5.67 3.06 Intr - 64906 64769 138 2 0 59 24 102 0.518 1.34 3.05 Intr - 65077 65038 40 1 1 81 85 74 0.348 4.20 3.04 Intr - 65660 65553 108 2 0 134 -7 85 0.560 4.08 3.03 Intr - 67651 67269 383 2 2 73 94 130 0.428 6.73 3.02 Intr - 75129 75007 123 1 0 80 43 48 0.375 0.06 3.01 Init - 85173 85134 40 0 1 73 100 35 0.357 3.59 3.00 Prom - 93914 93875 40 -4.36 4.00 Prom + 97702 97741 40 -4.76 4.01 Sngl + 100001 102238 2238 1 0 77 46 2829 0.998 269.20 4.02 PlyA + 104029 104034 6 1.05 5.00 Prom + 107558 107597 40 -3.86 5.01 Init + 110443 110491 49 0 1 82 58 45 0.133 0.01 5.02 Intr + 121028 121137 110 0 2 24 115 79 0.084 4.20 5.03 Term + 122578 122727 150 0 0 29 40 147 0.158 1.91 5.04 PlyA + 124926 124931 6 1.05 6.00 Prom + 128094 128133 40 -4.86 6.01 Sngl + 142105 143391 1287 0 0 107 41 1876 0.916 178.78 6.02 PlyA + 144096 144101 6 1.05 7.17 PlyA - 146254 146249 6 1.05 7.16 Term - 147489 147326 164 1 2 63 55 248 0.971 17.10 7.15 Intr - 148183 148028 156 2 0 69 91 163 0.948 14.68 7.14 Intr - 148704 148541 164 2 2 100 107 126 0.980 15.32 7.13 Intr - 149508 149407 102 2 0 69 93 87 0.990 6.59 7.12 Intr - 149706 149589 118 1 1 81 89 162 0.996 15.12 7.11 Intr - 151235 151102 134 1 2 102 85 28 0.992 4.19 7.10 Intr - 152301 152226 76 1 1 95 109 37 0.987 5.07 7.09 Intr - 156588 156361 228 1 0 78 100 295 0.310 27.54 7.08 Intr - 158489 158413 77 1 2 91 80 35 0.505 2.16 7.07 Intr - 158737 158670 68 1 2 108 75 -8 0.533 -2.40 7.06 Intr - 161168 161046 123 2 0 38 98 111 0.969 7.88 7.05 Intr - 162714 162548 167 1 2 95 86 69 0.921 7.08 7.04 Intr - 163393 163254 140 0 2 83 75 64 0.958 4.71 7.03 Intr - 164669 164584 86 2 2 109 72 54 0.838 4.52 7.02 Intr - 165064 164998 67 0 1 90 70 10 0.794 -1.69 7.01 Init - 169513 169401 113 1 2 43 98 174 0.444 13.58 7.00 Prom - 177716 177677 40 -5.26 8.00 Prom + 182095 182134 40 -8.06 8.01 Init + 184518 184827 310 2 1 92 86 206 0.990 16.53 8.02 Intr + 185743 186107 365 2 2 88 94 255 0.872 21.00 8.03 Intr + 187207 187237 31 0 1 108 41 36 0.351 -1.40 8.04 Term + 188496 188581 86 1 2 91 54 70 0.481 1.72 8.05 PlyA + 188781 188786 6 1.05 9.04 PlyA - 190204 190199 6 1.05 9.03 Term - 193383 193321 63 0 0 92 42 34 0.069 -2.91 9.02 Intr - 200749 200592 158 2 2 63 69 89 0.606 4.23 9.01 Intr - 201661 201565 97 1 1 106 72 38 0.752 3.58 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583f:74032755_74234989|GENSCAN_predicted_peptide_1|725_aa DVSNTTTAQKRKCSQTQCPRKVIKMESEEGKEARLARSSPEQPRPSTSKAVSPPHLDGPP SPRSPVIGSEVFLPNSNHVASGAGEAEERVVVISSSEDSDAENSCMEPMETAEPQSSPAH SSPAHSSPAHSSPVQSLLRAQGASSLPCGTYHPPAWPPHQPAEQAATPDAEPHSEPPDHQ ERPAVHRGIRYLLYRAQRAIRLRHALRLHPQLHRAPIRTWSPHVVQASTPAITGPLNHPA NAQEHPAQLQRGISPPHRIRGAVRSRSRSLRGSSHLSQWLNNFFALPFSSMASQLDMSSV GGPHAPQLIPDTHLPTLIEGRRALSHAPDFSYCPGVADTQHSHLPARPAWPVTFQQLRMD AHSKSKSFPVTVPLYAPPTPSYPNLRNKAKSSRELDDSSSESSDLQLEGPSTLRVLDENL ADPQAEDRPLVFFDLKIDNERSELHLGADVWLLPADPKVRSLDAQKISQLAAVNRESKFR VVIQPEAFFSIYSKAVSLEVGLQHFLSFLSSMRRPILACYKLWGPGLPNFFRALEDINRL WEFQEAISGFLAALPLIRERVPGASSFKLKNLAQTYLARNMSERSAMAAVLAMRDLCRLL EVSPGPQLAQHVYPFSSLQCFASLQPLVQAAVLPRAEARLLALHNVSFMELLSAHRRDRQ GGLKKYSRYLSLQTTTLPPAQPAFNLQALGTYFEGLLEGPALARAEGVSTPLAGRGLAER ASQQS >gi568815583f:74032755_74234989|GENSCAN_predicted_CDS_1|2178_bp gatgtctccaatacaacgacagcccagaagaggaagtgcagccagacccagtgccccagg aaggtcatcaagatggagtctgaggaggggaaggaggcaaggttggctcggagctccccg gagcagcccaggcccagcacctccaaggcagtctcaccaccccacctggatggaccgcct agccccaggagccccgtcataggaagtgaggtcttcctgcccaacagcaaccacgtggcc agtggcgccggggaggcagaggaacgcgttgtggtgatcagcagctcggaagactcagat gccgaaaactcgtgcatggagcccatggagaccgccgagccacagtcctcgccagcccac tcctcgccagcccactcctcgccagcccactcctcgccagtccagtctctgctgagagca caaggagcctccagcctgccctgtggcacataccaccccccagcttggcctccccaccag cccgctgagcaggctgccacccccgatgctgagcctcacagcgagcctcctgatcaccag gagcgccctgccgtccaccgtgggatccgctacctgttgtacagagcacagagagccatc cgccttcgccatgccctccgcttgcaccctcaattgcatcgggcccctattcggacttgg tctccccatgtggtccaagccagcactcctgccatcacagggcccctcaaccatcctgcc aatgcccaggaacatcctgcccagctgcaaaggggcatcagcccaccccaccggatacga ggggctgtgcgatcccgcagccgctccctccggggctcctcccatttatcccagtggctc aacaacttttttgccctccccttctcctccatggcttcccagcttgacatgtcttccgtg ggtgggccccatgctccccagctcatcccagacacccacctgcccaccctcattgagggc agacgtgctctcagccacgcccctgacttcagttactgcccaggggtagctgatacccaa cactcgcacctgcctgccagaccagcatggccagtgaccttccaacagctccgcatggat gcccatagcaaatctaagtcttttccagtgactgtccctctctatgctcccccgacaccc agttatcctaatttgcggaataaggccaagtcctcccgagagctggatgacagcagcagt gagtccagtgacctccagctggaaggccccagcaccctcagggtcctggacgagaacctt gctgacccccaagcagaagacagacctctggttttctttgacctcaagattgacaatgaa agatcagagctgcatttgggggctgatgtgtggctactgccagcagaccccaaggtgagg tctctagatgcccagaagattagccagctggctgcggtgaaccgggaaagcaagttccgc gtggtcatccagcctgaagccttcttcagcatctactccaaggccgtgtccctggaggtg gggctgcagcacttcctcagctttctgagctccatgcgccgccctatcttggcctgctac aagctgtgggggcctggcctcccaaacttcttccgggccctggaggacattaacaggctg tgggaattccaggaggccatctcgggcttcctggctgccctgcctctcatccgggagcgt gtgcccggggccagcagcttcaaactcaagaacctggcccagacctacctggcgagaaac atgagcgagcgcagcgccatggctgccgtgctggccatgcgtgacctgtgccgcctcctc gaggtctccccgggcccccagctggcccagcatgtctaccccttcagtagcctgcagtgc tttgcctccctgcagcccctggtgcaggcagctgtgctgccccgggctgaggcccgcctc ctggccctacacaacgtgagcttcatggagctgctgagtgcacaccgccgtgaccggcag gggggcctgaagaagtacagccgctatctaagcctgcagaccaccacgttgccccctgcc cagcctgctttcaacctgcaggctctgggcacctactttgaaggcctgttggagggtccg gcgctggcacgggcagaaggagtctccaccccacttgctggccgtggcttggcagagagg gcctcccagcagagctga >gi568815583f:74032755_74234989|GENSCAN_predicted_peptide_2|168_aa MVRRPSTGSGQRAPPAPRKTLFKVPSWKQEQPLRDYKPAGPLILDFSDTKTVAVIPGGSG GGNVGVWASQDKGNASKAEENVSDSFMHSMDPQLEQQMETTQSLVDSYVTIVNKTVWDLM VGLTPKTIMHLMINNTKEFIFSELLANLYLHGDKNMLMEESAEQAQRS >gi568815583f:74032755_74234989|GENSCAN_predicted_CDS_2|507_bp atggtccgcaggcccagcactgggagcggccagcgggccccgccagccccccgcaagaca ctgttcaaggtgccatcttggaagcaggagcagcccttgcgagactacaagcctgctggc cccttgatcttggacttctcagacaccaaaactgtggctgtgatcccaggtggcagcggt ggcggcaatgttggggtgtgggcctcccaggacaaggggaatgccagcaaggctgaggag aatgtctccgacagcttcatgcactccatggacccacagctggagcagcaaatggagacc actcagagcctggtggactcctatgtgaccattgtcaacaagaccgtgtgggacctcatg gttggtctcacgcccaagaccatcatgcacctcatgatcaacaacaccaaggagtttatc ttctcggagctgctggccaacctgtacttgcatggggacaagaacatgctgatggaggag tcggcagagcaggcacagcggtcatga >gi568815583f:74032755_74234989|GENSCAN_predicted_peptide_3|1291_aa MRPGFVVSVGMVEGVPGEPDPYPVLCYTPWPGHDLPASRAALAQGRSWSLHRMIGQDSGR RRSRRQHFAPGTSSGLRSAPGLTRAGPAPPEAVSPSHVIVDSADLAGPEKEIPGPWLPRA MYEAPGVKRAWAAGAGMRGRQWLRKRVEVVCTGRSANTVCAGVRAAGLVEKSPPPSLSRM GRRFRFCGDLDCPDRVLAEISTLAKMVECTGSTLGGGGYKKILKLTADAKFVEEHEGKKD IDSPTFLSSASSWWSSSSAYGSRRPGGFELKLIGQQGESGDVKATVAVLSFILSGAAKHS VDGKSLASELQQLGLPKEHAASPCCCYEEKQSPLQKHLRVCSLRTWSLAARVAEGTAETV DPSAAPKTSVWSSRVCWGRLLATISPSVNETDTCGMEDRRLGVQALACTVPHDGLGWRHP EEGGTHSGGSPLEHLEPKKLSLTFAIPGTRPLIMMKVDPNPLRGLKEQRFLAEHDVLAHF LDYPQKVYRTTWGHHICLPAPSAVLQALAVAIQLGGHLADPLLQVDPLSSCGAEVQRNLV PAAVNPYVMIFLPALFRVLVLAAVFGQLKEYQQRKSPGIPAGAKTKKKKTDSSPETTTSG GCHSPGDSQYQELAVALESSSVTISQLNENIESLKQQKKQVEHQLEEAKKTNNEIHKAQM ERLETINILTLEKADLKTTLYHTKRAARHFEEESKDLAGRLQYSLQRIQELERALCAVST QQQEEDRGHCLSSPDQNFSLFTIQSSSCREAVLQRWLQQTIKERALLNAHVTQVTESLKQ VQLERDEYAKHIKGERARWQERMWKMSVEARTLKEEKKRDIHRIQELERSLSELKNQMAE PPSLAPPAVTSVVEQLQDEAKHLRQEVEGLEGKLQSQVENNQALSLLSKEQKQRLQEQEE MLREQEAQRVREQERLCEQNERLREQQKTLQEQGERLRKQEQRLRKQEERLRKEEERLQK QEKRLWDQEERLWKKEERLQKQEERLALSQNHKLDKQLAEPQCSFEDLNNEKKSALQLEQ QVKELQEKLDEVKEMQYMATYQQLTSEKEALHRQLLLQTQFVDQLQQQEAWGKAEHLEAA SHQNQQLETQLSLVALPGEGDGGQHLDSEEEEAPRPTPNIPEDLESREATEPEAEAPAPG SGGEFVCGESYRALKEAMVKLKGRESFTVYESQGAVPNTRHQEMEDVIRLAQKEEEMKVK LLELQELVLPLVGNHEGHGKFLIAAQNPADEPTPGAPAPQELGAAGEQDVFYEVSLDNNV EPAPGAAREGSPHDNPTVQQIVQLSPVMQDT >gi568815583f:74032755_74234989|GENSCAN_predicted_CDS_3|3876_bp atgagaccaggcttcgtggtgtcagtgggaatggtggaaggtgtacctggagagcctgat ccatacccagtgctctgctacaccccatggcctggccacgacctccctgcctccagagca gccctggcccagggaagaagctggtctctgcatcgaatgatagggcaggattctgggaga cggaggagcagacgccagcatttcgcccccggcacctccagcgggctgcggtccgctccc ggcctcactcgagcaggtcccgccccgccagaggccgttagtccaagtcacgtgatcgtc gactcagctgacctggcgggaccggaaaaagaaattcccgggccctggcttcctcgcgcg atgtatgaggcaccaggggtgaagcgggcatgggctgctggagcgggaatgagggggcgc cagtggctccggaaacgggttgaggttgtctgcactggccgctccgcaaacacagtgtgt gcgggcgtcagggcagcaggtctggtagagaaaagtccaccgccatccctctcccgaatg gggcggaggtttcggttctgtggtgatctggactgtcctgaccgggtcctggcagagatc agcacgctggccaagatggttgagtgcacagggtctacactgggtggaggagggtataag aagatcctgaaactcacagctgatgccaagtttgtggaggagcatgagggaaagaaagac attgacagtcccaccttcctgtcctctgccagctcctggtggagcagtagcagtgcctat ggctccaggaggcctgggggctttgagctgaagttaatagggcaacagggagagtcaggc gatgtgaaggccacagtggcagtgctgagtttcatcctctccggtgcggccaagcacagt gtcgatggcaaatccttggccagtgaactgcagcagctggggctgcccaaagagcacgcg gccagcccgtgctgctgttatgaggagaagcaaagccccttgcagaagcacttgcgggtc tgcagcctacgcacgtggtccctggcagcacgagtggcagaagggacagcagagactgtg gacccctcagctgcacctaagacctctgtgtggtctagccgtgtgtgctgggggaggctt ttggccaccatttccccctctgtaaatgagaccgatacctgtggcatggaggacaggagg ctgggtgtgcaggccctggcgtgcacagtgccacatgacggcctgggctggcgccaccct gaagaaggtggcacccactctggcgggtctcccttggagcacctggagcccaagaagctg tccctcacctttgcaatcccagggactcgccccctgatcatgatgaaggtggacccaaac ccgctgaggggcctgaaggagcaaaggttcttggctgagcatgatgttcttgcccatttc cttgattatcctcaaaaggtttatagaaccacttggggacatcacatctgtctaccagcc ccctctgctgtcctgcaggctctggcggtggccatccagcttggtggccatctggctgat ccactcctccaggtggaccctctgtcctcatgtggtgcagaagtacagaggaacctggtg ccggccgcagtcaacccctatgtgatgatctttctacccgccctctttcgtgtcttggtg ctggctgctgtctttggccagctaaaagaatatcagcaaaggaagagccctggtattcca gcaggagcaaagacaaaaaagaaaaaaactgacagtagccctgagacaaccacttccggt ggttgccactcacctggggatagccagtaccaagaactagcagtagccctggagtcaagc tcagtgacaatcagtcaactcaatgaaaacatagaatcattgaaacagcagaagaaacaa gtggaacatcagctggaagaagcaaagaaaacaaacaatgaaatacacaaagcacaaatg gagcggttagagacaatcaacatcctcacattggaaaaggcagacttgaagaccaccctt taccatactaaacgtgctgcccgacacttcgaagaagagtccaaggatctggctggccgc ctgcaatactccttacagcgtattcaagaattggagcgggctctctgtgctgtgtctaca cagcagcaggaagaggacagggggcactgcctgagctccccagatcaaaacttctcactc ttcaccatccagtcctcgagctgcagagaagcggtcctccagcggtggttacagcagacc ataaaggagcgggcgctgctgaacgcacacgtgacacaggtgacagagtcactaaaacaa gtccagctagagcgagacgaatatgctaaacacataaaaggagagagggcccggtggcag gagaggatgtggaaaatgtcggtggaggctcgaacattgaaggaagagaagaagcgtgac atacatcggatacaggagctggagaggagcttgtccgaactcaaaaaccagatggctgag cccccatccctggcgcccccagcagtgacctctgtggtggaacagctacaagatgaggcc aaacacctgaggcaggaggtggaaggtctggagggaaagctccagtcccaggtggaaaac aatcaggccttgagtctccttagcaaggaacaaaagcagagactccaggagcaggaggag atgctccgagagcaggaggcgcagagagtgcgggagcaggagagactgtgtgaacaaaac gagaggcttcgggagcagcagaagacgctacaggagcagggtgagaggctgcgaaagcag gagcagaggctacgcaaacaggaggagaggctgcgaaaggaggaggagaggctgcaaaag caggaaaagaggctgtgggaccaggaggagaggctgtggaagaaggaggagaggctacaa aagcaggaggagaggctcgcgctctcccagaaccacaagctcgacaagcagctggccgag ccacagtgcagcttcgaggatctgaacaacgagaaaaagagcgcactgcagttggagcag caagtaaaggagctgcaggagaagctagacgaggtgaaggagatgcagtacatggccacc tatcagcagctgacctctgagaaggaggcgctgcacaggcagttactgctgcagacccag ttcgtggaccagctgcagcagcaggaagcttggggcaaagcggagcacctagaagctgcc agccaccagaaccaacagctagagacccagctaagcctcgtggctctccctggagaagga gatggaggacaacatctggacagtgaggaggaggaggcgcctcggcccacgccaaacatc ccagaggacctggagagccgggaggccacggagccagaggcagaggccccagccccaggg agtgggggtgagtttgtgtgtggggagagctaccgggccctgaaggaggccatggtgaag ctgaaagggagagagtccttcaccgtatatgaaagccagggggcagtgccaaacacgcgg caccaggagatggaggatgtcatcaggctggcccagaaggaggaggagatgaaggtgaag ctgctggagctgcaagagttggtgttgccccttgtgggcaaccatgaggggcatggcaaa ttcctcatcgctgcccagaaccctgctgatgagcccactccaggggccccagccccccag gaacttggggctgccggtgagcaggatgttttttatgaagtgagcctggacaacaacgtg gagcctgcaccaggagcggccagggagggttctccccatgacaaccccactgtacagcag atcgtgcagctgtctcctgtcatgcaggacacctag >gi568815583f:74032755_74234989|GENSCAN_predicted_peptide_4|745_aa MFPLRALWLVWALLGVAGSCPEPCACVDKYAHQFADCAYKELREVPEGLPANVTTLSLSA NKITVLRRGAFADVTQVTSLWLAHNEVRTVEPGALAVLSQLKNLDLSHNFISSFPWSDLR NLSALQLLKMNHNRLGSLPRDALGALPDLRSLRINNNRLRTLAPGTFDALSALSHLQLYH NPFHCGCGLVWLQAWAASTRVSLPEPDSIACASPPALQGVPVYRLPALPCAPPSVHLSAE PPLEAPGTPLRAGLAFVLHCIADGHPTPRLQWQLQIPGGTVVLEPPVLSGEDDGVGAEEG EGEGDGDLLTQTQAQTPTPAPAWPAPPATPRFLALANGSLLVPLLSAKEAGVYTCRAHNE LGANSTSIRVAVAATGPPKHAPGAGGEPDGQAPTSERKSTAKGRGNSVLPSKPEGKIKGQ GLAKVSILGETETEPEEDTSEGEEAEDQILADPAEEQRCGNGDPSRYVSNHAFNQSAELK PHVFELGVIALDVAEREARVQLTPLAARWGPGPGGAGGAPRPGRRPLRLLYLCPAGGGAA VQWSRVEEGVNAYWFRGLRPGTNYSVCLALAGEACHVQVVFSTKKELPSLLVIVAVSVFL LVLATVPLLGAACCHLLAKHPGKPYRLILRPQAPDPMEKRIAADFDPRASYLESEKSYPA GGEAGGEEPEDVQGEGLDEDAEQGDPSGDLQREESLAACSLVESQSKANQEEFEAGSEYS DRLPLGAEAVNIAQEINGNYRQTAG >gi568815583f:74032755_74234989|GENSCAN_predicted_CDS_4|2238_bp atgttcccccttcgggccctgtggttggtctgggcgcttctaggagtggccggatcatgc ccggagccgtgcgcctgcgtggacaagtacgctcaccagttcgcggactgcgcttacaaa gagttgcgtgaggtgccggaaggactgcctgccaacgtgacgacgcttagtctgtccgcg aacaagatcactgtgctgcggcgcggggccttcgccgacgtcacacaggtcacgtcgctg tggctggcgcacaatgaggtgcgcaccgtggagccaggcgcactggccgtgctgagtcag ctcaagaacctcgatctgagccacaacttcatatccagctttccgtggagcgacctgcgc aacctgagcgcgctgcagctgctcaaaatgaaccacaaccgcctgggctctctgccccgg gacgcactcggtgcgctacccgacctgcgttccctgcgcatcaacaacaaccggctgcgt acgctggcgcctggcaccttcgacgcgcttagcgcgctgtcacacttgcaactctatcac aatcccttccactgcggctgcggccttgtgtggctgcaggcctgggccgcgagcacccgg gtgtccttacccgagcccgactccattgcttgtgcctcgcctcccgcgctgcagggggtg ccggtgtaccgcctgcccgccctgccctgtgcaccgcccagcgtgcatctgagtgccgag ccaccgcttgaagcacccggcaccccactgcgcgcaggactggcgttcgtgttacactgc atcgccgacggccaccctacgcctcgcctgcaatggcaacttcagatccccggtggcacc gtagtcttagagccaccggttctgagcggggaggacgacggggttggggcggaggaagga gagggagaaggagatggggatttgctgacgcagacccaagcccaaacgccgactccagca cccgcttggccggcgcccccagccacaccgcgcttcctggccctcgcaaatggctccctg ttggtgcccctcctgagtgccaaggaggcgggcgtctacacttgccgtgcacacaatgag ctgggcgccaactctacgtcaatacgcgtggcggtggcagcaaccgggcccccaaaacac gcgcctggcgccgggggagaacccgacggacaggccccgacctctgagcgcaagtccaca gccaagggccggggcaacagcgtcctgccttccaaacccgagggcaaaatcaaaggccaa ggcctggccaaggtcagcattctcggggagaccgagacggagccggaggaggacacaagt gagggagaggaggccgaagaccagatcctcgcggacccggcggaggagcagcgctgtggc aacggggacccctctcggtacgtttctaaccacgcgttcaaccagagcgcagagctcaag ccgcacgtcttcgagctgggcgtcatcgcgctggatgtggcggagcgcgaggcgcgggtg cagctgactccgctggctgcgcgctggggccctgggcccggcggggctggcggagccccg cgacccgggcggcgacccctgcgcctactctatctgtgtccagcggggggcggcgcggca gtgcagtggtcccgcgtagaggaaggcgtcaacgcctactggttccgcggcctgcggccg ggtaccaactactccgtgtgcctggcgctggcgggcgaagcctgccacgtgcaagtggtg ttttccaccaagaaggagctcccatcgctgctggtcatagtggcagtgagcgtattcctc ctggtgctggccacagtgccccttctgggcgccgcctgctgccatctgctggctaaacac ccgggcaagccctaccgtctgatcctgcggcctcaggcccctgaccctatggagaagcgc atcgccgcagacttcgacccgcgtgcttcgtacctcgagtccgagaaaagctacccggca ggcggcgaggcgggcggcgaggagccagaggacgtgcagggggagggccttgatgaagac gcggagcagggagacccaagtggggacctgcagagagaggagagcctggcggcctgctca ctggtggagtcccagtccaaggccaaccaagaggagttcgaggcgggctctgagtacagc gatcggctgcccctgggcgccgaggcggtcaacatcgcccaggagattaatggcaactac aggcagacggcaggctga >gi568815583f:74032755_74234989|GENSCAN_predicted_peptide_5|102_aa MGFRHVSQAGLKLLTSELNTHLGQESEQLKEDQEKLEEVQAQLLPHAQEVSQQTCTGDIL ESPALAELSADSNHMSAPNKISRRTTQIPAYRTMKNDQSSSS >gi568815583f:74032755_74234989|GENSCAN_predicted_CDS_5|309_bp atgggctttcgccatgttagccaggctggtctcaaactcctgacctcagagcttaacaca cacctgggccaggagtcagagcaactgaaagaggatcaggagaagttggaggaggtacaa gcccagctgctgccccatgcccaagaggtcagccagcagacatgcacgggggacatcctg gagtctccagctctggctgagctgtcagctgacagcaaccacatgagtgctcccaacaag atcagcagaagaaccacacagataccagcctacaggactatgaaaaatgatcaatcatct tcatcttaa >gi568815583f:74032755_74234989|GENSCAN_predicted_peptide_6|428_aa MQELHLLWWALLLGLAQACPEPCDCGEKYGFQIADCAYRDLESVPPGFPANVTTLSLSAN RLPGLPEGAFREVPLLQSLWLAHNEIRTVAAGALASLSHLKSLDLSHNLISDFAWSDLHN LSALQLLKMDSNELTFIPRDAFRSLRALRSLQLNHNRLHTLAEGTFTPLTALSHLQINEN PFDCTCGIVWLKTWALTTAVSIPEQDNIACTSPHVLKGTPLSRLPPLPCSAPSVQLSYQP SQDGAELRPGFVLALHCDVDGQPAPQLHWHIQIPSGIVEITSPNVGTDGRALPGTPVASS QPRFQAFANGSLLIPDFGKLEEGTYSCLATNELGSAESSVDVALATPGEGGEDTLGRRFH GKAVEGKGCYTVDNEVQPSGPEDNVVIIYLSRAGNPEAAVAEGVPGQLPPGLLLLGQSLL LFFFLTSF >gi568815583f:74032755_74234989|GENSCAN_predicted_CDS_6|1287_bp atgcaggagctgcatctgctctggtgggcgcttctcctgggcctggctcaggcctgccct gagccctgcgactgtggggaaaagtatggcttccagatcgccgactgtgcctaccgcgac ctagaatccgtgccgcctggcttcccggccaatgtgactacactgagcctgtcagccaac cggctgccaggcttgccggagggtgccttcagggaggtgcccctgctgcagtcgctgtgg ctggcacacaatgagatccgcacggtggccgccggagccctggcctctctgagccatctc aagagcctggacctcagccacaatctcatctctgactttgcctggagcgacctgcacaac ctcagtgccctccaattgctcaagatggacagcaacgagctgaccttcatcccccgcgac gccttccgcagcctccgtgctctgcgctcgctgcaactcaaccacaaccgcttgcacaca ttggccgagggcaccttcaccccgctcaccgcgctgtcccacctgcagatcaacgagaac cccttcgactgcacctgcggcatcgtgtggctcaagacatgggccctgaccacggccgtg tccatcccggagcaggacaacatcgcctgcacctcaccccatgtgctcaagggtacgccg ctgagccgcctgccgccactgccatgctcggcgccctcagtgcagctcagctaccaaccc agccaggatggtgccgagctgcggcctggttttgtgctggcactgcactgtgatgtggac gggcagccggcccctcagcttcactggcacatccagatacccagtggcattgtggagatc accagccccaacgtgggcactgatgggcgtgccctgcctggcacccctgtggccagctcc cagccgcgcttccaggcctttgccaatggcagcctgcttatccccgactttggcaagctg gaggaaggcacctacagctgcctggccaccaatgagctgggcagtgctgagagctcagtg gacgtggcactggccacgcccggtgagggtggtgaggacacactggggcgcaggttccat ggcaaagcggttgagggaaagggctgctatacggttgacaacgaggtgcagccatcaggg ccggaggacaatgtggtcatcatctacctcagccgtgctgggaaccctgaggctgcagtc gcagaaggggtccctgggcagctgcccccaggcctgctcctgctgggccaaagcctcctc ctcttcttcttcctcacctccttctag >gi568815583f:74032755_74234989|GENSCAN_predicted_peptide_7|660_aa MSSQPAGNQTSPGATEDYSYGSWYIDEPQGGEELQPEGEVPSCHTSIPPGLYHACLASLS ILVLLLLAMLVRRRQLWPDCVRGRPGLPSPVDFLAGDRPRAVPAAVFMVLLSSLCLLLPD EDALPFLTLASAPSQGAWKILGLFYYAALYYPLAACATAGHTAAHLLGSTLSWAHLGVQV WQRAECPQVPKIYKYYSLLASLPLLLGLGFLSLWYPVQLVRSFSRRTGAGSKGLQSSYSE EYLRNLLCRKKLGSSYHTSKHGFLSWARVCLRHCIYTPQPASPLLAQKTLSWDGALTVGL FQVALLLLVGVVPTIQKVRAGVTTDVSYLLAGFGIVLSEDKQEVVELVKHHLWALEVCYI SALVLSCLLTFLVLMRSLVTHRTNLRALHRGAALDLSPLHRSPHPSRQAIFCWMSFSAYQ TAFICLGLLVQQIIFFLGTTALAFLVLMPVLHGRNLLLFRSLESSWPFWLTLALAVILQN MAAHWVFLETHDGHPQLTNRRVLYAATFLLFPLNVLVGAMVATWRVLLSALYNAIHLGQM DLSLLPPRAATLDPGYYTYRNFLKIEVSQSHPAMTAFCSLLLQAQSLLPRTMAAPQDSLR PGEEDEGMQLLQTKDSMAKGARPGASRGRARWGLAYTLLHNPTLQVFRKTALLGANGAQP >gi568815583f:74032755_74234989|GENSCAN_predicted_CDS_7|1983_bp atgtcgtcccagccagcagggaaccagacctcccccggggccacagaggactactcctat ggcagctggtacatcgatgagccccaggggggcgaggagctccagccagagggggaagtg ccctcctgccacaccagcataccacccggcctgtaccacgcctgcctggcctcgctgtca atccttgtgctgctgctcctggccatgctggtgaggcgccgccagctctggcctgactgt gtgcgtggcaggcccggcctgcccagccctgtggatttcttggctggggacaggccccgg gcagtgcctgctgctgttttcatggtcctcctgagctccctgtgtttgctgctccccgac gaggacgcattgcccttcctgactctcgcctcagcacccagccaaggggcctggaagata ctgggactgttctattatgctgccctctactaccctctggctgcctgtgccacggctggc cacacagctgcacacctgctcggcagcacgctgtcctgggcccaccttggggtccaggtc tggcagagggcagagtgtccccaggtgcccaagatctacaagtactactccctgctggcc tccctgcctctcctgctgggcctcggattcctgagcctttggtaccctgtgcagctggtg agaagcttcagccgtaggacaggagcaggctccaaggggctgcagagcagctactctgag gaatatctgaggaacctcctttgcaggaagaagctgggaagcagctaccacacctccaag catggcttcctgtcctgggcccgcgtctgcttgagacactgcatctacactccacagcca gcgtctcccctgttagcacagaaaaccctttcctgggatggggccctcactgtggggctc ttccaggtggccctgctgctgctggtgggcgtggtacccactatccagaaggtgagggca ggggtcaccacggatgtctcctacctgctggccggctttggaatcgtgctctccgaggac aagcaggaggtggtggagctggtgaagcaccatctgtgggctctggaagtgtgctacatc tcagccttggtcttgtcctgcttactcaccttcctggtcctgatgcgctcactggtgaca cacaggaccaaccttcgagctctgcaccgaggagctgccctggacttgagtcccttgcat cggagtccccatccctcccgccaagccatattctgttggatgagcttcagtgcctaccag acagcctttatctgccttgggctcctggtgcagcagatcatcttcttcctgggaaccacg gccctggccttcctggtgctcatgcctgtgctccatggcaggaacctcctgctcttccgt tccctggagtcctcgtggcccttctggctgactttggccctggctgtgatcctgcagaac atggcagcccattgggtcttcctggagactcatgatggacacccacagctgaccaaccgg cgagtgctctatgcagccacctttcttctcttccccctcaatgtgctggtgggtgccatg gtggccacctggcgagtgctcctctctgccctctacaacgccatccaccttggccagatg gacctcagcctgctgccaccgagagccgccactctcgaccccggctactacacgtaccga aacttcttgaagattgaagtcagccagtcgcatccagccatgacagccttctgctccctg ctcctgcaagcgcagagcctcctacccaggaccatggcagccccccaggacagcctcaga ccaggggaggaagacgaagggatgcagctgctacagacaaaggactccatggccaaggga gctaggcccggggccagccgcggcagggctcgctggggtctggcctacacgctgctgcac aacccaaccctgcaggtcttccgcaagacggccctgttgggtgccaatggtgcccagccc tga >gi568815583f:74032755_74234989|GENSCAN_predicted_peptide_8|263_aa MAFRGPEPWVSASLLRQRLKAEEKTLDLEFEVLSVGFNEAGRYALRLSAENPLQVGSGAG VQLQVNDGDPFPACSAITDVIEQQEPGQSLTLTRSKFIFTLPKGFCKNDGQHDAQLHVEA LRLDEPLGRAAQRVGEAIFPIYPRPDQPRMNPKAQDHEDLYRYCGNLALLRASTDPTARH CGSLAYSVAFHVHRGPQPPVSDSPPRAGQPELMSPCPEPQLPILQCIGLVLKWSPELMMV MGGSGAGAGEQDEQECNNQDDPE >gi568815583f:74032755_74234989|GENSCAN_predicted_CDS_8|792_bp atggcattcagggggcctgaaccctgggtctctgcatccctgctgagacagaggctgaag gccgaggagaagacgctggatctggagttcgaagttttgagcgtggggtttaatgaggcg ggtagatacgccctgagactgtcagcagagaaccccctgcaggtgggctctggggctggg gtgcagttgcaagtgaatgatggggaccccttccctgcctgctctgctatcactgatgtc attgagcagcaggagcctggccagagcctcaccctcaccaggagcaagtttatctttact ttgcccaaaggtttctgcaagaatgacgggcagcatgatgctcagctgcatgtggaggca ctgaggctggacgaacccttgggacgggcagcccagcgggtgggtgaggccatcttcccc atctacccgaggccagaccaaccccgcatgaacccaaaggctcaggatcacgaggacctg taccgctactgtggcaacctggctctgctccgggctagcacggaccccacagcccgacac tgtgggagcctggcctacagtgtggccttccacgtccaccggggccctcagcctccagtc tcagacagccctcccagggctggccagccagaactgatgtcaccatgcccagagccccag ctccccatactgcagtgcattggtctggtgctgaagtggagcccagaactgatgatggtc atggggggcagtggagcaggggcaggagagcaggatgagcaggaatgcaataatcaagat gatccagaatga >gi568815583f:74032755_74234989|GENSCAN_predicted_peptide_9|105_aa QGELYIPSPEKWACSSNVQGPAPSCDDSGRFQETPEAEPHNLYEVRAALEGPGQQPVDSE RGALSEFSILQPTSEDLLSTPLGPRPGSSLKAGHTATQFTAWESE >gi568815583f:74032755_74234989|GENSCAN_predicted_CDS_9|318_bp caaggtgagctctatatccccagcccagagaagtgggcttgctcctcaaatgtccagggg ccagccccaagctgtgatgattcagggagattccaggaaactcctgaggctgagcctcat aacttatatgaggtcagagcagccctcgaaggtcctggccagcagccggtggattcagag cggggagccctctcagagttttccatccttcagccaacaagtgaagacctgctgtctaca ccactggggccgaggcctggcagctccctgaaggctggacatacagcaacccagtttact gcttgggaaagtgaatga