GENSCAN 1.0 Date run: 6-Nov-116 Time: 22:21:40 Sequence gi568815579f:41788191_42006783 : 218593 bp : 53.40% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 8488 8551 64 0 1 98 109 13 0.780 3.83 1.02 Intr + 9399 9758 360 1 0 100 84 287 0.985 25.25 1.03 Intr + 20623 20740 118 2 1 120 76 182 0.978 20.23 1.04 Intr + 21775 21827 53 1 2 73 105 75 0.902 6.74 1.05 Intr + 22133 22164 32 0 2 142 111 -16 0.995 4.43 1.06 Intr + 22326 22459 134 2 2 62 47 55 0.621 -1.25 1.07 Intr + 22642 22707 66 1 0 130 105 -16 0.581 2.81 1.08 Term + 23958 24057 100 0 1 118 39 56 0.600 1.60 1.09 PlyA + 25034 25039 6 1.05 2.00 Prom + 39932 39971 40 0.79 2.01 Sngl + 42194 42751 558 1 0 78 43 413 0.977 32.31 2.02 PlyA + 43062 43067 6 1.05 3.04 PlyA - 44827 44822 6 1.05 3.03 Term - 49155 48953 203 1 2 148 47 144 0.997 14.17 3.02 Intr - 50071 49745 327 2 0 116 94 103 0.990 9.92 3.01 Init - 50745 50691 55 0 1 88 72 68 0.945 6.59 3.00 Prom - 53794 53755 40 -2.71 4.00 Prom + 55130 55169 40 -2.81 4.01 Init + 56161 56299 139 0 1 64 119 10 0.195 1.97 4.02 Intr + 59282 59462 181 0 1 69 76 142 0.166 10.54 4.03 Intr + 59546 59691 146 2 2 93 76 57 0.983 5.44 4.04 Intr + 60262 60338 77 2 2 64 116 14 0.952 1.53 4.05 Intr + 60605 60785 181 1 1 110 99 94 0.995 12.66 4.06 Intr + 60940 61066 127 2 1 91 94 18 0.727 2.94 4.07 Intr + 62122 62182 61 1 1 111 34 8 0.426 -3.07 4.08 Intr + 62336 62510 175 1 1 145 80 100 0.954 14.93 4.09 Intr + 64711 64771 61 2 1 56 88 41 0.590 -0.82 4.10 Intr + 71944 72099 156 1 0 102 115 24 0.810 6.14 4.11 Intr + 72216 72255 40 0 1 72 65 40 0.386 -1.19 4.12 Intr + 72585 72655 71 2 2 104 87 85 0.951 8.47 4.13 Intr + 72922 73022 101 1 2 78 81 207 0.991 19.25 4.14 Intr + 80841 81024 184 1 1 69 94 177 0.521 15.76 4.15 Intr + 81509 81563 55 2 1 118 110 38 0.994 8.47 4.16 Term + 83161 83187 27 0 0 157 42 57 0.999 6.36 4.17 PlyA + 83206 83211 6 1.05 5.00 Prom + 84712 84751 40 -5.31 5.01 Init + 89115 89193 79 2 1 73 71 125 0.856 8.49 5.02 Intr + 90800 91099 300 0 0 142 78 384 0.999 40.05 5.03 Intr + 91345 91463 119 2 2 81 84 141 0.732 13.69 5.04 Intr + 92480 92548 69 1 0 79 81 128 0.999 11.07 5.05 Term + 92677 92790 114 0 0 104 47 224 0.999 18.77 5.06 PlyA + 93162 93167 6 1.05 6.02 PlyA - 93468 93463 6 -0.45 6.01 Sngl - 96462 96205 258 0 0 49 42 243 0.402 11.29 6.00 Prom - 97875 97836 40 -4.51 7.00 Prom + 99586 99625 40 -8.19 7.01 Init + 99893 99916 24 1 0 66 101 55 0.739 2.68 7.02 Intr + 100002 100088 87 2 0 99 79 118 0.999 12.66 7.03 Intr + 100562 100675 114 1 0 88 81 145 0.986 14.95 7.04 Intr + 103835 103933 99 1 0 45 81 149 0.195 10.71 7.05 Intr + 104141 104183 43 1 1 83 87 43 0.899 2.00 7.06 Intr + 104413 104659 247 2 1 84 73 465 0.996 41.65 7.07 Intr + 105084 105113 30 0 0 102 78 22 0.557 0.33 7.08 Intr + 106017 106116 100 0 1 104 44 181 0.999 15.91 7.09 Intr + 106261 106357 97 0 1 92 86 170 0.996 17.28 7.10 Intr + 106436 106471 36 0 0 117 50 28 0.542 0.62 7.11 Intr + 107159 107296 138 0 0 101 94 126 0.999 15.44 7.12 Intr + 108187 108292 106 2 1 108 34 101 0.987 6.47 7.13 Intr + 109756 109923 168 1 0 63 60 139 0.948 8.18 7.14 Intr + 110252 110397 146 2 2 44 101 87 0.572 6.04 7.15 Intr + 113697 113843 147 1 0 66 68 304 0.843 26.92 7.16 Intr + 114084 114166 83 1 2 84 80 192 0.983 17.85 7.17 Intr + 114343 114468 126 0 0 48 94 196 0.981 17.58 7.18 Intr + 114594 114708 115 2 1 55 86 211 0.992 18.12 7.19 Intr + 115117 115217 101 2 2 85 93 164 0.995 16.93 7.20 Intr + 115517 115594 78 1 0 84 70 131 0.997 11.14 7.21 Intr + 115845 115920 76 2 1 81 92 127 0.856 11.98 7.22 Intr + 116026 116193 168 2 0 48 117 419 0.996 41.23 7.23 Intr + 116759 116846 88 0 1 131 59 150 0.999 15.93 7.24 Intr + 116985 117071 87 0 0 96 94 24 0.912 3.38 7.25 Intr + 117570 117637 68 0 2 73 81 65 0.490 3.34 7.26 Intr + 117749 117835 87 0 0 82 63 86 0.959 5.94 7.27 Term + 118267 118460 194 2 2 88 48 166 0.968 10.51 7.28 PlyA + 119242 119247 6 -4.04 8.06 PlyA - 119535 119530 6 1.05 8.05 Term - 120486 120038 449 1 2 131 52 421 0.997 38.56 8.04 Intr - 120987 120870 118 0 1 67 99 -2 0.988 -0.66 8.03 Intr - 121281 121086 196 2 1 92 105 246 0.997 26.64 8.02 Intr - 121907 121673 235 0 1 119 77 368 0.999 36.08 8.01 Init - 124729 124663 67 1 1 100 50 99 0.588 6.51 8.00 Prom - 129357 129318 40 -5.31 9.00 Prom + 130242 130281 40 -3.31 9.01 Init + 132812 132886 75 1 0 77 59 94 0.929 4.50 9.02 Intr + 134902 135043 142 0 1 112 82 80 0.527 10.24 9.03 Intr + 140641 140796 156 2 0 83 61 89 0.514 6.19 9.04 Term + 145145 145260 116 0 2 70 54 105 0.440 4.34 9.05 PlyA + 155247 155252 6 1.05 10.07 PlyA - 157964 157959 6 1.05 10.06 Term - 166218 166023 196 2 1 84 38 104 0.372 2.20 10.05 Intr - 168744 168660 85 1 1 69 39 163 0.483 8.88 10.04 Intr - 168929 168828 102 0 0 64 92 58 0.950 4.45 10.03 Intr - 170193 170096 98 2 2 107 77 57 0.996 6.65 10.02 Intr - 170758 170546 213 0 0 116 56 406 0.998 38.56 10.01 Init - 171102 171047 56 0 2 99 68 65 0.988 6.35 10.00 Prom - 173940 173901 40 -5.81 11.24 PlyA - 176081 176076 6 -0.45 11.23 Term - 178775 178747 29 0 2 132 43 35 0.985 1.92 11.22 Intr - 179150 179059 92 1 2 129 60 189 0.988 20.34 11.21 Intr - 179573 179472 102 1 0 111 89 212 0.999 23.39 11.20 Intr - 180725 180595 131 2 2 79 53 214 0.999 17.00 11.19 Intr - 181390 181245 146 2 2 137 92 293 0.999 35.11 11.18 Intr - 182118 181995 124 0 1 102 73 195 0.695 20.06 11.17 Intr - 182352 182198 155 1 2 96 89 317 0.999 32.90 11.16 Intr - 187607 187439 169 2 1 119 96 384 0.998 42.33 11.15 Intr - 188376 188226 151 2 1 122 80 332 0.999 36.48 11.14 Intr - 189882 189746 137 0 2 35 105 286 0.726 24.87 11.13 Intr - 190136 189961 176 0 2 105 70 349 0.676 34.98 11.12 Intr - 190608 190416 193 0 1 72 50 449 0.926 39.09 11.11 Intr - 193446 193312 135 0 0 120 78 190 0.908 22.47 11.10 Intr - 193641 193532 110 1 2 72 98 36 0.977 3.50 11.09 Intr - 193916 193718 199 2 1 135 99 365 0.977 41.75 11.08 Intr - 196996 196728 269 2 2 98 65 534 0.975 49.89 11.07 Intr - 197233 197116 118 1 1 110 100 116 0.982 15.54 11.06 Intr - 197808 197674 135 0 0 52 86 282 0.999 25.67 11.05 Intr - 198039 197926 114 0 0 117 72 239 0.991 26.35 11.04 Intr - 199949 199746 204 2 0 96 92 292 0.999 30.32 11.03 Intr - 200187 200128 60 0 0 79 64 78 0.936 3.92 11.02 Intr - 200411 200286 126 2 0 84 103 273 0.694 29.78 11.01 Init - 205886 205881 6 2 0 107 61 0 0.357 0.18 11.00 Prom - 206773 206734 40 -8.68 12.08 PlyA - 210157 210152 6 1.05 12.07 Term - 211109 210681 429 2 0 72 47 500 0.987 39.98 12.06 Intr - 211516 211355 162 1 0 73 67 46 0.632 1.69 12.05 Intr - 213773 213657 117 2 0 129 20 24 0.635 0.87 12.04 Intr - 215263 215142 122 2 2 113 78 179 0.957 20.12 12.03 Intr - 215493 215365 129 1 0 67 86 248 0.964 23.67 12.02 Intr - 217758 217518 241 0 1 65 54 586 0.289 50.55 12.01 Init - 218550 218455 96 0 0 47 75 200 0.997 14.86 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 103856 103933 78 1 0 99 81 110 0.804 12.41 S.002 Init - 152760 152686 75 0 0 73 5 184 0.896 7.64 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815579f:41788191_42006783|GENSCAN_predicted_peptide_1|308_aa MGPPSASPHRECIPWQGLLLTASLLNFWNPPTTAKLTIESMPLSVAEGKEVLLLVHNLPQ HLFGYSWYKGERVDGNSLIVGYVIGTQQATPGAAYSGRETIYTNASLLIQNVTQNDIGFY TLQVIKSDLVNEEATGQFHVYQENAPGLPVGAVAGIVTGVLVGVALVAALVCFLLLAKTG RTSIQRDLKEQQPQALAPGRGPSHSSAFSGQDHPRPVLTPPRDPPSPEETPGVQQVQSGH WDHQSGAHINKGAGCPLSPLPRPPYPTPGQQLPSMRALHSGLTSSSTTYLGRQFSVPPEG PGECNDLM >gi568815579f:41788191_42006783|GENSCAN_predicted_CDS_1|927_bp atggggcccccctcagcctctccccacagagaatgcatcccctggcaggggcttctgctc acagcctcacttctaaacttctggaacccgcccaccactgccaagctcactattgaatcc atgccgctcagtgtcgcagaggggaaggaggtgcttctacttgtccacaatctgccccag catctttttggctacagctggtacaaaggggaaagagtggatggcaacagtctaattgta ggatatgtaataggaactcaacaagctaccccaggggccgcatacagcggtcgagagaca atatacaccaatgcatccctgctgatccagaatgtcacccagaatgacataggattctac accctacaagtcataaagtcagatcttgtgaatgaagaagcaactggacagttccatgta taccaagaaaatgccccaggccttcctgtgggggccgtcgccggcatcgtgaccggggtc ctggtcggagtggcgctggtggccgcgctggtgtgtttcctgctccttgccaaaactgga agaaccagcatccagcgtgacctcaaggagcagcagccccaagcccttgcccctggccgt ggtccctcccacagctctgccttctcgggtcaggaccacccacgccctgtgctgacccct ccccgggaccctccatcacctgaggagacccctggagtgcagcaggtgcagagtggacac tgggaccatcagtcaggggcccacattaacaagggtgcaggatgtcccctctctccactg cccaggcccccctacccaaccccaggacagcagcttccatctatgagggccctccactca ggcctcaccagcagctccaccacatacctgggccggcagttctctgtccctcctgagggc cctggggagtgtaatgatctcatgtag >gi568815579f:41788191_42006783|GENSCAN_predicted_peptide_2|185_aa MTDRGSGKKRGFAFLTFDDRDSVGKIIIQKYHTVNGHNCEIREALSKQEIPSASSSQRDQ SSSGNFGGGHGGGFGGNDNFGHGGNFSGHGGFGGSHVVGGYGGSGDGYNGFGNDGSNFGR SGSYNDFGNYNNQSSNFGPMKRGNFGGRSSGPYSGGGQYLPNHETKVAMAVPVAAVAMAV AEDFN >gi568815579f:41788191_42006783|GENSCAN_predicted_CDS_2|558_bp atgactgaccgaggcagtggcaagaaaaggggctttgcctttctaacctttgatgaccgt gactccgtgggtaagattatcattcagaaataccatactgtgaatggccacaactgtgaa attagggaagccctatcaaagcaagagatacctagtgcttcatccagccaaagagatcaa agcagttctggaaattttggtggtggtcatggaggtggttttggtgggaatgacaacttt ggtcatggaggaaacttcagtggtcatggtggctttggtggcagccatgttgttggtgga tatggtggcagtggggatggctataatggatttggtaatgatggaagcaactttggacgt agtggaagctacaatgattttggcaattacaacaatcagtcttcaaattttggacccatg aagagaggaaactttggaggcagaagctctggcccctatagtggtggaggccaatatttg ccaaaccacgaaaccaaggtggctatggcggttccagtagcagcagtagctatggcagtg gcagaagattttaattag >gi568815579f:41788191_42006783|GENSCAN_predicted_peptide_3|194_aa MVCKLQEGCEETLVFIETGTARGVVGFKGCSSSSSYPAQISYLVSPPGVSIASYSRVCRS YLCNNLTNLEPFVKLKASTPKSITSASCSCPTCVGEHMKDCLPNFVTTNSCPLAASTCYS STLKFQAGFLNTTFLLMGCAREHNQLLADFHHIGSIKVTEVLNILEKSQIVGAASSRQDP AWGVVLGLLFAFRD >gi568815579f:41788191_42006783|GENSCAN_predicted_CDS_3|585_bp atggtgtgtaagctgcaagagggctgcgaggagacgctagtgttcattgagacagggact gcaaggggagtcgtgggctttaaaggctgcagctcgtcttcgtcttaccctgcgcaaatc tcctaccttgtttccccacccggagtgtccattgcctcctacagtcgcgtctgccggtct tatctctgcaacaacctcaccaatttggagccttttgtgaaactcaaggccagcactcct aagtctatcacatctgcgtcctgtagctgcccgacctgtgtgggcgagcacatgaaggat tgcctcccaaattttgtcaccactaattcttgccccttggctgcttctacgtgttacagt tccaccttaaaatttcaggcagggtttctcaataccaccttcctcctcatggggtgtgct cgtgaacataaccagcttttagcagattttcatcatattgggagcatcaaagtgactgag gtcctcaacatcttagagaagtctcagattgttggtgcagcatcctccaggcaagatcct gcttggggtgtcgtcttaggcctcctgtttgccttcagggactga >gi568815579f:41788191_42006783|GENSCAN_predicted_peptide_4|593_aa MKHGRKWVRRGRASERKGKSEIRKITTDTKTRRDTRKGVQSEEGGPDSAPWDETRDPQST ELIPRRAISRSPTCARCRNHGVTAHLKGHKRLCLFQACECHKCVLILERRRVMAAQVALR RQQEAQLKKHLMRRGEASPKAPNHFRKGTTQPQVPSGKENIAPQPQTPHGAVLLAPTPPG KNSCGPLLLSHPPEASPLSWTPVPPGPWVPGHWLPPGFSMPPPVVCRLLYQEPAVSLPPF PGFDPGTSLQLPTHGPFTTCPGSHPVLTAPLSGEPQGPPSQPRTHSTLILQPCGTPDPLQ LQPQASGASCLARTSGPSEWQLQQEAAEALVGLKDSSQAPRVTPSVPPNPAWISLLHPCG PPATLEVMQDGSHFTSEETSLKRRGRTFALREAEPQRLPSLSRAFGTLARPYARLVRPGN PVVPFPLAGSAEAARAAATCERQARTCPMPGVTVKDVNQQEFVRALAAFLKKSGKLKVPE WVDTVKLAKHKELAPYDENWFYTRAASTARHLYLRGGAGVGSMTKIYGGRQRNGVMPSHF SRGSKSVARRVLQALEGLKMVEKDQDGGRKLTPQGQRDLDRIAGQVAAANKKH >gi568815579f:41788191_42006783|GENSCAN_predicted_CDS_4|1782_bp atgaagcacgggagaaaatgggtccgaagagggagagcgtccgagagaaaaggaaaatca gagattaggaaaatcacaaccgacacaaagacaagaagagacacaaggaaaggggtgcaa tctgaggaaggagggcccgactctgccccctgggatgagaccagagacccccagagcaca gagctgatccccaggagagccatcagccgctctccaacctgcgcccgctgccgcaaccat ggtgtcaccgcccatctcaagggccacaagcgcctctgcctcttccaggcttgcgagtgt cacaaatgtgtcctcatcctggagcgccgcagggtcatggctgcccaggtggccttgcgt aggcagcaggaggcgcagctaaagaagcacctgatgaggagaggggaagcctctcccaaa gctcccaaccacttcagaaagggaaccactcagccacaggtcccctctggaaaggagaac atagcaccccagcctcagaccccccatggggcagtcctgctggcaccgacaccccccggg aagaactcctgtgggcctctgctgctcagccatcccccggaagcctcgcccttgtcctgg actccggtgcctcctggcccttgggtccctggacactggctgcctccaggcttctccatg ccaccaccagtggtgtgccgcctgctgtaccaagaacctgctgtctctctgcctcccttc cctggctttgaccctggcacctccctccagctgcccactcatgggcccttcaccacctgc ccaggatctcacccagtactgacagctcctctttctggagagccccaagggccccctagc cagccccgcacacactcaactctgatactccagccctgtggcaccccagaccctcttcag ctacagccacaggcctctggagcctcgtgcctggcccggacatctggcccctcagagtgg cagctgcagcaagaggcagctgaagccctcgtggggctgaaagattcatcccaggctcct cgtgtgaccccttctgtgccccccaaccctgcctggatctccctgcttcacccctgtggc ccaccagcaaccctggaagttatgcaggatggttcccattttacaagtgaagaaacaagc ttaaagaggcggggacgaactttcgccctgagagaggcggagcctcagcgtctaccctcg ctctcgcgagctttcggaactctcgcgagaccctacgcccgacttgtgcgcccgggaaac cccgtcgttccctttcccctggctggcagcgcggaggccgcacgagccgcggccacgtgc gagcggcaggcccggacatgcccgatgcctggagttactgtaaaagacgtgaaccagcag gagttcgtcagagctctggcagccttcctcaaaaagtccgggaagctgaaagtccccgaa tgggtggataccgtcaagctggccaagcacaaagagcttgctccctacgatgagaactgg ttctacacgcgagctgcttccacagcgcggcacctgtacctccggggtggcgctggggtt ggctccatgaccaagatctatgggggacgtcagagaaacggcgtcatgcccagccacttc agccgaggctccaagagtgtggcccgccgggtcctccaagccctggaggggctgaaaatg gtggaaaaggaccaagatggcggccgcaaactgacacctcagggacaaagagatctggac agaatcgccggacaggtggcagctgccaacaagaagcattag >gi568815579f:41788191_42006783|GENSCAN_predicted_peptide_5|226_aa MPGGPGVLQALPATIFLLFLLSAVYLGPGCQALWMHKVPASLMVSLGEDAHFQCPHNSSN NANVTWWRVLHGNYTWPPEFLGPGEDPNGTLIIQNVNKSHGGIYVCRVQEGNESYQQSCG TYLRVRQPPPRPFLDMGEGTKNRIITAEGIILLFCAVVPGTLLLFRKRWQNEKLGLDAGD EYEDENLYEGLNLDDCSMYEDISRGLQGTYQDVGSLNIGDVQLEKP >gi568815579f:41788191_42006783|GENSCAN_predicted_CDS_5|681_bp atgcctgggggtccaggagtcctccaagctctgcctgccaccatcttcctcctcttcctg ctgtctgctgtctacctgggccctgggtgccaggccctgtggatgcacaaggtcccagca tcattgatggtgagcctgggggaagacgcccacttccaatgcccgcacaatagcagcaac aacgccaacgtcacctggtggcgcgtcctccatggcaactacacgtggccccctgagttc ttgggcccgggcgaggaccccaatggtacgctgatcatccagaatgtgaacaagagccat gggggcatatacgtgtgccgggtccaggagggcaacgagtcataccagcagtcctgcggc acctacctccgcgtgcgccagccgccccccaggcccttcctggacatgggggagggcacc aagaaccgaatcatcacagccgaggggatcatcctcctgttctgcgcggtggtgcctggg acgctgctgctgttcaggaaacgatggcagaacgagaagctcgggttggatgccggggat gaatatgaagatgaaaacctttatgaaggcctgaacctggacgactgctccatgtatgag gacatctcccggggcctccagggcacctaccaggatgtgggcagcctcaacataggagat gtccagctggagaagccgtga >gi568815579f:41788191_42006783|GENSCAN_predicted_peptide_6|85_aa MRRVYTRPKSYMDVAGLTRALNPTGDVAVCTARDAGEDVQSQGLTPAGDRVPHLLQVERE AIAATGTEASALGLRTLPTPAAASL >gi568815579f:41788191_42006783|GENSCAN_predicted_CDS_6|258_bp atgaggcgggtctacacgagacccaaatcttacatggacgtggcgggtcttacgcgggcc ctaaatcctacgggggacgtggcagtgtgcacggccagggatgccggggaggacgtgcag agtcagggtctgacacccgctggggaccgcgtccctcacctgctccaggtggaaagagaa gccatcgccgccaccggaaccgaagcctccgcgctcgggctccgcactctccccacccct gccgccgcgtcgctctag >gi568815579f:41788191_42006783|GENSCAN_predicted_peptide_7|950_aa MEDFARGAASPGPSRPGLVPVSIIGAEDEDFENELETNSEEQNSQFQSLEQVKRRPAHLM ALLQHVALQFEPGPLLCCLHADMLGSLGPKEAKKAFLDFYHSFLEKTAVLRVPVPPNVAF ELDRTRADLISEDVQRRFVQEVVQSQQVAVGRQLEDFRSKRLMGMTPWEQELAQLEAWVG RDRASYEARERHVAERLLMHLEEMQHTISTDEEKSAAVVNAIGLYMRHLGVRTKSGDKKS GRNFFRKKVMGNRRSDEPAKTKKGLSSILDAARWNRGEPQVPDFRHLKAEVDAEKPGATD RKGGVGMPSRDRNIGAPGQDTPGVSLHPLSLDSPDREPGADAPLELGDSSPQGPMSLESL APPESTDEGAETERLSGRLGRSESLRVSDRRRPSRGSLGAKGRGGGRSRSDVDMDPSSAT AVLGPARRATPEPGDEGEPGRSGLELEPEEPPGWRELVPPDTLHSLPKSQVKRQEVISEL LVTEAAHVRMLRVLHDLFFQPMAECLFFPLEELQNIFPSLDELIEVHSLFLDRLMKRRQE SGYLIEEIGDVLLARFDGAEGSWFQKISSRFCSRQSFALEQLKAKQRKDPRFCAFVQEAE SRPRCRRLQLKDMIPTEMQRLTKYPLLLQSIGQNTEEPTEREKVELAAECCREILHHVNQ AVRDMEDLLRLKDYQRRLDLSHLRQSSDPMLSEFKNLDITKKKLVHEGPLTWRVTKDKAV EVHVLLLDDLLLLLQRQDERLLLKSHSRTLTPTPDGKTMLRPVLRLTSAMTREVATDHKA FYVLFTWDQEAQIYELVAQTVSERKNWCALITETAGSLKVPAPASRPKPRPSPSSTREPL LSSSENGNGGRETSPADARTERILSDLLPFCRPGPEGQLAATALRKVLSLKQLLFPAEED NGAGPPRDGDGVPGGGPLSPARTQEIQENLLSLEETMKQLEVGRDGPGVP >gi568815579f:41788191_42006783|GENSCAN_predicted_CDS_7|2853_bp atggaagacttcgcccgaggggcggcctccccaggcccctcccggcctggcctggttccc gtcagcatcatcggggctgaggatgaggattttgagaacgagctggagacaaactcagaa gagcaaaacagccagttccagagcctggagcaggtgaagcggcgcccagcccacctcatg gccctcctgcagcacgtggccctgcagtttgagccaggacccctgctttgctgtctgcat gccgacatgctgggctcactgggccccaaggaggccaagaaggccttcctggacttctac cacagcttcctggagaagacagcggttctccgggtgccggtccctcccaacgtcgccttt gaacttgaccgcactagggctgacctcatctccgaggatgtccagcggcggttcgtgcag gaggtggtgcaaagccagcaggtagccgtgggccggcagctggaggacttccgttccaag cggctcatgggcatgacgccctgggagcaggagctggcccagctggaggcttgggttggg cgggaccgagccagctacgaggcccgggagcggcacgtggcggagcggctgctcatgcac ctggaggagatgcaacataccatctctaccgacgaagaaaagagtgctgccgtggtcaac gccattggcctgtacatgcgccaccttggggtgcggaccaagagtggagacaagaagtcg gggaggaacttcttccggaaaaaggtgatggggaaccggcggtcggacgagcctgccaag accaagaaggggctgagcagcatcctggatgccgcccgctggaaccggggagagccccag gttccagattttcgacacctcaaagcagaggttgatgccgagaagccaggtgctacagac cggaagggaggcgtggggatgccctctcgggaccggaatatcggggctcctgggcaggac acccctggagtctctctgcaccctctgtccctggacagcccagaccgggaaccaggtgct gacgcccccctggagctgggggactcatccccgcagggcccaatgagcctggagtccttg gcgcccccagagagtaccgacgagggggccgaaaccgagaggctatcagggcgtctgggg cgctcagagagcctgcgggtgagtgaccgccgccggccttcccggggcagcctcggggct aagggccggggtgggggccgctcccggagcgacgtggacatggaccccagttccgccacg gcagtgcttggccctgcccgacgagccacccccgagcctggagatgagggggagccgggg cggtcgggactggagcttgaaccagaagagcctcccggctggcgggaactcgtcccccca gacaccctgcacagcctgcccaagagccaggtgaagcggcaggaggtcatcagcgagctg ctggtgacagaggcggcccacgtgcgcatgctgcgggtgctgcacgacctcttcttccag cccatggcagaatgcctgttcttccccttggaggagctgcagaacatcttccccagcctg gacgagctcatcgaggtgcattccctgttcctcgatcgcctgatgaagcggaggcaggag agtggctacctcatcgaggagatcggagacgtgctgctggcccggtttgatggtgctgag ggctcctggttccagaaaatctcctcccgcttctgcagccgccagtcatttgccttagag cagctcaaagccaagcaacgcaaggaccctcggttctgtgccttcgtgcaggaagctgag agccgcccgcggtgccgccgcctgcagctgaaggacatgatccccacggagatgcagcgg ctgaccaagtaccccctgctcctgcagagcatcgggcagaacacagaagagcccacagaa cgggagaaagtggagctggcagccgagtgctgccgggaaattctacaccacgtcaaccaa gccgtgcgtgacatggaggacctgctgaggctcaaggactatcagcggcgcctggacttg tcccaccttcggcagagcagcgaccctatgctgagcgagttcaagaacctggacatcacc aagaagaaattggtccacgagggcccactgacgtggcgggtgactaaggacaaggcagtg gaggtgcatgtgctgctgctggacgacctgctgctgctgctccagcgccaggacgagcgg ctgctgctcaagtcccatagccggacactgacgcccacgcccgatggcaagaccatgctg cggcccgtgctgcggctcacctccgccatgacccgcgaggtggccaccgatcacaaagcc ttctacgtcctttttacctgggaccaggaggcccagatatacgagctggtggcacagact gtgtcggagcggaaaaactggtgtgctctcatcactgagactgccggatccctgaaagtc cctgcccctgcctctcgccctaagccccggcccagcccgagcagcacccgagaacccctc ctcagcagctctgagaacggcaatggtggccgagagacgtctccagctgatgcccggacc gagagaatcctcagtgacctcctgcccttctgcagaccaggccccgagggccagctcgct gccacggcccttcggaaagtgctgtccctgaagcagcttctgtttccggcggaggaagac aatggggcggggcctcctcgagatggggatggggtcccagggggcggccccctgagccca gcacggacccaggaaatccaggagaacctgctcagcttggaggagaccatgaagcagctg gaggtggggcgggacgggccaggggtgccctga >gi568815579f:41788191_42006783|GENSCAN_predicted_peptide_8|354_aa MDCSCVSDLLFAPPALPALWTPGFAFPDWAYKPESSPGSRQIQLWHFILELLQKEEYQGV IAWQGDYGEFVIKDPDEVARLWGIRKCKPHMNYDKLSRALRYYYNKRILHKTKGKRFTYK FNFSKVVLVNYPLLDMAAAATGSPLLLTPSPFGGAPGPDAPPLTPETLQTLFSAPRLGEP GARTPLFTSETDKLRLDSPFPFLGSGATSYSKPPGLLGPYGRAFPEYPWNFNPYLTGPFP KLPPSLYPPHFYPNPLASSLGHLPSSGAGGGPTATPLLASTGEGLGPERPSGLAAAPRLA LPGAGGPEAALGGKEDSDSELEITDVSGCSSDSEGDEGLPAPPKAKAGKGGTGS >gi568815579f:41788191_42006783|GENSCAN_predicted_CDS_8|1065_bp atggactgtagctgcgtctccgaccttctcttcgccccgcccgccctgccggctctctgg acccccgggtttgccttcccggattgggcctacaagccagagtcatcccctggctcgagg cagatccagctgtggcactttatcctggagctgctgcagaaggaggagtaccagggcgtc atagcctggcagggggactacggggaattcgtcatcaaagaccccgatgaggtggcccgg ctgtggggtattcgcaaatgcaagccccacatgaattacgacaagctgagccgggccctg cgttactactacaacaagcggattctccacaagaccaaagggaagaggttcacctacaag ttcaacttcagcaaagtcgtgcttgtcaattacccgctgctggacatggcggcagctgcc actggctccccactcttgctgacccccagtccctttgggggggccccagggccagatgct cctcccctcacccctgagaccctgcaaaccctgttctctgccccacgcctgggagagcca ggggcccggacacccctgttcacctccgagacagataaattgcgtctggacagccctttc ccattcctgggctctggtgccaccagctattccaagccccctggcctgctgggtccttat ggccgcgccttccctgagtacccctggaactttaacccgtacctcacgggccccttcccc aagctgcctccctctctctaccccccgcatttctaccccaaccctctggccagttccctg ggccacctgccctcgtcaggggcagggggaggccccacagccacgcccctgctggcctcg acaggggagggcctgggccccgagcgcccctcgggcctggcagcggcccctcgcctggcg ctgccaggggctgggggtccagaggctgcccttggtgggaaggaggacagcgactcggag ctggagatcaccgacgtcagcggctgcagctctgacagcgagggcgatgagggtctcccg gcaccccccaaggcaaaggcgggcaaaggggggaccggcagctga >gi568815579f:41788191_42006783|GENSCAN_predicted_peptide_9|162_aa MSLGLRTAALRMVSVEGWRGSVPGEHPILTTTCEVGTAMMDAHFTDVETEVLTGKVTWPK SHRDEVSLDLNAVDARTNGQARGGGGDDASRTPAGHASLDRDWNTDPHTRSLDRRYTDRG TYGYKAVLREKNILFCKDIIIDFGGNLTSGVVNSTQNKIENT >gi568815579f:41788191_42006783|GENSCAN_predicted_CDS_9|489_bp atgtccctggggctgcggacagcagcgctccgaatggtgtcagttgaggggtggcgaggg agcgtccccggggagcaccccatcctcaccaccacctgtgaggtaggaactgccatgatg gatgcccatttcacagatgtggaaactgaggttctgacaggaaaagtaacttggccaaag tcccacagggatgaggtgagcctggacttgaatgcagtggacgcgcggacaaacggacag gcgcgaggtggaggtggcgacgatgcgtccaggacccctgctggacacgcaagcctggac cgggactggaacacggacccgcacacacgcagcctggataggagatacacggacagaggg acatatggatacaaggccgttctccgagagaagaatatcctcttttgtaaagatattatt atcgatttcggcggcaatttgacatcaggggtagttaattccactcagaataaaattgaa aacacttga >gi568815579f:41788191_42006783|GENSCAN_predicted_peptide_10|249_aa MAAQKDQQKDAEAEGLSGTTLLPKLIPSGAGREWLERRRATIRPWSTFVDQQRFSRPRNL GELCQRLVRNVEYYQSNYVFVFLGLILYCVVTSPMLLVALAVFFGACYILYLRTLESKLV LFGREVSPAHQYALAGGISFPFFWLAGAGSAVFWVLGATLVVIGSHAAFHQIEAVDGEEL QMEPVRTHPLILDAAGAHPEPTDAHCSGILLSDMVASHMRLPQTWLVQTETCYECEMHTR LRRFSMKME >gi568815579f:41788191_42006783|GENSCAN_predicted_CDS_10|750_bp atggcagcgcagaaggaccagcagaaagatgccgaggcggaagggctgagcggcacgacc ctgctgccgaagctgattccctccggtgcaggccgggagtggctggagcggcgccgcgcg accatccggccctggagcaccttcgtggaccagcagcgcttctcacggccccgcaacctg ggagagctgtgccagcgcctcgtacgcaacgtggagtactaccagagcaactatgtgttc gtgttcctgggcctcatcctgtactgtgtggtgacgtcccctatgttgctggtggctctg gctgtctttttcggcgcctgttacattctctatctgcgcaccttggagtccaagcttgtg ctctttggccgagaggtgagcccagcgcatcagtatgctctggctggaggcatctccttc cccttcttctggctggctggtgcgggctcggccgtcttctgggtgctgggagccaccctg gtggtcatcggctcccacgctgccttccaccagattgaggctgtggacggggaggagctg cagatggaacccgtcagaactcacccactcatccttgatgcagcaggtgctcacccagag cccactgatgcccactgttctggaattctcctgtcagatatggtggccagccatatgcgg ctgcctcaaacatggctggtccaaactgagacatgctatgagtgtgaaatgcacaccaga cttcgaagatttagtatgaaaatggagtaa >gi568815579f:41788191_42006783|GENSCAN_predicted_peptide_11|1026_aa MGSGGSDSYRIATSQDKKDDKDSPKKNKGKERRDLDDLKKEVAMTEHKMSVEEVCRKYNT DCVQGLTHSKAQEILARDGPNALTPPPTTPEWVKFCRQLFGGFSILLWIGAILCFLAYGI QAGTEDDPSGDNLYLGIVLAAVVIITGCFSYYQEAKSSKIMESFKNMVPQQALVIREGEK MQVNAEEVVVGDLVEIKGGDRVPADLRIISAHGCKVDNSSLTGESEPQTRSPDCTHDNPL ETRNITFFSTNCVEGTARGVVVATGDRTVMGRIATLASGLEVGKTPIAIEIEHFIQLITG VAVFLGVSFFILSLILGYTWLEAVIFLIGIIVANVPEGLLATVTVCLTLTAKRMARKNCL VKNLEAVETLGSTSTICSDKTGTLTQNRMTVAHMWFDNQIHEADTTEDQSGTSFDKSSHT WVALSHIAGLCNRAVFKGGQDNIPVLKRDVAGDASESALLKCIELSSGSVKLMRERNKKV AEIPFNSTNKYQLSIHETEDPNDNRYLLVMKGAPERILDRCSTILLQGKEQPLDEEMKEA FQNAYLELGGLGERVLGFCHYYLPEEQFPKGFAFDCDDVNFTTDNLCFVGLMSMIDPPRA AVPDAVGKCRSAGIKVIMVTGDHPITAKAIAKGVGIISEGNETVEDIAARLNIPVSQVNP RDAKACVIHGTDLKDFTSEQIDEILQNHTEIVFARTSPQQKLIIVEGCQRQGAIVAVTGD GVNDSPALKKADIGVAMGIAGSDVSKQAADMILLDDNFASIVTGVEEGRLIFDNLKKSIA YTLTSNIPEITPFLLFIMANIPLPLGTITILCIDLGTDMVPAISLAYEAAESDIMKRQPR NPRTDKLVNERLISMAYGQIGMIQALGGFFSYFVILAENGFLPGNLVGIRLNWDDRTVND LEDSYGQQWTYEQRKVVEFTCHTAFFVSIVVVQWADLIICKTRRNSVFQQGMKNKILIFG LFEETALAAFLSYCPGMDVALRMYPLKPSWWFCAFPYSFLIFVYDEIRKLILRRNPGGWV EKETYY >gi568815579f:41788191_42006783|GENSCAN_predicted_CDS_11|3081_bp atggggtctggtggctctgacagctatcgtatcgccacctcgcaggacaagaaagatgac aaggactcacccaagaagaacaagggcaaggagcgccgggacctggatgacctcaagaag gaggtggctatgacagagcacaagatgtcagtggaagaggtctgccggaaatacaacaca gactgtgtgcagggtttgacccacagcaaagcccaggagatcctggcccgggatgggcct aacgcactcacgccaccgcctaccaccccagagtgggtcaagttttgccggcagctcttc gggggcttctccatcctgctgtggatcggggctatcctctgcttcctggcctacggtatc caggcgggcaccgaggacgacccctctggtgacaacctgtacctgggcatcgtgctggcg gccgtggtgatcatcactggctgcttctcctactaccaggaggccaagagctccaagatc atggagtccttcaagaacatggtgccccagcaagccctggtgatccgggaaggtgagaag atgcaggtgaacgctgaggaggtggtggtcggggacctggtggagatcaagggtggagac cgagtgccagctgacctgcggatcatctcagcccacggctgcaaggtggacaactcctcc ctgactggcgaatccgagccccagactcgctctcccgactgcactcacgacaaccccttg gagactcggaacatcaccttcttttccaccaactgtgtggaaggcacggctcggggcgtg gtggtggccacgggcgaccgcactgtcatgggccgtatcgccaccctggcatcagggctg gaggtgggcaagacgcccatcgccatcgagattgagcacttcatccagctcatcaccggc gtggctgtcttcctgggtgtctccttcttcatcctctccctcattctcggatacacctgg cttgaggctgtcatcttcctcatcggcatcatcgtggccaatgtcccagagggtctgctg gccactgtcactgtgtgtctgacgctgaccgccaagcgcatggcccggaagaactgcctg gtgaagaacctggaggctgtagaaaccctgggctccacgtccaccatctgctcagataag acagggaccctcactcagaaccgcatgacagtcgcccacatgtggtttgacaaccagatc cacgaggctgacaccactgaggaccagtcagggacctcatttgacaagagttcgcacacc tgggtggccctgtctcacatcgctgggctctgcaatcgcgctgtcttcaagggtggtcag gacaacatccctgtgctcaagagggatgtggctggggatgcgtctgagtctgccctgctc aagtgcatcgagctgtcctctggctccgtgaagctgatgcgtgaacgcaacaagaaagtg gctgagattcccttcaattccaccaacaaataccagctctccatccatgagaccgaggac cccaacgacaaccgatacctgctggtgatgaagggtgcccccgagcgcatcctggaccgc tgctccaccatcctgctacagggcaaggagcagcctctggacgaggaaatgaaggaggcc ttccagaatgcctaccttgagctcggtggcctgggcgagcgcgtgcttggtttctgccat tattacctgcccgaggagcagttccccaagggctttgccttcgactgtgatgacgtgaac ttcaccacggacaacctctgctttgtgggcctcatgtccatgatcgacccaccccgggca gccgtccctgacgcggtgggcaagtgtcgcagcgcaggcatcaaggtcatcatggtcacc ggcgatcaccccatcacggccaaggccattgccaagggtgtgggcatcatctctgagggc aacgagactgtggaggacatcgccgcccggctcaacattcccgtcagccaggttaacccc cgggatgccaaggcctgcgtgatccacggcaccgacctcaaggacttcacctccgagcaa atcgacgagatcctgcagaatcacaccgagatcgtcttcgcccgcacatccccccagcag aagctcatcattgtggagggctgtcagagacagggtgcaattgtggctgtgaccggggat ggtgtgaacgactcccccgctctgaagaaggccgacattggggtggccatgggcatcgct ggctctgacgtctccaagcaggcagctgacatgatcctgctggacgacaactttgcctcc atcgtcacaggggtggaggagggccgcctgatcttcgacaacctaaagaagtccattgcc tacaccctgaccagcaatatcccggagatcacgcccttcctgctgttcatcatggccaac atcccgctgcccctgggcaccatcaccatcctctgcatcgatctgggcactgacatggtc cctgccatctcactggcgtacgaggctgccgaaagcgacatcatgaagagacagcccagg aacccgcggacggacaaattggtcaatgagagactcatcagcatggcctacgggcagatt ggaatgatccaggctctcggtggcttcttctcttactttgtgatcctggcagaaaatggc ttcttgcccggcaacctggtgggcatccggctgaactgggatgaccgcaccgtcaatgac ctggaagacagttacgggcagcagtggacatacgagcagaggaaggtggtggagttcacc tgccacacggccttctttgtgagcatcgttgtcgtccagtgggccgatctgatcatctgc aagacccggaggaactcggtcttccagcagggcatgaagaacaagatcctgatcttcggg ctgtttgaggagacggccctggctgccttcctgtcctactgccccggcatggacgtggcc ctgcgcatgtaccctctcaagcccagctggtggttctgtgccttcccctacagtttcctc atcttcgtctacgacgaaatccgcaaactcatcctgcgcaggaacccagggggttgggtg gagaaggaaacctactactga >gi568815579f:41788191_42006783|GENSCAN_predicted_peptide_12|431_aa MEVPVESADDLADQTNIEYGTIHAGSTMTFFQNSRYQTYQRMWNYMQSKQPSVFVKSTEE GIARVLNSRYAFLLESTMNEYHRRLNCNLTQIGGLLDTKGYGIGMPLGTAQHGSPFRDEI TLAILQLQENNRLEILKRKWWEGGRCPKEEDHRAKGLGMENIGGIFIVLICGLIIAVFVA VMEFIWSTRRSAESEEALFVVGLQGSVLRLLLFSVFLHYLNDPTSQMALKTHYVLCSEQC RAHSRPQVNMPCNNERMDEGQKMGNQEGRTGENLETVKRPEEREELRWKVSVCQEMLQEL RHAVSCRKTSRSRRRRRPGGPSRALLSLRAVREMRLSNGKLYSAGAGGDAGSAHGGPQRL LDDPGPPSGARPAAPTPCTHVRVCQECRRIQALRASGAGAPPRGLGVPAEATSPPRPRPG PAGPRELAEHE >gi568815579f:41788191_42006783|GENSCAN_predicted_CDS_12|1296_bp atggaggtgcctgtggagtcggccgatgacctggcagatcagaccaacatcgagtatggc accatccacgccggctccaccatgaccttcttccagaattcacggtaccaaacgtaccag cgcatgtggaactacatgcagtcgaagcagcccagcgtgttcgtcaagagcacagaagag ggcattgcccgcgtcctcaactcccgctacgccttcctgctcgagtccaccatgaacgaa taccaccggcgcctcaactgcaacctcacccagatcgggggactcctcgacaccaagggc tacggcattggcatgccgctgggtacggcacagcacggctccccgttccgggatgagatc acactggccatcctgcagcttcaggagaacaaccggctggagatcctgaagcgcaagtgg tgggaggggggccggtgccccaaggaggaggaccatcgagctaaaggtttgggcatggag aacattggtggcatttttatcgtgctcatctgtggcctcatcattgctgtcttcgtggcg gtcatggaattcatatggtccacacggaggtcagctgagtccgaggaggctctgtttgtt gtggggctccagggctcagtccttagactgcttctcttctcggtctttctccactacttg aatgatcccaccagccaaatggctcttaagacccactatgtgctgtgctcagaacagtgc cgggcacacagcaggccacaggtaaatatgccttgcaataatgaacgcatggatgagggg cagaagatgggaaatcaggaaggaagaactggagaaaatttagagacagtgaaaagacca gaggaaagagaggagcttaggtggaaggtgtcggtgtgccaggagatgctgcaggagctg cgccacgccgtttcttgccgcaagacgtcgcgttcccgccggcgccgacgcccgggcggc ccgagccgggccctgctgtcactgcgcgcggtccgcgagatgcgcctcagcaacggcaag ctctactcggccggcgcgggcggggatgcgggcagcgcgcacgggggcccgcagcgcctc ctggacgacccggggccccccagcggagcccgacccgccgcccccaccccctgcacccac gtgcgcgtctgccaggagtgccggcgcatccaggcgctgcgggcctcgggggccggcgcg cctccgcgtggcctgggcgtccccgccgaagccaccagcccgccccggccgcggcctggc cccgccggcccccgggagctggcggagcacgagtga