GENSCAN 1.0 Date run: 8-Nov-116 Time: 17:20:19 Sequence gi568815579r:4437943_4638953 : 201011 bp : 55.15% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 4303 4399 97 0 1 63 80 246 0.296 21.38 1.02 Term + 4983 5083 101 1 2 120 54 19 0.982 0.29 1.03 PlyA + 5286 5291 6 1.05 2.13 PlyA - 7086 7081 6 1.05 2.12 Term - 7681 7556 126 1 0 82 44 189 0.986 12.49 2.11 Intr - 8255 8107 149 0 2 98 105 285 0.999 31.66 2.10 Intr - 8471 8341 131 1 2 91 47 357 0.657 32.74 2.09 Intr - 8804 8558 247 0 1 28 80 364 0.784 26.65 2.08 Intr - 8978 8894 85 2 1 89 110 64 0.997 8.59 2.07 Intr - 9683 9608 76 1 1 112 80 212 0.505 22.81 2.06 Intr - 10078 9912 167 1 2 43 89 136 0.506 8.47 2.05 Intr - 10473 10405 69 0 0 101 39 148 0.454 11.07 2.04 Intr - 14550 14422 129 0 0 69 97 122 0.595 12.60 2.03 Intr - 15580 15516 65 2 2 80 83 93 0.944 6.93 2.02 Intr - 16151 15988 164 1 2 103 76 207 0.878 21.13 2.01 Init - 19755 19673 83 0 2 114 57 113 0.977 11.19 2.00 Prom - 25649 25610 40 -1.81 3.00 Prom + 29965 30004 40 -2.41 3.01 Init + 30368 30438 71 1 2 62 91 -20 0.442 -3.83 3.02 Intr + 34357 34480 124 1 1 71 72 144 0.634 12.19 3.03 Intr + 37333 37409 77 0 2 83 111 122 0.999 12.81 3.04 Intr + 37503 37641 139 0 1 92 117 261 0.999 30.37 3.05 Intr + 44841 44963 123 2 0 131 28 27 0.815 2.29 3.06 Intr + 45048 45089 42 2 0 100 101 13 0.436 2.72 3.07 Intr + 50734 50934 201 0 0 61 97 407 0.928 38.90 3.08 Intr + 53624 53740 117 1 0 107 80 179 0.969 20.17 3.09 Intr + 54073 54210 138 0 0 72 85 39 0.875 3.17 3.10 Intr + 55761 55920 160 2 1 126 91 98 0.995 13.97 3.11 Intr + 56040 56115 76 1 1 75 94 20 0.774 0.37 3.12 Intr + 56224 56533 310 1 1 73 83 593 0.987 54.16 3.13 Intr + 58360 58463 104 0 2 54 63 148 0.992 8.37 3.14 Intr + 60016 60233 218 1 2 43 23 232 0.529 10.87 3.15 Intr + 60364 60434 71 2 2 73 111 139 0.984 14.09 3.16 Intr + 60872 60973 102 1 0 94 86 128 0.799 14.07 3.17 Intr + 61549 61762 214 0 1 45 93 319 0.979 26.91 3.18 Intr + 63249 63375 127 1 1 83 72 141 0.812 12.24 3.19 Term + 63969 64068 100 0 1 65 52 124 0.789 4.40 3.20 PlyA + 64260 64265 6 -0.45 4.09 PlyA - 64280 64275 6 1.05 4.08 Term - 66843 66517 327 0 0 126 37 328 0.770 26.56 4.07 Intr - 67005 66919 87 0 0 75 90 90 0.907 8.56 4.06 Intr - 71013 70826 188 1 2 91 99 114 0.950 12.73 4.05 Intr - 75762 72504 3259 0 1 86 100 3779 0.634 367.50 4.04 Intr - 78736 78675 62 2 2 116 100 42 0.980 7.14 4.03 Intr - 79756 79612 145 1 1 88 96 73 0.793 8.47 4.02 Intr - 80347 80280 68 2 2 103 98 8 0.380 2.42 4.01 Init - 80947 80842 106 1 1 66 48 51 0.181 -0.75 4.00 Prom - 84257 84218 40 -6.50 5.11 PlyA - 84614 84609 6 1.05 5.10 Term - 86143 85586 558 1 0 51 46 871 0.996 73.96 5.09 Intr - 87134 87021 114 2 0 108 94 33 0.791 7.05 5.08 Intr - 87890 87691 200 0 2 85 85 227 0.553 21.69 5.07 Intr - 91311 91131 181 0 1 49 85 294 0.050 25.16 5.06 Intr - 91924 91842 83 2 2 116 32 34 0.908 0.45 5.05 Intr - 93880 93685 196 1 1 101 75 255 0.998 25.01 5.04 Intr - 96153 96073 81 0 0 109 94 71 0.308 10.13 5.03 Intr - 99537 99504 34 2 1 87 75 -11 0.029 -3.79 5.02 Intr - 100979 100002 978 1 0 61 44 1295 0.037 113.15 5.01 Init - 102071 102040 32 2 2 81 100 7 0.488 0.36 5.00 Prom - 102133 102094 40 -6.50 6.17 PlyA - 104672 104667 6 1.05 6.16 Term - 106587 105659 929 1 2 98 47 1086 0.999 98.31 6.15 Intr - 108332 108274 59 1 2 111 109 60 0.999 9.42 6.14 Intr - 108527 108432 96 1 0 71 90 92 0.763 7.32 6.13 Intr - 110231 110085 147 1 0 127 81 213 0.983 24.36 6.12 Intr - 110503 110321 183 0 0 128 111 286 0.999 34.32 6.11 Intr - 112330 112181 150 0 0 101 63 220 0.991 20.59 6.10 Intr - 112988 112857 132 1 0 62 97 215 0.983 20.07 6.09 Intr - 114697 114480 218 1 2 79 76 342 0.904 29.93 6.08 Intr - 116534 116446 89 0 2 114 100 114 0.991 15.39 6.07 Intr - 117153 117034 120 1 0 72 100 152 0.953 15.77 6.06 Intr - 117622 117532 91 1 1 74 119 170 0.988 18.77 6.05 Intr - 118147 118046 102 1 0 89 75 231 0.981 22.77 6.04 Intr - 119071 119009 63 1 0 115 105 61 0.994 9.91 6.03 Intr - 119281 119221 61 0 1 100 72 145 0.996 13.33 6.02 Intr - 120207 120084 124 1 1 89 72 229 0.985 21.55 6.01 Init - 120515 120395 121 2 1 89 74 238 0.978 20.81 6.00 Prom - 132272 132233 40 -5.41 7.00 Prom + 132871 132910 40 -2.71 7.01 Init + 137478 137611 134 2 2 58 105 61 0.639 4.39 7.02 Intr + 138415 138451 37 1 1 79 94 26 0.447 1.05 7.03 Intr + 142254 142333 80 2 2 50 80 66 0.463 0.84 7.04 Intr + 142667 142830 164 2 2 46 86 54 0.410 1.13 7.05 Intr + 146375 146504 130 0 1 26 60 113 0.456 2.56 7.06 Intr + 147749 147860 112 2 1 103 42 65 0.194 4.28 7.07 Intr + 152906 152941 36 1 0 144 94 38 0.766 8.94 7.08 Intr + 153836 153852 17 1 2 90 105 1 0.458 -3.18 7.09 Intr + 154784 154955 172 2 1 72 58 100 0.279 5.86 7.10 Intr + 181683 181798 116 2 2 137 84 -1 0.009 4.15 7.11 Intr + 200427 200661 235 0 1 82 23 130 0.002 4.12 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 88122 88065 58 0 1 20 80 80 0.810 1.82 S.002 Term - 91311 91099 213 0 0 49 47 373 0.914 26.85 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815579r:4437943_4638953|GENSCAN_predicted_peptide_1|65_aa IGAEDMDGFQADTEEEEEEEGDCMIVDVPDAAEVQAPCGAASGAGGGVGVDTGKATLTAS PLGAS >gi568815579r:4437943_4638953|GENSCAN_predicted_CDS_1|198_bp attggtgctgaagacatggacggcttccaggcagacacggaggaggaggaagaggaggag ggcgactgtatgatcgtggatgtcccggatgctgcggaggtccaagccccgtgtggagcc gcttccggagctgggggtggtgtgggggtggacaccggcaaggccaccctgaccgcgagc ccactgggtgcatcctga >gi568815579r:4437943_4638953|GENSCAN_predicted_peptide_2|496_aa MKKFFQEFKADIKFKSAGPGQKLKESVGEKAHKEKPNQPAPRPPRQGPTNEAQMAAAAAL ARLEQKQSRAWGPTSQDTIRNQVRKELQAEATVSGSPEAPGTNVVSEPREEGSAHLAVPG VYFTCPLTGATLRKDQRDACIKEAILLHFSTDPVAASIMKIYTFNKDQDRDRGQAWRSTL PANAEGAPRRSPLLALRPFSSGSPLFRHCGFPTAVVNCALPGLVGRYLDNIHLHPEEEKY RKIKLQNKVFQERINCLEGTHEFFEAIGFQKVLLPAQDQVTPMSCHAVEDPEEFYVLSET TLAQPQSLERHKEQLLAAEPVRAKLDRQRRVFQPSPLASQFELPGDFFNLTAEEIKREQR LRSEAVERLSVLRTKAMREKEEQRGLRKYNYTLLRVRLPDGCLLQGTFYARERLGAVYGF VREALQSDWLPFELLASGGQKLSEDENLALNECGLVPSALLTFSWDMAVLEDIKAAGAEP DSILKPELLSAIEKLL >gi568815579r:4437943_4638953|GENSCAN_predicted_CDS_2|1491_bp atgaagaaattctttcaggagttcaaggccgacatcaagttcaagagcgcgggacccggt cagaagctcaaagagtccgtgggggaaaaggcccacaaagagaagcccaaccagccagcc cccaggccgccccgccagggacccaccaatgaggcacagatggcagccgctgccgcccta gcccggctggagcagaagcagtcccgggcctggggccccacatcgcaggacaccatccga aaccaggtgagaaaggaacttcaagccgaagccaccgtcagcgggagccccgaggcccca gggaccaacgtggtatctgagcccagagaggaaggctctgcccacctggctgtgcctggc gtgtacttcacctgtccgctcactggggccaccctgaggaaggaccagcgggacgcctgc atcaaggaggccattctcttgcacttctccaccgacccagtggccgcctccatcatgaag atctacacgttcaacaaagaccaggaccgggatcgggggcaggcctggaggtccacgctg cccgccaacgccgagggagcaccccgcaggagccccctgctggcactgcggcccttcagc tctggctctccactcttccggcactgtggcttccccaccgctgtggttaattgtgccctt ccagggctggtgggacggtacctggacaacatccacctgcaccccgaggaggagaagtac cggaagatcaagctgcagaacaaggtgtttcaggagcgcattaactgcctggaagggacc cacgagttttttgaggccattgggttccagaaggtgttgcttcccgcccaggatcaggtg acacccatgtcctgccacgctgtagaggaccccgaggagttctacgtgctgagcgagacc accttggcccagccccagagcctggagaggcacaaggaacagctgctggctgcggagccc gtgcgcgccaagctggacaggcagcgccgcgtcttccagccctcgcccctggcctcgcag ttcgaactgcctggggacttcttcaacctcacagcagaggagatcaagcgggagcagagg ctcaggtccgaggcggtggagcggctgagcgtgctgcggaccaaggccatgcgggagaag gaggagcagcgggggctgcgcaagtacaactacacgctgctgcgcgtgcgcctccccgat ggctgcctcctgcagggcactttctacgctcgggagcggctgggggcggtgtacgggttc gtccgggaggccctgcagagcgactggctgccttttgagctgctggcctcgggagggcag aagctgtccgaggacgagaacctggccttgaacgagtgcgggctggtgccctctgccctc ctgaccttctcgtgggacatggctgtgctggaggacatcaaggccgcgggggccgagccg gactccatcctgaaacccgagctcctgtcagccatcgagaagctcttgtga >gi568815579r:4437943_4638953|GENSCAN_predicted_peptide_3|837_aa MVPWEVCASTYTFERTVQNSVPSSRYRRCSRFPRPGPLAVSMPHAFKPGDLVFAKMKGYP HWPARIDDIADGAVKPPPNKYPIFFFGTHETAFLGPKDLFPYDKCKDKYGKPNKRKGFNE GLWEIQNNPHASYSAPPNQAGHAHRLHGALISPITAGGSWVRGTHQLLLWKILGHQGWRL SPYQGCSKEKPQPVSSSDSEAPEANPADGSDADEDDEDRGVMAVTAVTATAASDRMESDS DSDKSSDNSGLKRKTPALKMSVSKRARKASSDLDQASVSPSEEENSESSSESEKTSDQEP TIWFGKGHSGMLASEGREAVLTRLHESERVRKQERERDTEERREKAPSASDSDSKADSDG AKPEPVAMARSASSSSSSSSSSDSDVSVKKPPRGRKPAEKPLPKPRGRKPKPERPPSSSS SDSDSDEVDRISEWKRRDEARRRELEARRRREQEEELRRLREQEKEEKERRRERADRGEA ERGSGGSSGDELREDDEPVKKRGRKGRGRGPPSSSDSEPEAELEREAKKSAKKPQSSSTE PARKPGQKEKRVRPEEKQQAKPVKVERTRKRSEGFSMDRKVEKKKGEAWLPSTAHTEFTE GNGDSRGVAGLSRLSLERVLSTSSAGLAGGASREPSVEEKLQKLHSEIKFALKVDSPDVK RCLNALEELGTLQVTSQILQKNTDVVATLKKIRRYKANKDVMEKAAEVYTRLKSRVLGPK IEAVQKVNKAGMEKEKAEEKLAGEELAGEEAPQEKAEDKPSTDLSAPVNGEATSQKGESA EDKEHEEGRDSEEGPRCGSSEDLHDSVREGPDLDRPGSDRQERERARGDSEALDEES >gi568815579r:4437943_4638953|GENSCAN_predicted_CDS_3|2514_bp atggttccatgggaggtctgtgcttccacatacacatttgagaggacggtacaaaattca gtccctagcagccgctaccgccgctgcagccgctttccgcggcctgggcctctcgccgtc agcatgccacacgccttcaagcccggggacttggtgttcgctaagatgaagggctaccct cactggcctgccaggatcgacgacatcgcggatggcgccgtgaagcccccacccaacaag taccccatctttttctttggcacacacgaaacagccttcctgggacccaaggacctgttc ccctacgacaaatgtaaagacaagtacgggaagcccaacaagaggaaaggcttcaatgaa gggctgtgggagatccagaacaacccccacgccagctacagcgcccctccgaatcaggct ggtcatgcccaccggttacatggggcattgataagtcctataactgccggtggctcctgg gttcgtggcactcaccagctcctcctgtggaagatcctggggcatcagggttggcgcttg tctccttaccagggatgctccaaagagaaaccccagccagtgagctcctccgacagcgag gcccccgaggccaaccccgccgacggcagtgacgctgacgaggacgatgaggaccggggg gtcatggccgtcacagcggtaaccgccacagctgccagcgacaggatggagagcgactca gactcagacaagagtagcgacaacagtggcctgaagaggaagacgcctgcgctaaagatg tcggtctcgaaacgagcccgaaaggcctccagcgacctggatcaggccagcgtgtcccca tccgaagaggagaactcggaaagctcatctgagtcggagaagaccagcgaccaggagccc accatctggtttggaaaggggcattctgggatgctggcatcagaagggagggaagcagtc ctgacgcgcctgcatgagagtgagagagtgagaaagcaagagcgcgagagggacacagaa gagaggagagagaaggcgccatcagcctccgactccgactccaaggccgattcggacggg gccaagcctgagccggtggccatggcgcggtcggcgtcctcctcctcctcttcctcctcc tcctccgactccgatgtgtctgtgaagaagcctccgaggggcaggaagccagcggagaag cctctcccgaagccgcgagggcggaaaccgaagcctgaacggcctccgtccagctccagc agtgacagtgacagcgacgaggtggaccgcatcagtgagtggaagcggcgggacgaggcg cggaggcgcgagctggaggcccggcggcggcgagagcaggaggaggagctgcggcgcctg cgggagcaggagaaggaggagaaggagcggaggcgcgagcgggccgaccgcggggaggct gagcggggcagcggcggcagcagcggggacgagctcagggaggacgatgagcccgtcaag aagcggggacgcaagggccggggccggggtcccccgtcctcctctgactccgagcccgag gccgagctggagagagaggccaagaaatcagcgaagaagccgcagtcctcaagcacagag cccgccaggaaacctggccagaaggagaagagagtgcggcccgaggagaagcaacaagcc aagcccgtgaaggtggagcggacccggaagcggtccgagggcttctcgatggacaggaag gtagagaagaagaaaggtgaggcctggctgcccagcactgcccacactgagttcactgag gggaacggggacagccgtggcgtggctggcctgtcccgcctgtccctagagagggtgctc agcacatcctcggccggcctggccggcggtgcctccagggagccctccgtggaggagaag ctgcagaagctgcacagtgagatcaagtttgccctaaaggtcgacagcccggacgtgaag aggtgcctgaatgccctagaggagctgggaaccctgcaggtgacctctcagatcctccag aagaacacagacgtggtggccaccttgaagaagattcgccgttacaaagcgaacaaggac gtaatggagaaggcagcagaagtctatacccggctcaagtcgcgggtcctcggcccaaag atcgaggcggtgcagaaagtgaacaaggctgggatggagaaggagaaggccgaggagaag ctggccggggaggagctggccggggaggaggccccccaggagaaggcggaggacaagccc agcaccgatctctcagccccagtgaatggcgaggccacatcacagaagggggagagcgca gaggacaaggagcacgaggagggtcgggactcggaggaggggccaaggtgtggctcctct gaagacctgcacgacagcgtacgggagggtcccgacctggacaggcctgggagcgaccgg caggagcgcgagagggcacggggggactcggaggccctggacgaggagagctga >gi568815579r:4437943_4638953|GENSCAN_predicted_peptide_4|1413_aa MKQAKEEAGGDESREEAGCYGTWCGVRERNLKQKDASHVLTMSAPDEGRRDPPKPKGKTL GSFFGSLPGFSSARNLVANAHSSARARPAADPTGAPAAEAAQPQAQVAAHPEQTAPWTEK ELQPSEKQMVSGAKDLVCSKMSRAKDAVSSGVASVVDVAKGVVQGGLDTTRSALTGTKEV VSSGVTGAMDMAKGAVQGGLDTSKAVLTGTKDTVSTGLTGAVNVAKGTVQAGVDTTKTVL TGTKDTVTTGVMGAVNLAKGTVQTGVETSKAVLTGTKDAVSTGLTGAVNVARGSIQTGVD TSKTVLTGTKDTVCSGVTGAMNVAKGTIQTGVDTSKTVLTGTKDTVCSGVTGAMNVAKGT IQTGVDTSKTVLTGTKDTVCSGVTGAMNVAKGTIQTGVDTTKTVLTGTKNTVCSGVTGAV NLAKEAIQGGLDTTKSMVMGTKDTMSTGLTGAANVAKGAMQTGLNTTQNIATGTKDTVCS GVTGAMNLARGTIQTGVDTTKIVLTGTKDTVCSGVTGAANVAKGAVQGGLDTTKSVLTGT KDAVSTGLTGAVNVAKGTVQTGVDTTKTVLTGTKDTVCSGVTSAVNVAKGAVQGGLDTTK SVVIGTKDTMSTGLTGAANVAKGAVQTGVDTAKTVLTGTKDTVTTGLVGAVNVAKGTVQT GMDTTKTVLTGTKDTIYSGVTSAVNVAKGAVQTGLKTTQNIATGTKNTFGSGVTSAVNVA KGAAQTGVDTAKTVLTGTKDTVTTGLMGAVNVAKGTVQTSVDTTKTVLTGTKDTVCSGVT GAANVAKGAIQGGLDTTKSVLTGTKDAVSTGLTGAVKLAKGTVQTGMDTTKTVLTGTKDA VCSGVTGAANVAKGAVQMGVDTAKTVLTGTKDTVCSGVTGAANVAKGAVQTGLKTTQNIA TGTKNTLGSGVTGAAKVAKGAVQGGLDTTKSVLTGTKDAVSTGLTGAVNLAKGTVQTGVD TSKTVLTGTKDTVCSGVTGAVNVAKGTVQTGVDTAKTVLSGAKDAVTTGVTGAVNVAKGT VQTGVDASKAVLMGTKDTVFSGVTGAMSMAKGAVQGGLDTTKTVLTGTKDAVSAGLMGSG NVATGATHTGLSTFQNWLPSTPATSWGGLTSSRTTDNGGEQTALSPQEAPFSGISTPPDV LSVGPEPAWEAAATTKGLATDVATFTQGAAPGREDTGLLATTHGPEEAPRLAMLQNELEG LGDIFHPMNAEEQAQLAASQPGPKVLSAEQGSYFVRLGDLGPSFRQRAFEHAVSHLQHGQ FQARDTLAQLQDCFRLIEKAQQAPEGQPRLDQGSGASAEDAAVQEERDAGVLSRVCGLLR QLHTAYSGLVSSLQGLPAELQQPVGRARHSLCELYGIVASAGSVEELPAERLVQSREGVH QAWQGLEQLLEGLQHNPPLSWLVGPFALPAGGQ >gi568815579r:4437943_4638953|GENSCAN_predicted_CDS_4|4242_bp atgaagcaagcgaaggaggaagctgggggagatgagagcagagaagaagctggatgttac gggacgtggtgcggagtacgagagagaaacctgaagcaaaaagatgcttctcacgttctc actatgtctgctccagacgaagggagacgggatccccccaaaccgaagggcaagaccctg ggcagcttctttgggtccctgcctggcttcagctctgcccggaacctggtggccaacgca catagctcggcgagagcccggccggccgctgaccccacaggagcgcctgctgccgaggct gcccaaccacaggctcaggtggctgcccacccagagcagacggccccatggacggagaag gagctgcaaccttcggaaaagcagatggtgtccggggccaaagacctggtgtgttccaag atgtccagggccaaggatgccgtgtcctccggggtggccagcgtggtggacgtggctaag ggagtggtccagggaggcctggacaccactcggtctgcacttacgggcaccaaggaggtg gtgtccagcggggtcacaggggccatggacatggctaagggggccgtccaagggggtctg gacacctcgaaggctgtcctcaccggcaccaaggacacggtgtccactgggctcacgggg gcagtgaatgtggccaaagggaccgtacaggccggtgtggacaccaccaagactgtgctg accggcaccaaagacacagtgactactggggtcatgggggcagtgaacttggccaaaggg actgtccagactggcgtggaaacctccaaggctgtgctgaccggcaccaaagatgctgtg tccactgggctcacaggggcagtgaatgtggccagaggaagcattcagaccggtgtggac accagtaagactgtcctaacaggtaccaaggacaccgtctgtagtggggtgactggtgcc atgaatgtggccaaaggaaccatccagaccggcgtggacaccagtaagactgtcctaaca ggtaccaaggacaccgtctgtagtggggtgactggtgccatgaatgtggccaaaggaacc atccagaccggcgtggacaccagtaagactgtcctaacaggtaccaaggacaccgtctgt agtggggtgactggtgccatgaatgtggccaaaggaaccatccagaccggcgtggacacc accaagactgtcctaactggcaccaagaacactgtctgcagtggggtgaccggtgccgtg aacttggccaaagaggccatccaggggggcctggataccaccaagtctatggtcatgggt acgaaagacacgatgtccactgggctcacaggggcagcgaatgtggccaagggggccatg caaactgggctgaacacaacccaaaatatcgcaacaggtacaaaggacaccgtctgcagt ggggtgactggtgccatgaatttggccagaggaaccatccagacaggcgtggacaccacc aagatcgttctaactggtaccaaggacactgtctgcagtggggtcaccggtgctgcgaat gtggccaaaggggccgtccagggcggcctggacactacaaagtctgtcctgactggcact aaagatgctgtgtccactgggctcacaggggctgtgaacgtggccaaagggaccgtccag accggcgtagacaccaccaagactgtcctaaccggcaccaaggacaccgtctgcagtggg gtgaccagtgctgtgaacgtggccaaaggggccgtccaggggggcctggacaccaccaag tctgtggtcataggtacaaaagacacgatgtccactgggctcacgggggcagcgaatgtg gccaagggggctgtccagacaggtgtagacacagccaagaccgtgctgaccggcaccaag gacacagtgactactgggctcgtgggggcagtgaatgtcgccaaagggaccgtccagaca ggcatggacaccaccaaaactgtcctaaccggtaccaaggacaccatctacagtggggtc accagtgccgtgaacgtggccaagggggctgtgcaaactgggctgaaaacgacccaaaat atcgcgacaggtacaaagaacacctttggcagtggggtgaccagtgctgtgaatgtggcc aaaggggctgcccagacaggtgtagacacggccaagaccgtgctgaccggcaccaaggac acagtcactactgggctcatgggggcagtgaatgtcgccaaagggactgtccagaccagt gtggacaccaccaagactgtcctaactggtaccaaggacaccgtctgcagtggggtgacc ggtgctgcgaatgtggccaaaggggccatccaagggggcctggacactacaaagtctgtc ctgactggcactaaagatgctgtgtccactgggctcacaggggctgtgaagttggccaaa gggactgtccagaccggcatggacaccaccaagactgtgttaactggtaccaaggatgct gtgtgcagtggggtgaccggtgctgcgaatgtggccaagggggccgtccagatgggtgta gacacggccaagaccgtgctgaccggtaccaaggacactgtctgcagtggggtcaccggt gctgcgaacgtggccaagggtgctgtgcaaactgggctgaaaacgacccaaaatatcgca acaggtacaaagaacacccttggcagtggggtgaccggtgctgcgaaagtggccaaaggg gccgtccaggggggcctggacactacaaagtctgtcctgactggcactaaagatgccgtg tccactgggctcacaggggctgtgaacttggccaaagggactgtccagaccggcgtggac accagcaagactgtcctgaccggtaccaaggacaccgtctgcagtggagtcactggtgcc gtaaatgtggccaaagggaccgtccagacaggtgtggacacagccaagacggtgctgagt ggcgctaaggatgcagtgactactggagtcacgggggcagtgaatgtggccaaaggaacc gtgcagaccggcgtggacgcctccaaggctgtgcttatgggtaccaaggacactgtcttc agtggggttaccggtgccatgagcatggccaaaggggccgtccaggggggcctggacacc accaagacagtgctgaccggaaccaaagacgcagtgtccgctgggctcatggggtcaggg aacgtggcgacaggggccacccacactggcctcagcaccttccagaactggttacctagt acccccgccacctcctggggtggactcaccagttccaggaccacagacaatggtggggag cagactgccctgagcccccaagaggccccgttctctggcatctccacgcccccggatgtg ctcagtgtaggcccggagcctgcctgggaagccgcagccactaccaagggccttgcgact gacgtggcgacgttcacccaaggggccgccccaggcagggaggacacggggcttttggcc accacacacggccccgaagaagccccacgcttggcaatgctgcagaatgagttggagggg ctgggggacatcttccaccccatgaatgcggaggagcaagctcagctggctgcctcccag cctgggccaaaggtgctgtcggcggaacaggggagctacttcgttcgtttaggtgacctg ggtcccagcttccgccagcgggcatttgaacacgcggtgagccacctgcagcacggccag ttccaagccagggacactctggcccagctccaggactgcttcaggctgattgaaaaggcc cagcaggctccagaagggcagccacgtctggaccagggctcaggtgccagtgcggaggac gctgctgtccaggaggagcgggatgccggggttctgtccagggtctgcggccttctccgg cagctgcacacggcctacagtggcctggtctccagcctccagggcctgcccgccgagctc cagcagccagtggggcgggcgcggcacagcctctgtgagctctatggcatcgtggcctca gctggctctgtagaggagctgcccgcagagcggctggtgcagagccgcgagggtgtgcac caggcttggcaggggttagagcagctgctggagggcctacagcacaatcccccgctcagc tggctggtagggcccttcgccttgcccgctggcgggcagtag >gi568815579r:4437943_4638953|GENSCAN_predicted_peptide_5|818_aa MSSWSRQRPKRTLFLLLLLAASAWGVTLSPKDCQVFRSDHGSSISCQPPAEIPGYLPADT VHLAVEFFNLTHLPANLLQGASKLQELHLSSNGLESLSPEFLRPVPQLRVLDLTRNALTG LPPGLFQASATLDTLVLKENQLEVLEVSWLHGLKALGHLDLSGNRLRKLPPGLLANFTLL RTLDLGENQLETLPPDLLRGPLQLERLHLEGNKLQVLGKDLLLPQPDLRYLFLNGNKLAR VAAGAFQGLRQLDMLDLSNNSLASVPEGLWASLGQPNWDMRDGFDISGNPWICDQNLSDL YRWLQAQKDKMFSQNDTRCAGPEAVKGQTLLAVAKSQTHDGLSLPMPKGDPVCSTMSEEE AAQIPRSSVWEQDQQNVVQRVVALPLVRATCTAVCDVYSAAKDRHPLLGSACRLAENCVC GLTTRALDHAQPLLEHLQPQLATMNSLACRGLDKLEEKLPFLQQPSETVVTSAKDVVASS VTGVVDLARRGRRWSVELKRSVSHAVDVVLEKSEELVDHFLPMTEEELAALAAEAEGPEV GSVEDQRRQQGYFVRLGSLSARIRHLAYEHSVGKLRQSKHRAQDTLAQLQETLELIDHMQ CGVTPTAPACPGKVHELWGEWGQRPPESRRRSQAELETLVLSRSLTQELQGTVEALESSV RGLPAGAQEKVAEVRRSVDALQTAFADARCFRDVPAAALAEGRGRVAHAHACVDELLELV VQAVPLPWLVGPFAPILVERPEPLPDLADLVDEVIGGPDPRWAHLDWPAQQRAWEAEHRD GSGNGDGDRMGVAGDICEQEPETPSCPVKHTLMPELDF >gi568815579r:4437943_4638953|GENSCAN_predicted_CDS_5|2457_bp atgtcctcttggagcagacagcgaccaaaaagaactctgttcctgctgctgctgttggca gcctcagcctggggggtcaccctgagccccaaagactgccaggtgttccgctcagaccat ggcagctccatctcctgtcaaccacctgccgaaatccccggctacctgccagccgacacc gtgcacctggccgtggaattcttcaacctgacccacctgccagccaacctcctccagggc gcctctaagctccaagaattgcacctctccagcaatgggctggaaagcctctcgcccgaa ttcctgcggccagtgccgcagctgagggtgctggatctaacccgaaacgccctgaccggg ctgcccccgggcctcttccaggcctcagccaccctggacaccctggtattgaaagaaaac cagctggaggtcctggaggtctcgtggctacacggcctgaaagctctggggcatctggac ctgtctgggaaccgcctccggaaactgccccccgggctgctggccaacttcaccctcctg cgcacccttgaccttggggagaaccagttggagaccttgccacctgacctcctgaggggt ccgctgcaattagaacggctacatctagaaggcaacaaattgcaagtactgggaaaagat ctcctcttgccgcagccggacctgcgctacctcttcctgaacggcaacaagctggccagg gtggcagccggtgccttccagggcctgcggcagctggacatgctggacctctccaataac tcactggccagcgtgcccgaggggctctgggcatccctagggcagccaaactgggacatg cgggatggcttcgacatctccggcaacccctggatctgtgaccagaacctgagcgacctc tatcgttggcttcaggcccaaaaagacaagatgttttcccagaatgacacgcgctgtgct gggcctgaagccgtgaagggccagacgctcctggcagtggccaagtcccaaacacacgat gggctttccttacccatgcccaagggtgaccctgtttgcagcacgatgtctgaagaagag gcggctcagatccccagatccagtgtgtgggagcaggaccagcagaacgtggtgcagcgt gtggtggctctgcccctggtcagggccacgtgcaccgcggtctgcgatgtttacagtgca gccaaggacaggcacccgctgctgggctccgcctgccgcctggctgagaactgcgtgtgc ggcctgaccacccgtgccctggaccacgcccagccgctgctcgagcacctgcagccccag ctggccactatgaacagcctcgcctgcaggggcctggacaagctggaagagaagcttccc tttctccagcaaccttcggagacggtggtgacctcagccaaggacgtggtggccagcagt gtcacgggtgtggtggacctggcccggaggggccggcgctggagcgtggagctgaagcgc tccgtgagccatgctgtggatgttgtactggaaaaatcagaggagctggtggatcacttc ctgcccatgacggaggaagagctcgcggcactggcggctgaggctgaaggccctgaagtg ggttcggtggaggatcagaggagacagcagggctactttgtgcgcctcggctccctgtca gcacggatccgccacctggcctacgagcactctgtggggaaactgaggcagagcaaacac cgtgcccaggacaccctggcccagctgcaggagacgctggagctgatagaccacatgcag tgtggggtgacccccaccgccccggcctgccctgggaaggtgcacgagctgtggggggaa tggggccagcgccctccggagagccgccgccggagccaggcagagctggagacgctggtg ctgtcccgcagcctgacccaggagctgcagggcacggtagaggctctggagtccagcgtg cggggcctgcccgccggcgcccaggagaaggtggctgaggtgcggcgcagtgtggatgcc ctgcagaccgccttcgctgatgcccgctgcttcagggacgtgccagcggccgcgctggcc gagggccggggtcgcgtggcccacgcgcacgcctgcgtggacgagctgctggagctggtg gtgcaggccgtgccgctgccctggctggtgggacccttcgcgcccatccttgtggagcga cccgagcccctgcccgacctggcggacctggtggacgaggtcatcgggggccctgacccc cgctgggcgcacctggactggccggcccagcagagagcctgggaggcagagcacagggac gggagtgggaatggggatggggacaggatgggtgttgccggggacatctgcgagcaggaa cccgagacccccagctgcccggtcaagcacaccctgatgcccgagctggacttctga >gi568815579r:4437943_4638953|GENSCAN_predicted_peptide_6|894_aa MQTPRASPPRPALLLLLLLLGGAHGLFPEEPPPLSVAPRDYLNHYPVFVGSGPGRLTPAE GADDLNIQRVLRVNRTLFIGDRDNLYRVELEPPTSTELRYQRKLTWRSNPSDINVCRMKG KQEGECRNFVKVLLLRDESTLFVCGSNAFNPVCANYSIDTLQPVGDNISGMARCPYDPKH ANVALFSDGMLFTATVTDFLAIDAVIYRSLGDRPTLRTVKHDSKWFKEPYFVHAVEWGSH VYFFFREIAMEFNYLEKVVVSRVARVCKNDVGGSPRVLEKQWTSFLKARLNCSVPGDSHF YFNVLQAVTGVVSLGGRPVVLAVFSTPSNSIPGSAVCAFDLTQVAAVFEGRFREQKSPES IWTPVPEDQVPRPRPGCCAAPGMQYNASSALPDDILNFVKTHPLMDEAVPSLGHAPWILR TLMRHQLTRVAVDVGAGPWGNQTVVFLGSEAGTVLKFLVRPNASTSGTSGLSVFLEEFET YRPDRCGRPGGGETGQRLLSLELDAASGGLLAAFPRCVVRVPVARCQQYSGCMKNCIGSQ DPYCGWAPDGSCIFLSPGTRYCGRGRAAFEQDVSGASTSGLGDCTGLLRASLSEDRAGLV SVNLLVTSSVAAFVVGAVVSGFSVGWFVGLRERRELARRKDKEAILAHGAGEAVLSVSRL GERRAQGPGGRGGGGGGGAGVPPEALLAPLMQNGWAKATLLQGGPHDLDSGLLPTPEQTP LPQKRLPTPHPHPHALGPRAWDHGHPLLPASASSSLLLLAPARAPEQPPAPGEPTPDGRL YAARPGRASHGDFPLTPHASPDRRRVVSAPTGPLDPASAADGLPRPWSPPPTGSLRRPLG PHAPPAATLRRTHTFNSGEARPGDRHRGCHARPGTDLAHLLPYGGADRTAPPVP >gi568815579r:4437943_4638953|GENSCAN_predicted_CDS_6|2685_bp atgcagaccccgcgagcgtcccctccccgcccggccctgctgcttctgctgctgctactg gggggcgcccacggcctctttcctgaggagccgccgccgcttagcgtggcccccagggac tacctgaaccactatcccgtgtttgtgggcagcgggcccggacgcctgacccccgcagaa ggtgctgacgacctcaacatccagcgagtcctgcgggtcaacaggacgctgttcattggg gacagggacaacctctaccgcgtagagctggagccccccacgtccacggagctgcggtac cagaggaagctgacctggagatctaaccccagcgacataaacgtgtgtcggatgaagggc aaacaggagggcgagtgtcgaaacttcgtaaaggtgctgctccttcgggacgagtccacg ctctttgtgtgcggttccaacgccttcaacccggtgtgcgccaactacagcatagacacc ctgcagcccgtcggagacaacatcagcggtatggcccgctgcccgtacgaccccaagcac gccaatgttgccctcttctctgacgggatgctcttcacagctactgttaccgacttccta gccattgatgctgtcatctaccgcagcctcggggacaggcccaccctgcgcaccgtgaaa catgactccaagtggttcaaagagccttactttgtccatgcggtggagtggggcagccat gtctacttcttcttccgggagattgcgatggagtttaactacctggagaaggtggtggtg tcccgcgtggcccgagtgtgcaagaacgacgtgggaggctccccccgcgtgctggagaag cagtggacgtccttcctgaaggcgcggctcaactgctctgtacccggagactcccatttc tacttcaacgtgctgcaggctgtcacgggcgtggtcagcctcgggggccggcccgtggtc ctggccgttttttccacgcccagcaacagcatccctggctcggctgtctgcgcctttgac ctgacacaggtggcagctgtgtttgaaggccgcttccgagagcagaagtcccccgagtcc atctggacgccggtgccggaggatcaggtgcctcgaccccggcccgggtgctgcgcagcc cccgggatgcagtacaatgcctccagcgccttgccggatgacatcctcaactttgtcaag acccaccctctgatggacgaggcggtgccctcgctgggccatgcgccctggatcctgcgg accctgatgaggcaccagctgactcgagtggctgtggacgtgggagccggcccctggggc aaccagaccgttgtcttcctgggttctgaggcggggacggtcctcaagttcctcgtccgg cccaatgccagcacctcagggacgtctgggctcagtgtcttcctggaggagtttgagacc taccggccggacaggtgtggacggcccggcggtggcgagacagggcagcggctgctgagc ttggagctggacgcagcttcggggggcctgctggctgccttcccccgctgcgtggtccga gtgcctgtggctcgctgccagcagtactcggggtgtatgaagaactgtatcggcagtcag gacccctactgcgggtgggcccccgacggctcctgcatcttcctcagcccgggcaccagg tactgcgggagagggagagccgcctttgagcaggacgtgtccggggccagcacctcaggc ttaggggactgcacaggactcctgcgggccagcctctccgaggaccgcgcggggctggtg tcggtgaacctgctggtaacgtcgtcggtggcggccttcgtggtgggagccgtggtgtcc ggcttcagcgtgggctggttcgtgggcctccgtgagcggcgggagctggcccggcgcaag gacaaggaggccatcctggcgcacggggcgggcgaggcggtgctgagcgtcagccgcctg ggcgagcgcagggcgcagggtcccgggggccggggcggaggcggtggcggtggcgccggg gttcccccggaggccctgctggcgcccctgatgcagaacggctgggccaaggccacgctg ctgcagggcgggccccacgacctggactcggggctgctgcccacgcccgagcagacgccg ctgccgcagaagcgcctgcccactccgcacccgcacccccacgccctgggcccccgcgcc tgggaccacggccaccccctgctcccggcctccgcttcatcctccctcctgctgctggcg cccgcccgggcccccgagcagccccccgcgcctggggagccgacccccgacggccgcctc tatgctgcccggcccggccgcgcctcccacggcgacttcccgctcaccccccacgccagc ccggaccgccggcgggtggtgtccgcgcccacgggccccttggacccagcctcagccgcc gatggcctcccgcggccctggagcccgcccccgacgggcagcctgaggaggccactgggc ccccacgcccctccggccgccaccctgcgccgcacccacacgttcaacagcggcgaggcc cggcctggggaccgccaccgcggctgccacgcccggccgggcacagacttggcccacctc ctcccctatgggggggcggacaggactgcgccccccgtgccctag >gi568815579r:4437943_4638953|GENSCAN_predicted_peptide_7|411_aa MGVEVSGSAKCCWERKTWRMNLKKTKAGNRSQGKGPGVRADHKSCHTGLLGVPPTRQNPK ASEHPSNSKSPFRSSLGVKDDSLSSWGPPNPQAPTAAGGIRTGRPSPPTPLRPLPRGLDP RLVLFWAGQEAGEDSGAQVRVLRAPAATDTFFPVRPGLRPRVRGGPGRPGGGQKAFRVPG MGLIMIVTEDPGGLPGGGGSSILKCQGYSFIGTASVSEYVDLVNPSSFIPKVGKPWSFPE PSTGQKPYTFPRRWKEDSLVLGTEDFDKLVMRTSGQNHVCRGKTQPYPSFLSCLALGAHL PRGCQPLGSTLPTLLRHPPLCYFQIPHLLLPPRSGHTTGPQQFVVWLPGGRSSEVAMGSL GPSCNRDPELGCRAEMVLQGTWEQLQGLVPRTHSCILPGGPQLALRSREAP >gi568815579r:4437943_4638953|GENSCAN_predicted_CDS_7|1233_bp atgggggtggaggtctcagggagtgccaagtgctgctgggaacggaagacctggcgaatg aacttgaagaagacaaaggcgggaaataggtcccaggggaaaggccccggggtcagagcg gaccacaagtcctgccatacgggcctacttggtgttcctccaacacgccagaaccccaaa gcctcagagcaccccagtaactccaagtccccgttccgctccagtctaggggtcaaagat gacagcctcagctcttgggggcccccgaacccccaggcccccactgcagcgggggggatc cggaccggccggccctcccctccaacaccactgcgacccctgccccgcggcctggacccg cgactcgtcctgttctgggctggacaggaggccggagaggactcgggcgcccaagtgcgg gttttgcgggcgcccgcggccaccgacaccttcttcccagtgcggcccgggctgcggccc cgggtccgaggaggcccggggagacccggaggaggtcagaaggccttcagggtccccggg atgggcctaatcatgattgtcactgaagatccaggagggcttcctggaggaggaggcagc tccattctgaaatgccaagggtacagctttattgggacagccagtgtctctgaatatgtt gacctggtcaacccgtcctccttcattcccaaggtgggaaaaccgtggagcttcccagag cccagcacagggcagaagccctacacgttccccaggagatggaaagaagattccctggtt ctggggactgaagattttgacaagctggtaatgcgcacctctggtcagaaccacgtctgc agaggaaagacccagccatatccctcttttctcagctgcctggctctgggtgcacatctc ccccggggctgccaacctctgggctccacactccccactctcctgcggcacccccctctc tgctacttccagatcccccacctgctcctgcctcccaggtcaggacacacaacaggccct caacaatttgttgtctggcttcctggaggaagaagcagcgaggttgctatgggctcccta ggccctagctgcaaccgggacccagagttgggctgcagggcggaaatggtgctgcagggg acctgggagcagctccagggcctggttcccaggactcacagctgcatcctccccggggga ccccagctggcgctgcgcagccgggaggctcct