GENSCAN 1.0 Date run: 8-Nov-116 Time: 11:12:38 Sequence gi568815597r:27269239_27474818 : 205580 bp : 49.36% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 13001 13047 47 1 2 108 89 23 0.704 1.61 1.02 Intr + 14100 14211 112 0 1 64 105 128 0.999 12.48 1.03 Intr + 18436 18623 188 0 2 74 111 198 0.999 19.19 1.04 Intr + 22977 23159 183 0 0 99 80 360 0.999 35.20 1.05 Intr + 24784 24878 95 1 2 104 74 64 0.994 6.21 1.06 Intr + 25276 25391 116 2 2 87 91 81 0.999 8.47 1.07 Intr + 27088 27163 76 0 1 100 98 42 0.999 5.49 1.08 Intr + 27810 27918 109 1 1 72 94 137 0.997 12.04 1.09 Intr + 28700 28873 174 2 0 126 89 225 0.995 25.45 1.10 Intr + 31988 32223 236 2 2 95 89 328 0.999 30.93 1.11 Intr + 34383 34557 175 1 1 72 57 342 0.985 28.50 1.12 Intr + 35763 35955 193 0 1 87 78 297 0.968 28.09 1.13 Intr + 36948 37124 177 2 0 113 61 298 0.886 29.72 1.14 Intr + 37462 37544 83 0 2 98 87 6 0.602 -0.06 1.15 Intr + 53019 53153 135 0 0 61 94 255 0.072 23.08 1.16 Term + 55997 56558 562 2 1 13 36 846 0.590 65.75 1.17 PlyA + 57267 57272 6 1.05 2.00 Prom + 59820 59859 40 -5.36 2.01 Init + 61281 61288 8 2 2 74 53 17 0.910 -3.85 2.02 Intr + 61482 61566 85 0 1 138 70 131 0.993 16.12 2.03 Intr + 62832 62863 32 2 2 126 108 27 0.982 5.43 2.04 Intr + 64720 64816 97 1 1 81 78 188 0.871 17.11 2.05 Intr + 64913 65043 131 1 2 98 99 262 0.993 27.79 2.06 Intr + 66141 66224 84 0 0 93 21 141 0.191 6.84 2.07 Intr + 72786 72912 127 0 1 59 92 44 0.449 2.58 2.08 Intr + 75730 75832 103 0 1 85 61 49 0.619 1.65 2.09 Intr + 76059 76287 229 1 1 117 110 158 0.955 17.83 2.10 Intr + 78183 78331 149 0 2 109 64 186 0.885 18.18 2.11 Intr + 78570 78642 73 1 1 55 94 20 0.668 -2.24 2.12 Intr + 78729 78774 46 0 1 86 107 22 0.680 2.31 2.13 Intr + 79842 79914 73 2 1 144 113 30 0.659 10.08 2.14 Intr + 80160 80260 101 1 2 70 96 85 0.999 7.33 2.15 Intr + 80414 80527 114 1 0 79 89 117 0.993 11.54 2.16 Intr + 80734 80894 161 0 2 123 86 317 0.999 33.79 2.17 Intr + 81151 81247 97 1 1 107 76 152 0.999 15.91 2.18 Intr + 81571 81714 144 0 0 93 76 102 0.705 9.98 2.19 Intr + 82020 82098 79 2 1 72 96 72 0.999 5.52 2.20 Intr + 82224 82317 94 1 1 35 77 113 0.744 3.92 2.21 Intr + 84045 84250 206 0 2 68 131 244 0.973 25.54 2.22 Term + 84475 84614 140 2 2 68 41 135 0.999 4.93 2.23 PlyA + 84672 84677 6 1.05 3.36 PlyA - 85561 85556 6 1.05 3.35 Term - 86507 86361 147 2 0 37 39 183 0.987 6.20 3.34 Intr - 86861 86788 74 0 2 65 101 79 0.999 6.03 3.33 Intr - 87262 87150 113 0 2 103 89 207 0.996 22.32 3.32 Intr - 87876 87734 143 0 2 121 37 79 0.492 5.25 3.31 Intr - 88338 88162 177 0 0 127 77 167 0.998 19.62 3.30 Intr - 88638 88473 166 2 1 59 80 270 0.951 23.26 3.29 Intr - 89081 88943 139 0 1 93 97 34 0.843 4.32 3.28 Intr - 89373 89181 193 0 1 138 49 102 0.920 10.37 3.27 Intr - 89628 89471 158 1 2 74 94 116 0.988 10.53 3.26 Intr - 90284 90179 106 2 1 43 92 193 0.973 14.99 3.25 Intr - 90756 90620 137 1 2 12 80 210 0.980 12.89 3.24 Intr - 91130 91003 128 1 2 133 67 121 0.993 14.82 3.23 Intr - 91600 91467 134 1 2 72 80 320 0.991 29.04 3.22 Intr - 91770 91683 88 2 1 90 78 106 0.996 9.77 3.21 Intr - 92014 91820 195 0 0 90 47 90 0.486 3.63 3.20 Intr - 92157 92108 50 0 2 119 59 -15 0.955 -3.82 3.19 Intr - 92389 92282 108 1 0 128 72 165 0.992 19.38 3.18 Intr - 92629 92467 163 0 1 63 73 138 0.990 9.78 3.17 Intr - 93012 92853 160 1 1 87 78 280 0.996 25.95 3.16 Intr - 93515 93403 113 1 2 99 71 140 0.999 13.42 3.15 Intr - 93783 93613 171 2 0 77 63 209 0.997 16.36 3.14 Intr - 94310 94204 107 2 2 89 117 199 0.968 21.91 3.13 Intr - 94847 94679 169 1 1 27 96 389 0.995 33.55 3.12 Intr - 95156 94966 191 2 2 137 23 172 0.991 13.98 3.11 Intr - 95530 95423 108 1 0 51 97 42 0.522 1.88 3.10 Intr - 95674 95535 140 2 2 99 77 151 0.994 15.28 3.09 Intr - 97524 97020 505 0 1 11 80 593 0.134 43.35 3.08 Intr - 100239 100019 221 1 2 65 -7 227 0.063 8.92 3.07 Intr - 101492 101358 135 0 0 120 109 191 0.999 24.94 3.06 Intr - 101734 101605 130 1 1 120 96 65 0.999 10.77 3.05 Intr - 104025 103898 128 1 2 96 105 45 0.972 7.40 3.04 Intr - 104282 104250 33 0 0 69 100 40 0.738 1.49 3.03 Intr - 104771 104727 45 0 0 80 109 59 0.917 5.58 3.02 Intr - 105213 105118 96 1 0 114 100 22 0.980 5.98 3.01 Init - 105580 105490 91 1 1 85 80 165 0.999 14.05 3.00 Prom - 108200 108161 40 -5.76 4.05 PlyA - 109240 109235 6 1.05 4.04 Term - 110957 110809 149 0 2 119 48 200 0.684 17.16 4.03 Intr - 113161 113090 72 2 0 111 110 77 0.999 11.58 4.02 Intr - 113429 113262 168 0 0 62 80 112 0.987 7.72 4.01 Init - 114001 113914 88 1 1 71 53 110 0.943 4.64 4.00 Prom - 115125 115086 40 -5.86 5.00 Prom + 119302 119341 40 -7.06 5.01 Sngl + 124561 125553 993 0 0 82 37 1418 0.413 132.79 5.02 PlyA + 126553 126558 6 1.05 6.00 Prom + 132180 132219 40 -7.56 6.01 Init + 132457 132518 62 0 2 93 66 44 0.468 3.42 6.02 Intr + 134325 134479 155 0 2 57 69 93 0.326 4.02 6.03 Intr + 134532 134598 67 1 1 82 115 -4 0.453 -0.34 6.04 Term + 137755 137899 145 1 1 97 51 70 0.300 1.58 6.05 PlyA + 138378 138383 6 1.05 7.09 PlyA - 138855 138850 6 -0.45 7.08 Term - 139108 138951 158 2 2 101 35 358 0.990 29.90 7.07 Intr - 140968 140454 515 0 2 68 94 196 0.817 10.81 7.06 Intr - 143489 143334 156 1 0 75 99 125 0.976 11.43 7.05 Intr - 145725 145595 131 0 2 80 69 70 0.979 3.79 7.04 Intr - 146864 146747 118 1 1 94 72 133 0.998 12.77 7.03 Intr - 149184 149031 154 1 1 76 95 36 0.955 2.23 7.02 Intr - 149850 149716 135 1 0 116 116 91 0.998 15.24 7.01 Init - 159652 159523 130 1 1 102 61 107 0.997 9.71 7.00 Prom - 178849 178810 40 -1.66 8.04 PlyA - 182883 182878 6 1.05 8.03 Term - 200387 200264 124 1 1 94 42 28 0.366 -3.34 8.02 Intr - 201752 201662 91 0 1 90 29 99 0.681 3.25 8.01 Intr - 204768 204694 75 1 0 89 61 60 0.555 2.89 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 100239 99998 242 1 2 65 49 268 0.932 16.79 S.002 Intr + 181950 181997 48 2 0 105 119 28 0.838 6.48 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:27269239_27474818|GENSCAN_predicted_peptide_1|886_aa GHSGCVNCLEWNEKGDLLASGSDDQHTIVWDPLHHKKLLSMHTGHTANIFSVKFLPHAGD RILITGAADSKVHVHDLTVKETIHMFGDHTNRVKRIATAPMWPNTFWSAAEDGLIRQYDL RENSKHSEVLIDLTEYCGQLVEAKCLTVNPQDNNCLAVGASGPFVRLYDIRMIHNHRKSM KQSPSAGVHTFCDRQKPLPDGAAQYYVAGHLPVKLPDYNNRLRVLVATYVTFSPNGTELL VNMGGEQVYLFDLTYKQRPYTFLLPRKCHSSGEVQNGKMSTNGVSNGVSNGLHLHSNGFR LPESRGHVSPQVELPPYLERVKQQANEAFACQQWTQAIQLYSKAVQRAPHNAMLYGNRAA AYMKRKWDGDHYDALRDCLKAISLNPCHLKAHFRLARCLFELKYVAEALECLDDFKGKFP EQAHSSACDALGRDITAALFSKNDGEEKKGPGGGAPVRLRSTSRKDSISEDEMVLRERSY DYQFRYCGHCNTTTDIKEANFFGSNAQYIVSGSDDGSFFIWEKETTNLVRVLQGDESIVN CLQPHPSYCFLATSGIDPVVRLWNPRPESEDLTGRVVEDMEGASQANQRRMNADPLEVML LNMGYRITGLSSGGAGASDDEDSSEGQVGRNKLNCLQGLSCPLAGSSRRSGIPNWMAEVE APTAAETDMKQYQGSGGVAMDVERSRFPYCVVWTPIPVLTTYTKIGTIQRRLAWPLRKDD MQIREAFHIFVSLDFEQEMATTTSSSLEKSYKLPDGQVITISNKRFQCPEALFQPSFLGM ESCGIHETTFNSIMKCDVDIRKDLYTNIGLSRGTTMYPGITDRMQKEINALASSTMKIKL IVPPECKYSVWISGSILASLSTFQQMWISKQEYDESGPCIVHRKCF >gi568815597r:27269239_27474818|GENSCAN_predicted_CDS_1|2661_bp ggtcactcaggatgtgtcaactgtctggagtggaatgagaaaggagacttgctggcctct ggttccgatgaccagcacacgattgtgtgggacccgctgcaccacaagaagctgctctcc atgcacacgggacacaccgcaaatatcttctctgtcaagttcctgcctcacgctggggac cgcatcttgatcacgggggcagccgactctaaggtgcatgtgcacgacctgacagtaaag gagaccatccacatgtttggagaccacacaaaccgggtgaagcgcatcgccacagcgccc atgtggcccaacacattctggagtgctgctgaggatgggcttatccgccagtatgacctt cgagagaacagcaaacactcggaggtgctgattgacctgacagagtactgtggccagctg gtggaggccaagtgcctcactgtcaacccccaggacaacaactgcctggcagttggggcc agcgggcccttcgtgaggctctatgacatccgcatgatccataaccacagaaagagcatg aagcagagcccttcagcgggtgtgcacaccttctgtgaccggcagaaaccccttccggac ggtgcagcccagtattacgtagcaggtcacctgccagtgaagcttcctgactacaacaac cgtttgagagtgctggttgccacctatgtgaccttcagccccaatggcacagagctacta gtcaacatggggggggaacaggtctatttgtttgacttgacttacaagcagcggccgtac accttcctcttgcctagaaaatgccactcctcgggggaagtccagaatggcaagatgtcc accaacggtgtgtccaacggtgtgtccaatggcctgcaccttcatagcaatggcttccgg ctgccggagagtaggggacatgtcagcccccaagtagagctaccaccatacctggagcgt gtgaaacagcaagccaatgaggcttttgcctgccagcagtggacccaagccattcagctt tacagcaaggctgtgcagagggcccctcacaatgccatgctttatggaaaccgagcagca gcctacatgaagcgcaagtgggatggtgaccactatgatgccctgagggactgcctcaag gccatctccctaaacccatgccacctgaaggcacactttcgcctggcccgctgcctcttt gagctcaagtatgtggctgaagccctggagtgcctggacgacttcaaagggaaatttccg gagcaggcccacagcagcgcttgtgatgcattgggccgcgacatcacagctgccctcttc tctaaaaatgatggtgaggagaagaagggacctggtggcggcgccccagtccgcctccgc agcacgagccgcaaggactccatctcagaggatgaaatggtgctgcgggagcgaagctac gactatcagttccgctactgcggccactgcaacaccaccacggatatcaaagaggccaat ttctttggcagcaacgctcagtatatcgtcagtggctctgacgatggctccttcttcatc tgggaaaaggagaccaccaacctggtccgtgtgctccaaggggatgagtccattgtcaac tgcctgcagccacaccccagctactgcttcctggccaccagtggcatcgatcctgttgtg cggctctggaacccccgaccagagagtgaagacctcacaggccgagtcgtggaagatatg gagggtgcttcacaggccaaccagcggcgcatgaatgcagacccgttggaggtgatgctg ctcaacatgggctaccggatcacgggcctgagcagtgggggtgccggggcctctgatgat gaggacagctctgagggccaggtggggaggaacaagctgaactgtcttcagggactgtcc tgccctttagctggcagcagcaggaggagtgggattcccaattggatggcggaagtggag gcgccgacggcggccgagacggacatgaagcaatatcaaggctccggcggcgtcgccatg gatgtggaacggagtcgcttcccctactgcgtggtgtggacgcccatcccggtgctcacc acatatactaaaattggaacgatacagagaagattagcatggcccctgcgcaaggatgac atgcagattcgtgaagcgttccatatttttgtctccctggacttcgagcaggagatggcc actaccacatcctcctccctggagaagagctacaagctgccggatggccaggtcatcacc atcagcaacaagcggttccagtgtccggaggcgctgttccagccttccttcctgggtatg gaatcttgcggcatccacgagaccacgttcaactccatcatgaagtgtgacgtagacatc cgcaaagacctgtacaccaacatagggctatccagaggcaccaccatgtacccgggcatc accgacaggatgcagaaggagattaacgccctggcatccagcaccatgaagatcaagctc attgtgcccccagagtgcaagtactctgtgtggatcagcggctccatcctggcctcactg tccaccttccagcagatgtggattagcaagcaggagtatgacgagtcaggcccctgtatc gtccaccgcaaatgcttctaa >gi568815597r:27269239_27474818|GENSCAN_predicted_peptide_2|790_aa MLRWFFPIIGHMGICTSTGVIRDFAGPYFVSEDNMAFGKPAKYWKLDPAQVYASGPNAWD TAVHDASEEYKHRMHNLCCDNCHSHVALALNLMRYNNSTNWNMVTLCFFCLLYGKYVSVG AFVKTWLPFILLLGIILTVSLVFNLRCWLGCLLSQPSSLTALLPVPETRLPGELEPPAKV SSHTGPLQVSVCMCDHIYAYMDPRVHMCLCVPSLQSCVALPQGSSVCPAGAQPQLMPQRG HPSQEGLWALPSLPMAHGPKPETEGLLDLSFLTEEEQEAIAGVLQRDARLRQLEEGRVSK LRASVADPGQLKILTGDWFQEARSQRHHNAHFGSDLVRASMRRKKSTRGDQAPGHDREAE AAVKEKEEGPEPRLTIDEAPQERLRETEGPDFPSPSVPLKASDPEEASQAQEDPGQGDQQ VCAEEADPELEPASGGEQEPRPQQAQTKAASQILENGEEAPGPDPSLDRMLSSSSSVSSL NSSTLSGSQMSLSGDAEAVQVRGSVHFALHYEPGAAELRVHVIQCQGLAAARRRRSDPYV KSYLLPDKQSKRKTAVKKRNLNPVFNETLRAELQGRVLSLSVWHRESLGRNIFLGEVEVP LDTWDWGSEPTWLPLQPRVPPSPDDLPSRGLLALSLKYVPAGSEGLPPSGELHFWVKEAR DLLPLRAGSLDTYVQCFVLPDDSQASRQRTRVVRRSLSPVFNHTMVYDGFGPADLRQACA ELSLWDHGALANRQLGGTRLSLGTGSSYGLQVPWMDSTPEEKQLWQALLEQPCEWVDGLL PLRTNLAPRT >gi568815597r:27269239_27474818|GENSCAN_predicted_CDS_2|2373_bp atgctgaggtggtttttccccatcatcggccacatgggcatctgcacatccacaggagtc attcgggacttcgcgggcccctactttgtctcagaggacaacatggcctttggaaagcct gccaagtactggaagttggaccctgctcaggtctatgctagcgggcccaacgcatgggac acggctgtgcacgacgcctctgaggagtacaagcaccgcatgcacaatctctgctgtgac aactgccactcgcacgtggcattggccctgaatctgatgcgctacaacaacagcaccaac tggaatatggtgacgctctgcttcttctgcctgctctacgggaagtacgtcagcgttggg gccttcgtgaagacctggctgcccttcatccttctcctgggcatcatcctcaccgtcagc ctggtctttaacctccgctgctggctgggctgcctgttgagtcagccttcttccctcacg gctcttctcccggtccctgaaactcggctgccaggggagctggagccacctgcgaaggtg tcctcccatactggacccctacaggtgtctgtgtgcatgtgcgaccatatctatgcctac atggacccgcgtgtgcacatgtgtctgtgtgttccatcgctgcagtcctgtgtggcccta cctcagggaagctccgtgtgcccagctggggcacagccccagctgatgccccagaggggc cacccatcgcaagaggggctttgggctctgccctccctccccatggcgcatgggccaaag cctgagactgaaggactgttggacctcagcttcctgacagaggaggagcaggaggccatt gctggcgtcctccaacgagatgcccgcctgcgccagctggaggaggggcgggtcagcaag ctccgggcctcagtggcagaccctgggcagctgaagatcctgacaggggactggttccag gaagcacgctcccagcggcaccacaatgcccacttcggctctgaccttgtccgagcgtct atgcgcaggaagaagagcaccaggggagaccaggctccaggccacgacagggaggctgag gctgctgtgaaagagaaggaagaggggccagagcccaggctcaccattgatgaggcccct caggagaggctcagggagactgagggacctgatttcccatcgccttctgtccccctaaag gcttcagatcctgaggaggcgtcccaggcccaggaagatcctggccaaggagaccaacag gtctgtgccgaggaggctgacccggagctggagcccgcgtcggggggagagcaggagccg cggccccagcaagcccagaccaaggccgcgtcccagatcctggagaatggggaggaggcc ccggggcccgacccctctctcgaccgcatgctcagcagcagctcctcggtgtccagcctt aactcctccacgctgagcggcagccagatgagcctgtcaggcgacgcggaggcggtgcag gtccgcggctccgtgcacttcgcgctgcactacgagccgggcgccgccgagctgcgcgtg cacgtgatccagtgccagggcctggccgccgcccggcgccgccgctcggacccctacgtc aaaagctacctcctcccggataagcagagcaagcgcaagacggcggtgaagaaacggaat ctgaatccggttttcaacgagactctccgggccgagcttcagggccgcgtgctgagcctg tctgtgtggcaccgcgaaagcctgggtcgcaacatctttctgggcgaagttgaagtgccc ctggacacgtgggactggggctctgagcccacctggctccccctgcagccccgggtccca ccctctcccgacgaccttccgagccgcgggttactcgccctgtccctcaagtacgtcccc gccggctccgagggactgcccccgagcggggagctgcacttctgggtgaaggaggctcgg gacctcctgccgctgcgggcaggatccctggacacttacgtacaatgcttcgtgctgcct gatgacagccaggccagccgccagcgtacaagggttgtgcgacgcagcctcagccctgtg ttcaatcacaccatggtgtacgatggctttgggcctgctgacctgcgccaggcttgtgcc gagctctccctctgggaccatggggccctggccaaccgccagctggggggcacacgcctc agcctgggcaccggcagcagctatgggctgcaggtgccctggatggattccacacctgag gagaagcagctgtggcaagccctcctggagcagccgtgcgaatgggtggatggccttcta cccctcagaaccaacctggcccccaggacgtag >gi568815597r:27269239_27474818|GENSCAN_predicted_peptide_3|1653_aa MDLLWILPSLWLLLLGGPACLKTQEHPSCPGPRELEASKVVLLPSCPGAPGSPGEKGAPG PQGPPGPPGKMGPKGEPGDPVNLLRCQEGPRNCRELLSQGATLSGWYHLCLPEGRALPVF CDMDTEGGGWLVFQRRQDGSVDFFRSWSSYRAGFGNQESEFWLGNENLHQLTLQGNWELR VELEDFNGNRTFAHYATFRLLGEVDHYQLALGKFSEGTAGDSLSLHSGRPFTTYDADHDS SNSNCAVIVHGAWWYASCYRSNLNGRYAVSEAAAHKYGIDWASGRGVGHPYRRIPGTGRQ AVFRIPRSRGRFSDSERIPDPKIPDLPAQAPAPPPVPSARRPAPERPRMAGPCPRSGAER AGSCWQDPLAVALSRGRQLAAPPGRGCARSRPLSVVYVLTREPQPGLEPREGTEAEPLPL RCLREACAQVPRPRPPPQLRSLPFGTLELGDTAALDAFYNADVVVLEVSSSLVQPSLFYH LGVRESFSMTNNVLLCSQADLPDLQALRVAWGWWRQAQGWGKAYTEHQYLTVCLLQEDVF QKNSDCVGSYTLIPYVVTATGRVLCGDAGLLRGLADGLVQAGVGTEALLTPLVGRLARLL EATPTDSCGYFRETIRRDIRQARERFSGPQLRQELARLQRRLDSVELLSPDIIMNLLLSY RDVQDYSAIIELVETLQALPTCDVAEQHNVCFHYTFALNRRNRPGDRAKALSVLLPLVQL EGSVAPDLYCMCGRIYKDMFFSSGFQDAGHREQAYHWYRKAFDVEPSLHSGINAAVLLIA AGQHFEDSKELRLIGMKLGCLLARKGCVEKMQYYWDVGFYLGAQILANDPTQVVLAAEQL YKLNAPIWYLVSVMETFLLYQHFRPTPEPPGGPPRRAHFWLHFLLQSCQPFKTACAQGDQ CLVLVLEMNKVLLPAKLEVRGTDPVSTVTLSLLEPETQDIPSSWTFPVASICGVSASKRD ERCCFLYALPPAQDVQLCFPSVGHCQWCAAFHGRGTLRARMPGGEWGKERDDVRWMLRLR FCGLIQAWVTNPDSTAPAEEAEGAGEMLEFDYEYTETGERLVLGKGTYGVVYAGRDRHTR VRIAIKEIPERDSRFSQPLHEEIALHRRLRHKNIVRYLGSASQGGYLKIFMEEVPGGSLS SLLRSVWGPLKDNESTISFYTRQILQGLGYLHDNHIVHRDIKGDNVLINTFSGLLKISDF GTSKRLAGITPCTETFTGTLQYMAPEIIDQGPRGYGKAADIWSLGCTVIEMATGRPPFHE LGSPQAAMFQVGMYKVHPPMPSSLSAEAQAFLLRTFEPDPRLRASAQTLLGDPFLQPGKR SRSPSSPRHAPRPSDAPSASPTPSANSTTQSQTFPCPQAPSQHPPSPPKRCLSYGGTSQL RVPEEPAAEEPASPEESSGLSLLHQESKRRAMLAAVLEQELPALAENLHQEQKQEQGARL GRNHVEELLRCLGAHIHTPNRRQLAQELRALQGRLRAQGLGPALLHRPLFAFPDAVKQIL RKRQIRPHWMFVLDSLLSRAVRAALGVLGPGRRGMPRASCQRGLREILAGKEREYQALVQ RALQRLNEEARTYVLAPEPPTALSTDQGLVQWLQELNVDSGTIQMLLNHSFTLHTLLTYA TRDDLIYTRIRYILGPPSVNHAYGLILANCRPA >gi568815597r:27269239_27474818|GENSCAN_predicted_CDS_3|4962_bp atggatctactgtggatcctgccctccctgtggcttctcctgcttggggggcctgcctgc ctgaagacccaggaacaccccagctgcccaggacccagggaactggaagccagcaaagtt gtcctcctgcccagttgtcccggagctccaggaagtcctggggagaagggagccccaggt cctcaagggccacctggaccaccaggcaagatgggccccaagggtgagccaggagatcca gtgaacctgctccggtgccaggaaggccccagaaactgccgggagctgttgagccagggc gccaccttgagcggctggtaccatctgtgcctacctgagggcagggccctcccagtcttt tgtgacatggacaccgaggggggcggctggctggtgtttcagaggcgccaggatggttct gtggatttcttccgctcttggtcctcctacagagcaggttttgggaaccaagagtctgaa ttctggctgggaaatgagaatttgcaccagcttactctccagggtaactgggagctgcgg gtagagctggaagactttaatggtaaccgtactttcgcccactatgcgaccttccgcctc ctcggtgaggtagaccactaccagctggcactgggcaagttctcagagggcactgcaggg gattccctgagcctccacagtgggaggccctttaccacctatgacgctgaccacgattca agcaacagcaactgtgcagtgattgtccacggtgcctggtggtatgcatcctgttaccga tcaaatctcaatggtcgctatgcagtgtctgaggctgccgcccacaaatatggcattgac tgggcctcaggccgtggtgtgggccacccctaccgcaggattcctggaacgggacgtcag gccgtattccggatccctagatcccgtgggagattctccgattccgaacggattccagat cccaagattcctgatctgccggcccaggctcctgccccgcccccggttcccagtgcgcgg cgccccgcgcctgagcgcccccgcatggcggggccgtgtccccggtccggggcggagcgc gccggcagctgctggcaggacccgctggccgtggcgctgagccggggccggcagctcgcg gcgcccccgggccggggctgcgcgcggagccggccgctcagcgtggtctacgtgctgacc cgggagccgcagcccgggctcgagcctcgggagggaaccgaggcggagccgctgcccctg cgctgcctgcgcgaggcttgcgcgcaggtcccccggccgcggccgcccccgcagctgcgc agcctgcccttcgggacgctggagctaggcgacaccgcggctctggatgccttctacaac gcggatgtggtggtgctggaggtgagcagctcgctggtacagccctccctgttctaccac cttggtgtgcgtgagagcttcagcatgaccaacaatgtgctcctctgctcccaggccgac ctccctgacctgcaggccctgcgggtggcctggggctggtggaggcaggcacagggctgg ggtaaggcctacacagaacaccagtatctgactgtctgcctcctgcaggaggatgttttc cagaagaactcggattgcgttggcagctacacactgatcccctatgtggtgacggccact ggtcgggtgctgtgtggtgatgcaggccttctgcggggcctggctgatgggctggtacag gctggagtggggaccgaggccctgctcactcccctggtgggccggcttgcccgcctgctg gaggccacacccacagactcttgtggctatttccgggagaccattcggcgggacatccgg caggcgcgggagcggttcagtgggccacagctgcggcaggagctggctcgcctgcagcgg agactggacagcgtggagctgctgagccccgacatcatcatgaacttgctgctctcctac cgcgatgtgcaggactactcggccatcattgagctggtggagacgctgcaggccttgccc acctgtgatgtggccgagcagcataatgtctgcttccactacacttttgccctcaaccgg aggaacaggcctggggaccgggcgaaggccctgtctgtgctgctgccgctggtacagctt gagggctctgtggcgcccgatctgtactgcatgtgtggccgtatctacaaggacatgttc ttcagctcgggtttccaggatgctgggcaccgggagcaggcctatcactggtatcgcaag gcttttgacgtagagcccagccttcactcaggcatcaatgcagctgtgctcctcattgct gccgggcagcactttgaggattccaaagagctccggctaataggcatgaagctgggctgc ctgctggcccgcaaaggctgcgtggagaagatgcagtattactgggatgtgggtttctac ctgggagcccagatcctcgccaatgaccccacccaggtggtgctggctgcagagcagctg tataagctcaatgcccccatatggtacctggtgtccgtgatggagaccttcctgctctac cagcacttcaggcccacgccagagccccctggagggccaccacgccgtgcccacttctgg ctccacttcttgctacagtcctgccaaccattcaagacagcctgtgcccagggcgaccag tgcttggtgctggtcctggagatgaacaaggtgctgctgcctgcaaagctcgaggttcgg ggtactgacccagtaagcacagtgaccctgagcctgctggagcctgagacccaggacatt ccctccagctggaccttcccagtcgcctccatatgcggagtcagcgcctcaaagcgcgac gagcgctgctgcttcctctatgcactccccccggctcaggacgtccagctgtgcttcccc agcgtagggcactgccagtggtgcgcagcctttcatggtcggggtactctgagggccagg atgcccgggggggagtgggggaaagaaagggacgacgtccgttggatgctccggcttagg ttctgcggcctgatccaggcctgggtgacgaacccggattccacggcgcccgcggaggag gcggagggcgcgggggagatgttggagtttgattatgagtacacggagacgggcgagcgg ctggtgctgggcaagggcacgtatggggtggtgtacgcgggccgcgatcgccacacgagg gtgcgcatcgccatcaaggagatcccggagcgggacagcaggttctctcagcccctgcat gaagagatcgctcttcacagacgcctgcgccacaagaacatagtgcgctatctgggctca gctagccagggcggctaccttaagatcttcatggaggaagtgcctggaggcagcctgtcc tccttgctgcggtcggtgtggggacccctgaaggacaacgagagcaccatcagtttctac acccgccagatcctgcagggacttggctacttgcacgacaaccacatcgtgcacagggac ataaaaggggacaatgtgctgatcaacaccttcagtgggctgctcaagatttctgacttc ggcacctccaagcggctggcaggcatcacaccttgcactgagaccttcacaggaactctg cagtatatggccccagaaatcattgaccagggcccacgcgggtatgggaaagcagctgac atctggtcactgggctgcactgtcattgagatggccacaggtcgcccccccttccacgag ctcgggagcccacaggctgccatgtttcaggtgggtatgtacaaggtccatccgccaatg cccagctctctgtcggccgaggcccaagcctttctcctccgaacttttgagccagacccc cgcctccgagccagcgcccagacactgctgggggaccccttcctgcagcctgggaaaagg agccgcagccccagctccccacgacatgctccacggccctcagatgccccttctgccagt cccactccttcagccaactcaaccacccagtctcagacattcccgtgccctcaggcaccc tctcagcacccacccagccccccgaagcgctgcctcagttatgggggcaccagccagctc cgggtgcccgaggagcctgcggccgaggagcctgcgtctccggaggagagttcggggctg agcctgctgcaccaggagagcaagcgtcgggccatgctggccgcagtattggagcaggag ctgccagcgctggcggagaatctgcaccaggagcagaagcaagagcagggggcccgtctg ggcagaaaccatgtggaagagctgctgcgctgcctcggggcacacatccacactcccaac cgccggcagctcgcccaggagctgcgggcgctgcaaggacggctgagggcccagggcctt gggcctgcgcttctgcacagaccgctgtttgccttcccggatgcggtgaagcagatcctc cgcaagcgccagatccgtccacactggatgttcgttctggactcactgctcagccgtgct gtgcgggcagccctgggtgtgctaggaccgggtaggaggggaatgccacgggcctcctgc cagcgagggctgcgcgaaatcctggcggggaaggaacgggagtaccaggccctggtgcag cgggctctacagcggctgaatgaggaagcccggacctatgtcctggccccagagcctcca actgctctttcaacggaccagggcctggtgcagtggctacaggaactgaatgtggattca ggcaccatccaaatgctgttgaaccatagcttcaccctccacactctgctcacctatgcc actcgagatgacctcatctacacccgcatcaggtacatcctgggccccccaagtgtgaac catgcatatggcctcatcctggccaactgcaggcctgcctag >gi568815597r:27269239_27474818|GENSCAN_predicted_peptide_4|158_aa MEAPGPRALRTALCGGCCCLLLCAQLAVAGKGARGFGRGALIRLNIWPAVQGACKQLEVC EHCVEGDGARNLSSCVWEQCRPEEPGHCVAQSEVVKEGCSIYNRSEACPGSPPVPEAHSP GFDGASFIGGVVLVLSLQAVAFFVLHFLKAKDSTYQTL >gi568815597r:27269239_27474818|GENSCAN_predicted_CDS_4|477_bp atggaggctccgggaccccgcgccttgcggactgcgctctgtggcggctgttgctgcctc ctcctatgtgcccagctggctgtggctggtaaaggagctcgaggctttgggaggggagcc ctgatccgcctgaatatctggccggcggtccaaggggcctgcaaacagctggaggtctgt gagcactgcgtggagggagacggagcgcgcaatctctccagctgcgtgtgggagcagtgc cggccagaggagccaggacactgtgtggcccaatctgaggtggtcaaggaaggttgctcc atctacaaccgctcagaggcatgtccagggagccccccagtccctgaggcccacagccct ggatttgacggggccagctttatcggaggtgtcgtgctggtgttgagcctacaggcggtg gctttctttgtgctgcacttcctcaaggccaaggacagcacctaccagacgctgtga >gi568815597r:27269239_27474818|GENSCAN_predicted_peptide_5|330_aa MMWGAGSPLAWLSAGSGNVNVSSVGPAEGPTGPAAPLPSPKAWDVVLCISGTLVSCENAL VVAIIVGTPAFRAPMFLLVGSLAVADLLAGLGLVLHFAAVFCIGSAEMSLVLVGVLAMAF TASIGSLLAITVDRYLSLYNALTYYSETTVTRTYVMLALVWGGALGLGLLPVLAWNCLDG LTTCGVVYPLSKNHLVVLAIAFFMVFGIMLQLYAQICRIVCRHAQQIALQRHLLPASHYV ATRKGIATLAVVLGAFAACWLPFTVYCLLGDAHSPPLYTYLTLLPATYNSMINPIIYAFR NQDVQKVLWAVCCCCSSSKIPFRSRSPSDV >gi568815597r:27269239_27474818|GENSCAN_predicted_CDS_5|993_bp atgatgtggggtgcaggcagccctctggcctggctctcagctggctcaggcaacgtgaat gtaagcagcgtgggcccagcagaggggcccacaggtccagccgcaccactgccctcgcct aaggcctgggatgtggtgctctgcatctcaggcaccctggtgtcctgcgagaatgcgcta gtggtggccatcatcgtgggcactcctgccttccgtgcccccatgttcctgctggtgggc agcctggccgtggcagacctgctggcaggcctgggcctggtcctgcactttgctgctgtc ttctgcatcggctcagcggagatgagcctggtgctggttggcgtgctggcaatggccttt accgccagcatcggcagtctactggccatcactgtcgaccgctacctttctctgtacaat gccctcacctactattcagagacaacagtgacacggacctatgtgatgctggccttagtg tggggaggtgccctgggcctggggctgctgcctgtgctggcctggaactgcctggatggc ctgaccacatgtggcgtggtttatccactctccaagaaccatctggtagttctggccatt gccttcttcatggtgtttggcatcatgctgcagctctacgcccaaatctgccgcatcgtc tgccgccatgcccagcagattgcccttcagcggcacctgctgcctgcctcccactatgtg gccacccgcaagggcattgccacactggccgtggtgcttggagcctttgccgcctgctgg ttgcccttcactgtctactgcctgctgggtgatgcccactctccacctctctacacctat cttaccttgctccctgccacctacaactccatgatcaaccctatcatctacgccttccgc aaccaggatgtgcagaaagtgctgtgggctgtctgctgctgctgttcctcttccaagatc cccttccgatcccgctcccccagtgatgtctag >gi568815597r:27269239_27474818|GENSCAN_predicted_peptide_6|142_aa MVQRQSGGATYIITVNSLLNSRRDPQSPTLLPPLSRGPLHCVCWARNWVSAHKLPPGTVT RSSCARNNRCFLGRRRGVQQPKPFIGFHCKSFYLREAGPKAMWLAGDRKGATKNDKRGYE GLSSTDRAGTDPEPSYWDPNLW >gi568815597r:27269239_27474818|GENSCAN_predicted_CDS_6|429_bp atggtgcaacgccagagtggaggggctacatacatcatcacagtcaactctttgttgaat agcaggcgtgacccgcagtcaccaactctgctcccgcccctctcccgggggccactacac tgtgtctgctgggcccggaactgggtgtctgctcataaattacccccagggacagttacc cgcagcagctgtgctcggaacaaccgttgcttcctgggcaggagaaggggcgtgcagcag cctaaaccttttatcggattccattgtaaatcgttctatttaagagaggctggtcccaag gctatgtggttggctggtgacagaaaaggggcaacaaagaatgacaagagaggctatgaa ggtctgagcagcaccgacagggctggcacagaccctgagccttcctactgggaccccaac ctatggtga >gi568815597r:27269239_27474818|GENSCAN_predicted_peptide_7|498_aa MPLVTRNIEPRHLCRQTLPSVRSELECVTNITLANVIRQLGSLSKYAEDIFGELFTQANT FASRVSSLAERVDRLQVKVTQLDPKEEEVSLQGINTRKAFRSSTIQDQKLFDRNSLPVPV LETYNTCDTPPPLNNLTPYRDDGKEALKFYTDPSYFFDLWKEKMLQDTKDIMKEKRKHRK EKKDNPNRGNVNPRKIKTRKEEWEKMKMGQEFVESKEKLGTSGYPPTLVYQNGSIGCVEN VDASSYPPPPQSDSASSPSPSFSEDNLPPPPAEFSYPVDNQRGSGLAGPKRSSVVSPSHP PPAPPLGSPPGPKPGFAPPPAPPPPPPPMIGIPPPPPPVGFGSPGTPPPPSPPSFPPHPD FAAPPPPPPPPAADYPTLPPPPLSQPTGGAPPPPPPPPPPGPPPPPFTGADGQPAIPPPL SDTTKPKSSLPAVSDARSDLLSAIRQGFQLRRVEEQREQEKRDVVGNDVATILSRRIAVE YSDSEDDSSEFDEDDWSD >gi568815597r:27269239_27474818|GENSCAN_predicted_CDS_7|1497_bp atgccgttagtaacgaggaacatcgagccaaggcacctgtgccgtcagacgttgcctagc gttagaagcgagctggaatgcgtgaccaacatcaccctggcaaatgtcatccgacagctg ggcagcctgagtaaatatgcagaggacatttttggagagctctttactcaggcaaatacc tttgcctctcgggtaagctcccttgctgagagggtcgaccgactacaggttaaagtcact cagctggatcccaaggaagaagaagtgtcactgcaaggaatcaacacccgaaaagccttc agaagttccaccattcaagaccagaagctttttgacagaaactctctcccagtgcctgtc ttagaaacatacaatacctgtgatactcctccccctctcaacaatcttaccccttacagg gacgatggaaaagaggcactcaaattctacacagacccttcatacttctttgatctttgg aaggagaagatgctgcaggacaccaaggatatcatgaaagagaagagaaagcataggaaa gaaaagaaagataatccaaatcgagggaatgtaaacccacgtaaaatcaagacacgtaag gaagagtgggagaaaatgaagatggggcaagaatttgtggagtccaaagaaaagctgggg acttctgggtatccacccactttggtgtaccagaatggcagcattggctgtgttgaaaac gtggatgcaagtagctatccgccaccaccacagtcagactctgcttcttcaccttctcct tccttctccgaggacaacttgcctcctccaccagcagaattcagttacccagtggacaac caaagaggatctggtttggctggacccaaaagatccagtgtggtcagcccaagccatcca ccaccagctcctcctctaggctctccaccaggccctaaacccgggtttgctccaccacct gcccctccgccacctccgcctccaatgataggcatcccacctccaccaccgcctgtagga tttgggtctccagggacgcctccaccaccctcacccccatctttcccacctcaccctgat tttgctgcccctccacctcctcctccaccaccagcagctgactacccaactctgccacca cctcccttgtcccagccaacaggaggagcacctcctcctccccctcctcctcctcctccg gggccccctcctccccctttcactggtgcagatggccagcctgctataccaccaccgctt tctgataccaccaagcccaagtcctccttgcctgccgtgagcgatgcccgtagcgacctg ctttcagccatccgtcaaggttttcagctgcgcagggttgaggagcagcgggaacaagag aagcgggatgttgtgggcaatgacgtggccaccatcttgtctcgtcgcattgctgttgag tacagtgactcagaagatgactcctctgaatttgatgaggacgactggtccgattaa >gi568815597r:27269239_27474818|GENSCAN_predicted_peptide_8|96_aa XTLVSSRITELFVFDELVSLNGPSLTPVPEFLLDRFTDILLAPPAFHELSAFENLWYFSD FSYMTKVLKQKLVLVFLCIFYNVSAFVLNSYSINIC >gi568815597r:27269239_27474818|GENSCAN_predicted_CDS_8|291_bp ngtactctggtttcctcccgcatcacagaattgttcgtgtttgatgaattggtgtctcta aatggtcccagtctgactccagtaccagaattcctattggaccgcttcactgatatccta cttgcacctccagcttttcatgagctgtctgcttttgagaacctgtggtacttttctgat ttctcctatatgactaaagtcctgaaacagaaactggttcttgtatttctgtgcatcttc tacaatgttagcgcctttgtgctcaatagttactcaataaatatttgctga