GENSCAN 1.0 Date run: 5-Nov-116 Time: 23:45:45 Sequence gi568815582f:29730550_29947986 : 217437 bp : 51.07% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init - 7391 7300 92 2 2 72 86 109 0.896 7.02 1.00 Prom - 9588 9549 40 -4.61 2.04 PlyA - 9640 9635 6 1.05 2.03 Term - 14402 13728 675 2 0 109 41 300 0.048 21.55 2.02 Intr - 18685 18482 204 1 0 4 71 165 0.132 6.32 2.01 Init - 18728 18708 21 2 0 60 113 56 0.169 3.65 2.00 Prom - 36637 36598 40 -1.81 3.00 Prom + 38397 38436 40 -4.61 3.01 Init + 48718 48772 55 0 1 54 100 64 0.719 3.62 3.02 Intr + 48956 49088 133 0 1 86 90 146 0.738 14.81 3.03 Term + 49555 49870 316 1 1 96 52 452 0.995 37.16 3.04 PlyA + 49946 49951 6 1.05 4.00 Prom + 58840 58879 40 -2.81 4.01 Init + 60211 60280 70 0 1 55 80 102 0.564 5.65 4.02 Intr + 66344 66539 196 0 1 90 72 91 0.859 6.69 4.03 Intr + 67825 67952 128 1 2 127 81 116 0.982 15.63 4.04 Intr + 68044 68198 155 2 2 91 100 116 0.999 13.40 4.05 Intr + 68426 68635 210 1 0 69 100 237 0.996 22.63 4.06 Intr + 68715 68945 231 2 0 103 71 251 0.989 23.40 4.07 Intr + 69079 69232 154 0 1 123 81 146 0.999 17.56 4.08 Intr + 69364 69499 136 2 1 59 99 51 0.355 3.43 4.09 Intr + 72220 72388 169 1 1 22 94 96 0.034 4.06 4.10 Intr + 72900 73059 160 2 1 59 97 142 0.996 12.27 4.11 Intr + 73449 73558 110 1 2 81 72 91 0.914 7.30 4.12 Intr + 74265 74477 213 2 0 116 69 273 0.999 27.64 4.13 Intr + 74566 74625 60 0 0 80 67 79 0.954 4.42 4.14 Intr + 76258 76344 87 0 0 132 109 17 0.850 8.86 4.15 Intr + 76429 77279 851 0 2 107 70 1390 0.952 129.94 4.16 Intr + 77681 77744 64 2 1 92 75 6 0.890 -1.09 4.17 Intr + 78021 78192 172 2 1 120 83 153 0.893 18.03 4.18 Term + 78991 79193 203 2 2 70 55 224 0.973 15.17 4.19 PlyA + 80593 80598 6 1.05 5.00 Prom + 82213 82252 40 -7.99 5.01 Init + 82506 83384 879 2 0 80 89 437 0.819 37.62 5.02 Intr + 83784 83916 133 2 1 70 101 259 0.999 26.12 5.03 Intr + 86038 86458 421 2 1 6 37 568 0.002 37.08 5.04 Intr + 86661 86743 83 0 2 82 87 127 0.560 11.78 5.05 Term + 89006 89205 200 0 2 110 37 244 0.994 19.38 5.06 PlyA + 89537 89542 6 1.05 6.00 Prom + 93382 93421 40 -3.01 6.01 Init + 100001 100125 125 1 2 104 90 264 0.999 27.81 6.02 Intr + 100329 100524 196 0 1 120 77 194 0.970 21.44 6.03 Intr + 103184 103307 124 1 1 107 119 109 0.988 16.56 6.04 Intr + 103386 103517 132 1 0 124 75 240 0.999 27.42 6.05 Intr + 105155 105249 95 0 2 65 81 133 0.985 10.48 6.06 Intr + 106173 106409 237 2 0 87 100 289 0.994 28.34 6.07 Intr + 109629 109910 282 2 0 103 90 415 0.925 41.36 6.08 Intr + 111047 111291 245 1 2 96 117 402 0.997 40.73 6.09 Intr + 111366 111563 198 0 0 92 58 203 0.999 16.69 6.10 Intr + 113944 114330 387 1 0 79 108 570 0.999 52.57 6.11 Intr + 115314 115430 117 0 0 120 90 180 0.999 21.48 6.12 Intr + 115609 115735 127 1 1 97 113 186 0.999 23.19 6.13 Intr + 116648 116836 189 1 0 87 94 245 0.999 25.30 6.14 Term + 117213 117440 228 2 0 118 35 186 0.702 13.26 6.15 PlyA + 122569 122574 6 1.05 7.07 PlyA - 123238 123233 6 1.05 7.06 Term - 128785 128640 146 2 2 90 48 298 0.998 24.38 7.05 Intr - 128974 128893 82 1 1 74 63 6 0.998 -3.59 7.04 Intr - 130113 130032 82 2 1 58 98 151 0.999 13.24 7.03 Intr - 130710 130557 154 1 1 85 46 222 0.997 17.34 7.02 Intr - 132171 132037 135 1 0 86 56 187 0.999 16.35 7.01 Init - 132308 132266 43 2 1 90 109 68 0.999 9.64 7.00 Prom - 133312 133273 40 1.49 8.19 PlyA - 133599 133594 6 1.05 8.18 Term - 141179 141150 30 2 0 124 46 50 0.989 2.54 8.17 Intr - 141734 141638 97 1 1 125 86 204 0.999 24.41 8.16 Intr - 141977 141860 118 0 1 109 108 167 0.999 20.83 8.15 Intr - 142194 142156 39 1 0 125 86 112 0.999 13.38 8.14 Intr - 142882 142691 192 2 0 70 46 330 0.961 26.98 8.13 Intr - 143180 142989 192 0 0 80 76 274 0.987 25.38 8.12 Intr - 146401 146207 195 2 0 94 44 220 0.999 18.11 8.11 Intr - 146918 146722 197 1 2 100 79 99 0.995 9.78 8.10 Intr - 147876 147738 139 1 1 95 42 135 0.932 9.63 8.09 Intr - 149515 149315 201 2 0 74 105 226 0.974 22.68 8.08 Intr - 155200 155037 164 0 2 80 115 251 0.996 27.13 8.07 Intr - 157268 157100 169 0 1 119 107 199 0.943 24.42 8.06 Intr - 158176 157991 186 2 0 55 92 171 0.945 14.38 8.05 Intr - 164911 164710 202 1 1 91 72 147 0.909 12.78 8.04 Intr - 165311 165172 140 0 2 58 93 228 0.942 20.99 8.03 Intr - 166572 166273 300 1 0 90 94 118 0.478 9.85 8.02 Intr - 167435 167304 132 0 0 23 94 140 0.998 9.22 8.01 Init - 168470 168392 79 2 1 108 86 77 0.999 8.70 8.00 Prom - 168555 168516 40 -11.95 9.00 Prom + 169051 169090 40 -9.74 9.01 Init + 170624 171371 748 1 1 61 106 530 0.791 45.09 9.02 Intr + 174303 174341 39 1 0 84 89 50 0.801 3.38 9.03 Term + 175239 175348 110 1 2 86 49 208 0.998 15.67 9.04 PlyA + 175484 175489 6 -3.94 10.09 PlyA - 175809 175804 6 1.05 10.08 Term - 176559 176323 237 0 0 33 43 384 0.324 24.90 10.07 Intr - 180624 180429 196 2 1 71 77 421 0.989 39.24 10.06 Intr - 181318 181266 53 1 2 114 60 107 0.969 8.69 10.05 Intr - 181500 181411 90 0 0 113 100 112 0.991 15.59 10.04 Intr - 192810 192641 170 1 2 72 94 326 0.152 31.78 10.03 Intr - 194759 194653 107 1 2 68 66 20 0.547 -1.84 10.02 Intr - 195032 194990 43 0 1 77 103 50 0.636 3.19 10.01 Init - 195484 195241 244 1 1 79 80 497 0.995 43.89 10.00 Prom - 201561 201522 40 -0.11 11.00 Prom + 201962 202001 40 -1.61 11.01 Init + 210322 210468 147 0 0 65 80 121 0.884 9.06 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 30355 30400 46 0 1 63 115 -2 0.828 0.79 S.002 Term + 30562 30662 101 2 2 83 54 95 0.965 4.19 S.003 Term + 74714 74761 48 1 0 126 47 52 0.928 2.49 S.004 Term + 84079 84089 11 2 2 148 55 10 0.996 2.34 S.005 Init + 85977 86458 482 2 2 16 37 624 0.982 43.68 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582f:29730550_29947986|GENSCAN_predicted_peptide_1|31_aa MALWLYTHLLSLPATQPCSLTLFAGGMPSQS >gi568815582f:29730550_29947986|GENSCAN_predicted_CDS_1|93_bp atggctctgtggctttacacacacctgctttccctccccgctacccagccgtgctctctg actctctttgctggcggcatgccttctcaaagn >gi568815582f:29730550_29947986|GENSCAN_predicted_peptide_2|299_aa MKLRTLTAACPEFVPSDVRTCSEFLPSGGFVVSLASGVKLQTFTMLQLIKAVQTQDKQQQ DLLERSKEQSFLCGRMPLTPEPPSGRVEGPPAWEAAPWPSLPCGPCIPIMLVLATLAALF ILTTAVLAERLFRRALRPDPSHRAPTLVWRPGGELWIEPMGTARERSEDWYGSAVPLLTD RAPEPPTQVGTLEARATAPPAPSAPNSAPSNLGPQTVLEVPARSTFWGPQPWEGRPPATG LVSWAEPEQRPEASVQFGSPQARRQRPGSPDPEWGLQPRVTLEQISAFWKREGRTSVGF >gi568815582f:29730550_29947986|GENSCAN_predicted_CDS_2|900_bp atgaagctgcggaccctcacggcggcatgtccggagtttgttccttctgatgttcggacg tgttccgagtttcttccttctggtgggttcgtggtctcgctggcctctggcgtgaagctg cagaccttcacgatgttacagctcataaaggcagtgcagacccaggataagcagcagcaa gatttattggaaagatcaaaagaacaaagcttcctatgtggaaggatgccgttgactcca gagccgccctctgggcgcgtggaggggccccccgcatgggaagcagccccatggccctca ctgccctgtgggccctgcatccccatcatgctggtcctggccaccctggctgcgctcttc atcctcaccaccgctgtgttggctgaacgcctgttccgccgtgctctccgcccagacccc agccaccgtgcacccaccctggtgtggcgcccaggaggagagctgtggattgagcccatg ggcaccgcccgagagcgctctgaggactggtatggctctgcggtccccctgctgacagat cgggcccctgagcctcccacccaggtgggcactttggaggcccgagcaacagccccacct gccccctcagccccaaattctgctcccagcaacttgggcccccagaccgtactggaggtc ccagcccggagcaccttctgggggccccagccctgggaggggaggccccccgccacaggc ctggtgagctgggctgaacccgagcagaggccagaggccagcgtccagtttgggagcccc caggccaggaggcagcggccagggagcccggatcctgagtggggcctccagccacgggtc accttggagcagatctcagctttctggaagcgtgaaggccggaccagtgtggggttctga >gi568815582f:29730550_29947986|GENSCAN_predicted_peptide_3|167_aa MLTVALLALLCASASGNAIQARSSSYSGEYGGGGGKRFSHSGNQLDGPITALRVRVNTYY IVGLQVRYGKVWSDYVGGRNGDLEEIFLHPGESVIQVSGKYKWYLKKLVFVTDKGRYLSF GKDSGTSFNAVPLHPNTVLRFISGRSGSLIDAIGLHWDVYPSSCSRC >gi568815582f:29730550_29947986|GENSCAN_predicted_CDS_3|504_bp atgttgacagtcgctctcctagcccttctctgtgcctcagcctctggcaatgccattcag gccaggtcttcctcctatagtggagagtatggaggtggtggtggaaagcgattctctcat tctggcaaccagttggacggccccatcaccgccctccgggtccgagtcaacacatactac atcgtaggtcttcaggtgcgctatggcaaggtgtggagcgactatgtgggtggtcgcaac ggagacctggaggagatctttctgcaccctggggaatcagtgatccaggtttctgggaag tacaagtggtacctgaagaagctggtatttgtgacagacaagggccgctatctgtctttt gggaaagacagtggcacaagtttcaatgccgtccccttgcaccccaacaccgtgctccgc ttcatcagtggccggtctggttctctcatcgatgccattggcctgcactgggatgtttac cccagtagctgcagcagatgctga >gi568815582f:29730550_29947986|GENSCAN_predicted_peptide_4|1122_aa MAAGGSTQQRRREMAAASAAAISGAGRCRLSKIGATRRPPPARVRVAVRLRPFVDGTAGA SDPPCVRGMDSCSLEIANWRNHQETLKYQFDAFYGERSTQQDIYAGSVQPILRHLLEGQN ASVLAYGPTGAGKTHTMLGSPEQPGVIPRALMDLLQLTREEGAEGRPWALSVTMSYLEIY QEKVLDLLDPASGDLVIREDCRGNILIPGLSQKPISSFADFERHFLPASRNRTVGATRLN QRSSRSHAVLLVKVDQRERLAPFRQREGKLYLIDLAGSEDNRRTGNKGLRLKESGAINTS LFVLGKVVDALNQGLPRVPYRDSKLTRLLQDSLGGSAHSILIANIAPERRFYLDTVSALN FAARSKEVINRPFTNESLQPHALGPVKLSQKELLGPPEAKRARGPEEEEIGSPEPMAAPA SASQKLSPLQKLSSMDPAMLERLLSLDRLLASQGSQGAPLLSTPKRERMVLMKTVEEKDL EIERLKTKQKELEAKMLAQKAEEKENHCPTMLRPLSHRTVTGAKPLKKAVVMPLQLIQEQ AASPNAEIHILKNKGRKRKVKVAGGLGYTWSPELESLDALEPEEKAEDCWELQISPELLA HGRQKILDLLNEGSARDLRSLQRIGPKKAQLIVGWRELHGPFSQVEDLERVEGITGKQME SFLKGHAQNPLQVGAELQSRFFASQGCAQSPFQAAPAPPPTPQAPAAEPLQVDLLPVLAA AQESAAAAAAAAAAAAAVAAAPPAPAAASTVDTAALKQPPAPPPPPPPVSAPAAEAAPPA SAATIAAAAATAVVAPTSTVAVAPVASALEKKTKSKGPYICALCAKEFKNGYNLRRHEAI HTGAKAGRVPSGAMKMPTMVPLSLLSVPQLSGAGGGGGEAGAGGGAAAVAAGGVVTTTAS GKRIRKNHACEMCGKAFRDVYHLNRHKLSHSDEKPYQCPVCQQRFKRKDRMSYHVRSHDG AVHKPYNCSHCGKSFSRPDHLNSHVRQVHSTERPFKCEKCEAAFATKDRLRAHTVRHEEK VPCHVCGKMLSSAYISDHMKVHSQGPHHVCELCNKGFTTAAYLRIHAVKDHGLQAPRADR ILCKLCSVHCKTPAQLAGHMQTHLGGAAPPVPGDAPQPQPTC >gi568815582f:29730550_29947986|GENSCAN_predicted_CDS_4|3369_bp atggccgcgggcggctcgacgcagcagaggcgacgcgagatggcggcagcttcagcggcg gcgatctcaggagctggtcgctgtcggctaagcaagattggagctactcgtcgtccacct ccagctcgcgtaagggtggctgtgcgactgcggccatttgtggatggaacagcgggagca agtgatcccccctgtgtgcggggcatggacagctgctctctagagattgctaactggagg aaccaccaggagactctcaaataccagtttgatgccttctatggggagaggagtactcag caggacatctatgcaggttcagtgcagcccatcctaaggcacttgctggaagggcagaat gccagtgtgcttgcctatggacccacaggagctgggaagacgcacacaatgctgggcagc ccagagcaacctggggtgatcccgcgggctctcatggacctcctgcagctcacaagggag gagggtgccgagggccggccatgggccctttctgtcaccatgtcttacctagagatctac caggagaaggtattagacctcctggaccctgcttcgggagacctggtaatccgagaagac tgccgggggaatatcctgattccgggtctctcccagaagcccatcagtagctttgctgat tttgagcggcacttcctgccagccagtcgaaatcggactgtaggagccacccggctcaac cagcgctcctcccgcagtcatgctgtgctcctggtcaaggtggaccagcgggaacgtttg gccccatttcgccagcgagagggaaaactctacctgattgacttggctgggtcagaggac aaccggcgcacaggcaacaagggccttcggctaaaagagagtggagccatcaacacctcc ctgtttgtcctgggcaaagtggtagatgcgctgaatcagggcctccctcgtgtaccttat cgggacagcaagctcactcgcctattgcaggactctctgggtggctcagcccacagtatc cttattgccaacattgcccctgagagacgcttctacctagacacagtctccgcactcaac tttgctgccaggtccaaggaggtgatcaatcggccttttaccaatgagagcctgcagcct catgccttgggacctgttaagctgtctcagaaagaattgcttggtccaccagaggcaaag agagcccgaggccctgaggaagaggagatcgggagccctgagcccatggcagctccagcc tctgcctcccagaaactcagccccctacagaagctaagcagcatggacccggccatgctg gagcgcctcctcagcttggaccgtctgcttgcctcccaggggagccagggggcccctctg ttgagtaccccaaagcgagagcggatggtgctaatgaagacagtggaagagaaggaccta gagattgagaggcttaagacgaagcaaaaagaactggaggccaagatgttggcccagaag gctgaggaaaaggagaaccattgtcccacaatgctccggcccctttcacatcgcacagtc acaggggcaaagcccctgaaaaaggctgtggtgatgcccctacagctaattcaggagcag gcagcatccccaaatgccgagatccacatcctgaagaataaaggccggaagagaaaggtg aaagtagctgggggcttaggctacacctggagcccagaactggagtccctggatgcccta gagcctgaggagaaggctgaggactgctgggagctacagatcagcccggagctactggct catgggcgccaaaaaatactggatctgctgaacgaaggctcagcccgagatctccgcagt cttcagcgcattggcccgaagaaggcccagctaatcgtgggctggcgggagctccacggc cccttcagccaggtggaggacctggaacgcgtggagggcataacggggaaacagatggag tccttcctgaagggtcacgcccagaaccccctgcaggtcggggctgagctccagtcccgc ttctttgcctcccagggctgcgcccagagtccattccaggccgcgccggcgcccccgccc acgccccaggccccggcggccgagcccctccaggtggacttgctcccggtgctcgccgcc gcccaggagtccgccgcggctgctgcggccgctgccgccgctgctgccgccgtcgctgcc gcgcccccggcccctgccgccgcctctacggtggacacagcggccctgaagcagcctccg gcgccccctccgccacccccgccagtgtcggcgcccgcggccgaggccgcgccccccgcc tccgccgccactatcgccgcggcggcggccaccgccgtcgtagccccaacctcgacggtc gccgtggccccggtcgcgtctgccttggagaagaagacaaagagcaaggggccctacatc tgcgctctgtgcgccaaggagttcaagaacggctacaatctccggaggcacgaagccatc cacacgggagccaaggccggccgggtcccctcgggtgctatgaagatgccgaccatggtg cccctgagcctcctgagcgtgccccagctgagcggagccggcgggggagggggagaggcg ggtgccggcggcggcgctgccgcagtggccgccggtggcgtggtgaccacgaccgcctcg gggaagcgcatccggaagaaccatgcctgcgagatgtgtggcaaggccttccgcgacgtc taccacctgaaccgacacaagctgtcgcactcggacgagaagccctaccagtgcccggtg tgccagcagcgcttcaagcgcaaggaccgcatgagctaccacgtgcgctcacatgacggc gctgtgcacaagccctacaactgctcccactgtggcaagagcttctcccggccggatcac ctcaacagtcacgtcagacaagtgcactcaacagaacggcccttcaaatgtgagaaatgt gaggcagctttcgccacgaaggatcggctgcgggcgcacacagtacgacacgaggagaaa gtgccatgtcacgtgtgtggcaagatgctgagctcggcttatatttcggaccacatgaag gtgcacagccagggtcctcaccatgtctgtgagctctgcaacaaaggcttcaccacggca gcatacctgcgcatccacgcggtgaaggaccacgggctccaggccccgcgggctgaccgc atcctgtgcaagctgtgcagcgtgcactgcaagacccctgcccagctggccggccacatg cagacccatctggggggggccgccccccctgtcccgggagacgccccccagccacagccc acctgctga >gi568815582f:29730550_29947986|GENSCAN_predicted_peptide_5|571_aa MAASSSEISEMKGVEESPKVPGEGPGHSEAETGPPQVLAGVPDQPEAPQPGPNTTAAPVD SGPKAGLAPETTETPAGASETAQATDLSLSPGGESKANCSPEDPCQETVSKPEVSKEATA DQGSRLESAAPPEPAPEPAPQPDPRPDSQPTPKPALQPELPTQEDPTPEILSESVGEKQE NGAVVPLQAGDGEEGPAPEPHSPPSKKSPPANGAPPRVLQQLVEEDRMRRAHSGHPGSPR GSLSRHPSSQLAGPGVEGGEGTQKPRDYIILAILSCFCPMWPVNIVAFAYAVMSRNSLQQ GDVDGAQRLGRVAKLLSIVALVGGVLIIIASCVINLGEGEVTSGLQALAVEDTGGPSASA GKAEDEGEGGREETEREGSGGEEAQGEVPSAGGEEPAEEDSEDWCVPCSDEEVELPADGQ PWMPPPSEIQRLYELLAAHGTLELQAEILPRRPPTPEAQSEEERSDEEPEAKEEEEEKPH MPTEFDFDDEPVTPKDSLIDRRRTPGSSARSQKREARLDKVLSDMKRHKKLEEQILRTGR DLFSLDSEDPSPASPPLRSSGSSLFPRQRKY >gi568815582f:29730550_29947986|GENSCAN_predicted_CDS_5|1716_bp atggcagccagcagctctgagatctctgagatgaagggggttgaggagagtcccaaggtt ccaggcgaagggcctggccattctgaagctgaaactggccctccccaggtcctagcaggg gtaccagaccagccagaggccccgcagccaggtccaaacaccactgcggcccctgtggac tcagggcccaaggctgggctggctccagaaaccacagagaccccggctggggcctcagaa acagcccaggccacagacctcagcttaagcccaggaggggaatcaaaggccaactgcagc cccgaagacccatgccaagaaacagtgtccaaaccagaagtgagcaaagaggccactgca gaccaggggtccaggctggagtctgcagccccacctgaaccagccccagagcctgctccc caaccagacccccggccagattcccagcctacccccaagccagcccttcaaccagagctc cctacccaggaggaccccacccctgagattctgtctgagagtgtaggggaaaagcaagag aatggggcagtggtgcccctgcaggctggtgatggggaagagggcccagcccctgagcct cactcaccaccctcaaaaaaatcccccccagccaatggggcccccccccgagtgctgcag cagctggttgaggaggatcgaatgagaagggcacacagtgggcatccaggatctccccga ggtagcctgagccgccaccccagctcccagttggcaggtcctggggtggaggggggtgaa ggcacccagaaacctcgggactacatcatccttgccatcctgtcctgcttctgccccatg tggcctgtcaacatcgtggccttcgcttatgctgtcatgtcccggaacagcctgcagcag ggggacgtggacggggcccagcgtctgggccgggtagccaagctcttaagcatcgtggcg ctggtggggggagtcctcatcatcatcgcctcctgcgtcatcaacttaggcgaaggggaa gtgacctccggcctccaggctctggccgtggaggataccggaggcccctctgcctcggcc ggtaaggccgaggacgagggggaaggaggccgagaggagaccgagcgtgaggggtccggg ggcgaggaggcgcagggagaagtccccagcgctgggggagaagagcctgccgaggaggac tccgaggactggtgcgtgccctgcagcgacgaggaggtggagctgcctgcggatgggcag ccctggatgcccccgccctccgaaatccagcggctctatgaactgctggctgcccacggt actctggagctgcaagccgagatcctgccccgccggcctcccacgccggaggcccagagc gaagaggagagatccgatgaggagccggaggccaaagaagaggaagaggaaaaaccacac atgcccacggaatttgattttgatgatgagccagtgacaccaaaggactccctgattgac cggagacgcaccccaggaagctcagcccggagccagaaacgggaggcccgcctggacaag gtgctgtcggacatgaagagacacaagaagctggaggagcagatccttcgtaccgggagg gacctcttcagcctggactcggaggaccccagccccgccagccccccactccgatcctcc gggagtagtctcttccctcggcagcggaaatactga >gi568815582f:29730550_29947986|GENSCAN_predicted_peptide_6|893_aa MATEEFIIRIPPYHYIHVLDQNSNVSRVEVGPKTYIRQDNERVLFAPMRMVTVPPRHYCT VANPVSRDAQGLVLFDVTGQVRLRHADLEIRLAQDPFPLYPGEVLEKDITPLQVVLPNTA LHLKALLDFEDKDGDKVVAGDEWLFEGPGTYIPRKEVEVVEIIQATIIRQNQALRLRARK ECWDRDGKERVTGEEWLVTTVGAYLPAVFEEVLDLVDAVILTEKTALHLRARRNFRDFRG VSRRTGEEWLVTVQDTEAHVPDVHEEVLGVVPITTLGPHNYCVILDPVGPDGKNQLGQKR VVKGEKSFFLQPGEQLEQGIQDVYVLSEQQGLLLRALQPLEEGEDEEKVSHQAGDHWLIR GPLEYVPSAKVEVVEERQAIPLDENEGIYVQDVKTGKVRAVIGSTYMLTQDEVLWEKELP PGVEELLNKGQDPLADRGEKDTAKSLQPLAPRNKTRVVSYRVPHNAAVQVYDYREKRARV VFGPELVSLGPEEQFTVLSLSAGRPKRPHARRALCLLLGPDFFTDVITIETADHARLQLQ LAYNWHFEVNDRKDPQETAKLFSVPDFVGDACKAIASRVRGAVASVTFDDFHKNSARIIR TAVFGFETSEAKGPDGMALPRPRDQAVFPQNGLVVSSVDVQSVEPVDQRTRDALQRSVQL AIEITTNSQEAAAKHEAQRLEQEARGRLERQKILDQSEAEKARKELLELEALSMAVESTG TAKAEAESRAEAARIEGEGSVLQAKLKAQALAIETEAELQRVQKVRELELVYARAQLELE VSKAQQLAEVEVKKFKQMTEAIGPSTIRDLAVAGPEMQVKLLQSLGLKSTLITDGSTPIN LFNTAFGLLGMGPEGQPLGRRVASGPSPGEGISPQSAQAPQAPGDNHVVPVLR >gi568815582f:29730550_29947986|GENSCAN_predicted_CDS_6|2682_bp atggcaactgaagagttcatcatccgcatccccccataccactatatccatgtgctggac cagaacagcaacgtgtcccgtgtggaggtcgggccaaagacctacatccggcaggacaat gagagggtactgtttgcccccatgcgcatggtgaccgtccccccacgtcactactgcaca gtggccaaccctgtgtctcgggatgcccagggcttggtgctgtttgatgtcacagggcaa gttcggcttcgccacgctgacctcgagatccggctggcccaggaccccttccccctgtac ccaggggaggtgctggaaaaggacatcacacccctgcaggtggttctgcccaacactgcc ctccatctaaaggcgctgcttgattttgaggataaagatggagacaaggtggtggcagga gatgagtggcttttcgagggacctggcacgtacatcccccggaaggaagtggaggtcgtg gagatcattcaggccaccatcatcaggcagaaccaggctctgcggctcagggcccgcaag gagtgctgggaccgggacggcaaggagagggtgacaggggaagaatggctggtcaccaca gtaggggcgtacctcccagcggtgtttgaggaggttctggatttggtggacgccgtcatc cttacggaaaagacagccctgcacctccgggctcggcggaacttccgggacttcagggga gtgtcccgccgcactggggaggagtggctggtaacagtgcaggacacagaggcccacgtg ccagatgtccacgaggaggtgctgggggttgtgcccatcaccaccctgggcccccacaac tactgcgtgattctcgaccctgtcggaccggatggcaagaatcagctggggcagaagcgc gtggtcaagggagagaagtcttttttcctccagccaggagagcagctggaacaaggcatc caggatgtgtatgtgctgtcggagcagcaggggctgctgctgagggccctgcagcccctg gaggagggggaggatgaggagaaggtctcacaccaggctggggaccactggctcatccgc ggacccctggagtatgtgccatctgccaaagtggaggtggtggaggagcgccaggccatc cctctagacgagaacgagggcatctatgtgcaggatgtcaagaccggaaaggtgcgcgct gtgattggaagcacctacatgctgacccaggacgaagtcctgtgggagaaagagctgcct cccggggtggaggagctgctgaacaaggggcaggaccctctggcagacaggggtgagaag gacacagctaagagcctccagcccttggcgccccggaacaagacccgtgtggtcagctac cgcgtgccccacaacgctgcggtgcaggtgtacgactaccgagagaagcgagcccgcgtg gtcttcgggcctgagctggtgtcgctgggtcctgaggagcagttcacagtgttgtccctc tcagctgggcggcccaagcgtccccatgcccgccgtgcgctctgcctgctgctggggcct gacttcttcacagacgtcatcaccatcgaaacggcggatcatgccaggctgcaactgcag ctggcctacaactggcactttgaggtgaatgaccggaaggacccccaagagacggccaag ctcttttcagtgccagactttgtaggtgatgcctgcaaagccatcgcatcccgggtgcgg ggggccgtggcctctgtcactttcgatgacttccataagaactcagcccgcatcattcgc actgctgtctttggctttgagacctcggaagcgaagggccccgatggcatggccctgccc aggccccgggaccaggctgtcttcccccaaaacgggctggtggtcagcagtgtggacgtg cagtcagtggagcctgtggatcagaggacccgggacgccctgcaacgcagcgtccagctg gccatcgagatcaccaccaactcccaggaagcggcggccaagcatgaggctcagagactg gagcaggaagcccgcggccggcttgagcggcagaagatcctggaccagtcagaagccgag aaagctcgcaaggaacttttggagctggaggctctgagcatggccgtggagagcaccggg actgccaaggcggaggccgagtcccgtgcggaggcagcccggattgagggagaagggtcc gtgctgcaggccaagctaaaagcacaggccttggccattgaaacggaggctgagctccag agggtccagaaggtccgagagctggaactggtctatgcccgggcccagctggagctggag gtgagcaaggctcagcagctggctgaggtggaggtgaagaagttcaagcagatgacagag gccataggccccagcaccatcagggaccttgctgtggctgggcctgagatgcaggtaaaa ctgctccagtccctgggcctgaaatcaaccctcatcaccgatggctccactcccatcaac ctcttcaacacagcctttgggctgctggggatggggcccgagggtcagcccctgggcaga agggtggccagtgggcccagccctggggaggggatatccccccagtctgctcaggcccct caagctcctggagacaaccacgtggtgcctgtactgcgctaa >gi568815582f:29730550_29947986|GENSCAN_predicted_peptide_7|213_aa MPDENIFLFVPNLIGYARIVFAIISFYFMPCCPLTASSFYLLSGLLDAFDGHAARALNQG TRFGAMLDMLTDRCSTMCLLVNLALLYPGATLFFQISMSLDVASHWLHLHSSVVRGSESH KMIDLSGNPVLRIYYTSRPALFTLCAGNELFYCLLYLFHFSEGPLVGSVGLFRMGLWVTA PIALLKSLISVIHLITAARNMAALDAADRAKKK >gi568815582f:29730550_29947986|GENSCAN_predicted_CDS_7|642_bp atgccagacgaaaatatcttcctgttcgtgcccaacctcatcggttatgcccggattgtc ttcgccatcatttctttctacttcatgccctgctgccccctcacggcctcctccttctac ctgctcagcggcctgctggacgctttcgatggacacgctgctcgcgctcttaatcaagga acccggtttggggccatgctggacatgctgacggaccgctgctccaccatgtgcctgttg gtcaacctggccctgctgtaccctggagccacgctgttcttccaaatcagcatgagtttg gatgtggccagtcactggctgcacctccacagttctgtggtccgaggcagtgagagtcac aagatgatcgacttgtccgggaatccggtgcttcggatctactacacctcgaggcctgct ctgttcaccttgtgtgctgggaatgagctcttctactgcctcctctacctgttccatttc tctgagggacctttagttggctctgtgggactgttccggatgggcctctgggtcactgcc cccatcgccttgctgaagtcgctcatcagcgtcatccacctgatcacggccgcccgcaac atggctgccctggacgcagcagaccgcgccaagaagaagtga >gi568815582f:29730550_29947986|GENSCAN_predicted_peptide_8|923_aa MGTPRAQHPPPPQLLFLILLSCPWIQGLPLKEEEILPEPGSETPTVASEALAELLHGALL RRGPEMGYLPGSDRDPTLATPPAGQTLAVPSLPRATEPGTGPLTTAVTPNGVRGAGPTAP ELLTPPPGTTAPPPPSPASPGPPLGPEGGEEETTTTIITTTTVTTTVTSPVLCNNNISEG EGYVESPDLGSPVSRTLGLLDCTYSIHVYPGYGIEIQVQTLNLSQEEELLVLAGGGSPGL APRLLANSSMLGEGQVLRSPTNRLLLHFQSPRVPRGGGFRIHYQAYLLSCGFPPRPAHGD VSVTDLHPGGTATFHCDSGYQLQGEETLICLNGTRPSWNGETPSCMASCGGTIHNATLGR IVSPEPGGAVGPNLTCRWVIEAAEGRRLHLHFERVSLDEDNDRLMVRSGGSPLSPVIYDS DMDDVPERGLISDAQSLYVELLSETPANPLLLSLRFEAFEEDRCFAPFLAHGNVTTTDPE YRPGALATFSCLPGYALEPPGPPNAIECVDPTEPHWNDTEPACKAMCGGELSEPAGVVLS PDWPQSYSPGQDCVWGVHVQEEKRILLQVEILNVREGDMLTLFDGDGPSARVLAQLRGPQ PRRRLLSSGPDLTLQFQAPPGPPNPGLGQGFVLHFKEVPRNDTCPELPPPEWGWRTASHG DLIRGTVLTYQCEPGYELLGSDILTCQWDLSWSAAPPACQKIMTCADPGEIANGHRTASD AGFPVGSHVQYRCLPGYSLEGAAMLTCYSRDTGTPKWSDRVPKCALKYEPCLNPGVPENG YQTLYKHHYQAGESLRFFCYEGFELIGEVTITCVPGHPSQWTSQPPLCKVAYEELLDNRK LEVTQTTDPSRQLEGGNLALAILLPLGLVIVLGSGVYIYYTKLQGKSLFGFSGSHSYSPI TVESDFSNPLYEAGDTREYEVSI >gi568815582f:29730550_29947986|GENSCAN_predicted_CDS_8|2772_bp atggggactcccagggcccagcacccgccgcctccccagctgctgttcctaattctgctg agctgtccctggatccagggtctgcccctgaaggaggaggagatattgccagagcctgga agtgagacccccacggtggcctctgaggccctggctgaactgcttcatggggccctgctg aggaggggcccagagatgggctacctgccaggatctgatcgggaccccacgctagccacc cctccggccggccagactctcgcagtgccctccctgccacgggccactgagccggggaca gggcctctgacaacagccgtcacccctaacggggtcaggggggcaggccccactgcgcca gaactgctgaccccgcccccaggaaccacagccccacccccacccagccctgcctcccca gggcctccccttgggcctgagggaggagaggaggagacgacgaccaccatcatcaccacg acaactgttaccactacggtgaccagcccagttctgtgtaataacaacatctccgagggc gaagggtatgtggagtctccagatctggggagccccgtcagccgcaccctggggctcctg gactgcacttacagcatccatgtctaccctggctacggcattgagatccaggtgcagacg ctgaacctgtcacaggaagaggagctcctggtgctggctggtgggggatccccaggcctg gccccccgactcctggccaactcatccatgcttggagaaggacaagtccttcggagccca accaaccggctgcttctgcacttccagagcccacgggtcccaaggggcggtggcttcagg atccactatcaggcctacctcctgagctgtggcttccctccccggccggcccatggggac gtgagtgtgacggacctgcaccctgggggcactgccacctttcactgtgattcgggctac cagctgcagggagaggagaccctcatctgcctcaatggcacccggccatcctggaacggt gaaacccccagctgcatggcatcctgtggtggcaccatccacaatgccaccctgggccgc atcgtgtccccagagcctgggggagccgtagggcccaacctcacctgccgttgggtcatt gaagcagctgaggggcgccggctgcacctgcactttgaaagggtctcgctggatgaggac aatgaccggctgatggtgcgctcagggggcagccccctatcccccgtgatctatgattcg gacatggacgatgtccccgagcggggtctcatcagtgacgcccagtccctctacgtggag ctgctgtcagagacacctgccaatcccctgctgttaagccttcgatttgaagcctttgag gaggatcgctgcttcgcccccttcctggcacatggaaatgtcactaccacggaccctgag tatcgcccaggggcactggcaaccttctcgtgcctcccaggatatgccctggagccccct gggccccccaatgccatcgaatgtgtggatcccacagaaccccactggaacgacacagag ccggcctgcaaagccatgtgtggaggggagctgtcggaaccagctggcgtggtcctctct cccgactggccccagagctatagcccgggccaagactgcgtgtggggcgtgcacgtccag gaagagaagcgcatcttgctccaagttgagatattgaatgtgcgggaaggggacatgctg acgctgttcgacggggacggtcccagcgcccgagtcttggcccagctgcggggacctcag ccgcgccgccgccttctctcctctgggcccgacctcacactgcagtttcaggcaccgccc gggcccccaaatccaggcctgggccagggcttcgtattgcacttcaaagaggtcccgagg aacgacacgtgccccgagctgccacctccggagtggggctggagaacggcatcccacggg gacctgatccggggcacggtgctcacctaccagtgcgagcctggctacgagctgctaggc tccgacattctcacttgccagtgggacctgtcttggagcgccgcgccgcccgcctgccaa aagatcatgacttgtgctgaccctggcgagattgccaacgggcaccgcaccgcctcggac gccggcttccccgttggctcccacgtccagtaccgctgcctgccagggtacagcctcgag ggggcagccatgctcacctgctacagccgggacacaggcacacccaagtggagcgatagg gtccccaaatgcgccttgaagtacgagccgtgcctgaacccgggggttcccgagaatggc taccagacgctgtacaagcaccactaccaggcgggcgagtctctgcgcttcttctgctat gagggctttgagcttatcggcgaggtcaccatcacctgtgtgcccggccacccctcccag tggaccagccagcccccactctgcaaagttgcctatgaggagctcctggacaaccgaaaa ctggaagtgacccagaccacagatccatcacggcagctggaaggggggaacctggccctg gccatcctgctgcctctaggcttggtcattgtcctcggcagtggcgtttacatctactac accaagcttcagggaaagtcccttttcggcttctcgggctcccactcctacagccccatc accgtggagtcggacttcagcaacccgctgtatgaagctggggatacgcgggagtatgaa gtttccatctga >gi568815582f:29730550_29947986|GENSCAN_predicted_peptide_9|298_aa MLPWPLPLASSALTLLFGALTSLFLWYCYRLGSQDMQALGAGSRAGGVRGGPVGCSEAGG PSPGGPGDPGEGPRTEGLVSRRLRAYARRYSWAGMGRVRRAAQGGPGPGRGPGVLGIQRP GLLFLPDLPSAPFVPRDAQRHDVELLESSFPAILRDFGAVSWDFSGTTPPPRGWSPPLAP GCYQLLLYQAGRCQPSNCRRCPGAYRALRGLRSFMSANTFGNAGFSVLLPGARLEGRCGP TNARVRCHLGLKIPPGCELVVGGSPEDGPRVVFIVDLWHPNVAGAERQALDFVFAPDP >gi568815582f:29730550_29947986|GENSCAN_predicted_CDS_9|897_bp atgctcccgtggccactacccctggcctcctcggccctcaccttgctcttcggggccctc acttccctgttcctctggtactgctaccgcctgggctcccaagacatgcaggccctaggg gctgggagccgagctgggggtgttcgtggtgggcctgtgggatgctcggaggccggcggg ccaagcccagggggtcctggggatcccggggaaggacctaggacggaaggcctagtgagc cggcggcttcgggcctacgcaaggcgctactcctgggctgggatgggtagagtgaggcgg gcagctcagggtggcccaggccctgggagagggccaggggtcctaggtattcagcgccca ggcctgcttttcctaccagacctgccttcagccccctttgtgccgcgggacgcccagcgg cacgacgtggagctcctggagagcagcttccctgccattttgcgggacttcggggctgtg agctgggacttctcagggactacccctccgcctcggggctggtccccacctctggccccc gggtgctaccagctcctgctgtaccaagcaggccggtgccaacccagcaactgccgccgg tgcccgggggcctatcgggcactgagggggcttcgaagctttatgagtgccaacaccttc ggcaatgccggcttttccgttctcctgcctggggcccggctcgagggccgctgtgggccc accaatgcccgggtcagatgccatctgggcctaaagatccctcctggctgtgagctggtg gtcggcggctcccccgaagatgggcctcgagtggtcttcatcgtggacctctggcacccc aacgtggcaggggctgagcgccaggccctcgactttgtcttcgccccagacccttga >gi568815582f:29730550_29947986|GENSCAN_predicted_peptide_10|379_aa MSAEASGPAAAAAPSLEAPKPSGLEPGPAAYGLKPLTPNSKYVKLNVGGSLHYTTLRTLT GQDTMLKAMFSGRVEVLTDAGAITNTYGTPNCQALWSLYTLGIHLYTHIAEGGCFGFARD TRVEGGQVSLAGWVLIDRSGRHFGTILNYLRDGSVPLPESTRELGELLGEARYYLVQGLI EDCQLALQQKRETLSPLCLIPMVTSPREEQQLLASTSKPVVKLLHNRSNNKYSYTSTSDD NLLKNIELFDKLALRFHGRLLFLKDVLGDEICCWSFYGQGRKIAEVCCTSIVYATEKKQT KVEFPEARIFEETLNILIYETPRGPDPALLEATGGAAGAGGAGRGEDEENREHRVRRIHV RRHITHDERPHGQQIVFKD >gi568815582f:29730550_29947986|GENSCAN_predicted_CDS_10|1140_bp atgtcggcggaggcctcgggcccggctgccgccgcggccccgtccctggaagcccccaag ccctcgggtctcgagcctggccccgccgcctacggtctcaagccgctgaccccgaacagc aaatacgtgaagctgaacgtgggcggctcgttgcactacaccacgctgcgcaccctcacg ggacaggacaccatgctcaaagccatgttcagcggccgcgtggaggtgctgaccgatgcc ggagcaataaccaacacttacggaacccctaactgccaggcgctgtggtccctgtacact ctaggaatccacctgtatacacacattgcagaaggaggctgctttgggtttgccagggac accagagtggagggtggacaggtctccttggcaggttgggtgctgattgaccggagcggc cgtcactttggtacaatcctcaattacctgcgggatgggtctgtgccactgccggagagt acgagagaactgggggagctgctgggcgaagcacgctactacctggtgcagggcctgatt gaggactgccagctggcgctgcagcaaaaaagggagacgctgtccccgctgtgcctcatc cccatggtgacatctccccgggaggagcagcagctcctggccagcacctccaagcccgtg gtgaagctcctgcacaaccgcagtaacaacaagtactcctacaccagcacttcagatgac aacctacttaagaacatcgagctgttcgacaagctggccctgcgcttccacgggcggcta ctcttcctcaaggatgtcctgggggacgagatctgctgctggtctttctacgggcagggc cgcaaaatcgccgaggtgtgctgcacctccattgtctatgctacggagaagaagcagacc aaggtggaatttccagaggcccggatcttcgaggagaccctgaacatcctcatctacgag actccccggggcccagacccagccctcctggaggccacagggggagcagctggagctggt ggggctggccgcggggaggatgaagagaaccgagagcaccgtgtccgcaggatccatgtc cggcgccatatcacccacgacgagcgtcctcatggccaacaaattgtcttcaaggactga >gi568815582f:29730550_29947986|GENSCAN_predicted_peptide_11|49_aa MNKSSGNKLQLNETGLTTEESGGLSELRGLVPPPDFKLALEEEEVEVSQ >gi568815582f:29730550_29947986|GENSCAN_predicted_CDS_11|147_bp atgaacaaatcctcagggaacaagcttcaattgaatgaaactggcttgaccacagaggaa tctgggggactttctgagcttcgcggacttgtgccgcctcctgactttaagctggctctg gaggaggaggaagtggaggtatctcag