GENSCAN 1.0 Date run: 2-Nov-116 Time: 21:38:35 Sequence gi568815576f:30594910_31006336 : 411427 bp : 47.54% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 12423 12486 64 2 1 88 89 85 0.977 7.96 1.02 Intr + 15962 16154 193 0 1 92 34 150 0.984 8.55 1.03 Intr + 17964 18133 170 0 2 49 85 208 0.841 16.19 1.04 Intr + 19440 19592 153 1 0 124 92 169 0.999 20.94 1.05 Intr + 20362 20564 203 2 2 93 75 189 0.860 17.10 1.06 Intr + 20692 20878 187 0 1 80 53 110 0.993 5.96 1.07 Intr + 22421 22586 166 0 1 69 57 180 0.391 12.02 1.08 Intr + 28059 28174 116 0 2 106 113 56 0.778 9.99 1.09 Intr + 41528 42160 633 0 0 46 106 737 0.734 63.41 1.10 Term + 51689 52122 434 0 2 103 53 485 0.997 41.96 1.11 PlyA + 55591 55596 6 1.05 2.07 PlyA - 56290 56285 6 1.05 2.06 Term - 56716 56705 12 1 0 105 32 9 0.239 -4.80 2.05 Intr - 59769 59364 406 2 1 36 -19 360 0.544 14.45 2.04 Intr - 69066 68532 535 1 1 81 48 806 0.003 67.88 2.03 Intr - 84235 84122 114 2 0 24 53 133 0.625 3.82 2.02 Intr - 84719 84524 196 2 1 67 54 141 0.763 7.59 2.01 Init - 88879 88874 6 1 0 71 110 0 0.516 1.49 2.00 Prom - 93740 93701 40 -2.16 3.05 PlyA - 93937 93932 6 1.05 3.04 Term - 99788 99651 138 2 0 98 45 116 0.926 6.26 3.03 Intr - 100456 99967 490 0 1 40 3 433 0.507 23.01 3.02 Intr - 100742 100539 204 1 0 42 64 128 0.765 4.12 3.01 Init - 115547 115483 65 2 2 90 80 94 0.971 9.42 3.00 Prom - 115692 115653 40 -5.86 4.04 PlyA - 117037 117032 6 1.05 4.03 Term - 122117 121942 176 0 2 40 49 153 0.964 4.52 4.02 Intr - 123538 123512 27 2 0 128 116 -4 0.893 4.49 4.01 Init - 126681 126675 7 0 1 73 97 5 0.590 0.72 4.00 Prom - 129492 129453 40 -6.16 5.00 Prom + 132509 132548 40 -4.16 5.01 Init + 136340 136425 86 1 2 82 76 49 0.689 3.41 5.02 Intr + 146252 146593 342 2 0 86 100 261 0.459 21.65 5.03 Intr + 148814 148926 113 2 2 91 78 96 0.704 8.92 5.04 Intr + 158195 158244 50 0 2 31 78 34 0.059 -4.90 5.05 Intr + 161956 162115 160 0 1 110 32 91 0.179 5.26 5.06 Intr + 167692 167771 80 2 2 97 66 43 0.314 2.27 5.07 Intr + 175674 175772 99 2 0 75 85 38 0.196 2.51 5.08 Intr + 200328 200350 23 2 2 109 82 35 0.002 1.34 5.09 Intr + 208115 208268 154 2 1 77 84 75 0.002 6.07 5.10 Intr + 215541 215678 138 2 0 74 72 130 0.138 10.66 5.11 Term + 220489 220551 63 0 0 95 38 22 0.035 -4.21 5.12 PlyA + 221224 221229 6 1.05 6.06 PlyA - 221366 221361 6 1.05 6.05 Term - 226558 226368 191 2 2 84 42 69 0.568 -0.49 6.04 Intr - 227314 227004 311 0 2 53 100 173 0.277 10.96 6.03 Intr - 227781 227638 144 2 0 47 60 96 0.488 2.10 6.02 Intr - 231206 231065 142 0 1 65 84 91 0.610 5.81 6.01 Init - 237177 237099 79 0 1 62 77 59 0.307 3.42 6.00 Prom - 248648 248609 40 -2.26 7.00 Prom + 263667 263706 40 -4.76 7.01 Init + 265008 265139 132 2 0 86 54 127 0.824 7.26 7.02 Intr + 271232 271408 177 1 0 58 55 82 0.375 2.02 7.03 Intr + 273410 273580 171 1 0 97 49 39 0.191 1.04 7.04 Intr + 273603 273633 31 2 1 59 57 52 0.390 -3.10 7.05 Intr + 275520 275773 254 1 2 127 92 464 0.964 47.85 7.06 Intr + 279708 279884 177 2 0 91 84 69 0.974 6.92 7.07 Intr + 286822 286893 72 0 0 85 94 6 0.181 0.50 7.08 Intr + 290227 290304 78 0 0 109 76 -6 0.300 0.05 7.09 Intr + 292517 292709 193 1 1 103 96 282 0.987 29.57 7.10 Intr + 293314 293431 118 2 1 92 110 221 0.995 24.22 7.11 Intr + 294268 294325 58 1 1 70 101 66 0.533 4.99 7.12 Intr + 294581 294727 147 1 0 64 80 221 0.938 19.33 7.13 Intr + 295819 296064 246 0 0 133 94 519 0.996 54.66 7.14 Intr + 298213 298333 121 0 1 68 71 118 0.585 8.17 7.15 Intr + 298554 298657 104 1 2 121 71 173 0.987 18.79 7.16 Intr + 298729 298824 96 0 0 73 121 179 0.999 19.91 7.17 Intr + 298908 299092 185 2 2 95 78 226 0.272 20.89 7.18 Intr + 310928 311160 233 2 2 42 100 539 0.283 47.92 7.19 Intr + 325506 325620 115 1 1 79 117 40 0.824 5.51 7.20 Term + 327338 327503 166 2 1 -1 43 183 0.443 2.29 7.21 PlyA + 329230 329235 6 1.05 8.28 PlyA - 329425 329420 6 -1.95 8.27 Term - 331962 331894 69 0 0 112 49 201 0.999 16.54 8.26 Intr - 333298 333110 189 1 0 96 83 262 0.954 26.28 8.25 Intr - 337543 337450 94 0 1 105 105 40 0.998 7.37 8.24 Intr - 337860 337636 225 2 0 103 45 270 0.999 21.30 8.23 Intr - 338121 337980 142 1 1 56 55 238 0.951 16.71 8.22 Intr - 338611 338557 55 1 1 72 105 57 0.999 4.35 8.21 Intr - 339282 339151 132 0 0 102 105 140 0.999 17.94 8.20 Intr - 340252 339872 381 1 0 72 97 186 0.980 13.01 8.19 Intr - 340413 340339 75 0 0 73 57 91 0.728 4.21 8.18 Intr - 341734 341602 133 0 1 120 98 172 0.865 22.05 8.17 Intr - 342128 342023 106 0 1 97 105 29 0.999 4.77 8.16 Intr - 342802 342674 129 2 0 94 101 163 0.999 18.87 8.15 Intr - 343060 342906 155 0 2 98 20 273 0.750 21.22 8.14 Intr - 343296 343156 141 2 0 72 84 173 0.999 14.77 8.13 Intr - 344797 344712 86 1 2 88 101 113 0.988 11.22 8.12 Intr - 345132 345050 83 1 2 72 86 68 0.945 4.36 8.11 Intr - 345928 345804 125 0 2 56 51 98 0.788 3.13 8.10 Intr - 346649 346524 126 1 0 110 91 184 0.999 20.69 8.09 Intr - 347093 346982 112 0 1 30 94 85 0.999 2.74 8.08 Intr - 347362 347203 160 1 1 129 115 187 0.998 25.06 8.07 Intr - 350619 350544 76 2 1 44 91 10 0.442 -3.58 8.06 Intr - 351540 351367 174 2 0 111 38 209 0.656 17.25 8.05 Intr - 354933 354843 91 1 1 132 47 134 0.986 12.75 8.04 Intr - 355536 355468 69 1 0 84 108 65 0.606 7.25 8.03 Intr - 361888 361854 35 0 2 102 111 27 0.644 4.27 8.02 Intr - 363785 363732 54 1 0 81 95 41 0.441 2.19 8.01 Init - 369310 369303 8 1 2 110 26 2 0.173 -3.44 8.00 Prom - 370000 369961 40 -6.36 9.03 PlyA - 370305 370300 6 1.05 9.02 Term - 374817 374424 394 2 1 103 41 289 0.900 20.21 9.01 Init - 378085 378024 62 1 2 48 62 74 0.470 1.52 9.00 Prom - 403499 403460 40 -1.06 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 72694 73326 633 0 0 69 37 225 0.922 11.69 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576f:30594910_31006336|GENSCAN_predicted_peptide_1|772_aa MRHLGAFLFLLGVLGALTEMCEIPEMDSHLVEKLGQHLLPWMDRLSLEHLNPSIYVGLRL SSLQAGTKEDLYLHSLKLGYQQCLLGSAFSEDDGDCQGKPSMGQLALYLLALRANCEFVR GHKGDRLVSQLKWFLEDEKRAIGHDHKGHPHTSYYQYGLGILALCLHQKRVHDSVVDKLL YAVEPFHQGHHSVAHCMFCPPLQDTAAMAGLAFTCLKRSNFNPGRRQRITMAIRTVREEI LKAQTPEGHFGNVYSTPLALQFLMTSPMRGAELGTACLKARVALLASLQDGAFQNALMIS QLLPVLNHKTYIDLIFPDCLAPRVMLEPAAETIPQTQEIISVTLQVLSLLPPYRQSISVL AGSTVEDVLKKAHELGGFTYETQASLSGPYLTSVMGKAAGEREFWQLLRDPNTPLLQASL VRMCRCPPEHHDGRMTSAEVGAAAGGAQAAGPPEWPPGSPQALRQPGRARVAMAALVWLL AGASMSSLNKWIFTVHGFGRPLLLSALHMLVAALACHRGARRPMPGGTRCRVLLLSLTFG TSMACGNVGLRAVPLDLAQLVTTTTPLFTLALSALLLGRRHHPLQLAAMGPLCLGAACSL AGEFRTPPTGCGFLLAATCLRGLKSVQQSALLQEERLDAVTLLYATSLPSFCLLAGAALV LEAGVAPPPTAGDSRLWACILLSCLLSVLYNLASFSLLALTSALTVHVLGNLTVVGNLIL SRLLFGSRLSALSYVGIALTLSGMFLYHNCEFVASWAARRGLWRRDQPSKGL >gi568815576f:30594910_31006336|GENSCAN_predicted_CDS_1|2319_bp atgaggcaccttggggccttcctcttccttctgggggtcctgggggccctcactgagatg tgtgaaataccagagatggacagccatctggtagagaagttgggccagcacctcttacct tggatggaccggctttccctggagcacttgaaccccagcatctatgtgggcctacgcctc tccagtctgcaggctgggaccaaggaagacctctacctgcacagcctcaagcttggttac cagcagtgcctcctagggtctgccttcagcgaggatgacggtgactgccagggcaagcct tccatgggccagctggccctctacctgctcgctctcagagccaactgtgagtttgtcagg ggccacaagggggacaggctggtctcacagctcaaatggttcctggaggatgagaagaga gccattgggcatgatcacaagggccacccccacactagctactaccagtatggcctgggc attctggccctgtgtctccaccagaagcgggtccatgacagcgtggtggacaaacttctg tatgctgtggaacctttccaccagggccaccattctgtggctcattgcatgttctgtccc ccacttcaagacacagcagccatggcaggcttggcattcacctgtctgaagcgctcaaac ttcaaccctggtcggagacaacggatcaccatggccatcagaacagtgcgagaggagatc ttgaaggcccagacccccgagggccactttgggaatgtctacagcaccccattggcatta cagttcctcatgacttcccccatgcgtggggcagaactgggaacagcatgtctcaaggcg agggttgctttgctggccagtctgcaggatggagccttccagaatgctctcatgatttcc cagctgctgcccgttctgaaccacaagacctacattgatctgatcttcccagactgtctg gcaccacgagtcatgttggaaccagctgctgagaccattcctcagacccaagagatcatc agtgtcacgctgcaggtgcttagtctcttgccgccgtacagacagtccatctctgttctg gccgggtccaccgtggaagatgtcctgaagaaggcccatgagttaggaggattcacatat gaaacacaggcctccttgtcaggcccctacttaacctccgtgatggggaaagcggccgga gaaagggagttctggcagcttctccgagaccccaacaccccactgttgcaagcctcactg gtgcggatgtgccgctgcccgccggagcaccatgatggcaggatgacctcagccgaagta ggagcagcagctggtggtgctcaggcggctgggccccccgagtggccccctggcagccct caggccctccggcagcctggccgggcccgagtggccatggcagcactggtgtggctgctg gcgggagccagcatgtcaagcctcaacaagtggatcttcacagtgcacggctttgggcgg cccctgctgctgtcggccctgcacatgctggtggcagccctggcatgccaccggggggca cggcgccccatgccaggcggcactcgctgccgagtcctactgctcagtctcacctttggc acgtccatggcctgcggcaacgtgggcctaagggctgtgcccctggacctggcacaactg gttactaccaccacacctctgttcaccctggccctgtcggcgctgctgctgggccgccgc caccacccacttcagttggccgccatgggtccgctctgcctgggggccgcctgcagcctg gctggagagttccggacaccccctaccggctgtggcttcctgctcgcagccacctgcctc cgcggactcaagtcggttcagcaaagtgccctgctgcaggaggagaggctggacgcggtg accctgctttacgccacctcgctgcccagcttctgcctgctggcgggtgcagccctggtg ctggaggctggcgttgccccaccgcccactgctggcgactctcgcctctgggcctgcatc ctgctcagctgcctcctgtctgttctctataacctggccagcttctccctgctggccctc acctctgccctcaccgtccacgtcctgggcaacctcaccgtggtgggcaacctcatcctg tcccggctgttgtttggcagccgcctcagtgccctcagctacgtgggcatcgcactcact ctttcaggaatgttcctttaccacaactgcgagttcgtggcctcctgggctgcccgtcgg gggctgtggcggagggaccagcccagcaagggtctttga >gi568815576f:30594910_31006336|GENSCAN_predicted_peptide_2|422_aa MRTSSSPQPGVGSDHPNIPGCPTWGDKDPGEGTLEHVKAEDTLSQDTQERRTSKRSSAGK LSTGKNQDHEADSDEEEEDKCKKLTSDSECEEQLPEEMKERKTEKIQFRQPSVSGLSQIT KSLYISNGVAANNKLMLSSNQITMVINVSVEVVNTLYEDIQYMQVPVADSPNSRLCDFFD PIADHIHSVEMKQGRTLLHCAAGVSRSAALCLAYLMKYHAMSLLDAHTWTKSCRPIIRPN SGFWEQLIHYEFQLFGKNTVHMVSSPVGMIPDIYEKEVRLMIPLKKLKYLAFLHKWMNSN PSRGTYHFWAPTTSGSPAASSGGPRGMLPRETKEARPPKTASRCLTASHHPMTKKQVVVP AALQVVHLKPTRKFAYLGQLDHKVGWKYQTVTATLEEKRKEKAKIHYQNKKQLMGWPGAG PD >gi568815576f:30594910_31006336|GENSCAN_predicted_CDS_2|1269_bp atgaggacttcgtcgtccccacaacctggtgttgggtctgatcaccccaacattcctggc tgcccaacgtggggcgacaaagaccccggtgaaggaacactagagcatgtgaaagcggag gacacattgtcacaggacacccaagaacgtcgaacgtctaaaagaagctcggcaggaaag ctgagcactgggaagaaccaggatcatgaggcagattcagatgaggaagaggaggacaag tgtaaaaaactaacttcagattctgagtgtgaggaacagctaccggaggagatgaaagaa aggaaaactgaaaaaattcagttccggcagccctcagtcagcggcctctcgcagataacc aaaagcctgtatatcagcaatggtgtggccgccaacaacaagctcatgctgtctagcaac cagatcaccatggtcatcaatgtctcagtggaggtagtgaacaccttgtatgaggatatc cagtacatgcaggtacctgtggctgactcccctaactcacgtctctgtgacttctttgac cctattgctgaccatatccacagcgtggagatgaagcagggccgtactttgctgcactgt gctgctggtgtgagccgctcagctgccctgtgcctcgcctacctcatgaagtaccacgcc atgtccctgctggacgcccacacgtggaccaagtcatgccggcccatcatccgacccaac agcggcttttgggagcagctcatccactatgagttccaattgtttggcaagaacactgtg cacatggtcagttccccagtgggaatgatccctgacatctatgagaaggaagtccgtttg atgattccactaaagaagttgaagtacctggccttcctccacaagtggatgaacagcaac ccttcccgaggcacctaccacttctgggcccccaccacttccgggtccccagccgcatct tctggcggaccccgaggcatgctgccccgcgagaccaaggaagccaggccgcccaagacc gcctcaaggtgtttaacggcatcccaccaccctatgacgaaaaagcaggtggtggttcct gctgccctccaggttgtgcatctgaagcctacaagaaagtttgcctacctggggcagctg gatcacaaagttggctggaagtaccagacagtgacagccaccctggaggagaagaggaag gagaaggccaagatccactaccagaataagaaacagctcatgggatggccgggcgcgggt ccagactaa >gi568815576f:30594910_31006336|GENSCAN_predicted_peptide_3|298_aa MSIFKGNDSSGKHIKHADIYERLVAVSANHGGPPSPSLQHHHPPTPMSHPGASIPVVREQ TIAQHEPAALVALQIVGPLEKPALEAVQRQRGRGFNAGSDPDSGLRKVALPTARESGSDA ALVKGPAPTPELDSDPGRDPCSSSDGCPAPGSGSDVVSDTGSDLGTASETASDTCSDSGP RSGSGTGWGWGLGSGPEPDVEALMPGAAVWHDRQGTTVNSDESPRERPPQPPPRLGAAAF PIEPADPRAAERPVRPVEGALTTPVGGRAWWSVNSEGDGALQATVSASAQPCQALEEQ >gi568815576f:30594910_31006336|GENSCAN_predicted_CDS_3|897_bp atgagcatctttaagggtaatgacagtagtgggaaacacatcaaacatgctgatatctat gagagactagtggcagtgtccgccaaccatggaggccctccatcaccatccctgcagcat caccaccctccaacccccatgtcccaccctggcgcttccatacctgtagtaagagagcaa accattgcccagcacgaaccagcggcgctggtagcccttcagatagttggtccacttgag aagccagccctcgaagctgtccagaggcaaagaggcaggggctttaacgctggcagcgat cctgactcgggtctgagaaaggtcgcgctccccaccgcccgggagagcggctccgatgcg gccttagtgaagggcccagcccctacacctgagcttgactctgaccccggccgcgacccc tgcagcagttccgatggctgcccagcccctggctccggctcagacgtcgtctcggacaca ggttccgatcttggcactgcctccgaaactgcctccgacacctgttctgacagcggtccc cgctccggttcgggcacgggctggggctggggcttgggctccggcccggagccggacgtg gaagcgctcatgcccggcgccgccgtgtggcacgacaggcaggggacaaccgtgaacagc gacgagagcccgcgggagcggccgccacagccgccgcctcggctcggagccgccgctttc cccatagagccggccgacccgcgcgcggccgagcggccagtccggccggtggaaggagct ctgacaacacctgtgggcggaagagcttggtggtcggtcaactccgaaggggatggcgcg cttcaggccacggtctctgcttcggcccagccgtgccaggcgctggaggaacaatga >gi568815576f:30594910_31006336|GENSCAN_predicted_peptide_4|69_aa MADGFPSNFQKEKTIGIKFHDIGFGNDFLDMTPKAQATKAKTDKWDSIEGHNLSSLKDTI NKVKRKLME >gi568815576f:30594910_31006336|GENSCAN_predicted_CDS_4|210_bp atggcagatggtttcccatcaaattttcagaaagagaaaaccataggcataaagtttcat gacattggatttggcaatgatttcttggatatgacaccaaaagcacaggcaacaaaagca aaaacagacaaatgggactccatcgaaggacacaacttaagttccttgaaggacacaatc aacaaagtaaaaaggaaacttatggaatag >gi568815576f:30594910_31006336|GENSCAN_predicted_peptide_5|435_aa MASGRGEIKVTTQLIPLLDETSQRATETRNQGEMAHTCRGTINLSTAHIDTEDSCGILLT SGARSYHLKASSEVDRQQWITALELAKAKAVRVMNTHSGSEHLGQRVCIWWCHESGGSGN CKPLVGSGGQETQASVDQSGRSRNFLKIMQQVAVMGLVIRQQFSYDSQCEGFLLQLAGCG ALYELKLQEQGVAAPTEAGVIGPFAEHTYTCYAATFCSDPPSHADNTPQTQGICCVVTPS NHADHTPQTQDPSSFCCLHWHTILLPASVNAVPLVGKGQLVAGNKTPLRLAETQVGPVLR VQEVEHPRGEGFDYDVYRLGEAVAAGTAPRWTESAAAGRRRRRRQQQQLQEPSQPRGRTH VHTREPLPGSRPDEVSADACLDNRTGSLSWAATAINLKVTGVHGHELQPEAWLQRDTYSF DIFCTSSVLLGKGFA >gi568815576f:30594910_31006336|GENSCAN_predicted_CDS_5|1308_bp atggcaagtggcagaggagagatcaaagtgaccacccagctcattcccttgctggatgag acttcacagcgagctacagagacacgaaatcagggtgaaatggcccacacgtgccgtgga accatcaacctgtccaccgcgcacattgacacggaggactcttgtggtatcttgctgacc agtggggccaggagctaccacctcaaggccagctcagaggtggaccggcagcagtggatc accgccctggagctggccaaggccaaggctgtccgcgtgatgaacactcattcaggtagt gagcacttgggacagcgggtgtgtatatggtggtgtcatgagtctggaggaagtgggaat tgcaagccactggtgggcagtggtggccaggagacccaggccagcgtggatcaatctgga aggtcaaggaacttcttgaaaataatgcagcaggtggcagtcatgggattagtcatccgg caacagttttcctacgacagccagtgcgaggggtttcttttgcagttggcagggtgcgga gccctatatgaactcaaacttcaggagcaaggtgtggcagctcccacagaggcaggggtg atagggccctttgctgagcacacgtacacatgctatgcagccacgttctgcagtgaccct cccagccatgcagacaacactcctcaaacccagggtatctgctgtgtggtgaccccttcc aaccatgcagaccacactcctcaaacgcaggacccatccagcttctgctgtcttcactgg cacaccatcttgctgccagccagcgtgaatgctgtacctcttgttggcaagggtcaactg gtggcagggaacaaaaccccgctgagactggctgaaacacaagtggggcctgtgctcagg gtgcaggaagtggagcatccaaggggagagggctttgactatgatgtgtataggctcgga gaggcggtggcagccggcactgcacccagatggactgagagcgcggcggcgggaaggcgg cggcggcggcggcagcagcagcagctgcaagagccctcgcagcctcggggcaggacccac gtccacacccgagagcccctccctgggagcagacctgatgaggtgagtgcagatgcatgc ctggacaaccgcacaggtagcctctcctgggcagccacagccattaatctcaaggtcact ggtgtccacggccatgagcttcagccggaagcctggctgcagagggacacctactccttc gatatcttctgtacttcctctgtcctcttgggaaagggtttcgcataa >gi568815576f:30594910_31006336|GENSCAN_predicted_peptide_6|288_aa MSQDSEMLPGHLFCLAKAGKEEQEEKDVNTSSKTITPQNIPGPGAHITATHRGRCTVATK FSGGLCKEPNRLTWQPELQKERTEAVSQQPRPFISRGGAASNRRGATWRRVPGACTDACT ARAKMPLRAKGHAGNGDGQGPDGDSRKSGSPLRPSHLLTPLPWLVHLCLATCRHLATMQE EGNRQGPGELGSLSSAATGLQASGHKVFAQAFEHNSTVQLPTGITNPHPVGQHIMMAVPV EYSQDPLTLTTCMLFLGPATPSSCLWTLTQSPCKSLCSHSGPHSLCTI >gi568815576f:30594910_31006336|GENSCAN_predicted_CDS_6|867_bp atgtcccaggacagtgagatgcttcctggtcacttgttctgcttggccaaagcaggcaaa gaggagcaggaggagaaagacgtaaacacatcatcaaaaacaataacccctcagaacatt ccaggacctggtgctcacatcacagccactcacagaggccgatgcaccgttgccaccaag ttttctgggggtctgtgcaaagaacctaaccgcctgacctggcagccggagctgcagaag gagcgcacggaggccgtgtcccagcagccgcgtcccttcatttcccgcggcggggcagcc tccaatcggcggggcgctacatggcgccgggtcccgggggcttgcacggacgcgtgcaca gctcgggcaaagatgcccttgagagctaaggggcacgcgggaaatggggatggccagggg cctgacggtgattccaggaaatcggggtcacctctgagaccctctcatcttctgaccccg ctgccgtggctggtgcacctgtgtctggccacctgcagacacctggccaccatgcaggaa gaaggcaacaggcagggccccggggagctgggctcccttagttctgctgccacaggcctg caggcatctggtcacaaggtatttgcccaggcctttgaacacaactccactgtccagctg cccacaggcataaccaatccacatccagttggtcagcacatcatgatggctgtacccgta gaatacagccaggatcccttgactctcaccacctgcatgttgtttctggggcctgctacg ccctcgtcttgcctgtggactcttacacaatctccctgcaagtctctctgctcccactct ggtccccacagtctctgcacaatataa >gi568815576f:30594910_31006336|GENSCAN_predicted_peptide_7|957_aa MPGRRPRPLSAELGGSTQAMRDKCGARDGAGTCYYGTLKLAGPESLRWLALDGERDVADT MKNFEMRILLDYPGGPSTITGSERVEDASLLALKMEEGAMSEGGGKNSHCLLTSTVRPRA TAYGASGPDPGPARVASRRLGAAVGLLSADTLWPLNGLARAIDEEIEALRDDSGDDDEAT TPADKSELHHTLKNLSLKLDDLSTCNDLIAKHGAALQRSLTELDGLKIPSESGEKLKVVN ERATLFRITSNAMINGAAAILTSYQEWLQPQGFEGTRCSEVVSGLGIWNSNAVFQPPTPL PVLYLLEHPFLTLQHSQDGAWRRPAAAEETLGVRDGHQVRTGVPIFLLWGGTACDVHGAQ WVLQACRDFLELAEIHSRKWQRALQYEQEQRVHLEETIEQLAKQHNSLERAFHSAPGRPA NPSKSFIEGSLLTPKGEDSEEDEDTEYFDAMEDSTSFITVITEAKEDSRKAEGSTGTSSV DWSSADNVLDGASLVPKGSSKVKRRVRIPNKPNYSLNLWSIMKNCIGRELSRIPMPVNFN EPLSMLQRLTEDLEYHHLLDKAVHCTSSVEQMCLVAAFSVSSYSTTVHRIAKPFNPMLGE TFELDRLDDMGLRSLCEQVSHHPPSAAHYVFSKHGWSLWQEITISSKFRGKYISIMPLGA IHLEFQASGNHYVWRKSTSTVHNIIVGKLWIDQSGDIEIVNHKTNDRCQLKFLPYSYFSK EAARKVTGVVSDSQGKAHYVLSGSWDEQMECSKVMHSSPSSPSSDGKQKTVYQTLSAKLL WKKYPLPENAENMYYFSELALTLNEHEEGVAPTDSRLRPDQRLMEKGRWDEANTEKQRLE EKQRLSRRRRLEACGPGSSCSSEEEGVKAFDKIQPVSRGHSPRFILCDCCPLAPNSGRTG NYRKLARLPKRMRRSRYGHVRRASKTTQASCRFRLERVAGLRKPSTVHNRWLVMNRS >gi568815576f:30594910_31006336|GENSCAN_predicted_CDS_7|2874_bp atgcctggccgccgcccccggcccctctctgcagagctaggaggcagcacccaggccatg agggacaaatgtggagcaagggatggagctggaacttgctattatgggacgttaaagctg gcaggacctgagagccttagatggcttgccttagatggcgaaagggatgttgcagataca atgaagaattttgagatgcggattctcctggattacccaggtgggcccagcacaatcact ggatcagagagagtagaagatgcctctttgctggctttgaagatggaggaaggggctatg agtgaaggaggagggaagaatagccactgtctactgacctctaccgtgaggcccagagca acagcttacggagccagtggcccagaccctggcccagcccgtgtggcctccagacgtctg ggggccgcagttgggcttctgagtgcagacacgctgtggcctctcaatggccttgccagg gcaatagatgaggaaattgaggctctgagagatgactctggggacgacgacgaggctacc accccagccgacaagagcgagctgcaccacaccctgaagaatctttccctgaagttagat gacctcagcacgtgcaatgacctcatcgccaagcacggcgctgcactccagcgctccctg acagagctggacggcctcaagatcccatctgagagtggggagaagctgaaggtggtgaat gagcgggccaccctcttccgcatcacatccaatgctatgatcaacggcgctgcagccatc ctcacctcatatcaggaatggcttcagccccaggggtttgagggaacaaggtgctcagag gtggtttctggattgggaatctggaacagcaacgctgtgtttcaaccccccacgccttta ccagtgctgtaccttctagaacaccccttcctcacccttcagcacagccaggatggggcc tggagaaggccggcagcagcagaggagaccctgggagtcagggatggacaccaggtgagg actggtgtgcccatcttcctcctttggggtgggacagcctgtgatgtgcatggagctcag tgggtgctccaggcctgcagggacttcttggaactagcagagatacacagtcggaaatgg cagcgggcactgcagtatgagcaggagcagcgcgtgcacttggaggaaaccattgagcag ctggcgaagcagcacaacagcctcgagcgggccttccacagtgcccctggccggccggcc aacccctccaagagcttcattgagggaagcctcttgactcccaaaggagaggacagtgag gaagatgaagataccgagtactttgatgccatggaagactccacatccttcatcaccgtg atcaccgaggccaaggaagacagcagaaaagctgaaggtagcaccgggacaagttccgtg gactggagctcagcagacaatgtactagatggtgcctcgctcgtgcccaagggttcatcc aaagtcaagaggcgagtccgcattcccaacaagcccaactacagccttaacctctggagc atcatgaagaactgcatcggccgggagctctccaggatccccatgccggtgaacttcaat gagcccctgtccatgctccagcggctgacagaggacctggagtaccaccacctgctggac aaggcagtgcactgcaccagctcagtggagcagatgtgcctggtggccgccttctctgtg tcctcctactccaccacagtgcaccgcatcgccaagcccttcaaccccatgctgggggag accttcgagctggaccgcctcgacgacatgggcctgcgctccctctgtgagcaggtgagc caccaccccccctcagctgcgcactacgtgttctccaagcatggctggagcctctggcag gagatcaccatctccagcaagttccggggaaaatacatctccatcatgccgctaggtgcc atccacttagaattccaggccagtgggaatcactacgtgtggaggaagagcacctcaact gttcacaacatcatcgtgggcaagctctggatcgaccagtcaggggacatcgagattgtg aaccataagaccaatgaccggtgccagctgaagttcctgccctacagctacttctccaaa gaggcagcccggaaggtgacaggagtggtgagtgacagccagggcaaggcccattacgtg ctgtccggctcgtgggatgaacaaatggagtgctccaaggtcatgcatagcagtcccagc agccccagctctgacgggaagcagaagacagtgtaccagaccctgtcagccaagctgctg tggaagaagtacccgctgccggagaacgcggagaacatgtactacttctcagagctggcc ctgaccctcaacgagcacgaggagggcgtagcgccaaccgacagccgcctgcggcccgac cagcggctgatggagaagggccgttgggacgaggccaataccgagaagcagcggctggag gagaagcagcgcctgtcgcggcgccggcggctggaggcctgcgggccgggcagcagctgc agctcggaggaagaaggtgtaaaggcatttgacaaaattcagccagtttccagggggcac agcccacgtttcattctgtgtgactgctgccccctggcgcccaactcaggcaggactggc aactaccggaagctcgcgaggcttccaaagcgcatgcgcaggtcacggtacgggcacgtg cggcgcgccagcaaaaccacgcaggcgtcttgtcgcttccgcctcgagcgtgtggcggga ttgcggaagccctccacagttcataaccgttggctggttatgaaccgttcataa >gi568815576f:30594910_31006336|GENSCAN_predicted_peptide_8|1074_aa MPGTTHEFLFGALAELVDNARDADATRIDIYAERREDLRGGFMLCFLDDGAGMDPSDAAS VIQFGKSAKRTPESTQIGQYGNGLKSGSMRIGKDFILFTKKEDTMTCLFLSRTFHEEEGI DEVGPIIVVCITSPHFLSLPRRFSHGGQGLKEGGLRGSVERNPQLPQKQVIVPLPTWNAR TREPVTDNVEKFAIETELIYKYSPFRTEEEVMTQFMKIPGDSGTLVIIFNLKLMDNGEPE LDIISNPRDIQMAETSPEGTKPERRSFRAYAAVLYIDPRMRIFIHGHKVQTKRLSCCLYK PRMYKYTSSRFKTRAEQEVKKAEHVARIGNATLGAGVGGVCIWAEEKAREAESKARTLEV RLGGDLTRDSRVMLRQVQNRAITLRREADVKKRIKEAKQRALKEPKELNFVFGVNIEHRD LDGMFIYNCSRLIKMYEKVGPQLEGGMACGGVVGVVDVPYLVLEPTHNKQDFADAKEYRH LLRAMGEHLAQYWKDIAIAQRGIIKFWDEFGYLSANWNQPPSSELRYKRRRAMEIPTTIQ CDLCLKWRTLPFQLSSVEKDYPDTWVCSMNPDPEQDRCEASEQKQKVPLGTFRKDMKTQE EKQKQLTEKIRQQQEKLEALQKTTPIRSQADLKKLPLEVTTRPSTEEPVRRPQRPRSPPL PAVIRNAPSRPPSLPTPRPASQPRKAPVISSTPKLPALAAREEASTSRLLQPPEAPRKPA NTLVKTASRPAPLVQQLSPSLLPNSKSPREVPSPKVIKTPVVKKTESPIKLSPATPSRKR SVAVSDEEEVEEEAERRKERCKRGRFVVKEEKKDSNELSDSAGEEDSADLKRAQKDKGLH VEVRVNREWYTGRVTAVEVGKHVVRWKVKFDYVPTDTTPRDRWVEKGSEDVRLMKPPSPE HQSLDTQQEGGEEEVGPVAQQAIAVAEPSTSECLRIEPDTTALSTNHETIDLLVQILRNC LRYFLPPSFPISKKQLSAMNSDELISFPLKEYFKQYEVGLQNLCNSYQSRADSRAKASEE SLRTSERKLRETEEKLQKLRTNIVALLQKVQEDIDINTDDELDAYIEDLITKGD >gi568815576f:30594910_31006336|GENSCAN_predicted_CDS_8|3225_bp atgcctggaaccactcacgaattcttgtttggtgcccttgctgaactggttgataatgca agagatgctgatgccaccagaatagatatttatgcagaaagacgagaggaccttcgagga ggatttatgctttgctttttggatgatggagcaggaatggatccaagtgatgctgccagt gtgatccagtttgggaagtcggccaagcgaacacctgagtctactcagattgggcagtac gggaatgggttaaaatcgggctcaatgcgcattgggaaggattttatcctgttcaccaag aaggaagacaccatgacctgcctcttcctgtctcgcacgtttcatgaggaagaaggcatt gatgaagtaggtcccatcatcgtggtctgcatcaccagtcctcatttcctgagtctgccc aggcggttttcacatggaggccagggcttgaaagaaggggggctaagaggttctgtggaa agaaatccgcaacttccacagaagcaggtgatagtcccactgcccacctggaatgctcgg acccgggaacctgtcacagacaatgtagagaaatttgccattgagacagaactcatctat aagtactctccattccgcactgaggaggaagtgatgacccagtttatgaagattcctggg gacagcggaacattggtgatcatcttcaatctcaaactcatggataatggagagccagaa ctagacataatctcaaatccaagagatatccagatggcagagacgtccccagagggcacg aagccagagcggcgctcgttccgtgcctatgccgctgtgctctatattgatccccggatg aggatcttcatccatgggcacaaggtgcagaccaagaggctctcctgctgcctgtacaag cccaggatgtacaagtacacgtcaagccgtttcaagacccgtgcggagcaggaggtgaag aaagcagagcacgtagcaaggattggtaatgccactctgggtgcaggagttgggggggtg tgcatctgggctgaagagaaggcgcgggaggcagagagcaaagctcggacattagaagta cgcctaggtggagacctcacgcgggactccagggtgatgttgcgacaggtccagaacaga gccatcactctgcgcagagaagccgatgtcaagaagaggatcaaggaggccaagcagcga gcacttaaagaacctaaggaactgaattttgtttttggtgtcaacattgaacaccgggat ctggatggcatgttcatctacaactgtagccgactgatcaaaatgtatgagaaagtgggc ccacagctggaagggggcatggcatgtggcggggttgttggggttgttgatgtgccctac ctggtcctggagcctacacacaacaaacaggactttgctgatgccaaggagtaccggcac ctgctccgagcaatgggggagcacctggcgcagtattggaaggatattgccatcgcccag aggggaatcatcaagttctgggatgagtttggctacctctctgccaactggaaccagccc ccatccagtgagctgcgttacaaacgccggagagctatggaaatccccaccaccatccag tgcgatttgtgtctgaaatggagaaccctccccttccagctgagttctgtggaaaaagat taccctgacacctgggtttgctccatgaaccctgatcctgaacaggaccggtgtgaggct tctgaacaaaagcagaaggttcccctgggaacattcagaaaggacatgaagacgcaggaa gagaagcagaaacaactgacagagaaaattcgccagcagcaggagaagctggaggccctt cagaaaaccacacccatccgctcccaagcagacctgaagaaattgcccttggaagtgacc accagaccttccactgaggaacctgtgcgtagacctcagcgtcctcggtcgcccccttta cctgctgtgatcaggaacgcccccagcagacccccttctttgccaactcctagaccagcc agccagccccgaaaggctcctgtcatcagcagtaccccaaagctccctgctttggcagcc cgggaggaggccagcacatctaggctgctccagccacctgaggcaccccgaaagcctgcc aacactctcgtcaagactgcatcccgacctgcccctctggtgcagcaactgtcaccatct ttactgcccaactccaagagccctcgggaggttccttctcccaaagtcatcaagactcca gtggtgaagaagacagagtcacccatcaaactctccccggctacccctagtcggaagcgg agtgtcgcagtttctgatgaggaagaagttgaggaggaagctgagaggaggaaggagagg tgcaagcggggcagatttgttgtgaaggaggaaaagaaggactcgaatgagctctcagac agtgctggggaagaggactcggctgacctcaagagagctcagaaagataaagggctgcac gtggaggtgcgtgtgaacagggagtggtacacgggccgtgtcacagccgtggaggtgggc aagcatgtggtgcggtggaaggtgaagtttgactacgtgcccacagacacgacaccaaga gaccgctgggtggagaaaggcagtgaggatgtgcggctgatgaaacccccttctccggaa catcagagccttgatacacaacaggagggcggggaggaggaggtgggccctgtggcccag caggccatagctgtcgcagagccctccacttccgaatgcctccgcattgagcctgacacc actgccctgagcaccaatcacgagaccatcgacctgcttgtccagatcctccggaattgt ttacggtacttcctgcctccaagtttccccatctccaagaagcagctgagtgctatgaat tcagatgagctaatatcttttcctctgaaggagtacttcaagcaatatgaagtagggctc caaaacctgtgcaattcctaccagagccgtgctgactcccgggccaaggcctccgaggaa agcctgcgcacctccgagaggaagctccgcgagacggaggagaagctgcagaagctgagg accaacatcgtggcactcctgcaaaaggtgcaggaggacatagacatcaacacagatgat gagctggacgcctacattgaggacctcatcaccaagggggactga >gi568815576f:30594910_31006336|GENSCAN_predicted_peptide_9|151_aa MGVFVSSKVVGIISGKNPGTRPRRVAPGLQAGPGHPRLPGRSKSRGRCRRRLPRSGAEQL PAAADHPDDSSSNRKRIGRRPSSAYEKKDAARAPTATGRAPRRPGLRPKSWMPEQEKEPP TDRSIDRTPVPSPYQTREGGRARQGFFDLGA >gi568815576f:30594910_31006336|GENSCAN_predicted_CDS_9|456_bp atgggtgtcttcgtcagtagtaaggtggtaggaatcatctcgggcaaaaatccaggtacc aggccccgaagggtggcgccaggcctccaagccggccccggccacccccgcctccccggc cgctccaaatcccgcggccgctgccgccgccgcctcccccggagcggggcggagcagctg cccgccgccgccgaccacccggacgactcctcgtcgaatcgcaaacgcataggccgccgc ccgagttctgcgtacgagaagaaagacgcggcgcgagcgccaacggccaccgggcgcgcg ccgcggcggccgggcctgcgccccaagagctggatgccagagcaggagaaagagccgcca accgatcgctcgatcgaccgcacgcccgttccttcgccctaccagacccgggaggggggg agggcgcgccagggcttcttcgatttaggggcctga