GENSCAN 1.0 Date run: 5-Nov-116 Time: 14:42:34 Sequence gi568815596f:241602066_241825028 : 222963 bp : 53.60% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 Intr - 1014 905 110 2 2 85 94 218 0.948 22.53 1.02 Intr - 4408 4249 160 2 1 95 99 199 0.234 21.26 1.01 Init - 8248 7900 349 1 1 84 7 201 0.265 6.96 1.00 Prom - 16134 16095 40 -4.01 2.00 Prom + 16237 16276 40 -1.11 2.01 Init + 17528 17589 62 1 2 57 92 23 0.032 -0.68 2.02 Term + 28117 28309 193 1 1 56 41 135 0.287 2.82 2.03 PlyA + 28997 29002 6 1.05 3.07 PlyA - 29420 29415 6 -0.45 3.06 Term - 32014 30715 1300 0 1 99 41 1153 0.483 103.34 3.05 Intr - 32916 32860 57 2 0 110 84 20 0.829 2.29 3.04 Intr - 35363 34876 488 2 2 56 101 230 0.002 13.50 3.03 Intr - 35641 35384 258 1 0 91 -22 231 0.001 10.59 3.02 Intr - 35984 35862 123 2 0 88 9 144 0.002 7.79 3.01 Init - 37027 37016 12 1 0 83 23 23 0.001 -5.35 3.00 Prom - 41001 40962 40 -2.21 4.00 Prom + 46123 46162 40 -1.81 4.01 Init + 48448 48514 67 0 1 60 22 68 0.321 -1.41 4.02 Intr + 48945 49046 102 1 0 72 74 85 0.874 6.15 4.03 Intr + 49199 49270 72 0 0 40 111 30 0.550 0.37 4.04 Intr + 51447 51545 99 1 0 83 105 68 0.968 8.58 4.05 Intr + 52481 52582 102 0 0 73 47 131 0.996 8.15 4.06 Intr + 53206 53278 73 2 1 103 47 94 0.056 5.76 4.07 Intr + 56974 57122 149 1 2 67 89 122 0.029 10.59 4.08 Intr + 64580 64773 194 0 2 86 98 195 0.045 20.03 4.09 Intr + 66078 66156 79 2 1 96 97 213 0.989 22.62 4.10 Intr + 66475 66620 146 2 2 108 76 314 0.982 32.71 4.11 Intr + 68661 68717 57 2 0 76 76 53 0.822 2.47 4.12 Intr + 69247 69376 130 0 1 73 80 89 0.670 7.37 4.13 Intr + 69559 69761 203 2 2 89 81 43 0.489 3.23 4.14 Intr + 69968 70079 112 1 1 113 37 0 0.421 -2.14 4.15 Term + 70126 70199 74 2 2 103 48 123 0.831 8.16 4.16 PlyA + 73376 73381 6 1.05 5.07 PlyA - 73698 73693 6 -0.45 5.06 Term - 74172 74062 111 0 0 27 46 251 0.409 13.66 5.05 Intr - 76584 76387 198 0 0 88 100 255 0.905 26.77 5.04 Intr - 78254 78164 91 1 1 76 105 137 0.960 14.70 5.03 Intr - 82513 82437 77 1 2 83 100 -17 0.058 -2.19 5.02 Intr - 83812 83706 107 2 2 113 -17 125 0.062 4.93 5.01 Init - 84718 84589 130 1 1 95 63 209 0.970 18.59 5.00 Prom - 88614 88575 40 -3.41 6.00 Prom + 89073 89112 40 -4.11 6.01 Init + 100001 100037 37 1 1 95 72 27 0.589 2.18 6.02 Intr + 102588 102659 72 1 0 92 98 85 0.782 9.77 6.03 Intr + 107151 107317 167 1 2 81 77 443 0.881 42.69 6.04 Intr + 109312 109415 104 0 2 78 -1 168 0.759 6.37 6.05 Intr + 117418 117547 130 1 1 57 63 13 0.266 -3.10 6.06 Intr + 118917 119095 179 2 2 91 47 139 0.265 9.33 6.07 Intr + 120874 121009 136 1 1 117 86 221 0.969 25.88 6.08 Intr + 121145 121206 62 1 2 96 101 55 0.688 5.62 6.09 Intr + 121423 121522 100 1 1 23 75 64 0.048 -0.79 6.10 Intr + 123109 123240 132 0 0 29 51 85 0.004 0.25 6.11 Intr + 124598 124718 121 1 1 67 55 51 0.010 0.27 6.12 Intr + 126544 126745 202 2 1 69 19 127 0.018 2.77 6.13 Intr + 129289 129438 150 1 0 87 77 91 0.482 7.69 6.14 Term + 129486 129786 301 0 1 23 53 103 0.054 -4.66 6.15 PlyA + 130071 130076 6 1.05 7.00 Prom + 130084 130123 40 -4.31 7.01 Init + 133160 133451 292 1 1 84 90 528 0.660 47.91 7.02 Intr + 138968 139025 58 0 1 98 94 79 0.841 7.93 7.03 Intr + 140370 140509 140 0 2 99 107 201 0.750 23.72 7.04 Intr + 141557 141750 194 0 2 51 77 259 0.690 20.83 7.05 Intr + 142644 142812 169 2 1 79 60 239 0.910 20.23 7.06 Intr + 142840 143014 175 2 1 -19 42 150 0.195 -0.79 7.07 Intr + 146644 146761 118 1 1 94 5 74 0.329 0.67 7.08 Intr + 146878 147258 381 0 0 112 39 176 0.293 10.77 7.09 Intr + 147949 148033 85 0 1 59 43 40 0.457 -3.51 7.10 Intr + 148086 148229 144 1 0 75 79 141 0.985 12.66 7.11 Intr + 149181 149323 143 1 2 122 44 297 0.937 29.28 7.12 Intr + 153784 153949 166 0 1 105 85 300 0.358 31.45 7.13 Term + 165645 165904 260 1 2 139 49 536 0.999 50.85 7.14 PlyA + 170047 170052 6 1.05 8.06 PlyA - 170165 170160 6 1.05 8.05 Term - 174552 174323 230 1 2 87 52 77 0.697 1.12 8.04 Intr - 175403 175067 337 2 1 49 68 141 0.722 3.85 8.03 Intr - 176177 176154 24 2 0 110 83 13 0.694 1.70 8.02 Intr - 176729 176580 150 2 0 83 80 36 0.798 3.17 8.01 Init - 180417 180412 6 0 0 82 89 4 0.390 0.57 8.00 Prom - 183080 183041 40 -4.11 9.00 Prom + 191012 191051 40 -2.21 9.01 Init + 196980 197089 110 2 2 76 103 230 0.999 20.95 9.02 Intr + 199716 199971 256 0 1 75 86 495 0.950 45.98 9.03 Intr + 201280 202097 818 0 2 86 -16 1823 0.035 162.59 9.04 Intr + 206049 206183 135 0 0 126 -34 125 0.014 4.29 9.05 Intr + 208053 208125 73 0 1 83 68 44 0.034 1.80 9.06 Intr + 210451 210645 195 0 0 17 36 139 0.018 1.73 9.07 Intr + 212414 212620 207 1 0 60 100 278 0.384 26.00 9.08 Intr + 212827 213082 256 0 1 42 61 410 0.945 31.25 9.09 Term + 213986 214983 998 0 2 133 48 970 0.975 90.25 9.10 PlyA + 215323 215328 6 1.05 10.05 PlyA - 215725 215720 6 1.05 10.04 Term - 218311 217061 1251 1 0 63 44 358 0.930 21.05 10.03 Intr - 218849 218747 103 1 1 72 -19 97 0.788 -1.92 10.02 Intr - 219709 219530 180 0 0 5 81 118 0.764 2.30 10.01 Intr - 220118 219915 204 1 0 85 74 97 0.471 6.84 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 34952 34876 77 2 2 94 101 119 0.989 14.31 S.002 Init + 35303 35659 357 1 0 92 94 340 0.867 30.12 S.003 Term + 53206 53303 98 2 2 103 53 126 0.937 9.03 S.004 Intr - 57101 56869 233 1 2 59 61 183 0.831 10.65 S.005 Init + 64564 64773 210 0 0 62 98 218 0.909 17.05 S.006 Term + 149899 150045 147 0 0 34 53 131 0.845 2.41 S.007 Term - 151538 151162 377 0 2 62 48 320 0.823 20.66 S.008 Init + 162648 162699 52 2 1 100 62 35 0.968 2.14 S.009 Term - 194614 194501 114 1 0 97 47 68 0.932 2.47 S.010 Intr - 194952 194814 139 2 1 21 47 125 0.939 2.67 S.011 Init - 195587 195475 113 2 2 82 28 128 0.939 3.84 S.012 Term + 201280 202101 822 0 0 86 44 1822 0.855 171.16 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:241602066_241825028|GENSCAN_predicted_peptide_1|207_aa MRRLRRGCASGAGPAEAGGGRPGAEAGLAARAPHPSPGLSCLVGSVKAMPGAAALVPLFH WDPGGRRQAPWPLPRGPRDPGAGGGRAGDSGVRGGDRGPGAAAQNLDSGSLLPSAREPPK MNPVVEPLSWMLGTWLSDPPGAGTYPTLQPFQYLEEVHISHVGQPMLNFSFNSFHPDTRK PMHRECGFIRLKPDTNKVAFVSAQNTX >gi568815596f:241602066_241825028|GENSCAN_predicted_CDS_1|621_bp atgcggcgcctgaggcggggctgcgcttccggggcggggccggcggaggctggagggggc cgcccgggagcagaggcaggacttgcggcgcgtgcccctcaccctagcccggggctgtcc tgcctcgtggggagtgtgaaggccatgcccggcgccgcggccctggtgcccctgttccac tgggaccctggggggcgccgccaggccccttggcctcttcctcggggccctcgggaccca ggggccggtggtggccgtgccggggactcgggcgtgaggggtggagatcgcggccccggt gcagcagctcagaacctggactctggatcgctgctccccagcgcccgggagccccccaag atgaacccagtggtggagccactgtcctggatgctgggcacctggctgtcggacccacct ggagccgggacctaccccacactgcagcccttccagtacctggaggaggttcacatctcc cacgtgggccagcccatgctgaacttctcgttcaactccttccacccggacacgcgcaag ccgatgcacagagagtgtggcttcattcgcctcaagcccgacaccaacaaggtggccttt gtcagcgcccagaacacagnn >gi568815596f:241602066_241825028|GENSCAN_predicted_peptide_2|84_aa MVHLHKALTSTGVCSMGSCSGVKGGIEAGITNDSEMNSNKSRERQIHGTQPKEQQSHSLH CRVKEHARNQNYKKKPNKAKRCKD >gi568815596f:241602066_241825028|GENSCAN_predicted_CDS_2|255_bp atggttcacctgcataaggcacttaccagcactggagtgtgcagcatgggcagttgctct ggggttaaagggggcatcgaggctggaattacaaacgactcagaaatgaacagcaacaaa agcagagagaggcagatccatggcacgcagcccaaggaacagcagagccacagcctccat tgcagggttaaggaacacgccaggaaccagaactacaagaaaaaacccaacaaagcaaaa agatgtaaggattaa >gi568815596f:241602066_241825028|GENSCAN_predicted_peptide_3|745_aa MVAMAPPPSTADLAQGPVQPQPPERSGRDRKSRSSSATDGGFRGASGRRLTRAASLRRAS NATRRRDAAGANRCAGAGGEDDTARERKYAGAAILAYGRADASPGAPCGRHGVVSWDDGW FCCPTGHCVPWRTQEQLSRRQRRALSPSEQPSCGTDVLGAPMSRYGLAPPGRLPRPSARL PTDAARGLRRPAGPKTPEPAAGEDAGGGWGGGGPAAGACASILPYGADAAGRPTSRAVSP PRSLPRGGPRAGGRLGPGPGCAAGPRPAMVICCAAVNCSNRQGKGEKRAVSFHRLWEPAF SCLYEFDYSEYLMFPLKDSKRLIQWLKAVQRDNWTPTKYSFLCSEHFTKDSFSKRLEDQH RLLKPTAVPSIFHLTEKKRGAGGHGRTRRKDASKATGGVRGHSSAATSRGAAGWSPSSSG NPMAKPESRRLKQAALQGEATPRAAQEAASQEQAQQALERTPGDGLATMVAGSQGKAEAS ATDAGDESATSSIEGGVTDKSGISMDDFTPPGSGACKFIGSLHSYSFSSKHTRERPSVPR EPIDRKRLKKDVEPSCSGSSLGPDKGLAQSPPSSSLTATPQKPSQSPSAPPADVTPKPAT EAVQSEHSDASPMSINEVILSASGACKLIDSLHSYCFSSRQNKSQVCCLREQVEKKNGEL KSLRQRVSRSDSQVRKLQEKLDELRRVSVPYPSSLLSPSRGQYRSLAYPCSLPVRRGFSA RWPASPEHFWEVSQAPCALRRKLWS >gi568815596f:241602066_241825028|GENSCAN_predicted_CDS_3|2238_bp atggtggccatggctccgccgccctccacggccgacctggcgcaagggcctgtgcagcca caacctccggagcgctccggacgggaccggaaaagcagaagctccagcgcaaccgacggc gggttccgcggcgcatccggccgccgactgacccgagcggcgtcgctccgacgggccagc aacgcgacgcggcgacgggacgccgcgggggccaataggtgtgccggcgcaggcggcgaa gacgatacggcgcgcgagcggaaatacgcgggagccgccattttggcgtacggccgggcc gacgcgagccccggggcgccctgcgggcggcatggggtcgtgtcgtgggacgacggctgg ttctgctgtccgacgggacactgtgtgccatggaggacccaggagcagttgtcgaggcgg cagaggcgagctctgtccccgtccgagcagccatcttgcggtacggacgtgctcggcgcg cccatgtcgcggtacggacttgccccgccggggcgactcccccgcccgtctgcccgcctg ccgacggacgctgcgaggggcctgcggcgccccgctggtccgaagacgcccgaacccgcg gctggcgaggacgcgggcgggggctggggcgggggcgggcccgctgcgggcgcctgcgcc tccatcttgccgtacggcgcggacgccgcgggccggcccacgtcccgagccgtgtcgccg ccgcgctccctccctcggggcggtccgcgggcgggcgggcggctagggccggggcctggc tgcgcggctgggccaaggcccgcgatggtgatctgctgtgcggccgtgaactgctccaac cggcagggaaagggcgagaagcgcgccgtctccttccacaggctctgggaacccgcattc tcctgtctctatgaatttgattactctgagtacctcatgttccccctaaaggactcaaaa cgtctaatccaatggttaaaagctgttcagagggataactggactcccactaagtattca tttctctgtagtgagcatttcaccaaagacagcttctccaagaggctggaggaccagcat cgcctgctgaagcccacggccgtgccatccatcttccacctgaccgagaagaagaggggg gctggaggccatggccgcacccggagaaaagatgccagcaaggccacagggggtgtgagg ggacactcgagtgccgccaccagcagaggagctgcaggttggtcaccgtcctcgagtgga aacccgatggccaagccagagtcccgcaggttgaagcaagctgctctgcaaggtgaagcc acacccagggcggcccaggaggccgccagccaggagcaggcccagcaagctctggaacgg actccaggagatggactggccaccatggtggcaggcagtcagggaaaagcagaagcgtct gccacagatgctggcgatgagagcgccacttcctccatcgaagggggcgtgacagataag agtggcatttctatggatgactttacgcccccaggatctggggcgtgcaaatttatcggc tcacttcattcgtacagtttctcctctaagcacacccgagaaaggccatctgtcccccga gagcccattgaccgcaagaggctgaagaaagatgtggaaccaagctgcagtgggagcagc ctgggacccgacaagggcctggcccagagccctcccagctcatcacttaccgcgacaccg cagaagccttcccagagcccctctgcccctcctgccgacgtcaccccaaagccagccacg gaagccgtgcagagcgagcacagcgacgccagccccatgtccatcaacgaggtcatcctg tcggcgtcaggggcctgcaagctcatcgactcactgcactcctactgcttctcctcccgg cagaacaagagccaggtgtgctgcctgcgggagcaggtggagaagaagaacggcgagctg aagagcctgcggcagagggtcagccgctccgacagccaggtgcggaagctacaggagaag ctggatgagctgaggagagtgagcgtcccctatccaagtagcctgctgtcgcccagccgc ggtcagtaccggagcctcgcgtacccatgttcccttcccgttaggaggggtttctctgca aggtggccagcgtcccctgagcatttctgggaggtgtcacaagctccctgcgctctcagg aggaagctctggtcgtaa >gi568815596f:241602066_241825028|GENSCAN_predicted_peptide_4|552_aa MAMENDSLPCFFLASLAYKDEGATLTYDTLRFAEFEDFPETSEPVWILGRKYSIFTEKDE ILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQR KRQPDSYFSVLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLNYSCKDEPS LNCTKAYGHSPTLCRKLAVFDTWSSLAVHIAMDNTVVMEEIRRLCRTSVPCAGATAFPAD SDRHCNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTDINEAYVETLKHCFMMPQSLGVIGG KPNSAHYFIGYVGEELIYLDPHTTQPAVEPTDGCFIPDESFHCQHPPCRMSIAELDPSIA VGFFCKTEDDFNDWCQQVKKLSLLGGALPMFELVELQPSHLACPDVLNLSLGESCQVQVG SLGALPARALWSWACYHGVLRGLEQTEHAGSVVTQSWVGDWFTWAPLAMGGVDPSDHGQR AAGAGLGSCSEVCTTPGPVHTRMGTAVCGLQSRHCSVCPTPRALDSHPDSSDVERLERFF DSEDEDFEILSL >gi568815596f:241602066_241825028|GENSCAN_predicted_CDS_4|1659_bp atggccatggaaaacgactctttgccctgtttcttcttggcttcccttgcgtacaaggat gaaggagctactctgacctacgacactctccggtttgctgagtttgaagattttcctgag acctcagagcccgtttggatactgggtagaaaatacagcattttcacagaaaaggacgag atcttgtctgatgtggcatctagactttggtttacatacaggaaaaactttccagccatt ggggggacaggccccacctcggacacaggctggggctgcatgctgcggtgtggacagatg atctttgcccaagccctggtgtgccggcacctaggccgagattggaggtggacacaaagg aagaggcagccagacagctacttcagcgtcctcaacgcattcatcgacaggaaggacagt tactactccattcaccagatagcgcaaatgggagttggcgaaggcaagtccataggccag tggtacgggcccaacactgtcgcccaggtcctgaactatagctgcaaagatgaaccgtct ttgaattgtacaaaagcttatggtcattctcctactctctgtaggaagcttgctgtcttc gatacgtggagctccttggcggtccacattgcaatggacaacactgttgtgatggaggaa atcagaaggttgtgcaggaccagcgttccctgtgcaggcgccactgcgtttcctgcagat tccgaccggcactgcaacggattccctgccggagctgaggtcaccaacaggccgtcgcca tggagacccctggtacttctcattcccctgcgcctggggctcacggacatcaacgaggcc tacgtggagacgctgaagcactgcttcatgatgccccagtccctgggcgtcatcggaggg aagcccaacagcgcccactacttcatcggctacgttggtgaggagctcatctacctggac ccccacaccacgcagccagccgtggagcccactgatggctgcttcatcccggacgagagc ttccactgccagcacccgccgtgccgcatgagcatcgcggagcttgacccgtccatcgct gtggggtttttctgtaagactgaagatgacttcaatgattggtgccagcaagtcaaaaag ctgtctctgcttggaggtgccctgcccatgtttgagctggtggagctgcagccttcacat ctggcctgccccgacgtcctgaacctgtccctaggtgagagctgccaagtccaggtgggg tccctcggagcgctgcctgcccgggcgctgtggagctgggcgtgctaccatggagtcctc aggggtctggagcagacagaacatgcaggctctgtggtgacgcagtcctgggtgggggac tggttcacttgggcaccactggccatgggtggcgtagacccctcggaccatggccagcgt gccgcaggagccggcctgggctcgtgcagtgaagtctgcacaacccccggacctgttcac acccgcatggggacagctgtctgtgggctgcagagcaggcactgctcagtctgccccacg ccaagggcccttgactcacacccagattcttctgatgtagagcgactggaaagattcttc gactcagaagatgaagactttgaaatcctgtccctttga >gi568815596f:241602066_241825028|GENSCAN_predicted_peptide_5|237_aa MAARRGALIVLEGVDRAGKSTQSRKLVEALCAAGHRAELLRFPERSTEIGKLLSSYLQKK SDVEDHSVHLLFSANRWEQISALTDPLTLPFSFSLIVVALSLEPRPLIKEKLSQGVTLVV DRYAFSGVAFTGAKENFSLDWCKQPDVGLPKPDLVLFLQLQLADAAKRGAFGHERYENGA FQERALRCFHQLMKDTTLNWKMVDASKSIEAVHEDIRVLSEDAIRTATEKPLGELWK >gi568815596f:241602066_241825028|GENSCAN_predicted_CDS_5|714_bp atggcggcccggcgcggggctctcatagtgctggagggcgtggaccgcgccgggaagagc acgcagagccgcaagctggtggaagcgctgtgcgccgcgggccaccgcgccgaactgctc cggttcccggaaagatcaactgaaatcggcaaacttctgagttcctacttgcaaaagaaa agtgacgtggaggatcactcggtgcacctgcttttttctgcaaatcgctgggaacaaatt tctgcacttacagatcccctaactctgcccttctcattctctttaattgtcgtcgcattg tctctagaaccacggccgttaattaaggaaaagttgagccagggcgtgaccctcgtcgtg gacagatacgcattttctggtgtggccttcaccggtgccaaggagaatttttccctagat tggtgtaaacagccagacgtgggccttcccaaacccgacctggtcctgttcctccagtta cagctggcggatgctgccaagcggggagcgtttggccatgagcgctatgagaacggggct ttccaggagcgggcgctccggtgtttccaccagctcatgaaagacacgactttgaactgg aagatggtggatgcttccaaaagcatcgaagctgtccatgaggacatccgcgtgctctct gaggacgccatccgcactgccacagagaagccgctgggggagctatggaagtga >gi568815596f:241602066_241825028|GENSCAN_predicted_peptide_6|630_aa MATAMYLEHYLDSIENLPCELQRNFQLMRELDQRTEDKKAEIDILAAEYISTVKTLSPDQ RVERLQKIQNAYSKCKEYSDDKVQLAMQTYEMVDKHIRRLDADLARFEADLKDKMEGSDF ESSGGRGGNSVLSLPAPSCSEPEGPVPTVPFLRESGTPPGLFLTQCCFSRPSNLASALEP GDAETQVKWQLKMSGRALWTELDSAQKRRASAFRGRLAAEGAALLSAGERSEFTDTILSV HPSDVLDMPVDPNEPTYCLCHQVSYGEMIGCDNPDCPIEWFHFACVDLTTKPKGKCWQRD AVTQEWWLPGYVIFTLMAPPEGQPSGLWENKNHQSLFAQKGDLAGTRRRDLSVAFTGLPP SISRDPGGRGRARCLGLKGLPPSGAFLRLRRESAREGRLQCFTPFATAVKLEMGAAAGDM DAKRRPAWPPEQQLTARLMLDGTPGRVDMRPLLARDAVRRPTWLPEQQLTARLTVGQGFV ASSISKYGHIGGYSFCIRILGDTNITGADGSQDVALRSALTEPGKSLVITRLESSLEEEA EMLGEGPTTPKPAPAISLEEEAEMLGEGLTMRKPAPAISLEEEAEMLGEGPTTLKPAHAV FSSKHSLRVHVGFARVTMPWEKEKSVVIIF >gi568815596f:241602066_241825028|GENSCAN_predicted_CDS_6|1893_bp atggcgaccgccatgtacttggagcactatctggacagtatcgagaaccttccctgcgaa cttcagaggaacttccagctgatgcgagagctggaccagaggacggaagataagaaagca gagattgacatcctggctgcagagtacatctccacggtgaagacgctgtctccagaccag cgcgtggagcgcctgcagaagatccagaacgcctacagcaagtgcaaggaatacagtgac gacaaagtgcagctggccatgcagacctacgagatggtggataaacacattcgaaggctt gatgcagacctggcgcgctttgaagcagatctgaaggacaagatggagggcagtgatttt gaaagctccggagggcgaggtggcaacagcgttctgagtcttcctgctccttcctgctcg gaacctgaaggccctgtcccgaccgtgccttttctgcgtgaaagtgggactcctcctggt ctcttcctcacccagtgctgtttctcaaggccatcaaatctggccagcgccctcgaacct ggtgatgctgagacacaggtcaagtggcagctcaagatgtcaggtcgggctttgtggaca gaattggactcggcgcagaagaggagggcgtcagcgtttcggggacggttggcagcagag ggtgctgccctgctctccgcgggtgagaggtctgagttcactgacaccatcctgtccgtg cacccctctgatgtgctggacatgcccgtggacccaaacgaacccacgtactgcctgtgc caccaggtctcctatggggagatgattggctgtgacaatccagactgtccaattgagtgg tttcactttgcctgcgtggaccttaccacgaaacccaaaggaaaatgctggcagagggat gccgtcacccaggaatggtggttaccaggttacgtgatattcacgttaatggcccctcct gaaggacaaccttccggactctgggagaacaagaaccaccaaagcctgttcgcacagaag ggcgaccttgcagggactcgccgccgcgacctcagtgtggcttttacaggactccccccg agcatcagcagggaccccggcggacgtgggcgggcgcgctgtctgggcctgaaggggctc cctccaagcggtgccttcctgcggcttagaagggaaagcgccagagagggcaggctacag tgcttcacacccttcgccacggctgtgaaactggagatgggtgctgctgctggggacatg gatgccaagcggaggcctgcgtggcctcctgagcagcagctgaccgcccgtctcatgctg gacgggacccctgggcgggtggacatgagaccactgttggccagggacgccgtgcggagg cctacgtggctgcctgagcagcagctgaccgcccgtctcactgttggccagggctttgtg gcaagttccatctccaaatacggccatattgggggttacagcttctgcatacgaattctg ggggacacaaacataacaggagctgatgggtctcaagatgttgccttgaggagtgcatta acagaacccgggaagagcttggtcatcacgagactggagtcctcgctggaagaggaggct gagatgctgggggaaggccccacaacgccgaaaccagcacctgccatctcgctggaagag gaggctgagatgctgggggaaggcctcacaatgcggaaaccagcacctgccatctcgctg gaagaggaggctgagatgctgggtgagggccccacaacgctgaaaccagcacatgccgta ttttcctccaagcattctctgagggtacatgtgggctttgctagggtgactatgccctgg gaaaaagaaaaatcagttgttattattttctga >gi568815596f:241602066_241825028|GENSCAN_predicted_peptide_7|774_aa MLPRRPLAWPAWLLRGAPGAAGSWGRPVGPLARRGCCSAPGTPEVPLTRERYPVRRLPFS TVSKQDLAAFERIVPGGVVTDPEALQAPNVDWLRTLRGCSKVLLRPRTSEEVSHILRHCH ERNLAVNPQGGNTGMVGGSVPVFDEIILSTARMNRVLSFHSVSGILVCQAGCVLEELSRY VEERDFIMPLDLGAKGSCHIGGNVATNAGGLRFLRYGSLHGTVLGLEVVLADGTVLDCLT SLRKDNTGYDLKQLFIGSEGTLGIITTVSILCPPKPRAVNVAFLDGWLGSSVCSDGATVG VRKGMEGPPAKDSRFPCLLTSLLPKESGIQDLAFLVVMGKGVDPWWGSSPPDPGTAGAAL TASADLLSGRQCHLDRWFLFIQVSDRSHSPLRATPTSKPGGTGPVRMVLVSLVLVGSVQL SRAPVSPGSCSPAAGATSDEALLSHFLGIPAHSEQDAQSPPPDPPTPPPPPTPQPSTRLQ PHPDMVLGSQALRCSCVTQCRCLPPASAPCQSSCSSWLPSSPTHMSRADVAALKGDLGCP GFAEVLQTFSTCKGMLGEILSAFEFMDAVCMQLVGRHLHLASPVQESPFYVLIETSGSNA GHDAEKLGHFLEHALGSGLVTDGTMATDQRKVKMLWALRERITEALSRDGYVYKYDLSLP VERLYDIVTDLRARLGPHAKHVVGYGHLGDGNLHLNVTAEAFSPSLLAALEPHVYEWTAG QQGSVSAEHGVGFRKRDVLGYSKPPGALQLMQQLKALLDPKGILNPYKTLPSQA >gi568815596f:241602066_241825028|GENSCAN_predicted_CDS_7|2325_bp atgctgccccgtcggcctctggcgtggcccgcgtggctgttgcggggtgctccgggagcc gcgggttcttggggtcggccggttggccccctggcccgcagaggctgctgctccgccccg gggacccccgaggtgccgctgacccgggagcgctaccccgtgcggcgcttgccgttctcc acggtgtctaagcaggacctggccgcctttgagcgcatcgtgcccggcggggtcgtcacg gacccggaagcgctgcaggctcccaacgtggactggttgcggacgctgcgaggctgtagc aaggtgctgctgaggccacggacgtcggaggaggtgtcccacatcctcaggcactgccac gagaggaacctggccgtgaacccacaggggggcaacacaggcatggtgggtggcagcgtc cccgtctttgacgagatcatcctctccactgcccgcatgaaccgggtcctcagcttccac agcgtgtctggaattctggtttgccaggcgggctgcgtcctggaggagctgagccggtat gtggaggaacgggacttcatcatgccgctggacttaggagccaagggcagctgccacatc gggggaaacgtggcaaccaacgctggaggcctgcggtttcttcgatatggctcactgcat gggactgtcctgggcctggaagtggtgctggccgacggcactgtcctggactgcctgacc tccctgaggaaggacaacacgggctatgacctgaagcagctgttcatcgggtcggagggc actttggggatcatcaccacggtgtccatcttgtgtccacccaagcccagggctgtgaac gtggctttcctcgatgggtggttgggctcgagcgtctgctctgatggtgccactgttggt gtgaggaaggggatggagggaccccccgccaaggacagtcggttcccgtgtctcctgaca tctctgcttccaaaggaatcaggaatccaggacctggccttcctggtggtcatgggtaaa ggtgtagacccctggtggggctcctcaccacctgaccccggaacggctggggcagccttg actgcttctgccgacttactctctgggcggcagtgccatctggatcggtggttcctcttc atccaggtctcagacaggagtcactcgcctcttcgtgccacgcccacgtctaagccgggt ggaacaggccctgtgaggatggtcctggtgtccctcgtgctcgtgggctcggtccagctc tccagggccccagtcagccctggctcctgctctcctgcagctggggcgacttccgacgag gcactcttgtcccacttcctggggattcctgcccactctgagcaagatgcccagtcccca ccgccagaccctccaacaccaccaccacctcccacaccccagccctctacgcgtctgcag ccacaccccgacatggtgctcgggtcacaggccttgaggtgctcctgtgtcacccagtgc aggtgtctgcccccagcctctgcgccttgccagtcctcgtgctcctcgtggctgcccagc tcacccacccacatgagtcgggctgatgttgctgctttgaagggggacttgggctgccca ggctttgctgaggttctgcagaccttcagcacctgcaaggggatgctgggtgagatcctg tctgcattcgagttcatggatgctgtgtgcatgcagctggtcgggcgccatctccacctg gccagcccggtgcaagagagtccgttttacgtcctcatcgagacttcaggctccaacgca ggccatgacgctgagaagctgggccacttcctggagcacgcgctgggctccggcctggtg accgatgggaccatggccaccgaccagaggaaagtcaagatgctgtgggccctgagggaa aggatcacagaggcgctgagccgggatggctacgtgtacaagtacgacctctccctccct gtggagcggctctacgacatcgtgactgacctgcgcgcccgcctcggcccgcacgccaag cacgtggtgggctatggccaccttggagatggtaacctgcacctcaatgtgacggcggag gccttcagcccctcgctcctggctgccctggagccccacgtgtacgagtggacggccggg cagcagggcagcgtcagcgcggagcacggagtgggcttcaggaagagggacgtcctgggc tacagcaagccaccgggggccctgcagctcatgcagcagctcaaggccctgctggacccc aagggcatcctcaacccctacaagacgctgcccagccaggcctga >gi568815596f:241602066_241825028|GENSCAN_predicted_peptide_8|248_aa MVTEPMFVLHMRTDVSCLPTMYKARLCPDHLGHTSSGPPEAGSWVHVLTLGKAKQADTSE KGDSTAEPSLPVTTQSPGSILKRTHDQRSSKRLALPPTRPSHGPEPQTEAGRSQAQQVSP SPAVTRTLERLHATVLGGSFLTGPHIPEPPNTQKQHQAWAGGAPYFRNEAKTVSIPPGGL EGAVPTRDGLGGLQQHSDGFWTQNSGPDADERAEGPVATARMSEESSAQFPASSPSSSSS SSSPSSSS >gi568815596f:241602066_241825028|GENSCAN_predicted_CDS_8|747_bp atggtgaccgaaccaatgttcgtcctgcatatgcggactgatgtctcatgtctccctaca atgtataaagccagactgtgccctgaccaccttgggcacacgtcatcaggacctcctgag gctgggtcatgggtgcacgtcctcacccttggcaaagccaagcaagctgacacaagcgag aagggagacagcacagcagagcccagcctccccgtcaccacccagagccccgggagcatc ctgaagcgcacgcatgaccaacgctccagcaagcgcctggcactcccaccaacacggccc agccatgggcctgagccgcagaccgaggctgggaggtcccaagcccagcaggtctcaccc tccccagccgtcacgcggacactcgagcgtctccacgccacagtgttggggggcagcttt ctcactgggccccacatcccagaacccccaaacacccagaagcagcaccaggcctgggca ggaggggccccctacttccgaaacgaagccaagacagtgtccatccccccggggggcttg gagggagctgtccccacaagggacgggcttggtgggctccaacagcacagcgacggcttc tggactcagaactcaggacctgacgcagacgagagggcagagggccctgtggccactgca aggatgagcgaggaaagctcagcccaattcccagcttcctccccctcctcgtcctcctct tcttcctccccctcctcctcctcctga >gi568815596f:241602066_241825028|GENSCAN_predicted_peptide_9|1015_aa MACPFHRYFRVILLLLLALTLLLLAGFLHSDLELDTPLFGGQAEGPPVTNIMFLKTHKTA SSTVLNILYRFAETHNLSVALPAGSRVHLGYPWLFLARYVEGVGSQQRFNIMCNHLRFNL PQVQKVMPNDTFYFSILRNPVFQLESSFIYYKTYAPAFRGAPSLDAFLASPRTFYNDSRH LRNVYAKNNMWFDFGFDPNAQCEEGYVRARIAEVERRFRLVLIAEHLDESLVLLRRRLRW ALDDVVAFRLNSRSARSVARLSPETRERARSWCALDWRLYEHFNRTLWAQLRAELGPRRL RGEVERLRARRRELASLCLQDGGALKNHTQIRDPRLRPYQSGKADILGYNLRPGLDNQTL GVCQRLVMPELQYMARLYALQFPEKPLKNIPFLGAPGKAQERILPGKGKVAREEPTCQQQ ESQKTEKTTDEKAVHAENKTILGCEQEDVYDGQVAWTGSAPPGAGTGHTEDTEALWNPGP GHTEHGWSPKTEGPPEDTESLWNPGPGHMEDTEALWNPGPGRGKEAPSQQSMGVPRTPSR TVLFERERTGLTYRVPSLLPVPPGPTLLAFVEQRLSPDDSHAHRLVLRRGTLAGGSVRWG ALHVLGTAALAEHRSMNPCPVHDAGTGTVFLFFIAVLGHTPEAVQIATGRNAARLCCVAS RDAGLSWGSARDLTEEAIGGAVQDWATFAVGPGHGVQLPSGRLLVPAYTYRVDRRECFGK ICRTSPHSFAFYSDDHGRTWRCGGLVPNLRSGECQLAAVDGGQAGSFLYCNARSPLGSRV QALSTDEGTSFLPAERVASLPETAWGCQGSIVGFPAPAPNRPRDDSWSVGPGSPLQPPLL GPGVHEPPEEAAVDPRGGQVPGGPFSRLQPRGDGPRQPGPRPGVSGDVGSWTLALPMPFA APPQSPTWLLYSHPVGRRARLHMGIRLSQSPLDPRSWTEPWVIYEGPSGYSDLASIGPAP EGGLVFACLYESGARTSYDEISFCTFSLREVLENVPASPKPPNLGDKPRGCCWPS >gi568815596f:241602066_241825028|GENSCAN_predicted_CDS_9|3048_bp atggcctgccctttccacagatacttccgggtcatcctcctcctcctcctggccctgact ctgctcctgctggccggattcctgcactcggacttagagctggacacacccctgtttggg ggccaggctgaggggccgccggtcaccaacatcatgttcctgaagacgcacaagacggcc agcagcacggtgctcaacatcctctaccgcttcgccgagacccacaacctgtccgtggcg ctgcccgccggctcacgcgtccacctgggctacccctggctcttcctggcgcgctacgtg gaaggcgtggggtcgcagcagcgcttcaacatcatgtgcaaccacctgaggttcaacctg cctcaggtgcagaaagtcatgcccaacgacaccttctacttctccatcctgaggaacccc gtgttccagctggagtcctccttcatctactacaaaacctacgcccccgccttccggggc gccccgagcctggacgcgttcctggcctcgccgcggacgttctacaacgacagccgccac ctcaggaacgtctacgccaagaacaacatgtggttcgacttcggcttcgaccccaacgcg cagtgcgaggagggctacgtgcgcgcgcgcatcgccgaggtggagcggcgcttccggctg gtgctcatcgccgagcacctggacgagtccctggtgctgctgcggcgccggctgcgctgg gcgctggacgacgtggtggccttcaggctcaactcccgcagcgcgcgctccgtggcccgc ctgtcgcccgagacccgggagcgcgcgcggagctggtgcgcgctggactggcgcctgtac gagcatttcaaccgcaccctctgggcgcagctgcgcgccgagctggggccgcggcggctg cgcggggaggtggagcggctgcgcgcccggaggcgcgaactcgcgagcctgtgcctgcag gacggcggcgcgctcaagaaccacacgcagatcagagacccgcgcctgcgcccctaccag tccggcaaggccgacatcctgggttacaacctccggccgggcctggacaaccagacgctg ggcgtgtgccagaggcttgtgatgcctgagctccagtacatggcccgcctgtacgccctg cagttcccggagaagcccctcaagaacatcccgttcctgggggccccaggaaaggcacag gagaggatcctgccaggtaaagggaaagtggcccgggaggagcccacgtgccaacaacag gagtcccagaagacagaaaagaccacagatgagaaagcagttcacgcagagaacaaaacg attctcggctgtgagcaggaggatgtgtacgatggccaagttgcctggactgggtccgcc cctcctggggctggaactggtcacactgaggacacggaggctctgtggaacccagggcct ggccacactgagcacggctggtcccccaagacagagggacctcctgaggacacagagtct ctgtggaacccagggcctggccacatggaggacacggaggctctgtggaacccagggcct ggccgtggcaaggaagccccaagccaacagagcatgggggtccctcgtaccccttcacgg acagtgctcttcgagcgggagaggacgggcctgacctaccgcgtgccctcgctgctcccc gtgccccccgggcccaccctgctggcctttgtggagcagcggctcagccctgacgactcc cacgcccaccgcctggtgctgaggaggggcacgctggccgggggctccgtgcggtggggt gccctgcacgtgctggggacagcagccctggcggagcaccggtccatgaacccctgccct gtgcacgatgctggcacgggcaccgtcttcctcttcttcatcgcggtgctgggccacacg cctgaggccgtgcagatcgccacgggaaggaacgccgcgcgcctctgctgtgtggccagc cgtgacgccggcctctcgtggggcagcgcccgggacctcaccgaggaggccatcggtggt gccgtgcaggactgggccacattcgctgtgggtcccggccacggtgtgcagctgccctca ggccgcctgctggtacccgcctacacctaccgcgtggaccgccgagagtgttttggcaag atctgccggaccagccctcactccttcgccttctacagcgatgaccacggccgcacctgg cgctgtggaggcctcgtgcccaacctgcgctcaggcgagtgccagctggcagcggtggac ggtgggcaggccggcagcttcctctactgcaatgcccggagcccactgggcagccgtgtg caggcgctcagcactgacgagggcacctccttcctgcccgcagagcgcgtggcttccctg cccgagactgcctggggctgccagggcagcatcgtgggcttcccagcccccgcccccaac aggccacgggatgacagttggtcagtgggccccgggagtcccctccagcctccactcctc ggtcctggagtccacgaacccccagaggaggctgctgtagacccccgtggaggccaggtg cctggtgggcccttcagccgtctgcagcctcggggggatggccccaggcagcctggcccc aggcctggggtcagtggggatgtggggtcctggaccctggcactccccatgccctttgct gccccgccccagagccccacgtggctgctgtactcccacccagtggggcgcagggctcgg ctacacatgggtatccgcctgagccagtccccgctggacccgcgcagctggacagagccc tgggtgatctacgagggccccagcggctactccgacctggcgtccatcgggccggcccct gaggggggcctggtttttgcctgcctgtacgagagcggggccaggacctcctatgatgag atttccttttgtacattctccctgcgtgaggtcctggagaacgtgcccgccagccccaaa ccgcccaaccttggggacaagcctcgggggtgctgctggccctcctga >gi568815596f:241602066_241825028|GENSCAN_predicted_peptide_10|579_aa XTLVRPWTHPDDQDDFSPQDPQPAPSAQPFAMTSWCRPRNEDADGFRALVWPIRHQGCLT SSWRSLPAQNTGGWVILKEKKFNRLPVLQALQEAWCWHLLGFRGGPRKPTILVEEKGEQA RHMAKAAARPDPTPAQRVLQPQTARDIERLHHGDRSSWGPRAQALLANKVAKSELPGGHE AGDAAHPSIEENDEEWAMQSCSGLHGFQNPREACGSFASDLKAEEWTVMTLLATSQDTRM TGCTQHTTYTRATSQDTRMRGCTQHTTYTRATSQDTRMTGCTQHTTYTHTTRQDTRMTGC TQHTTYTQATSQDTRMRGCTQHTTYTHTTRQDTRMTGCTQHTTYTRAMSQDTCMRGCTQH TTYTHTTRQDTRMTGCTQHTTYTRAMSQDTCMRGCTQHTTYTHTTRQDTRMTGCTQHTTY TRATSQDTHDRMHPARNLHTGHEPGHTHDRMHPAHDLHTGHEPGHTHERMHPARDLHTGH EPGHTHERMHPEHNLHTHHEAGHTHDRMHPAHNLHTGHEPGHTHDRMHPAHDLHTGHEPG HMHDTMYPACDLHTGHKSGHTRDTRHPARDLHVHPRFPC >gi568815596f:241602066_241825028|GENSCAN_predicted_CDS_10|1740_bp nngaccctggtgagaccctggacccacccagatgaccaggatgatttctcacctcaagac cctcagccggccccctcggcgcagcccttcgccatgacgtcatggtgcaggccccggaat gaggatgcagacggcttccgggcccttgtttggcctatcagacatcaaggctgtttaaca tcttcctggaggtccttgccagcccagaacaccggaggctgggtgattcttaaggaaaag aagtttaatcggctcccggttctgcaggctctacaggaagcgtggtgctggcatctgctc ggcttccggggaggccccaggaaacccacaatcctggtggaagagaagggggagcaggca cgtcacatggcgaaagcagcagcaagacctgacccaaccccagctcagagggtgctccag ccgcagacagcccgagacatcgagaggctccaccacggcgaccggagctcctggggacct cgagcccaagccctacttgcaaataaggtagcaaagtctgagcttccagggggacatgaa gctggggacgccgctcaccccagtatagaagagaatgacgaagaatgggccatgcagagc tgcagtgggctccacggcttccagaaccccagagaggcctgtggctcctttgcttcagat ctgaaggcagaagagtggactgtgatgaccctgctagccacgagccaggacacacgcatg acaggatgcacccagcacacgacctacacacgggccacgagccaggacacacgcatgaga ggatgcacccagcacacgacctacacacgggccacgagccaggacacacgcatgacagga tgcacccagcacacaacctacacacacaccacgaggcaggacacacgcatgacaggatgc acccagcacacgacctacacacaggccacgagccaggacacacgcatgagaggatgcacc cagcacacaacctacacacacaccacgaggcaggacacacgcatgacaggatgcacccag cacacgacctacacacgggccatgagccaggacacatgcatgagaggatgcacccagcac acaacctacacacacaccacgaggcaggacacacgcatgacaggatgcacccagcacacg acctacacacgggccatgagccaggacacatgcatgagaggatgcacccagcacacaacc tacacacacaccacgaggcaggacacacgcatgacaggatgcacccagcacacgacctac acacgggccacgagccaggacacacacgacaggatgcacccagcacgcaacctacacacg ggccacgagccaggacacacgcatgacaggatgcacccagcacatgacctacacacgggc cacgagccaggacacacgcatgagaggatgcacccagcacgcgacctacacacgggccac gagccaggacacacgcatgagaggatgcacccagaacacaacctacacacacaccacgag gcaggacacacgcatgacaggatgcacccagcacacaacctacacacgggccacgagcca ggacacacgcatgacaggatgcacccagcacacgacctacacacgggccatgagccagga cacatgcatgacacaatgtacccagcatgtgacctacacacgggccacaagtcaggacac acacgtgacacaaggcacccagcacgtgatctacacgtgcaccccagattcccatgttga