GENSCAN 1.0 Date run: 5-Nov-116 Time: 15:01:37 Sequence gi568815597f:109811024_110025186 : 214163 bp : 45.34% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.08 PlyA - 1734 1729 6 1.05 1.07 Term - 2243 2062 182 0 2 5 34 182 0.422 2.27 1.06 Intr - 6188 6079 110 1 2 14 94 79 0.423 1.03 1.05 Intr - 9985 9895 91 2 1 105 21 78 0.098 1.85 1.04 Intr - 10154 10049 106 2 1 102 44 74 0.349 4.19 1.03 Intr - 12581 12468 114 2 0 63 77 43 0.199 1.34 1.02 Intr - 16079 15988 92 0 2 23 73 97 0.257 1.41 1.01 Init - 17588 17549 40 2 1 70 116 -6 0.266 0.91 1.00 Prom - 20070 20031 40 -4.66 2.00 Prom + 24240 24279 40 -4.06 2.01 Init + 30922 32528 1607 0 2 37 11 485 0.458 27.57 2.02 Intr + 40338 40452 115 0 1 40 58 65 0.010 -0.85 2.03 Intr + 46087 46158 72 0 0 83 105 44 0.428 5.20 2.04 Term + 46755 46808 54 2 0 42 37 96 0.398 -2.54 2.05 PlyA + 47237 47242 6 1.05 3.00 Prom + 48404 48443 40 -5.56 3.01 Init + 49355 49758 404 1 2 60 39 242 0.234 12.60 3.02 Intr + 67546 67576 31 1 1 118 107 47 0.356 7.73 3.03 Intr + 70663 70820 158 0 2 72 39 131 0.267 5.41 3.04 Intr + 84925 85001 77 1 2 108 24 26 0.000 -2.64 3.05 Intr + 94684 94773 90 2 0 89 61 70 0.821 4.37 3.06 Intr + 95940 96057 118 1 1 -6 82 91 0.527 -1.38 3.07 Intr + 99220 99349 130 1 1 65 23 112 0.407 3.10 3.08 Term + 99790 100308 519 0 0 46 52 200 0.606 6.50 3.09 PlyA + 102879 102884 6 -0.45 4.00 Prom + 102921 102960 40 -10.74 4.01 Init + 103215 103358 144 2 0 69 105 224 0.996 20.32 4.02 Intr + 104611 104673 63 0 0 100 97 20 0.833 3.01 4.03 Intr + 106270 106440 171 0 0 84 97 256 0.983 26.24 4.04 Intr + 110824 110971 148 0 1 63 121 139 0.687 14.51 4.05 Intr + 112143 113167 1025 1 2 111 100 413 0.587 35.13 4.06 Intr + 113753 113805 53 1 2 124 94 11 0.425 3.01 4.07 Term + 115040 115130 91 2 1 55 47 69 0.075 -3.31 4.08 PlyA + 115895 115900 6 1.05 5.00 Prom + 122978 123017 40 -1.16 5.01 Init + 150203 150257 55 1 1 79 58 90 0.362 4.55 5.02 Intr + 173767 174149 383 2 2 80 86 411 0.805 34.73 5.03 Intr + 198011 198122 112 1 1 70 96 70 0.856 5.95 5.04 Intr + 200191 200334 144 2 0 66 98 224 0.918 21.45 5.05 Intr + 201339 201439 101 1 2 99 101 47 0.981 6.93 5.06 Intr + 201874 201976 103 0 1 103 119 70 0.999 11.35 5.07 Intr + 203740 203834 95 2 2 80 45 73 0.999 1.88 5.08 Intr + 204402 204508 107 2 2 46 92 135 0.999 8.71 5.09 Intr + 205321 205437 117 1 0 120 69 168 0.999 17.68 5.10 Intr + 205644 205707 64 0 1 80 84 55 0.963 3.02 5.11 Intr + 206472 206560 89 2 2 147 99 99 0.999 15.67 5.12 Intr + 206923 206993 71 1 2 115 99 74 0.972 10.03 5.13 Intr + 207350 207444 95 0 2 113 79 72 0.999 8.48 5.14 Intr + 207529 207627 99 0 0 119 71 91 0.997 10.81 5.15 Intr + 208028 208096 69 1 0 88 91 4 0.503 0.08 5.16 Intr + 208525 208603 79 0 1 87 92 47 0.905 4.12 5.17 Intr + 209708 209828 121 0 1 93 91 39 0.524 4.25 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 85510 85303 208 1 1 54 70 161 0.820 7.88 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:109811024_110025186|GENSCAN_predicted_peptide_1|244_aa MVSMKCSAPLLAQANQDGAIRIFFGFHVLGSHHGCAWGPASTAAGSIQPVDPLSPSSLQQ APRPWAAAAGPKQNDPIEPSEKSEWAFSKASKATLLSCSESFDGSLLPWHEGLALQAAPT TQATVIPRNLLDIGSEPCCRYQNPRMLRPQGPTGGKPKSAAQDSSLCTEKGFPILPPASQ YNAGEKEEEEEGEGEGEGEEEEEEEEEEEEAEEEEEEEKEEEEKNIYVYIYFIFLMFAIL IDLI >gi568815597f:109811024_110025186|GENSCAN_predicted_CDS_1|735_bp atggtgtccatgaagtgctcagcaccattactagcacaagcaaaccaagatggcgccatc cgcatcttcttcgggtttcacgtgttgggcagccaccacggctgtgcctggggcccggcc agcacagctgcaggctcaattcagcctgtggaccccttgtctccaagctcactgcagcag gccccaaggccttgggcagcagcagcaggtcccaaacaaaatgaccccatagagcctagc gaaaagtcagagtgggctttttcaaaagcaagtaaggccacattactgtcctgctcagag tccttcgatggctccctgctgccctggcatgaaggacttgccctgcaagctgctccaacc acccaggctacagtcatccctcggaatctcttggatattggttccgaaccctgctgcaga taccaaaacccacggatgctcagacctcaagggcccaccgggggcaagcccaaatcagct gcccaggacagctcgctttgcacagagaaaggcttccccatccttcctccagccagccaa tacaatgcaggagaaaaagaagaagaagaagaaggagaaggagaaggagaaggagaagag gaagaggaagaggaagaagaagaagaagaagcagaagaagaagaagaagaagaaaaagag gaagaagaaaaaaatatatatgtatatatatatttcatctttttaatgtttgccatcctg atagatttaatttga >gi568815597f:109811024_110025186|GENSCAN_predicted_peptide_2|615_aa MNIDAKILNKILANRIQQHIKKLIHHDQVGFIPGMQGWFNICKSINVIQHINGTKDKKHM IISIDAEKAFDKIQQHFMLKTLNKLGIHGTYLKIIRAIYDKPTANIILNGQKLKAFPLKN GTRQGCPLSPLLFNIVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVSA QNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLTR DVKDLFKENYKPLLKEIKEDTNKWKNIPCSWVGRINIVKMAVLPKVIYRFSAIPIKLPMT FFTELEKTTLKFIWNQKRARITKSILSQKNKAGGITLPDFKLYYKATVTKTAWYWYQNRD IDQWNRTEPSEITPHIYNYLIFDKPEKNKQWGKDSLFNKWCWENWLTICRKLKLDPFLTP YTKINSRWIKDLNIRPKTIKTLEENLGITIQDIGMGKDFMSKTPKAMATKAKIDKWDLIK LKSFCTAKETTIRVNRQPTKWEKIFATYSSDKGLISRIYNELKQIYKKKTNNPIKNKWHT RRSYPMTGSVGPMPTEPSPLLVQQSEIDLRGSSLIKGSGLLDSSGPLPVCTTLPLRSEQL LLQIECIWAIRGILV >gi568815597f:109811024_110025186|GENSCAN_predicted_CDS_2|1848_bp atgaacattgatgcaaaaatcctcaataaaatactggcaaacagaatccagcagcacatc aaaaagcttatccaccatgatcaagtgggcttcatccctgggatgcaaggctggttcaat atatgcaaatcaataaatgtaatccagcatataaacggaaccaaagacaaaaaacacatg attatctcaatagatgcagaaaaggcctttgacaaaattcaacaacacttcatgctaaaa actctcaataaattaggtattcatgggacgtatctcaaaataataagagctatctatgac aaacccacagccaatatcatactgaatgggcaaaaactgaaagcattccctttgaaaaat ggcacaagacagggatgccctctctcaccactcctattcaacatagtgttggaagttctg gccagggcaatcaggcaggagaaagaaataaagggtattcaattaggaaaagaggaagtc aaattgtccctgtttgcagacgacatgattgtatatctagaaaaccccattgtctcagcc caaaatctccttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaatgta caaaaatcacaagcattcttatacaccaacaacagacaaacagagagccaaatcatgagt gaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaacttacaagg gatgtgaaggacctcttcaaggagaactacaaaccactgctcaaggaaataaaagaggat acaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatatcgtgaaaatg gccgtattgcccaaggtaatttacagattcagtgccatccccatcaagctaccaatgact ttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcccgc atcaccaagtcaatcctgagccaaaagaacaaagctggaggcatcacactacctgacttc aaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaacagagat atagatcaatggaacagaacagagccctcagaaataacaccgcatatctacaactatctg atctttgacaaacctgagaaaaacaagcaatggggaaaggattccctatttaataaatgg tgctgggaaaactggctcaccatatgtagaaagctgaaactggatcccttccttacacct tatacaaaaatcaattcaagatggattaaagacttaaacattagacctaaaaccataaaa accctagaagaaaacctaggcattaccattcaggacataggcatgggcaaggacttcatg tctaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatctaattaaa ctaaagagcttctgcacagcaaaagaaactaccatcagagtgaacagacaacctacaaaa tgggagaaaattttcgcaacctactcatctgacaaagggctaatatccagaatctacaat gaactcaaacaaatttacaagaaaaaaacaaacaaccccatcaaaaacaaatggcacacc aggagatcatatcccatgactggctcagtgggtcccatgcccacggagcctagcccactg ctagtgcagcagtctgagattgacctgcgaggcagcagcctgataaaggggtccgggctg ctggattctagtggtcctttaccagtgtgcacaacattgcctttgcgctcagagcagttg ctgctacagattgaatgcatttgggccatccgtggaatcctggtgtaa >gi568815597f:109811024_110025186|GENSCAN_predicted_peptide_3|508_aa MSELPFTITTKRIKYPGIQLTKDVKDLFKENYKPLLNEIKEDTNKWKNIPCSWIGRVNIV KMSILPKVIYRFNAIPIKKPMTFFTELEKTTLKFMWNQKRARIAKTILSKMNKAGGITLP DFKIYYKAIVTKTAWSSNFCLIYNELRLLCTSSCLSDESAEEQFRVKPAFLPEVSPSLET PSQARDVQKPAGKADALGGRLPFLVTHPRRVSSKQKVIVDRKRGRSAQQQPMERGLACWL EDPGLGLTAATEMRHSVQNPSLLAALPPFTLSLTLKVGEDMTERAGCSANDYRMGLEAGR AAWGLQRKKGAAGKPADSGSTREQVTLDSFRHSENGSSLKRLCRGLASEARPGESESLPG SSRRQSRSPHPRTAVRPSAGAPTPQQPASERASEGGRRARPGPSCPYDRAGRRRALPSHG KRRPRRWARDGLGRGLGGQAAGSGPSEPTAWARRLHWPGVPAADEPPLRTDWALAPGRRP VAALQLHRDCQASLGNGGLLTPLHGGEH >gi568815597f:109811024_110025186|GENSCAN_predicted_CDS_3|1527_bp atgagtgaactcccattcacaattactacaaagagaataaaatatccaggaatccaactt acaaaggatgtgaaggacctcttcaaggagaactacaaaccactgctaaatgaaataaaa gaggacacaaacaaatggaagaacattccatgctcatggataggaagagtcaatatcgtg aaaatgtccatactgcccaaggtaatttatagattcaatgccatccccatcaagaaacca atgactttcttcacagaattggaaaaaactactttaaagttcatgtggaatcaaaagaga gcccgcattgccaagacaatcctaagcaaaatgaacaaagctggaggcatcacactacct gacttcaaaatatactacaaggctatagtaacgaaaacagcatggtcctccaatttctgc ctcatctacaacgagctcaggctcctctgcacctcctcctgtctgagcgatgagtctgct gaggaacaattccgggtcaagccggctttcttaccagaggtctcgccctcgctggagact cccagccaggcccgggatgtgcagaagccagcagggaaggctgacgccttaggaggcaga ttgcccttcctggtgactcaccccagaagggtcagctctaagcagaaagtcattgttgac aggaagagggggagaagtgcccagcagcagccaatggaaaggggacttgcctgctggctg gaggaccccggcctggggctcacggcagccacagaaatgaggcattcagtccaaaatcct tccctgctggctgctttaccacccttcacgctgtctctcaccttgaaagtgggagaagac atgactgaaagggcaggctgcagtgcaaatgactatagaatgggacttgaagctggccgg gctgcttgggggctgcagaggaagaagggggctgccggcaaacctgctgactcaggctcc acgagggagcaagtaacactggactcctttcggcactccgagaatgggtcctctcttaaa aggctgtgccgagggctggccagtgaggctcggcccggggaaagtgaaagtttgcctggg tcctctcggcgccagagccgctctccgcatcccaggacagcggtgcggccctcggccggg gcgcccactccgcagcagccagcgagcgagcgagcgagcgagggcggccgacgcgcccgg ccgggacccagctgcccgtatgaccgcgccgggcgccgccgggcgctgccctcccacggt aagcgacggccgcggcgctgggcccgggacgggctggggcgggggctcggcggccaggcg gccgggagcggcccctcggagccgacagcctgggcgcgccggctgcactggcccggcgtt cccgccgcggacgaaccgcctttgcgcacggactgggccctggcgccagggcggaggccg gtggcggccctgcagctgcaccgggactgccaggcttccctgggaaacggagggctgctc accccattgcacggaggggaacactga >gi568815597f:109811024_110025186|GENSCAN_predicted_peptide_4|564_aa MTPSLSQTWLGSLLLLVCLLASRSITEEVSEYCSHMIGSGHLQSLQRLIDSQMETSCQIT FEFVDQEQLKDPVCYLKKAFLLVQDIMEDTMRFRDNTPNAIAIVQLQELSLRLKSCFTKD YEEHDKACVRTFYETPLQLLEKVKNVFNETKNLLDKDWNIFSKNCNNSFAECSSQDVVTK PDCNCLYPKAIPSSDPASVSPHQPLAPSMAPVAGLTWEDSEGTEGSSLLPGEQPLHTVDP GSAKQRPPRSTCQSFEPPETPVVKDSTIGGSPQPRPSVGAFNPGMEDILDSAMGTNWVPE EASGEASEIPVPQGTELSPSRPGGGSMQTEPARPSNFLSASSPLPASAKGQQPADVTGTA LPRVGPVRPTGQDWNHTPQKTDHPSALLRDPPEPGSPRISSLRPQGLSNPSTLSAQPQLS RSHSSGSVLPLGELEGRRSTRDRRSPAEPEGGPASEGAARPLPRFNSVPLTDTGHERQSE GSFSPQLQESVFHLLVPSVILVLLAVGGLLFYRWRRRSHQEPQRADSPLEQPEGRPLTIL DTLVCQCPSENVTPSPGHSTPDVV >gi568815597f:109811024_110025186|GENSCAN_predicted_CDS_4|1695_bp atgacaccctctctgtcacagacatggctgggctccctgctgttgttggtctgtctcctg gcgagcaggagtatcaccgaggaggtgtcggagtactgtagccacatgattgggagtgga cacctgcagtctctgcagcggctgattgacagtcagatggagacctcgtgccaaattaca tttgagtttgtagaccaggaacagttgaaagatccagtgtgctaccttaagaaggcattt ctcctggtacaagacataatggaggacaccatgcgcttcagagataacacccccaatgcc atcgccattgtgcagctgcaggaactctctttgaggctgaagagctgcttcaccaaggat tatgaagagcatgacaaggcctgcgtccgaactttctatgagacacctctccagttgctg gagaaggtcaagaatgtctttaatgaaacaaagaatctccttgacaaggactggaatatt ttcagcaagaactgcaacaacagctttgctgaatgctccagccaagatgtggtgaccaag cctgattgcaactgcctgtaccccaaagccatccctagcagtgacccggcctctgtctcc cctcatcagcccctcgccccctccatggcccctgtggctggcttgacctgggaggactct gagggaactgagggcagctccctcttgcctggtgagcagcccctgcacacagtggatcca ggcagtgccaagcagcggccacccaggagcacctgccagagctttgagccgccagagacc ccagttgtcaaggacagcaccatcggtggctcaccacagcctcgcccctctgtcggggcc ttcaaccccgggatggaggatattcttgactctgcaatgggcactaattgggtcccagaa gaagcctctggagaggccagtgagattcccgtaccccaagggacagagctttccccctcc aggccaggagggggcagcatgcagacagagcccgccagacccagcaacttcctctcagca tcttctccactccctgcatcagcaaagggccaacagccggcagatgtaactggtaccgcc ttgcccagggtgggccccgtgaggcccactggccaggactggaatcacaccccccagaag acagaccatccatctgccctgctcagagaccccccggagccaggctctcccaggatctca tcactgcgcccccagggcctcagcaacccctccaccctctctgctcagccacagctttcc agaagccactcctcgggcagcgtgctgccccttggggagctggagggcaggaggagcacc agggatcggaggagccccgcagagccagaaggaggaccagcaagtgaaggggcagccagg cccctgccccgttttaactccgttcctttgactgacacaggccatgagaggcagtccgag ggatccttcagcccgcagctccaggagtctgtcttccacctgctggtgcccagtgtcatc ctggtcttgctggccgtcggaggcctcttgttctacaggtggaggcggcggagccatcaa gagcctcagagagcggattctcccttggagcaaccagagggcagacccctcaccatcctg gacacactcgtttgtcaatgtccctctgaaaatgtgacgcccagccccggacacagtact ccagatgttgtctga >gi568815597f:109811024_110025186|GENSCAN_predicted_peptide_5|635_aa MEAKKSQAEGLHLASLLAAEVAARAGRSSELLFWFSCGRRRCPAALGCRTDKAWATAPQK PTQLDAGAGRRVGDRVSEGAARAGGRAPEGERGGGGGSAAGRAGRGMSMPDAMPLPGVGE ELKQAKEIEDAEKYSFMATVTKAPKKQIQFADDMQEFTKFPTKTGRRSLSRSISQSSTDS YSSAASYTDSSDDEVSPREKQQTNSKGSSNFCVKNIKQAEFGRREIEIAEQDMSALISLR KRAQGEKPLAGAKIVGCTHITAQTAVLIETLCALGAQCRWSACNIYSTQNEVAAALAEAG VAVFAWKGESEDDFWWCIDRCVNMDGWQANMILDDGGDLTHWVYKKYPNVFKKIRGIVEE SVTGVHRLYQLSKAGKLCVPAMNVNDSVTKQKFDNLYCCRESILDGLKRTTDVMFGGKQV VVCGYGEVGKGCCAALKALGAIVYITEIDPICALQACMDGFRVVKLNEVIRQVDVVITCT GNKNVVTREHLDRMKNSCIVCNMGHSNTEIDVTSLRTPELTWERVRSQVDHVIWPDGKRV VLLAEGRLLNLSCSTVPTFVLSITATTQALALIELYNAPEGRYKQDVYLLPKKMDEYVAS LHLPSFDAHLTELTDDQAKYLGLNKNGPFKPNYYS >gi568815597f:109811024_110025186|GENSCAN_predicted_CDS_5|1905_bp atggaggctaagaagtcccaggctgaggggctgcatctggcgagtcttcttgctgcggag gtggcggcgcgggcaggtcggagctcggagctgctgttctggttctcttgtggccgccgt cgctgtccggctgccttgggctgccgaacagacaaggcgtgggccacagcacctcagaag ccgacgcagctcgacgcaggggccggcaggagggtgggcgatcgcgtgtcggagggcgcc gcgcgggcaggcgggcgggcgccagagggggaaagaggcgggggcggcgggtcagccgct ggccgggccggccggggaatgtcgatgcctgacgcgatgccgctgcccggggtcggggag gagctgaagcaggccaaggagatcgaggacgccgagaagtactccttcatggccaccgtc accaaggcgcccaagaagcaaatccagtttgctgatgacatgcaggagttcaccaaattc cccaccaaaactggccgaagatctttgtctcgctcgatctcacagtcctccactgacagc tacagttcagctgcatcctacacagatagctctgatgatgaggtttctccccgagagaag cagcaaaccaactccaagggcagcagcaatttctgtgtgaagaacatcaagcaggcagaa tttggacgccgggagattgagattgcagagcaagacatgtctgctctgatttcactcagg aaacgtgctcagggggagaagcccttggctggtgctaaaatagtgggctgtacacacatc acagcccagacagcggtgttgattgagacactctgtgccctgggggctcagtgccgctgg tctgcttgtaacatctactcaactcagaatgaagtagctgcagcactggctgaggctgga gttgcagtgttcgcttggaagggcgagtcagaagatgacttctggtggtgtattgaccgc tgtgtgaacatggatgggtggcaggccaacatgatcctggatgatgggggagacttaacc cactgggtttataagaagtatccaaacgtgtttaagaagatccgaggcattgtggaagag agcgtgactggtgttcacaggctgtatcagctctccaaagctgggaagctctgtgttccg gccatgaacgtcaatgattctgttaccaaacagaagtttgataacttgtactgctgccga gaatccattttggatggcctgaagaggaccacagatgtgatgtttggtgggaaacaagtg gtggtgtgtggctatggtgaggtaggcaagggctgctgtgctgctctcaaagctcttgga gcaattgtctacattaccgaaatcgaccccatctgtgctctgcaggcctgcatggatggg ttcagggtggtaaagctaaatgaagtcatccggcaagtcgatgtcgtaataacttgcaca ggaaataagaatgtagtgacacgggagcacttggatcgcatgaaaaacagttgtatcgta tgcaatatgggccactccaacacagaaatcgatgtgaccagcctccgcactccggagctg acgtgggagcgagtacgttctcaggtggaccatgtcatctggccagatggcaaacgagtt gtcctcctggcagagggtcgtctactcaatttgagctgctccacagttcccacctttgtt ctgtccatcacagccacaacacaggctttggcactgatagaactctataatgcacccgag gggcgatacaagcaggatgtgtacttgcttcctaagaaaatggatgaatacgttgccagc ttgcatctgccatcatttgatgcccaccttacagagctgacagatgaccaagcaaaatat ctgggactcaacaaaaatgggccattcaaacctaattattacagn