GENSCAN 1.0 Date run: 3-Nov-116 Time: 19:33:14 Sequence gi568815591f:73077354_73205386 : 128033 bp : 47.52% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 Intr - 1424 1181 244 1 1 80 90 268 0.924 23.70 1.04 Intr - 3456 3326 131 0 2 53 17 89 0.767 -2.21 1.03 Intr - 4501 4443 59 2 2 86 116 35 0.935 4.70 1.02 Intr - 5753 5535 219 0 0 42 54 265 0.573 16.77 1.01 Init - 6885 6756 130 0 1 63 94 83 0.232 6.72 1.00 Prom - 7767 7728 40 -5.16 2.00 Prom + 13484 13523 40 -7.56 2.01 Init + 15975 16042 68 2 2 76 48 50 0.939 0.34 2.02 Intr + 16304 16443 140 2 2 50 111 107 0.961 9.31 2.03 Intr + 18329 18415 87 0 0 62 97 126 0.999 10.84 2.04 Intr + 18592 18694 103 2 1 36 78 49 0.997 -2.07 2.05 Intr + 20031 20214 184 0 1 81 87 156 0.938 14.59 2.06 Term + 25216 25272 57 0 0 25 48 95 0.279 -3.11 2.07 PlyA + 25509 25514 6 1.05 3.08 PlyA - 26693 26688 6 1.05 3.07 Term - 28133 28029 105 2 0 107 43 91 0.997 4.91 3.06 Intr - 29457 29214 244 2 1 80 90 268 0.976 23.70 3.05 Intr - 31490 31360 131 2 2 53 17 89 0.773 -2.21 3.04 Intr - 32535 32477 59 1 2 86 116 35 0.935 4.70 3.03 Intr - 33787 33569 219 2 0 42 54 265 0.577 16.77 3.02 Intr - 34945 34770 176 0 2 42 94 97 0.291 5.28 3.01 Init - 37020 36962 59 0 2 82 85 24 0.492 2.44 3.00 Prom - 42260 42221 40 -4.56 4.04 PlyA - 42410 42405 6 -0.45 4.03 Term - 44418 44336 83 1 2 77 50 74 0.929 0.36 4.02 Intr - 45879 45711 169 0 1 91 75 100 0.953 8.52 4.01 Init - 46718 46710 9 2 0 100 62 3 0.463 -0.66 4.00 Prom - 50913 50874 40 -7.36 5.00 Prom + 51041 51080 40 -3.16 5.01 Init + 71783 71790 8 1 2 104 91 0 0.265 2.40 5.02 Intr + 77375 77795 421 2 1 75 -20 202 0.236 2.05 5.03 Intr + 77806 77990 185 0 2 61 115 219 0.294 20.49 5.04 Intr + 79943 79970 28 2 1 120 60 -1 0.465 -1.58 5.05 Intr + 89662 89680 19 0 1 86 105 19 0.164 -0.42 5.06 Intr + 98062 98172 111 2 0 65 123 71 0.801 8.65 5.07 Intr + 99504 99569 66 1 0 60 80 76 0.778 2.88 5.08 Intr + 105762 105945 184 1 1 36 116 73 0.043 3.75 5.09 Intr + 107308 107366 59 1 2 76 97 -8 0.013 -2.57 5.10 Intr + 112713 112784 72 1 0 84 111 42 0.761 5.48 5.11 Intr + 114018 114201 184 1 1 88 106 155 0.812 16.15 5.12 Intr + 115144 115202 59 1 2 116 116 -41 0.756 0.03 5.13 Intr + 115598 115672 75 0 0 38 116 69 0.771 4.19 5.14 Intr + 117273 117374 102 1 0 120 111 -6 0.967 5.05 5.15 Intr + 118283 118348 66 0 0 62 103 97 0.952 7.48 5.16 Intr + 118526 118709 184 0 1 81 78 94 0.769 6.55 5.17 Intr + 120611 120666 56 2 2 102 81 29 0.974 2.22 5.18 Intr + 121340 121420 81 0 0 74 80 39 0.705 1.31 5.19 Intr + 122343 122426 84 1 0 42 111 52 0.865 2.69 5.20 Intr + 123096 123279 184 1 1 94 97 167 0.979 17.05 5.21 Intr + 123530 123584 55 2 1 59 79 44 0.122 -0.42 5.22 Intr + 127158 127251 94 2 1 66 93 54 0.125 3.24 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 65963 66047 85 1 1 79 77 47 0.816 3.68 S.002 Term + 66866 66912 47 0 2 100 54 47 0.853 -0.13 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:73077354_73205386|GENSCAN_predicted_peptide_1|261_aa MGQILGKIMMSHQPQPQEERSPQRSTSGYPLQEVVDDEVSGPSAPGVDPSPPRRSLGWKR KRECLDESDDEPEKELAPEPEETWVAETLCGLKMKAKRRRVSLVLPEYYEAFNRLLEDPV IKRLLAWDKDLRVSDKIPSEPTILGASPKTLPPASRICIRPSNTPPPRNFHMSTVTPTLS YLANDMEEDDEAPKQNIFYFLYEETRSHIPLLSELWFQLCRYMNPRARKNCSQIALFRKY RFHFFCSMRCRAWVSLEELEE >gi568815591f:73077354_73205386|GENSCAN_predicted_CDS_1|783_bp atgggacagattttgggaaagatcatgatgagccatcaaccgcagccccaggaagagcgg agcccccagcggagcacctcagggtaccccctccaggaggtggtggatgatgaagtgtcg ggaccatcagcccctggggtagatcccagccccccacgtaggtcccttggctggaaaagg aagagggaatgtttggatgaatctgatgatgagccagagaaggagctcgcccctgagcct gaggagacctgggtggcggagacgctgtgtggcctcaagatgaaggcgaagcgacggcga gtgtcgctcgtgctccctgagtactacgaggccttcaacaggctgcttgaggatcctgtc attaaaagactcctggcctgggacaaagatctgagggtgtcggacaagatcccatcggag cccaccatcctgggagcatcaccaaaaacccttcctccggcttctcggatttgcatccga ccttcgaatacccctccaccccgcaatttccacatgagcacagtcaccccaacactgagc tatctggccaatgacatggaggaggacgacgaggcccccaaacaaaacatcttctacttc ctgtacgaggagacccgctctcatatacccttgctcagtgagctttggttccagttatgc cgttacatgaacccgagggccaggaagaactgctctcagatagccttgttccggaagtat cggttccacttcttttgttccatgcgctgcagggcttgggtttccctggaggagttggaa gag >gi568815591f:73077354_73205386|GENSCAN_predicted_peptide_2|212_aa MLSVFKKEDTIIAKDFGNLRDTITEPAKAIKPIDRKSVHQICSGPVVLSLSTAVKKIVGN SLDAGATNIDLKLKDYGMDLIEVSGNGCGVEEENFEGLTLKHHTSKIQEFADLTRVETFG FRGKALSSLCALSDVTISTCHVSAKVGTRLVFDHDGKIIQKTPYPHPRGTTVSVKQLFST LPVRHKEFQRNIKKPVGEKYMRSVDTQACSCI >gi568815591f:73077354_73205386|GENSCAN_predicted_CDS_2|639_bp atgctttcagttttcaagaaagaagacaccattattgccaaagattttggtaatttgaga gatacaattacagaacctgctaaggccatcaaacctattgatcggaagtcagtccatcag atttgctctgggccggtggtactgagtctaagcactgcggtgaagaagatagtaggaaac agtctggatgctggtgccactaatattgatctaaagcttaaggactatggaatggatctc attgaagtttcaggcaatggatgtggggtagaagaagaaaacttcgaaggcttaactctg aaacatcacacatctaagattcaagagtttgccgacctaactcgggttgaaacttttggc tttcgggggaaagctctgagctcactttgtgcactgagtgatgtcaccatttctacctgc cacgtatcggcgaaggttgggactcgactggtgtttgatcacgatgggaaaatcatccag aaaaccccctacccccaccccagagggaccacagtcagcgtgaagcagttattttctacg ctacctgtgcgccataaggaatttcaaaggaatattaagaagccagttggtgagaagtac atgcggtctgtggacacccaagcttgcagctgcatctga >gi568815591f:73077354_73205386|GENSCAN_predicted_peptide_3|330_aa MDSATPHDPAAPLLVTVLESVQKKTKDRTETRFGEMGQILGKIMMSHQPQPQEERSPQRS TSGYPLQEVVDDEVSGPSAPGVDPSPPRRSLGWKRKRECLDESDDEPEKELAPEPEETWV AETLCGLKMKAKRRRVSLVLPEYYEAFNRLLEDPVIKRLLAWDKDLRVSDKIPSEPTILG ASPKTLPPASRICIRPSNTPPPRNFHMSTVTPTLSYLANDMEEDDEAPKQNIFYFLYEET RSHIPLLSELWFQLCRYMNPRARKNCSQIALFRKYRFHFFCSMRCRAWVSLEELEENTGP RGDVDFQQELYSNANGRHQEGGEEPFVQII >gi568815591f:73077354_73205386|GENSCAN_predicted_CDS_3|993_bp atggactcagccacaccccatgacccagcagctccgctcctggtgactgttctagagagt gtccagaagaagaccaaggacagaacagagactaggtttggtgagatgggacagattttg ggaaagatcatgatgagccatcaaccgcagccccaggaagagcggagcccccagcggagc acctcagggtaccccctccaggaggtggtggatgatgaagtgtcgggaccatcagcccct ggggtagatcccagccccccacgtaggtcccttggctggaaaaggaagagagaatgtttg gatgaatctgatgatgagccagagaaggagctcgcccctgagcctgaggagacctgggtg gcggagacgctgtgtggcctcaagatgaaggcgaagcgacggcgagtgtcgctcgtgctc cctgagtactacgaggccttcaacaggctgcttgaggatcctgtcattaaaagactcctg gcctgggacaaagatctgagggtgtcggacaagatcccatcggagcccaccatcctggga gcatcaccaaaaacccttcctccggcttctcggatttgcatccgaccttcgaatacccct ccaccccgcaatttccacatgagcacagtcaccccaacactgagctatctggccaatgac atggaggaggacgacgaggcccccaaacaaaacatcttctacttcctgtacgaggagacc cgctctcatatacccttgctcagtgagctttggttccagttatgccgttacatgaacccg agggccaggaagaactgctctcagatagccttgttccggaagtatcggttccacttcttt tgttccatgcgctgcagggcttgggtttccctggaggagttggaagagaacaccggaccc aggggagatgtggattttcagcaggaactttattccaatgctaatggcagacatcaggaa ggaggagaggaaccatttgtgcagatcatctag >gi568815591f:73077354_73205386|GENSCAN_predicted_peptide_4|86_aa MAQAQRLRQELQMLMTECLTWARNGSSKHPLSLAQKPEFLFHQLQSRDNAICPIAEQKAG TANPVVQLLPQFPLVLQVPTVAVALL >gi568815591f:73077354_73205386|GENSCAN_predicted_CDS_4|261_bp atggctcaggctcagaggctcaggcaagagctacagatgctcatgaccgaatgtctcacc tgggccaggaatggcagcagcaagcaccctctcagcctagcccagaagccagagttccta tttcatcagttgcaaagcagagacaatgccatctgcccgatagcagagcaaaaggcaggt accgccaaccctgtggtgcagctgctgccccagtttccccttgtgctccaggtccccact gtggcagttgctcttctctga >gi568815591f:73077354_73205386|GENSCAN_predicted_peptide_5|793_aa MPSPEGARAPAFPVHRAAGSSGRPAPAAATAATAAAAAAPGEPPQADYKSRQAARGPRMR SGDRRLSGKLFVTANTSCSPRAGRRAPLLRRSAPRLLLCRRCLPRPAAPPCNLAGLLPFV RPGPAAAAPRARRRAPGSPGRLSGAEGAGAAARPPADEPPRTGRGRTMELHILEHRLQVA SVAKESIPLFTYGLIKLAFLSSKTRKQPVLSQAQEATLVPDDDYSPPSKRPKANELPQPP VPEPANAGKRKVREFNFEKWNARITDLRKQVEELFERKYAEALGSTEAKAVPYQKFEAHP NDLYVEGLPENIPFRSPSWYGIPRLEKIIQVGNRIKFVIKRPELLTHSTTEVTQPRTNTP VKEDWNVRITKLRKQVEEIFNLKFAQALGLTEAVKVPYPVFESNPEFLYVEGLPEGIPFR SPTWFGIPRLERIVHGSNKIKFVVKKPELVISYLPPGMASKINTKALQSPKRPRSPGSNS KVPEIEVTVEGPNNNNPQTSAVRTPTQTNGSNVPFKPRGREFSFEAWNAKITDLKQKVEN LFNEKCGEALGLKQAVKVPFALFESFPEDFYVEGLPEGVPFRRPSTFGIPRLEKILRNKA KIKFIIKKPEMFETAIKESTSSKSPPRKINSSPNVNTTASGVEDLNIIQVTIPDDDNERL SKVEKARQLREQVNDLFSRKFGEAIGMGFPVKVPYRKITINPGCVVVDGMPPGVSFKAPS YLEISSMRRILDSAEFIKFTVIRVFLKHSYTRIIHGCFPVTYKAAVEQCGNQSLCFSLES AEPSQLEVPATEX >gi568815591f:73077354_73205386|GENSCAN_predicted_CDS_5|2379_bp atgcccagtcctgagggtgcccgcgctcctgcatttcccgtgcaccgggctgccggtagc tccggccgcccagcccccgcggcagcaacagcagcaacagcagcagcagcagcagcaccg ggggagcccccccaggcggactacaagtcccggcaggccgcgcgcgggccgcgcatgcgc agcggggaccggcgtttgagtggcaagttgtttgttacagcgaacaccagctgctccccc cgcgccgggcgccgcgcgccgctgctccgccgctcggcccctcggctgctgctctgccgg cgctgcctccctcgccccgcggctcccccttgcaacttggcgggcctcctcccttttgtc cggcccggcccggccgccgccgccccccgcgcccggcgccgagctcccgggtctccgggc cggctgtcgggcgcggagggggcgggggccgcggctcgtccccccgcggatgagccgccg cggacggggcgcgggcggacgatggaactccacatcctggagcaccggctgcaagttgcc agcgtcgccaaggagagtatcccgctgttcacctacggcctgatcaaacttgccttcctg tcctccaagaccagaaagcagcctgtcctgtcccaggcccaggaagccaccctggtgcct gatgatgattattctccaccgtctaagagaccaaaggccaatgagctaccgcagccacca gtcccggaacccgccaatgctgggaagcggaaagtgagggagttcaacttcgagaaatgg aatgctcgcatcactgatctacgtaaacaagttgaagaattgtttgaaaggaaatatgcg gaagccttggggagcactgaagccaaggctgtaccgtaccaaaaatttgaggcacacccg aatgatctgtacgtggaaggactgccagaaaacattcctttccgaagtccctcatggtat ggaatcccaaggctggaaaaaatcattcaagtgggcaatcgaattaaatttgttattaaa agaccagaacttctgactcacagtaccactgaagttactcagccaagaacgaatacacca gtcaaagaagattggaatgtcagaattaccaagctacggaagcaagtggaagagattttt aatttgaaatttgctcaagctcttggactcaccgaggcagtaaaagtaccatatcctgtg tttgaatcaaacccggagttcttgtatgtggaaggcttgccagaggggattcccttccga agccctacctggtttggaattccacgacttgaaaggatcgtccacgggagtaataaaatc aagttcgttgttaaaaaacctgaactagttatttcctacttgcctcctgggatggctagt aaaataaacactaaagctttgcagtcccccaaaagaccacgaagtcctgggagtaattca aaggttcctgaaattgaggtcaccgtggaaggccctaataacaacaatcctcaaacctca gctgttcgaaccccgacccagactaacggttctaacgttcccttcaagccacgagggaga gagttttcctttgaggcctggaatgccaaaatcacggacctaaaacagaaagttgaaaat ctcttcaatgagaaatgtggggaagctcttggccttaaacaagctgtgaaggtgccgttc gcgttatttgagtctttcccggaagacttttatgtggaaggcttacctgagggtgtgcca ttccgaagaccatcgacttttggcattccgaggctggagaagatactcagaaacaaagcc aaaattaagttcatcattaaaaagcccgaaatgtttgagacggcgattaaggagagcacc tcctctaagagccctcccagaaaaataaattcatcacccaatgttaatactactgcatca ggtgttgaagaccttaacatcattcaggtgacaattccagatgatgataatgaaagactc tcgaaagttgaaaaagctagacagctaagagaacaagtgaatgacctctttagtcggaaa tttggtgaagctattggtatgggttttcctgtgaaagttccctacaggaaaatcacaatt aaccctggctgtgtggtggttgatggcatgcccccgggggtgtccttcaaagcccccagc tacctggaaatcagctccatgagaaggatcttagactctgccgagtttatcaaattcacg gtcattagagtcttcctgaaacacagctacacccgtattatccatggctgctttcctgtc acatataaagcagctgttgaacaatgtggaaatcagtctctgtgtttctctttagaatca gctgaaccaagccagttggaagttccagccacagaagnn