GENSCAN 1.0 Date run: 3-Nov-116 Time: 19:33:19 Sequence gi568815591r:75188120_75316047 : 127928 bp : 47.65% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.20 Intr - 1137 1096 42 1 0 85 105 15 0.117 1.11 1.19 Intr - 3677 3619 59 1 2 65 80 53 0.499 0.73 1.18 Intr - 4111 3928 184 2 1 94 97 167 0.979 17.05 1.17 Intr - 4861 4780 82 1 1 43 111 82 0.170 5.21 1.16 Intr - 5864 5797 68 0 2 74 61 34 0.152 -2.08 1.15 Intr - 6593 6538 56 1 2 102 81 29 0.723 2.22 1.14 Intr - 8677 8494 184 2 1 81 78 94 0.595 6.55 1.13 Intr - 8920 8855 66 2 0 62 103 97 0.940 7.48 1.12 Intr - 9930 9829 102 1 0 120 111 -6 0.967 5.05 1.11 Intr - 11606 11532 75 0 0 38 116 69 0.893 4.19 1.10 Intr - 12060 12002 59 2 2 116 116 -41 0.914 0.03 1.09 Intr - 13185 13002 184 1 1 88 106 134 0.985 14.05 1.08 Intr - 14494 14423 72 2 0 84 111 42 0.954 5.48 1.07 Intr - 16170 16082 89 2 2 12 81 75 0.018 -1.19 1.06 Intr - 21440 21257 184 0 1 36 116 73 0.062 3.75 1.05 Intr - 27702 27637 66 1 0 60 80 76 0.787 2.88 1.04 Intr - 29145 29035 111 1 0 65 123 71 0.838 8.65 1.03 Intr - 37536 37518 19 0 1 79 105 19 0.069 -1.12 1.02 Intr - 47268 47241 28 2 1 120 60 -1 0.178 -1.58 1.01 Init - 49286 49174 113 2 2 74 115 236 0.373 24.58 1.00 Prom - 76169 76130 40 -3.16 2.00 Prom + 76297 76336 40 -7.36 2.01 Init + 80484 80492 9 2 0 100 62 3 0.467 -0.66 2.02 Intr + 81309 81477 169 2 1 91 75 123 0.968 10.82 2.03 Term + 82770 82852 83 1 2 77 50 74 0.939 0.36 2.04 PlyA + 82961 82966 6 -0.45 3.00 Prom + 84942 84981 40 -3.96 3.01 Init + 91003 91061 59 0 2 82 85 24 0.497 2.44 3.02 Intr + 93078 93253 176 0 2 42 94 94 0.281 4.98 3.03 Intr + 94249 94467 219 2 0 42 54 265 0.471 16.77 3.04 Intr + 95500 95558 59 2 2 86 116 35 0.939 4.70 3.05 Intr + 96545 96675 131 1 2 53 17 89 0.776 -2.21 3.06 Intr + 98577 98820 244 0 1 80 90 268 0.974 23.70 3.07 Term + 99901 100005 105 0 0 110 43 91 0.992 5.21 3.08 PlyA + 101171 101176 6 1.05 4.06 PlyA - 101184 101179 6 1.05 4.05 Term - 102820 102764 57 1 0 25 48 95 0.169 -3.11 4.04 Intr - 107970 107822 149 1 2 3 87 109 0.625 2.25 4.03 Intr - 109641 109555 87 1 0 62 97 126 0.997 10.84 4.02 Intr - 111674 111535 140 1 2 50 111 107 0.961 9.31 4.01 Init - 112003 111936 68 1 2 76 48 50 0.940 0.34 4.00 Prom - 113395 113356 40 -7.56 5.00 Prom + 120170 120209 40 -5.16 5.01 Init + 121052 121181 130 1 1 63 94 80 0.221 6.42 5.02 Intr + 122177 122395 219 0 0 42 54 265 0.466 16.77 5.03 Intr + 123428 123486 59 0 2 86 116 35 0.939 4.70 5.04 Intr + 124473 124603 131 2 2 53 17 89 0.770 -2.21 5.05 Intr + 126505 126748 244 1 1 80 90 268 0.924 23.70 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 60274 60228 47 2 2 100 54 47 0.871 -0.13 S.002 Init - 61177 61093 85 1 1 79 77 47 0.830 3.68 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:75188120_75316047|GENSCAN_predicted_peptide_1|615_aa MELHILEHRLQVASVAKESIPLFTYGLIKLAFLSSKTRKQPVLSQAQEATLVPDDDYSPP SKRPKANELPQPPVPEPANAGKRKVREFNFEKWNARITDLRKQVEELFERKYAEALGSTE AKAVPYQKFEAHPNDLYVEGLPENIPFRSPSWYGIPRLEKIIQVGNRIKFVIKSLRSSAV MDGSTFFIGFSSNWKMQGPGQQQVKEDWNVRITKLRKQVEEIFNLKFAQALGLTEAVKVP YPVFESNPEFLYVEGLPEGIPFRSPTWFGIPRLERIVRGSNKIKFVVKKPELVISYLPPG MASKINTKALQSPKRPRSPGSNSKVPEIEVTVEGPNNNNPQTSAVRTPTQTNGSNVPFKP RGREFSFEAWNAKITDLKQKVENLFNEKCGEALGLKQAVKVPFALFESFPEDFYVEGLPE GVPFRRPSTFGIPRLEKILRNKAKIKFIIKKPEMFETAIKESTSSKSPPRKINSSPNVNT TASGVEDLNIIQMMIMKDSRKLKKLSLREQVNDLFSRKFGEAIGMGFPVKVPYRKITINP GCVVVDGMPPGVSFKAPSYLEISSMRRILDSAEFIKFTVIRVFLKHSYTRIIHGCFPVTV KLVDQSESEGPVIQX >gi568815591r:75188120_75316047|GENSCAN_predicted_CDS_1|1845_bp atggaactccacatcctggagcaccggctgcaagttgccagcgtcgccaaggagagtatc ccgctgttcacctacggcctgatcaaacttgccttcctgtcctccaagaccagaaagcag cctgtcctgtcccaggcccaggaagccaccctggtgcctgatgatgattattctccaccg tctaagagaccaaaggccaatgagctaccgcagccaccagtcccggaacccgccaatgct gggaagcggaaagtgagggagttcaacttcgagaaatggaatgctcgcatcactgatcta cgtaaacaagttgaagaattgtttgaaaggaaatatgcggaagccttggggagcactgaa gccaaggctgtaccgtaccaaaaatttgaggcacacccgaatgatctgtacgtggaagga ctgccagaaaacattcctttccgaagtccctcatggtatggaatcccaaggctggaaaaa atcattcaagtgggcaatcgaattaaatttgttattaaaagcttgaggagctccgcagtg atggatggcagcacattcttcattggcttctccagtaattggaagatgcaaggacctggc cagcaacaagtcaaagaagattggaatgtcagaattaccaagctacggaagcaagtggaa gagatttttaatttgaaatttgctcaagctcttggactcaccgaggcagtaaaagtacca tatcctgtgtttgaatcaaacccggagttcttgtatgtggaaggcttgccagaggggatt cccttccgaagccctacctggtttggaattccacgacttgaaaggatcgtccgcgggagt aataaaatcaagttcgttgttaaaaaacctgaactagttatttcctacttgcctcctggg atggctagtaaaataaacactaaagctttgcagtcccccaaaagaccacgaagtcctggg agtaattcaaaggttcctgaaattgaggtcaccgtggaaggccctaataacaacaatcct caaacctcagctgttcgaaccccgacccagactaacggttctaacgttcccttcaagcca cgagggagagagttttcctttgaggcctggaatgccaaaatcacggacctaaaacagaaa gttgaaaatctcttcaatgagaaatgtggggaagctcttggccttaaacaagctgtgaag gtgccgttcgcgttatttgagtctttcccggaagacttttatgtggaaggcttacctgag ggtgtgccattccgaagaccatcgacttttggcattccgaggctggagaagatactcaga aacaaagccaaaattaagttcatcattaaaaagcccgaaatgtttgagacggcgattaag gagagcacctcctctaagagccctcccagaaaaataaattcatcacccaatgttaatact actgcatcaggtgttgaagaccttaacatcattcagatgatgataatgaaagactctcga aagttgaaaaagctgtcgctaagagaacaagtgaatgacctctttagtcggaaatttggt gaagctattggtatgggttttcctgtgaaagttccctacaggaaaatcacaattaaccct ggctgtgtggtggttgatggcatgcccccgggggtgtccttcaaagcccccagctacctg gaaatcagctccatgagaaggatcttagactctgccgagtttatcaaattcacggtcatt agagtcttcctgaaacacagctacacccgtattatccatggctgctttcctgtcacagtg aagctggttgatcagagtgagtcagaaggccccgtgatacaagnn >gi568815591r:75188120_75316047|GENSCAN_predicted_peptide_2|86_aa MAQAQRLRQELQMLMTECLTWARNGSSKHPLSLAQKPEFLFHQLQSRDNAICPIAEQKAG TANPVVQLLPQFPLVLQVPTVAVALL >gi568815591r:75188120_75316047|GENSCAN_predicted_CDS_2|261_bp atggctcaggctcagaggctcaggcaagagctacagatgctcatgaccgaatgtctcacc tgggcccggaatggcagcagcaagcaccctctcagcctagcccagaagccagagttccta tttcatcagttgcaaagcagagacaatgccatctgcccgatagcagagcaaaaggcaggt accgccaaccctgtggtgcagctgctgccccagtttccccttgtgctccaggtccccact gtggcagttgctcttctctga >gi568815591r:75188120_75316047|GENSCAN_predicted_peptide_3|330_aa MDSATPHDPAAPLLVTVLESVQKKTKDRTETRFGEMGQILGKIMMSHQPQPQEERSPQRS TSGYPLQEVVDDEVLGPSAPGVDPSPPRRSLGWKRKRECLDESDDEPEKELAPEPEETWV AETLCGLKMKAKRRRVSLVLPEYYEAFNRLLEDPVIKRLLAWDKDLRVSDKIPSEPTILG ASPKTLPPASRICIRPSNTPPPRNFHMSTVTPTLSYLANDMEEDDEAPKQNIFYFLYEET RSHIPLLSELWFQLCRYMNPRARKNCSQIALFRKYRFHFFCSMRCRAWVSLEELEENTGP RGDVDFQQELYSNANGRHQEGGEEPFVQII >gi568815591r:75188120_75316047|GENSCAN_predicted_CDS_3|993_bp atggactcagccacaccccatgacccagcagctccgctcctggtgactgttctagagagt gtccagaagaagaccaaggacagaacagagactaggtttggtgagatgggacagattttg ggaaagatcatgatgagccatcaaccgcagccccaggaagagcggagcccccagcggagc acctcagggtaccccctccaggaggtggtggatgatgaagtgttgggaccatcagcccct ggggtagatcccagccccccacgtaggtcccttggctggaaaaggaagagggaatgtttg gatgaatctgatgatgagccagagaaggagctcgcccctgagcctgaggagacctgggtg gcggagacgctgtgtggcctcaagatgaaggcgaagcgacggcgagtgtcgctcgtgctc cctgagtactacgaggccttcaacaggctgcttgaggatcctgtcattaaaagactcctg gcctgggacaaagatctgagggtgtcggacaagatcccatcggagcccaccatcctggga gcatcaccaaaaacccttcctccggcttctcggatttgcatccgaccttcgaatacccct ccaccccgcaatttccacatgagcacagtcaccccaacactgagctatctggccaatgac atggaggaggacgacgaggcccccaaacaaaacatcttctacttcctgtacgaggagacc cgctctcatatacccttgctcagtgagctttggttccagttatgccgttacatgaacccg agggccaggaagaactgctctcagatagccttgttccggaagtatcggttccacttcttt tgttccatgcgctgcagggcttgggtttccctggaggagttggaagagaacaccggaccc aggggagatgtggattttcagcaggaactttattccaatgctaatggcagacatcaggaa ggaggagaggaaccatttgtgcagatcatctag >gi568815591r:75188120_75316047|GENSCAN_predicted_peptide_4|166_aa MLSVFKKEDTIIAKDFGNLRDTITEPAKAIKPIDRKSVHQICSGPVVLSLSTAVKKIVGN SLDAGATNIDLKLKDYGMDLIEVSGNGCGVEEENFEGLSWDSTGVFDHDGKIIQKTPYPH PRGTTVSVKQLFSTLPVRHKEFQRNIKKPVGEKYMRSVDTQACSCI >gi568815591r:75188120_75316047|GENSCAN_predicted_CDS_4|501_bp atgctttcagttttcaagaaagaagacaccattattgccaaagattttggtaatttgaga gatacaattacagaacctgctaaggccatcaaacctattgatcggaagtcagtccatcag atttgctctgggccggtggtactgagtctaagcactgcggtgaagaagatagtaggaaac agtctggatgctggtgccactaatattgatctaaagcttaaggactatggaatggatctc attgaagtttcaggcaatggatgtggggtagaagaagaaaacttcgaaggcttaagttgg gactcgactggtgtttttgatcacgatgggaaaatcatccagaaaaccccctacccccac cccagagggaccacagtcagcgtgaagcagttattttctacgctacctgtgcgccataag gaatttcaaaggaatattaagaagccagttggtgagaagtacatgcggtctgtggacacc caagcttgcagctgcatctga >gi568815591r:75188120_75316047|GENSCAN_predicted_peptide_5|261_aa MGQILGKIMMSHQPQPQEERSPQRSTSGYPLQEVVDDEVLGPSAPGVDPSPPRRSLGWKR KRECLDESDDEPEKELAPEPEETWVAETLCGLKMKAKRRRVSLVLPEYYEAFNRLLEDPV IKRLLAWDKDLRVSDKIPSEPTILGASPKTLPPASRICIRPSNTPPPRNFHMSTVTPTLS YLANDMEEDDEAPKQNIFYFLYEETRSHIPLLSELWFQLCRYMNPRARKNCSQIALFRKY RFHFFCSMRCRAWVSLEELEE >gi568815591r:75188120_75316047|GENSCAN_predicted_CDS_5|783_bp atgggacagattttgggaaagatcatgatgagccatcaaccgcagccccaggaagagcgg agcccccagcggagcacctcagggtaccccctccaggaggtggtggatgatgaagtgttg ggaccatcagcccctggggtagatcccagccccccacgtaggtcccttggctggaaaagg aagagggaatgtttggatgaatctgatgatgagccagagaaggagctcgcccctgagcct gaggagacctgggtggcggagacgctgtgtggcctcaagatgaaggcgaagcgacggcga gtgtcgctcgtgctccctgagtactacgaggccttcaacaggctgcttgaggatcctgtc attaaaagactcctggcctgggacaaagatctgagggtgtcggacaagatcccatcggag cccaccatcctgggagcatcaccaaaaacccttcctccggcttctcggatttgcatccga ccttcgaatacccctccaccccgcaatttccacatgagcacagtcaccccaacactgagc tatctggccaatgacatggaggaggacgacgaggcccccaaacaaaacatcttctacttc ctgtacgaggagacccgctctcatatacccttgctcagtgagctttggttccagttatgc cgttacatgaacccgagggccaggaagaactgctctcagatagccttgttccggaagtat cggttccacttcttttgttccatgcgctgcagggcttgggtttccctggaggagttggaa gag