GENSCAN 1.0 Date run: 5-Nov-116 Time: 04:24:04 Sequence gi568815597r:206010192_206216890 : 206699 bp : 43.54% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.09 Intr - 156 21 136 0 1 97 3 137 0.611 6.24 1.08 Intr - 2215 2117 99 1 0 44 99 136 0.999 10.61 1.07 Intr - 2458 2317 142 0 1 49 81 182 0.789 13.96 1.06 Intr - 3703 3581 123 0 0 65 78 124 0.988 8.80 1.05 Intr - 5939 5740 200 2 2 103 110 199 0.999 21.75 1.04 Intr - 10976 10858 119 0 2 27 65 62 0.663 -2.02 1.03 Intr - 12076 11959 118 1 1 114 92 119 0.991 14.94 1.02 Intr - 12854 12710 145 1 1 45 48 186 0.527 10.58 1.01 Init - 13600 13533 68 1 2 63 94 113 0.999 8.01 1.00 Prom - 16820 16781 40 -5.26 2.00 Prom + 23571 23610 40 -5.06 2.01 Init + 23854 23930 77 0 2 60 96 58 0.745 4.39 2.02 Intr + 40988 41155 168 2 0 87 78 47 0.059 2.66 2.03 Intr + 49768 49833 66 1 0 116 91 1 0.093 1.22 2.04 Intr + 50536 50642 107 1 2 69 70 83 0.844 4.46 2.05 Intr + 51154 51206 53 2 2 101 99 12 0.948 2.23 2.06 Term + 51471 51629 159 2 0 61 45 145 0.702 5.44 2.07 PlyA + 52029 52034 6 1.05 3.00 Prom + 60797 60836 40 -2.66 3.01 Init + 66865 67021 157 0 1 39 89 112 0.712 6.47 3.02 Intr + 75884 75912 29 0 2 80 105 13 0.116 0.03 3.03 Intr + 87542 87648 107 1 2 99 70 102 0.991 8.51 3.04 Intr + 87890 87990 101 2 2 119 28 141 0.974 10.95 3.05 Intr + 89464 89607 144 2 0 119 69 44 0.918 5.85 3.06 Intr + 90945 91006 62 1 2 50 95 6 0.897 -4.05 3.07 Term + 91561 91761 201 0 0 102 38 178 0.977 11.59 3.08 PlyA + 92211 92216 6 1.05 4.21 PlyA - 93325 93320 6 1.05 4.20 Term - 100433 99998 436 1 1 119 49 256 0.815 19.56 4.19 Intr - 106711 105756 956 1 2 89 11 1301 0.122 112.39 4.18 Intr - 107595 107473 123 0 0 11 96 68 0.534 0.68 4.17 Intr - 118801 118641 161 2 2 64 111 2 0.253 -0.19 4.16 Intr - 124695 124591 105 1 0 62 87 39 0.244 1.39 4.15 Intr - 127005 126916 90 1 0 105 58 23 0.337 0.97 4.14 Intr - 130476 130402 75 1 0 82 94 70 0.726 6.49 4.13 Intr - 132213 132122 92 2 2 54 80 18 0.319 -2.76 4.12 Intr - 133843 133594 250 2 1 99 34 113 0.544 3.50 4.11 Intr - 140902 140697 206 0 2 27 15 136 0.005 -0.96 4.10 Intr - 150999 150841 159 2 0 54 63 121 0.030 5.30 4.09 Intr - 157748 157605 144 1 0 46 70 75 0.120 0.90 4.08 Intr - 159074 159043 32 2 2 75 69 60 0.038 -0.27 4.07 Intr - 174013 173961 53 2 2 61 98 46 0.014 1.53 4.06 Intr - 185685 185561 125 2 2 69 110 36 0.037 4.13 4.05 Intr - 189693 189616 78 2 0 115 46 63 0.043 3.47 4.04 Intr - 193846 193715 132 0 0 41 49 132 0.183 4.36 4.03 Intr - 194291 194158 134 2 2 60 22 64 0.136 -3.56 4.02 Intr - 194774 194662 113 0 2 88 37 57 0.136 0.70 4.01 Intr - 195738 195592 147 1 0 19 64 119 0.123 2.81 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 184380 184486 107 1 2 62 49 93 0.922 1.37 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:206010192_206216890|GENSCAN_predicted_peptide_1|384_aa MKTLLLLLLVLLELGEAQGSLHRRHPSLKKKLRARSQLSEFWKSHNLDMIQFTESCSMDQ SAKEPLINYLDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCTSPACKTHSRFQPSQ SSTYSQPGQSFSIQYGTGSLSGIIGADQVSVEGLTVVGQQFGESVTEPGQTFVDAEFDGI LGLGYPSLAVGGVTPVFDNMMAQNLVDLPMFSVYMSSNPEGGAGSELIFGGYDHSHFSGS LNWVPVTKQAYWQIALDNIQVGGTVMFCSEGCQAIVDTGTSLITGPSDKIKQLQNAIGAA PVDGEYAVECANLNVMPDVTFTINGVPYTLSPTAYTLLDFVDGMQFCSSGFQGLDIHPPA GPLWILGDVFIRQFYSVFDRGNNX >gi568815597r:206010192_206216890|GENSCAN_predicted_CDS_1|1152_bp atgaaaacgctccttcttttgctgctggtgctcctggagctgggagaggcccaaggatcc cttcacaggaggcatccgtccctcaagaagaagctgcgggcacggagccagctctctgag ttctggaaatcccataatttggacatgatccagttcaccgagtcctgctcaatggaccag agtgccaaggaacccctcatcaactacttggatatggaatacttcggcactatctccatt ggctccccaccacagaacttcactgtcatcttcgacactggctcctccaacctctgggtc ccctctgtgtactgcactagcccagcctgcaagacgcacagcaggttccagccttcccag tccagcacatacagccagccaggtcaatctttctccattcagtatggaaccgggagcttg tccgggatcattggagccgaccaagtctctgtggaaggactaaccgtggttggccagcag tttggagaaagtgtcacagagccaggccagacctttgtggatgcagagtttgatggaatt ctgggcctgggatacccctccttggctgtgggaggagtgactccagtatttgacaacatg atggctcagaacctggtggacttgccgatgttttctgtctacatgagcagtaacccagaa ggtggtgcggggagcgagctgatttttggaggctacgaccactcccatttctctgggagc ctgaattgggtcccagtcaccaagcaagcttactggcagattgcactggataacatccag gtgggaggcactgttatgttctgctccgagggctgccaggccattgtggacacagggact tccctcatcactggcccttccgacaagattaagcagctgcaaaacgccattggggcagcc cccgtggatggagaatatgctgtggagtgtgccaaccttaacgtcatgccggatgtcacc ttcaccattaacggagtcccctataccctcagcccaactgcctacaccctactggacttc gtggatggaatgcagttctgcagcagtggctttcaaggacttgacatccaccctccagct gggcccctctggatcctgggggatgtcttcattcgacagttttactcagtctttgaccgt gggaataaccnn >gi568815597r:206010192_206216890|GENSCAN_predicted_peptide_2|209_aa MKLAKSTQREFVEEGEAPGLCSEDCRHVSSFSEIDITQFNITYFTGVLWALSNQFPLRRL KICHQLRGSKVCGVGSPKFWQRYLALHILVISFSGPPVISANPCLSTAATALSGSIAVVS LILLLVGLLSMTLKKWRQERLFKKQLRHQTNFPHKSSDLSCHADAIYSNVINLAPQKEDD FAVYTNMPPFHHPRRTLPDQVEYVSIVFH >gi568815597r:206010192_206216890|GENSCAN_predicted_CDS_2|630_bp atgaaactggccaagagcacccagagagagtttgtagaagaaggagaagccccaggactg tgctctgaagactgcaggcacgtgtcctctttttctgaaattgacattacccagttcaac atcacctactttaccggagtactttgggctctttcaaatcaatttccgcttcgaaggcta aagatttgtcatcagctaagaggttcaaaagtctgtggtgtaggctctccaaaattctgg cagaggtacctagcactgcacatcctggttatctctttttctggacctcctgtcatatct gctaacccttgcctaagcacagcagccacagccctttctggctccattgctgtggtgtcc ctcatcttgctcctggtgggtctcttgtccatgaccctgaagaaatggaggcaagagaga ctatttaagaaacaactgaggcatcagaccaactttccccacaagtcctcggatctttcc tgccatgctgatgccatatattccaacgtgatcaacctggctccccagaaggaggacgac tttgctgtctacaccaacatgcccccttttcatcaccccaggaggacattgccagaccaa gtggaatatgtctccattgtattccactga >gi568815597r:206010192_206216890|GENSCAN_predicted_peptide_3|266_aa MVYTCIVSTPYYMQTLADDEKRLLPVLGRQRHKNTAVHAWNELWPATVFSDDAWRFDLLY SLISSTLPVALLRDEVPGWFLKVPEPQLISKELIMLTEVMEVWHGLVIAVVSLFLQACFL TAINYLLSRHMAHKSEQILKAASLQVPRPSPGHHHPPAVKEMKETQTERDIPMSDSLYRH DSDTPSDSLDSSCSSPPACQATEDVDYTQVVFSDPGELKNDSPLDYENIKEITDYVNVNP ERHKPSFWYFVNPALSEPAEYDQVAM >gi568815597r:206010192_206216890|GENSCAN_predicted_CDS_3|801_bp atggtgtatacatgcatcgtgtccactccctactacatgcaaacacttgccgatgatgag aagaggttgttgccagtgcttggaagacaacgacacaaaaacacagctgtgcatgcctgg aatgaactctggcctgcaactgtgttcagtgatgatgcctggcgttttgaccttctctat tccctgatctccagcaccctgccggtggcactactgagagacgaggtgccagggtggttc ctgaaagtgcctgagccccaacttatcagcaaggagctcatcatgctgacagaagtcatg gaggtctggcatggcttagtgatcgcggtggtgtccctcttcctgcaggcctgcttcctc accgccatcaactacctgctcagcaggcacatggcccacaagagtgaacagatactgaaa gcggccagtctccaggttcccaggcccagccctggccaccatcatccacctgctgtcaaa gagatgaaggagactcagacagagagagacatcccaatgtctgattccctttacaggcat gacagcgacacaccctcagatagcttggatagctcctgcagttcgcctcctgcctgccag gccacagaggatgtggattacacacaagtcgtcttttctgaccctggagaactaaaaaat gactccccgctggactatgagaacataaaggaaatcacagattatgtcaatgtcaatcca gaaagacacaagcccagtttctggtattttgtcaaccctgctctgtctgagccagcggaa tatgatcaagtggccatgtga >gi568815597r:206010192_206216890|GENSCAN_predicted_peptide_4|1203_aa XPRKKSNHLNQYLQDRFSRGQKGPPTVVSSAPWSAGCGLGRRLRRAHADVVLFLVTSFSW QQNVDGGGGKSAGKSLLGTGHIQENNVRREEEAPASVSHFPRLEEGATLREQDREQRRRG SGEREAGSERRVAGRHWRRPADDYARCQRRLPYGGPRLPVCGGKGEAAGCRLRCGPAVDF TGRCYFTKICKCKLKDIACLKCGNIVGYHVIVPCSSCLLSCNNGHFWMFHSQAVYDINRL DSTVWQELYNDMLTLAFREYTRLLIVVKSDNNEPSLDQSLFPGEPASTCDKDNVRTGQTN LGFLSWPLPVEIPEFLRRTRRWNHGCRHSTNFEQFLQERIKVNRKAVNLGGVVTIERSKS KLTVTPEVPFSRSWIMVFRGLNLNRKSWRLAHQVWNLAIFQVFHDPPSMESVLLEMKHGV MSMKPDSKKCNMALHFGNLAICVWLLPEPFIPTLHLANRLKIPHNAKGDCRWPCTWESRI PGLQDLELDFPAWLTALEPPVHLTGNCKASPSLEQPAAQVHTEKRKSLIPQQPRQLVPLL FRTLLIEIPDPPDKLNAASDEILAGILGVVHNQCEHLHLTGWYNNSCSPLLGQIADLLHL GRITLASLLIDLRSRGWKTQHPKLQTGPSPSSSKKVAGWQMNLSLGLLSTHYVPGIMVRA GDTEVPNRQLSLHAAYLCTLATWINELPCPLPPSVGTELPAGRGRANSRQPKCPRVGGRS DCLSPSPGDAPVERRPESSRQTLLMDSGPLWDANPTPRGTLSAPNATTPWLGRDEELAKV EIGVLATVLVLATGGNLAVLLTLGQLGRKRSRMHLFVLHLALTDLAVALFQVLPQLLWDI TYRFQGPDLLCRAVKYLQVLSMFASTYMLLAMTLDRYLAVCHPLRSLQQPGQSTYLLIAA PWLLAAIFSLPQVFIFSLREVIQGSGVLDCWADFGFPWGPRAYLTWTTLAIFVLPVTMLT ACYSLICHEICKNLKVKTQAWRVGGGGWRTWDRPSPSTLAATTRGLPSRVSSINTISRAK IRTVKMTFVIVLAYIACWAPFFSVQMWSVWDKNAPDEGKTQFPHLFLICGAWTLSQVPSS CHSEASHASLLVDSTNVAFTISMLLGNLNSCCNPWIYMGFNSHLLPRPLRHLACCGGPQP RMRRRLSDGSLSSRHTTLLTRSSCPATLSLSLSLTLSGRPRPEESPRDLELADGEGTAET IIF >gi568815597r:206010192_206216890|GENSCAN_predicted_CDS_4|3612_bp natccacggaaaaaatccaaccacctaaatcaatacctccaagaccgcttctctcgaggg cagaaggggccccccacggtggtctccagcgctccgtggagcgcgggctgcgggctgggc agacggctacggcgggcacatgcagacgttgttttgtttttggttactagcttctcttgg cagcaaaatgttgatgggggggggggcaaatctgcaggaaaaagcctgctcggcacggga cacatccaggaaaacaatgtgcgccgggaggaggaagctcccgctagcgtctcccacttc ccgcggctggaggaaggggccacgctgagggaacaagacagggagcagaggcggcggggt tcaggcgagcgggaagcgggaagcgagaggcgggtggctgggagacactggaggaggccg gccgatgattacgcgcgatgccaacgacgactgccctacggtggaccgcggctgcccgtg tgcggaggaaagggagaggcggctgggtgccggctgcgctgcggtcccgcagtggacttc actggaagatgctatttcaccaaaatctgcaaatgtaaactgaaggacatcgcatgttta aaatgtgggaacattgtaggttatcatgtgattgttccatgtagttcctgtcttctttcc tgcaacaacggacacttctggatgtttcacagccaggcagtttatgatattaacagacta gactccacagtgtggcaagaattgtacaatgacatgctcacacttgccttcagggagtac acaaggctgctcatcgttgtaaaaagtgataataatgagcccagccttgaccagagcctg tttcctggcgaaccagcttccacatgtgacaaggacaacgtcagaacaggacagaccaac cttgggttcctttcctggccgctgccagtagaaatcccagaatttcttcgaaggacaaga agatggaatcatggatgccgccattctaccaattttgagcagtttttgcaagagaggatc aaagtgaacagaaaagctgtgaatcttggaggagtggtgaccatcgaaaggagcaagagc aagcttaccgtaacaccggaagtgcctttttccagaagttggatcatggtcttccgtggg ttgaacctgaaccgcaagtcatggcgtctggcccatcaggtatggaacctggccatcttc caggtgttccatgaccctccgagtatggaatctgtacttctagagatgaaacatggagtt atgagcatgaaacctgattcaaagaaatgtaatatggctctccactttgggaacttggcc atctgtgtctggttactgccagaaccatttattccgaccctccatctggccaacagatta aaaatcccccacaatgcaaagggcgactgcagatggccgtgcacctgggagtcccgaatc cctgggctgcaggacctggagttggacttccctgcttggctgacagccttggaacctcca gttcatctaacagggaactgtaaagccagcccctcattagaacagcctgcagcccaagtc cacactgaaaagaggaaatccctcattccccagcaaccaagacagttggtccccctacta tttagaactcttttgatcgaaataccagatccaccggacaagttgaatgctgcatctgat gaaattctcgcaggaattctgggagtagtgcacaaccagtgtgaacacctccacttgaca ggctggtataacaatagctgtagccccttgctgggtcagattgctgatttgttgcatctt ggaagaataacccttgcttcactactgatagacctccgcagcaggggctggaagacgcag cacccaaaactccagacaggcccatctccctctagcagcaaaaaggtggctggctggcag atgaatctatcccttgggctattgagtacccattatgtgccaggaattatggtgagagct ggggatacagaggtgcctaatagacaactcagccttcatgctgcctacctgtgcacattg gctacatggattaatgaactcccctgccctctgccaccatcagtgggcacagaattgccg gctggcagagggcgcgccaacagccgccagccgaagtgcccccgcgtaggtgggaggagt gactgcctctcaccatcccccggggatgcacctgtggagagacggcccgagagttccagg caaaccttgctcatggattctgggcctctgtgggatgccaaccccacccctcggggcacc ctctctgcccccaatgccacaacaccctggctgggccgggatgaggagctggccaaggtg gagatcggagtcctggccactgtcctggtgctggcgaccgggggcaacctggctgtgctg ctgaccctgggccagctgggccgcaagcgctcccgcatgcacctgttcgtgctgcactta gccctgacagacctggccgtggcgctcttccaggtgctgccacagctgctgtgggacatc acctaccgcttccagggccccgacctcctgtgcagggccgtcaagtacctgcaggtgctc agcatgtttgcctccacctacatgctgctggccatgacgctggaccgctacctggctgtc tgtcaccccctgcgcagcctccagcagccaggccagtccacctacctgctcatcgctgct ccctggctgctggccgccatcttcagcctccctcaagtcttcattttttccctgcgggag gtgatccagggctcaggggtgctggactgctgggcagacttcggcttcccttgggggcca cgggcctacctcacctggaccaccctggctatcttcgttctgccggtgaccatgctcacg gcctgctacagcctcatctgccatgagatctgtaaaaacctaaaagtcaagacacaggcc tggcgggtgggaggagggggctggaggacttgggacaggccctcaccttccaccttagct gccaccactcgggggctgccatctcgggtcagcagcatcaacaccatctcacgggccaag atccgaacagtgaagatgacctttgtcatcgtgctggcctacatcgcttgctgggctccc ttcttcagtgtccagatgtggtccgtgtgggacaagaatgcccctgatgaaggcaagact cagtttcctcatctgttcctcatctgtggggcctggacgctctcccaggtcccttcgagc tgccattctgaggcttctcatgcttctctcttggtagattccaccaatgtggctttcacc atctctatgcttttgggcaacctcaacagctgctgcaacccctggatctacatgggcttc aacagccacctgttaccgcggcccctgcgtcaccttgcctgctgtgggggtccccagccc aggatgcgccggcggctctccgacggcagcctctcgagccgccacaccacgctgctgacc cgctccagctgcccggccaccctcagcctcagcctcagcctaaccctcagtgggaggccc aggcctgaagagtcaccaagggacttggagctggcagatggggaaggcaccgctgagacc atcatcttttag