GENSCAN 1.0 Date run: 4-Nov-116 Time: 12:21:08 Sequence gi568815575f:23683352_23885853 : 202502 bp : 44.62% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.11 PlyA - 1479 1474 6 1.05 1.10 Term - 8485 8345 141 1 0 76 54 34 0.335 -3.27 1.09 Intr - 8930 8682 249 2 0 33 119 91 0.052 4.33 1.08 Intr - 14172 14135 38 1 2 79 79 17 0.036 -2.12 1.07 Intr - 21493 21343 151 1 1 60 107 79 0.783 6.74 1.06 Intr - 21729 21642 88 2 1 86 96 26 0.763 3.17 1.05 Intr - 23388 23277 112 1 1 96 78 77 0.918 6.94 1.04 Intr - 39402 39319 84 1 0 87 111 59 0.842 7.89 1.03 Intr - 47213 47176 38 1 2 101 69 23 0.823 -0.39 1.02 Intr - 47635 47465 171 0 0 99 87 68 0.590 6.86 1.01 Init - 52561 52503 59 1 2 95 71 42 0.630 3.98 1.00 Prom - 69875 69836 40 -5.86 2.00 Prom + 74749 74788 40 -5.06 2.01 Init + 82134 82254 121 2 1 53 55 50 0.786 -1.45 2.02 Intr + 82584 82713 130 1 1 65 84 67 0.903 3.75 2.03 Intr + 83129 83291 163 2 1 82 100 83 0.844 8.88 2.04 Intr + 99962 100066 105 1 0 30 69 93 0.042 2.01 2.05 Intr + 100307 100358 52 1 1 102 86 14 0.946 1.08 2.06 Intr + 100449 100532 84 1 0 108 86 60 0.965 7.59 2.07 Intr + 101977 102078 102 2 0 74 80 115 0.966 9.45 2.08 Intr + 102169 102209 41 2 2 121 91 15 0.999 3.04 2.09 Term + 102335 102505 171 1 0 94 55 125 0.988 7.63 2.10 PlyA + 102545 102550 6 1.05 3.00 Prom + 112502 112541 40 -1.86 3.01 Init + 131718 131816 99 2 0 85 82 93 0.706 8.66 3.02 Term + 134033 134059 27 1 0 102 43 4 0.315 -4.63 3.03 PlyA + 134087 134092 6 1.05 4.02 PlyA - 134977 134972 6 1.05 4.01 Sngl - 153969 153391 579 0 0 75 44 523 0.999 42.78 4.00 Prom - 156455 156416 40 -4.26 5.00 Prom + 161855 161894 40 -5.26 5.01 Init + 163734 163803 70 2 1 66 45 64 0.582 1.11 5.02 Term + 165913 166097 185 2 2 62 39 137 0.960 3.91 5.03 PlyA + 166787 166792 6 1.05 6.07 PlyA - 168389 168384 6 1.05 6.06 Term - 174253 174126 128 2 2 67 42 96 0.037 1.34 6.05 Intr - 185337 185242 96 1 0 53 92 45 0.036 1.38 6.04 Intr - 191106 191052 55 0 1 94 97 -20 0.031 -1.95 6.03 Intr - 195683 195564 120 2 0 56 87 30 0.171 0.39 6.02 Intr - 197601 197494 108 0 0 56 77 101 0.676 6.28 6.01 Intr - 201486 201439 48 0 0 121 38 35 0.283 0.68 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 100001 100066 66 1 0 81 69 76 0.885 6.12 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575f:23683352_23885853|GENSCAN_predicted_peptide_1|376_aa MVSLFDAYVVKNLVLLLFERDHVKAMEERKLLHSFLAKSQDGLPPRRMKDSYIEVLLPLG SEPELREKYLTVQNTVRFGRILEDLDSLGVLICYMHNKIHSAKMSPLSIVTALVDKIVNK GRRIAFSSTSLLKMAPSAEERTTIHEMFLSTLDPNGSRPFVVAVDDIMFQKPVEVGSLLF LSSQVCFTQNNYIQVRVHSEVASLQEKQHTTTNVFHFTFMSEKEVPLVFPKTYGAIFLEG SVVSLKVSCLSNEVSIKGPRGPGTQNFWTAELVKACRKVNKNSPMCWDDGAPQLHGNRSS RAQDPSRPRPMCLFIWVFIRILQNILHKKLVDSIRIELDQKIPGWCPLQNCLPVGGENPY TLHIWSQKSSVLIVVM >gi568815575f:23683352_23885853|GENSCAN_predicted_CDS_1|1131_bp atggtgtctctgtttgatgcctatgtggtgaaaaaccttgttttacttttatttgaaaga gaccatgtgaaggcaatggaagaaaggaaattacttcatagtttcttggctaaatcacag gatggactgcctcctaggagaatgaaggacagttatattgaagttctcttgcctttgggc agtgagcctgaattacgagagaaatatttgactgttcaaaacaccgtaagatttggcagg attcttgaggatcttgacagcttgggagttcttatttgttacatgcacaacaaaatccac tccgccaagatgtctcctttatcgatagttacagccctggtggataagattgtgaacaag gggagaagaattgccttcagctccacgtcgttactgaaaatggcccccagcgctgaggag aggaccaccatacatgagatgtttctcagcacactggatccaaatggttctcgaccgttt gtggtagcagtagatgacatcatgtttcagaaacctgttgaggttggctcattgctcttt ctttcttcacaggtatgctttactcagaataattatattcaagtcagagtacacagtgaa gtggcctccctgcaggagaagcagcatacaaccaccaatgtctttcatttcacgttcatg tcggaaaaagaagtgccattggttttcccaaaaacatatggagccattttcttggaggga tcagtggtgtcactgaaggtgtcatgcctatccaatgaagtctccataaaaggcccaaga ggaccaggtacgcagaacttctggacagctgagctggtgaaggcttgcaggaaggtgaac aagaattcacccatgtgttgggatgatggtgcaccccaactccacgggaatagaagctcc cgtgctcaggacccttccagacctcgccctatgtgcctcttcatctgggtgtttattcgt atccttcaaaatatccttcataagaaactggtggacagcatcagaattgaattggatcag aagattcctggctggtgtccactgcagaattgcttgcctgttggtggggaaaatccctac accctacacatttggtcacagaagtcttctgtattgattgttgtcatgtga >gi568815575f:23683352_23885853|GENSCAN_predicted_peptide_2|322_aa MATPQVEQPVDHFQSIGTALTSERNDGIAVPQLYQLDSPTTRKLCDQWCLCLSFDQARQT CSTDVAWQAVLSSCYQPGFHACRGPAIQWVLSSRPTSRKNEVRGQLEGEQGGEEPYRVTE QLSGNLKWVAPFRRQVVPGPGPQREEKQKTKMAKFVIRPATAADCSDILRLIKELAKYEY MEEQVILTEKDLLEDGFGEHPFYHCLVAEVPKEHWTPEGHSIVGFAMYYFTYDPWIGKLL YLEDFFVMSDYRGFGIGSEILKNLSQVAMRCRCSSMHFLVAEWNEPSINFYKRRGASDLS SEEGWRLFKIDKEYLLKMATEE >gi568815575f:23683352_23885853|GENSCAN_predicted_CDS_2|969_bp atggctactccacaggtagagcagccagtagaccattttcaaagcattgggacagctctc acctcagaaagaaatgatggtattgcggtgcctcagctctaccaactagattcaccaacc actagaaagctctgtgaccagtggtgcctttgcctgagttttgatcaggcccgccagact tgttccactgacgtggcctggcaggctgtgctcagctcatgctaccagccaggattccac gcctgccgaggtcccgccattcagtgggtcctgagttctcgtcccacatccaggaagaat gaggtacgcggacaactagagggtgagcaaggcggagaggagccttatcgagtgacagaa cagctctcgggaaacttgaagtgggtagctcctttccgcaggcaggttgtcccagggcct ggtccgcaaagggaagaaaagcaaaagacgaaaatggctaaattcgtgatccgcccagcc actgccgccgactgcagtgacatactgcggctgatcaaggagctggctaaatatgaatac atggaagaacaagtaatcttaactgaaaaagatctgctagaagatggttttggagagcac cccttttaccactgcctggttgcagaagtgccgaaagagcactggactccggaaggacac agcattgttggttttgccatgtactattttacctatgacccgtggattggcaagttattg tatcttgaggacttcttcgtgatgagtgattatagaggctttggcataggatcagaaatt ctgaagaatctaagccaggttgcaatgaggtgtcgctgcagcagcatgcacttcttggta gcagaatggaatgaaccatccatcaacttctataaaagaagaggtgcttctgatctgtcc agtgaagagggttggagactgttcaagatcgacaaggagtacttgctaaaaatggcaaca gaggagtga >gi568815575f:23683352_23885853|GENSCAN_predicted_peptide_3|41_aa MTRDNRVEGYDLTSEVMQYHFGCIQVIVGEVTKVNLSTVAK >gi568815575f:23683352_23885853|GENSCAN_predicted_CDS_3|126_bp atgacgagagacaaccgggtggaaggttatgacctaacctcagaagtcatgcagtatcac tttggctgcatccaggtcattgtgggggaggtcactaaggtgaatctgtcaactgtggca aagtag >gi568815575f:23683352_23885853|GENSCAN_predicted_peptide_4|192_aa MKTILSNQTVDIPENVDITLKGRTVIVKGPRGTLRRDFNHINVELSLLGKKKKRLRVDKW WGNRKELATVRTICSHVQNMIKGVTLGFRYKMRSVYAHFPINVVIQENGSLVEIRNFLGE KYIRRVRMRPGVACSVSQAQKDELILEGNDIELVSNSAALIQQATTVKNKDIRKFLDGIY VSEKGTVQQADE >gi568815575f:23683352_23885853|GENSCAN_predicted_CDS_4|579_bp atgaagactattctcagcaatcagactgtcgacattccagaaaatgtcgacattactctg aagggacgcacagttatcgtgaagggccccagaggaaccctgcggagggacttcaatcac atcaatgtagaactcagccttcttggaaagaaaaaaaagaggctccgggttgacaaatgg tggggtaacagaaaggaactggctaccgttcggactatttgtagtcatgtacagaacatg atcaagggtgttacactgggcttccgttacaagatgaggtctgtgtatgctcacttcccc atcaacgttgttatccaggagaatgggtctcttgttgaaatccgaaattttttgggtgaa aaatacatccgcagggttcggatgagaccaggtgttgcttgttcagtatctcaagcccag aaagatgaattaatccttgaaggaaatgacattgagcttgtttcaaattcagcggctttg attcagcaagccacaacagttaaaaacaaggatatcaggaaatttttggatggtatctat gtctctgaaaaaggaactgttcagcaggctgatgaataa >gi568815575f:23683352_23885853|GENSCAN_predicted_peptide_5|84_aa MLNQGNVNENHDGKFWEHLHMLDNSMGLGIEHLQLLKPIKTKQKPDLCAYVDLEHCQKDQ TGSGSSSPCIQLPMCREYREHVKL >gi568815575f:23683352_23885853|GENSCAN_predicted_CDS_5|255_bp atgctcaatcagggcaatgtaaatgaaaaccacgatgggaaattctgggaacatcttcac atgctagataattcaatgggtctaggcattgagcatctacagctgctaaaaccaataaaa acaaagcaaaaaccagacttatgtgcctacgtggatctagagcattgccaaaaggatcaa acaggatctggttcaagtagcccgtgcatccagctgccaatgtgcagggagtacagagaa cacgttaaactatag >gi568815575f:23683352_23885853|GENSCAN_predicted_peptide_6|184_aa ECCVTTQLEKPIGNPKVIQRSVGPASLSLLTFKVYAAPKKDSPPKNSVKVDELSLYSVPE GQSKYVEEARSQLEESISQLRHYCEPYTTWCQETYSQTKPKMQSLVQWGLDSYDYLQNAP PGFFPRLGVIGFAGLIGLLLARVLNPDSLIGLQLYVRVQWPQLGELDGFDDKYQYFPAFA QHCL >gi568815575f:23683352_23885853|GENSCAN_predicted_CDS_6|555_bp gaatgctgtgtcaccacccaactagaaaaacccatcggaaaccccaaggtaattcagagg tccgtggggccagccagcctgagcttgctcaccttcaaagtctatgcagcaccaaaaaag gactcacctcccaaaaattccgtgaaggttgatgagctttcactctactcagttcctgag ggtcaatcgaagtatgtggaggaggcaaggagccagcttgaagaaagcatctcacagctc cgacactattgcgagccatacacaacctggtgtcaggaaacgtactcccaaactaagccc aagatgcaaagtttggttcaatgggggttagacagctatgactatctccaaaatgcacct cctggattttttccgagacttggtgttattggttttgctggccttattggactccttttg gctagagtattgaatccagattccttgattgggctacagttgtatgtgagagtccagtgg ccccagcttggggagcttgatggctttgatgacaaataccagtactttccagcatttgct caacattgtttgtaa