GENSCAN 1.0 Date run: 3-Nov-116 Time: 18:21:34 Sequence gi568815597f:149751114_149886974 : 135861 bp : 44.06% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 6287 6326 40 -2.56 1.01 Init + 14589 14690 102 2 0 74 43 72 0.406 1.54 1.02 Term + 24977 25144 168 1 0 87 38 127 0.813 5.48 1.03 PlyA + 26082 26087 6 -0.45 2.00 Prom + 26391 26430 40 -3.66 2.01 Init + 31631 31661 31 1 1 80 98 21 0.464 1.01 2.02 Intr + 37253 37504 252 0 0 60 67 183 0.708 10.81 2.03 Intr + 38941 39225 285 2 0 69 97 129 0.930 9.21 2.04 Term + 40175 40404 230 0 2 86 43 153 0.593 7.39 2.05 PlyA + 40537 40542 6 1.05 3.04 PlyA - 40689 40684 6 1.05 3.03 Term - 42350 42219 132 2 0 82 49 59 0.002 -0.51 3.02 Intr - 54009 53964 46 2 1 112 43 43 0.095 0.61 3.01 Init - 61210 60834 377 1 2 90 63 951 0.380 89.21 3.00 Prom - 61339 61300 40 -10.84 4.02 PlyA - 61606 61601 6 1.05 4.01 Sngl - 62568 62158 411 0 0 97 48 999 0.998 93.09 4.00 Prom - 67740 67701 40 -4.76 5.00 Prom + 69263 69302 40 -5.56 5.01 Init + 72426 72497 72 2 0 24 105 53 0.439 1.67 5.02 Term + 81521 81883 363 1 0 45 47 680 0.940 54.17 5.03 PlyA + 82598 82603 6 1.05 6.03 PlyA - 83100 83095 6 1.05 6.02 Term - 90186 89634 553 2 1 -5 48 1085 0.930 88.99 6.01 Init - 91585 91197 389 1 2 62 61 701 0.090 61.08 6.00 Prom - 91673 91634 40 -5.66 7.00 Prom + 91837 91876 40 -16.58 7.01 Init + 91926 92301 376 2 1 78 89 675 0.962 63.70 7.02 Term + 95168 95244 77 0 2 47 29 64 0.121 -5.60 7.03 PlyA + 95346 95351 6 1.05 8.03 PlyA - 95420 95415 6 1.05 8.02 Term - 99147 99113 35 1 2 66 32 44 0.520 -5.65 8.01 Init - 99660 99285 376 0 1 78 89 675 0.970 63.70 8.00 Prom - 99749 99710 40 -16.58 9.00 Prom + 99913 99952 40 -5.66 9.01 Init + 100001 100389 389 1 2 62 61 701 0.090 61.08 9.02 Term + 101400 101952 553 0 1 -5 48 1085 0.930 88.99 9.03 PlyA + 103138 103143 6 1.05 10.06 PlyA - 104855 104850 6 1.05 10.05 Term - 110069 109707 363 2 0 45 47 680 0.924 54.17 10.04 Intr - 114325 114253 73 0 1 66 66 31 0.243 -1.89 10.03 Intr - 119554 119433 122 1 2 120 68 50 0.309 5.49 10.02 Intr - 127817 127652 166 1 1 20 21 119 0.089 -1.64 10.01 Init - 135527 135151 377 2 2 69 89 997 0.482 94.31 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 90328 90273 56 1 2 62 119 73 0.841 8.56 S.002 Sngl - 91585 91193 393 1 0 62 54 715 0.909 61.74 S.003 Sngl + 100001 100393 393 1 0 62 54 715 0.909 61.74 S.004 Init + 101258 101313 56 1 2 62 119 73 0.841 8.56 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:149751114_149886974|GENSCAN_predicted_peptide_1|89_aa MTEGDANTSLFTWQQQGEMQSKSGEKSLIKPSDLDSNCNAKDFCLSQTKELVWRLPVAME TRTKRKKAVGFIEQSDSTKLFSVEGVSSG >gi568815597f:149751114_149886974|GENSCAN_predicted_CDS_1|270_bp atgacagaaggagatgcaaatacatccctcttcacatggcagcagcaaggagaaatgcag agcaaaagcggggaaaagtcccttataaaaccatcagatctcgactccaattgtaacgct aaagatttttgccttagccagaccaaagaattggtgtggcggctgcccgtggcgatggaa acacggaccaagagaaaaaaggctgtaggctttattgagcagagtgacagtacaaagctt ttcagcgtggaaggggtttcgagcgggtag >gi568815597f:149751114_149886974|GENSCAN_predicted_peptide_2|265_aa MWFLTTLLLWGWLLLQVSSRVFTEGEPLALRCHAWKDKLVYNVLYYRNGKAFKFFHWNSN LTILKTNISHNGTYHCSGMGKHRYTSAGISVTVKELFPAPVLNASVTSPLLEGNLVTLSC ETKLLLQRPGLQLYFSFYMGSKTLRGRNTSSEYQILTARREDSGLYWCEAATEDGNVLKR SPELELQVLVGIMFLVNTVLWVTIRKELKRKKKWDLEISLDSGHEKKVISSLQEDRHLEE ELKCQEQKEEQLQEGVHRKEPQGAT >gi568815597f:149751114_149886974|GENSCAN_predicted_CDS_2|798_bp atgtggttcttgacaactctgctcctttggggctggctactactgcaggtctccagcaga gtcttcacggaaggagaacctctggccttgaggtgtcatgcgtggaaggataagctggtg tacaatgtgctttactatcgaaatggcaaagcctttaagtttttccactggaattctaac ctcaccattctgaaaaccaacataagtcacaatggcacctaccattgctcaggcatggga aagcatcgctacacatcagcaggaatatctgtcactgtgaaagagctatttccagctcca gtgctgaatgcatctgtgacatccccactcctggaggggaatctggtcaccctgagctgt gaaacaaagttgctcttgcagaggcctggtttgcagctttacttctccttctacatgggc agcaagaccctgcgaggcaggaacacatcctctgaataccaaatactaactgctagaaga gaagactctgggttatactggtgcgaggctgccacagaggatggaaatgtccttaagcgc agccctgagttggagcttcaagtgcttgtgggaataatgtttttagtgaacactgttctc tgggtgacaatacgtaaagaactgaaaagaaagaaaaagtgggatttagaaatctctttg gattctggtcatgagaagaaggtaatttccagccttcaagaagacagacatttagaagaa gagctgaaatgtcaggaacaaaaagaagaacagctgcaggaaggggtgcaccggaaggag ccccagggggccacgtag >gi568815597f:149751114_149886974|GENSCAN_predicted_peptide_3|184_aa MPDPAKSAPAPKKGSKKAVTKVQKKDGKKRKRSRKESYSVYVYKVLKQVHPDTGISSKAM GIMNSFVNDIFERIAGEASRLAHYNKRSTITSREIQTAVRLLLPGELAKHAVSEGTKAVT KYTSSKKNRKENYLQMDTEEVDGPWALVGEARRFPSPCGVAPAEPGAFLPAGLLIPPSQL PSCF >gi568815597f:149751114_149886974|GENSCAN_predicted_CDS_3|555_bp atgccggatccagcgaaatccgctcctgctcccaagaagggctccaaaaaggctgttacg aaagtgcagaagaaggacggcaagaagcgcaagcgcagccgcaaggagagctactccgtt tacgtgtacaaggtgctgaagcaggtccaccccgacaccggcatctcgtccaaggccatg ggcatcatgaactccttcgtcaacgacatcttcgagcgcatcgcgggagaggcgtcccgc ctggcgcactacaacaagcgctccaccatcacatcccgcgagatccagacggccgtgcgc ctgctgctgcccggcgagctggccaagcacgccgtgtccgagggcaccaaggcggtcacc aagtacaccagctcgaaaaaaaacagaaaagagaactatctccagatggatacagaagag gtggatgggccgtgggcccttgtgggtgaagctcggcggttcccgagtccatgtggggtg gcccctgcggagcctggagccttcttgcccgctggcttgctgatcccgccgagccagctc ccttcctgcttttga >gi568815597f:149751114_149886974|GENSCAN_predicted_peptide_4|136_aa MARTKQTARKSTGGKAPRKQLATKAARKSAPATGGVKKPHRYRPGTVALREIRRYQKSTE LLIRKLPFQRLVREIAQDFKTDLRFQSSAVMALQEASEAYLVGLFEDTNLCAIHAKRVTI MPKDIQLARRIRGERA >gi568815597f:149751114_149886974|GENSCAN_predicted_CDS_4|411_bp atggcccgtactaagcagactgcccgcaagtcgaccggcggcaaggccccgaggaagcag ctggctaccaaagcggcccgcaagagcgcgccggccacgggcggggtgaagaagccgcac cgctaccggcccggcaccgtggctctgcgggagatccggcgctaccagaagtctacggag ctgctgatccgcaagctgcccttccagcggctggtacgcgagatcgcgcaggactttaag acggacctgcgcttccagagctcggccgtgatggcgctgcaggaggccagcgaggcctac ctggtggggctgttcgaagacacgaacctgtgcgccatccatgccaagcgcgtgaccatc atgcccaaggacatccagttggcccgccgcatccgcggggagcgggcctaa >gi568815597f:149751114_149886974|GENSCAN_predicted_peptide_5|144_aa MFPKVRVYPASQNVNAFGNWGFEDGKTVLALTEAVYRAPAVMSGRGKGGKGLGKGGAKRH RKVLRDNIQGITKPAIRRLARRGGVKRISGLIYEETRGVLKVFLENVIRDAVTYTEHAKR KTVTAMDVVYALKRQGRTLYGFGG >gi568815597f:149751114_149886974|GENSCAN_predicted_CDS_5|435_bp atgttccctaaagttcgtgtctatccagcatctcaaaatgtgaacgcatttggaaactgg ggctttgaagacgggaagacggtgctcgccttgacagaagctgtctatcgggctccagcg gtcatgtccggcagaggaaagggcggaaaaggcttaggcaaagggggcgctaagcgccac cgcaaggtcttgagagacaacattcagggcatcaccaagcctgccattcggcgtctagct cggcgtggcggcgttaagcggatctctggcctcatttacgaggagacccgcggtgtgctg aaggtgttcctggagaatgtgattcgggacgcagtcacctacaccgagcacgccaagcgc aagaccgtcacagccatggatgtggtgtacgcgctcaagcgccaggggcgcaccctgtac ggcttcggaggctag >gi568815597f:149751114_149886974|GENSCAN_predicted_peptide_6|313_aa MSGRGKQGGKARAKAKSRSSRAGLQFPVGRVHRLLRKGNYAERVGAGAPVYMAAVLEYLT AEILELAGNAARDNKKTRIIPRHLQLAIRNDEELNKLLGKVTIAQGGVLPNIQAVLLPKK TESHHKAKGNQWTASAGFSIIVPPNREKTVLIKTAAAGLGARFSPRRCAGKPVFWFAMAR TKQTARKSTGGKAPRKQLATKAARKSAPATGGVKKPHRYRPGTVALREIRRYQKSTELLI RKLPFQRLVREIAQDFKTDLRFQSSAVMALQEASEAYLVGLFEDTNLCAIHAKRVTIMPK DIQLARRIRGERA >gi568815597f:149751114_149886974|GENSCAN_predicted_CDS_6|942_bp atgtctggtcgtggcaagcaaggaggcaaggcccgcgccaaggccaagtcgcgctcgtcc cgcgctggccttcagttcccggtagggcgagtgcatcgcttgctgcgcaaaggcaactac gcggagcgagtgggggccggcgcgcccgtctacatggctgcggtcctcgagtatctgacc gccgagatcctggagctggcgggcaacgcggctcgggacaacaagaagacgcgcatcatc cctcgtcacctccagctggccatccgcaacgacgaggaactgaacaagctgctgggcaaa gtcaccatcgcccagggcggcgtcttgcctaacatccaggccgtactgctccctaagaag acggagagtcaccacaaggcaaagggcaaccaatggacagccagcgcgggattttcaatt attgttccgcccaatcgggaaaagactgtgcttataaagacggctgcggcggggctagga gctcgtttttctccccgccgctgcgctggtaagcctgtgttttggttcgctatggcccgt actaagcagactgctcgcaagtcgaccggcggcaaggccccgaggaagcagctggccacc aaggcggcccgcaagagcgcgccggccacgggcggggtgaagaagccgcaccgctaccgg cccggcaccgtagccctgcgggagatccggcgctaccagaagtccacggagctgctgatc cgcaagctgcccttccagcggctggtacgcgagatcgcgcaggactttaagacggacctg cgcttccagagctcggccgtgatggcgctgcaggaggccagcgaggcctacctggtgggg ctgttcgaagacacgaacctgtgcgccatccacgccaagcgcgtgaccattatgcccaag gacatccagctggcccgccgcatccgtggagagcgggcttaa >gi568815597f:149751114_149886974|GENSCAN_predicted_peptide_7|150_aa MPEPAKFAPAPKKGSKKAVTKAQKKDGKKRKRSRKESYSIYVYKVLKRVHPDTGIWCKAM GIMNSFLNDIFERIAGEASRLAHYNKRSTITSRRSRRPCACCCPASWPSTPCPRAPRRSP STPAPNLCNHKSHHYRYIVRIMTNGYHSEF >gi568815597f:149751114_149886974|GENSCAN_predicted_CDS_7|453_bp atgcctgagcctgcaaagttcgcgccggctcccaagaagggctccaagaaagccgtcacc aaagcccagaagaaagacggcaagaagcgcaagcgcagccgcaaggagagctactccatc tacgtgtacaaggtgctgaagcgggtccaccccgacaccggcatctggtgcaaggccatg ggcatcatgaactccttcctcaacgacatcttcgagcgcatcgcgggagaggcgtcccgc ctggcgcactacaacaagcgctccaccatcacgtcccggagatccagacggccgtgcgcc tgctgctgcccggcgagctggccaagcacgccgtgtccgagggcaccaaggcggtcacca agtacgccagctccaaacttgtgcaatcacaaaagtcaccattaccgctacatagtacgt attatgaccaatggatatcattcagagttctag >gi568815597f:149751114_149886974|GENSCAN_predicted_peptide_8|136_aa MPEPAKFAPAPKKGSKKAVTKAQKKDGKKRKRSRKESYSIYVYKVLKRVHPDTGIWCKAM GIMNSFLNDIFERIAGEASRLAHYNKRSTITSRRSRRPCACCCPASWPSTPCPRAPRRSP STPAPIPVDVFGFTSG >gi568815597f:149751114_149886974|GENSCAN_predicted_CDS_8|411_bp atgcctgagcctgcaaagttcgcgccggctcccaagaagggctccaagaaagccgtcacc aaagcccagaagaaagacggcaagaagcgcaagcgcagccgcaaggagagctactccatc tacgtgtacaaggtgctgaagcgggtccaccccgacaccggcatctggtgcaaggccatg ggcatcatgaactccttcctcaacgacatcttcgagcgcatcgcgggagaggcgtcccgc ctggcgcactacaacaagcgctccaccatcacgtcccggagatccagacggccgtgcgcc tgctgctgcccggcgagctggccaagcacgccgtgtccgagggcaccaaggcggtcacca agtacgccagctccaatcccagttgatgtttttggcttcacgtctggttaa >gi568815597f:149751114_149886974|GENSCAN_predicted_peptide_9|313_aa MSGRGKQGGKARAKAKSRSSRAGLQFPVGRVHRLLRKGNYAERVGAGAPVYMAAVLEYLT AEILELAGNAARDNKKTRIIPRHLQLAIRNDEELNKLLGKVTIAQGGVLPNIQAVLLPKK TESHHKAKGNQWTASAGFSIIVPPNREKTVLIKTAAAGLGARFSPRRCAGKPVFWFAMAR TKQTARKSTGGKAPRKQLATKAARKSAPATGGVKKPHRYRPGTVALREIRRYQKSTELLI RKLPFQRLVREIAQDFKTDLRFQSSAVMALQEASEAYLVGLFEDTNLCAIHAKRVTIMPK DIQLARRIRGERA >gi568815597f:149751114_149886974|GENSCAN_predicted_CDS_9|942_bp atgtctggtcgtggcaagcaaggaggcaaggcccgcgccaaggccaagtcgcgctcgtcc cgcgctggccttcagttcccggtagggcgagtgcatcgcttgctgcgcaaaggcaactac gcggagcgagtgggggccggcgcgcccgtctacatggctgcggtcctcgagtatctgacc gccgagatcctggagctggcgggcaacgcggctcgggacaacaagaagacgcgcatcatc cctcgtcacctccagctggccatccgcaacgacgaggaactgaacaagctgctgggcaaa gtcaccatcgcccagggcggcgtcttgcctaacatccaggccgtactgctccctaagaag acggagagtcaccacaaggcaaagggcaaccaatggacagccagcgcgggattttcaatt attgttccgcccaatcgggaaaagactgtgcttataaagacggctgcggcggggctagga gctcgtttttctccccgccgctgcgctggtaagcctgtgttttggttcgctatggcccgt actaagcagactgctcgcaagtcgaccggcggcaaggccccgaggaagcagctggccacc aaggcggcccgcaagagcgcgccggccacgggcggggtgaagaagccgcaccgctaccgg cccggcaccgtagccctgcgggagatccggcgctaccagaagtccacggagctgctgatc cgcaagctgcccttccagcggctggtacgcgagatcgcgcaggactttaagacggacctg cgcttccagagctcggccgtgatggcgctgcaggaggccagcgaggcctacctggtgggg ctgttcgaagacacgaacctgtgcgccatccacgccaagcgcgtgaccattatgcccaag gacatccagctggcccgccgcatccgtggagagcgggcttaa >gi568815597f:149751114_149886974|GENSCAN_predicted_peptide_10|366_aa MPEPAKSAPAPKKGSKKAVTKAQKKDGKKRKRSRKESYSIYVYKVLKQVHPDTGISSKAM GIMNSFVNDIFERIAGEASRLAHYNKRSTITSREIQTAVRLLLPGELAKHAVSEGTKAVT KYTSSNQLMKDLNYICKVFLINSKYRFCPYSKGGDLNNGYQGPGLTGAIAEFCLLQEYRG TLFQASKCRPQSASCPTIQAFHYYRKHLAKDHHHLPVLPLKLTDLAEDIINSDQRTLSLS NNTTFMGKTVLALTEAVYRAPAVMSGRGKGGKGLGKGGAKRHRKVLRDNIQGITKPAIRR LARRGGVKRISGLIYEETRGVLKVFLENVIRDAVTYTEHAKRKTVTAMDVVYALKRQGRT LYGFGG >gi568815597f:149751114_149886974|GENSCAN_predicted_CDS_10|1101_bp atgcctgaaccggcaaaatccgctccggcccctaaaaagggctccaagaaagccgtcacc aaagcccagaagaaagacggcaagaagcgcaagcgcagccgcaaagagagctactccatc tacgtgtacaaggtgctgaagcaggtccaccccgacaccggcatctcgtccaaggccatg ggcatcatgaactccttcgtcaacgacatcttcgagcgcatcgcgggagaggcttcccgc ctggcgcactacaacaagcgctccaccatcacatcccgcgagatccagacggccgtgcgc ctgctgctgcccggcgagctggccaagcacgccgtgtccgagggcaccaaggcggtcacc aagtacaccagctccaatcaactgatgaaggatcttaactacatctgcaaagttttcttg attaacagcaagtacaggttctgcccatactcaaagggaggggatttaaacaacgggtac caggggccaggactcacgggagccatcgcagaattttgcctactacaggaatatagagga actcttttccaggcaagtaaatgcaggccacagtctgccagctgtccaaccattcaggct tttcattattataggaagcatctggccaaggaccaccaccatcttcctgtcctgcccctg aagctcactgacttagctgaggacatcatcaattctgatcaaaggaccctaagtctaagt aacaataccactttcatggggaagacggtgctcgccttgacagaagctgtctatcgggct ccagcggtcatgtccggcagaggaaagggcggaaaaggcttaggcaaagggggcgctaag cgccaccgcaaggtcttgagagacaacattcagggcatcaccaagcctgccattcggcgt ctagctcggcgtggcggcgttaagcggatctctggcctcatttacgaggagacccgcggt gtgctgaaggtgttcctggagaatgtgattcgggacgcagtcacctacaccgagcacgcc aagcgcaagaccgtcacagccatggatgtggtgtacgcgctcaagcgccaggggcgcacc ctgtacggcttcggaggctag