GENSCAN 1.0 Date run: 6-Nov-116 Time: 12:39:49 Sequence gi568815597r:227821521_228025520 : 204000 bp : 50.26% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 23950 24092 143 2 2 93 53 240 0.352 21.07 1.02 Intr + 24434 24675 242 1 2 118 65 77 0.070 4.75 1.03 Term + 46866 46977 112 0 1 94 44 82 0.121 2.43 1.04 PlyA + 47882 47887 6 1.05 2.00 Prom + 49571 49610 40 -5.36 2.01 Init + 51674 51825 152 1 2 78 98 102 0.540 9.71 2.02 Intr + 53954 54276 323 2 2 25 40 146 0.279 -1.00 2.03 Intr + 61595 61665 71 0 2 105 76 28 0.093 2.20 2.04 Intr + 62379 62489 111 2 0 83 68 22 0.040 0.28 2.05 Term + 65078 65227 150 1 0 102 48 36 0.037 -1.09 2.06 PlyA + 66445 66450 6 1.05 3.00 Prom + 70026 70065 40 -5.46 3.01 Init + 74315 74411 97 1 1 91 92 61 0.695 5.28 3.02 Intr + 74654 74726 73 0 1 86 76 74 0.658 4.46 3.03 Term + 87362 87521 160 2 1 77 39 91 0.124 0.51 3.04 PlyA + 87884 87889 6 1.05 4.11 PlyA - 89068 89063 6 1.05 4.10 Term - 92090 92053 38 0 2 105 49 56 0.145 0.90 4.09 Intr - 93583 93469 115 1 1 103 40 86 0.148 5.32 4.08 Intr - 94691 94527 165 2 0 85 60 38 0.097 0.86 4.07 Intr - 100480 100070 411 1 0 89 96 463 0.049 41.58 4.06 Intr - 102880 102609 272 2 2 102 23 412 0.573 33.36 4.05 Intr - 103546 103445 102 2 0 96 37 60 0.721 1.85 4.04 Intr - 103999 103743 257 0 2 81 105 571 0.998 55.19 4.03 Intr - 115699 115603 97 2 1 84 106 48 0.022 5.27 4.02 Intr - 122723 122563 161 1 2 45 76 101 0.041 4.23 4.01 Init - 126367 126273 95 1 2 87 77 269 0.873 23.55 4.00 Prom - 133975 133936 40 -3.56 5.00 Prom + 134501 134540 40 -7.86 5.01 Init + 136862 136916 55 1 1 51 94 66 0.340 4.95 5.02 Intr + 145870 145927 58 2 1 88 101 49 0.911 4.14 5.03 Intr + 147519 147633 115 0 1 42 98 40 0.562 0.85 5.04 Intr + 149703 149807 105 2 0 106 71 36 0.858 4.11 5.05 Intr + 151977 152081 105 2 0 105 71 -4 0.262 0.01 5.06 Intr + 152140 152365 226 0 1 93 56 85 0.528 3.36 5.07 Term + 152924 152976 53 0 2 68 43 66 0.806 -2.31 5.08 PlyA + 153655 153660 6 -0.45 6.06 PlyA - 153693 153688 6 -4.33 6.05 Term - 154615 154482 134 2 2 93 47 114 0.971 6.05 6.04 Intr - 155187 155029 159 1 0 78 13 115 0.802 2.96 6.03 Intr - 155433 155310 124 0 1 65 14 126 0.871 3.06 6.02 Intr - 156164 155907 258 2 0 -10 72 407 0.429 26.96 6.01 Init - 156594 156415 180 0 0 66 75 129 0.864 8.58 6.00 Prom - 171508 171469 40 -3.66 7.06 PlyA - 174057 174052 6 1.05 7.05 Term - 179128 179058 71 2 2 86 45 72 0.813 0.80 7.04 Intr - 180687 180553 135 1 0 105 71 58 0.573 6.34 7.03 Intr - 181451 181388 64 2 1 92 74 50 0.852 2.29 7.02 Intr - 182767 182715 53 2 2 113 90 36 0.783 4.93 7.01 Init - 183549 183519 31 0 1 79 99 26 0.445 1.75 7.00 Prom - 189337 189298 40 -1.56 8.00 Prom + 195073 195112 40 -7.26 8.01 Init + 197162 197238 77 1 2 114 55 6 0.331 0.46 8.02 Intr + 201147 201388 242 0 2 136 86 493 0.768 51.09 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 24434 24688 255 1 0 118 51 82 0.834 3.09 S.002 Term - 100480 99998 483 1 0 89 44 545 0.908 44.95 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:227821521_228025520|GENSCAN_predicted_peptide_1|165_aa XETSDELQEMQLPLILEPWCHLLYGHMSYIMPDMLCAGDILNAKTVCEGDSGGPLVCEFN RSWLQIGIVSWGRGCSNPLYPGVYASVSYFSKWICDNIEITPTPAQPAPALSPALGPTLS VLMAMLAGWWNERLGKEKDTETKYRERNKGTQGTSVQHMEDPASL >gi568815597r:227821521_228025520|GENSCAN_predicted_CDS_1|498_bp ngtgagacctcagacgagctgcaggagatgcagctcccgctgatcctggagccctggtgc cacctgctctacggacacatgtcctacatcatgcccgacatgctgtgtgctggggacatc ctgaatgctaagaccgtgtgtgagggcgactccgggggcccacttgtctgtgaattcaac cgcagctggttgcagattggaattgtgagctggggccgaggctgctccaaccctctgtac cctggagtgtatgccagtgtttcctatttctcaaaatggatatgtgataacatagaaatc acgcccactcctgctcagccagcccctgctctctctccagctctggggcccactctcagc gtcctaatggccatgctggctggctggtggaacgagaggcttggaaaagaaaaagacaca gagacaaagtatagagaaagaaataagggcacccagggaaccagcgttcagcatatggag gatcccgccagcctctga >gi568815597r:227821521_228025520|GENSCAN_predicted_peptide_2|268_aa MKMRKNQHKKAENSKNQNASSPPKNHNSPPAREQNWTENEFDKLTEVGFRRIILITKPGR DTTKKEIFRPISLMNIYVKILNKILASRIQQHIKKLIHHDQGGFIPGMQGWFNIHKSINI IHHINKTNDKNQMIISIDAEKAFDEIQHPFMIKTLNKLACIHAPKTQGHGGLDALLARWL QQLPVCPSLALAPTDPRTPIIWSVRLALLPGTSAHPHLQAYGAPLLSSRASQASQLLSGA LASLQVSHPIKEITAELSRTPGNSWVGR >gi568815597r:227821521_228025520|GENSCAN_predicted_CDS_2|807_bp atgaagatgaggaaaaaccagcacaaaaaggctgaaaattccaaaaaccagaatgcctct tctcctccaaagaatcataactccccaccagcaagggaacaaaactggacggagaatgag tttgacaaattgacagaagtaggcttcagaagaatcatcctgataacaaaacctggcaga gacacaacaaaaaaagaaattttcaggccaatatccctgatgaacatctatgtgaaaatc ctcaataaaatactggcaagccgaatccagcagcacatcaaaaagcttatccaccacgat caaggcggcttcatccctgggatgcaaggctggttcaacatacacaaatcaataaacata atccatcacataaacaaaacgaatgacaaaaaccaaatgattatttcaatagatgcagaa aaggcctttgatgaaattcaacaccccttcatgataaaaactctcaataaactagcatgc atccatgcaccgaaaacacaaggccatgggggcttggatgcacttctggccaggtggctg cagcagctccctgtgtgcccatccctagctctggccccgacggacccccggacccccatc atctggagtgtccgcctggccctacttccagggacctctgcccacccccacctgcaggcc tatggtgctcccctcctgagcagcagagccagccaagcttctcagctcctaagtggggct cttgcgtctttacaagtctcccaccccataaaagagatcaccgcggagctgtccagaacc cctggaaacagctgggtcggacggtga >gi568815597r:227821521_228025520|GENSCAN_predicted_peptide_3|109_aa MSQAAPGARHPRPGLLGLCSLQFPAPFPGGALGPQEHGLGCQVALEEHKELFVFRLGSSS RSVLLLHGSPEADLATHRLANDLEGPVHLPSSSFIMLTVSPLVHSLFHR >gi568815597r:227821521_228025520|GENSCAN_predicted_CDS_3|330_bp atgagccaggcagcacccggggcccggcaccccaggcctggactcctgggcctctgctcg ctccagtttccggcacctttcccaggaggggctctgggaccccaggagcatggcctgggc tgccaggtggccttggaagagcacaaagagctgtttgtcttcaggctcggctcctcctca cggtctgtgctgctcctgcacggctccccagaagctgatctggccacacacaggctggcc aatgacctcgaaggcccagtccacctgccgagttcttccttcatcatgctcaccgtcagt cccctcgtccacagcctcttccaccgttag >gi568815597r:227821521_228025520|GENSCAN_predicted_peptide_4|570_aa MLDGSPLARWLAAAFGLTLLLAALRPSAAYFGITVVARVSIAFFLMTELDSTGRLSERFF DRSSVDGHLDHFHCLAMMDNVAVSIGRKTLGDGCKARETTKTDPASLEKPNRARGGEKLT GSEPLTILPLTLEPEAAAQAHYKACDRLKLERKQRRMCRRDPGVAETLVEAVSMSALECQ FQFRFERWNCTLEGRYRASLLKRGTRTVVAGGQGMLTRPLTGPQATRGCPDSPRLMPGFK ETAFLYAISSAGLTHALAKACSAGRMERCTCDEAPDLENREAWQWGGCGDNLKYSSKFVK EFLGRRSSKDLRARVDFHNNLVGVKASGVIKAGVETTCKCHGVSGSCTVRTCWRQLAPFH EVGKHLKHKYETALKVGSTTNEAAGEAGAISPPRGRASGAGGSDPLPRTPELVHLDDSPS FCLAGRFSPGTAGRRCHREKNCESICCGRGHNTQSRVVTRPCQCQLTSSHWPWAHPVQRH PPGGWRHLGIHEVLPHLLEHGSDAPEQFPDNRASPSWLAGPLCSSEAPPCPLPSAYSNWL AQEERMLGRAVTSLEVTLALQLEDREDKGV >gi568815597r:227821521_228025520|GENSCAN_predicted_CDS_4|1713_bp atgctggatgggtccccgctggcgcgctggctggccgcggccttcgggctgacgctgctg ctcgccgcgctgcgcccttcggccgcctacttcggcatcaccgttgtagcacgcgttagt attgccttctttcttatgactgagttagactcgactggaagactctccgagagatttttt gaccgttcatcagttgacggacatctggatcacttccactgtttggctatgatggataat gttgctgtgagcattggaaggaagacacttggggatggctgcaaggctcgggaaaccaca aagactgacccagcaagtttggaaaaaccaaatagagctagagggggagaaaagctgacg ggcagcgagcccctgaccatcctcccgctgaccctggagccagaggcggctgcccaggcg cactacaaggcctgcgaccggctgaagctggagcggaagcagcggcgcatgtgccgccgg gacccgggcgtggcagagacgctggtggaggccgtgagcatgagtgcgctcgagtgccag ttccagttccgctttgagcgctggaactgcacgctggagggccgctaccgggccagcctg ctcaagcgaggcacacgcacagtcgttgctggtggccagggcatgctgactcggcctttg acggggccccaggctacccggggctgccccgacagccctcggctcatgccaggcttcaag gagactgccttcctctatgccatctcctcggctggcctgacgcacgcactggccaaggcg tgcagcgcgggccgcatggagcgctgtacctgcgatgaggcacccgacctggagaaccgt gaggcctggcagtgggggggctgcggagacaaccttaagtacagcagcaagttcgtcaag gaattcctgggcagacggtcaagcaaggatctgcgagcccgtgtggacttccacaacaac ctcgtgggtgtgaaggcaagtggggtgatcaaggctggggtggagaccacctgcaagtgc cacggcgtgtcaggctcatgcacggtgcggacctgctggcggcagttggcgcctttccat gaggtgggcaagcatctgaagcacaagtatgagacggcactcaaggtgggcagcaccacc aatgaagctgccggcgaggcaggtgccatctccccaccacggggccgtgcctcgggggca ggtggcagcgacccgctgccccgcactccagagctggtgcacctggatgactcgcctagc ttctgcctggctggccgcttctccccgggcaccgctggccgtaggtgccaccgtgagaag aactgcgagagcatctgctgtggccgcggccataacacacagagccgggtggtgacaagg ccctgccagtgccagctgacaagctcccactggccctgggcccaccctgtccaacgacat ccaccaggtgggtggagacaccttggaatccacgaggtgcttcctcacctgctcgagcac gggtcagatgctcctgagcagtttccggacaacagggcatccccatcttggcttgcgggg cctctgtgctcctcggaagcccctccctgcccactgcccagtgcatacagcaactggctg gcccaggaggagcggatgctgggcagagctgtcacctccctggaggtgaccttggccttg cagcttgaagaccgggaggacaaaggcgtttag >gi568815597r:227821521_228025520|GENSCAN_predicted_peptide_5|238_aa MVEDEDTKHEAGQPRGEAGGTIQLQENKLNTPTDSTLWMSSSSQCVYRPVSCLTTTSFGP TPAELLAASVGHRILKARLLPLNNLFGLSAYPSPGGLGRPTASSSQAPQAQARLWPPNNV FGLSSCPAPSGPGRPTTSRSQVPQAQAPNGLRSVGSSRPSLGLLAASAGPNRPEVGLSRP SLGLPAASAGPSHPQIGLELGLEEQQVGLPGPNSVLSAASPGLMTVMATFWHNEKYRP >gi568815597r:227821521_228025520|GENSCAN_predicted_CDS_5|717_bp atggtggaagatgaggacaccaagcatgaagctggccagccccgtggggaggcaggtggg accatccagttgcaggaaaacaagcttaacacgcccactgattctacattatggatgagc tcatcctcccagtgcgtctacaggcccgtctcctgcctcacaacaacctcctttggccca actcctgctgagctgctggcagcctctgtaggccacagaattcttaaggcccgactcctg cctctcaacaacctttttggactcagtgcctacccatctcctggcggccttggtcggccc acagcttcctcaagccaagctccccaggcccaggcccgactctggcctcccaacaacgtc tttggactcagctcctgcccagctcccagtggccctggtaggcccacaacttcccgaagc caagttccccaggcccaggccccaaacggcctccggtcggtgggctcctctaggcccagc ttgggcctcctggcggcctctgcaggcccaaatcgtcccgaagtcggcctctccaggccc agcttgggcctcccggcggcctctgcaggcccaagtcatcctcaaatcggcctggaattg ggcctggaagagcagcaagtcggcctccccgggcccaactccgtcctctcggcggcctct ccaggtttaatgacggtcatggcaactttttggcacaatgaaaaatatcgcccatga >gi568815597r:227821521_228025520|GENSCAN_predicted_peptide_6|284_aa MFVWRNVEGHSVAVFPWYSIPFLTPPCSHMRPSKLPVTQWPPTRENNLPSWQLLLMSVHQ AQSLSALRKEQDSSSEKDGRSPNKWDKDHIWWPMSGSHNLQQAAPGPGRAHQGHPNQDNR TISQMLSEQWYTLGPNEMQKYDLAFQHMASEDIASDEEHTVIHEEEGVMMSLLMMALAPL TLISSSRNPEPPGSPDPPAAFGKVYGPTLSSSYTYSDASSSTLAPTSFLLGPGAFKAQES GGEAEDRLREPETEKALSSSLHAPRTSAGPDHAALPGPLLLPVH >gi568815597r:227821521_228025520|GENSCAN_predicted_CDS_6|855_bp atgttcgtgtggaggaatgtggaaggtcactctgtggccgtgttcccctggtactccatc cccttcctgacccctccctgcagccacatgaggcccagcaaactgccagtcactcagtgg cctccaaccagagaaaacaacctgccaagttggcagctgttgctcatgagcgtccaccag gcccagtccctcagtgccctgcgcaaggaacaggactcatcttctgagaaggatggacgc agccccaacaaatgggacaaggaccacatctggtggcccatgagtggcagtcataatctt cagcaagcagcaccaggccctggcagggcgcaccagggtcaccccaaccaggataaccgg accatcagccagatgctgagcgagcaatggtacaccctggggcccaatgagatgcagaaa tacgacctggccttccagcacatggccagtgaggacatagcgagtgacgaggagcacacg gtcatccatgaggaggagggggtgatgatgtcattgctgatgatggctttagcaccactg acactgatctcaagttcaaggaatcctgagcccccagggtcccccgatccacctgcagct tttggcaaagtctatggtcccaccctgtcctcctcctacacatactcggatgcttcctcc tcaaccttggcacccacctccttcttactgggcccaggagccttcaaagcccaggagtct ggtggagaagctgaggacaggctcagggaaccggagaccgagaaggcgctgtcctcttca ctgcacgcgccccggaccagtgccggccctgatcatgcagctcttccaggcccactgctt cttcctgtccactag >gi568815597r:227821521_228025520|GENSCAN_predicted_peptide_7|117_aa MGLVTSLKGADLLVGQDTDPHSNTWNGEPLNMKATRMKTFMRTHCRLMNSPLYIPCLSLR PVSTFVGIVQLRHCGNYLLRVNITMRAPVDITDVGMSVDSKPFACCATNATFEKADA >gi568815597r:227821521_228025520|GENSCAN_predicted_CDS_7|354_bp atgggcctcgtgacctcactgaagggggcagacctccttgttggccaggacacagaccca cattccaacacttggaacggggagccactcaatatgaaggccactaggatgaagaccttt atgaggacccactgccgcctcatgaactcccctctctacattccttgtctgtccctccgc cccgtctccacctttgttggcattgttcagctcaggcactgtgggaattacctcctccgc gttaatatcaccatgagagcccccgtggacattactgatgtgggaatgtctgttgactca aagcccttcgcatgctgtgctacgaacgccacctttgagaaggctgacgcatga >gi568815597r:227821521_228025520|GENSCAN_predicted_peptide_8|107_aa MPSLIFDFLFNDPTDPSRPSQNLVAQSLAVGPQYSSLGSQPILCASIPGLVPKQLRFCRN YVEIMPSVAEGIKIGIQECQHQFRGRRWNCTTVHDSLAIFGPVLDKX >gi568815597r:227821521_228025520|GENSCAN_predicted_CDS_8|321_bp atgcccagccttatatttgattttctttttaatgacccaactgatccttcaaggcccagc caaaatctggtggcccagtcgctggctgttgggccacagtattcctccctgggctcgcag cccatcctgtgtgccagcatcccgggcctggtccccaagcagctccgcttctgcaggaac tacgtggagatcatgcccagcgtggccgagggcatcaagattggcatccaggagtgccag caccagttccgcggccgccggtggaactgcaccaccgtccacgacagcctggccatcttc gggcccgtgctggacaaagnn