GENSCAN 1.0 Date run: 3-Nov-116 Time: 17:50:09 Sequence gi568815578r:31365509_31573013 : 207505 bp : 46.31% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 3143 3200 58 1 1 74 89 99 0.909 8.09 1.02 Term + 7992 8011 20 1 2 86 42 31 0.237 -3.22 1.03 PlyA + 8750 8755 6 1.05 2.04 PlyA - 9084 9079 6 1.05 2.03 Term - 11931 11738 194 1 2 97 48 98 0.461 4.28 2.02 Intr - 24986 24915 72 0 0 94 97 53 0.243 6.18 2.01 Init - 40644 40587 58 0 1 43 97 100 0.877 5.88 2.00 Prom - 43161 43122 40 -3.66 3.00 Prom + 44589 44628 40 -8.46 3.01 Init + 48257 48387 131 1 2 70 68 78 0.717 3.62 3.02 Intr + 54161 54315 155 2 2 45 74 120 0.869 6.02 3.03 Term + 65720 65745 26 0 2 122 49 4 0.142 -1.71 3.04 PlyA + 66364 66369 6 1.05 4.06 PlyA - 66420 66415 6 1.05 4.05 Term - 74667 74609 59 1 2 93 43 73 0.819 1.15 4.04 Intr - 74966 74862 105 0 0 85 76 18 0.490 0.49 4.03 Intr - 75487 75379 109 1 1 79 81 45 0.973 2.76 4.02 Intr - 78315 78082 234 0 0 69 97 77 0.821 4.59 4.01 Init - 83690 83469 222 2 0 63 98 108 0.759 7.86 4.00 Prom - 91584 91545 40 -5.46 5.00 Prom + 98202 98241 40 -3.86 5.01 Init + 110938 111277 340 0 1 73 75 183 0.635 12.94 5.02 Intr + 112320 112402 83 1 2 83 60 122 0.658 8.26 5.03 Intr + 116779 116980 202 0 1 95 105 254 0.997 26.66 5.04 Term + 118651 118922 272 2 2 69 48 355 0.999 25.25 5.05 PlyA + 119225 119230 6 1.05 6.00 Prom + 122651 122690 40 -3.26 6.01 Init + 131641 131934 294 0 0 48 86 145 0.392 7.49 6.02 Intr + 139887 139989 103 2 1 61 32 108 0.394 2.25 6.03 Intr + 142383 142435 53 1 2 128 45 16 0.407 -0.07 6.04 Intr + 148951 149226 276 0 0 63 97 426 0.801 38.71 6.05 Intr + 161976 162074 99 2 0 115 106 137 0.920 18.51 6.06 Intr + 172671 172753 83 2 2 95 71 106 0.899 8.04 6.07 Intr + 176920 177075 156 1 0 98 70 43 0.809 2.63 6.08 Intr + 179439 179527 89 0 2 134 86 65 0.993 10.51 6.09 Intr + 181931 182184 254 0 2 68 103 111 0.502 7.75 6.10 Intr + 183457 183606 150 0 0 81 113 108 0.128 12.96 6.11 Intr + 183699 183824 126 2 0 63 92 207 0.999 19.48 6.12 Intr + 184556 184613 58 1 1 146 101 98 0.999 15.36 6.13 Intr + 188031 188048 18 1 0 102 111 1 0.566 0.38 6.14 Intr + 189238 189321 84 2 0 104 99 91 0.977 11.59 6.15 Intr + 194103 194139 37 1 1 113 109 92 0.998 11.12 6.16 Intr + 196126 196228 103 1 1 88 80 200 0.991 19.38 6.17 Intr + 200702 200787 86 1 2 76 110 73 0.771 6.92 6.18 Term + 202570 202720 151 1 1 101 38 64 0.516 0.08 6.19 PlyA + 206671 206676 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 100155 99998 158 1 2 103 37 69 0.848 1.40 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578r:31365509_31573013|GENSCAN_predicted_peptide_1|25_aa MKLLLLALPMLVLLPQVIPAVFMAA >gi568815578r:31365509_31573013|GENSCAN_predicted_CDS_1|78_bp atgaaactcctgctgctggctcttcctatgcttgtgctcctaccccaagtgatcccagct gtcttcatggcagcataa >gi568815578r:31365509_31573013|GENSCAN_predicted_peptide_2|107_aa MKLLLLLLTVTLLLAQVTPGLPAMKLLYLFLAILLAIEEPVISGKRHILRCMGNSGICRA SCKKNEQPYLYCRNCQSCCLQSYMRISISGKEENTDWSYEKQWPRLP >gi568815578r:31365509_31573013|GENSCAN_predicted_CDS_2|324_bp atgaagctccttcttctgcttttgactgttactctgctcctggcccaggtcaccccaggt ctgccagccatgaaacttctttacctgtttcttgccatccttctggccatagaagaacca gtgatatcaggcaaacgccacatccttcgatgcatgggtaacagtggaatttgtagggcc tcttgcaaaaagaacgaacagccctacctctattgcagaaattgtcagtcctgctgcctc cagtcctacatgaggataagcatttctggcaaagaggaaaataccgactggtcttatgag aagcagtggccaagactaccttga >gi568815578r:31365509_31573013|GENSCAN_predicted_peptide_3|103_aa MGLECRKPELSKQQKTIVGKAVEKMEPLHTLGGNVKWCGHYGKREFAQDLMVFESVWQFP LVPSLSPAAVEDVPCFLFIFCPDYKFPEASPAMQNYVYGATRF >gi568815578r:31365509_31573013|GENSCAN_predicted_CDS_3|312_bp atgggcttagaatgcagaaagccagaattatccaaacaacaaaagacaattgtgggcaag gctgtggagaaaatggaacccttgcacacgctgggtgggaatgtaaaatggtgtggccac tatggaaaacgtgagttcgcacaagatctgatggtttttgaaagtgtttggcagttcccc ctggtgccctctctctcacctgctgccgtggaagatgtgccttgcttcctcttcatcttc tgccctgattataagtttcctgaggcctccccagccatgcagaactatgtttatggtgcc actagattctga >gi568815578r:31365509_31573013|GENSCAN_predicted_peptide_4|242_aa MTHNQEKNKSTNTEMTMMMKLADKDDKTAMINMLQVVKKVEEAMSMMRRNKDVKNIHGRA RWVKPMIPALWEAKEVEGKYSAHNFWGSWKFKSSSRSSSSGPESPARTHASFCQPDGGPT NKLGTKAFRVSPASSLLVDLNTQEVEIINVRKATPTCSLELGRKRRDGAAERAALDVVVV IYQLAPAAAPNCLNPVTSRRRHKHRLRKVREDGRVLKNKYKATGSSFRFLRDLRTNHSES FF >gi568815578r:31365509_31573013|GENSCAN_predicted_CDS_4|729_bp atgacccataaccaggagaaaaacaaatcaaccaacacagaaatgacaatgatgatgaaa ctagcagacaaggatgataaaacagctatgataaatatgctccaagtagtaaaaaaggtg gaggaagccatgagcatgatgagaagaaataaagatgtaaaaaatattcatggccgggca cgatgggtcaagcctatgatcccagcactttgggaggccaaggaagtagaggggaaatac tctgcccataatttctggggaagctggaagtttaaatccagctcccgcagcagcagcagt gggcctgagagccctgctagaacccatgcctcattctgccagcctgatggtggacccacc aacaagctagggacaaaggcctttcgtgtgtctcctgctagcagtctactggttgacctg aacactcaggaagtagagataattaatgtgagaaaggccacccccacatgctccctggag ctgggcaggaagaggagggatggagccgctgagagagcagctctggatgttgtagttgtc atctatcagctggcacctgctgctgctccaaactgcctgaacccagtgaccagcaggagg cgccacaaacaccggctccggaaggtcagggaagacggcagagttttgaagaataaatac aaagcaacagggtcatccttccgcttcctcagagatctgagaaccaaccactcagaatcc ttcttctga >gi568815578r:31365509_31573013|GENSCAN_predicted_peptide_5|298_aa MTLNTEQEAKTPLHRRASTPLPLSPRGHQPGRLSTVPSTQSQHPRLGQSASLNPPTQKPS PAPDDWSSESSDSEGSWEALYRVVLLGDPGVGKTSLASLFAGKQERDLHEQLGEDVYERT LTVDGEDTTLVVVDTWEAEKLDKSWSQESCLQGGSAYVIVYSIADRGSFESASELRIQLR RTHQADHVPIILVGNKADLARCREVSVEEGRACAVVFDCKFIETSATLQHNVAELFEGVV RQLRLRRRDSAAKEPPAPRRPASLAQRARRFLARLTARSARRRALKARSKSCHNLAVL >gi568815578r:31365509_31573013|GENSCAN_predicted_CDS_5|897_bp atgacactcaacaccgagcaggaagcaaagacccctctgcaccggcgagccagcacccca ctgcccctgtccccacggggccaccagcctggccgcctgagcacagtgccttccactcaa tcccagcatccccggctgggccaatcagcctccctcaaccctcccacccagaaaccttca cctgccccagatgattggtcttctgaatccagcgactctgaaggctcctgggaggctctc taccgtgtggtgctacttggagatcctggagtggggaagaccagcttggccagcctcttt gcagggaagcaagagagggacctccatgaacagctgggagaagatgtatatgagaggacc ctcacggtggatggagaagacaccacactggtggtcgtggacacctgggaggccgagaaa ctggataaaagctggagccaggagtcatgcctgcaggggggcagtgcctatgtcatcgta tactccatcgcagaccgaggcagctttgagagtgcctctgagctccgcatccagctgcgg cgcacacatcaggcagaccatgtgcccatcatcctcgtgggcaacaaggcagacttggcc cgctgccgagaagtctctgtggaagagggccgcgcctgcgctgtggtgttcgactgtaaa ttcatcgagacatccgccacgctgcagcacaatgtggccgagctcttcgagggcgtggtg cgccaactgcgcttgcgccgccgggacagtgcggccaaggaacccccagcaccccgacgg ccggccagcctagcccagcgcgctcgtcgcttcctggcacgcctgacagcccgcagcgca cgccgccgggcactcaaggcccgctccaagtcctgccacaatctggccgtgctctga >gi568815578r:31365509_31573013|GENSCAN_predicted_peptide_6|739_aa MNLGINHSWNGDAGKLDSLGMRTADSGRVRWPGRRIDPILRDLRQGWLRSESHQPHQMHP DLYDISAVPLPGKSPLATHNAHNSTIMDTTKALVQYTKQAWFSVKENEDKADKNRKEMQR EDPGSMDILASRDYNLNKVSRALASFLDHEGNVAFPAEPVSPPASLLQQPELESDPERTL AMDSALSDPHNGSAEAGGPTNSTTRPPSTPEGIALAYGSLLLMALLPIFFGALRSVRCAR GKNASDMPETITSRDAARFPIIASCTLLGLYLFFKIFSQEYINLLLSMYFFVLGILALSH TIRGEGHQDVACQWEAMTFTFDMSHQGPGRISACYKSLHSESFSKHFSRCLAGTSPFMNK FFPASFPNRQYQLLFTQGSGENKEGDSTGALPIPFVSLLSPASPWIMFKKFDEKESVSNC IQLKTSVIKGIKSQLVEQFPGIEPWLNQIMPKKDPVKIVRCHEHTEILTGLTGGRGSPAS GSGLTWPLCSEIINYEFDTKDLVCLGLSSIVGVWYLLRKHWIANNLFGLAFSLNGVELLH LNNVSTGCILLGGLFIYDVFWVFGTNVMVTVAKSFEAPIKCDKTKAVVFPQDLLEKGLEA NNFAMLGLGDVVIPGIFIALLLRFDISLKKNTHTYFYTSFAAYIFGLGLTIFIMHIFKHA QPALLYLVPACIGFPVLVALAKGEVTEMFSYESSAEILPHTPRLTHFPTVSGSPASLADS MQQKLAGPRRRRPQNPSAM >gi568815578r:31365509_31573013|GENSCAN_predicted_CDS_6|2220_bp atgaatctgggcatcaaccactcctggaatggagatgctggcaaactcgactctctgggg atgcgcacagctgactcagggagggtgcggtggccaggaagaaggatcgatcctattctc agagatcttcgccagggctggttaagatctgagagccatcagccacatcaaatgcatcct gatctctatgacatcagtgcggtccctctgcctggtaaatctcctcttgctactcacaat gcccataattccaccattatggatacaaccaaggctcttgttcagtatacaaagcaagcc tggttttctgtgaaagagaatgaagacaaagcagataaaaacagaaaggagatgcagaga gaggatcctggcagcatggacatcctggcttccagggactacaatttgaacaaggtgtcc agagctttagcatccttcttagatcatgaggggaacgtggctttccctgcagagccggtg tctccgcctgcgtccctgctgcagcaaccggagctggagtcggatcccgaacgcaccctc gccatggactcggccctcagcgatccgcataacggcagtgccgaggcaggcggccccacc aacagcactacgcggccgccttccacgcccgagggcatcgcgctggcctacggcagcctc ctgctcatggcgctgctgcccatcttcttcggcgccctgcgctccgtacgctgcgcccgc ggcaagaatgcttcagacatgcctgaaacaatcaccagccgggatgccgcccgcttcccc atcatcgccagctgcacactcttggggctctacctctttttcaaaatattctcccaggag tacatcaacctcctgctgtccatgtatttcttcgtgctgggaatcctggccctgtcccac accatcaggggagaagggcaccaggatgtggcttgtcagtgggaagcaatgacctttacg tttgacatgagccatcaagggccaggaagaatctctgcctgctacaaaagtctgcattca gagtcattcagcaagcattttagcagatgtctagcaggcaccagccccttcatgaataag ttttttccagccagctttccaaatcgacagtaccagctgctcttcacacagggttctggg gaaaacaaggaaggggactccaccggagccttgccaattccgtttgtttccctgttgtcg cccgcttcaccctggatcatgttcaagaagtttgatgaaaaggaaagtgtgtccaactgc atccagttgaaaacgtcagttattaagggcattaagagccaactggtagagcaatttcca ggtattgaaccatggcttaatcaaatcatgcctaagaaagatcctgtcaaaatagtccga tgccacgaacatacagaaatccttaccgggctgacaggtgggaggggtagccctgcctca gggagtggacttacctggcctctctgctcagagatcatcaattatgaatttgacaccaag gacctggtgtgcctgggcctgagcagcatcgttggcgtctggtacctgctgaggaagcac tggattgccaacaacctttttggcctggccttctcccttaatggagtagagctcctgcac ctcaacaatgtcagcactggctgcatcctgctgggcggactcttcatctacgatgtcttc tgggtatttggcaccaatgtgatggtgacagtggccaagtccttcgaggcaccaataaaa tgtgacaaaactaaggcagtggtgtttccccaggatctgctggagaaaggcctcgaagca aacaactttgccatgctgggacttggagatgtcgtcattccagggatcttcattgccttg ctgctgcgctttgacatcagcttgaagaagaatacccacacctacttctacaccagcttt gcagcctacatcttcggcctgggccttaccatcttcatcatgcacatcttcaagcatgct cagcctgccctcctatacctggtccccgcctgcatcggttttcctgtcctggtggcgctg gccaagggagaagtgacagagatgttcagctacgagtcctcggcggaaatcctgcctcat accccgaggctcacccacttccccacagtctcgggctccccagccagcctggccgactcc atgcagcagaagctagctggccctcgccgccggcgcccgcagaatcccagcgccatgtaa