GENSCAN 1.0 Date run: 4-Nov-116 Time: 06:09:50 Sequence gi568815578f:6670127_6879086 : 208960 bp : 39.85% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 10886 11008 123 1 0 55 131 60 0.251 6.86 1.02 Term + 18486 18662 177 2 0 3 54 132 0.410 -2.00 1.03 PlyA + 20111 20116 6 1.05 2.11 PlyA - 20354 20349 6 1.05 2.10 Term - 23629 23425 205 0 1 59 37 134 0.631 1.26 2.09 Intr - 27695 27347 349 0 1 52 44 263 0.594 11.79 2.08 Intr - 30130 30040 91 1 1 95 76 45 0.799 2.65 2.07 Intr - 33309 33208 102 0 0 73 70 88 0.910 4.95 2.06 Intr - 35644 35526 119 2 2 -5 63 142 0.033 1.66 2.05 Intr - 40592 40428 165 0 0 68 102 88 0.147 7.21 2.04 Intr - 45417 45347 71 2 2 60 53 74 0.070 -0.99 2.03 Intr - 48298 48243 56 1 2 91 59 78 0.082 2.06 2.02 Intr - 56430 56283 148 2 1 74 78 78 0.583 4.72 2.01 Init - 58440 58361 80 0 2 83 69 47 0.820 2.88 2.00 Prom - 62871 62832 40 -6.85 3.00 Prom + 65146 65185 40 -5.85 3.01 Init + 66201 66333 133 2 1 53 35 65 0.171 -1.94 3.02 Intr + 67301 67422 122 0 2 32 87 127 0.593 6.29 3.03 Intr + 68691 68791 101 2 2 43 37 77 0.291 -3.91 3.04 Intr + 71309 71436 128 2 2 44 25 167 0.227 5.40 3.05 Intr + 81375 81449 75 1 0 56 94 79 0.399 3.87 3.06 Term + 96312 96490 179 1 2 145 55 74 0.068 6.67 3.07 PlyA + 96720 96725 6 1.05 4.00 Prom + 99783 99822 40 -9.25 4.01 Init + 100001 100346 346 1 1 92 100 369 0.664 33.92 4.02 Term + 108119 108963 845 0 2 58 38 604 0.947 44.47 4.03 PlyA + 109683 109688 6 1.05 5.04 PlyA - 109901 109896 6 1.05 5.03 Term - 117928 117597 332 2 2 77 47 172 0.810 5.83 5.02 Intr - 148648 148498 151 1 1 75 53 76 0.039 1.61 5.01 Init - 152955 152881 75 0 0 85 103 56 0.474 7.94 5.00 Prom - 178321 178282 40 -2.15 6.03 PlyA - 181377 181372 6 1.05 6.02 Term - 194176 193904 273 1 0 64 48 155 0.228 3.59 6.01 Init - 205931 205842 90 2 0 67 11 126 0.052 3.34 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 35624 35526 99 2 0 80 63 128 0.934 9.81 S.002 Term + 96312 96474 163 0 1 145 53 127 0.868 11.33 S.003 Init - 119534 119483 52 2 1 28 92 58 0.890 1.57 S.004 Init + 128980 129016 37 0 1 86 81 60 0.824 5.32 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578f:6670127_6879086|GENSCAN_predicted_peptide_1|99_aa AAAEAFAWVISPTSALGSRKCWNYYHPVDKIEADLPSVTQPVTLVALQGMDFSGSSQWEG GDLLGDENNLRSLLAMHNNPWLLGAQCWECVGGANEEME >gi568815578f:6670127_6879086|GENSCAN_predicted_CDS_1|300_bp gctgcagctgaggcttttgcttgggtaatttcccccacgtcagctctaggaagtaggaag tgttggaactactaccatccagttgacaaaattgaagctgacttgcccagcgtcacacag ccggtaactctggtggccttacaaggaatggatttcagtggcagttctcagtgggaagga ggggacttgttgggagatgaaaacaatctgagaagccttttagcaatgcataataaccct tggcttttaggggcacagtgttgggaatgcgtagggggagccaatgaggagatggaatga >gi568815578f:6670127_6879086|GENSCAN_predicted_peptide_2|461_aa MRPEQLPHSPGHQEVTNSCDNRFSCAKQNTGNPSFGGQNTDTGLEFSLGLFSTDLNQDGK SSMINWHLKPQSSSTTIKKWKQKELNLSVAIELKVFLAMRDRLKRKCGPMLRTHSGAEEN KHLLKNTYKTSFPDQGKQRSPKPLQAKEKTDALWVPSRNGVVGFHGGNGCKEQALITSEN MLRGRWGYDHRADEEADTGWALVTYDKPQDLDKCVIVKEKTMESQEEDAESRKNWRKIKK GEESKNQHVLNPPKQPQKLNSGFSRMKLLDTCAGNFAGSSQSTQMLNGKDSREVTACRGV DRMKRSNRGLSHQGLATAGACTIPRLRAEGKGQCQQNPERAAATGGRPLTGVMAISEGVI TVNTRTDPKECGDKHSNFSLTSRKLGGTGSGSQGLPVEVVEITTGNGEGEAAERECCLEP KAVPLGVPQKPSPPRNDGWKDHSRTRSFAYAGLLTRLLYLH >gi568815578f:6670127_6879086|GENSCAN_predicted_CDS_2|1386_bp atgaggccggagcagctgcctcacagtccaggtcaccaagaagtcacaaacagttgtgat aataggttcagttgtgctaagcaaaatacaggaaatccgtcatttggaggtcaaaacact gatactggacttgaattctcgttgggtctgttctcaactgatttaaaccaagatggaaaa agtagcatgatcaattggcacctcaaaccccaatcttcatctactacgataaagaaatgg aagcaaaaagagctaaatctgtccgtggccatagagctaaaggtctttttggccatgagg gaccggctgaagaggaagtgtggtcccatgctcagaacacatagtggtgcagaagagaat aagcaccttctcaagaatacatataaaacaagcttcccagaccaaggtaaacagagatct cctaaacctctgcaggcaaaggaaaagactgatgctctttgggtcccttctagaaatggg gttgtaggattccatggaggcaatggctgtaaggaacaagcattaattacttctgaaaac atgctgcgagggagatggggctatgaccatcgtgctgatgaggaagcagatactggttgg gcattggtaacttatgataaaccacaggatcttgacaagtgtgtgattgttaaagagaaa acaatggaaagtcaggaagaggatgcagaaagcaggaagaactggagaaaaattaaaaag ggagaggaatctaaaaaccagcatgtcctgaaccctccaaagcaaccccagaaactcaac tctggcttctcaagaatgaaattgttggacacctgtgctggtaattttgcaggaagttcc cagtccactcagatgctgaacggaaaagactctagagaagtgactgcatgcagaggtgtg gacaggatgaagcgaagcaacagggggctaagccaccaggggctagcaacagcgggagcc tgtaccatccccaggcttagagcagaggggaagggtcaatgccagcagaacccagaaaga gctgcggccacgggagggaggccactgactggggtcatggccattagcgaaggtgtgatc actgtcaacaccagaactgacccgaaggaatgtggagacaaacactccaacttctctctc acatcacggaagctaggaggcacaggaagtgggtcccaagggttgcctgttgaagtagtt gagataaccactggaaatggggaaggagaagctgctgagagagaatgttgccttgagccc aaggcagttcctttaggagtgccccaaaagcccagccctcccaggaatgacggctggaaa gatcattcgaggacacgcagctttgcctacgctgggctactgactcgcctactttacctg cactga >gi568815578f:6670127_6879086|GENSCAN_predicted_peptide_3|245_aa MVIVINKNLPDISGGMGACSALGNKGFILNLFPTDNQSVILSGKEQLDPDEIGEKSILAW IVTYTYLKVLTIRVHIIYRWVIALIKLLFDPELQFLHIQKENRVVMRIREDNSKVSGTFT PSSQKPEARGYKVRDSIVPSVDIASWDTKQDSAKREDWGEGINEEKANKPEKWAKNLVKA NRNIIASPIALGKQNLAFSKNFLPLVLGIIWASPRCCAWGLLESWEGQKALAPDSSCSSS EQYFG >gi568815578f:6670127_6879086|GENSCAN_predicted_CDS_3|738_bp atggtcatcgtcataaacaagaacctcccagacatcagtggtggaatgggggcctgttca gctcttggaaacaaaggctttattctgaatcttttcccgactgataaccagtctgtcatt ctaagtgggaaggagcaactagaccctgatgagattggagagaaatccattcttgcctgg attgtcacctacacatatcttaaagttttaaccattcgtgtacatatcatctaccgatgg gtaattgcacttattaagttactttttgatcctgagcttcaatttctacatatacagaaa gaaaatagggttgtaatgaggattcgagaagataacagcaaagtgtctggcacatttacc cctagcagccagaagccagaagccagaggctacaaggtccgggatagcatcgttccatct gtagatatcgcctcatgggacacaaaacaggatagtgcaaagagagaggactggggagaa ggaattaatgaggaaaaggcgaacaagccagaaaagtgggcaaagaatctggtaaaagca aaccgaaatataatagctagcccgatagctctgggaaagcagaacttggccttttccaaa aattttctgcccttggttttggggatcatttgggcaagcccgaggtgctgtgcatggggg ctcctggaatcctgggaagggcagaaagccttggccccagactcatcgtgcagcagctct gagcagtatttcggctga >gi568815578f:6670127_6879086|GENSCAN_predicted_peptide_4|396_aa MVAGTRCLLALLLPQVLLGGAAGLVPELGRRKFAAASSGRPSSQPSDEVLSEFELRLLSM FGLKQRPTPSRDAVVPPYMLDLYRRHSGQPGSPAPDHRLERAASRANTVRSFHHEESLEE LPETSGKTTRRFFFNLSSIPTEEFITSAELQVFREQMQDALGNNSSFHHRINIYEIIKPA TANSKFPVTRLLDTRLVNQNASRWESFDVTPAVMRWTAQGHANHGFVVEVAHLEEKQGVS KRHVRISRSLHQDEHSWSQIRPLLVTFGHDGKGHPLHKREKRQAKHKQRKRLKSSCKRHP LYVDFSDVGWNDWIVAPPGYHAFYCHGECPFPLADHLNSTNHAIVQTLVNSVNSKIPKAC CVPTELSAISMLYLDENEKVVLKNYQDMVVEGCGCR >gi568815578f:6670127_6879086|GENSCAN_predicted_CDS_4|1191_bp atggtggccgggacccgctgtcttctagcgttgctgcttccccaggtcctcctgggcggc gcggctggcctcgttccggagctgggccgcaggaagttcgcggcggcgtcgtcgggccgc ccctcatcccagccctctgacgaggtcctgagcgagttcgagttgcggctgctcagcatg ttcggcctgaaacagagacccacccccagcagggacgccgtggtgcccccctacatgcta gacctgtatcgcaggcactcaggtcagccgggctcacccgccccagaccaccggttggag agggcagccagccgagccaacactgtgcgcagcttccaccatgaagaatctttggaagaa ctaccagaaacgagtgggaaaacaacccggagattcttctttaatttaagttctatcccc acggaggagtttatcacctcagcagagcttcaggttttccgagaacagatgcaagatgct ttaggaaacaatagcagtttccatcaccgaattaatatttatgaaatcataaaacctgca acagccaactcgaaattccccgtgaccagacttttggacaccaggttggtgaatcagaat gcaagcaggtgggaaagttttgatgtcacccccgctgtgatgcggtggactgcacaggga cacgccaaccatggattcgtggtggaagtggcccacttggaggagaaacaaggtgtctcc aagagacatgttaggataagcaggtctttgcaccaagatgaacacagctggtcacagata aggccattgctagtaacttttggccatgatggaaaagggcatcctctccacaaaagagaa aaacgtcaagccaaacacaaacagcggaaacgccttaagtccagctgtaagagacaccct ttgtacgtggacttcagtgacgtggggtggaatgactggattgtggctcccccggggtat cacgccttttactgccacggagaatgcccttttcctctggctgatcatctgaactccact aatcatgccattgttcagacgttggtcaactctgttaactctaagattcctaaggcatgc tgtgtcccgacagaactcagtgctatctcgatgctgtaccttgacgagaatgaaaaggtt gtattaaagaactatcaggacatggttgtggagggttgtgggtgtcgctag >gi568815578f:6670127_6879086|GENSCAN_predicted_peptide_5|185_aa MGSIGDAQRRLSLGVRTHKGVSAQQVTVLVSSSLDSTLATPAGRATGSVQSSVHLHSSPE WAGSCTSQNFPSQYPEERRFGPCLHPAEASGFPAPSPPPLPHPQLEQPFSASQNWCSRDG RRAISPSPPAFSTREACEMTLPPKPSRITMEKDKHNRILFCVQTGRSQEILGTASVRFNK RSGLR >gi568815578f:6670127_6879086|GENSCAN_predicted_CDS_5|558_bp atggggtcgattggtgatgcccagagaagactaagtttaggagtcagaacacacaagggt gtgtctgctcaacaggtaactgtccttgttagtagctcattggattccacactggccacc ccagcgggaagagcaacagggtcagtgcaaagctctgtccatcttcactcttcaccggaa tgggcaggcagctgcacttctcagaatttcccctctcagtaccctgaggagaggcgcttt ggtccgtgcttacaccccgctgaagccagtggatttccagctccctccccgcccccacta ccccacccccagctggagcagcccttctcagcatcccagaactggtgttccagggatggg aggagagccatttctccctcaccaccagcattttccacgagggaagcctgtgaaatgact cttcctcccaagccgagtcgcattaccatggaaaaggacaaacacaataggatcctgttc tgtgtccaaactggccgctcacaggagatcttaggcacagcttcagtcaggtttaacaaa cgctcaggtttgcgatga >gi568815578f:6670127_6879086|GENSCAN_predicted_peptide_6|120_aa MVSARYRGHRQHLLQRLTIEDLHYWYDRDMKSTMQVLIGLRNLDIREKRTCSSHQGLNDH PLPGTICQGTQQSHQALKLVLESSAAGTAPVAEASSHPAILSPDFLTVSHHKTIVHLAHI >gi568815578f:6670127_6879086|GENSCAN_predicted_CDS_6|363_bp atggtgagtgccaggtatagggggcatcggcaacatctgctacaacgtctaacaattgaa gacctgcactactggtatgaccgtgatatgaaaagcaccatgcaagttctcataggactc agaaatctggacatcagagaaaagagaacctgcagttctcaccagggactcaatgaccat ccactgccaggcactatttgccaaggtacccagcaaagccatcaagctctcaagcttgtg ctggagtcatcagcggcaggaaccgctccagtagcagaagccagctcacatccagctata ctcagcccagacttcctaacggtatcacatcataaaaccatcgtgcatctggcacatatt taa