GENSCAN 1.0 Date run: 6-Nov-116 Time: 15:22:33 Sequence gi568815588r:118494135_118695244 : 201110 bp : 43.30% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.08 PlyA - 784 779 6 1.05 1.07 Term - 5373 5257 117 0 0 94 44 89 0.423 3.64 1.06 Intr - 13699 13676 24 1 0 74 58 72 0.091 1.02 1.05 Intr - 22684 22578 107 2 2 74 77 62 0.158 3.63 1.04 Intr - 48592 48538 55 1 1 93 92 65 0.561 5.95 1.03 Intr - 49807 49675 133 0 1 51 81 71 0.578 3.35 1.02 Intr - 50400 50287 114 2 0 93 37 60 0.455 0.96 1.01 Init - 56519 56431 89 2 2 80 55 63 0.659 2.31 1.00 Prom - 61168 61129 40 -5.56 2.00 Prom + 62056 62095 40 -3.66 2.01 Init + 78287 78343 57 1 0 78 68 51 0.671 3.41 2.02 Intr + 85959 86078 120 2 0 49 84 113 0.679 7.69 2.03 Term + 96310 96540 231 0 0 70 40 114 0.393 1.27 2.04 PlyA + 96563 96568 6 1.05 3.02 PlyA - 99284 99279 6 1.05 3.01 Sngl - 101110 99998 1113 1 0 103 44 2294 0.998 222.52 3.00 Prom - 108895 108856 40 -4.46 4.00 Prom + 116301 116340 40 -3.46 4.01 Init + 135611 135655 45 1 0 72 98 40 0.534 4.16 4.02 Intr + 144647 144758 112 1 1 79 88 106 0.997 9.65 4.03 Intr + 144916 145081 166 2 1 60 27 117 0.480 1.82 4.04 Intr + 145759 145906 148 1 1 65 63 67 0.497 2.14 4.05 Term + 149657 150064 408 1 0 52 33 349 0.639 21.02 4.06 PlyA + 152646 152651 6 1.05 5.02 PlyA - 153930 153925 6 1.05 5.01 Sngl - 154912 154457 456 1 0 70 41 196 0.969 9.32 5.00 Prom - 155331 155292 40 -6.96 6.02 PlyA - 155388 155383 6 1.05 6.01 Sngl - 157267 156821 447 1 0 79 48 224 0.984 13.73 6.00 Prom - 160270 160231 40 -4.66 7.05 PlyA - 160348 160343 6 1.05 7.04 Term - 160963 160579 385 0 1 67 41 172 0.623 4.76 7.03 Intr - 164031 163881 151 1 1 78 88 113 0.547 9.52 7.02 Intr - 176916 176808 109 0 1 62 61 63 0.004 0.86 7.01 Init - 183975 183931 45 0 0 83 95 28 0.019 3.68 7.00 Prom - 190035 189996 40 -3.76 8.03 PlyA - 190134 190129 6 1.05 8.02 Term - 194222 194150 73 1 1 83 43 86 0.373 0.98 8.01 Init - 196639 196599 41 1 2 114 80 3 0.335 1.76 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 28081 27989 93 1 0 74 105 27 0.829 3.16 S.002 Intr + 175948 176061 114 0 0 95 35 96 0.914 5.64 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:118494135_118695244|GENSCAN_predicted_peptide_1|212_aa MGYNVHRELCFPPIWMETPEALQSPAEKQRLLWNTLPASILMNQKARACGSHQSCVFFWY MPDVPDGLRRCISYVLATEVTPVPHFNPDMTQFGQENAWWAEFGFLLKGGYLQEASPGGP EKMAIRKPDTAYSFVWVGLKSEVPNPQATDGYRGLATFDLRTIHMKQGNYVDFKPKHPSK HYTDMKRNKRHEWSYSHHIPCDVLSIEGTSYT >gi568815588r:118494135_118695244|GENSCAN_predicted_CDS_1|639_bp atggggtacaatgtccacagggagctctgctttccaccaatatggatggaaacaccagaa gctctgcagagcccagcagagaaacagaggctcctctggaacacactacctgcttccatc ctcatgaaccagaaagcacgtgcctgtggcagtcaccagtcatgtgtcttcttttggtac atgcctgatgttccagatggtctaaggaggtgcataagctacgttttagcaaccgaggtg acccctgtcccccacttcaatccagacatgacccagtttgggcaggaaaatgcctggtgg gcagagtttggctttttactcaaaggaggctacctgcaagaggcttcaccgggggggcca gagaagatggccatcaggaagcccgacacagcgtattcttttgtatgggtgggccttaag tcagaggttcctaacccccaggccacagatgggtaccggggcttggcgacctttgactta aggaccatccatatgaagcaaggcaactacgtggacttcaagccaaagcatccaagtaag cactacacagacatgaaaagaaataagaggcacgagtggtcctacagccaccacattcca tgtgatgtgctaagtattgaaggcacttcctacacgtag >gi568815588r:118494135_118695244|GENSCAN_predicted_peptide_2|135_aa MTAPLQPQTTGTNTVANTHNNSQQSNAEISLIAVKYKAVSAPLGFCSISYEHPGVSLFTV AVGACIETLFCQFNEINLLIGCQKCDLVTKLAKRITEPPISLEKGCCQVPSWLSSQPDCL YYSLQEEDCVYILDA >gi568815588r:118494135_118695244|GENSCAN_predicted_CDS_2|408_bp atgactgctccactccagccacagaccacaggaacgaacactgtggccaacacccacaat aacagtcaacaatcaaatgcagaaatcagccttattgctgtgaagtacaaggctgtgtct gctcctcttgggttctgttccatcagctacgagcatccaggagttagcctgttcaccgta gctgttggagcatgcattgaaacactcttctgccagtttaatgagataaatctattgatt ggctgccaaaagtgtgacctggtgacaaagcttgcaaagaggattacagaacctcccatc agccttgagaagggctgctgccaagtaccaagttggctcagttcccagcctgactgcctg tattactccttgcaggaagaggactgtgtatacatcttagatgcttga >gi568815588r:118494135_118695244|GENSCAN_predicted_peptide_3|370_aa MASSTTRGPRVSDLFSGLPPAVTTPANQSAEASAGNGSVAGADAPAVTPFQSLQLVHQLK GLIVLLYSVVVVVGLVGNCLLVLVIARVRRLHNVTNFLIGNLALSDVLMCTACVPLTLAY AFEPRGWVFGGGLCHLVFFLQPVTVYVSVFTLTTIAVDRYVVLVHPLRRRISLRLSAYAV LAIWALSAVLALPAAVHTYHVELKPHDVRLCEEFWGSQERQRQLYAWGLLLVTYLLPLLV ILLSYVRVSVKLRNRVVPGCVTQSQADWDRARRRRTFCLLVVIVVVFAVCWLPLHVFNLL RDLDPHAIDPYAFGLVQLLCHWLAMSSACYNPFIYAWLHDSFREELRKLLVAWPRKIAPH GQNMTVSVVI >gi568815588r:118494135_118695244|GENSCAN_predicted_CDS_3|1113_bp atggcctcatcgaccactcggggccccagggtttctgacttattttctgggctgccgccg gcggtcacaactcccgccaaccagagcgcagaggcctcggcgggcaacgggtcggtggct ggcgcggacgctccagccgtcacgcccttccagagcctgcagctggtgcatcagctgaag gggctgatcgtgctgctctacagcgtcgtggtggtcgtggggctggtgggcaactgcctg ctggtgctggtgatcgcgcgggtgcgccggctgcacaacgtgacgaacttcctcatcggc aacctggccttgtccgacgtgctcatgtgcaccgcctgcgtgccgctcacgctggcctat gccttcgagccacgcggctgggtgttcggcggcggcctgtgccacctggtcttcttcctg cagccggtcaccgtctatgtgtcggtgttcacgctcaccaccatcgcagtggaccgctac gtcgtgctggtgcacccgctgaggcggcgcatctcgctgcgcctcagcgcctacgctgtg ctggccatctgggcgctgtccgcggtgctggcgctgcccgccgccgtgcacacctatcac gtggagctcaagccgcacgacgtgcgcctctgcgaggagttctggggctcccaggagcgc cagcgccagctctacgcctgggggctgctgctggtcacctacctgctccctctgctggtc atcctcctgtcttacgtccgggtgtcagtgaagctccgcaaccgcgtggtgccgggctgc gtgacccagagccaggccgactgggaccgcgctcggcgccggcgcaccttctgcttgctg gtggtgatcgtggtggtgttcgccgtctgctggctgccgctgcacgtcttcaacctgctg cgggacctcgacccccacgccatcgacccttacgcctttgggctggtgcagctgctctgc cactggctcgccatgagttcggcctgctacaaccccttcatctacgcctggctgcacgac agcttccgcgaggagctgcgcaaactgttggtcgcttggccccgcaagatagccccccat ggccagaatatgaccgtcagcgtggtcatctga >gi568815588r:118494135_118695244|GENSCAN_predicted_peptide_4|292_aa MMPELRAAMQMGSGQKNVASETQAGLQQDRFSPSLNVQVALGVAAIGSFLGRVLSEIEKS HFQLEDEKGTAPITRAVDKNQPHAEAGLLSCSDHVLSQSWCLYLALAGRVECQRPQRAKV EASHWSGNVQDLQLDTALVPPASDVLIGIREPAPDSHGNPHPQTSPDELLPKGKAVKTEE ELEEDDDEELEETLSERLWGLTEMFPERVRSVAGATSDLFLFVAQKMYRFSRAALWTGTT SFMILVLPLVFETEKLQMEQQQQLQQQQILLGPNTGLSGGMPGALPSLPGKI >gi568815588r:118494135_118695244|GENSCAN_predicted_CDS_4|879_bp atgatgccagaacttagagctgccatgcagatgggcagtgggcagaaaaatgtagcctct gagacacaagcaggtctgcagcaagaccgcttctctccctccctcaacgtgcaggttgct ctcggagttgctgccatcggaagttttctaggcagagtcctttcagaaatagaaaaatcc cacttccagttagaggacgagaaaggcacagctccaatcactagagctgttgacaaaaat cagccacatgctgaggcaggtctgctgagctgttcagaccatgtgctttcacagtcctgg tgcttgtacctggccctggctggaagggtagagtgtcaaagaccccaaagggcaaaggtg gaggcatcccactggagtgggaacgtgcaggacctgcagctggacactgctctggtacca cctgcatctgatgttttgattggtatcagggaacctgcccccgatagtcacgggaacccc catccccagacgtccccggatgaattgctcccaaaaggcaaagccgtgaagactgaggag gagctggaggaggatgacgatgaggagctagaggagaccctgtcagagagactatggggc ctgacggagatgtttccggagagggtccggtccgtggctggagccacttctgatctcttc ctctttgtggctcaaaaaatgtacaggttttccagggcagccttgtggactgggaccact tcctttatgatcctggttcttccccttgtctttgagactgagaagttgcaaatggagcaa cagcagcaactgcagcagcagcagatacttctagggcctaacacagggctctcaggagga atgccaggggctctaccttcacttcctggaaagatctag >gi568815588r:118494135_118695244|GENSCAN_predicted_peptide_5|151_aa MDKFLDTYTLPRLNQEEVQSLNRPITSSEIEAVINSLPIKNSPGPDGFTAEFYQRYKEEL VPFLLKLFQTIEKQGLVPNSFYEASIILIPKPGREITKKENFRPISLMNIDAKILNKILA NQIQQHIKKLIHHDQLGFIPGCKAGSTYANQ >gi568815588r:118494135_118695244|GENSCAN_predicted_CDS_5|456_bp atggataaattcctggacacatacaccctcccaagactaaaccaggaagaagtccaatcc ctgaatagaccaataacaagttctgaaattgaggcagtaattaatagcctaccaatcaaa aacagcccaggaccagatggattcacagctgaattctaccagaggtacaaagaggagctg gtaccattccttctgaaactattccaaacaatcgaaaaacagggactcgtccctaactca ttttatgaggccagcatcatcctgataccaaaacctggcagagaaataacaaaaaaagaa aatttcaggccaatatccctgatgaacattgatgcaaaaatcctcaataaaatactggca aaccaaatccagcaacacatcaaaaagcttatccaccacgatcaacttggcttcatccct ggatgcaaggctggttcaacatatgcaaatcaataa >gi568815588r:118494135_118695244|GENSCAN_predicted_peptide_6|148_aa MRKNQHQKAENSKNQNTSSPKDHNSSPAREQIWIENEFDKLTEAGFRRLVITNSSEVKEH VLTQCKEAKNLEKRLEELLTRITSLEKNINYLMELKNTAQELHKAYTSINSQIDKVEDRI SEIEDQLNEIKCEDKIREKRMTRNEQSL >gi568815588r:118494135_118695244|GENSCAN_predicted_CDS_6|447_bp atgaggaaaaaccagcaccaaaaggctgaaaattccaaaaaccagaacacctcttctcca aaggatcacaactcctcaccagcaagggaacaaatctggatagagaatgagtttgacaaa ttgacagaagcaggcttcagaaggttggtaataacaaactcctctgaggtaaaggagcat gttctaacccaatgcaaggaagctaagaaccttgaaaaaaggttagaggaattgctaact agaataactagtttagagaagaacataaattacttgatggagctgaaaaacacagcacaa gaacttcataaagcatacacaagtatcaatagccaaatcgataaagtggaagacaggata tcagagattgaagatcaacttaatgaaataaaatgtgaagacaagattagagaaaaaaga atgacaaggaatgaacaaagcctctaa >gi568815588r:118494135_118695244|GENSCAN_predicted_peptide_7|229_aa MAHKCEKMFSLISRQRTLTNTEPLIPDDPPAEKPRCPLTLTGIPGVLAQLQKPTQMRRNQ KTNPGNTTKQGLSTPPKNHISSPAMDPNQEEIPDLPEKEFQSTGSPSHSNQTRERKGIQI GKEEVKLFLFVDNMIIYLENPKDSSRKLLELIKEFSKVSGYKINVHKSVALLYTNSDRAE NQIKNSTPFTIAAKQQQQQQQQQNHQNQNQKKLRNIPNKGVERPLQGKL >gi568815588r:118494135_118695244|GENSCAN_predicted_CDS_7|690_bp atggcccataaatgtgaaaagatgttcagcctcattagtcgtcagagaaccctaactaat acagagcccttgatccccgatgacccaccagcagagaaacccaggtgtcctcttacactg actggcatcccgggggtccttgcccagctccaaaagcctacccaaatgagaaggaaccag aaaaccaaccctggtaatacgacaaaacaaggcttgtcaacgccccctaaaaatcacatt agttcaccagcaatggatccaaaccaagaagaaatccctgatttacctgaaaaagaattc cagagtactggaagtcctagccacagcaatcagacaagagaaagaaagggcatccaaatt ggtaaagaggaagtcaaactgttcctgtttgttgacaatatgatcatttaccttgaaaac cctaaggactcctccagaaagctcctagaactgataaaagaattcagcaaagtttctgga tacaagattaatgtacacaaatcagtagctcttctatacaccaacagcgaccgcgcggag aatcaaatcaagaactcaaccccttttacaatagctgcaaaacaacaacaacaacaacaa caacaacaaaaccaccaaaaccaaaaccaaaaaaaacttaggaatatacccaacaaagga gtcgaaagacctctacaaggaaaactataa >gi568815588r:118494135_118695244|GENSCAN_predicted_peptide_8|37_aa MPGPYILKKGFPTGIFGECNCDDTSVNNNGRHSLGFG >gi568815588r:118494135_118695244|GENSCAN_predicted_CDS_8|114_bp atgcccggcccatacattttgaagaagggatttcccactggaatatttggtgaatgcaac tgtgatgatactagtgtgaataacaacggaagacactcacttggctttggctag