GENSCAN 1.0 Date run: 4-Nov-116 Time: 04:00:43 Sequence gi568815597r:67605475_67807686 : 202212 bp : 41.70% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3189 3314 126 1 0 57 64 136 0.849 7.83 1.02 Term + 7547 7692 146 0 2 59 54 135 0.815 4.29 1.03 PlyA + 8749 8754 6 1.05 2.10 PlyA - 9032 9027 6 1.05 2.09 Term - 9550 9337 214 0 1 46 45 141 0.037 1.32 2.08 Intr - 12131 12004 128 2 2 69 59 74 0.044 1.16 2.07 Intr - 14394 14281 114 0 0 81 63 60 0.014 2.52 2.06 Intr - 21184 21118 67 0 1 67 97 12 0.001 -2.01 2.05 Intr - 27730 27511 220 2 1 99 63 123 0.264 7.34 2.04 Intr - 29020 28865 156 2 0 76 27 97 0.387 1.46 2.03 Intr - 32944 32919 26 0 2 109 53 30 0.024 -1.65 2.02 Intr - 34113 34041 73 1 1 108 79 79 0.028 6.55 2.01 Init - 52609 52468 142 1 1 65 69 92 0.062 5.34 2.00 Prom - 53121 53082 40 -10.05 3.02 PlyA - 53697 53692 6 1.05 3.01 Sngl - 55541 55266 276 2 0 68 55 198 0.645 9.63 3.00 Prom - 60185 60146 40 -3.95 4.00 Prom + 60480 60519 40 -7.65 4.01 Init + 62967 63008 42 2 0 95 69 65 0.174 5.77 4.02 Intr + 79452 79734 283 2 1 -2 70 191 0.130 4.37 4.03 Intr + 80013 80064 52 1 1 18 113 45 0.309 -2.95 4.04 Intr + 80130 80546 417 0 0 64 -1 266 0.772 7.51 4.05 Intr + 80551 80652 102 1 0 86 86 181 0.998 16.07 4.06 Intr + 80876 81113 238 2 1 76 103 363 0.969 33.29 4.07 Term + 82187 82300 114 1 0 78 45 92 0.915 1.49 4.08 PlyA + 82604 82609 6 1.05 5.05 PlyA - 82723 82718 6 1.05 5.04 Term - 93310 93146 165 1 0 50 43 107 0.669 -0.67 5.03 Intr - 95375 95231 145 1 1 118 97 44 0.254 7.66 5.02 Intr - 111196 111151 46 2 1 99 103 -4 0.039 -1.25 5.01 Init - 114816 114552 265 0 1 68 40 231 0.085 13.52 5.00 Prom - 120734 120695 40 -6.55 6.04 PlyA - 121703 121698 6 1.05 6.03 Term - 130681 130589 93 1 0 86 45 107 0.648 3.05 6.02 Intr - 135402 135302 101 1 2 61 81 96 0.230 5.11 6.01 Init - 137845 137821 25 1 1 63 79 19 0.205 -1.66 6.00 Prom - 138201 138162 40 -5.25 7.04 PlyA - 138445 138440 6 1.05 7.03 Term - 139060 138858 203 2 2 61 40 196 0.742 8.67 7.02 Intr - 146109 145974 136 0 1 39 57 88 0.212 0.02 7.01 Init - 147148 147068 81 1 0 18 84 89 0.232 2.52 7.00 Prom - 149799 149760 40 -3.45 8.07 PlyA - 150122 150117 6 1.05 8.06 Term - 151126 151025 102 1 0 52 53 122 0.596 2.40 8.05 Intr - 156373 156277 97 0 1 123 86 31 0.247 5.49 8.04 Intr - 161454 161315 140 0 2 54 37 116 0.053 1.44 8.03 Intr - 162125 162019 107 0 2 82 38 65 0.065 -0.09 8.02 Intr - 166084 165969 116 0 2 93 -7 139 0.077 4.07 8.01 Init - 168645 168566 80 0 2 53 66 63 0.128 1.18 8.00 Prom - 174677 174638 40 -5.55 9.00 Prom + 178520 178559 40 -3.65 9.01 Init + 183655 183694 40 0 1 113 95 47 0.495 8.30 9.02 Intr + 185053 185099 47 2 2 105 69 4 0.041 -2.59 9.03 Term + 193127 193192 66 1 0 115 39 105 0.972 5.26 9.04 PlyA + 193491 193496 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 69246 69674 429 2 0 59 39 198 0.871 6.47 S.002 Init + 192913 192957 45 0 0 76 53 73 0.924 3.33 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:67605475_67807686|GENSCAN_predicted_peptide_1|90_aa XGKFPGNLTISSCELGLASPGVPLVIAEGLAEKGTSEQETERGLQRDQCPESRRVGSQND SGRLCTVEHWPLTAVGVWRSCWPGALQGLC >gi568815597r:67605475_67807686|GENSCAN_predicted_CDS_1|273_bp nctggcaaatttcctggaaatttaaccatcagctcttgtgaactgggactagccagccct ggcgtaccactggtgattgcggaaggccttgctgagaaagggacatctgaacaggaaact gaaagaggattgcagcgtgaccagtgccctgagagcagacgagtggggagccagaatgac agtggccgcctctgcacagtggaacactggcccttgacagctgtaggggtctggaggagc tgctggcctggcgctttgcagggcttgtgctga >gi568815597r:67605475_67807686|GENSCAN_predicted_peptide_2|379_aa MISMMKLVHKNIKATPYVQKVEENIAIMRREFESVKKTQMELLEIIQSFSDFLTWQLASK RAEMEVASVLKTHQNLKSKAEWFLLSAFMGCLTLMRPHQNLITRVSLKESLLHGTPGTVF PCPDKETDSERESLVQRTEGTLIFHVASQQALVCPVPIAAEDISFPDICQITFYLVPDIY SVKMGHSHDPRDQTDPHRQDRLGKAQTNHMASPTTKKLANVRKHMEYLECSAKNETMDTK TPCSLDCSGVISSKKPFGFSPSNQTGGIIAAQDTHVWGTEEEITSSYERDAQFHLEGEYA ASSKTQHSSFSESEIQVQVQLSWVIRVQISHKAVIKVSVRLQSPQGSTAGGSVSQLTWVA VGSPQKIDFPAQSHGAGRR >gi568815597r:67605475_67807686|GENSCAN_predicted_CDS_2|1140_bp atgatatcaatgatgaaactagtgcataagaatattaaagccactccatatgttcaaaag gtagaagaaaatatagccataatgagaagagaatttgaaagtgtgaaaaagacccaaatg gaacttctagagattatccaaagtttttcagacttcctgacatggcaactggcttcaaag agagcggaaatggaagttgccagcgttcttaagacgcatcagaacctgaagtccaaggct gaatggttcctactctcagctttcatgggatgcctaaccctcatgaggcctcatcaaaac ctcattaccagagtgagcctaaaagagtctctgctccatgggaccccagggacagtgttc ccatgtccagataaagaaacagattcagagagggaaagtcttgtgcagaggacagagggg accctcatctttcatgtcgcttcccagcaagcccttgtgtgtcctgtgccaattgcagct gaagatatttcatttcctgatatttgccaaatcaccttttacctagttcctgatatttat tctgtcaagatggggcacagccatgacccgagagaccagacagacccacacaggcaagac cgattaggcaaagcccagactaatcacatggcctcgcctacaacaaaaaaactagcaaat gtgaggaagcacatggaatatttggagtgctctgccaagaatgagacgatggacacaaaa accccttgttctctagactgttcaggtgtcatctcctccaagaagccctttggattctct ccctccaatcagactggggggataatagcagcacaagacactcatgtatgggggacagag gaagaaataacttcatcatatgagagggatgcccaattccatttggaaggggagtatgca gcctccagcaaaacccaacacagcagtttctctgagtcagaaattcaggtacaggtgcag cttagttgggtgattagggtgcagatctctcacaaagctgtaatcaaggtgtctgtcagg ctacagtcacctcaagggtcaactgcagggggatctgtttcccagctcacttgggtggct gttggcagtcctcagaaaatcgactttccagctcaatcacatggtgctggcagacgttag >gi568815597r:67605475_67807686|GENSCAN_predicted_peptide_3|91_aa MASNVINKSNPHSMNPRVFIGNLNTLVVKKSDVEAIFSKYGKTVGGSVHKGFASVHCVHE RNIQPTVAGGDGRMIAGQVLDTNLAAELKVN >gi568815597r:67605475_67807686|GENSCAN_predicted_CDS_3|276_bp atggctagcaatgttatcaacaagtcaaatcctcactccatgaaccccagagtattcatt gggaatctcaacactcttgtggtcaagaagtctgatgtggaagcaatcttttcaaagtat ggcaaaactgtgggtggctctgttcataagggctttgcctctgttcactgtgttcatgag agaaatatccagcctactgtggcaggaggggatggcagaatgattgccggccaagtttta gatactaatctggctgcagagctgaaagtgaactga >gi568815597r:67605475_67807686|GENSCAN_predicted_peptide_4|415_aa MTFGQQDPVQNEDVPPTPNGPRQPDKSARGTRLPLSAPPPPPPPRLASRVAPPFSAPLNL TPGAGVKLLEQADLHSPMAKLHANEAEGGWLRVGRITPESGALCPPVAVCNMTLEEFSAG EQKTESRAVEGLQGRKLKSLHRATPALAPSGLSVELPRTERACPPTRTSPAGAFAKGSSG RRCPCGSRGWQPVAGALSGLLTGRPVLGRAGVMGGDGAAGAPGFRRAQVRGGACIREGGA WYRTRGAAMAPRAPGLTGPLALARVMDKVGDALEEVLSKALSQRTITVGVYEAAKLLNVD PDNVVLCLLAADEDDDRDVALQIHFTLIQAFCCENDINILRVSNPGRLAELLLLETDAGP AASEGAEQPPDLHCVLVTNPHSSQWKDPALSQLICFCRESRYMDQWVPVINLPER >gi568815597r:67605475_67807686|GENSCAN_predicted_CDS_4|1248_bp atgacgtttggccagcaggacccagtgcaaaatgaagatgtgccccccactcccaatggt ccccgccaaccggataagagtgcgcgcgggacccgccttcccctctcggcaccgcccccg cccccgccccctcggctcgcctcccgcgtggctcctcccttttccgctcctctcaacctg actccaggagctggggtcaaattgctggagcaggctgatttgcatagcccaatggccaag ctgcatgcaaatgaggcggaaggtggttggctgagggttggcaggataaccccggagagc ggggccctttgtcctccagtggctgtttgcaatatgactttggaggaattctcggctgga gagcagaagaccgaaagccgggctgtggaaggcttgcaggggaggaagctaaaaagtttg cacagggcaactcccgcccttgctccctcgggactctccgtggagctcccacggactgaa agagcgtgccccccaacccgaacgagccccgccggggcctttgcaaagggcagcagtggc cgtcgctgcccgtgcggctcccgtggctggcagcctgtggcaggggcactctcgggactt ctcacgggacgcccggtccttgggcgtgcaggggtcatggggggtgacggggccgcggga gcgccgggttttcgtagagcccaggtgcgcggtggtgcttgcattcgagagggaggggcg tggtaccggacgaggggggcggcgatggccccgagggcaccggggctgacgggacccctc gcccttgcccgcgtgatggataaggtgggggatgccctggaggaagtgctcagcaaagcc ctgagtcagcgcacgatcactgtcggggtgtacgaagcggccaagctgctcaacgtcgac cccgataacgtggtgttgtgcctgctggcggcggacgaggacgacgacagagatgtggct ctgcagatccacttcaccctgatccaggcgttttgctgcgagaacgacatcaacatcctg cgcgtcagcaacccgggccggctggcggagctcctgctcttggagaccgacgctggcccc gcggcgagcgagggcgccgagcagcccccggacctgcactgcgtgctggtgacgaatcca cattcatctcaatggaaggatcctgccttaagtcaacttatttgtttttgccgggaaagt cgctacatggatcaatgggttccagtgattaatctccctgaacggtga >gi568815597r:67605475_67807686|GENSCAN_predicted_peptide_5|206_aa MGPPSPAENLFGAQDECAWSLGILVPHKWPSHAGGESSEHQQLALIKCMETSEAPPRLLR AVQATKQVSICGLRTVSISRDTCRKEQQAPSLYSCCSEAGNKGWAGIPTEMRGGQNHQRT RAGGSQQAPGYEPDTLQRWPLLLRASTPHLQKKTFANYVTSQGHYGSINLCTRASLKQSL PEQMWAVLPPLSQALTDLFINVPDLK >gi568815597r:67605475_67807686|GENSCAN_predicted_CDS_5|621_bp atgggcccgccttcaccagctgaaaacttatttggggcacaggatgagtgtgcatggtcc cttggaatcttggtgccacacaagtggccatctcatgctggaggtgaaagttcagagcac cagcaactggccctaatcaaatgcatggaaacatcagaagctccacccaggcttctcaga gctgtgcaggccaccaagcaagtgagcatctgtgggctcagaacagtcagtatttcacgg gatacttgtaggaaggagcagcaagccccttccttgtattcttgctgttcagaggcaggc aacaaagggtgggctgggattcccacagagatgaggggagggcaaaatcatcaaagaaca agggcgggaggcagccaacaagccccaggctatgaaccggacacgctgcagagatggccc ctgcttctgagagccagcactccacacttacaaaagaagacattcgccaactatgtgaca tcacaagggcactatggatccataaatttgtgtactagggcaagcctaaagcagtcactc cctgaacaaatgtgggcagtgctacctcctctaagccaggcactgacggacttgttcatt aatgtccccgaccttaagtaa >gi568815597r:67605475_67807686|GENSCAN_predicted_peptide_6|72_aa MKNLASASVLEQWRSKTKALADSMSGEVLLPHTRCLLAMYHKFLQVKWQMYESLHKMGTG RHPVDAKPFSVQ >gi568815597r:67605475_67807686|GENSCAN_predicted_CDS_6|219_bp atgaagaaccttgcttctgcttcagttttggagcagtggagatccaagaccaaggctcta gcagattcgatgtctggtgaggtcctgcttcctcatacacggtgccttcttgctatgtac cacaagttcctgcaagtgaaatggcagatgtatgagtcacttcataagatgggaacagga agacatcctgttgatgccaagccattttcagttcagtga >gi568815597r:67605475_67807686|GENSCAN_predicted_peptide_7|139_aa MSGPNRKEHHDSLVIAAMSADPLGMSTTPNTATLTCLKTPPWTGDSVSPELPLLRLWRRL GTETLTVGQSTEDYKLVLLLTNVLRISENQLFKEVPEHSVGHSGEETPGVASAFPWTPEN FISGSGVADSSWQYKNSAV >gi568815597r:67605475_67807686|GENSCAN_predicted_CDS_7|420_bp atgtctgggcccaacaggaaagaacaccatgactcattagtgatagctgccatgagtgct gatccattagggatgtctaccaccccaaacacagcaaccctgacctgtttgaagacccca ccctggacaggggacagtgtttccccagagctgccactcttgaggctctggaggagactg ggtacagagaccctaactgtgggacagtcaactgaagactataaactggtgctccttctt acaaatgttcttcgtatatctgaaaatcagttatttaaagaagtgcccgagcacagtgta gggcacagtggagaggaaacaccaggggtagcctctgcctttccttggacacctgagaat ttcatctcaggctctggagttgcggacagcagttggcagtacaaaaactctgctgtttaa >gi568815597r:67605475_67807686|GENSCAN_predicted_peptide_8|213_aa MIQDSEKCLQSTHKDNATQSGRQHLCSSGTREEEGWIGPKSLSCFSRKSFAENHSSGTIQ ELTDRLKTSIVGTFVQPKEVWKEGCITFPWVEKEEEAESELALPVYAPSVARPAFPKRTP NPFREGEARPSEELLSRHLPDHQMPSSGITERVFLIAPASLPAPSLTRQKADGVHIPSGR VKTITPYLVASKSNPLPILNPEDGKLVSRAIIL >gi568815597r:67605475_67807686|GENSCAN_predicted_CDS_8|642_bp atgattcaagactcagagaaatgtcttcagagtactcataaagataatgccacacagagt ggccgacaacatctctgtagctcagggactcgcgaggaggaaggatggataggacccaag tccctttcctgcttcagtcgcaaaagttttgctgaaaaccactcatccgggacaatacaa gagctcacagatagattaaagactagtatagttggcacctttgtgcagcccaaggaggtc tggaaggagggttgcataaccttcccatgggtggaaaaggaggaggaggcagagtctgaa ctagcattgcctgtgtatgccccttctgtagctagacctgcattccctaagaggacacct aaccctttccgtgagggggaagcgaggcccagcgaggaactgctgtccaggcatcttcct gaccaccagatgccatcctcagggatcactgagagagttttcctgattgcccctgcctct ctccctgcaccctccctcacccgacagaaagcggatggtgtgcacataccctcagggagg gttaaaaccatcacaccctatcttgtggcctccaagtcaaacccactccccatcctgaac ccagaagatgggaaactggtgtccagagccatcatcctttga >gi568815597r:67605475_67807686|GENSCAN_predicted_peptide_9|50_aa MAFGLQKMQKDFNDPLLVMYLCICIDNLIPTQCEYDEEEDLYDDPLPLNK >gi568815597r:67605475_67807686|GENSCAN_predicted_CDS_9|153_bp atggccttcggtctgcagaagatgcagaaagacttcaatgatcccctgttggtgatgtat ttatgtatctgcatcgacaacttgatccctacccagtgtgaatatgatgaggaggaagac ctttatgatgatccacttccacttaataaatag