GENSCAN 1.0 Date run: 5-Nov-116 Time: 18:21:17 Sequence gi568815593r:55430159_55630844 : 200686 bp : 40.70% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 413 715 303 1 0 41 42 218 0.548 6.69 1.02 PlyA + 959 964 6 1.05 2.05 PlyA - 2281 2276 6 1.05 2.04 Term - 6417 6224 194 1 2 -7 42 141 0.008 -3.40 2.03 Intr - 9379 9237 143 0 2 12 87 124 0.009 3.78 2.02 Intr - 37929 37711 219 2 0 87 109 176 0.824 16.10 2.01 Init - 41409 41375 35 0 2 67 106 38 0.035 2.89 2.00 Prom - 42076 42037 40 -4.45 3.00 Prom + 46196 46235 40 -4.35 3.01 Init + 67017 67065 49 2 1 38 57 67 0.398 -0.24 3.02 Intr + 67760 67888 129 0 0 74 98 67 0.445 6.05 3.03 Intr + 80514 80684 171 1 0 70 119 -2 0.302 0.09 3.04 Term + 91772 91956 185 0 2 70 55 114 0.613 3.02 3.05 PlyA + 93449 93454 6 1.05 4.05 PlyA - 96185 96180 6 1.05 4.04 Term - 100545 99998 548 1 2 -10 49 264 0.178 6.15 4.03 Intr - 104584 104414 171 2 0 -9 79 236 0.174 11.99 4.02 Intr - 105263 105010 254 1 2 80 52 147 0.307 6.45 4.01 Init - 109549 109542 8 1 2 114 91 0 0.370 3.45 4.00 Prom - 114026 113987 40 -6.15 5.04 PlyA - 114669 114664 6 1.05 5.03 Term - 122121 121838 284 1 2 69 43 233 0.433 11.50 5.02 Intr - 122361 122246 116 2 2 58 77 86 0.161 3.67 5.01 Init - 148284 148178 107 0 2 70 92 55 0.419 3.84 5.00 Prom - 157860 157821 40 -3.65 6.02 PlyA - 158631 158626 6 1.05 6.01 Sngl - 160139 159318 822 2 0 70 44 289 0.659 18.20 6.00 Prom - 164441 164402 40 -7.95 7.05 PlyA - 165014 165009 6 1.05 7.04 Term - 167034 166809 226 2 1 80 36 155 0.515 4.77 7.03 Intr - 183565 183334 232 2 1 37 48 211 0.001 7.91 7.02 Intr - 184535 184302 234 0 0 45 22 190 0.000 4.84 7.01 Intr - 189968 189828 141 0 0 82 92 85 0.170 7.70 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 42428 42644 217 2 1 3 43 288 0.843 11.43 S.002 Term - 141835 141724 112 0 1 61 44 135 0.837 3.45 S.003 Intr + 183091 183354 264 2 0 78 61 161 0.892 8.96 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:55430159_55630844|GENSCAN_predicted_peptide_1|100_aa IYLKENILPYEGKFKKLEEVTVKPDVQIPKYGFKKHEEGNMTLPKEHSNFVATDSNEEMC EISEKEFKIMIQKKLKKLSEIQENSNQKNNTKKSEKPFRI >gi568815593r:55430159_55630844|GENSCAN_predicted_CDS_1|303_bp atatatcttaaggaaaatatcctcccctatgaaggcaaattcaaaaaattggaagaagtg actgttaaaccagatgtacagataccgaaatacggattcaagaaacatgaagaaggaaat atgacacttccaaaggaacacagtaattttgtagcaacagattccaatgaagaaatgtgt gaaatctcagaaaaagaattcaaaataatgattcaaaaaaaattgaaaaagctcagtgag atacaagaaaacagcaatcagaaaaataatacaaagaaatcagagaaaccatttaggata tga >gi568815593r:55430159_55630844|GENSCAN_predicted_peptide_2|196_aa MVSEVKYTEKLKNNYIATIYKAIGTFLFGAAASQSLTDIAKYSIGRLRPHFLDVCDPDWS KINCSDGYIEYYICRGNAERVKEGRTTLDKFKGVTGQGRNDFQIGQPSESLQIQNDSTAS KWSGRIYGQKKEICFHAADKDIPKAGQFTQERGLIGLPVPRGLGGLTSMAEGKKEQVTSY VDGGRQRELVQGNSTL >gi568815593r:55430159_55630844|GENSCAN_predicted_CDS_2|591_bp atggtcagtgaagtcaaatacactgagaagttaaagaataactacatagccactatttac aaagccattggaacctttttatttggtgcagctgctagtcagtccctgactgacattgcc aagtattcaataggcagactgcggcctcacttcttggatgtttgtgatccagattggtca aaaatcaactgcagcgatggttacattgaatactacatatgtcgagggaatgcagaaaga gttaaggaaggcagaacaactttagataaatttaagggtgtcactgggcaaggaagaaac gactttcaaattgggcagccctcagagtcactgcagattcagaatgactccactgcttct aagtggtcagggagaatttatggacagaaaaaggaaatctgttttcatgctgccgataaa gacatacccaaggctgggcagtttacacaagaaagaggtttaattggactcccagttccc cgtggcttaggaggccttacaagcatggcggaaggcaagaaggagcaagttacatcttac gtggatggcggcaggcagagagagcttgtgcagggaaactccaccttataa >gi568815593r:55430159_55630844|GENSCAN_predicted_peptide_3|177_aa MAAATPRKKKGKKTKRVLSIVQTHLEASLRLLRLRAGKAMTESRSTGAMEDKQNQEGTEA SIQLLVEPILWCARVEYWTELGSQVYLAGFTSGFLPVTTCGSLGKLLTCLSSVSSSAPSS KPISPTTDANEPKGKAESAVTSCQLTEGLICYTFPICHFPGHPTLKKTQETFPSSDN >gi568815593r:55430159_55630844|GENSCAN_predicted_CDS_3|534_bp atggcagctgcaacgccaagaaagaagaaaggaaaaaagactaaaagagtgctatccatt gtccaaacccatctagaagccagtctcaggctcctgagactgagggcagggaaggcaatg acagaaagcaggtccacaggggcaatggaagataagcagaaccaggagggaacagaagct tccattcagctactggtggaaccaatcctttggtgtgccagggtggagtactggacagag ttgggtagtcaagtatatctggctgggtttacttctggctttctcccagttactacttgt gggagcttgggcaaattacttacctgtctgagctcagtttcctcatctgcaccaagcagc aaacccatctctcctactactgatgcaaatgaaccaaaaggcaaggcagaatctgccgtg acaagctgccaattaactgaagggctcatctgctacacgtttcctatatgtcattttcct ggacaccccaccctgaaaaagactcaggaaacatttccaagcagtgacaactga >gi568815593r:55430159_55630844|GENSCAN_predicted_peptide_4|326_aa MPSGNNLTCLLPHPHGRGGEWQAAGALGSLGREPSSLKAAMPSGGTEAEFGDHLSNCGPE QSPLDRNGSRPHAPPGCREPHFWADAAGAVARGRRQPRPGLENQGPRPPSRSSVHRPCRA ARAETMFDKTRLPYVALDVLCVLLEERACPERALDLENIMRKFSGSCRCCAKQIKFYRMR LHCKSCKKYQDEYDVSSIIPNFQISEDSVGNSNRSETSTSDNTETYQENTSSSGHPTYKC PLCQESNFTRERLLDHSNSNHLFQIVPVTCPICVSLPWGDPSQITRNFVSHLNQRHQFDY GEFVNLQLDEETQCQTAVEESFQVNI >gi568815593r:55430159_55630844|GENSCAN_predicted_CDS_4|981_bp atgcccagtggtaacaatctgacctgtttattacctcatccccatggaaggggtggggag tggcaggcagccggtgctctaggatcgttgggtagggagcctagctctctgaaagcagca atgcctagtgggggtaccgaggccgagttcggagaccacctgagcaattgcggtccagag caaagccccctggatcggaacgggagccgcccccacgccccgcccggctgccgagagccg cacttctgggcggatgccgcaggggccgtagctcggggccgtcgccagccccggcccggg ctcgagaatcaagggcctcggccgccgtcccgcagctcagtccatcgcccttgccgggca gcccgggcagagaccatgtttgacaagacgcggctgccgtacgtggccctcgatgtgctc tgcgtgttgctggaagagagagcatgccctgaacgggccttagaccttgaaaatataatg aggaagttttctggtagctgcagatgctgtgcaaaacagattaaattctatcgcatgaga cttcattgcaaatcttgtaagaagtatcaggatgaatatgatgtttcttctatcattcca aactttcagatctctgaagattcagtagggaacagcaataggagtgaaacatccacatct gataacacagaaacttaccaagagaatacaagttcttctggtcatcctacttataagtgt cccctgtgtcaagaatcaaattttaccagagagcgtttactggatcactctaacagtaat cacttatttcagatagttcctgtgacatgtcctatttgtgtatctcttccttggggagat cctagccagattaccagaaatttcgttagtcatctaaatcagagacatcagtttgattat ggagaatttgtgaatcttcagctagatgaagaaactcaatgtcaaactgctgttgaagaa tcatttcaagtaaacatctga >gi568815593r:55430159_55630844|GENSCAN_predicted_peptide_5|168_aa MNGVRQMNFRREITGVLKGHCCISRERIAVEMKMDRPPCSGKGTECQSIAQNFGFQDFSS GHFLWENIKAKTEVGHYDKPKPWTKICELDLVISLKIPFETLKDRLGCCWIHPPSGRVYN MDFNPPHVHGIDDVTGEPLVQQEDDKPEAVAARLRQYKDMAKPVTELY >gi568815593r:55430159_55630844|GENSCAN_predicted_CDS_5|507_bp atgaatggagttaggcagatgaattttcggagggaaataactggagtgctaaaaggccat tgctgtattagccgtgagagaattgcagtagaaatgaaaatggacagacccccctgctcg ggcaagggcaccgagtgccagagcatcgcccagaactttggcttccaggatttctccagt gggcacttcttgtgggagaacatcaaggccaaaaccgaagttggacattacgacaagccg aagccctggacaaaaatctgtgaactggatttagtaattagtttgaagattccatttgaa acacttaaagatcgtctcggctgctgttggattcaccctcctagcggaagggtatataac atggacttcaatccacctcatgtacatgggattgatgacgtcactggtgaaccattagtc cagcaggaggatgataaacctgaagcagttgctgccaggctgagacagtacaaggatatg gcaaagccagtcactgaattatactag >gi568815593r:55430159_55630844|GENSCAN_predicted_peptide_6|273_aa MDKFLDTPTLPRLNQEEVESLNRRITGSEIVAIINSLPTKKSPGPDGFTAEFYQRYKEEL VPLLLKLFQSIEKEGILPNSFYEASIILIPKPGRDTTKKENFRPISLMNIDAKILNKILA NQIQQHIKKLIHHDQVDFIPGMQDWFNISKSINVIQHINRTKYKNHMIISIDAEKAFDKI QQPFMLKTLNKLGIDRTYLKIIRAIYDKPTANIILHGQKLEAFPLKTGTRQGCPLSPLLF NIMLEVLAKAIRQEKEIKGIQLGKEEVKLSVCR >gi568815593r:55430159_55630844|GENSCAN_predicted_CDS_6|822_bp atggataaattcctcgacacacccaccctcccaagactaaaccaggaagaagttgaatct ctgaatagacgaataacgggctctgaaattgtggcaataatcaatagcttaccaaccaaa aagagtccaggaccagatggattcacagccgaattctaccagaggtacaaggaggaactg gtaccactccttctgaaactattccaatcaatagaaaaagagggaatcctccctaactca ttttatgaggccagcatcatcctgataccaaagcctggcagagacacaaccaaaaaagag aattttagaccaatatccttgatgaacattgatgcaaaaatcctcaataaaatactggca aaccaaatccagcagcacatcaaaaagcttatccaccatgatcaagtggacttcatccct gggatgcaagactggttcaatataagcaaatcaataaatgtaatccagcatataaacaga accaaatacaaaaaccacatgatcatctcaatagatgcagaaaaggcctttgacaaaatt caacaacccttcatgctaaaaactctcaataaattaggtattgataggacgtatctcaaa ataataagagctatctatgacaaacccacagccaatatcatactgcatgggcaaaaactg gaagcattccctttgaaaactggcacaagacagggatgccctctctcaccactcctattc aacataatgttggaagttctggccaaggcaattaggcaggagaaggaaataaagggtatt caattaggaaaagaggaagtcaaattgtccgtttgcagatga >gi568815593r:55430159_55630844|GENSCAN_predicted_peptide_7|277_aa XINNQVILPGVTEMPGYCPFLLPVSTECCAVATSYTCFEEKNIGQCCRVKLQTFAVSVTA FKAAHLELFIHPSRFMVSLASGVKLQTFAVSITAHKGSADPKSEQQQDLLQIVKEQSFRS VEGDPTLGQSMGSGAADQGAALVGEARATQEPTAVGGGSGMAGCRSRALPREEEAEARRE FEHSTCGPALLGDPHTLHSCWPRDGQCRRSARGWWDGSSQTQLCSVSTMWLSNKAAMTRV AKTMCWEPGVDEFLVYIRKTATEFATTVINMQDSSWR >gi568815593r:55430159_55630844|GENSCAN_predicted_CDS_7|834_bp ngcatcaacaaccaggtcatccttccaggtgtcaccgaaatgccaggctattgccccttc ctgctgcctgtctcaactgaatgctgtgctgtggccacatcatacacatgttttgaagag aagaatataggacaatgttgcagagtgaagctgcagaccttcgcagtgagtgttacagct tttaaggcagcgcatctggagttgttcattcatcccagtaggttcatggtctcgctggcc tcaggagtgaagctgcagaccttcgcggtgagtattacagctcataaaggcagtgcggac ccaaagagtgagcagcagcaagatttattgcaaatagtgaaagaacaaagcttccgcagt gtggaaggggacccgacccttgggcagtcgatgggatcgggcgctgcggaccagggagcg gcgctcgtcggggaggctcgggccacacaggagcccacggcagtggggggaggctcgggc atggcaggctgcaggtcccgagccctgccccgtgaggaggaagctgaggcccggcgagaa ttcgagcacagcacctgcgggccagcactactgggggacccacacaccctccacagctgc tggcccagggatggacagtgccggaggtcagccaggggttggtgggatggatccagtcaa actcagctctgctcggtttccaccatgtggctcagcaacaaagcagccatgacacgtgtt gccaagaccatgtgttgggagccaggagtagatgaattcctggtgtacataaggaaaaca gcaactgagtttgctacaactgtgattaacatgcaagattcatcctggagatag