GENSCAN 1.0 Date run: 6-Nov-116 Time: 08:22:19 Sequence gi568815595f:58400945_58601629 : 200685 bp : 45.48% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2918 2966 49 2 1 90 98 39 0.264 3.78 1.02 Intr + 7980 8057 78 2 0 113 99 39 0.258 7.25 1.03 Intr + 9146 9215 70 1 1 85 93 54 0.217 4.35 1.04 Term + 23808 24016 209 1 2 139 48 128 0.971 11.40 1.05 PlyA + 24189 24194 6 1.05 2.11 PlyA - 24235 24230 6 1.05 2.10 Term - 27235 27090 146 2 2 104 29 115 0.933 5.27 2.09 Intr - 27670 27529 142 1 1 65 63 114 0.978 6.53 2.08 Intr - 28855 28764 92 2 2 115 105 -1 0.999 4.01 2.07 Intr - 29294 29184 111 0 0 56 116 43 0.935 4.25 2.06 Intr - 29998 29713 286 1 1 84 76 170 0.984 12.31 2.05 Intr - 30684 30649 36 0 0 82 80 33 0.558 0.36 2.04 Intr - 30849 30787 63 0 0 50 99 51 0.713 1.31 2.03 Intr - 31040 30933 108 2 0 74 88 126 0.985 11.68 2.02 Intr - 32740 32687 54 1 0 115 92 68 0.970 9.08 2.01 Init - 32865 32824 42 0 0 104 80 40 0.704 4.61 2.00 Prom - 36881 36842 40 -1.66 3.00 Prom + 38668 38707 40 -6.06 3.01 Init + 39198 39301 104 2 2 85 115 60 0.457 8.12 3.02 Intr + 60274 60396 123 1 0 44 113 62 0.027 4.00 3.03 Intr + 71373 71516 144 0 0 75 75 60 0.026 2.80 3.04 Intr + 97769 97838 70 2 1 50 101 70 0.333 3.68 3.05 Term + 100002 100688 687 2 0 73 39 497 0.739 36.91 3.06 PlyA + 101121 101126 6 1.05 4.22 PlyA - 101429 101424 6 1.05 4.21 Term - 101831 101826 6 2 0 97 37 0 0.007 -6.33 4.20 Intr - 108081 107949 133 2 1 88 68 156 0.803 14.25 4.19 Intr - 116479 116262 218 1 2 58 117 212 0.953 18.40 4.18 Intr - 121657 121552 106 0 1 89 105 22 0.982 4.22 4.17 Intr - 123661 123482 180 0 0 131 121 60 0.996 12.58 4.16 Intr - 125712 125522 191 0 2 99 110 194 0.998 21.08 4.15 Intr - 128012 127850 163 1 1 98 67 141 0.969 12.98 4.14 Intr - 128089 128049 41 1 2 52 72 19 0.562 -6.28 4.13 Intr - 128744 128613 132 2 0 42 115 20 0.402 0.94 4.12 Intr - 129694 129515 180 1 0 103 1 277 0.850 20.56 4.11 Intr - 130422 130307 116 1 2 65 31 184 0.923 10.57 4.10 Intr - 130868 130749 120 0 0 89 115 38 0.987 7.07 4.09 Intr - 132608 132501 108 0 0 64 74 102 0.990 6.66 4.08 Intr - 133201 133050 152 0 2 87 89 145 0.975 14.31 4.07 Intr - 133578 133416 163 1 1 123 47 220 0.992 20.43 4.06 Intr - 134253 134003 251 2 2 73 87 160 0.903 11.48 4.05 Intr - 136288 136175 114 0 0 50 99 82 0.932 5.06 4.04 Intr - 165751 165648 104 1 2 129 9 101 0.009 5.27 4.03 Intr - 166420 166264 157 0 1 96 94 311 0.999 32.51 4.02 Intr - 168921 168747 175 1 1 83 97 322 0.532 31.60 4.01 Init - 178879 178831 49 1 1 76 96 49 0.758 5.76 4.00 Prom - 179808 179769 40 -3.06 5.03 PlyA - 183344 183339 6 1.05 5.02 Term - 184079 183991 89 0 2 16 42 109 0.395 -3.08 5.01 Init - 185992 185914 79 1 1 98 114 101 0.978 15.01 5.00 Prom - 197843 197804 40 -3.06 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 20872 20908 37 0 1 52 121 39 0.893 3.80 S.002 Term - 165751 165644 108 1 0 129 49 107 0.990 9.41 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:58400945_58601629|GENSCAN_predicted_peptide_1|135_aa XLFSDVLLTTSEKPQFKIPTKLKEALRIAKECIEKRLIEEQKQKSKRSALENSEEHSAKY SNSNNSGISALPPPPPPPPPPAAPLPPASTEAPAQLSSQAVNGMSRGALLSSIQNFQKGT LRKAKTCDHSAPKIG >gi568815595f:58400945_58601629|GENSCAN_predicted_CDS_1|408_bp nnattattcagcgatgttttactaaccacttctgaaaaaccacagtttaagatccctaca aagttaaaagaggcattgagaattgccaaagaatgtatagagaagagactaattgaggaa cagaaacagaagtcaaaacgatctgctcttgaaaatagtgaagagcattcagcgaagtac agcaactccaataattcagggatatctgcattacctccacctcctccacctccaccacca ccagcagctcccttgcctcctgcgagcaccgaggcacctgcccagctctcgtctcaggct gtgaatggcatgagccgaggggccttgctcagctccatccagaatttccaaaaaggaact ttgaggaaagccaaaacctgtgatcacagtgctccgaagatcggctga >gi568815595f:58400945_58601629|GENSCAN_predicted_peptide_2|359_aa MAAVSGLVRRPLREVSGLLKRRFHWTAPAALQVTVRDAINQGMDEELERDEKVFLLGEEV AQYDGAYKVSRGLWKKYGDKRIIDTPISEMGFAGIAVGAAMAGLRPICEFMTFNFSMQAI DQVINSAAKTYYMSGGLQPVPIVFRGPNGASAGVAAQHSQCFAAWYGHCPGLKVVSPWNS EDAKGLIKSAIRDNNPVVVLENELMYGVPFEFPPEAQSKDFLIPIGKAKIERQGTHITVV SHSRPVGHCLEAAAVLSKEGVECEVINMRTIRPMDMETIEASVMKTNHLVTVEGGWPQFG VGAEICARIMEGPAFNFLDAPAVRVTGADVPMPYAKILEDNSIPQVKDIIFAIKKTLNI >gi568815595f:58400945_58601629|GENSCAN_predicted_CDS_2|1080_bp atggcggcggtgtctggcttggtgcggagaccccttcgggaggtctccgggctgctgaag aggcgctttcactggaccgcgccggctgcgctgcaggtgacagttcgtgatgctataaat cagggtatggatgaggagctggaaagagatgagaaggtatttctgcttggagaagaagtt gcccagtatgatggggcatacaaggttagtcgagggctgtggaagaaatatggagacaag aggattattgacactcccatatcagagatgggctttgctggaattgctgtaggtgcagct atggctgggttgcggcccatttgtgaatttatgaccttcaatttctccatgcaagccatt gaccaggttataaactcagctgccaagacctactacatgtctggtggccttcagcctgtg cctatagtcttcagagggcccaatggtgcctcagcaggtgtagctgcccagcactcacag tgctttgctgcctggtatgggcactgcccaggcttaaaggtggtcagtccctggaattca gaggatgctaaaggacttattaaatcagccattcgggataacaatccagtggtggtgcta gagaatgaattgatgtatggggttccttttgaatttcctccggaagctcagtcaaaagat tttctgattcctattggaaaagccaaaatagaaaggcaaggaacacatataactgtggtt tcccattcaagacctgtgggccactgcttagaagctgcagcagtgctatctaaagaagga gttgaatgtgaggtgataaatatgcgtaccattagaccaatggacatggaaaccatagaa gccagtgtcatgaagacaaatcatcttgtaactgtggaaggaggctggccacagtttgga gtaggagctgaaatctgtgccaggatcatggaaggtcctgcgttcaatttcctggatgct cctgctgttcgtgtcactggtgctgatgtccctatgccttatgcaaagattctagaggac aactctatacctcaggtcaaagacatcatatttgcaataaagaaaacattaaatatttag >gi568815595f:58400945_58601629|GENSCAN_predicted_peptide_3|375_aa MGASLDQEHSGHPAGSRGVEVSSGSATVANSSGGRSIFCPNEVTLGGFLDGGRSPERPSH DEKFGAFSSTPYPLGRLPDSGRYSQSQLPSELGEREKGKYKKTQPCGLTPYSMIQAHSQL CVSSFPETWALEDASLEQMDNGDWGYMMTDPVTLNVGGHLYTTSLTTLTRYPDSMLGAMF GGDFPTARDPQGNYFIDRDGPLFRYVLNFLRTSELTLPLDFKEFDLLRKEADFYQIEPLI QCLNDPKPLYPMDTFEEVVELSSTRKLSKYSNPVAVIITQLTITTKVHSLLEGISNYFTK WNKHMMDTRDCQVSFTFGPCDYHQEVSLRVHLMEYITKQGFTIRNTRVHHMSERANENTV EHNWTFCRLARKTDD >gi568815595f:58400945_58601629|GENSCAN_predicted_CDS_3|1128_bp atgggtgcctcgctggatcaggagcacagcggacaccctgccggatccagaggggtggaa gtcagcagcgggtctgcaacggtggcaaacagcagtggtggacggagcatcttttgtcct aatgaggtgactcttggtgggttcctggatgggggccggtctccagaaagaccaagccat gatgagaagtttggagcttttagctccactccctatcctctgggaaggcttccagactct ggacggtattcccagagtcagctcccttctgaattgggagaaagagagaaaggaaaatac aagaagacccaaccatgtggattaacaccctatagcatgatccaggcccacagccagctc tgtgtttccagtttccctgaaacctgggctcttgaagacgcatcactggagcagatggat aatggagactggggctatatgatgactgacccagtcacattaaatgtaggtggacacttg tatacaacgtctctcaccacattgacgcgttacccggattccatgcttggagctatgttt gggggggacttccccacagctcgagaccctcaaggcaattactttattgatcgagatgga cctcttttccgatatgtcctcaacttcttaagaacttcagaattgaccttaccgttggat tttaaggaatttgatctgcttcggaaagaagcagatttttaccagattgagcccttgatt cagtgtctcaatgatcctaagcctttgtatcccatggatacttttgaagaagttgtggag ctgtctagtactcggaagctttctaagtactccaacccagtggctgtcatcataacgcaa ctaaccatcaccactaaggtccattccttactagaaggcatctcaaattattttaccaag tggaataagcacatgatggacaccagagactgccaggtttcctttacttttggaccctgt gattatcaccaggaagtttctcttagggtccacctgatggaatacattacaaaacaaggt ttcacgatccgcaacacccgggtgcatcacatgagtgagcgggccaatgaaaacacagtg gagcacaactggactttctgtaggctagcccggaagacagacgactga >gi568815595f:58400945_58601629|GENSCAN_predicted_peptide_4|952_aa MTPNIRGLLPSSPWLSAAMYSEIQRERADIGGLMARPEYREWNPELIKPKKLLNPVKASR SHQELHRELLMNHRRGLGVDSKPELQRVLEHRRRNQLIKKKKEELEAKRLQCPFEQELLR RQQRLNQLEKPPEKEEDHAPEFIKVRENLRRIATLTSEERELPYVGGEKGRTTSLPEAAA YSRGEASPAYSADTGQRGARFFALTNAESRPSEQPGWKCLSIVTRQIQDRMGSPVHRVSL GDTWSRQMHPDIESERYMQSFDVERLTNILDGGAQNTALRRKVESIIHSYPEFSCKDNYF MTQNERYKAAMRRAFHIRLIARRLGWLEDGRELGYAYRALSGDVALNIHRVFVRALRSLG SEEQIAKWDPLCKNIQIIATYAQTELGHGTYLQGLETEATYDAATQEFVIHSPTLTATKW WPGDLGRSATHALVQAQLICSGARRGMHAFIVPIRSLQDHTPLPGIIIGDIGPKMDFDQT DNGFLQLNHVRVPRENMLSRFAQVLPDGTYVKLGTAQSNYLPMVVVRVELLSGEILPILQ KACVIAMRYSVIRRQSRLRPRQGNLGCKSSLEDRATGKKGLLLLSCPRPIYLGCRSVPRV VIRKMRQVFMVSGIGEGGHNSDPEAKVLDYQTQQQKLFPQLAISYAFHFLAVSLLEFFQH SYTAILNQDFSFLPELHALSTGMKAMMSEFCTQGAEMCRRACGGHGYSKLSGLPSLVTKL SASCTYEGENTVLYLQVARFLVKSYLQTQMSPGSTPQRSLSPSVAYLTAPDLARCPAQRA ADFLCPELYTTAWAHVAVRLIKDSVQHLQTLTQSGADQHEAWNQTTVIHLQAAKVHCYYV TVKGFTEALEKLENEPAIQQVLKRLCDLHAIHGILTNSGDFLHDAFLSGAQVDMARTAYL DLLRLIRKDAILLTDAFDFTDQCLNSALGCYDGNVYERLFQWAQKSPTNTQK >gi568815595f:58400945_58601629|GENSCAN_predicted_CDS_4|2859_bp atgacgcccaacatccgaggactactgcccagttctccttggctgtcagccgccatgtac tcggagatccagagggagcgggcagacattgggggcctgatggcccggccagaatacaga gagtggaatccggagctcatcaagcccaagaagctgctgaaccccgtgaaggcctctcgg agtcaccaggagctccaccgggagctgctcatgaaccacagaaggggccttggtgtggac agcaagccagagctgcagcgtgtcctagagcaccgccggcggaaccagctcatcaagaag aagaaggaggagctggaagccaagcggctgcagtgcccctttgagcaggagctgctgaga cggcagcagaggctgaaccagctggaaaaaccaccagagaaggaagaggatcacgccccc gagtttattaaagtcagggaaaacctgcggagaattgccacactgaccagcgaagagaga gagctcccctatgtgggaggagagaagggcaggaccacaagcctgccagaggctgcggcc tacagccggggagaggccagcccggcctacagtgcggacacaggacagaggggagccagg ttctttgcactgaccaatgctgagagcagaccctcggagcagccgggttggaagtgtctc tccatagtcaccagacagatccaggataggatgggcagcccagtgcaccgagtgtcattg ggggatacctggagcaggcaaatgcaccccgacatagagagcgagaggtatatgcagtcc tttgacgtggaacggctcaccaacatccttgatggaggtgcccagaacactgcactccgc aggaaagttgagagcatcatccacagttacccggagtttagctgtaaggacaattatttc atgacccagaatgagcgttataaggctgccatgcggagggcattccacatccggttgata gctcggcgcctgggttggttagaagatggtcgtgaattaggctacgcttacagagccctt tctggagacgtggccttaaatatacacagagtcttcgtgagagccctcaggagcctgggc tcagaggagcagattgccaaatgggacccactctgcaaaaacatccagatcatcgcaacg tatgcacagacagagttgggacatgggacatatcttcagggcctggagactgaagccacc tatgacgcagccacccaggagtttgtgatacacagccccacgctgactgccaccaaatgg tggcctggagacttgggacggtcagccacccatgccctggtccaggcccagctgatctgc tcaggagccaggcggggcatgcacgcttttattgtgccaatccggagtcttcaggaccac accccactgccaggaatcatcattggggacatcggacccaagatggactttgatcaaaca gacaatggcttcctgcagctgaaccatgtgcgggtccccagggagaacatgctgagtcgc tttgcacaggtcttgccagatggcacctacgtcaaactcggtacagcacagagcaactac cttcccatggtggtggtgcgggtggagctgctgtcaggggagatcctccctatactgcag aaggcctgtgtcatcgccatgcgctactcggtcatccgccgccaatcccggctccggccc aggcaagggaatctgggctgcaagagttctctggaggacagggccactgggaagaagggg cttctgctgctgtcctgcccgaggcccatttatctgggctgcagaagtgtccccagggtg gttataagaaagatgcgccaggtttttatggtgtcggggataggggaaggtggacacaac agtgacccagaggcaaaggtcctggactaccagacacaacagcagaaactctttcctcag ctggccatcagttatgccttccatttcctggcagtcagcctcttggagttcttccagcac tcctacactgccattctgaaccaagacttcagcttcctgcctgagctccacgcactgagc acgggcatgaaggccatgatgtcagaattctgcacccagggagctgagatgtgccgcagg gcctgtggcggacatggctactcaaagctgagtggcctgccatcactggtcaccaaattg tcggcctcctgtacctacgagggtgagaacacagtgctctacctgcaggtggccaggttc ctggtgaagagctacctgcagactcagatgtcccctggctccacgccacagagatctctc tctccatctgtcgcatatctcaccgcacctgacctggccaggtgtccagcccagagggca gccgacttcctctgcccggagctctacaccacggcctgggcacatgtggcagtaaggctc ataaaggactcagtgcagcatttacagaccctgacgcaatccggagctgaccagcacgag gcttggaaccagaccactgtcatacacctccaggctgctaaggtgcactgctactatgtc actgtgaagggttttacagaagctctggagaaactagaaaatgaaccagcgattcagcag gtgctcaagcgcctctgtgacctccatgccatacatggaatcttgactaactcgggtgac tttctccatgacgccttcctgtctggtgcccaagtggacatggcaagaacagcctacctg gacctgctccgcctgatccggaaggatgccatcctgttaactgatgcttttgacttcacc gatcagtgtttaaattcagcacttggctgttatgatggaaacgtctacgaacgcctgttc cagtgggctcagaagtcaccaaccaatactcagaaataa >gi568815595f:58400945_58601629|GENSCAN_predicted_peptide_5|55_aa MAQRLGEWARGPSDATGLYRAVLLRSVLPIDGVQVLGVMNKELDKTHKQSKERMK >gi568815595f:58400945_58601629|GENSCAN_predicted_CDS_5|168_bp atggcgcagaggctgggcgagtgggcccgggggccctccgatgccaccgggctctaccgg gctgtgctgctccggtcggtgttacccattgacggtgtccaggttcttggcgtcatgaac aaagaattggacaaaacgcacaaacaaagcaaggaaagaatgaagtaa