GENSCAN 1.0 Date run: 3-Nov-116 Time: 06:37:41 Sequence gi568815583r:69950100_70196262 : 246163 bp : 48.86% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 10646 10776 131 1 2 78 75 54 0.021 2.72 1.02 Intr + 24426 24554 129 0 0 61 37 126 0.036 4.61 1.03 Term + 42219 42282 64 0 1 112 50 48 0.438 0.76 1.04 PlyA + 42725 42730 6 1.05 2.08 PlyA - 43816 43811 6 1.05 2.07 Term - 55651 55460 192 1 0 46 49 156 0.881 4.92 2.06 Intr - 58033 57844 190 0 1 119 61 54 0.817 5.39 2.05 Intr - 69114 69025 90 2 0 88 99 2 0.296 0.41 2.04 Intr - 74401 74298 104 1 2 77 111 50 0.601 5.17 2.03 Intr - 81653 81538 116 0 2 65 110 18 0.124 1.87 2.02 Intr - 81952 81870 83 0 2 87 87 23 0.252 1.38 2.01 Init - 85037 85021 17 2 2 97 94 18 0.361 1.86 2.00 Prom - 90702 90663 40 -6.26 3.21 PlyA - 91324 91319 6 1.05 3.20 Term - 100105 99998 108 1 0 100 41 130 0.990 8.01 3.19 Intr - 101368 101292 77 2 2 104 84 74 0.999 7.83 3.18 Intr - 102425 102275 151 2 1 86 89 306 0.991 30.24 3.17 Intr - 103275 103128 148 2 1 102 69 180 0.988 17.74 3.16 Intr - 104586 104339 248 0 2 107 99 428 0.999 42.06 3.15 Intr - 105199 104950 250 0 1 83 85 275 0.975 24.14 3.14 Intr - 106275 106199 77 0 2 87 77 31 0.977 0.21 3.13 Intr - 107559 107375 185 1 2 87 89 317 0.907 31.21 3.12 Intr - 108192 108060 133 0 1 115 43 90 0.457 7.42 3.11 Intr - 108716 108564 153 2 0 88 50 99 0.927 6.37 3.10 Intr - 109361 109311 51 2 0 134 64 86 0.978 9.90 3.09 Intr - 110550 110431 120 0 0 126 64 168 0.891 18.89 3.08 Intr - 114371 114355 17 0 2 95 115 6 0.401 -0.74 3.07 Intr - 116146 115915 232 1 1 84 64 192 0.807 13.75 3.06 Intr - 124508 124434 75 2 0 79 71 169 0.610 14.01 3.05 Intr - 126059 125997 63 2 0 121 94 13 0.892 4.11 3.04 Intr - 144477 144433 45 0 0 71 111 60 0.951 5.21 3.03 Intr - 145542 145479 64 2 1 81 81 142 0.935 11.52 3.02 Intr - 146162 146062 101 2 2 74 110 83 0.598 8.01 3.01 Init - 149166 149062 105 0 0 35 62 135 0.478 3.82 3.00 Prom - 162793 162754 40 -2.46 4.06 PlyA - 165272 165267 6 1.05 4.05 Term - 170087 169958 130 1 1 101 42 111 0.843 5.45 4.04 Intr - 170410 170312 99 0 0 48 86 51 0.252 0.13 4.03 Intr - 179030 178857 174 1 0 104 57 31 0.151 0.65 4.02 Intr - 180792 180683 110 0 2 112 103 13 0.577 4.28 4.01 Init - 202968 202843 126 0 0 83 77 21 0.023 0.78 4.00 Prom - 205710 205671 40 0.74 5.07 PlyA - 211942 211937 6 1.05 5.06 Term - 219119 218896 224 0 2 93 49 114 0.755 5.08 5.05 Intr - 224005 223934 72 2 0 47 99 86 0.300 4.98 5.04 Intr - 237890 237783 108 0 0 54 113 30 0.277 2.36 5.03 Intr - 239076 238948 129 1 0 84 95 34 0.945 4.37 5.02 Intr - 240991 240718 274 1 1 104 14 121 0.943 3.41 5.01 Intr - 245493 245364 130 2 1 62 91 43 0.406 2.70 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583r:69950100_70196262|GENSCAN_predicted_peptide_1|107_aa MEYYAAMKKDEFMSFVGTCMKLETIILSKLSQGQKTKHCMFSLSFRKLIHGIISLVPLNA YGRDSGMIIAIWQINAKTQRRKVTFSRAAPWLLALPTVTGITGRPRE >gi568815583r:69950100_70196262|GENSCAN_predicted_CDS_1|324_bp atggaatactatgcagccatgaaaaaggatgagttcatgtcctttgtagggacatgcatg aagctggaaaccatcattctcagcaaactatcacaaggacaaaaaacaaaacactgcatg ttctcactcagctttcgaaagctcattcacggaattatctcattagttccactcaatgcc tatgggagggactcagggatgatcatcgccatttggcagataaatgctaaaacccagaga aggaaagtgactttctcaagggcagctccatggttgctggccctgcctaccgtcaccggc atcacaggaagaccgagagaatga >gi568815583r:69950100_70196262|GENSCAN_predicted_peptide_2|263_aa MASWKRWKGAYESPCLRQLSKEVAMIVLLGVFLGTRDIGKESVLHPLTVSSNPPHNPSTS SILQKRNLKQERACDSKINTTKSTLVAGFPCPAVEHMNPGTHSAFLCSPVQEAGQVLQAH LPDAETRAYQCLRSQSWPPRWSDSQGSREPVSGSVLSLRELNTPPCPVKVHRMNNSIHEQ KTWNPLGWAISSPRRLGEKKCDAGSCGGVSQASHKHEGMYLRTNANVQGMAGRKNRECLG PRWHCGDAELTTAGAAHLIPNLF >gi568815583r:69950100_70196262|GENSCAN_predicted_CDS_2|792_bp atggcttcctggaagagatggaagggggcttacgagtctccttgcttgaggcagctctcc aaggaagttgcaatgattgttcttcttggagtatttcttggcacaagggacattggtaag gaaagcgtactgcaccctctcactgtctcatcaaatcctccccacaaccccagcaccagc tccattttacagaagaggaatctgaagcaagagagggcctgtgattctaaaatcaacaca accaaatccacactcgtcgctggctttccctgccctgctgtggagcacatgaacccgggg acgcattcggctttcctttgctcccctgttcaggaggctgggcaggtgttgcaggcccat ctaccagatgcagaaaccagggcttaccagtgcctaagatcacagagctggcctcctagg tggtcagattcccaaggctcccgagagcctgtgtctggttcagtgctgtccctcagagag ctgaacacacccccttgcccagtgaaggtccatcgaatgaacaacagcatccatgagcaa aagacatggaatcctcttggatgggcaatcagctccccaaggagacttggagaaaaaaag tgtgatgctgggagctgcggtggtgtctcacaagcttctcacaagcacgaagggatgtac ctgaggaccaatgccaacgtgcaggggatggcagggagaaagaaccgagagtgcctgggt cctcgatggcattgtggagatgctgaattaactacagctggagctgctcacctcatcccc aacctcttctga >gi568815583r:69950100_70196262|GENSCAN_predicted_peptide_3|800_aa MRGKYRSGDARPARARARGLAGEGPGGPALAVHRAAPHQPGQPGFKFTVAESCDRIKDEF QFLQAQYHSLKVEYDKLANEKTEMQRHYVMYYEMSYGLNIEMHKQTEIAKRLNTILAQIM PFLSQEHQQQVAQAVERAKQVTMTELNAIIGLYSPPTSLQQQQLQAQHLSHATHGPPVQL PPHPSGLQPPGIPPVTGSSSGLLALGALGSQAHLTVKDEKNHHELDHRERESSANNSVSP SESLRASEKHRGSADYSMEAKKRKAEEKDSLSRYDSDGDKSDDLVVDVSNEDPATPRVSP AHSPPENGLDKARSLKKDAPTSPASVASSSSTPSSKTKDLGHNDKSSTPGLKSNTPTPRN DAPTPGTSTTPGLRSMPGKPPGMDPIASALRTPISITSSYAAPFAMMSHHEMNGSLTSPG AYAGLHNIPPQMSAAAAAAAAAYGRSPMVGFDPHPPMRATGLPSSLASIPGGKPAYSFHV SADGQMQPVPFPHDALAGPGIPRHARQINTLSHGEVVCAVTISNPTRHVYTGGKGCVKIW DISQPGSKSPISQLDCLNRDNYIRSCKLLPDGRTLIVGGEASTLTIWDLASPTPRIKAEL TSSAPACYALAISPDAKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISHDGTK LWTGGLDNTVRSWDLREGRQLQQHDFTSQIFSLGYCPTGEWLAVGMESSNVEVLHHTKPD KYQLHLHESCVLSLKFAYCGKWFVSTGKDNLLNAWRTPYGASIFQSKESSSVLSCDISAD DKYIVTGSGDKKATVYEVIY >gi568815583r:69950100_70196262|GENSCAN_predicted_CDS_3|2403_bp atgaggggaaaatatagatccggggacgcgcggccggctcgcgcgcgggctcgcggactt gccggggagggccccgggggtcccgccttggccgtccaccgggcggctccccatcaaccc gggcagccgggatttaaattcacggtggctgagtcttgtgacaggatcaaagacgaattc cagttcctgcaagctcagtatcacagcctcaaagtggagtacgacaagctggcaaacgag aagacggagatgcagcgccattatgtgatgtactatgagatgtcctatggcttgaacatt gaaatgcacaagcagacagagattgcgaagagactgaacacaattttagcacagatcatg cctttcctgtcacaagagcaccagcagcaggtggcgcaggcagtggagcgcgccaagcag gtcaccatgacggagctgaacgccatcatcgggctgtactcaccaccgacttcattgcag cagcagcagctccaggcgcagcacctctcccatgccacacacggccccccggtccagttg ccaccccacccgtcaggtctccagcctccaggaatccccccagtgacagggagcagctcc gggctgctggcactgggcgccctgggcagccaggcccatctgacggtgaaggatgagaag aaccaccatgaactcgatcacagagagagagaatccagtgcgaataactctgtgtcaccc tcggaaagcctccgggccagtgagaagcaccggggctctgcggactacagcatggaagcc aagaagcggaaggcggaggagaaggacagcttgagccgatacgacagtgatggagacaag agtgatgatctggtggtggatgtttccaatgaggaccccgcaacgccccgggtcagcccg gcacactcccctcctgaaaatgggctggacaaggcccgtagcctgaaaaaagatgccccc accagccctgcctcggtggcctcttccagtagcacaccttcctccaagaccaaagacctt ggtcataacgacaaatcctccacccctgggctcaagtccaacacaccaaccccaaggaac gacgccccaactccaggcaccagcacgaccccagggctcaggtcgatgccgggtaaacct ccgggcatggacccgatagcctcggctctgcgcacgcccatctccatcaccagctcctat gcggcgcccttcgccatgatgagccaccatgagatgaacggctccctcaccagtcctggc gcctacgccggcctccacaacatcccaccccagatgagcgccgccgccgctgctgcagcc gctgcctatggccgatcgccaatggttggttttgaccctcaccccccgatgcgggccaca ggcctcccctcaagcctggcctccattcctggaggaaaaccagcgtactcattccatgtg agtgctgatgggcagatgcagcccgtgcccttcccccacgacgccctggcaggccccggc atcccgaggcacgcccggcagatcaacacactcagccacggggaggtggtgtgtgccgtg accatcagcaaccccacgaggcacgtctacacaggtggcaagggctgcgtgaagatctgg gacatcagccagccaggcagcaagagccccatctcccagctggactgcctgaacagggac aattacatccgctcctgcaagctgctccctgatgggcgcacgctcatcgtgggcggcgag gccagcacgctcaccatctgggacctggcctcgcccacgccccgcatcaaggccgagctg acgtcctcggctcccgcctgttatgccctggccattagccctgacgccaaagtctgcttc tcctgctgcagcgatgggaacattgctgtctgggacctgcacaaccagaccctggtcagg cagttccagggccacacagatggggccagctgcatagacatctcccatgatggcaccaaa ctgtggacagggggcctggacaacacggtgcgctcctgggacctgcgggagggccgacag ctacagcagcatgacttcacttcccagatcttctcgctgggctactgccccactggggag tggctggctgtgggcatggagagcagcaacgtggaggtgctgcaccacaccaagcctgac aagtaccagctgcacctgcacgagagctgcgtgctctccctcaagttcgcctactgcggc aagtggttcgtgagcactgggaaagataaccttctcaacgcctggaggacgccttatgga gccagcatattccagtctaaagaatcctcgtctgtcttgagttgtgacatttcagcggat gacaaatacattgtaacaggctctggtgacaagaaggccacagtttatgaggtcatctac taa >gi568815583r:69950100_70196262|GENSCAN_predicted_peptide_4|212_aa MAPGSRHWSLLASGLMWRDSRPTRSPQTVGNKGQPPRVLLSQVRNWCSQRAYEWPNVTLP LGVCMWSQDLILGRSNPKWDNSIKSCHFRLSRPFTASQPVRALDTIRHNKGSSYLFAKVI HASPCKLDFSICKIVYRVGGGGVSRLVAFGIVWCKHSHGGMMSLNADNQRALQGYTLGPE FLLLITRRDLINEVLSGAVAGFSSDDRAAGSV >gi568815583r:69950100_70196262|GENSCAN_predicted_CDS_4|639_bp atggctcctggcagtagacactggagcctcttggcttcaggcttgatgtggagggacagc aggcccacgagaagcccccagaccgtagggaataaaggacaaccaccacgggtcctgctt tcacaggtgagaaactggtgctctcagagagcgtatgagtggcccaatgtcacactacct ctgggtgtgtgcatgtggagccaggatttgatcctgggcagatcaaaccccaagtgggat aacagcatcaagtcctgtcatttcaggctctcaaggcctttcacagccagccagccagtc agggcactggacacaataaggcacaataagggctcaagctacctcttcgccaaggtcatc catgcctccccatgcaagcttgatttctctatctgtaaaattgtgtatcgggtgggcggt ggaggggtcagcagattggtagcatttggcattgtatggtgtaaacactctcatggtggc atgatgtcactgaacgcagacaatcaaagagctctgcaaggctacaccctgggcccagag ttcctgctgttaataacaaggagagacttaatcaatgaggtcctctcaggagcagttgct ggcttttcatctgatgacagggcagcaggctcagtgtag >gi568815583r:69950100_70196262|GENSCAN_predicted_peptide_5|312_aa XVRPVVHEEKELGKTQARKRGREVDLGWTFNRMYGRVSYGETVQMGSRAFSGRSSECVDL QLPLCSANKAGIFHQQFLNGWPPSAGPTLQSDLTLSVIKVGSMLTWELVRHAEWILGLTF HFENEALDPDIKASSVLMVHTEPPRSELSMGHFLEKCHLSISWDNDIEIGFPRPTTKPDS YAARVDQYTGTDRARVWQEEVGKSHKSSWKKGLEVLIWKIDQLDETISEDLSNTHVFTEG LFQHSHVACQGLASKKYSLWAVFSRLIPLDLLEEALGINDNWGQNEKKYDCVIFNQRVGT FKEHRPNPQMSW >gi568815583r:69950100_70196262|GENSCAN_predicted_CDS_5|939_bp nnagtgcgtcctgtggtgcatgaagagaaggaactaggcaaaacccaggcaagaaaacga gggagggaagtggatcttggctggacctttaacaggatgtatgggagggtgagctacggt gagactgtgcagatgggcagcagggccttctcgggaaggagctccgagtgtgtggaccta cagctaccactgtgctctgcaaacaaggcaggcatcttccatcagcagttcctaaatggc tggcctcccagtgcaggccccacactgcagagtgatctcacactctccgtaattaaagtg ggttctatgctcacctgggaacttgttagacatgcagaatggatattgggccttacgttc cattttgagaacgaagctctagacccagacatcaaggccagcagtgtgcttatggttcat acggaaccaccacgctctgaattatccatgggccatttcttggagaaatgtcatttatcc atttcttgggacaatgacattgaaattggctttcccaggcctacaacaaagccagacagc tatgcagctagagtagaccagtatactggcactgacagggcgagagtgtggcaggaggag gttggcaagtcccataaatcttcctggaagaaaggccttgaagtcctcatctggaaaatt gaccagttggatgaaaccatctctgaggacctctccaatacccatgtgttcacagagggg ctcttccaacactcgcatgttgcctgccagggcctggcctctaaaaagtatagcctgtgg gctgtattttcaagactcattcctctggatttgttggaagaagccctcggaattaatgac aactggggacaaaatgaaaagaagtacgattgtgttattttcaaccaaagagttggcaca ttcaaagagcacagacccaacccgcagatgagctggtga