GENSCAN 1.0 Date run: 6-Nov-116 Time: 21:25:59 Sequence gi568815586r:110620137_110842707 : 222571 bp : 46.39% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 8631 8782 152 1 2 59 83 128 0.476 9.28 1.02 Intr + 12336 12423 88 2 1 97 110 -7 0.449 1.94 1.03 Intr + 14534 14643 110 0 2 18 90 86 0.397 1.80 1.04 Intr + 16345 16365 21 0 0 95 116 5 0.396 1.74 1.05 Intr + 20247 20381 135 2 0 74 92 127 0.278 12.46 1.06 Intr + 20888 21013 126 1 0 71 110 29 0.943 4.28 1.07 Intr + 24856 24993 138 0 0 35 98 149 0.810 11.26 1.08 Intr + 27075 27200 126 2 0 78 80 40 0.807 3.08 1.09 Term + 27613 27756 144 0 0 89 45 59 0.893 -0.39 1.10 PlyA + 28444 28449 6 -0.45 2.15 PlyA - 28666 28661 6 1.05 2.14 Term - 29612 29466 147 2 0 104 42 29 0.517 -2.20 2.13 Intr - 30144 30032 113 1 2 71 116 65 0.859 7.70 2.12 Intr - 31312 31081 232 1 1 147 88 314 0.999 34.65 2.11 Intr - 35202 35098 105 0 0 123 54 217 0.961 22.21 2.10 Intr - 41312 41028 285 2 0 99 96 188 0.772 18.24 2.09 Intr - 68572 68489 84 1 0 115 62 6 0.011 0.72 2.08 Intr - 77199 77043 157 2 1 109 61 88 0.285 8.21 2.07 Intr - 78378 78243 136 1 1 58 87 113 0.963 7.83 2.06 Intr - 79182 79020 163 0 1 98 84 89 0.125 9.05 2.05 Intr - 102133 101999 135 1 0 47 80 97 0.577 5.56 2.04 Intr - 102559 102336 224 2 2 65 72 133 0.966 7.25 2.03 Intr - 104628 104524 105 1 0 104 92 57 0.995 7.89 2.02 Intr - 110623 110393 231 2 0 75 101 137 0.987 11.44 2.01 Init - 122571 122517 55 0 1 82 116 129 0.966 16.55 2.00 Prom - 127322 127283 40 -4.06 3.03 PlyA - 127345 127340 6 1.05 3.02 Term - 134632 134556 77 2 2 -7 47 168 0.793 1.20 3.01 Init - 137857 137797 61 1 1 71 88 111 0.970 8.82 3.00 Prom - 148540 148501 40 -7.06 4.10 PlyA - 149367 149362 6 1.05 4.09 Term - 149539 149459 81 1 0 92 38 66 0.279 -0.31 4.08 Intr - 159141 158953 189 0 0 117 89 35 0.439 6.28 4.07 Intr - 164267 164195 73 1 1 63 67 71 0.102 1.91 4.06 Intr - 175026 174940 87 2 0 49 47 114 0.034 2.49 4.05 Intr - 176929 176886 44 1 2 66 82 25 0.018 -3.26 4.04 Intr - 191882 191640 243 2 0 117 76 84 0.611 7.79 4.03 Intr - 196622 196580 43 1 1 79 68 29 0.010 -1.76 4.02 Intr - 206749 206674 76 2 1 57 67 80 0.019 1.37 4.01 Init - 221808 221577 232 0 1 77 69 357 0.489 31.12 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:110620137_110842707|GENSCAN_predicted_peptide_1|346_aa XKPALSFINPEVPDENNFDTLMKTSDGFTLNAESYVSFTTKLDIPTAAKYEYGVPLQTSD SFLRFPSSLTSSLCTDNNPAAFLVNQAVKCTRKINLEQCEEIEALSMAFYSSPEILRVPD SRKKVPITVQSIVIQSLNKTLTRREDTDVLQPTLVNAGHFSLCVNVVLEVKYSLTYTDAG EVTKADLSFVLGTVSSVVVPLQQKFEIHFLQLVAQKVKSLLWGQGFPDYVAPFGNSQAQD MLDWVPIHFITQSFNRKDSCQLPGALVIEVKWTKYGSLLNPQAKIVNVTANLISSSFPEA NSGNERTILISTAVTFVDVSAPAEAGFRAPPAINARLPFNFFFPFV >gi568815586r:110620137_110842707|GENSCAN_predicted_CDS_1|1041_bp nataaacctgcattatcctttattaatccagaagtacctgatgaaaacaattttgataca ttgatgaaaacatctgatggttttacattgaatgctgaatcatatgtttccttcacaacc aaactggatattcctactgctgctaaatatgagtatggggttcctctgcagacttcagat tcgtttctgagatttccttcgtccctgacatcatctctgtgcactgataataaccctgca gcgtttctggtgaaccaggctgttaagtgcaccagaaaaataaatttagaacagtgtgaa gaaattgaagccctcagcatggctttttacagcagcccggaaattctgagggtacctgat tcaagaaaaaaggtccctatcactgttcagtccatcgtcattcagtctctaaataaaacg ctcacccgacgggaggacactgatgtgctgcagccgactctcgtcaacgctggacacttt agcctttgcgtgaatgttgttcttgaggtaaagtacagcctcacatacacagatgcaggt gaagtcaccaaagctgatctctcattcgttctggggacagttagcagcgtagtggtccca ctgcagcaaaagtttgaaattcattttcttcagctcgtagcacagaaggtgaagagcctg ctgtggggccagggcttcccagattacgtggccccttttggaaattcccaggcccaggac atgctggactgggtgcccatccacttcatcacccagtcattcaacaggaaggattcctgc cagctcccaggggctttggttatagaagtgaagtggactaaatacggatccctgctgaat ccacaggccaaaatagtcaatgtaactgcaaatctaatttcatcctcctttcctgaggcc aactcaggaaatgaaaggacgattcttatttccactgcggttacttttgtggatgtgtct gcacctgcagaggcaggcttcagagctccaccagccatcaatgccaggctgccctttaac ttcttcttcccgtttgtttga >gi568815586r:110620137_110842707|GENSCAN_predicted_peptide_2|723_aa MADLDKLNIDSIIQRLLEGDIHGQYYDLLRLFEYGGFPPESNYLFLGDYVDRGKQSLETI CLLLAYKIKYPENFFLLRGNHECASINRIYGFYDECKRRYNIKLWKTFTDCFNCLPIAAI VDEKIFCCHGGLSPDLQSMEQIRRIMRPTDVPDQGLLCDLLWSDPDKDVLGWGENDRGVS FTFGAEVVAKFLHKHDLDLICRAHQVVEDGYEFFAKRQLVTLFSAPNYCGEFDNAGAMMS VDETLMCSFQAHPLAGRGGCLDEQFHQGPSECAENTKCALGCSQDPAKPSKAQQIPAKVK SGQVLLQRLRKPLVGLEGGTEVWRALSTSNQQVEQGDAAKPLSVRLCSLNLFLTCPGPSL VLNPFVNTLEGDHSPGLVVNTSCALMTPKPLCQIPDSDPTAQVKKPRRREAKPLALGHTA KGGRARTHNQAVTRRAKVAPAERMSKFLRHFTVVGDDYHAWNINYKKWENEEEEEEEEQP PPTPVSGEEGRAAAPDVAPAPGPAPRAPLDFRGMLRKLFSSHRFQVIIICLVVLDALLVL AELILDLKIIQPDKNNYAAMVFHYMSITILVFFMMEIIFKLFVFRLEFFHHKFEILDAVV VVVSFILDIVLLFQEHQFEALGLLILLRLWRVARIINGIIISVKTRSERQLLRLKQMNVQ LAAKIQHLEFSCSEKIQPRRHILPEAVPTPTSQGSARSPTFVFSLLSLLLSEAYLIVAVS SAD >gi568815586r:110620137_110842707|GENSCAN_predicted_CDS_2|2172_bp atggcggatttagataaactcaacatcgacagcattatccaacggctgctggaaggtgac atccatggacaatactatgatttgctgcgactttttgagtacggtggtttcccaccagaa agcaactacctgtttcttggggactatgtggacaggggaaagcagtcattggagacgatc tgcctcttactggcctacaaaataaaatatcctgagaatttttttcttctcagagggaac catgaatgtgccagcatcaacagaatttatggattttatgatgaatgtaaaagaagatac aacattaaactatggaaaactttcacagactgttttaactgtttaccgatagcagccatc gtggatgagaagatattctgctgtcatggaggtttatcaccagatcttcaatctatggag cagattcggcgaattatgcgaccaactgatgtaccagatcaaggtcttctttgtgatctt ttgtggtctgaccccgataaagatgtcttaggctggggtgaaaatgacagaggagtgtcc ttcacatttggtgcagaagtggttgcaaaatttctccataagcatgatttggatcttata tgtagagcccatcaggtggttgaagatggatatgaattttttgcaaagaggcagttggtc actctgttttctgcgcccaattattgcggagagtttgacaatgcaggtgccatgatgagt gtggatgaaacactaatgtgttcttttcaggcccaccccttggcaggaagagggggatgc ctggatgagcagttccaccagggacccagcgagtgtgccgagaacacgaaatgtgccctg ggctgcagccaagacccagcaaagcccagcaaagcccagcaaatcccagcaaaggtcaag tctggccaagtcttgctgcagcggctgcggaagcctctagttggcctggagggcggaacc gaggtgtggagagcactttccactagcaaccagcaggtggaacaaggagatgctgctaaa cccttgagtgtaaggctctgctcactaaacctcttcctcacctgcccaggccctagcttg gtcctgaacccctttgtcaacactctggaaggggatcacagcccaggccttgttgtcaat accagttgtgcactgatgacgcccaagcctctctgccaaattcctgactcagatcctact gcccaggtgaagaaaccaagacgcagagaggccaagccccttgccttgggtcacacagcc aaaggaggcagagccagaactcacaaccaggcagtcacccgcagggccaaggtggctccc gctgagaggatgagcaagttcttaaggcacttcacggtcgtgggagacgactaccatgcc tggaacatcaactacaagaaatgggagaatgaagaggaggaggaggaggaggagcagcca ccacccacaccagtctcaggcgaggaaggcagagctgcagcccctgacgttgcccctgcc cctggccccgcacccagggccccccttgacttcaggggcatgttgaggaaactgttcagc tcccacaggtttcaggtcatcatcatctgcttggtggttctggatgccctcctggtgctt gctgagctcatcctggacctgaagatcatccagcccgacaagaataactatgctgccatg gtattccactacatgagcatcaccatcttggtcttttttatgatggagatcatctttaaa ttatttgtcttccgcctggagttctttcaccacaagtttgagatcctggatgccgtcgtg gtggtggtctcattcatcctcgacattgtcctcctgttccaggagcaccagtttgaggct ctgggcctgctgattctgctccggctgtggcgggtggcccggatcatcaatgggattatc atctcagttaagacacgttcagaacggcaactcttaaggttaaaacagatgaatgtacaa ttggccgccaagattcaacaccttgagttcagctgctctgagaagatccagcccaggcgt cacatcctccctgaagctgtgcccacccccacatcccagggctctgccaggtcccccacg tttgtcttctcactgctgtcactgctgctcagtgaagcctatctgattgtagcagtctcc tcagcagactga >gi568815586r:110620137_110842707|GENSCAN_predicted_peptide_3|45_aa MITVHRILDFWAQAILLLRLQRTCKEEEEEEEEEEKARRKKEEEV >gi568815586r:110620137_110842707|GENSCAN_predicted_CDS_3|138_bp atgatcactgttcaccgcatccttgacttctgggctcaagcgatcctcctgcttcggctt cagagaacttgcaaagaagaagaagaagaggaggaggaggaggagaaggcaaggaggaag aaggaagaagaagtttga >gi568815586r:110620137_110842707|GENSCAN_predicted_peptide_4|355_aa MRFAKKHNKKSLKKMQANNAKAMGARAEAIKALAKPEEAKPKIPKGISHKLDRPACITHS KLGKLHQPACITPLECSGNAIHVGAPGGATYTDNTVNAIPRVLVSNEFLVDFTWLSQATS KEPHMEGFANQQRERFIQIFAKGRASSEPLTNSSLHDLKYNLLKTIFWEGRQRKKETKGQ GGGSQDRIEEDPGNQKAPSLTIIHLLTDMSAGRECGHRPGGSDTAVEQLDACVGERSRTD VGNHRKPSLRKDDIVERDVVRSNAETSLKIGARQQDSEYKVKSHLEISRPPDVQRIQTEQ LDHLTASLFSPRTSAAGVSHTAYLHEDTQTVQGPAEKALGGNDLSANELETSTRS >gi568815586r:110620137_110842707|GENSCAN_predicted_CDS_4|1068_bp atgcgctttgccaagaagcacaacaaaaagagcctgaagaagatgcaggccaacaatgcc aaggccatgggtgcacgtgccgaggctatcaaggccctggcaaagcccgaggaggctaag cccaagatcccaaagggcatcagccacaagcttgatcgacctgcctgcatcacccactcc aagcttgggaagctccatcaacctgcctgcatcaccccactggagtgcagtggaaacgcc attcatgttggtgcaccagggggcgccacttacacagacaatacagtgaacgccatcccc agggtccttgtcagcaacgaattcctagtggattttacctggctatctcaggcaacaagc aaagagccacacatggagggctttgcaaaccaacagcgtgagaggttcatccagattttt gccaagggcagagctagctctgagcctctaaccaactcatcattacatgacttaaaatac aatttgctgaagacaatcttctgggaagggagacagagaaagaaagaaaccaagggtcaa ggaggaggcagccaagacaggatagaggaggaccctggaaaccagaaggctccttccctg accattatccacctgctaactgacatgtcagccggaagagaatgtggacacaggcctggg ggctctgacacagctgtggaacagctggacgcctgcgtgggtgagaggtcgagaacggat gtgggcaatcacagaaagcccagcttgcgaaaagatgacattgtggaaagggatgtagtg cggtctaacgctgaaactagtctcaagattggggccaggcagcaagacagtgaatataaa gtgaaaagtcacctggagatttccaggccaccagatgtgcaacgtattcagacagaacag ctggatcatttaacagcttccttgttttcccccagaacatccgctgcgggggtgtctcac acagcatatctacatgaagacacgcagacagtacaagggcctgcagaaaaggctcttgga ggcaatgatctctctgcaaatgaattggagaccagcacaagatcctag