GENSCAN 1.0 Date run: 6-Nov-116 Time: 23:15:07 Sequence gi568815597r:31265889_31473014 : 207126 bp : 45.24% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.08 PlyA - 1422 1417 6 1.05 1.07 Term - 3560 3439 122 0 2 101 44 48 0.828 0.34 1.06 Intr - 5611 5491 121 1 1 102 93 91 0.989 11.07 1.05 Intr - 15608 15486 123 2 0 87 95 205 0.999 21.88 1.04 Intr - 23531 23366 166 1 1 101 98 64 0.984 8.66 1.03 Intr - 26118 26025 94 1 1 77 109 101 0.975 10.12 1.02 Intr - 27460 27331 130 1 1 84 45 108 0.932 6.37 1.01 Init - 30863 30723 141 2 0 90 115 89 0.936 11.93 1.00 Prom - 53115 53076 40 -3.06 2.00 Prom + 56733 56772 40 -6.66 2.01 Init + 58872 58932 61 2 1 83 16 120 0.607 5.71 2.02 Intr + 71287 71387 101 2 2 106 88 37 0.827 5.33 2.03 Intr + 73069 73160 92 0 2 93 80 61 0.882 4.59 2.04 Intr + 80752 80852 101 1 2 62 97 29 0.879 0.95 2.05 Intr + 82941 83086 146 1 2 116 63 90 0.983 9.30 2.06 Term + 98144 98305 162 1 0 83 54 203 0.987 14.34 2.07 PlyA + 99593 99598 6 -1.95 3.07 PlyA - 99788 99783 6 1.05 3.06 Term - 100051 99998 54 1 0 123 43 35 0.195 0.06 3.05 Intr - 101606 101505 102 2 0 82 99 98 0.839 10.67 3.04 Intr - 102282 102148 135 0 0 48 27 156 0.982 6.26 3.03 Intr - 103669 103497 173 2 2 114 115 136 0.978 18.56 3.02 Intr - 107138 107054 85 2 1 35 97 147 0.821 9.69 3.01 Init - 108085 107963 123 1 0 106 39 94 0.772 6.49 3.00 Prom - 111727 111688 40 -6.26 4.00 Prom + 115935 115974 40 -2.96 4.01 Init + 120042 120114 73 2 1 39 94 3 0.152 -2.63 4.02 Intr + 121147 122153 1007 2 2 51 56 493 0.267 32.86 4.03 Term + 141296 141427 132 1 0 106 47 104 0.725 6.19 4.04 PlyA + 142106 142111 6 -0.45 5.00 Prom + 145807 145846 40 -6.46 5.01 Init + 149008 149086 79 0 1 70 98 20 0.225 2.47 5.02 Intr + 149954 150019 66 0 0 69 98 41 0.310 2.08 5.03 Intr + 155233 155442 210 2 0 12 42 166 0.400 3.28 5.04 Intr + 157531 157750 220 2 1 53 69 54 0.378 -2.64 5.05 Intr + 157846 157966 121 1 1 84 113 178 0.996 20.40 5.06 Intr + 158795 158985 191 1 2 118 72 284 0.993 28.18 5.07 Intr + 159442 159521 80 1 2 90 22 144 0.998 7.19 5.08 Intr + 159888 160025 138 1 0 94 82 277 0.999 28.14 5.09 Intr + 160766 160935 170 0 2 75 99 304 0.994 29.87 5.10 Intr + 163063 163180 118 0 1 44 119 132 0.824 11.94 5.11 Intr + 163509 163650 142 1 1 71 80 143 0.975 11.21 5.12 Intr + 167079 167297 219 0 0 94 51 371 0.534 31.42 5.13 Term + 168176 168311 136 2 1 30 54 210 0.976 9.29 5.14 PlyA + 168766 168771 6 1.05 6.03 PlyA - 174076 174071 6 1.05 6.02 Term - 175975 175848 128 2 2 93 41 61 0.616 0.34 6.01 Init - 181712 181469 244 2 1 80 75 147 0.676 10.50 6.00 Prom - 184464 184425 40 -2.96 7.00 Prom + 188907 188946 40 -3.56 7.01 Init + 192813 192897 85 2 1 40 116 6 0.305 -0.42 7.02 Intr + 196739 196950 212 0 2 104 79 49 0.618 4.23 7.03 Term + 198597 198731 135 2 0 0 47 181 0.949 3.22 7.04 PlyA + 199553 199558 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:31265889_31473014|GENSCAN_predicted_peptide_1|298_aa MIEQQKRKGPELPLVPVKRQRHELLLGAGSGPGAGQQQATPGALLQAGPPRCSSLQAPIM LLSGHEGEVYCCKFHPNGSTLASAGFDRLILLWNVYGDCDNYATLKGHSGAVMELHYNTD GSMLFSASTDKTVAVWDSETGERVKRLKGHTSFVNSCYPARRGPQLVCTGSDDGTVKLWD IRKKAAIQTFQNTYQVLAVTFNDTSDQIISGGIDNDIKVWDLRQNKLTYTMRGHADSVTG LSLSSEGSYLLSNAMDNTASPSYWDLDVFSCLKKKRALTGSSMSSWTLPTEETASAID >gi568815597r:31265889_31473014|GENSCAN_predicted_CDS_1|897_bp atgatagaacagcagaagcgtaagggcccagagttgccgctggttccagtcaagcggcag cggcatgagttgctgttgggagcggggtctggcccaggagccgggcagcagcaggcgacg ccgggagccttgctgcaagcgggacctccaagatgttcctcccttcaagccccaatcatg ctgctctctggacatgaaggggaagtctactgctgcaagttccaccccaacggatccacc ttagcatctgcaggatttgaccgactgatattactgtggaatgtctatggtgactgtgat aactatgccacactgaagggacacagtggagcagtgatggaattgcattacaacacagat ggcagtatgcttttctcagcatccacagataaaaccgtggctgtgtgggatagtgaaaca ggtgagagggttaaaaggctaaagggacatacttcctttgtgaattcctgttatccagcc aggagaggccctcagcttgtctgcactggcagtgacgatggcacagttaagctttgggac atccggaagaaagcagccatccagacatttcagaacacgtaccaggtgttagctgtgacc ttcaatgacacaagtgatcagattatttctggtggaatagacaatgatatcaaggtctgg gacctgcgccagaacaagctaacctacaccatgagaggccatgcagattcagtgactggc ctgagtttaagttctgaaggctcttatcttttgtccaatgcaatggacaatacagctagc cccagttattgggatttagatgtgttttcctgcttgaaaaagaaaagagccttaacaggt agcagtatgtcaagttggactttgcccaccgaggaaacagcatctgccattgattga >gi568815597r:31265889_31473014|GENSCAN_predicted_peptide_2|220_aa MGLQVPLGTEQLGVMDNMIDGLVHRTHMSSCRVDKPSEIVDVGDKVWVKLIGREMKNDRI KVSLSMKVVNQGTGKDLDPNNVIIEQEERRRRSFQDYTGQKITLEAVLNTTCKKCGCKGH FAKDCFMQPGGTKYSLIPDEEEEKEEAKSAEFEKPDPTRNPSRKRKKEKKKKKHRDRKSS DSDSSDSESDTGKRARHTSKDSKAAKKKKKKKKHKKKHKE >gi568815597r:31265889_31473014|GENSCAN_predicted_CDS_2|663_bp atgggcctgcaggtgcccctgggcacagaacaactgggcgtcatggacaacatgattgat ggtctggtccatcgaactcatatgtcatcctgtcgggtggataagccctctgagatagta gatgttggagataaagtgtgggtgaagcttattggccgagagatgaaaaatgatagaata aaagtatccctctccatgaaggttgtcaatcaagggactgggaaagaccttgatcccaac aatgttatcattgagcaagaagagaggcggaggcgatccttccaggattacactgggcag aagatcacccttgaggctgtcttgaacactacctgcaagaagtgtggctgtaaaggccac tttgcaaaagattgtttcatgcaaccaggtgggactaaatactctctgatacctgatgag gaagaggaaaaggaagaggcaaagtcagcagagtttgagaagcctgaccctacaaggaat ccttctagaaaaagaaagaaggagaagaagaaaaagaaacatagagataggaagtcatct gactctgacagctcagactctgagagtgatacaggcaagagggcaaggcacacatcaaaa gacagcaaggcagcaaagaagaagaaaaagaagaagaagcacaagaagaagcacaaggag tga >gi568815597r:31265889_31473014|GENSCAN_predicted_peptide_3|223_aa MGLEELTELAEFLQEGSWERPELKLGAVRRNRKSPEHLQGRPSITMVDAFLGTWKLVDSK NFDDYMKSLGVGFATRQVASMTKPTTIIEKNGDILTLKTHSTFKNTEISFKLGVEFDETT ADDRKVKERELEELPNENQLMYVCWQHPELRNHFKGIQSQDFVVAALLLAKESIVTLDGG KLVHLQKWDGQETTLVRELIDGKLILTLTHGTAVCTRTYEKEA >gi568815597r:31265889_31473014|GENSCAN_predicted_CDS_3|672_bp atgggccttgaagagctgaccgaattggcagaatttctgcaggaggggagctgggaacga cctgagctaaagctcggagctgtgcgaagaaaccggaaaagcccagagcacttgcagggg cggcccagcatcactatggtggacgctttcctgggcacctggaagctagtggacagcaag aatttcgatgactacatgaagtcactcggtgtgggttttgctaccaggcaggtggccagc atgaccaagcctaccacaatcatcgaaaagaatggggacattctcaccctaaaaacacac agcaccttcaagaacacagagatcagctttaagttgggggtggagttcgatgagacaaca gcagatgacaggaaggtcaaggaaagagagctggaagagctgccaaatgagaaccagctg atgtatgtatgctggcagcacccagagctgaggaaccacttcaagggcatccagtcacag gactttgtggttgctgccctcttgttggctaaagagtccattgtgacactggatggaggg aaacttgttcacctgcagaaatgggacgggcaagagaccacacttgtgcgggagctaatt gatggaaaactcatcctgacactcacccacggcactgcagtttgcactcgcacttatgag aaagaggcatga >gi568815597r:31265889_31473014|GENSCAN_predicted_peptide_4|403_aa MLLDPGVSDCTWHQRLGQRPAAGPDTSTQRTSPSTPRHIPTDDLTHTPRHFHTDDITHTP RPIHTDDLTIHTPRHIHTDDLTHTPRHNHTDDLTHTPRHIHTDDHTIDTPRHIHTDDHTI DTPRHIHTDDHTIDTPRHIHTDDLTIHTQTYPHRGPHHPHADTSPQTSLSTPRHIHTDDL TIHTQTHPHRGPHHPHPDTSPQRTSPSTPRHVPTEDLTIHTQTHPHRRPHHPHPDTSPRM TSPSTPRHIPTDDLTIHTPRHIHTDDLTHTPRHIHTDDLTHIPRHVHTDDLIHTPRHIHA DDLTHTSRHIHMDDLTHISSPHTWHQRLGQRPAAGPWWTIHTQPLHRRLQNWQDPVQSEN ETASHTLAAGRMTPLGLSVSKPQVTEFLVPYAALGLVAALPRS >gi568815597r:31265889_31473014|GENSCAN_predicted_CDS_4|1212_bp atgctcctagacccaggagtcagtgactgcacctggcatcagagactggggcagaggcct gcagctggcccagacacatccacacagaggacctcaccatccacacccagacacatcccc acggacgacctcacccacacccccagacacttccacacagatgacatcacccacaccccc aggcccatccacacagatgacctcaccatccacacccccagacacatccacacagacgac ctcacccacacccccaggcacaaccacacagacgacctcacccacacccccagacacatc cacacagatgaccacaccatcgacacccccagacacatccacacagatgaccacaccatc gacacccccagacacatccacacagatgaccacaccatcgatacccccagacacatccac acagatgacctcaccatccacactcagacatatccccacagaggacctcaccatccacac gcagacacatccccacagacctcactatccacacccagacacatccacacagacgacctc accatccacacccagacacatccccacagaggacctcaccatccacacccagacacatcc ccacagaggacctcaccatccacacccagacacgtccccacagaggacctcaccatccac acccagacacatccacacagacgacctcaccatccacacccagacacatccccacggatg acctcaccatccacacccagacacatccccacggatgacctcaccatccacacccccaga cacatccacacagatgacctcacccacaccccgagacacatccacacggacgacctcacc cacatccccagacacgtccacacagacgacctcatccacacccccagacacatccatgca gatgatctcacccacacatccagacacatccacatggacgacctcacccacatatcctca ccacatacgtggcatcagagactagggcagaggcctgcagctggcccatggtggaccata cacacacagcccctacacagaaggctacaaaattggcaggatccagtgcaaagtgaaaat gaaacagcctcccacactctcgctgctggccggatgacgccgcttggcctgagcgtgtcc aagcctcaggtgacagagttcctggtcccttatgctgccctggggctggtggctgccctt cctcgatcctaa >gi568815597r:31265889_31473014|GENSCAN_predicted_peptide_5|629_aa MLLGDEAQFLAYRECSLWELLLTSRKGVTHLLIVLLSIGTGLKVNWAEVAALDSHRSANP IVNCACEGSKLRASYENLIPDDQRWNSFIPKPYLPPASTPAAICGKSVFPEIGSWYQKER NSVTSASSRFQATVSLEIDPGLNLGSVVTGCVALYDPASLSDLKVQWAGVWGTVCSCLAQ HRCADLWAGGESCCPASRNSTVSRLIFTFFLFLGVLVSIIMLSPGVESQLYKLPWVCEEG AGIPTVLQGHIDCGSLLGYRAVYRMCFATAAFFFFFTLLMLCVSSSRDPRAAIQNGFWFF KFLILVGLTVGAFYIPDGSFTNIWFYFGVVGSFLFILIQLVLLIDFAHSWNQRWLGKAEE CDSRAWYAGLFFFTLLFYLLSIAAVALMFMYYTEPSGCHEGKVFISLNLTFCVCVSIAAV LPKVQGSYQGSSCQDAQPNSGLLQASVITLYTMFVTWSALSSIPEQKCNPHLPTQLGNET VVAGPEGYETQWWDAPSIVGLIIFLLCTLFISLRSSDHRQVNSLMQTEECPPMLDATQQQ QQVAACEGRAFDNEQDGVTYSYSFFHFCLVLASLHVMMTLTNWYKPGETRKMISTWTAVW VKICASWAGLLLYLWTLVAPLLLRNRDFS >gi568815597r:31265889_31473014|GENSCAN_predicted_CDS_5|1890_bp atgctgttgggagatgaggcacagttcctggcatatcgtgagtgctccttgtgggaatta ttgctaacatcaaggaaaggtgtgactcatctgctgattgtgcttctcagcattggaact gggctgaaggtgaactgggctgaagtggcagcattagattctcataggagtgcgaaccct attgtgaactgtgcatgcgagggatccaagttgcgtgcttcttatgagaatctaattcct gatgatcagaggtggaacagtttcatcccaaaaccatatctgccccctgccagcacccct gctgccatctgtggaaaaagtgtcttccctgaaattggctcctggtaccagaaagaaaga aacagcgtgacctcggctagctctcgctttcaagccacagtatcactggagatagaccca ggtttgaatctgggttctgtggttactggctgtgtagccctatatgacccagcctccctc tcggacctgaaagttcaatgggctggtgtctggggaacagtgtgctcctgcctagcgcag cacagatgcgccgacctctgggcaggtggtgagagctgctgccccgccagccgcaactcc accgtgagccgcctcatcttcacgttcttcctcttcctgggggtgctggtgtccatcatt atgctgagcccgggcgtggagagtcagctctacaagctgccctgggtgtgtgaggagggg gccgggatccccaccgtcctgcagggccacatcgactgtggctccctgcttggctaccgc gctgtctaccgcatgtgcttcgccacggcggccttcttcttctttttcaccctgctcatg ctctgcgtgagcagcagccgggacccccgggctgccatccagaatgggttttggttcttt aagttcctgatcctggtgggcctcaccgtgggtgccttctacattcctgacggctccttc accaacatctggttctacttcggcgtcgtgggctccttcctcttcatcctcatccagctg gtgctgctcatcgactttgcgcactcctggaaccagcggtggctgggcaaggccgaggag tgcgattcccgtgcctggtacgcaggcctcttcttcttcactctcctcttctacttgctg tcgatcgcggccgtggcgctgatgttcatgtactacactgagcccagcggctgccacgag ggcaaggtcttcatcagcctcaacctcaccttctgtgtctgcgtgtccatcgctgctgtc ctgcccaaggtccaggggtcttaccaggggtcctcttgccaggacgcccagcccaactcg ggtctgctgcaggcctcggtcatcaccctctacaccatgtttgtcacctggtcagcccta tccagtatccctgaacagaaatgcaacccccatttgccaacccagctgggcaacgagaca gttgtggcaggccccgagggctatgagacccagtggtgggatgccccgagcattgtgggc ctcatcatcttcctcctgtgcaccctcttcatcagtctgcgctcctcagaccaccggcag gtgaacagcctgatgcagaccgaggagtgcccacctatgctagacgccacacagcagcag cagcaggtggcagcctgtgagggccgggcctttgacaacgagcaggacggcgtcacctac agctactccttcttccacttctgcctggtgctggcctcactgcacgtcatgatgacgctc accaactggtacaagcccggtgagacccggaagatgatcagcacgtggaccgccgtgtgg gtgaagatctgtgccagctgggcagggctgctcctctacctgtggaccctggtagcccca ctcctcctgcgcaaccgcgacttcagctga >gi568815597r:31265889_31473014|GENSCAN_predicted_peptide_6|123_aa MKEMQIKITRYFYTPALDLGSPLTEVRDIGYLSIMALDTFTTYTFVWLLDLRPPPPPDLS PLMASVTSAADPCVNKACYRIGFPMAAAESTNVHSDGQPVNQETRRLLNTFSLSGSVLGT GKL >gi568815597r:31265889_31473014|GENSCAN_predicted_CDS_6|372_bp atgaaggagatgcagattaaaatcacaagatacttctatacccctgctttggacctcggc tcacccctaacggaagtgcgtgacattggctatctttctatcatggcccttgacacattt acaacgtacacgtttgtgtggctactggatttacgtccacctcctccaccagatctcagc cccctaatggcaagtgtgacatctgctgctgatccttgtgtaaacaaggcctgctataga ataggctttcccatggctgctgcagaatcaactaatgttcattcagatggtcagccagtc aatcaagaaacccgacgcttgctgaataccttctctttatcaggctctgttctaggcact gggaaactataa >gi568815597r:31265889_31473014|GENSCAN_predicted_peptide_7|143_aa MNKPGLLATRGKKGLQHCCDQVILCTYQEKPFHLFVNVDNGVALGVLTQEHGGCRQPVAF LSKVLDPVACRWPQRIQSIVATVILVEESRKLTFGGYLTNFQYTSTSLGTIKGRKEAKLE PAWEGPYLVLLTTETAIGTAEKG >gi568815597r:31265889_31473014|GENSCAN_predicted_CDS_7|432_bp atgaataaacccgggctcttagcaacacggggaaagaagggcttgcaacactgctgtgac caggtaattctgtgcacataccaagaaaaaccatttcatctttttgttaacgtagataat ggagtggctctaggagtgcttacccaagaacatgggggctgccgacagccagtggccttc ctgtcaaaagttctagatccagtcgcctgcagatggcctcagcgtattcaatccattgtg gccacggtgatattggttgaagaaagcaggaaattaacttttggaggatatctgacaaat ttccagtacaccagcaccagcctggggaccatcaaaggacggaaagaagcaaaactcgag ccagcctgggaaggaccctacctcgtgctgctaaccaccgagactgccattggcacagca gaaaaaggatga