GENSCAN 1.0 Date run: 4-Nov-116 Time: 19:50:55 Sequence gi568815597r:23458963_23659426 : 200464 bp : 49.29% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 5964 6037 74 2 2 71 82 79 0.851 6.14 1.02 Term + 9247 9298 52 1 1 100 48 52 0.759 -0.70 1.03 PlyA + 10365 10370 6 1.05 2.06 PlyA - 10440 10435 6 1.05 2.05 Term - 16698 16615 84 0 0 72 54 42 0.171 -3.15 2.04 Intr - 25190 25043 148 1 1 76 78 325 0.082 30.54 2.03 Intr - 40758 40728 31 1 1 63 95 29 0.004 -1.71 2.02 Intr - 47223 47123 101 2 2 125 45 59 0.968 5.05 2.01 Init - 47697 47660 38 0 2 98 80 111 0.982 8.79 2.00 Prom - 48248 48209 40 -7.96 3.09 PlyA - 49907 49902 6 -0.45 3.08 Term - 51186 50918 269 1 2 82 48 299 0.934 20.96 3.07 Intr - 57565 57373 193 1 1 138 96 160 0.995 20.87 3.06 Intr - 60146 60054 93 2 0 61 78 140 0.913 10.46 3.05 Intr - 62109 61926 184 2 1 64 -24 189 0.784 5.09 3.04 Intr - 63094 62875 220 2 1 137 59 473 0.938 46.66 3.03 Intr - 65526 65421 106 0 1 98 80 29 0.894 2.89 3.02 Intr - 71234 71168 67 1 1 47 91 32 0.343 -1.69 3.01 Init - 71831 71488 344 2 2 56 80 242 0.461 14.93 3.00 Prom - 73402 73363 40 -3.06 4.03 PlyA - 73554 73549 6 1.05 4.02 Term - 100057 99998 60 1 0 99 41 64 0.494 0.60 4.01 Init - 100464 100165 300 0 0 90 115 426 0.928 42.75 4.00 Prom - 105200 105161 40 -4.26 5.00 Prom + 123866 123905 40 -3.56 5.01 Init + 130102 130166 65 0 2 62 61 57 0.018 1.02 5.02 Intr + 148470 148539 70 0 1 106 73 72 0.076 6.68 5.03 Term + 154080 154247 168 2 0 59 47 139 0.939 4.78 5.04 PlyA + 157058 157063 6 1.05 6.03 PlyA - 158572 158567 6 1.05 6.02 Term - 160728 160508 221 1 2 37 43 191 0.963 6.70 6.01 Init - 162169 162106 64 1 1 60 89 19 0.940 0.52 6.00 Prom - 164544 164505 40 -8.16 7.03 PlyA - 164673 164668 6 1.05 7.02 Term - 167443 166948 496 0 1 64 38 1605 0.838 147.14 7.01 Init - 168230 168163 68 2 2 79 84 68 0.543 4.06 7.00 Prom - 168287 168248 40 -4.76 8.05 PlyA - 169530 169525 6 1.05 8.04 Term - 175833 175771 63 0 0 84 42 70 0.249 -0.11 8.03 Intr - 185548 185432 117 1 0 70 90 47 0.735 3.76 8.02 Intr - 188543 188370 174 2 0 48 59 205 0.718 13.74 8.01 Init - 189624 189553 72 0 0 68 90 20 0.806 -0.73 8.00 Prom - 191325 191286 40 -0.06 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:23458963_23659426|GENSCAN_predicted_peptide_1|41_aa MQYIIIAKSMGLSPNYLDLNSSSTRQWTQTLKVHGVTSRTL >gi568815597r:23458963_23659426|GENSCAN_predicted_CDS_1|126_bp atgcagtacatcatcattgctaaaagtatggggctgtccccaaactacctggatttgaat tccagctccaccaggcaatggacccagaccctgaaggtccatggtgttactagcaggaca ctgtga >gi568815597r:23458963_23659426|GENSCAN_predicted_peptide_2|133_aa MVATRLASLLAPGFLKGLAAHTLPPTLYISGSNCSPQTSSSSITWKYPSTDPGRENSSTL APAMPEQFSVAEFLAVTAEDLSSPAGAAAFAAKMPRYRGAALAREEVLRHPSGDVKKAVC EFGIQRSGLEIHI >gi568815597r:23458963_23659426|GENSCAN_predicted_CDS_2|402_bp atggtggccacacgcctggccagcctgctggctcctggcttcctgaaaggtcttgctgcc cacactctccctcccaccctctacatcagtgggtccaactgtagtcctcagaccagcagc agcagtatcacctggaaatatccttccacagatcctggaagagagaacagctccacgctc gcgcccgccatgccggagcagttcagcgtcgccgagttcctggccgtcaccgcggaggac ctcagctccccggctggggccgccgccttcgccgccaagatgccccggtaccgaggggcg gcgctggcgcgggaggaggtcttgagacatccaagtggagatgtcaagaaggcagtatgt gagtttggaattcaaaggagcgggctggagatacacatttga >gi568815597r:23458963_23659426|GENSCAN_predicted_peptide_3|491_aa MLQGPRALASAAGQTPKVVPAMSPTELWPSGLSSPQLCPATATYYTPLYPQTAPPAAAPG TCLDATPHGPEGQVVRCLPAGRLPVMLGAPTASPYAFPGVGKEGGVPEIQFEWSRNQYWV PAIVQAKSMAWGHCGTLAKRKLDLEGIGRPVVPEFPTPKGKCIRVDGLPSPKTPKSPGEK TRYDTSLGLLTKKFIYLLSESEDGVLDLNWAAEVLDVQKRRIYDITNVLEGIQLIRKKAK NNIQWVGRGMFEDPTRPGKQQQLGQELKELMNTEQALDQLIQSCSLSFKHLTEDKANKRY PPWLGEGDIRAVGNFKEQTVIAVKAPPQTRLEVPDRTEDNLQIYLKSTQGPIEVYLCPEE VQEPDSPSEEPLPSTSTLCPSPDSAQPSSSTDPSIMEPTASSVPAPAPTPQQAPPPPSLV PLEATDSLLELPHPLLQQTEDQFLSPTLACSSPLISFSPSLDQDDYLWGLEAGEGISDLF DSYDLGDLLIN >gi568815597r:23458963_23659426|GENSCAN_predicted_CDS_3|1476_bp atgctgcaagggccccgggccttggcttcggccgctgggcagaccccgaaggtggtgccc gcgatgagccccacagagctgtggccatccggcctcagcagcccccagctctgcccagct actgctacctactacacaccgctgtacccgcagacggcgcctcccgcagcggcgccaggc acctgcctcgacgccactccccacggacccgagggccaagttgtgcgatgcctgccggca ggccggctgccggtaatgctgggggcccccaccgcttccccctatgcttttccaggtgtg ggaaaggagggaggggtgcctgagatccagtttgagtggagcagaaaccagtactgggtg cctgctattgtccaggcaaagagtatggcctggggacactgtgggactttggccaaaagg aagctggatctggaggggattgggaggcccgtcgtccctgagttcccaacccccaagggg aagtgcatcagagtggatggcctccccagccccaaaacccccaaatcccccggggagaag actcggtatgacacttcgctggggctgctcaccaagaagttcatttacctcctgagcgag tcagaggatggggtcctggacctgaactgggccgctgaggtgctggacgtgcagaagcgg cgcatctatgacatcaccaacgtgctggaaggcatccagctcatccgcaagaaggccaag aacaacatccagtgggtaggcaggggaatgtttgaagaccccaccagacctgggaagcag caacagctggggcaggagctgaaggagctgatgaacacggagcaggccttggaccagctc atccagagctgctctctgagcttcaagcacctgactgaggacaaggccaacaagagatat cctccttggttgggggaaggtgatatccgtgctgttggcaactttaaggagcagacagtg attgccgtcaaggcccctccgcagacgagactggaagtgcccgacaggactgaggacaac ctgcagatatatctcaagagcacccaagggcccatcgaagtctacctgtgcccagaggag gtgcaggagccggacagtccttccgaggagcctctcccctctacctccaccctctgcccc agccctgactctgcccagcccagcagcagcaccgaccctagcatcatggagcccacagca tcctcagtgccagcaccagcgccaaccccccagcaggccccaccgcctccatccctggtc cccttggaggctactgacagcctgctggagctgccgcacccactcctgcagcagactgag gaccagttcctgtccccgaccctggcgtgcagctcccctctgatcagcttctccccatcc ttggaccaggacgactacctgtggggcttggaggcgggtgagggcatcagcgatctcttc gactcctacgaccttggggacctgttgattaattga >gi568815597r:23458963_23659426|GENSCAN_predicted_peptide_4|119_aa MKALSPVRGCYEAVCCLSERSLAIARGRGKGPAAEEPLSLLDDMNHCYSRLRELVPGVPR GTQLSQVEILQRVIDYILDLQVVLAEPAPGPPDGPHLPIQTAELTPELVISNDKRSFCH >gi568815597r:23458963_23659426|GENSCAN_predicted_CDS_4|360_bp atgaaggcgctgagcccggtgcgcggctgctacgaggcggtgtgctgcctgtcggaacgc agtctggccatcgcccggggccgagggaagggcccggcagctgaggagccgctgagcttg ctggacgacatgaaccactgctactcccgcctgcgggaactggtacccggagtcccgaga ggcactcagcttagccaggtggaaatcctacagcgcgtcatcgactacattctcgacctg caggtagtcctggccgagccagcccctggaccccctgatggcccccaccttcccatccag acagccgagctcactccggaacttgtcatctccaacgacaaaaggagcttttgccactga >gi568815597r:23458963_23659426|GENSCAN_predicted_peptide_5|100_aa MQREHGPEPLVRFLRKGMGEVGTYLSTVPDFNGTGSIGGQNIQGYETQIVMITGIVVQNN TGDDDNTTIMEGAYFHWLNNYPNAHMEGAWGFKKRLLEAG >gi568815597r:23458963_23659426|GENSCAN_predicted_CDS_5|303_bp atgcaaagagagcatggcccagagcctttagtgcggtttttgcggaaaggaatgggcgag gtaggaacatacctgtccacagtccctgacttcaatggcacagggtccattgggggccag aacattcaaggctatgaaacccagattgttatgattactggtattgttgtccaaaacaat actggagacgatgacaatacaacaatcatggagggtgcttatttccactggctcaacaat tatcccaacgcccatatggagggggcctggggcttcaagaagaggcttctcgaggcagga tga >gi568815597r:23458963_23659426|GENSCAN_predicted_peptide_6|94_aa MDWIGKLDEELRSRPHLGQYPGPSPTPTRAHPPRRRRHRLRTCRACCGEARLCCGGGGGG GGWRGGDADSHSRPYRLLPASPPELRHQALGALP >gi568815597r:23458963_23659426|GENSCAN_predicted_CDS_6|285_bp atggattggattggaaagctggatgaggagctgaggagcagacctcacctgggacaatat ccaggtccctcgcccacgcccacgcgggcgcacccgccgcgccgacgccgccaccggctg cgcacctgccgcgcttgctgcggggaagccaggctctgctgtggcggcggcggcggcggc ggcggctggcggggaggagacgcggactcccactcgcggccctatcgcttgctccccgcc tccccgccagagctgcgccaccaggctctgggcgcgctcccatga >gi568815597r:23458963_23659426|GENSCAN_predicted_peptide_7|187_aa MRLEVQELGPAGGTWQVTFPLKGILIIPIIIPIIIIPIIIITITIIPIIIIPITIIPITI IPTIIIPIITIIPIIIIPITIIPITIIPTIIIPIIIITIIIIPIIITITTIIIIPNITII IIPIIITITITIIIIPITIITITINITIITITIIPIITIIITITIIPIIIILLLITIIVI CMEASLY >gi568815597r:23458963_23659426|GENSCAN_predicted_CDS_7|564_bp atgaggctggaagtgcaggaacttggaccggcggggggaacctggcaggtcacattccct ctcaagggaatcctcatcatccccatcatcatccccatcatcatcatccccatcatcatc atcaccatcaccatcatccccatcatcatcatccccatcaccatcatccccatcaccatc atccccaccatcatcatccccatcatcaccatcatccccatcatcatcatccccatcacc atcatccccatcaccatcatccccaccatcatcatccccatcatcatcatcaccatcatc atcatccccatcatcatcaccatcaccaccatcatcatcatccccaacatcaccatcatc atcatccccatcatcatcaccatcaccatcaccatcatcatcatccccatcaccatcatc accatcaccatcaacatcaccatcatcaccatcaccatcatccccatcatcaccatcatc atcaccatcaccatcatccctatcattatcatcctcctcctgatcaccataattgtcatc tgcatggaagctagcctatattga >gi568815597r:23458963_23659426|GENSCAN_predicted_peptide_8|141_aa MQENRHSRRAWWLMPVIPALWEAELSTDAGAALQVGSYLQDQQQDKVTRVQHSGCGHQKA QQGRYQKPYVQGSLPAKIMAVKAPFPNTVAMGLKAATCDSGEDTIQSTAGAMEGFKQGGP RCSIQNGQLSSLSFPEAFTGS >gi568815597r:23458963_23659426|GENSCAN_predicted_CDS_8|426_bp atgcaggaaaacaggcactctcgccgggcgtggtggctcatgcctgtaatccccgcactt tgggaggctgagttgagcacagatgcaggggctgcactccaggtgggctcctatctgcag gaccagcagcaggacaaggtcaccagggtccagcacagcggctgtggacaccagaaggca cagcaaggaagataccagaagccctatgtccagggctcgcttccagccaaaatcatggca gtcaaggccccttttccaaacacagttgcaatgggacttaaagctgcaacatgtgattct ggggaggacacaattcagtccacagcaggggccatggaaggttttaagcagggaggtcct aggtgttccatccagaatggacagctgtcctcactgtccttccctgaagccttcactggc tcctaa