GENSCAN 1.0 Date run: 6-Nov-116 Time: 12:00:33 Sequence gi568815595r:112366111_112599358 : 233248 bp : 38.16% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2332 2446 115 1 1 26 75 90 0.327 1.03 1.02 Term + 3493 3657 165 0 0 22 42 208 0.908 6.53 1.03 PlyA + 3694 3699 6 1.05 2.00 Prom + 4963 5002 40 -3.45 2.01 Init + 7967 8015 49 1 1 49 94 51 0.412 2.96 2.02 Term + 9431 9741 311 0 2 86 42 104 0.209 -0.16 2.03 PlyA + 11387 11392 6 1.05 3.03 PlyA - 14526 14521 6 1.05 3.02 Term - 15416 15291 126 2 0 83 44 121 0.912 4.50 3.01 Init - 15846 15796 51 0 0 47 98 42 0.558 2.31 3.00 Prom - 16472 16433 40 -6.65 4.00 Prom + 19198 19237 40 -4.45 4.01 Sngl + 22947 23384 438 2 0 67 48 293 0.979 19.21 4.02 PlyA + 23536 23541 6 1.05 5.00 Prom + 30875 30914 40 -5.55 5.01 Init + 33487 33573 87 0 0 39 96 78 0.626 4.39 5.02 Intr + 34003 34095 93 0 0 105 85 32 0.767 3.84 5.03 Term + 34674 34748 75 2 0 83 48 83 0.782 0.66 5.04 PlyA + 35474 35479 6 1.05 6.02 PlyA - 35920 35915 6 1.05 6.01 Sngl - 47923 47591 333 1 0 44 43 267 0.969 13.57 6.00 Prom - 55261 55222 40 -3.95 7.11 PlyA - 56866 56861 6 1.05 7.10 Term - 65893 65666 228 1 0 110 44 132 0.596 6.65 7.09 Intr - 71240 71170 71 0 2 56 105 18 0.002 -1.72 7.08 Intr - 71495 71312 184 2 1 54 55 125 0.003 4.24 7.07 Intr - 93907 93725 183 1 0 76 50 88 0.364 2.86 7.06 Intr - 100273 100010 264 1 0 121 28 116 0.851 5.69 7.05 Intr - 103694 103648 47 0 2 137 107 26 0.964 6.61 7.04 Intr - 105263 105102 162 0 0 56 98 39 0.539 0.73 7.03 Intr - 113659 113345 315 2 0 95 109 168 0.990 14.71 7.02 Intr - 133360 133335 26 0 2 114 65 35 0.011 0.55 7.01 Init - 151317 151178 140 0 2 64 51 76 0.069 1.16 7.00 Prom - 153057 153018 40 -2.65 8.12 PlyA - 153898 153893 6 1.05 8.11 Term - 158997 158761 237 0 0 43 47 184 0.045 4.98 8.10 Intr - 168227 168143 85 1 1 102 55 59 0.431 2.90 8.09 Intr - 170492 170365 128 2 2 78 62 89 0.628 3.86 8.08 Intr - 171780 171625 156 0 0 100 83 166 0.978 16.59 8.07 Intr - 172070 172036 35 0 2 88 68 44 0.635 -0.58 8.06 Intr - 182530 182423 108 2 0 83 110 109 0.932 11.94 8.05 Intr - 190706 190616 91 2 1 45 74 11 0.199 -5.95 8.04 Intr - 192307 192266 42 1 0 104 105 27 0.677 3.52 8.03 Intr - 195419 195347 73 1 1 51 121 92 0.771 7.29 8.02 Intr - 196004 195736 269 2 2 41 57 323 0.415 19.91 8.01 Init - 201637 201635 3 1 0 84 80 0 0.468 -1.15 8.00 Prom - 201765 201726 40 -5.55 9.00 Prom + 205610 205649 40 -4.65 9.01 Init + 205684 205740 57 0 0 81 100 2 0.556 2.06 9.02 Intr + 207779 207846 68 1 2 134 100 -39 0.596 -1.32 9.03 Intr + 214595 215216 622 2 1 97 98 218 0.655 15.05 9.04 Term + 216561 216626 66 2 0 101 36 85 0.960 1.56 9.05 PlyA + 218595 218600 6 1.05 10.03 PlyA - 219684 219679 6 1.05 10.02 Term - 220382 220246 137 0 2 151 37 115 0.990 9.90 10.01 Init - 233180 233045 136 2 1 52 78 102 0.927 5.95 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:112366111_112599358|GENSCAN_predicted_peptide_1|93_aa XGYTIADNQVEMILSSHLWQHHFFLYDQTIPAHGLNWMQLIVKQSQEGPSGSIPGESIII GDDNSLDVIASEDLSVGRDVEVEESDIDDPDLV >gi568815595r:112366111_112599358|GENSCAN_predicted_CDS_1|282_bp nnaggatacactattgcagacaatcaagttgaaatgatactctcttcacacctctggcag catcacttcttcctctatgaccagactattcctgctcatggcttgaactggatgcagtta attgtaaaacaatctcaggaaggtccttcaggaagtattccaggagaaagcattattata ggagacgacaattctttggatgttattgcctctgaagatctttcagtgggacgagatgtg gaggtggaagagagtgatattgatgatcctgaccttgtgtag >gi568815595r:112366111_112599358|GENSCAN_predicted_peptide_2|119_aa MWPHAEEHLEPSKAGGVKQESSLIPLEGHMMGVARLFSHQHSSNPLGEGEHTDRQVKEPK WSVLQCALLALPSTNGLGVNQLSGLSAFLQGQRGSSTAFCVLSSCPASWKNQVTHGLEG >gi568815595r:112366111_112599358|GENSCAN_predicted_CDS_2|360_bp atgtggccacacgctgaggaacatctggagccatcaaaagctggaggagtgaaacaggag agttccctgatccccctcgaaggacatatgatgggtgtggctcgcctgttcagtcaccag cactcctcaaaccccttaggggagggggagcacacagacaggcaggtgaaggagcctaag tggagtgtgttacaatgtgcccttttagccttgccatccacgaatggcttgggtgttaat cagctcagtggactctctgcctttctgcaagggcagaggggcagttcaacagctttctgt gtcctgagctcttgcccagcatcttggaaaaatcaggtcacacacgggcttgaaggatga >gi568815595r:112366111_112599358|GENSCAN_predicted_peptide_3|58_aa MISFDSRSHIEVTLMKKAQYHMEAAKSWGLHPLKPQPELYVGPFQPQLEQLGCRAPSP >gi568815595r:112366111_112599358|GENSCAN_predicted_CDS_3|177_bp atgatctcctttgattccaggtctcacatcgaggtcacgctgatgaaaaaggctcaatac cacatggaagctgccaagtcttggggcttgcaccctctgaagccacagcccgagctctat gttggcccctttcagccacagctggagcagctgggatgcagagcaccaagtccctag >gi568815595r:112366111_112599358|GENSCAN_predicted_peptide_4|145_aa MPQKIFTSKEEKQAPGFKAGRDRLMLLFCAKAVGFVIRAVPIYKATNTQTLKENKTPAVS LLAVLQQEGPENENSFSDWFYQCFVPEIGKYFASKGLFYKVILILDNVPGHLELYEFYTK GIKVVYLATNSASVIQRRSGDHKDL >gi568815595r:112366111_112599358|GENSCAN_predicted_CDS_4|438_bp atgccacaaaagatatttactagtaaggaagaaaagcaagcaccaggatttaaggcagga agggataggctaatgctattgttttgtgcaaaagcagttgggtttgtgatcagggctgtc cctatctataaagctactaacacccaaaccttgaaggaaaataaaacaccagctgtcagt ctcttggctgtattacaacaagaaggcccagaaaatgagaactctttttctgattggttc tatcaatgctttgttcctgaaattgggaagtactttgccagtaaagggcttttttataaa gttattttgatattggacaatgtccctggccacctagaactctacgagttctacaccaaa ggcatcaaagtggtctatttggccacaaactcagcatctgtaattcagcggagatcgggg gatcataaggacctttaa >gi568815595r:112366111_112599358|GENSCAN_predicted_peptide_5|84_aa MNIGSVRKGGTTQSKSGTTPGKEGDSGSQTQPACTQVIKTFIAHTKPVWWSLHMDAHEIQ LLQAIWAYTCKSQGMRWLGLGSEA >gi568815595r:112366111_112599358|GENSCAN_predicted_CDS_5|255_bp atgaacattggttcagtccggaaaggtgggacaactcaaagcaaaagtgggacaactcca ggcaaggaaggggattccgggtcacagactcagcccgcctgcacccaggtgattaaaacc tttattgctcacacgaagcctgtttggtggtctcttcacatggacgcgcatgaaattcag ttacttcaggccatctgggcgtatacgtgcaagtcacaggggatgcgatggcttggcttg ggctcagaggcctga >gi568815595r:112366111_112599358|GENSCAN_predicted_peptide_6|110_aa MNKEKELWEFQQQYADLCGVSIFPPVALPKQAASPNQSPLIQPCLSVKVLQLALPNDGAH SVALPDQRVQTVALSYQGAQPAALPDQGALPAPCSIREHDQQPQLNAEPS >gi568815595r:112366111_112599358|GENSCAN_predicted_CDS_6|333_bp atgaacaaagaaaaagagttgtgggaatttcagcagcagtatgcagatctttgtggagtc tccatatttccaccagttgccttacccaaacaggcagcctcgcccaaccagagtccactc atccagccctgcctaagtgtgaaggtcctccaactggccctacccaatgatggagcacac tcagtggctctacctgaccagagagtccagacagtagccctctcataccagggagcccag ccagcagccctgccggatcagggggcacttccagcaccctgctcaatcagggagcatgac cagcagccccaactgaatgcagagcctagctag >gi568815595r:112366111_112599358|GENSCAN_predicted_peptide_7|539_aa MGYYAAIKRNKIMSFAGTWLDLEAIILSKLTQEQKTKGHMFSLISGSYFTDATGVGKESC DVQLYIKRQSEHSILAGDPFELECPVKYCANRPHVTWCKLNGTTCVKLEDRQTSWKEEKN ISFFILHFEPVLPNDNGSYRCSANFQSNLIESHSTTLYVTAFTNIPDVKSASERPSKDEM ASRPWLLYRLLPLGGLPLLITTCFCLFCCLRRHQGKQNELSDTAGREINLVDAHLKSEQT EASTRQNSQVLLSETGIYDNDPDLCFRMQEGSEVYSNPCLEENKPGIVYASLNHSVIGPN SRLARNVKEAPTEYASICMRVRDITTTSLPLMQDVLSEVNVVKRVRSSAHHTSMLNQSSF CKEKPARGQIIGGPYIKAKALWAAGFPHVLAVAGIQVTPFLDRPCRRRHNSFPSAQEPEY LIPQSVLGVGHLSCLGTAKAREHGALPTCRVQAGEGLLCWKPKLAQHHMEAAKAWDLYPL KPWSKLYVVPFQPRIEQLGHRAQRPQTFHSTGTLGLGPRNYFFLLGLQACDGTNCHEDL >gi568815595r:112366111_112599358|GENSCAN_predicted_CDS_7|1620_bp atgggatactatgcagccataaaaaggaacaagatcatgtcctttgcaggaacatggttg gatctggaagccattatcctcagcaaactaacacaggaacagaaaaccaaaggccacatg ttctcacttataagtgggagttatttcacagatgccactggggtagggaaagaatcatgt gatgtacagctttatataaagagacaatctgaacactccatcttagcaggagatcccttt gaactagaatgccctgtgaaatactgtgctaacaggcctcatgtgacttggtgcaagctc aatggaacaacatgtgtaaaacttgaagatagacaaacaagttggaaggaagagaagaac atttcatttttcattctacattttgaaccagtgcttcctaatgacaatgggtcataccgc tgttctgcaaattttcagtctaatctcattgaaagccactcaacaactctttatgtgaca gcatttactaacattccagatgtaaaaagtgcctcagaacgaccctccaaggacgaaatg gcaagcagaccctggctcctgtatcgtttacttcctttggggggattgcctctactcatc actacctgtttctgcctgttctgctgcctgagaaggcaccaaggaaagcaaaatgaactc tctgacacagcaggaagggaaattaacctggttgatgctcaccttaagagtgagcaaaca gaagcaagcaccaggcaaaattcccaagtactgctatcagaaactggaatttatgataat gaccctgacctttgtttcaggatgcaggaagggtctgaagtttattctaatccatgcctg gaagaaaacaaaccaggcattgtttatgcttccctgaaccattctgtcattggaccgaac tcaagactggcaagaaatgtaaaagaagcaccaacagaatatgcatccatatgtatgagg gtcagagatataactacaacctccttgcctttgatgcaagacgttctttcagaggttaat gtggttaaaagggtgagaagctctgcccatcacaccagtatgttaaatcaatcctctttc tgtaaggagaagcctgcaagagggcagatcataggaggaccttatatcaaagctaaagca ctctgggcagcagggtttcctcacgtactggctgtggctggcattcaggtcacacccttc ctggatcggccctgtagaaggaggcacaactcattcccgtcagcccaagaacctgagtat ctcatccctcaaagtgttctgggagtagggcacctctcctgcttgggcactgccaaagcc agggaacacggggctttgcccacctgcagagttcaggcaggagaggggctgctgtgctgg aagcccaagctggctcaacaccacatggaagccgccaaggcttgggacttgtatcctctg aagccatggtccaagctctatgttgtaccctttcagccacggatagagcagctgggacac agggcacaaagaccccagactttccacagcacagggaccctgggactgggcccacgaaac tactttttccttctaggcctccaggcctgtgatgggacaaactgccatgaagacctctga >gi568815595r:112366111_112599358|GENSCAN_predicted_peptide_8|408_aa MNKNHDYVRIGKKRTRHPLRRHVQCYASGRRGEAAYPAPARPLSHVTCRESLVSREAPVA AQLRQESEKEGKPEGARVKQSEDRQLAEGEGMQNVINTVKGKALEVAEYLTPVLKESKFK ETGVITPEEHLFNKAHLAPPLIHSTLSGHSTCFREHRVGVPCYKRCKQMEYSDELEAIIE EDDGDGGWVDTYHNTEYEESGLLETDEATLDTRKIVEACKAKTDAGGEDAILQTRTYDLY ITYDKYYQTPRLWLFGYDEQRQPLTVEHMYEDISQDHVKKTVTIENHPHLPPPPMCSVHP CRHAEVMKKIIETVAEGGGELGVHMYPSLYGLSEDPELQPVLAGLSLSMYLVTVLRNLLI ILAVSSDSHLHTPMYFFLSNLSWADIGFTSAMVPKMIVDMQSHSRVIS >gi568815595r:112366111_112599358|GENSCAN_predicted_CDS_8|1227_bp atgaacaaaaaccacgattacgtccggattgggaaaaagcggacccgacacccattgcgc aggcacgttcagtgttacgcttccggaaggaggggagaagcggcttatcccgccccagcc cggcccctttctcacgtcacgtgccgggagagtctagtgtcacgtgaggccccggtggcg gcgcagctacggcaagagagtgagaaggaagggaagccggaaggggcgcgagtgaagcaa agcgaggacagacagctcgcagagggcgaggggatgcagaatgtgattaatactgtgaag ggaaaggcactggaagtggctgagtacctgaccccggtcctcaaggaatcaaagtttaag gaaacaggtgtaattaccccagaagagcatctgtttaacaaagcacatcttgcaccaccc ttaatccattcaaccctgagtggacacagcacatgtttcagagagcacagggttggggtg ccgtgctataagcggtgcaaacagatggaatattcagatgaattggaagctatcattgaa gaagatgatggtgatggcggatgggtagatacatatcacaacacagaatatgaagagagt ggattgttggaaacagatgaggctaccctagatacaaggaaaatagtagaagcttgtaaa gccaaaactgatgctggcggtgaagatgctattttgcaaaccagaacttatgacctttac atcacttatgataaatattaccagactccacgattatggttgtttggctatgatgagcaa cggcagcctttaacagttgagcacatgtatgaagacatcagtcaggatcatgtgaagaaa acagtgaccattgaaaatcaccctcatctgccaccacctcccatgtgttcagttcaccca tgcaggcatgctgaggtgatgaagaaaatcattgagactgttgcagaaggagggggagaa cttggagttcatatgtatccttccctgtatggactctcagaggatccagaactgcagccc gtcctcgctgggctgtccctgtccatgtacctggtcacggtgctgaggaacctgctcatc atcctggctgtcagctctgactcccacctccacacccccatgtacttcttcctctccaac ctctcctgggctgacattggtttcacctcggccatggttcccaagatgattgtggacatg cagtcgcatagcagagtcatctcttaa >gi568815595r:112366111_112599358|GENSCAN_predicted_peptide_9|270_aa MIQLSPPGLALDIWGLQFKAMAVIFSNFSIITTALLFRIVLKSECPRKDNCTAKEWTFPE AKWNTTARVFSHIRLGMGHVLIIVQCFISSMANIYNEKILKEGNQLTESIFIQNSKLYFF GILFNGLTLGLQRSNRDQIKNCGFFYGHSAFSVALIFVTAFQGLSVAFILKFLDNMFHVL MAQVTTVIITTVSVLVFDFRPSLEFFLEAPSVLLSIFIYNASKPQVPEYAPRQERIRDLS GNLWERSSGDGEELERLTKPKSDESDEDTF >gi568815595r:112366111_112599358|GENSCAN_predicted_CDS_9|813_bp atgattcagttatctccacctggcttggcccttgacatatggggattacaattcaaggcc atggctgttatcttctcaaattttagcattataacaacagctcttctattcaggatagtg ctgaaaagtgagtgtcccagaaaagacaattgtacagcaaaggaatggacttttcctgaa gctaaatggaacaccacagccagagttttcagtcacatccgtcttggcatgggccatgtt cttattatagtccagtgttttatttcttcaatggctaatatctataatgaaaagatactg aaggaagggaaccagctcactgaaagcatcttcatacagaacagcaaactctatttcttt ggcattctgtttaatgggctgactctgggccttcagaggagtaaccgtgatcagattaag aactgtggatttttttatggccacagtgcattttcagtagcccttatttttgtaactgca ttccagggcctttcagtggctttcattctgaagttcctggataacatgttccatgtcttg atggcccaggttaccactgtcattatcacaacagtgtctgtcctggtctttgacttcagg ccctccctggaatttttcttggaagccccatcagtccttctctctatatttatttataat gccagcaagcctcaagttccggaatacgcacctaggcaagaaaggatccgagatctaagt ggcaatctttgggagcgttccagtggggatggagaagaactagaaagacttaccaaaccc aagagtgatgagtcagatgaagatactttctaa >gi568815595r:112366111_112599358|GENSCAN_predicted_peptide_10|90_aa MRGKGPAVTEINEMIHRNGYHEKATGLGKTGCLSLPCHVLTCVTGGKVDVHGKNQNIGKG KRSQGGYGLFNRAYWVYKEKQKDSLQIYPV >gi568815595r:112366111_112599358|GENSCAN_predicted_CDS_10|273_bp atgaggggcaaagggccagctgtcacagaaatcaatgaaatgatacacagaaatgggtat catgaaaaggccactggacttgggaaaactgggtgcctgtccttgccctgccatgtactt acctgtgtgactggaggaaaggtagatgttcatgggaaaaatcagaatataggtaaagga aaaaggagccaaggaggatatggcctatttaatcgagcctattgggtctataaagaaaag caaaaagattctctgcagatctacccagtctag