GENSCAN 1.0 Date run: 4-Nov-116 Time: 04:10:07 Sequence gi568815578r:22481853_22684260 : 202408 bp : 43.33% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.08 PlyA - 3594 3589 6 1.05 1.07 Term - 12647 12507 141 2 0 86 44 54 0.155 -1.27 1.06 Intr - 18314 18197 118 1 1 17 69 119 0.334 3.37 1.05 Intr - 18655 18455 201 0 0 38 86 81 0.133 1.30 1.04 Intr - 26135 26036 100 0 1 76 65 104 0.505 6.07 1.03 Intr - 33902 33792 111 0 0 56 81 47 0.014 1.15 1.02 Intr - 41526 41496 31 0 1 68 100 29 0.002 -0.20 1.01 Init - 54935 54738 198 2 0 110 59 34 0.238 1.60 1.00 Prom - 56038 55999 40 -5.06 2.03 PlyA - 56674 56669 6 1.05 2.02 Term - 57233 56922 312 2 0 25 39 1007 0.865 84.50 2.01 Init - 59620 59588 33 1 0 36 80 4 0.333 -7.66 2.00 Prom - 60959 60920 40 -5.06 3.00 Prom + 61411 61450 40 -7.56 3.01 Init + 62475 62476 2 2 2 60 103 0 0.869 -4.21 3.02 Term + 63287 63530 244 2 1 9 28 456 0.998 27.07 3.03 PlyA + 63692 63697 6 1.05 4.00 Prom + 74722 74761 40 -2.46 4.01 Init + 79026 79093 68 2 2 79 53 105 0.161 4.65 4.02 Intr + 86990 87285 296 2 2 31 57 198 0.044 7.65 4.03 Intr + 94680 94788 109 1 1 57 82 55 0.043 1.14 4.04 Intr + 95238 95403 166 0 1 99 57 77 0.039 5.66 4.05 Term + 95968 96093 126 0 0 42 45 117 0.034 1.08 4.06 PlyA + 96913 96918 6 1.05 5.05 PlyA - 97041 97036 6 1.05 5.04 Term - 101302 99998 1305 1 0 83 48 2027 0.992 189.79 5.03 Intr - 102409 102340 70 0 1 55 99 47 0.542 1.68 5.02 Intr - 104164 104079 86 1 2 113 80 43 0.840 4.62 5.01 Init - 105040 104918 123 1 0 43 59 120 0.806 3.93 5.00 Prom - 115023 114984 40 -5.86 6.07 PlyA - 117593 117588 6 1.05 6.06 Term - 119170 119047 124 0 1 65 43 187 0.736 9.76 6.05 Intr - 135508 135405 104 1 2 38 69 59 0.002 -2.03 6.04 Intr - 141819 141720 100 2 1 107 82 52 0.506 6.61 6.03 Intr - 152637 152608 30 2 0 86 94 46 0.160 2.25 6.02 Intr - 163306 163138 169 2 1 26 56 83 0.080 -2.10 6.01 Init - 163463 163379 85 2 1 53 72 79 0.646 3.78 6.00 Prom - 164744 164705 40 -1.66 7.00 Prom + 187384 187423 40 -1.96 7.01 Init + 195535 195655 121 0 1 71 49 62 0.425 0.97 7.02 Term + 196332 196420 89 1 2 110 50 60 0.692 2.22 7.03 PlyA + 198167 198172 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 96745 96658 88 1 1 74 91 164 0.819 14.10 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578r:22481853_22684260|GENSCAN_predicted_peptide_1|299_aa MADVGSMVEKAFSGDEITQRQLVKSQPDSLFIYASLSDRLQAHHLTLLSLSFLICKRKVK GSVTLFGSPTGELKAEENKSFPITSAKHQSPGVEPAGTLAALKHSPPELPLKQGKVHYFS ELQSTQQFLFGFPLLVVVEIQQDRSKSGTRMMLRRQDHFVVGKAEAPHWVSRGFMKFAPV LQEAAELGHHQQCVFNFISCVALLGKLLAREESRKEKSIACENGIGKTSAEHTALLIPGG GGDEEVCKQALKQVNPLGICSLSFAEPVSPFDSILILQTEGTGNNCGLSKKQHLFSPYI >gi568815578r:22481853_22684260|GENSCAN_predicted_CDS_1|900_bp atggcagatgttggaagcatggtagaaaaagccttcagtggggatgagatcactcagaga caattggttaagagccagccagacagcctgtttatatatgcttccctgagtgatcgactt caggcacaccacttaacccttctcagcctcagtttccttatctgcaaaaggaaagtgaaa ggatctgtgaccttatttggctcgccaacaggagaactgaaagctgaagaaaataagagc ttccccatcacctcagcaaagcatcagtcccccggtgttgaaccagctggtactctggca gccctaaagcactctcctcccgagctcccacttaagcaaggtaaggtgcattacttctcc gaactgcaaagcacgcagcagtttctgttcggctttcctctcctggtggttgtagaaatt cagcaagacaggtctaaaagtggcacaaggatgatgctcaggcgccaggatcactttgtg gtgggaaaagcagaggctccccactgggtcagccggggcttcatgaagttcgccccagtg ctacaagaggctgcagaacttggtcaccatcagcaatgcgtcttcaactttatttcctgt gttgctcttctgggaaaacttttggcaagggaggaatcaaggaaagaaaaaagcattgcc tgtgaaaacggcattggcaaaacatctgctgagcacacagcactcctgatccctggtgga ggtggggatgaggaggtctgcaaacaagctctgaagcaggtaaacccactgggcatctgc agcctgagttttgcggagcccgtctctccattcgatagcatccttatattgcaaacagaa ggcaccggaaataactgtggtttatccaagaagcagcatcttttttccccttatatttag >gi568815578r:22481853_22684260|GENSCAN_predicted_peptide_2|114_aa MVRFRLCLPSQRDTDKKKKKKKKEEEERKKKKKTKKKKKKKKKKKKKKKKKEKKKKKEKK KKKKKREKKKEKKQKKKKKKKRKRKKKKKKKKKKKKKKKKKLKKKKKKKKKKKN >gi568815578r:22481853_22684260|GENSCAN_predicted_CDS_2|345_bp atggttagattcaggttgtgcctccctagccagagggatactgacaaaaaaaagaagaag aagaagaaagaagaagaagaaaggaagaagaagaagaagacgaagaagaagaagaagaag aagaagaagaagaagaagaagaagaagaagaaggagaagaagaagaagaaggagaagaag aagaagaagaagaagagggagaagaagaaggagaagaagcagaagaagaagaagaagaag aagaggaagaggaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaag aagttgaagaagaagaagaagaagaagaaaaagaagaagaactag >gi568815578r:22481853_22684260|GENSCAN_predicted_peptide_3|81_aa IIVIIIIIIIITITIIVTIINIITITNIIIVTIITIIITITIIIITIITITITSIIITTI IIILYYLIILPHVLTTSEFAK >gi568815578r:22481853_22684260|GENSCAN_predicted_CDS_3|246_bp atcattgtcattattatcattatcatcatcatcaccatcaccatcattgtcaccattatc aatattatcaccatcaccaacatcatcattgtcaccattatcactatcataatcaccatc accatcatcatcatcacaatcatcaccatcaccatcaccagcattatcatcaccactatc atcatcatcttatactacctcattatactacctcatgtgcttactacttctgaatttgca aaataa >gi568815578r:22481853_22684260|GENSCAN_predicted_peptide_4|254_aa MPSRWLWGMLAAPRLPALSSASHRSSPNRDSPTDPRGLGARPAALTTSARPIGPGCLAPG TRAHAAFPEPEHPLRLSQPEPKEGCSAGPRATRGEPEPARQPAARASPRNEERECCFISE ADLISATPPSKPGPPPDPRMCRSMHCPEENALPPGNFRRPIMFSEDNQPPEASRGSEGGG RAPQQASGAVDTGAQPKAPGKERRGHQGSPHRKRLPNLPAAKHGCAQKATSRFLQPELRF TRLLERRGPSPAAA >gi568815578r:22481853_22684260|GENSCAN_predicted_CDS_4|765_bp atgccctctcggtggctgtgggggatgcttgcggcccctcgcctgccagcgctgagctca gcctcacaccggagctcgccaaaccgggacagccccacggacccgcggggactgggggct cggccggctgctctgacgacctcggcgcgcccgatcggccccggctgcctggccccgggt acccgggcacacgccgccttcccggagcccgagcacccgctccggctctcgcagcctgag cccaaggaaggctgctctgcaggtccccgggccactcgcggggagcccgagccagcccgc cagccagcggctagggcctcacccaggaatgaggaaagggagtgctgctttatttcggaa gcagatctcatctcggccacccctccctccaagccagggccacctccagacccccggatg tgtagaagcatgcactgccccgaagaaaacgcgttgcccccaggtaacttcaggcgtccc atcatgttcagcgaagataaccagccacccgaggcgagccggggctcggagggcggcggg cgcgctccccagcaggcttcgggtgcggtggacacgggcgcacagcccaaggcaccaggg aaggagaggaggggccaccaagggtccccacacaggaagaggctccccaacctgcctgcc gccaagcacggctgtgcgcaaaaggccaccagccgcttcctgcagccagagctaaggttc actcgcctcttagaaaggcgcggcccctcccctgctgctgcttga >gi568815578r:22481853_22684260|GENSCAN_predicted_peptide_5|527_aa MRTRVGPAGLAADRGRPVTVEAPAERSLVSLTPATNSAVVEPHPYSSFNAKLRVCHLACL LPPSPPILDSMLGAVKMEGHEPSDWSSYYAEPEGYSSVSNMNAGLGMNGMNTYMSMSAAA MGSGSGNMSAGSMNMSSYVGAGMSPSLAGMSPGAGAMAGMGGSAGAAGVAGMGPHLSPSL SPLGGQAAGAMGGLAPYANMNSMSPMYGQAGLSRARDPKTYRRSYTHAKPPYSYISLITM AIQQSPNKMLTLSEIYQWIMDLFPFYRQNQQRWQNSIRHSLSFNDCFLKVPRSPDKPGKG SFWTLHPDSGNMFENGCYLRRQKRFKCEKQLALKEAAGAAGSGKKAAAGAQASQAQLGEA AGPASETPAGTESPHSSASPCQEHKRGGLGELKGTPAAALSPPEPAPSPGQQQQAAAHLL GPPHHPGLPPEAHLKPEHHYAFNHPFSINNLMSSEQQHHHSHHHHQPHKMDLKAYEQVMH YPGYGSPMPGSLAMGPVTNKTGLDASPLAADTSYYQGVYSRPIMNSS >gi568815578r:22481853_22684260|GENSCAN_predicted_CDS_5|1584_bp atgaggactcgggtcggtcccgcgggcctggcggccgacagggggcgccctgtcactgtg gaagcccccgccgagcgctctctggtgtccctgaccccggccaccaatagtgctgtggtg gagcctcacccttactcaagcttcaatgcgaagctccgtgtctgccatctcgcctgtctt ctgccaccatcgcccccaattttggacagtatgctgggagcggtgaagatggaagggcac gagccgtccgactggagcagctactatgcagagcccgagggctactcctccgtgagcaac atgaacgccggcctggggatgaacggcatgaacacgtacatgagcatgtcggcggccgcc atgggcagcggctcgggcaacatgagcgcgggctccatgaacatgtcgtcgtacgtgggc gctggcatgagcccgtccctggcggggatgtcccccggcgcgggcgccatggcgggcatg ggcggctcggccggggcggccggcgtggcgggcatggggccgcacttgagtcccagcctg agcccgctcggggggcaggcggccggggccatgggcggcctggccccctacgccaacatg aactccatgagccccatgtacgggcaggcgggcctgagccgcgcccgcgaccccaagacc tacaggcgcagctacacgcacgcaaagccgccctactcgtacatctcgctcatcaccatg gccatccagcagagccccaacaagatgctgacgctgagcgagatctaccagtggatcatg gacctcttccccttctaccggcagaaccagcagcgctggcagaactccatccgccactcg ctctccttcaacgactgtttcctgaaggtgccccgctcgcccgacaagcccggcaagggc tccttctggaccctgcaccctgactcgggcaacatgttcgagaacggctgctacctgcgc cgccagaagcgcttcaagtgcgagaagcagctggcgctgaaggaggccgcaggcgccgcc ggcagcggcaagaaggcggccgccggagcccaggcctcacaggctcaactcggggaggcc gccgggccggcctccgagactccggcgggcaccgagtcgcctcactcgagcgcctccccg tgccaggagcacaagcgagggggcctgggagagctgaaggggacgccggctgcggcgctg agccccccagagccggcgccctctcccgggcagcagcagcaggccgcggcccacctgctg ggcccgccccaccacccgggcctgccgcctgaggcccacctgaagccggaacaccactac gccttcaaccacccgttctccatcaacaacctcatgtcctcggagcagcagcaccaccac agccaccaccaccaccaaccccacaaaatggacctcaaggcctacgaacaggtgatgcac taccccggctacggttcccccatgcctggcagcttggccatgggcccggtcacgaacaaa acgggcctggacgcctcgcccctggccgcagatacctcctactaccagggggtgtactcc cggcccattatgaactcctcttaa >gi568815578r:22481853_22684260|GENSCAN_predicted_peptide_6|203_aa MWESLEALIDLLNGFDQKPDNDMNNKVQAKSLVAFCHRPRDLWNFELERGHLVYLVEEIS KQQSIQDVTGAAKGIRFYKESKAKWFKSNSSSFDSKGSGTGHMGAQLGNISQTPLQLDAD VTTCSEWNELCQEAALQRKANSGLPSAEIRKQNSIEEVSILVSPSKMEASTQMPYDWLLL STGIHSTSPENTDPATDINGGDE >gi568815578r:22481853_22684260|GENSCAN_predicted_CDS_6|612_bp atgtgggaaagtttggaagctcttatagacttgttgaatggttttgaccaaaagcctgat aatgatatgaacaataaggtccaggcaaagagcctggtggcattttgccaccgccctaga gatttgtggaactttgaacttgagagaggtcatttagtgtatctggtggaagaaatttct aagcagcaaagcattcaagatgtgaccggtgctgctaaaggcattcggttttataaggaa agcaaagcaaaatggtttaagagcaactcgtcttcatttgacagtaagggatctggaact ggtcacatgggtgcccaactgggtaacatctcccagactcccttgcagttagatgcagat gtgaccacatgttctgagtggaatgagctgtgccaagaggcagcccttcagaggaaggcc aactcagggctgcccagtgcagagataaggaaacagaactccatagaggaggtcagcatc ctggtcagtccaagcaagatggaggcctctacccagatgccttatgactggctgctgctg agcactggcatccactcaacgtccccagagaacacagaccccgccacagacatcaatggt ggagatgaatag >gi568815578r:22481853_22684260|GENSCAN_predicted_peptide_7|69_aa MERLPANHPELQRGVEQSSSQSSEGTSPAHTLISDFQHRHGITEPFLATLGSVTLLQRIA LRLLELLTI >gi568815578r:22481853_22684260|GENSCAN_predicted_CDS_7|210_bp atggaaaggttgcctgcaaaccacccggagctgcagagaggcgtggaacagtcttcctca cagtcctcggaaggaaccagccctgcccacaccttgatctcagacttccagcatcggcac ggcatcaccgagccttttcttgctacactcggctctgtgactctgcttcagaggatagcc cttaggctgctggagctgctgacaatatag