GENSCAN 1.0 Date run: 4-Nov-116 Time: 14:55:16 Sequence gi568815591f:152345477_152547121 : 201645 bp : 44.49% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 18766 18814 49 0 1 80 87 24 0.598 2.61 1.02 Intr + 22379 22590 212 0 2 85 98 205 0.562 19.83 1.03 Term + 22760 22975 216 1 0 -9 42 238 0.836 6.64 1.04 PlyA + 23523 23528 6 1.05 2.04 PlyA - 26149 26144 6 1.05 2.03 Term - 74988 74925 64 2 1 80 37 70 0.238 -1.54 2.02 Intr - 90311 90150 162 1 0 44 100 186 0.880 14.49 2.01 Init - 91163 91036 128 2 2 39 58 192 0.955 8.93 2.00 Prom - 94761 94722 40 -4.56 3.00 Prom + 97218 97257 40 -4.06 3.01 Sngl + 100001 101674 1674 1 0 92 40 1125 0.999 102.78 3.02 PlyA + 101685 101690 6 1.05 4.00 Prom + 109657 109696 40 -5.06 4.01 Init + 110667 110819 153 2 0 77 44 105 0.135 4.25 4.02 Intr + 138476 138626 151 1 1 68 97 62 0.452 4.84 4.03 Term + 145350 145477 128 1 2 85 45 40 0.338 -2.16 4.04 PlyA + 146540 146545 6 1.05 5.00 Prom + 155541 155580 40 -1.66 5.01 Init + 157164 157208 45 2 0 90 88 29 0.760 3.78 5.02 Intr + 160217 160344 128 1 2 45 99 86 0.504 4.88 5.03 Intr + 168605 168716 112 2 1 138 -7 19 0.099 -2.22 5.04 Term + 169413 169616 204 2 0 112 39 116 0.197 6.47 5.05 PlyA + 169834 169839 6 1.05 6.00 Prom + 173032 173071 40 -4.06 6.01 Init + 192071 192149 79 1 1 114 82 91 0.966 10.35 6.02 Intr + 193421 193577 157 0 1 38 97 82 0.401 3.17 6.03 Intr + 196525 196695 171 1 0 59 80 66 0.248 1.96 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 97178 97474 297 1 0 101 43 186 0.876 11.15 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:152345477_152547121|GENSCAN_predicted_peptide_1|158_aa MNMRETEDIKNNMELKGHELKPLDIEFMKRSHEKVIIIQLIAKADTLIPEECQQFKKQIM KEIQEHKIKIYDFPEIDDEEENKLAKKDLKDVTNNVHYENYRSRKLAAVTYNGVDNNKNK GQLTKSPLAQMEEERREYAAKMKKIEMEMEQVLEMKVK >gi568815591f:152345477_152547121|GENSCAN_predicted_CDS_1|477_bp atgaatatgagagaaacagaagatattaaaaataacatggaactcaaaggacatgaactt aaaccattggatattgagtttatgaagcgttcgcatgaaaaagtgattatcatccaactt attgccaaagcagacacacttataccagaggaatgccaacagtttaaaaaacagataatg aaagaaatccaagaacataaaattaagatatatgattttccagaaatagatgatgaagaa gaaaataaacttgctaaaaaggacttgaaagatgttactaataatgtccactatgagaac tacagaagcagaaaactggcagctgtgacttataatggagttgataacaacaagaataaa gggcagctgactaagagccctctggcacaaatggaagaagaaagaagggagtatgcagct aaaatgaagaagatagagatggagatggagcaggtgttagagatgaaggtcaaataa >gi568815591f:152345477_152547121|GENSCAN_predicted_peptide_2|117_aa MRGARPVSGSGGARGAGEGPSVGERAGAVEAAVCGRDGCELERMSSEEDKSVEQPQPPPP PPEEPGAPAPSPAAADKRPRGRPRKDGASPFQRARKKSSRFVQNIPNTGDRDDADKY >gi568815591f:152345477_152547121|GENSCAN_predicted_CDS_2|354_bp atgcgcggagcgaggcccgtgagtggcagcggcggcgcgcggggggcgggcgaggggccg agtgtgggggagcgggcgggggccgtcgaggcggcggtgtgtgggcgcgacggctgcgag ttggagaggatgtcgtcggaggaggacaagagcgtggagcagccgcagccgccgccacca ccccccgaggagcctggagccccggccccgagccccgcagccgcagacaaaagacctcgg ggccggcctcgcaaagatggcgcttcccctttccagagagccagaaagaaaagttcccgc tttgtgcagaacattccaaacactggagatagggatgatgcagataaatactaa >gi568815591f:152345477_152547121|GENSCAN_predicted_peptide_3|557_aa MDSTVPSALELPQRLALNPRESPRSPEEEEPHLLSSLAAVQTLANVIRPCYGPHGRQKFL VTMKGETVCTGCATAILRALELEHPAAWLLREAAQTQAENSGDGTAFVVLLTEALLEQAE QLLKFGLPRPQLREAYATATAEVLATLPSLAIQSLGPLEDPSWALHSVMNTHTLPPMNHL TKLVAHACWAIKELDGSFKPERVGVCTLHGGTLEDSCLLQGLAISGKLCGQMAAVLSGAR VALFACPFGPAHPNAPATACLSSPADLAQFSKGSDQLLEKQVGQLAAAGINVAVVLGEVD EETLTLADKYGIVVIQARSRMEIIYLSEVLDTPLLPRLLPPQRPGKCQRVYRQELGDGLA VVFEWECTGTPALTVVLRGATTQGLRSAEQAVYHSIDAYFQPCQDPRLIPGAGATEMALA KMLSDKGSRLEGPNGPAFLAFARALKYLPKTLAENAGLAVSDVVAEMSGVHQGGNLLMGV GAEGIINVAQEGVWDTLIVKAQGFRAVAEVVLQLVTVDEIVVAKKSPTHQQIWNPDSKKT KKRPPPVEKKKILGMNN >gi568815591f:152345477_152547121|GENSCAN_predicted_CDS_3|1674_bp atggacagcacagtcccttcagccctggagctgccccagcggctggcactgaacccaagg gagagcccaaggagtccggaagaggaggagccccacctgctgagcagcttggctgcagtc cagaccctggccaatgtcatccggccttgctatggcccccatggccggcagaagttcctg gtgaccatgaaaggagaaacagtgtgcacggggtgtgccactgccatcctcagggccctg gagctggagcacccagcggcatggctcctccgggaagcagcccaaacccaagcagagaat agtggggatggcacagccttcgtggttctgctgacggaagccttgctggaacaggcagag cagctgctgaagttcggcctgcctcgcccgcagctccgggaggcctacgccacagccact gcagaggtcctggccacactgccctccctggccatccaatctctggggcctttggaagat ccgtcctgggccctccattctgtgatgaatacccacaccctgccccccatgaaccacttg accaagctggtggcccacgcctgctgggctatcaaggagctagacggcagcttcaagcct gagcgtgttggggtgtgcacgctgcacggggggacactggaggattcctgcctcctccag gggttagcaatatctgggaagctctgtgggcaaatggccgcagtgttaagtggtgccagg gtggctctctttgcttgcccctttggtcctgcccatccaaatgcaccagcaacggcctgt ctttctagtcctgctgatctagctcaatttagtaaaggaagtgaccaattactagaaaag caagtaggccagctggcagccgcgggaattaatgtggcagtggtgttgggggaggtcgac gaggagaccctcacactggcggacaagtatggcatcgtggtgattcaggctaggtctcgg atggagatcatttacctgagtgaggtgttggacacaccgctgctgcctcgtctgctccct ccccagaggccaggcaagtgccagagggtttacaggcaggagctgggagatggtttggct gtggtatttgaatgggaatgtacaggcacacctgccctcaccgtggttctcaggggagcc accacccaggggctgcggagtgcagagcaggccgtctaccacagcattgatgcctatttc cagccatgtcaagatcccagactgatcccaggagctggggccacagaaatggctttggca aaaatgctctctgataaaggaagcagattggaagggcccaatgggcctgcattcctagca tttgcccgggccctgaagtatcttcctaaaaccttggcagagaatgcaggcttagctgtc tcagacgtggttgcagaaatgagtggagtgcaccaaggtgggaacctcctaatgggtgtg ggagctgaagggataataaatgtggcccaggaaggggtgtgggacaccctaatagtcaaa gcccaaggatttcgagcagtggctgaggtggtgctacagctcgtgactgtagatgaaatc gtagtggccaagaaaagtcccacacatcagcagatctggaatcctgactctaagaagaca aagaaacgcccacctcctgtggaaaaaaaaaaaatccttggaatgaataactag >gi568815591f:152345477_152547121|GENSCAN_predicted_peptide_4|143_aa MAGEDKGRMLLCATATETSRVVSPSVNEWSCFETSGMMTTVERPHSDLLLQHLLDTSTWM CHDPWPCWAQEDLLLPPKPTSSVLPIPVHDGLLMSPVTQALAAIISSLIKGDGAKVPGLS FVAVAPSNIDLSVYNGVGTIQNI >gi568815591f:152345477_152547121|GENSCAN_predicted_CDS_4|432_bp atggctggagaagacaagggtcgcatgcttctgtgtgccactgccacagaaacatcccgg gtggtcagcccctcagtgaatgaatggagctgttttgaaaccagtgggatgatgacaact gtggaaagaccccactcggatctcctgctgcagcacttactggacacctccacctggatg tgccacgacccttggccttgctgggcccaggaggaccttctgcttcctcccaaaccaaca tcttctgtgttacctattcctgttcatgatggacttctgatgagcccggtcacccaggct ctagctgctatcatttcatcattaattaaaggcgatggagcaaaggtccctgggttgtca tttgttgctgttgcaccatctaacattgatttgagtgtttataatggggttggtactatt cagaatatctga >gi568815591f:152345477_152547121|GENSCAN_predicted_peptide_5|162_aa MTLILTCEDVEWSSQVLFVAFNDFYGADIPLKQISGYQGDISLVAKSLKFCQSAPAGWCQ TRVSVRRLSLPTYAPSNSTLTFVSQKSGTGAGALEVLPEEHSPSTRGQLNSHLRLCSKGA VSTEATLRMAYQVGSLTPPRGPDLKFSVHPKCDPETACVRST >gi568815591f:152345477_152547121|GENSCAN_predicted_CDS_5|489_bp atgacccttatccttacatgtgaagatgtggaatggtcctctcaggtgctgtttgtagct tttaatgatttctatggtgctgatattccactaaaacaaatttcaggctaccagggtgat atctccctggttgcaaaatccctgaaattttgccagtctgctcctgcaggctggtgtcag acccgtgtctcagtcagaagactttccctgccgacgtatgctccctctaattccaccttg acatttgtttcccagaagtctggaactggcgcgggggctttggaagtccttccagaagag cactccccatcaacccgcgggcagctgaattcccacctcagactctgctccaagggcgcc gtgtctacggaggcgacgctgaggatggcttatcaggttgggtcactcaccccaccacga ggacctgaccttaaattctcggtgcatcctaagtgtgacccagagaccgcctgcgtcaga agcacctag >gi568815591f:152345477_152547121|GENSCAN_predicted_peptide_6|136_aa MPSRLSLFLNLPGGILLLLSVAVTNPALGWSMGLGAVEQGAVLIEEAPAAREPMEGVGGS GTAGCRSRGLPRGKAAKARLECSGAISTHCNLRLYGKNFGRAQVILHNERDKPQLVYANI SDTHLTMEVSVKATAS >gi568815591f:152345477_152547121|GENSCAN_predicted_CDS_6|408_bp atgcccagccgtctgtctttgtttttgaacctgcctggcggcatcctactcttgctttct gtggccgtcactaatccagcccttgggtggtcgatgggactgggcgccgtggagcagggg gcggtgctcatcgaggaggctccggccgcacgggagcccatggagggggtgggaggctca ggcacggcgggctgcaggtcccgaggcctgccccgcgggaaggcagctaaggcccggctg gagtgcagtggtgcaatctccactcactgcaaccttcgcctctatggcaagaacttcgga agggcacaagtcatattgcataatgagcgtgacaagcctcagcttgtttatgccaatatt tctgacactcacttgaccatggaggtttctgtgaaggccacagccagn