GENSCAN 1.0 Date run: 3-Nov-116 Time: 18:24:13 Sequence gi568815583f:62948576_63169940 : 221365 bp : 44.42% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 4598 4934 337 1 1 100 45 306 0.279 24.94 1.02 Term + 5201 5583 383 0 2 24 38 227 0.318 6.30 1.03 PlyA + 5879 5884 6 1.05 2.00 Prom + 6604 6643 40 -4.26 2.01 Init + 6660 6737 78 2 0 74 100 -1 0.186 0.77 2.02 Intr + 13456 13615 160 0 1 25 77 90 0.276 1.16 2.03 Intr + 13724 13771 48 0 0 104 77 30 0.143 2.15 2.04 Intr + 20205 20317 113 1 2 105 56 74 0.729 6.00 2.05 Intr + 27087 27190 104 2 2 96 72 50 0.381 3.17 2.06 Intr + 36293 36343 51 2 0 36 75 91 0.014 0.62 2.07 Intr + 41387 41508 122 2 2 147 28 29 0.059 2.94 2.08 Intr + 46965 47016 52 1 1 105 55 32 0.010 -0.43 2.09 Intr + 58414 58485 72 1 0 29 99 69 0.007 0.62 2.10 Intr + 62774 62911 138 2 0 16 55 147 0.044 3.78 2.11 Intr + 68410 68531 122 1 2 56 76 78 0.070 3.54 2.12 Intr + 71630 71730 101 0 2 102 72 -12 0.047 -1.57 2.13 Term + 72519 72635 117 2 0 61 41 132 0.321 4.34 2.14 PlyA + 73091 73096 6 1.05 3.00 Prom + 74390 74429 40 -7.86 3.01 Init + 77884 77947 64 0 1 92 103 20 0.848 5.21 3.02 Term + 83754 83833 80 1 2 94 42 96 0.513 3.53 3.03 PlyA + 84317 84322 6 1.05 4.00 Prom + 89714 89753 40 -3.76 4.01 Init + 93595 93667 73 0 1 66 13 147 0.908 4.32 4.02 Intr + 94229 94368 140 0 2 81 46 293 0.938 24.58 4.03 Intr + 95131 95256 126 0 0 71 75 345 0.994 32.38 4.04 Intr + 95452 95577 126 0 0 66 95 222 0.999 21.58 4.05 Intr + 98952 99064 113 2 2 32 61 46 0.112 -4.52 4.06 Intr + 99823 100132 310 1 1 62 75 267 0.245 19.12 4.07 Intr + 104076 104144 69 2 0 86 50 71 0.520 2.48 4.08 Intr + 108410 108543 134 1 2 92 78 194 0.999 18.24 4.09 Intr + 110988 111105 118 0 1 104 55 226 0.998 21.37 4.10 Intr + 112294 112364 71 0 2 104 98 133 0.999 13.88 4.11 Intr + 112623 112698 76 0 1 69 79 69 0.934 3.62 4.12 Intr + 113640 113702 63 2 0 69 75 110 0.986 6.71 4.13 Intr + 114001 114070 70 0 1 50 116 25 0.996 0.25 4.14 Term + 115489 115571 83 2 2 69 41 204 0.999 11.66 4.15 PlyA + 115618 115623 6 1.05 5.00 Prom + 119895 119934 40 -4.66 5.01 Init + 127728 127733 6 2 0 72 115 8 0.074 2.35 5.02 Intr + 133214 133319 106 1 1 61 80 56 0.010 1.89 5.03 Term + 156166 156269 104 2 2 86 54 97 0.184 4.54 5.04 PlyA + 159272 159277 6 1.05 6.00 Prom + 164507 164546 40 -0.06 6.01 Init + 173297 173653 357 1 0 79 86 349 0.921 28.91 6.02 Intr + 174061 174127 67 0 1 95 98 6 0.945 0.78 6.03 Intr + 178284 178474 191 1 2 117 79 184 0.989 19.70 6.04 Intr + 178778 179114 337 1 1 58 105 236 0.966 17.39 6.05 Intr + 180910 181075 166 2 1 35 94 63 0.754 0.62 6.06 Term + 192705 193230 526 0 1 103 44 190 0.924 9.84 6.07 PlyA + 193313 193318 6 1.05 7.00 Prom + 199324 199363 40 -2.76 7.01 Init + 209157 209249 93 2 0 87 45 112 0.766 5.18 7.02 Term + 209538 209591 54 2 0 76 47 96 0.795 1.86 7.03 PlyA + 210420 210425 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583f:62948576_63169940|GENSCAN_predicted_peptide_1|239_aa MSSEAETQQPPPAPALSSTYTKPGTTGSGAGNGVLGGLTPTAPAGGDKKVIATKVLGTVK WFNVRNRYGFINRNDTKEDVFVPQTAIENNPRKYLPRVDRETMEFDVVEGEKAFQPVQGE VMEGAGNQGAGEQRRPMRQNMCWGYRPRFRRGPPRPKQPREDGNEEDKENQGGDTQGQQP PQRRYLRNFNYRRRRPENPKPQDGKETKAADPPVENSSTPEAEQGRLSKCRLHLYQHTV >gi568815583f:62948576_63169940|GENSCAN_predicted_CDS_1|720_bp atgagcagcgaggcggagacccagcagccgccgcccgcccccgccctcagctccacctac accaagcccggcactacgggcagcggcgcagggaacggtgtcctgggcggcctcacaccg acggcgcctgccggcggggacaagaaagtcatcgcaacgaaggttttgggaacagtaaaa tggttcaatgtaaggaacagatatggtttcatcaacaggaatgacaccaaggaagatgta tttgtaccccagactgccatagagaataaccccaggaagtaccttcccagggtagatagg gagactatggagtttgatgttgttgaaggagaaaaggcattccaacctgtgcagggagaa gtgatggagggtgctggcaaccagggtgcaggagaacaacgtagaccaatgaggcagaat atgtgttggggatatagaccacggttccgcaggggccctcctcgcccaaaacagcctaga gaggacggcaatgaagaagataaagaaaatcaaggaggtgacacccaaggtcagcagcca cctcaacgtcggtacctccgcaacttcaattaccgacgcagacgcccagaaaaccctaaa ccacaagatggcaaagagacaaaagcagcagatccaccagttgagaattcgtccactccg gaggctgagcagggcaggctgagtaaatgccggcttcatctctaccaacatacagtttag >gi568815583f:62948576_63169940|GENSCAN_predicted_peptide_2|425_aa MGETTPMIQLPPPGPALYMWGSLQFKALEAAGGFVGSCLLLENGFAVSSDLVWGVHSHLS DILAEAKEQFTLGANIPWQGLALAEPTGLVLQGGSGLPGKPRDPEQFRTREEKSLENYTS VTRDSSYASICENASPWQTAPPFSALPRPLFLPLYDCLVFLEMAFCDGSIEHKWISEEDG EDTSRALSPLPPGPSHPYSLGPRYHPFHTHSSPHREGRDQCVGIKVNIYLFIVCSMEQHA LLCSPAKREARLGNATYPSTSKSQCDCLPMKRKLSEACVEPALKQSRKTERLCTREYPIR YPSQGTKAMFGFHVVTGFPRNYLSRKPRKKPQCLSDLALEVTRHHIYCPIVHTGSKSCKS STHSVPLLSLEIDLTIMPPCNVLASGLDSELSIGTAQPITLNTTRMSNTIQAPEKRFSVS LVLCL >gi568815583f:62948576_63169940|GENSCAN_predicted_CDS_2|1278_bp atgggtgaaacgacccccatgatccaattacctccacctggtcccgccctttacatgtgg ggatcattacaattcaaggccctggaagctgctggaggttttgtgggcagctgcctcctc ctagaaaatggctttgctgtcagttctgacctggtgtggggagtacactcacacctatca gacatcttagcagaggctaaggagcaatttactctgggcgccaacattccctggcaagga ctagctctggccgagcccacaggactggtgttgcagggaggaagtggtctccctggaaag ccacgtgatccagagcagtttcgaaccagagaagagaaaagcctggagaattacaccagt gtaactagagattcttcttatgcatcaatatgcgaaaatgcctctccctggcagacggcc ccgcccttctccgcactccctaggcctttgttcctacctctgtacgactgcctcgtcttc ctggagatggcattctgtgatgggtcaattgagcacaaatggatctccgaggaagatggt gaagacacatcaagggccctgtcacccctaccgcctgggccctctcacccctacagcctg ggccctcgttaccaccccttccacactcacagcagcccacaccgggaaggaagagatcag tgtgtcggcattaaggtgaacatttatttattcatcgtctgcagcatggagcagcatgcc ctgctctgtagtccagcaaagagagaggcccgactagggaatgccacatacccatcaact tccaagagccagtgcgactgtcttcccatgaaaagaaaactcagtgaagcttgtgtggag ccagccctgaagcagtcacgaaaaactgagagactttgcaccagggagtacccaatacga taccccagtcaaggaacaaaggccatgtttggcttccatgtggtgacaggcttcccccgc aactacctatccaggaaaccaaggaagaagccgcaatgcctctccgacctagctctggaa gtcacgcgtcatcacatctactgtcccattgttcacacaggctcaaagagttgcaaatct tccacacactcggtaccccttttaagtttagaaattgacctaacaataatgcctccttgc aatgtccttgcaagtggcctggattcagaactcagcatcggcacagcacaaccaataacc ctaaacaccactcggatgagtaacaccattcaggccccagagaaacgcttctccgtgtct cttgtcctgtgtctgtga >gi568815583f:62948576_63169940|GENSCAN_predicted_peptide_3|47_aa MGVGDKIEETSIGGLGGQCRNAEKILAAGFPIAVNNVQLQDIVTVLF >gi568815583f:62948576_63169940|GENSCAN_predicted_CDS_3|144_bp atgggcgtgggggacaagatagaggagaccagtattggaggattaggaggtcaatgtaga aatgctgagaagatcctggctgctggctttcctattgctgttaacaatgtacaactacaa gatatcgtgactgtacttttttaa >gi568815583f:62948576_63169940|GENSCAN_predicted_peptide_4|523_aa MTPGWLRAAGEALAGAESALTLTRAPGPLAAATMDAIKKKMQMLKLDKENALDRAEQAEA DKKAAEDRSKQLEEDIAAKEKLLRVSEDERDRVLEELHKAEDSLLAAEEAAAKLEDELVS LQKKLKGTEDELDKYSEALKDAQEKLELAEKKATDASQSSPGLTGFQEISEAGLTPKKAV KRRDHLDDCFHGNPGGCDFRTAPGRRGRRHRTERPGRGGPALGSQDSRGSRVRRAAAGLS HCSPPARLPSGAMAGSSSLEAVRRKIRSLQEQADAAEERAGTLQRELDHERKLRETITDG HVSMGLYRQRQQQVLKTPQAEADVASLNRRIQLVEEELDRAQERLATALQKLEEAEKAAD ESERGMKVIESRAQKDEEKMEIQEIQLKEAKHIAEDADRKYEEVARKLVIIESDLERAEE RAELSEGQVRQLEEQLRIMDQTLKALMAAEDKYSQKEDRYEEEIKVLSDKLKEAETRAEF AERSVTKLEKSIDDLEDELYAQKLKYKAISEELDHALNDMTSM >gi568815583f:62948576_63169940|GENSCAN_predicted_CDS_4|1572_bp atgacacctggctggctgagagctgccggggaggcgctggcgggtgccgagagcgcactg accctgacgcgggccccagggcccctcgccgccgccaccatggacgccatcaagaagaag atgcagatgctgaagctcgacaaggagaacgccttggatcgagctgagcaggcggaggcc gacaagaaggcggcggaagacaggagcaagcagctcgaggaggacatcgcggccaaggag aagttgctgcgggtgtcggaggacgagcgggaccgggtgctggaggagctgcacaaggcg gaggacagcctcctggccgccgaagaggccgccgccaagctggaagatgagctggtgtca ctgcaaaagaaactcaagggcaccgaagatgaactggacaaatactctgaggctctcaaa gatgcccaggagaagctggagctggcagagaaaaaggccaccgatgcatcccagtccagc ccaggactgactggattccaggagatctcagaggctggtctgacaccaaagaaggctgta aagagaagagaccatttggatgactgttttcatggaaaccctggaggctgcgacttccgg actgctcctggccgcagggggcgccgccatcgcacagagaggcctgggcggggcggaccg gcgctgggcagccaggacagccgcggcagccgggtccgcagggcagcagccggcctctcc cactgcagccctcccgcccgcctaccgtccggcgcgatggcggggagtagctcgctggag gcggtgcgcaggaagatccggagcctgcaggagcaggcggacgccgctgaggagcgcgcg ggcaccctgcagcgcgagctggaccacgagaggaagctgagggagaccataacagatggt catgtgtctatgggcttgtaccggcagaggcaacagcaggtccttaagactccccaggct gaagccgacgtagcttctctgaacagacgcatccagctggttgaggaagagttggatcgt gcccaggagcgtctggcaacagctttgcagaagctggaggaagctgagaaggcagcagat gagagtgagagaggcatgaaagtcattgagagtcgagcccaaaaagatgaagaaaaaatg gaaattcaggagatccaactgaaagaggccaagcacattgctgaagatgccgaccgcaaa tatgaagaggtggcccgtaagctggtcatcattgagagcgacctggaacgtgcagaggag cgggctgagctctcagaaggccaagtccgacagctggaagaacaattaagaataatggat cagaccttgaaagcattaatggctgcagaggataagtactcgcagaaggaagacagatat gaggaagagatcaaggtcctttccgacaagctgaaggaggctgagactcgggctgagttt gcggagaggtcagtaactaaattggagaaaagcattgatgacttagaagacgagctgtac gctcagaaactgaagtacaaagccatcagcgaggagctggaccacgctctcaacgatatg acttccatgtaa >gi568815583f:62948576_63169940|GENSCAN_predicted_peptide_5|71_aa MMATTPPFPTLRAISITTTIDRMKTCRPLLYCMCFVAGYSKVVAVHKPGKRALILVDYTG TLIPDFKPPEP >gi568815583f:62948576_63169940|GENSCAN_predicted_CDS_5|216_bp atgatggccaccaccccacctttccccacactcagagccatcagcatcaccaccaccatt gatcgtatgaagacatgcagaccacttttatactgcatgtgctttgttgcaggatacagc aaggtggtggctgtccacaagcctggaaagagagcgctcatattagttgactacactggc actcttatcccagacttcaagcctccagaaccgtga >gi568815583f:62948576_63169940|GENSCAN_predicted_peptide_6|547_aa MYRLMSAVTARAAAPGGLASSCGRRGVHQRAGLPPLGHGWVGGLGLGLGLALGVKLAGGL RGAAPAQSPAAPDPEASPLAEPPQEQSLAPWSPQTPAPPCSRCFARAIESSRDLLHRIKD EVGAPGIVVGVSVDGKEVWSEGLGYADVENRVPCKPETVMRIASISKSLTMVALAKLWEA GKLDLDIPVQHYVPEFPEKEYEGEKVSVTTRLLISHLSGIRHYEKDIKKVKEEKAYKALK MMKENVAFEQEKEGKSNEKNDFTKFKTEQENEAKCRNSKPGKKKNDFEQGELYLREKFEN SIESLRLFKNDPLFFKPGSQFLYSTFGYTLLAAIVERASGCKYLDYMQKIFHDLDMLTTV QEENEPVIYNRARFYVYNKKKRLVNTPYVDNSYKWAGGGFLSTVGDLLKFGNAMLYGYQV GLFKNSNENLLPGYLKPETMVMMWTPVPNTEMSWDKEGKYAMAWGVVERKQTYGSCRKQR HYASHTGGAVGASSVLLVLPEELDTETINNKVPPRGIIVSIICNMQSVGLNSTALKIALE FDKDRSD >gi568815583f:62948576_63169940|GENSCAN_predicted_CDS_6|1644_bp atgtaccggctcatgtcagcagtgactgcccgggctgccgcccccgggggcttggcctca agctgcggacgacgcggggtccatcagcgcgccgggctgccgcctctcggccacggctgg gtcgggggcctcgggctggggctggggctggcgctcggggtgaagctggcaggtgggctg aggggcgcggccccggcgcagtcccccgcggcccccgaccctgaggcgtcgcctctggcc gagccgccacaggagcagtccctcgccccgtggtctccgcagaccccggcgccgccctgc tccaggtgcttcgccagagccatcgagagcagccgcgacctgctgcacaggatcaaggat gaggtgggcgcaccgggcatagtggttggagtttctgtagatggaaaagaagtctggtca gaaggtttaggttatgctgatgttgagaaccgtgtaccatgtaaaccagagacagttatg cgaattgctagcatcagcaaaagtctcaccatggttgctcttgccaaattgtgggaagca gggaaactggatcttgatattccagtacaacattatgttcccgaattcccagaaaaagaa tatgaaggtgaaaaggtttctgtcacaacaagattactgatttcccatttaagtggaatt cgtcattatgaaaaggacataaaaaaggtgaaagaagagaaagcttataaagccttgaag atgatgaaagagaatgttgcatttgagcaagaaaaagaaggcaaaagtaatgaaaagaat gattttactaaatttaaaacagagcaggagaatgaagccaaatgccggaattcaaaacct ggcaagaaaaagaatgattttgaacaaggcgaattatatttgagagaaaagtttgaaaat tcaattgaatccctaagattatttaaaaatgatcctttgttcttcaaacctggtagtcag tttttgtattcaacttttggctataccctactggcagccatagtagagagagcttcagga tgtaaatatttggactatatgcagaaaatattccatgacttggatatgctgacgactgtg caggaagaaaacgagccagtgatttacaatagagcaagattttatgtttacaataaaaag aaacgtcttgtcaacacaccttacgtggataactcctataaatgggctggtggtggattt ctgtctacagtgggtgaccttctgaaatttgggaatgcaatgctttatggttaccaagtt gggctgtttaagaactcaaatgaaaatcttttacctggatacctcaaaccagaaacaatg gttatgatgtggaccccagtccctaacacagagatgtcttgggataaagagggtaaatat gcaatggcgtggggtgttgtggaaaggaaacaaacgtatggttcgtgtagaaagcaacgg cattatgcttcacatactggaggggcagtgggtgccagtagtgtcctgctggtccttcct gaagaactggatacagagactataaataacaaggttcccccaagaggaatcattgtttct atcatatgtaacatgcaatctgttggcctcaatagcaccgctttgaagattgcccttgaa tttgataaagacagatcagactga >gi568815583f:62948576_63169940|GENSCAN_predicted_peptide_7|48_aa MARERPRPPGPAPPRNPAGRSAPGPLARPLSSGQLGGLRKANVTDDRT >gi568815583f:62948576_63169940|GENSCAN_predicted_CDS_7|147_bp atggcgagggagcgcccgcggccaccggggccggcgccaccccggaacccggcgggccgc tcagcccccggccccctggcccggccgctctcgagtggacaactagggggcctgagaaag gcgaatgtcaccgatgaccgaacatga