GENSCAN 1.0 Date run: 5-Nov-116 Time: 14:43:35 Sequence gi568815581r:43541913_43761534 : 219622 bp : 49.62% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 Intr - 3110 3063 48 0 0 97 113 50 0.930 7.05 1.02 Intr - 3567 3362 206 2 2 113 98 214 0.654 23.74 1.01 Init - 5312 5239 74 2 2 93 37 37 0.374 -0.35 1.00 Prom - 5656 5617 40 -7.66 2.00 Prom + 8837 8876 40 -6.66 2.01 Init + 14518 14699 182 0 2 74 96 137 0.945 9.76 2.02 Intr + 15921 15951 31 0 1 95 101 20 0.848 2.13 2.03 Intr + 41580 41792 213 2 0 116 68 136 0.822 13.31 2.04 Intr + 72782 72983 202 1 1 70 51 83 0.511 1.66 2.05 Intr + 73318 73398 81 2 0 90 86 17 0.523 1.31 2.06 Intr + 80072 80221 150 0 0 83 15 86 0.096 0.93 2.07 Term + 86596 86678 83 2 2 84 55 84 0.248 2.56 2.08 PlyA + 87349 87354 6 1.05 3.17 PlyA - 89037 89032 6 1.05 3.16 Term - 100120 99998 123 1 0 125 50 85 0.433 6.78 3.15 Intr - 101748 101576 173 1 2 89 85 275 0.996 26.96 3.14 Intr - 104398 104000 399 2 0 67 20 146 0.282 0.18 3.13 Intr - 119481 119154 328 0 1 86 96 145 0.024 10.37 3.12 Intr - 124048 123940 109 0 1 43 64 52 0.001 -1.41 3.11 Intr - 138894 138698 197 0 2 3 92 124 0.193 2.51 3.10 Intr - 139184 139093 92 0 2 57 100 23 0.308 0.11 3.09 Intr - 141039 140956 84 1 0 69 76 39 0.217 0.59 3.08 Intr - 141395 141256 140 1 2 55 51 47 0.133 -2.19 3.07 Intr - 148750 148654 97 2 1 63 99 70 0.502 4.67 3.06 Intr - 156073 156017 57 2 0 108 121 44 0.455 8.56 3.05 Intr - 169364 169178 187 2 1 122 30 121 0.018 8.96 3.04 Intr - 178184 177704 481 1 1 28 81 86 0.011 -5.06 3.03 Intr - 178496 178244 253 0 1 32 40 256 0.006 11.69 3.02 Intr - 203792 203715 78 0 0 103 36 101 0.789 5.92 3.01 Init - 207906 207798 109 0 1 66 93 41 0.405 2.78 3.00 Prom - 208841 208802 40 -5.46 4.05 PlyA - 211846 211841 6 1.05 4.04 Term - 213851 213430 422 0 2 118 43 925 0.999 86.45 4.03 Intr - 215595 215391 205 0 1 66 59 108 0.594 4.47 4.02 Intr - 216493 216237 257 2 2 28 116 16 0.333 -4.44 4.01 Init - 216829 216610 220 1 1 86 86 284 0.987 24.79 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 163647 163748 102 2 0 116 47 78 0.898 4.78 S.002 Term - 169364 169143 222 2 0 122 48 147 0.806 11.02 S.003 Sngl - 178524 178159 366 0 0 75 32 276 0.823 16.80 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:43541913_43761534|GENSCAN_predicted_peptide_1|110_aa MAERELHFRKSSTLGLLVSAAISRNPSGVLGCPAPESPEDPNLVPQTKRLRVTRGHSPRF SQKSPGNGSLREALIGPLGKLMDPGSLPPLDSEDLFQDLSHFQETWLAEX >gi568815581r:43541913_43761534|GENSCAN_predicted_CDS_1|330_bp atggctgagagggagctacactttcggaaatcatctaccctggggcttctggtttctgct gcaatcagtagaaatcccagcggagtcctgggctgccccgcccctgagtcacccgaggac cccaacctcgtcccccagactaagcgcctcagggtgactcgcgggcattctccccgcttc tcgcagaaatcgcccggaaatgggagcttgcgcgaagcgctgatcggcccgctggggaag ctcatggacccgggctccctgccgcccctcgactctgaagatctcttccaggatctaagt cacttccaggagacgtggctcgctgaagnn >gi568815581r:43541913_43761534|GENSCAN_predicted_peptide_2|313_aa MTFLIYEVVLAALIWVLWGGGFEFLGAQSAVAIGRKGPTFLGREVELPLKEAEGGCQKRE GEPALSFSFVNNLSKELSTLTASTSSPPMDSSTRFPPFHGNDLWFNEVQTLVNFEIALAK VINGQLVFKLRLGSCPHALLALGWKDSENSKDIKQIHNGLSSINMSQVQTEMPQAPTWFN ENIHQLCLCSHFTNKQGGEMDSEIVPGEPARFASSVKRGNNPRPVSPHDRKGSLRQGSSA LMLYLGLIARESGLDGSPEHIKHCPLPRCSDCSSCHGDKDIFFLFQRWASEQSRQIETFS AGQLASAGRSDPS >gi568815581r:43541913_43761534|GENSCAN_predicted_CDS_2|942_bp atgacctttctgatttacgaggtggtgctggctgcactcatttgggttctgtggggaggt ggctttgagttcctcggggcccagtctgcagtggctattgggagaaaaggccccaccttc ctgggcagagaagtggagcttccacttaaagaggcagaaggaggctgtcagaagagggaa ggggaaccagcgctctccttcagctttgtcaataacttatcgaaggagctgtccacactc actgcctccacttcctcacctcccatggattcctcaactcgctttccacccttccatgga aatgatctgtggtttaatgaggtgcagactttggtcaattttgaaattgctcttgctaaa gtcatcaatggccaacttgtcttcaaactacggcttggttcttgtcctcatgctcttctc gctctgggatggaaggactcggagaattcgaaggatattaaacagattcataatggactc agcagtataaacatgtctcaagtgcagacagaaatgccacaagctcctacttggtttaat gaaaacatacaccagttgtgtctgtgttcccattttacaaataaacaaggtggggagatg gactcggaaattgtccctggggagccagccaggtttgcctcatctgtgaaacggggtaac aatcccagacctgtgtcgccgcatgatcgcaagggctctctgagacaagggagctcggct ctaatgctgtacctgggcctcattgccagagagtctggtttagatgggtccccagagcac attaagcactgtcctctgccaaggtgctcagactgcagctcctgtcatggggacaaggat atcttcttcctgttccagagatgggccagcgagcagtctagacagatagagaccttcagt gctggccagctggcttctgctggcaggtcggacccgtcgtga >gi568815581r:43541913_43761534|GENSCAN_predicted_peptide_3|968_aa MELTGSAGKEMDNGRITNKHLVQCPGMESSRKKIKAEPGIEPGTYREGGGAIVLTYALGI GVGITGNTVQQPPQLTDSASIRQEDAFDNKIDIAEDGGQTPYEATLQQSFQYSPTTDLPP LTNGYLPSISMYEIQTKYQSHNQYPNGNSKQKTTLNSRKPFPSTATTSVPQTVIPKKSGS PEVKLKITKTIQNGRELFKSSLCGDLLNEVQASEHTKSKHESRKEKRKKPKKHDSSRSEE RKSHKIPKLEPEEQNRPNERVHTISEKPREDPVLKEEAPVQPILSSVPTTEVSTGVKFQV GDLVWSKVTVTPCWVPRLRGRRSHHCSSCLEILVLVPALSLKRSFMVSSLKFLTSTGKQK PTFKGTAQMGWSPMASTTNVSLLLGHWEGTDQMSSRGPEFGGRRWVWQHQKPQIRISICH RPGKEPLRLSFLRCEVERRISSLATSQGCWCSPPDHVCEKCLEDYAGRRHLTLRAQEAFL GPDSRTGSLRAVGKRYCRNSQHQRYLLQGLLGGFLEERNANEYDCKLETREAASSTPRIP YSPTHILQSESAPNHYFPYHVSLSKFLKRKANSHFLHLCAVVAVRRRSNMPGTRGWGGHK QKQPCPAKYTPACHAQWETFRKFHVMAQKPDFLATATAAYPDFSASCLAATPHSLPQEEH IFTEQHPAFPQSPNWHFPVSDARRRPNSGPAGGSKEMGTSSLGLVDTTGGPGDDYGVLGS TANETEKKSSRRRKESSGGGAALAAAKAGGLLGDPGPGLRVLPCCPRSRFRRGLDLPSPA WSQRPPGRQPAPGTRPPALLPGRAPRAANPDRTRTPGPGCHDTPDPGTGTIGPLPRLLEF PERWHPLPFHPPAEATDAQREVGQAPKSLYHNQENRGKPEGSSKARKERTAFTKEQLREL EAEFAHHNYLTRLRRYEIAVNLDLSERQVKVWFQNRRMKWKRVKGGQPISPNGQDPEDGD STASPSSE >gi568815581r:43541913_43761534|GENSCAN_predicted_CDS_3|2907_bp atggagctcacagggagtgctggcaaggagatggataatggacggataacaaataaacat ttagtacaatgtccgggaatggaaagttctcgaaagaaaattaaagctgagcccggcatt gagcctggtacttacagggagggtggtggagccatcgtcctcacgtatgcgctggggatc ggggttgggatcacgggaaacacagttcaacaaccacctcaactcactgactccgccagc atccgtcaggaggatgcctttgataacaaaattgacattgctgaagatggtggccagaca ccatacgaagctaccttgcagcaaagctttcaatactcacctacaacagatcttcctcca ctcacaaatggctacctgccatcaatcagcatgtatgaaattcaaaccaaataccagtcg cataatcaatatcctaatggaaattctaaacagaagaccacattaaattctagaaaaccc ttcccctccacagccaccacttcggtaccacaaactgtgattccaaagaagagtggctca cctgaagttaaactaaaaataaccaaaactatccagaatggcagggaattgttcaagtct tccctttgtggagaccttttaaatgaagtacaggcaagtgagcacacgaagtcaaagcat gaaagcagaaaagaaaagaggaaaaaacccaaaaagcatgactcatcaagatctgaagag cgcaagtcacacaaaatccccaaattagaaccagaggaacaaaatagaccaaatgagagg gttcacaccatatcagaaaaaccaagggaagatccagtactaaaagaggaagccccagtt cagccaatactatcttctgttccaacaacagaagtgtccactggtgttaagtttcaagtt ggtgatcttgtgtggtccaaggtgacggtcacaccctgttgggtgccccgcctgcgagga cggaggagccatcactgttccagctgcctggagatcttggtgctggtgccagccctcagc ctcaagaggtctttcatggtttcttccttgaagttcctcacctccacgggcaaacagaag cccacattcaagggaactgcccagatgggctggtcacctatggcctccacgaccaatgtc tccctgctccttggtcattgggaaggaacagaccagatgtcatccaggggcccggaattt ggggggcgccgctgggtgtggcagcatcagaagcctcagatccgcatctccatctgccac aggccagggaaggaacctctgagactcagtttcctacgatgtgaagtggagagaagaatc tcctctttagccacctctcagggctgctggtgttcgcccccagaccacgtctgtgagaaa tgcttagaagactatgcagggcgccgccatttgacactcagagcccaggaagcctttctt ggtccagacagcaggactggaagccttagagctgtcggcaagagatactgcaggaacagc cagcaccagagatatctcctgcaaggcctcctaggtgggttcttggaagaaaggaatgcc aatgaatatgattgcaagctagagacgagagaagcggcgtcctcaactccaagaatcccg tattccccaacccacatccttcagtctgaaagtgcccctaaccactactttccctaccac gtctccctttccaagttcctcaaacgcaaagcaaacagccatttcctgcacctgtgtgca gtcgtagcagtacgtaggagatccaatatgcctggcacaagggggtggggtggccacaaa cagaagcagccctgtcctgccaagtacacgcctgcctgccacgcacaatgggagacattc cgcaagttccacgtgatggctcagaaaccagacttcctggcgacagcgacggcagcgtac cctgacttctcagcctcctgcctggcagccaccccacacagcctgccccaggaggagcac atcttcactgagcagcaccccgctttcccacagtcccccaactggcacttccctgtctca gacgcccggcgcaggcccaactcaggcccggcagggggttccaaggaaatggggaccagc agcctgggcctggtggacaccacaggaggcccaggcgatgactacggggtgcttgggagc actgccaatgagacagagaagaaatcatccaggcggagaaaggagagttcaggagggggc gcggcgctggcggcggccaaggccggagggctcctcggggaccccggcccggggctgcgc gtccttccctgctgcccacgctcccgcttccggaggggtctggacctgccttctccggcc tggagccagaggccgccgggcaggcagcccgcgccaggaactcggccccctgccctgctc cccgggcgcgccccgcgggcggccaacccggatcggacgcggacccccgggccagggtgc cacgacacccctgacccgggcaccgggactatcgggcccctcccccgccttctcgagttc ccggaacggtggcacccactccccttccatcccccggcggaggccacggacgcgcagcgc gaggtgggccaggcgcccaaatcactttaccacaaccaggagaacagagggaagccggag ggcagcagcaaagcccgcaaggagaggacggccttcaccaaggagcagctgcgagagctg gaggcagagtttgcccatcataactacctgactcggctccgcagatatgagattgcggta aacctggacctctctgagcgccaggtcaaagtgtggttccagaaccgaaggatgaagtgg aagcgtgtgaagggaggtcagcccatctcccccaatgggcaggaccctgaggatggggac tccacagcctctccaagttcagagtga >gi568815581r:43541913_43761534|GENSCAN_predicted_peptide_4|367_aa MQLPLALCLVCLLVHTAFRVVEGQGWQAFKNDATEIIPELGEYPEPPPELENNKTMNRAE NGGRPPHHPFETKALVEEQSCLEVGEEWLAGAAFKFRCRGMRQQTLVRAQGREDAGVVRV WHQGIRTGSGAQKRKGFKESPPGNIGATSSCWYHWEGNKVAEKSTAQAPGPQPSPPHNLL GLWGAKKGNCMGGQPGPSVFKMEDKGASPHSSPSRQGQLGSSDCLKGYVSEYSCRELHFT RYVTDGPCRSAKPVTELVCSGQCGPARLLPNAIGRGKWWRPSGPDFRCIPDRYRAQRVQL LCPGGEAPRARKVRLVASCKCKRLTRFHNQSELKDFGTEAARPQKGRKPRPRARSAKANQ AELENAY >gi568815581r:43541913_43761534|GENSCAN_predicted_CDS_4|1104_bp atgcagctcccactggccctgtgtctcgtctgcctgctggtacacacagccttccgtgta gtggagggccaggggtggcaggcgttcaagaatgatgccacggaaatcatccccgagctc ggagagtaccccgagcctccaccggagctggagaacaacaagaccatgaaccgggcggag aacggagggcggcctccccaccacccctttgagaccaaagcactggtcgaggaacagtct tgcctggaggtgggggaagaatggctcgctggtgcagccttcaaattcaggtgcagaggc atgaggcaacagacgctggtgagagcccagggcagggaggacgctggggtggtgagggta tggcatcagggcatcagaacaggctcaggggctcagaaaagaaaaggtttcaaagaatct cctcctgggaatataggagccacgtccagctgctggtaccactgggaagggaacaaggtg gcagagaagtccactgcccaggctcctggaccccagccctccccgcctcacaacctgttg ggactatggggtgctaaaaagggcaactgcatgggaggccagccaggaccctccgtcttc aaaatggaggacaagggcgcctccccccacagctccccttctaggcaaggtcagctgggc tccagcgactgcctgaagggctacgtgtccgagtacagctgccgcgagctgcacttcacc cgctacgtgaccgatgggccgtgccgcagcgccaagccggtcaccgagctggtgtgctcc ggccagtgcggcccggcgcgcctgctgcccaacgccatcggccgcggcaagtggtggcga cctagtgggcccgacttccgctgcatccccgaccgctaccgcgcgcagcgcgtgcagctg ctgtgtcccggtggtgaggcgccgcgcgcgcgcaaggtgcgcctggtggcctcgtgcaag tgcaagcgcctcacccgcttccacaaccagtcggagctcaaggacttcgggaccgaggcc gctcggccgcagaagggccggaagccgcggccccgcgcccggagcgccaaagccaaccag gccgagctggagaacgcctactag