GENSCAN 1.0 Date run: 6-Nov-116 Time: 02:08:18 Sequence gi568815577f:17413130_17665689 : 252560 bp : 40.59% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 2051 2175 125 0 2 129 43 107 0.968 7.87 1.02 PlyA + 2461 2466 6 1.05 2.03 PlyA - 3095 3090 6 1.05 2.02 Term - 9222 9144 79 2 1 84 43 46 0.424 -4.04 2.01 Init - 12968 12850 119 2 2 81 111 78 0.827 9.12 2.00 Prom - 29813 29774 40 -2.95 3.04 PlyA - 31111 31106 6 1.05 3.03 Term - 36191 36141 51 2 0 58 33 113 0.425 -0.75 3.02 Intr - 41803 41621 183 1 0 63 103 45 0.332 2.56 3.01 Init - 57589 57536 54 1 0 86 83 64 0.572 5.03 3.00 Prom - 61454 61415 40 -4.15 4.00 Prom + 78387 78426 40 -4.55 4.01 Init + 100001 100043 43 1 1 129 92 102 0.995 13.40 4.02 Term + 100425 100561 137 1 2 85 49 112 0.851 4.20 4.03 PlyA + 102153 102158 6 1.05 5.02 PlyA - 102360 102355 6 1.05 5.01 Sngl - 105864 105361 504 0 0 67 47 587 0.736 48.29 5.00 Prom - 122382 122343 40 -3.95 6.00 Prom + 123121 123160 40 -9.45 6.01 Init + 125213 125262 50 1 2 86 83 47 0.667 4.47 6.02 Intr + 125932 126011 80 1 2 121 36 70 0.917 3.38 6.03 Intr + 133898 134064 167 0 2 90 103 197 0.965 20.16 6.04 Intr + 138620 138824 205 1 1 70 111 134 0.997 11.75 6.05 Intr + 145847 146002 156 0 0 102 84 100 0.990 10.06 6.06 Intr + 147573 147695 123 1 0 107 99 100 0.999 12.64 6.07 Intr + 148209 148347 139 1 1 61 91 88 0.941 5.00 6.08 Intr + 152299 152482 184 1 1 79 65 153 0.140 10.97 6.09 Term + 169614 169775 162 2 0 59 35 89 0.061 -2.35 6.10 PlyA + 170272 170277 6 1.05 7.02 PlyA - 171207 171202 6 1.05 7.01 Sngl - 175389 174436 954 0 0 60 49 262 0.981 15.86 7.00 Prom - 180507 180468 40 -6.35 8.14 PlyA - 180547 180542 6 1.05 8.13 Term - 181203 180964 240 0 0 125 32 103 0.984 3.44 8.12 Intr - 185695 185488 208 0 1 59 91 272 0.994 22.76 8.11 Intr - 191868 191731 138 2 0 74 109 116 0.998 10.96 8.10 Intr - 196023 195843 181 1 1 70 103 142 0.294 11.90 8.09 Intr - 199743 199570 174 1 0 67 77 143 0.275 10.09 8.08 Intr - 206836 206619 218 0 2 26 57 132 0.485 1.12 8.07 Intr - 219187 219074 114 0 0 100 29 83 0.057 2.24 8.06 Intr - 221006 220895 112 0 1 99 53 27 0.215 -1.18 8.05 Intr - 223844 223751 94 2 1 50 56 105 0.153 2.12 8.04 Intr - 228015 227818 198 0 0 24 20 179 0.267 3.33 8.03 Intr - 245392 245344 49 0 1 89 72 54 0.499 1.66 8.02 Intr - 245646 245470 177 2 0 57 101 92 0.649 5.51 8.01 Init - 251572 251472 101 1 2 59 107 65 0.578 5.28 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815577f:17413130_17665689|GENSCAN_predicted_peptide_1|41_aa XNEIFRALKFSQTPQAAAPLFLRVPVAIPTCLYGSTSHTVL >gi568815577f:17413130_17665689|GENSCAN_predicted_CDS_1|126_bp ngaaatgaaatattccgtgctctgaagttttcccagacacctcaagcagcagctcctctg ttcctccgtgttcctgtagcaattcccacctgcctctatggaagtacttctcacactgtt ctgtga >gi568815577f:17413130_17665689|GENSCAN_predicted_peptide_2|65_aa MAECGETAGNRELHNKPAMGLFLNILVIGGPKSGEKHWLRRIIQTLPTGYIDILVLASSD TMHFC >gi568815577f:17413130_17665689|GENSCAN_predicted_CDS_2|198_bp atggcagaatgtggagaaacagctgggaacagagaactccacaacaaacctgcaatgggt cttttcctaaatatattagtgataggtggccctaaaagtggagagaaacactggctcaga agaattattcaaactctccctactggatatatagacatccttgtgcttgccagcagtgat accatgcacttctgctag >gi568815577f:17413130_17665689|GENSCAN_predicted_peptide_3|95_aa MGRARWLMPAIPALWEAESLATKNTFQSRSNHVTAFPQLFYRCMGGSVVEFSPATREARV RFPAHAARKCVLDRAREVALNGIAPPHLLREPVRH >gi568815577f:17413130_17665689|GENSCAN_predicted_CDS_3|288_bp atgggccgggcgaggtggctgatgcctgcaatcccagcactttgggaggccgagtctctt gcaacaaagaacaccttccagtcgcgttccaatcatgttactgcttttccccaactcttc tatcggtgcatgggtggttcagtggtagaattctcgcctgccacgcgggaggcccgggtt cgattcccggcccatgcagcacgaaaatgtgttttggaccgtgcgcgggaggttgcgctt aatggaattgctccacctcaccttttacgagaaccagtacgacattag >gi568815577f:17413130_17665689|GENSCAN_predicted_peptide_4|59_aa MALLLCFVLLCGVVALPDFKNLGRRVFRCRVAQAGRLNCKRHYSMDMELPGASDRPGYL >gi568815577f:17413130_17665689|GENSCAN_predicted_CDS_4|180_bp atggcgctcctgctgtgcttcgtgctcctgtgcggagtagtggcgttacccgatttcaaa aaccttgggcggcgagttttcaggtgtcgagtcgcccaggcggggaggctgaattgcaaa cgccattattcaatggacatggagctgcccggtgcctcggatcggccaggctacctctag >gi568815577f:17413130_17665689|GENSCAN_predicted_peptide_5|167_aa MNQEKLAKLQAQVRIGGKGTARRKKVVHKTAMADDKKLQSSLKKLAVNNIVGIEEMNMIK DDGTVIHFNNPKVQASLSANTFAITGHAEAKPITEMIPGILSQLGADSLTSLRKLAEQFP RQVLDSKAAKPEDTDEEDDDVPDLVENFDEASKNEAGMVFGSWHGLD >gi568815577f:17413130_17665689|GENSCAN_predicted_CDS_5|504_bp atgaatcaagaaaagttagccaaacttcaggctcaggtccggatagggggcaagggtaca gctcgcagaaagaaggtggtacataaaacagccatggctgatgacaaaaagcttcagagt tctctaaaaaaactggctgtgaataatatagttggtattgaagagatgaacatgattaaa gatgatgggacagttattcacttcaacaatcccaaagtccaagcttccctttctgctaac acctttgcaattactggtcatgcagaagccaaaccaatcacagaaatgatacctggaata ttaagtcagcttggtgctgacagtttaacaagccttaggaagttagctgaacagttccca cggcaagtcttggacagtaaagcagcaaaaccagaagacactgatgaggaggatgatgat gttcccgatcttgtagaaaattttgatgaggcatcaaagaatgaagctggcatggttttt ggaagctggcatggactagattga >gi568815577f:17413130_17665689|GENSCAN_predicted_peptide_6|421_aa MMMEMMMAVLYCGSGKWFVQMIGDAAISCICDLMQGLIAKSVGNFARSLSITTPEEMIEK AKGETAYLPCKFTLSPEDQGPLDIEWLISPADNQKVDQVIILYSGDKIYDDYYPDLKGRV HFTSNDLKSGDASINVTNLQLSDIGTYQCKVKKAPGVANKKIHLVVLVKPSGARCYVDGS EEIGSDFKIKCEPKEGSLPLQYEWQKLSDSQKMPTSWLAEMTSSVISVKNASSEYSGTYS CTVRNRVGSDQCLLRLNVVPPSNKAGLIAGAIIGTLLALALIGLIIFCCRKKRREEKYEK EVHHDIREDVPPPKSRTSTARSYIGSNHSSLGSMSPSNMEGYSKTQYNQVPSEDFERTPQ SPTLPPAKLSEFLPRAVLWNQHKLMKKVSYLAFEKMCLGYILGQLSHTIPSSMLAAGTVL F >gi568815577f:17413130_17665689|GENSCAN_predicted_CDS_6|1266_bp atgatgatggagatgatgatggctgtattatactgtgggagtgggaagtggtttgtgcag atgattggtgatgcagctatttcttgcatctgtgatcttatgcagggattgattgccaaa agtgttggtaatttcgccagaagtttgagtatcactactcctgaagagatgattgaaaaa gccaaaggggaaactgcctatctgccatgcaaatttacgcttagtcccgaagaccaggga ccgctggacatcgagtggctgatatcaccagctgataatcagaaggtggatcaagtgatt attttatattctggagacaaaatttatgatgactactatccagatctgaaaggccgagta cattttacgagtaatgatctcaaatctggtgatgcatcaataaatgtaacgaatttacaa ctgtcagatattggcacatatcagtgcaaagtgaaaaaagctcctggtgttgcaaataag aagattcatctggtagttcttgttaagccttcaggtgcgagatgttacgttgatggatct gaagaaattggaagtgactttaagataaaatgtgaaccaaaagaaggttcacttccatta cagtatgagtggcaaaaattgtctgactcacagaaaatgcccacttcatggttagcagaa atgacttcatctgttatatctgtaaaaaatgcctcttctgagtactctgggacatacagc tgtacagtcagaaacagagtgggctctgatcagtgcctgttgcgtctaaacgttgtccct ccttcaaataaagctggactaattgcaggagccattataggaactttgcttgctctagcg ctcattggtcttatcatcttttgctgtcgtaaaaagcgcagagaagaaaaatatgaaaag gaagttcatcacgatatcagggaagatgtgccacctccaaagagccgtacgtccactgcc agaagctacatcggcagtaatcattcatccctggggtccatgtctccttccaacatggaa ggatattccaagactcagtataaccaagtaccaagtgaagactttgaacgcactcctcag agtccgactctcccacctgctaagttgtcggaatttttgccaagagcagttttatggaat cagcacaaactaatgaagaaagtgagttatcttgcatttgaaaagatgtgtctaggttac atcttaggtcagctgtcacacactatcccaagttcaatgttggctgctggcactgtgctc ttttaa >gi568815577f:17413130_17665689|GENSCAN_predicted_peptide_7|317_aa MSELPFTIASKRIKYLRIQPTRDVKDLFKENYKPLLNEIKEDTNKRKNIPCSWVGRINIV KMAILPKAIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRARIANSILSQKNKAGGITLP DFKLHYKPTVTKTAWYWCQNRDINQSNRTEPSEITPHIYNYLIFDKPEKNKQWGKDSLFN KWCWENWLAIYRKLKLDPFLTPYTKINSRWIKDLHVRPKTITLEENLGNTIQDIGMGKDF MSKTPKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRQPTEWENIFATYSPDKGLISRIY NELQQIYKKKTTPSKSG >gi568815577f:17413130_17665689|GENSCAN_predicted_CDS_7|954_bp atgagtgaactcccattcacaattgcttcaaagagaataaaatacctacgaatccaacct acaagggatgtgaaggacctcttcaaggagaactacaaaccactgctcaatgaaataaaa gaggatacaaacaaacggaagaacattccatgctcatgggtaggaagaatcaatatcgtg aaaatggccatactgcccaaggcaatttatagattcaatgccatccccatcaagctacca atgactttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaaga gcccgcattgccaactcaatcctaagccaaaagaacaaagctggaggcatcacactacct gacttcaaactacactacaagcctacagtaaccaaaacagcatggtattggtgccaaaac agagatataaaccaatcgaacagaacagagccctcagaaataacgccacatatctacaac tatctgatctttgacaaacctgagaaaaacaagcaatggggaaaggattccctatttaat aaatggtgctgggaaaactggctagccatatatagaaagctgaaactggatcccttcctt acaccttatacaaaaattaattcaagatggattaaagacttacatgttagacctaaaacc ataaccctagaagaaaacctaggcaataccattcaggacataggcatgggcaaggacttc atgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatctaatt aaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaacctaca gaatgggagaatatttttgcaacctactcacctgacaaagggctaatatccagaatctac aatgaactccaacaaatttacaagaaaaaaacaaccccatcgaaaagtgggtga >gi568815577f:17413130_17665689|GENSCAN_predicted_peptide_8|667_aa MNKAAKNICAGFCVDSSFQLIWVNIKEYDCWIIRNILEDWWALKVLSMGLDWVHPWRLRD EPPSLFCKKSQFTVRKQGAEINHTIPEKVQAEIYSLKYNGLMPNPVNMEDFELESDDLVY LVEEISKQQSIHDVAWLLLVAFVYICEQRNDLKSELIFTREAEHKSLENLEPDHVMPSSC IEEPVKVFQALSQWSHQLTAKINSQTYYAISPNDLLIFTSVCEAILRYKLHIANENFRAS LTLLHFHSGFNLNTDYKTLQTLGEKCEFCQLRMVHSESLTMCLIQSKPLTLFNSTKAERG EAVADEKLEAIRGWSMRFMEKSHLHNIKVQGEAVNADRETEAIYPEDLAKNMVEVLSTRA WPPDDAGAARAGRGSLRSLLPSAGPLRRSPQFPARTRSGPPNLRPKSGGGSGGKKMKNEI AAVVFFFTRLVRKHDKLKKEAVERFAEKLTLILQEKYKNHWYPEKPSKGQAYRCIRVNKF QRVDPDVLKACENSCILYSDLGLPKELTLWVDPCEVCCRYGEKNNAFIVASFENKDENKD EISRKVTRALDKVTSDYHSGSSSSDEETSKEMEVKPSSVTAAASPVYQISELIFPPLPMW HPLPRKKPGMYRGNGHQNHYPPPVPFGYPNQGRKNKPYRPIPVTWVPPPGMHCDRNHWIN PHMLAPH >gi568815577f:17413130_17665689|GENSCAN_predicted_CDS_8|2004_bp atgaataaagctgctaaaaacatctgtgcaggtttttgtgtggacagcagttttcaactc atttgggtaaacatcaaggaatatgattgttggatcatacgaaacattctcgaagactgg tgggctttgaaggttctaagcatgggccttgactgggtccacccgtggcgtttaagagat gagccgccatctttattttgcaagaaaagtcagtttacagttagaaaacaaggagcagaa ataaatcataccattccagaaaaggtacaagctgaaatatattcccttaaatataatggg cttatgcccaatcccgtgaacatggaggactttgaacttgagagtgatgatttagtgtat ctggtggaagaaatttctaagcagcaaagcattcacgatgtggcctggttgcttctagta gcctttgtttatatttgtgagcaaagaaatgacctgaaatcggaacttatatttacaagg gaagcagagcataaaagtttggaaaatttggagcctgaccatgtgatgccaagcagctgc atagaggagccagtgaaggtgttccaggcactgtcccagtggagccatcagctaacagcc aaaatcaactcccagacatactatgccatcagcccaaatgacctgctaatctttaccagt gtatgtgaggctattctgagatataagcttcacatagctaatgaaaacttcagagctagt ctcacgttgttacattttcactcaggcttcaatcttaacacagactataaaacccttcaa actcttggggagaagtgtgaattttgtcaactgcgaatggtccactctgaatctctcact atgtgcctaatccagagcaagcccctaactctcttcaattctaccaaggctgagagaggt gaggcagttgcagatgaaaagttggaagctatcagaggttggtccatgaggtttatggaa aaaagccatctccataacataaaagtgcaaggcgaagcagtaaatgctgatagagaaact gaagcaatttatccagaagatctagctaagaacatggttgaagtcctctcaacgcgcgct tggccgcccgacgacgcgggagccgcacgcgccggacgaggctcgctgcgctccctgttg cccagcgcgggcccgttgaggcggagccctcagttcccggccaggacacggtctgggccg ccgaatctccggccgaagagcggcggcggcagcggcgggaaaaaaatgaagaatgaaatt gctgccgttgtcttctttttcacaaggctagttcgaaaacatgataagttgaaaaaagag gcagttgagaggtttgctgagaaattgaccctaatacttcaagaaaaatataaaaatcac tggtatccagaaaaaccatcgaaaggacaggcctacagatgtattcgtgtcaataaattt cagagagttgatcctgatgtcctgaaagcctgtgaaaacagctgcatcttgtatagtgac ctgggcttgccaaaggagctcactctctgggtggacccatgtgaggtgtgctgtcggtat ggagagaaaaacaatgcattcattgttgccagctttgaaaataaagatgagaacaaggat gagatctccaggaaagttaccagggcccttgataaggttacctctgattatcattcagga tcctcttcttcagatgaagaaacaagtaaggaaatggaagtgaaacccagttcggtgact gcagccgcaagtcctgtgtaccagatttcagaacttatatttccacctcttccaatgtgg caccctttgcccagaaaaaagccaggaatgtatcgagggaatggccatcagaatcactat cctcctcctgttccatttggttatccaaatcagggaagaaaaaataaaccatatcgccca attccagtgacatgggtacctcctcctggaatgcattgtgaccggaatcactggattaat cctcacatgttagcacctcactaa