GENSCAN 1.0 Date run: 8-Nov-116 Time: 14:27:57 Sequence gi568815597r:97620891_98020922 : 400032 bp : 34.29% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 2789 3112 324 1 0 88 41 140 0.836 5.05 1.02 PlyA + 4360 4365 6 1.05 2.12 PlyA - 4982 4977 6 1.05 2.11 Term - 18298 18132 167 2 2 48 42 100 0.000 -1.50 2.10 Intr - 58292 58205 88 2 1 57 95 89 0.193 5.12 2.09 Intr - 70908 70827 82 2 1 117 94 127 0.209 14.92 2.08 Intr - 78657 78461 197 0 2 100 90 95 0.672 8.19 2.07 Intr - 100781 100620 162 2 0 66 87 66 0.183 3.55 2.06 Intr - 119589 119502 88 2 1 74 94 61 0.276 4.35 2.05 Intr - 153400 153309 92 1 2 69 98 49 0.038 1.87 2.04 Intr - 153643 153497 147 1 0 70 42 84 0.031 1.51 2.03 Intr - 200171 200123 49 1 1 114 75 -3 0.028 -1.24 2.02 Intr - 207306 207224 83 0 2 36 107 171 0.296 11.32 2.01 Init - 215060 215034 27 2 0 77 105 17 0.367 1.95 2.00 Prom - 221898 221859 40 -6.55 3.00 Prom + 222423 222462 40 -3.85 3.01 Init + 223754 223794 41 1 2 98 92 62 0.383 7.31 3.02 Intr + 225006 225159 154 0 1 9 80 88 0.033 -0.75 3.03 Term + 234766 234972 207 0 0 -9 43 232 0.568 5.36 3.04 PlyA + 237555 237560 6 1.05 4.00 Prom + 238608 238647 40 -4.25 4.01 Init + 243761 243810 50 1 2 86 63 84 0.355 6.17 4.02 Term + 248147 248273 127 2 1 122 49 88 0.900 5.07 4.03 PlyA + 248382 248387 6 1.05 5.00 Prom + 252555 252594 40 -2.65 5.01 Init + 256048 256155 108 0 0 78 64 97 0.309 6.57 5.02 Intr + 277701 277836 136 2 1 113 88 11 0.096 2.82 5.03 Intr + 285233 285318 86 0 2 52 67 46 0.006 -2.48 5.04 Term + 299943 300146 204 2 0 60 49 195 0.927 9.19 5.05 PlyA + 305056 305061 6 1.05 6.03 PlyA - 305433 305428 6 1.05 6.02 Term - 313279 313144 136 0 1 111 44 125 0.938 6.91 6.01 Init - 319232 319165 68 2 2 75 78 25 0.714 0.80 6.00 Prom - 326168 326129 40 -6.75 7.05 PlyA - 326416 326411 6 1.05 7.04 Term - 332929 332828 102 1 0 78 54 92 0.658 2.10 7.03 Intr - 334930 334836 95 2 2 53 81 48 0.043 -0.64 7.02 Intr - 374110 374026 85 1 1 100 93 71 0.467 7.27 7.01 Intr - 399089 399004 86 0 2 94 76 52 0.179 3.22 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:97620891_98020922|GENSCAN_predicted_peptide_1|107_aa MSKKNKTGGITLPDFKLYNKARVIKTERYSPKKNRHINQGNRVEHPEINPCIYDQLIFDK GAKNKQWVKDSLIEKYYWENWISTCRRTKFDLAPYKKPTQNVLNTET >gi568815597r:97620891_98020922|GENSCAN_predicted_CDS_1|324_bp atgagcaaaaagaacaaaactggaggtatcacattacctgatttcaaactatacaacaaa gctagagtaattaaaacagaaaggtattcgccaaaaaaaaatagacacatcaaccaaggg aacagagtagagcatccagaaataaacccatgcatatatgatcaactgatttttgacaaa ggtgccaagaataagcaatgggtaaaggatagtctcattgaaaagtactattgggaaaac tggatttccacatgtagaagaacaaaatttgaccttgcaccatataaaaaaccaactcaa aatgtactaaatactgaaacataa >gi568815597r:97620891_98020922|GENSCAN_predicted_peptide_2|393_aa MGQIVQSLKNCEKLENNFDDIKHTTLGERGALREAMSLTGSRGLWVIDICNWKVEAESVY LYRWSDLHLWQLSEMLILPSNIATASSKTLNIIKEEIRAFQTERERGGSVLEEWQILQFC IHTLLPGSWMSPRCLKCADAPCQKSCPTNLDIKSFITSIANKNYYGAAKMIFSDNPLGLT CGMVCPTSDLCVGGCNLYATEEGPINIGGLQQFATEVFKAMSIPQIRNPSLPPPEKMSEA YSAKIALFGAGPASISCASFLARLGYSDITIFEKQEYVGGLSTSEIPQFRLPYDVVNFEI ELMKDLGVKIICGKSLSVNEMTLSTLKEKGYKAAFIGIETPRLFKTVCLTVSKWVVLVEC NPVSVYLLPHQLPTKKICLTTLLMGNNAGLFCG >gi568815597r:97620891_98020922|GENSCAN_predicted_CDS_2|1182_bp atgggtcagattgttcagagcttaaagaattgtgagaagctggagaataattttgatgac atcaagcacacgactcttggtgagcgaggagctctccgagaagcaatgagtctcacaggc agtagagggctatgggtaatagatatatgcaattggaaggtagaggctgagtcggtgtac ctctacaggtggtctgatttgcacctttggcagctgagtgaaatgctcattttacctagt aacattgccacagcctctagcaaaactcttaatatcataaaggaagagataagagccttt caaacagaaagagagagaggtggcagtgttttggaagagtggcagatcctacagttttgc attcatactctccttccaggatcctggatgagccccagatgcctgaaatgtgcagatgcc ccgtgtcagaagagctgtccaactaatcttgatattaaatcattcatcacaagtattgca aacaagaactattatggagctgctaagatgatattttctgacaacccacttggtctgact tgtggaatggtatgtccaacctctgatctttgtgtaggtggatgcaatttatatgccact gaagagggacccattaatattggtggattgcagcaatttgctactgaggtattcaaagca atgagtatcccacagatcagaaatccttcgctgcctcccccagaaaaaatgtctgaagcc tattctgcaaagattgctctttttggtgctgggcctgcaagtataagttgtgcttccttt ttggctcgattggggtactctgacatcactatatttgaaaaacaagaatatgttggtggt ttaagtacttctgaaattcctcagttccggctgccgtatgatgtagtgaattttgagatt gagctaatgaaggaccttggtgtaaagataatttgcggtaaaagcctttcagtgaatgaa atgactcttagcactttgaaagaaaaaggctacaaagctgctttcattggaatagagaca ccaagattatttaaaactgtgtgccttacagttagcaaatgggtagtgctggttgaatgc aacccagtgtctgtatatcttctgcctcatcaactgcctacaaagaagatttgccttact acactgctgatgggtaacaatgcaggactgttctgtggctaa >gi568815597r:97620891_98020922|GENSCAN_predicted_peptide_3|133_aa MASATPSIGPAGASCSLTWSRDLYQCLELPTLLQQQASLAVCNGWTPHSLTHTPLATLQL ACPGQEIHRNTIHVAATKTAFEIHQNNEEQLQTALKDTSDKNSNLQGNTELLSQEAEEWK REEGANILSRIKC >gi568815597r:97620891_98020922|GENSCAN_predicted_CDS_3|402_bp atggccagcgctacaccctccatagggccagcaggagccagctgcagcctcacatggagc cgggacctgtaccagtgcctggagctgcccaccctgctgcagcagcaagcatccctggct gtgtgcaatgggtggaccccacactcactcactcacacacctcttgccactctgcaactg gcttgccctgggcaggagattcatcgaaacacaattcacgtagctgcaaccaaaacagcc ttcgaaatacatcagaacaatgaagaacaacttcagacagcattgaaagacacttcggat aaaaattccaatcttcagggaaacacagagctgctttcacaagaagctgaagagtggaaa agggaagagggagcaaacatactaagcagaataaaatgctag >gi568815597r:97620891_98020922|GENSCAN_predicted_peptide_4|58_aa MATREAVLRRKILREGSLAFSDFQYMLQVENGHPAAPSFHFIPAQTAQNEKESLVIDI >gi568815597r:97620891_98020922|GENSCAN_predicted_CDS_4|177_bp atggcgactcgagaagctgttctaagaagaaaaatacttcgagaaggaagcctggccttt tctgattttcagtacatgttgcaagtggaaaatggccaccctgcagcaccaagctttcat tttattcctgctcaaacagcccagaatgaaaaagaatctcttgtcatagatatctag >gi568815597r:97620891_98020922|GENSCAN_predicted_peptide_5|177_aa MRTKSKVKGPEILADLGERGSMPSSLGYSQRADKYQVEMCTGKNLIGLSCSNPEQYGLTV SPTKPTLNCNNPHVTRVGLGEARTAFLHLSLENDHLIHFLAPKLLDIEVKSTPHHPATTD EPARSPYLDVRRVLTEHRGHGSAYSLESASDKPSLRPQAPAREPSDSSRSASRKQAD >gi568815597r:97620891_98020922|GENSCAN_predicted_CDS_5|534_bp atgaggaccaaaagcaaagtgaagggcccagagatattggcagatcttggagaaagagga tcaatgcccagctccttaggctattctcagagggctgacaaataccaggtggaaatgtgt acaggaaagaacctgataggcctgagttgctcaaatcctgagcaatatggtttgactgtg tccccaaccaaacccaccttgaactgtaataatccccatgtcacacgggtggggctaggt gaagccaggacagccttccttcacctgtccttggaaaatgatcacctaattcattttctg gcccctaaattactggatattgaggtgaagagcactccccaccaccccgccaccaccgac gagccggcgcgaagtccgtacctcgatgtccgccgagtccttactgagcacaggggccat ggcagtgcctacagtctcgagtctgccagtgacaaaccctccttgcgtcctcaagctcca gccagagagccaagtgacagcagccggagcgcgagtcgaaaacaggcagactag >gi568815597r:97620891_98020922|GENSCAN_predicted_peptide_6|67_aa MVDSKTEARNIQDKLGAYCSAKKIVNTVAIENNRNNCFMTNGPEPYEMNLQALAQLTYAT NNHLQTV >gi568815597r:97620891_98020922|GENSCAN_predicted_CDS_6|204_bp atggttgattccaaaactgaggctaggaatatacaagataagcttggggcatattgtagt gccaaaaagattgtcaacactgtggcaattgaaaataacaggaataactgttttatgaca aatggacctgaaccctatgaaatgaatttgcaagcactggcacagttaacatatgctaca aacaaccatcttcagacagtgtag >gi568815597r:97620891_98020922|GENSCAN_predicted_peptide_7|122_aa XYVSMQYYTYLIYIGVDYKLQETDRKLLEASKASGMCSDCKNSDVVKKAYALPPLKPVWQ ECRNLPSVFQKLFSFLAVTCVASENGKYLPRREKSTVNYNAHKCLKDKTDQLIRNNMVCT ET >gi568815597r:97620891_98020922|GENSCAN_predicted_CDS_7|369_bp naatatgttagcatgcaatactacacttatctcatctacattggtgtagactacaagcta caggaaacagataggaagcttctagaggcatccaaagcctctgggatgtgttctgactgt aaaaactctgatgttgtgaaaaaagcttacgctttgcctccactcaaaccagtttggcaa gagtgcagaaatctcccatctgtttttcagaagctgttcagcttcttggctgtcacctgt gtagcttcagaaaatggcaaatatctgccaagacgagagaaaagcacagtcaattataat gcccacaaatgtctgaaagataaaacagaccagttgattagaaacaatatggtttgtact gaaacatga