GENSCAN 1.0 Date run: 8-Nov-116 Time: 14:40:00 Sequence gi568815584f:93247269_93448117 : 200849 bp : 39.40% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.13 Intr - 1386 1208 179 1 2 91 91 248 0.984 24.12 1.12 Intr - 4384 4195 190 1 1 18 110 220 0.927 15.44 1.11 Intr - 6522 6379 144 0 0 26 90 194 0.957 12.96 1.10 Intr - 10087 9927 161 2 2 52 92 122 0.991 7.79 1.09 Intr - 14409 14334 76 0 1 69 99 38 0.552 1.17 1.08 Intr - 16725 16517 209 1 2 72 89 152 0.902 11.57 1.07 Intr - 47669 46590 1080 0 0 83 86 939 0.891 82.37 1.06 Intr - 58962 58881 82 0 1 92 85 7 0.238 -0.91 1.05 Intr - 85755 85552 204 0 0 63 84 214 0.278 16.97 1.04 Intr - 86363 86084 280 1 1 112 4 141 0.212 4.66 1.03 Intr - 100118 99991 128 2 2 18 94 130 0.042 5.16 1.02 Intr - 100562 100455 108 2 0 47 54 138 0.220 5.86 1.01 Init - 109970 109821 150 2 0 83 89 77 0.201 7.39 1.00 Prom - 110421 110382 40 -5.75 2.00 Prom + 110681 110720 40 -7.75 2.01 Sngl + 110952 111251 300 2 0 16 48 298 0.761 14.14 2.02 PlyA + 111827 111832 6 1.05 3.00 Prom + 112118 112157 40 -8.75 3.01 Init + 113150 113497 348 1 0 56 77 168 0.326 9.73 3.02 Intr + 122179 122291 113 0 2 50 91 89 0.402 3.66 3.03 Term + 145419 145488 70 0 1 61 38 100 0.594 -1.27 3.04 PlyA + 146260 146265 6 1.05 4.03 PlyA - 146388 146383 6 1.05 4.02 Term - 163944 163845 100 2 1 90 42 79 0.100 0.12 4.01 Init - 168510 168347 164 0 2 70 72 119 0.358 7.75 4.00 Prom - 169461 169422 40 -6.15 5.02 PlyA - 169630 169625 6 1.05 5.01 Sngl - 170871 169858 1014 0 0 88 43 670 0.994 59.26 5.00 Prom - 174875 174836 40 -3.25 6.04 PlyA - 176929 176924 6 1.05 6.03 Term - 183630 183388 243 0 0 130 48 253 0.390 20.42 6.02 Intr - 184038 183866 173 1 2 88 13 94 0.702 0.64 6.01 Init - 188172 188022 151 0 1 35 66 153 0.973 8.05 6.00 Prom - 196109 196070 40 -3.65 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584f:93247269_93448117|GENSCAN_predicted_peptide_1|997_aa MEYYAAMKTKEIMSFALTWIELEAIVLCELTHVLSELMQKQKTKYHMLSLGSPVPTFALW NFDVAQDFHHFIENARSSQCRGRPRMTGAAGAGAPSGRSGERAAGRAGPSGGSGGGQDSA HAGEAYRVRGILLRKTFQCLKDYFLTYYTSSKVVPVKVPCRQVLTIDKCSLPRSSHRNAR TVSNDFPRKSCSRERCTHCMRPPEGRRPPNGNMNSWKWPSGDESKWRRGEPGAVPEPEPL TGEERAAAAAVGGGWPGVLAPVRTAVLVAAAAEATAAVPAAAAAAGGASGHCNHPVMQGR NLSHRFLLPLHHSPYQFGTSSYSQQGYGCESKLYSLDHGHEKPQDKKKRTSGLATLKKKF IKRRKSNRSADHAKQMRELLSGWDVRDVNALVEEYEGTSALKELSLQASLARPEARTLQK DMADLYEYKYCTDVDLIFQETCFPVHRAILAARCPFFKTLLSSSPEYGAEIIMDINTAGI DMPMFSALLHYLYTGEFGMEDSRFQNVDILVQLSEEFGTPNSLDVDMRGLFDYMCYYDVV LSFSSDSELVEAFGGNQNCLDEELKAHKAVISARSPFFRNLLQRRIRTGEEITDRTLRTP TRIILDESIIPKKYATVILHCMYTDVVDLSVLHCSPSVGSLSEVQALVAGKPNMTRAEEA MELYHIALFLEFNMLAQGCEDIIAESISLDTLIAILKWSSHPYGSKWVHRQALHFLCEEF SQVMTSDVFYELSKDHLLTAIQSDYLQASEQDILKYLIKWGEHQLMKRIADREPNLLSGT AHSVNKRGVKRRDLDMEELREILSSLLPFVRIEHILPINSEVLSDAMKRGLISTPPSDML PTTEGGKSNAWLRQKNAGIYVRPRLFSPYVEEAKSVLDEMMVEQTDLVRLRMVRMSNVPD TLYMVNNAVPQCCHMISHQQISSNQSSPPSVVANEIPVPRLLIMKDMVRRLQELRHTEQV QRAYALNCGEGATVSYEIQIRVLREFGLADAAAELLQ >gi568815584f:93247269_93448117|GENSCAN_predicted_CDS_1|2991_bp atggaatactatgcagccatgaaaacaaaggagatcatgtcctttgcactaacatggata gagctggaggccatcgtcctatgtgaattaacacacgtcctaagtgaattaatgcagaag cagaaaaccaaataccacatgctctctcttgggtctccggttccaaccttcgcactctgg aacttcgatgtggcccaggatttccaccattttatcgaaaacgctagatcttcgcagtgc cgaggacgtccgaggatgacaggggccgctggcgcgggggccccgagtgggcgaagcggg gagcgggctgcaggccgagcagggccaagcggcggtagtggcggcgggcaggacagcgcc cacgcaggagaggcatatcgcgtcaggggtatcttgttaaggaaaacttttcagtgtctt aaagattattttttaacctactacacaagtagcaaggtggtgcctgtaaaagtaccatgc agacaagtgcttacgatagacaagtgctcactgccaagaagcagccaccggaatgccagg actgtcagcaatgactttccgcggaagagttgttctcgagaaagatgcactcattgcatg cgaccacccgagggacgccggccgccgaacggaaacatgaattcgtggaaatggccatct ggtgacgagtcaaaatggcggcgaggggaacctggagcagtcccggagcctgagccactg acaggagaggagagggcggcggcggcggcggtgggaggaggatggccgggggtgctggcg ccggtgcggacggcggtgctggtggcggcggcggcggaggcgacggcagcggtcccagcg gcagcagcagcggcgggaggagcctccgggcattgtaaccacccagttatgcaaggtaga aacctcagtcatcgtttccttcttcctctccaccattccccttaccagtttgggacctca tcctattctcagcaaggctatggttgcgaatcaaagttgtatagccttgaccatggccat gagaaaccacaagacaaaaaaaagagaacctctggtcttgccaccctcaaaaagaagttt attaagcgtcggaaatctaataggtctgccgatcatgccaagcagatgcgagaactcctc tctgggtgggatgttagagatgtcaatgcattagtggaggaatatgagggaacatcagca ttaaaggagctttctctacaagccagtttggctagaccagaagcccggacattgcagaaa gatatggctgatctttatgagtacaagtattgtactgatgtagacttaatatttcaagaa acttgttttcctgttcatcgtgccattttggcagcaaggtgtccattttttaaaacactg ctttcttcctcaccagagtatggggcagagataataatggacatcaatacagctggtatt gatatgcccatgttttctgctttgttacactacctttatacaggagagtttggaatggag gactcaaggtttcaaaatgtcgatatccttgttcagcttagtgaagaatttggaacacca aattcccttgatgtagatatgcgtggactctttgattacatgtgttattatgatgtcgtc cttagtttttcttcagactctgaactggttgaagcttttggtggaaatcagaactgttta gatgaagagctcaaagcccacaaggctgttatttctgcacggtccccattttttcgaaat ttattacaaaggaggatacgaactggtgaagaaatcacagaccgaactttgaggactccc acaagaattatattagatgagtccattataccaaaaaaatatgcaacagtgatattacac tgtatgtataccgacgtggtggacctctctgttttgcactgtagcccctctgtggggagt ctcagtgaagttcaggctctcgtcgcagggaagccaaacatgaccagggcagaagaagcc atggaactttaccacatagcactgttcttggaatttaacatgcttgcacaaggctgtgag gatatcattgctgagagcatctcattagataccttaattgccatcctcaagtggagttct catccatatggctctaaatgggtgcaccgacaagctttacatttcctctgtgaggaattt tcccaggtcatgacttcggatgttttttatgaactcagcaaagaccatctgcttactgct atccagtctgactacctacaggcaagtgaacaagatatccttaaatatctgattaaatgg ggagagcatcagttgatgaaaagaatagcagatagagagccaaacttactgagtggcact gcccatagtgtgaacaaaagaggtgtaaaaagacgggacctggacatggaagagctcaga gagatcctttcttctctcttaccttttgtgcgaattgaacacatcttacctataaacagt gaagtcttaagtgatgcaatgaaaagaggcttgattagtactcctccatcagatatgctt cctacaacagaaggtgggaagtcaaatgcctggttacggcaaaaaaatgctggcatctat gttcgtcctcgactcttctctccctatgtggaagaagcaaagtcagtgctagatgagatg atggtggaacaaacggatcttgtgcgcttgcgaatggttagaatgtccaatgtgccagac acgctctacatggtcaataatgccgtgccacagtgttgtcacatgatcagccaccagcag atcagcagcaaccagtcaagccctccttcagttgtagccaacgaaattccagttcctcgt ctcctcattatgaaagacatggtcagacgactgcaggaactgcggcacacggagcaggtg cagagggcctatgccctgaactgcggggaaggcgccactgtcagctatgaaattcagatt cgagtgctaagagagtttggtcttgcagatgctgctgcagagctgttgcag >gi568815584f:93247269_93448117|GENSCAN_predicted_peptide_2|99_aa MHLTPLRGKEFSDSIHHTFDYMWRTKEHNEVGWLLLSSLDKVMKENDELRDSNCQLQKQI LSLKSSKIALNESLISCRERAEIVEKQTQALITCMADLQ >gi568815584f:93247269_93448117|GENSCAN_predicted_CDS_2|300_bp atgcatttgacacccctgagaggcaaggagtttagtgactctatacatcatacctttgac tatatgtggaggaccaaggaacataatgaagttggttggttgctcctaagttcactggac aaagtgatgaaagaaaatgatgaactcagggattctaactgccagcttcagaagcagatc ctgagcctcaaatcttctaagattgccctgaatgagagtcttatctcctgtagagaaaga gctgaaattgtagaaaaacagacacaagctcttatcacgtgcatggctgacctgcaatga >gi568815584f:93247269_93448117|GENSCAN_predicted_peptide_3|176_aa MTVDYRKFNQVVTPMAALYQMRFHCLSKLTHFLVPGMQPLTWQMPFSPFLSMRPTRSNLP SLPQGYINFPALCHNLIRRELDFFLLLQDITLVHYIDDILLIGSSEQEVVNTLDLLIHKR SKEAEHTAASRIRVSCLPEQKSHEQTLPWEQVPSSLIGIFSDTGLLIRFEILTVDE >gi568815584f:93247269_93448117|GENSCAN_predicted_CDS_3|531_bp atgaccgtggattatcgtaagtttaaccaagtggtgactccaatggcagctctgtaccag atgcggtttcattgcttgagcaaattaacacatttcctggtacctggtatgcagccattg acttggcaaatgcctttttctccattcctgtccatgaggcccaccagaagcaatttgcct tcactacctcaggggtatatcaactttccagctttgtgtcataatcttattcggagagaa cttgatttctttttgcttctacaagatatcacactggtccattacattgatgacattctg ctgattggatccagtgagcaagaagtagtaaacacactggacttattgattcataagaga agtaaggaagcagagcacactgctgcttccaggattagagtgtcatgcttaccagagcag aaaagtcatgaacagacactgccgtgggaacaagtgcccagttccttaatcggcatcttc tctgacacaggactgttaattcggtttgaaatattgactgtggatgagtga >gi568815584f:93247269_93448117|GENSCAN_predicted_peptide_4|87_aa MDKFLDTYTLPRLNQEEVESLNRPITGSEIVAIINSLPTKKSPGPDGFTAEFYQRGRGAS SCGQDGEYCEATTNTPLRPKSSLVSLW >gi568815584f:93247269_93448117|GENSCAN_predicted_CDS_4|264_bp atggataaattcctcgacacatacaccctcccaagactaaaccaggaagaagttgaatct ctgaatagaccaataacaggctctgaaattgtggcaataatcaatagcttaccaaccaaa aagagtccaggaccagatggattcacagccgaattctaccagagaggcagaggggcctca tcctgtggccaggatggggagtactgcgaagctaccaccaatactcccttaaggcccaag agctctttagtcagcttgtggtga >gi568815584f:93247269_93448117|GENSCAN_predicted_peptide_5|337_aa MGKKQSRKTGNSKKQSTSPPPKERSSSPATEQSWTENDFDELREGFRRSNYSKLQEEIQT KVKEVENFEKNLDECITRITNTDKCLKELMELKAKARELREEWRSLRSRCNQLEERVSVM EDEMNEMKREGKFREKRIKRNEQSLQEIWDYVKRPHLHLIGVPESDGENGTKLENTLQDI IQENFPNLARQANIQIQEIQRTPQRHSSRRATPRHIIVRFTKVEMKEKMLRAAREKGRVT HKGKPIRLTADLSAETLQARRELGPIFNILKEKNFQPRISYPAKLSFISEGEIKYFTDKQ MLRDFVTTRPALKELLKEALNMERNNRYQPLQNHAKL >gi568815584f:93247269_93448117|GENSCAN_predicted_CDS_5|1014_bp atggggaaaaaacagagcagaaaaactggaaactctaaaaagcagagcacctctcctcct ccaaaggaacgcagctcctcaccagcaacggaacaaagctggacagagaatgactttgac gagttgagagaaggcttcagacgatcaaactactccaagctacaggaggaaattcaaacc aaagtcaaagaagtcgaaaactttgaaaaaaatttagacgaatgtataactagaataacc aatacagacaagtgcttaaaggagctcatggagctgaaagccaaggctcgagaactacgt gaagaatggagaagcctcaggagtcgatgcaatcaactggaagaaagggtatcagtgatg gaagatgaaatgaatgaaatgaagcgagaagggaagtttagagaaaaaagaataaaaaga aatgaacaaagcctccaagaaatatgggactatgtgaaaagaccacatctacatctgatt ggtgtacctgaaagtgacggggagaatggaaccaagttggaaaacactctgcaggatatt atccaggagaacttccccaatctggcaaggcaggccaacattcagattcaggaaatacag agaacgccacaaagacactcctcgagaagagcaactccaagacacataattgtcagattt accaaagttgaaatgaaggaaaaaatgttaagggcagccagagagaaaggtcgggttacc cacaaagggaagcccatcagactaacagcggatctctcggcagaaactctacaagccaga agagagttggggccaatattcaacattcttaaagaaaagaattttcaacccagaatttca tatccagccaaactaagcttcataagtgaaggagaaataaaatactttacagacaagcaa atgctgagagattttgtcaccaccaggcctgccctaaaagagctcctgaaggaagcacta aacatggaaaggaacaaccggtaccagccactgcaaaatcatgccaaattgtaa >gi568815584f:93247269_93448117|GENSCAN_predicted_peptide_6|188_aa MTDSCAKLIFKRQEEDSPKKKSEWEQPEKKEEKEFKSGMQQLVSNDTEKSWSLSLEKPPL EPEEDRIIFQHRGQRRAAINVHPPPPSLSATPHPPQPQPPPPHQHNAKARVATIRTKRTS NCRIRSRKVRKSPPEKWVGFNRRPKASCPSPPGAARVDVGGETERREQAAAPGEMGKWAR PGEEYFHS >gi568815584f:93247269_93448117|GENSCAN_predicted_CDS_6|567_bp atgacagattcctgtgcgaagctaattttcaaaaggcaggaagaagacagccctaagaaa aaatctgagtgggaacagccagaaaagaaagaagaaaaggagtttaaatcaggaatgcag caactagtgtcaaatgatacagagaagtcatggtctctgagtcttgagaaacccccgctg gagcccgaggaagaccgtattatttttcagcaccggggacagcgccgcgctgccattaac gtccaccctcccccaccttctctctccgccaccccccacccgccccaacctcagccccca cccccgcaccagcacaatgccaaggcccgtgtcgctaccattcggacaaaacgcacatca aattgcagaattcggtcccggaaggttaggaaatcgcccccggagaaatgggtgggattt aataggcggccaaaggcttcctgtccgtcccctcctggagctgcccgcgttgatgtgggc ggggagaccgaaaggagagagcaggctgcggctcctggtgaaatgggaaaatgggctcga ccgggggaggagtatttccattcataa