GENSCAN 1.0 Date run: 8-Nov-116 Time: 04:35:20 Sequence gi568815597f:84312426_84514865 : 202440 bp : 39.29% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 13212 13323 112 0 1 79 93 216 0.903 20.66 1.02 Term + 13391 13444 54 1 0 76 54 41 0.437 -3.92 1.03 PlyA + 14827 14832 6 1.05 2.02 PlyA - 15512 15507 6 1.05 2.01 Sngl - 25079 24684 396 2 0 100 48 152 0.601 8.41 2.00 Prom - 29190 29151 40 -6.55 3.09 PlyA - 29502 29497 6 1.05 3.08 Term - 32951 32523 429 2 0 70 32 192 0.006 6.12 3.07 Intr - 50732 50604 129 2 0 33 68 89 0.013 1.37 3.06 Intr - 55315 55232 84 1 0 91 94 80 0.127 8.00 3.05 Intr - 56351 56061 291 2 0 28 81 103 0.309 0.01 3.04 Intr - 57602 57479 124 1 1 110 115 138 0.965 18.37 3.03 Intr - 60512 60427 86 2 2 159 70 20 0.968 5.00 3.02 Intr - 62523 62460 64 2 1 71 75 59 0.276 0.70 3.01 Init - 80809 80754 56 1 2 59 98 46 0.358 3.51 3.00 Prom - 81404 81365 40 -5.55 4.10 PlyA - 81542 81537 6 1.05 4.09 Term - 91093 91025 69 1 0 92 42 127 0.620 5.46 4.08 Intr - 92013 91739 275 1 2 63 26 285 0.411 16.13 4.07 Intr - 94183 94141 43 1 1 105 116 -2 0.418 1.19 4.06 Intr - 99630 99554 77 1 2 87 91 88 0.908 7.32 4.05 Intr - 100174 100057 118 1 1 30 65 82 0.867 -0.78 4.04 Intr - 101562 101381 182 1 2 51 73 178 0.900 11.27 4.03 Intr - 104720 104669 52 2 1 69 127 42 0.107 3.76 4.02 Intr - 115467 115303 165 0 0 89 85 106 0.016 9.64 4.01 Init - 121075 120953 123 1 0 96 -39 116 0.021 -0.08 4.00 Prom - 128613 128574 40 -3.65 5.00 Prom + 129918 129957 40 -4.05 5.01 Sngl + 131685 132020 336 2 0 52 41 261 0.966 13.58 5.02 PlyA + 132071 132076 6 1.05 6.00 Prom + 136339 136378 40 -4.35 6.01 Init + 143245 143318 74 0 2 87 92 21 0.509 2.99 6.02 Intr + 144786 144851 66 0 0 88 56 85 0.345 2.40 6.03 Intr + 150902 151072 171 2 0 107 -6 106 0.511 1.24 6.04 Term + 151277 151544 268 2 1 18 49 226 0.386 5.68 6.05 PlyA + 152593 152598 6 1.05 7.00 Prom + 159318 159357 40 -5.35 7.01 Init + 166857 167288 432 2 0 95 39 416 0.567 33.66 7.02 Intr + 168531 168587 57 2 0 93 94 53 0.858 4.56 7.03 Intr + 170490 170570 81 2 0 90 80 147 0.971 13.02 7.04 Intr + 177208 177303 96 0 0 133 49 84 0.992 8.29 7.05 Intr + 177894 178047 154 2 1 94 7 134 0.636 4.62 7.06 Intr + 182948 183030 83 0 2 58 115 59 0.988 4.04 7.07 Intr + 183457 183638 182 0 2 90 94 40 0.986 2.54 7.08 Intr + 183819 183945 127 0 1 80 95 77 0.950 7.36 7.09 Term + 185117 185212 96 1 0 44 43 42 0.276 -7.71 7.10 PlyA + 185221 185226 6 1.05 8.03 PlyA - 185922 185917 6 1.05 8.02 Term - 186123 186070 54 0 0 125 48 43 0.028 0.58 8.01 Init - 193666 193586 81 1 0 77 61 121 0.103 9.40 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 54214 54395 182 2 2 66 47 174 0.890 7.89 S.002 Init + 111611 111855 245 1 2 51 80 225 0.848 15.25 S.003 Term + 115293 115527 235 0 1 43 49 142 0.885 0.61 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:84312426_84514865|GENSCAN_predicted_peptide_1|55_aa XSMENGRPPDPADWAVMDVVNYFRTVGFEEQASAFQEQWAEGSEEEEDQRAGRNQ >gi568815597f:84312426_84514865|GENSCAN_predicted_CDS_1|168_bp nnttccatggaaaatgggagaccacctgatcctgcagactgggccgtgatggatgtcgtc aattatttccgaaccgtgggatttgaggagcaagctagtgcttttcaggaacagtgggca gaagggagtgaggaggaagaggaccaaagagcagggaggaatcagtga >gi568815597f:84312426_84514865|GENSCAN_predicted_peptide_2|131_aa MAERDQCRAWVVASEGASLKLWQLPCGVEPASAEKSRIEVWESLPRFQKMYGNAWMSRQK FAARVGLSWRISARAVWREMWGGSSHIESLLRHHLVELREEGHHPPDPRMVDPLTACTVH PEKPQTLNTSP >gi568815597f:84312426_84514865|GENSCAN_predicted_CDS_2|396_bp atggctgaaagggaccaatgtagagcttgggtcgtggcttcagagggtgcaagcctcaag ctttggcagcttccatgtggtgttgaacctgcgagtgcagagaagtcaagaattgaggtt tgggaatctctgcctagatttcagaagatgtatggaaatgcctggatgtccaggcagaag tttgctgcaagggtggggctttcatggagaatctctgctagggcagtgtggagggaaatg tggggtgggagctcccatatagagtccctactgaggcaccacctagtggagctgagagaa gagggccaccatcctccagaccccagaatggtagatccactgacagcttgcaccgtgcac ccagaaaagccgcagacactcaataccagcccatga >gi568815597f:84312426_84514865|GENSCAN_predicted_peptide_3|420_aa MTEVLEKGMVCAGAALGVGHVATLGNMYDYRLHFTDDKIKVPMNQVRPFSFSRKNKSLYR YQLRQIYPLDTIRDLVMEKSAGPYDKGEYLTSVQKTLCDIQVLSLSRVPAAFLWLCLPPL GEESVGPQQHTLQRPMHSPEFPVQIALRLLRERGQISSRDLSSRGRIMLLVFTLRPESRQ RGRCLQEVWMALDMQAGVSTPVSPREKIEDMEISLPNIHYFNIDMSKMGLINKEEYFHGG IVQVVPSPRAAGEAVSMAFNGAGCHVGLVVLVAKILPEFKTFSNTNSVFHKHQNTPASYE NLPVKEHSSNFLPMMWRNLMKQSFSASFFPLTSSSTFFVTTLLPFTVWLQMVTSLKFLSL LLTKGSNVSEVEVLALEPRDKKVENPVSNMGATSKMEEWNLLALPTRPREKSSPENPENA >gi568815597f:84312426_84514865|GENSCAN_predicted_CDS_3|1263_bp atgactgaagtgctagaaaaagggatggtttgtgcaggggcagctcttggtgtgggacat gtggcaactctaggaaatatgtatgattatcgtctccattttacagatgacaaaatcaag gtacccatgaaccaagttcgtcccttctccttcagcagaaaaaacaaaagcctttataga tatcaacttaggcagatctatcccttggacaccattcgggaccttgtcatggagaaatct gctgggccctatgacaaaggtgaatacttgacctctgtgcagaagaccctctgtgatatc caggtgctctccctgagccgagttcctgcggccttcctctggctgtgcctccctccacta ggtgaagaatcagtggggccacagcaacatactctgcagaggcccatgcattccccagaa tttcctgtccaaatagccctgagactacttcgggaacgtgggcaaatctcttctcgagat ctgtcttccagagggcgaatcatgctattggtgttcacccttaggcctgagagtaggcaa aggggcagatgtttgcaggaggtatggatggcacttgacatgcaagctggagtgtctacc cctgtgagtccaagagagaagatagaagatatggaaatcagcctgccaaacattcactac ttcaacatagacatgtccaaaatgggtctgatcaacaaggaagagtactttcatggagga atcgtgcaagtagttccaagcccacgtgctgccggggaggcagtttccatggcattcaat ggagcaggctgtcatgtggggcttgtggttttggtagccaagattctgcctgagttcaaa acattctcaaacaccaattcagtctttcataagcatcagaacaccccagcctcatatgaa aatctccctgttaaagaacacagcagcaacttcttaccaatgatgtggaggaatctcatg aaacagtctttttctgcttcattcttccccctcacatcttctagcactttctttgtaaca actctcttgccatttactgtctggctacaaatggtgaccagtctgaagtttctcagtctg ctgcttacaaagggttcaaatgtctcagaggtagaggtacttgctctggagcccagagat aaaaaagtggaaaatcctgtgtccaatatgggagccacatcaaagatggaggagtggaat ctcctggctttacccacaaggccaagggaaaaatcatcacctgagaatcctgagaatgca tag >gi568815597f:84312426_84514865|GENSCAN_predicted_peptide_4|367_aa MVSFSDSAGAKPIKRGWQHRQQGASGNRALIWLLLCDLGQNSPDGKMVGLACLRSYWAEE WTEVGCKSDFLDIGKQLACSMYQSLLGHQTLLHLTEVLQQGVTITKIYQHPYHGVTHGSG KEFGEDTIHVSVIKDLELTEFDGLIEGSEKEEMGSDWVTMQNTRKSRFVGDDDASVQQQN YTACCLMMVFHTVKKRIRLCKMEEFLSLGRLKCDELTENLHHTPKIMKFVIDEIDIRTQN QSLKTLHVAGFTKGKAMKAIKPKTIHSCWKNLCSDVEHDFTGFMTESIKKVMKEIVDMAR KVGGDGFQDMIPGEIQKLIDTTLEKLTEHNSMEPEPDDEQEDIEELTQQEDDKDEDTYDD PFPLNEE >gi568815597f:84312426_84514865|GENSCAN_predicted_CDS_4|1104_bp atggtgtccttctcagactcagctggagcaaagcctatcaagcgagggtggcagcatcgt caacagggagcatcgggcaaccgggctctcatctggcttctgctctgtgatctcggacag aacagccctgatgggaagatggttggactggcctgcctccgttcctactgggccgaggaa tggactgaggttggctgcaagtctgactttttggatatagggaaacaactggcttgcagc atgtaccagtctctccttggtcaccagacactgctccatctcacagaggttctacaacaa ggagttaccatcaccaaaatataccaacacccctaccacggtgtgacccatggcagtggc aaagagtttggagaagatacaatacatgtgagtgttattaaggatctggaactgactgaa tttgatggtttgatagaagggtcagaaaaagaagagatggggagtgactgggtaactatg cagaatacaagaaaaagtagatttgtaggggatgatgacgcatcagttcaacagcagaat tatacagcatgttgtttgatgatggtctttcataccgtcaagaaaagaatccgactttgc aaaatggaggaatttttgtccctgggccgactgaagtgtgatgaattaactgagaattta catcacacacccaagattatgaagtttgtaattgatgaaattgatattcgaacccagaac caatcactcaagacattgcatgtggcaggttttactaaaggaaaagccatgaaagccatc aagcccaaaacaatacattcctgctggaaaaatctgtgttcagatgttgagcatgacttc acaggatttatgacagagtcaatcaagaaagtcatgaaagagattgtggatatggcaaga aaggttgggggtgatggatttcaagatatgattcctggagaaattcaaaagctaatagac accacactagagaaattaacagaacataactcaatggaaccagagccagatgatgagcaa gaagacatagaagagctgactcaacaggaagatgacaaggatgaagacacttatgatgat ccatttccacttaatgaagagtga >gi568815597f:84312426_84514865|GENSCAN_predicted_peptide_5|111_aa MRKNQYKKAENSKNQNTSSPKDHNSSSAREQTWTESEFDEFTEVGFRRWVITNSSELKEH VLTQCKEVKTLEKRLEELLTRITSLEKNINDLMELKNTARELSEAYTSINS >gi568815597f:84312426_84514865|GENSCAN_predicted_CDS_5|336_bp atgaggaaaaaccagtacaaaaaggctgaaaattccaaaaaccagaacacctcttctcca aaggatcacaactcctcatcagcaagggaacaaacctggacagagagtgagtttgacgaa ttcacagaagtaggcttcagaaggtgggtaataacaaactcctccgagctaaaggagcat gttctaacccaatgcaaggaagttaagacccttgaaaaaaggttagaggaattgctaact agaataaccagtttagagaagaacataaatgacctgatggagctgaaaaacacagcacga gaacttagtgaagcatacacaagtatcaacagctga >gi568815597f:84312426_84514865|GENSCAN_predicted_peptide_6|192_aa MNKNTKWSYRVQNISYLHHEDSGKRLVAKGEEKESEEGLQVLRAKQIIAIKAPKLKTIYI KEPPSFKENQWATERQQNWRSSSKHQSPILSSSLLRYLHIMCCTRKTSIGEQKEVAPVEM LLGLEQDAWIFQKTLKKKAKLEVNCWYLPKGAVHQVMGPQHQEAREHIAKEGVRRGGLSR IRELCRGPCMAS >gi568815597f:84312426_84514865|GENSCAN_predicted_CDS_6|579_bp atgaacaagaatacaaagtggtcctatagggtccagaatataagttacctccatcatgag gatagtggtaagaggttggtggctaagggagaagagaaggagagtgaagaaggtctacag gttctgagagcaaagcaaataattgccatcaaagctccaaagctgaagacaatatacatc aaggaacccccatcatttaaagagaaccagtgggctactgagaggcaacagaactggcgt tcctcctccaagcatcagtctcccatcctttcctcctccctactgcgttatctacatatt atgtgttgtacgagaaaaacaagtattggagaacaaaaagaagtggcaccagttgagatg ctacttggattggagcaagatgcttggattttccagaagacactgaagaagaaagccaag cttgaagttaattgctggtacctgcccaagggggctgtgcaccaagtgatgggaccccaa caccaagaggctagggagcacattgctaaggaaggtgtcagaagaggtggcctgtcccgc ataagggaactgtgcagaggcccttgtatggcttcctga >gi568815597f:84312426_84514865|GENSCAN_predicted_peptide_7|435_aa MAKAGDKSSSSGKKSLKRKAAAEELQEAAGAGDGATENGVQPPKAAAFPPGFSISEIKNK QRRHLMFTRWKQQQRKVRERRGLPGACALFLTLRAVAGRTSVVVCWSQKCSLWLRGKGGV AAHVFWSPRPSLETLGSAFCLSVKEKLAAKKKLKKEREALGDKAPPKPVPKTIDNQRVYD ETTVDPNDEEVAYDEATDEFASYFNKQTSPKILITTSDRPHGRTVRLCEQLSTVIPNSHV YYRRGLALKKIIPQCIARDFTDLIVINEDRKTPNGLILSHLPNGPTAHFKMSSVRLRKEI KRRGKDPTEHIPEIILNNFTTRLGHSIGRMFASLFPHNPQFIGRQVATFHNQRDYIFFRF HRYIFRSEKKVGIQELGPRFTLKLRSLQKGTFDSKYGEYEWVHKNTLFKMAFADFINLSR LDELPNAMNCHCVFM >gi568815597f:84312426_84514865|GENSCAN_predicted_CDS_7|1308_bp atggcgaaagccggggataagagcagcagcagcgggaagaaaagtctaaaacggaaagcc gctgccgaagaacttcaggaggctgcaggcgctggggatggggcgacggaaaacggggtc caacccccgaaagcggctgcctttccgccaggctttagcatttcggagattaaaaacaaa cagcggcgacacttaatgttcacgcggtggaaacagcagcagcggaaggtacgcgagagg cgggggctgccgggcgcttgcgcgttgttcctgacgcttagggcggtcgcggggcgcaca tctgtggttgtctgctggtctcaaaagtgttctctgtggctccgtgggaagggtggcgtc gcggcccacgtgttctggtcccctcgccctagtttggagactctaggttcagccttttgc ctgagcgtcaaggaaaagttggcagctaagaaaaaacttaaaaaagaaagagaggctctt ggcgataaggctccaccaaagcctgtacccaagaccattgacaaccagcgagtgtatgat gaaaccacagtagaccctaatgatgaagaggtcgcttatgatgaagctacagatgaattt gcttcttacttcaacaaacagacttctcccaagattctcatcacaacatcagatagacct catgggagaacagtacgactctgtgaacagctctccacagttataccaaactcacatgtt tattacagaagaggactggctctgaaaaaaattattccacagtgcatcgcaagagatttc acagacctgattgttattaatgaagatcgtaaaaccccaaatggacttattttgagtcac ttgccaaatggcccaactgctcattttaaaatgagcagtgttcgtcttcgtaaagaaatt aagagaagaggcaaggaccccacagaacacatacctgaaataattctgaataattttaca acacggctgggtcattcaattggacgtatgtttgcatctctctttcctcataatcctcaa tttatcggaaggcaggttgccacattccacaatcaacgggattacatattcttcagattt cacagatacatattcaggagtgaaaagaaagtgggaattcaggaacttggaccacgtttt accttaaaattaaggtctcttcagaaaggaacctttgattctaaatatggagagtatgaa tgggtccataagaatactcttttcaaaatggcatttgctgatttcataaacctttcacgt ctggacgaattaccaaatgccatgaattgccactgtgtgtttatgtag >gi568815597f:84312426_84514865|GENSCAN_predicted_peptide_8|44_aa MSGSSSVAAMKKVVQQLRLEAGLNRVKVSQTTPYDPVNIQESYI >gi568815597f:84312426_84514865|GENSCAN_predicted_CDS_8|135_bp atgtctggctcctccagcgtcgccgctatgaagaaagtggttcaacagctccggctggag gccggactcaaccgcgtaaaagtttcccaaaccactccttatgatccagtgaatattcaa gagagctacatttga