GENSCAN 1.0 Date run: 6-Nov-116 Time: 16:19:09 Sequence gi568815591f:121229293_121439342 : 210050 bp : 37.90% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 983 1115 133 1 1 81 97 55 0.502 6.05 1.02 Intr + 8724 8804 81 1 0 47 83 78 0.413 1.89 1.03 Intr + 9657 9744 88 1 1 70 61 83 0.304 1.91 1.04 Intr + 14935 15046 112 1 1 30 100 70 0.236 1.86 1.05 Term + 18975 19106 132 2 0 57 49 129 0.430 3.01 1.06 PlyA + 19173 19178 6 1.05 2.00 Prom + 23698 23737 40 -3.65 2.01 Init + 27872 28042 171 1 0 69 48 110 0.247 4.59 2.02 Intr + 36935 37155 221 1 2 82 82 97 0.467 4.68 2.03 Intr + 37415 37516 102 2 0 55 84 61 0.403 0.77 2.04 Term + 47015 47171 157 2 1 78 41 60 0.033 -3.28 2.05 PlyA + 47188 47193 6 1.05 3.03 PlyA - 49706 49701 6 1.05 3.02 Term - 55769 55343 427 1 1 12 42 208 0.115 2.39 3.01 Init - 63351 63206 146 0 2 78 98 150 0.596 14.64 3.00 Prom - 67718 67679 40 -5.15 4.00 Prom + 86388 86427 40 -4.45 4.01 Init + 86616 86673 58 2 1 42 131 23 0.970 3.52 4.02 Intr + 86804 87066 263 0 2 82 110 144 0.941 12.28 4.03 Term + 88541 88735 195 1 0 59 40 148 0.452 3.53 4.04 PlyA + 91160 91165 6 1.05 5.00 Prom + 91862 91901 40 -4.45 5.01 Init + 100001 100095 95 1 2 92 76 249 0.986 22.00 5.02 Intr + 100275 100525 251 0 2 87 101 293 0.983 26.66 5.03 Intr + 102386 102672 287 0 2 72 91 165 0.031 11.34 5.04 Intr + 109589 109979 391 1 1 59 84 223 0.301 12.37 5.05 Term + 117729 117856 128 1 2 68 38 123 0.493 2.76 5.06 PlyA + 119066 119071 6 1.05 6.07 PlyA - 119587 119582 6 1.05 6.06 Term - 121258 121169 90 1 0 47 39 74 0.123 -4.86 6.05 Intr - 121977 121851 127 2 1 80 92 37 0.189 3.06 6.04 Intr - 130835 130751 85 0 1 119 55 76 0.653 5.36 6.03 Intr - 133655 133605 51 0 0 93 99 30 0.718 2.56 6.02 Intr - 134896 134838 59 0 2 96 95 33 0.747 2.41 6.01 Init - 142063 142008 56 1 2 47 75 66 0.554 2.01 6.00 Prom - 142509 142470 40 -5.85 7.00 Prom + 142691 142730 40 -5.05 7.01 Init + 151106 151303 198 1 0 85 79 102 0.733 7.95 7.02 Term + 169135 169404 270 0 0 60 39 266 0.786 13.40 7.03 PlyA + 170063 170068 6 1.05 8.05 PlyA - 172766 172761 6 1.05 8.04 Term - 191697 191510 188 1 2 37 48 167 0.451 4.27 8.03 Intr - 192936 192631 306 1 0 43 68 187 0.099 7.70 8.02 Intr - 200910 200815 96 1 0 20 66 124 0.049 2.46 8.01 Init - 202583 202517 67 2 1 21 116 37 0.747 1.09 8.00 Prom - 204995 204956 40 -5.35 9.00 Prom + 205273 205312 40 -5.45 9.01 Sngl + 209362 209736 375 0 0 71 48 219 0.713 10.14 9.02 PlyA + 209846 209851 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 100839 101005 167 1 2 61 48 163 0.952 6.70 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:121229293_121439342|GENSCAN_predicted_peptide_1|181_aa MKRRETWKTEMSGLGTESSKLNSIEVIGDGTDFGMIFKVQFQLFDDNLREIIKQLYSSYW TKTKVLQQKVQEDYEQPGHNHDNQLLHPQAPKGPKTQKLNNCSDNRTCDWREITWQPHNC QYGVLTKPQLQQCLGGRKDQPGKNVAYLLAGASVQGAPQLGSPNKRNADTVPVIKGGSPK M >gi568815591f:121229293_121439342|GENSCAN_predicted_CDS_1|546_bp atgaagagaagggaaacttggaaaactgaaatgtctggcttgggcaccgaatcttccaag ctcaacagtattgaggttattggggatggaacagattttggaatgatttttaaagttcaa tttcagctgtttgatgacaatttacgagaaatcattaaacagttgtattcctcctactgg acaaagacaaaagttcttcaacaaaaagttcaagaagactatgagcagccagggcacaat cacgacaaccaactgctccatcctcaagctccaaaaggcccaaagactcaaaaacttaac aattgttcggacaacaggacgtgtgactggagagaaataacctggcagccccacaactgc caatatggtgtcctaactaagcctcaactccagcagtgcctgggaggaaggaaggaccag cctggaaagaatgtggcctatctccttgccggagcctctgttcagggagccccacagctt ggatcacctaacaaaagaaatgcagacacagtaccagtgattaaagggggctccccaaag atgtag >gi568815591f:121229293_121439342|GENSCAN_predicted_peptide_2|216_aa MQYTHAANLHTYAISAIKVEVWGKKDNRRAMNQAQDPSKHVALQDRPGHMPMKPAWVILF IGDSTNRGIMYYLIERLNETLQEWQKVHGTKFYHNVNGGKTLISYSYYPQFWISPSLRPT FENALEHLLQRSRPLENTGQTVLVVGGVQWLNSNHLQIIHKVLKSPFTTLNQPVTKSCLQ AIYFPRLSPTLHSNCLDLVYSFTKSFNIYFVVQFLN >gi568815591f:121229293_121439342|GENSCAN_predicted_CDS_2|651_bp atgcaatatactcatgcagcaaacctgcacacatacgccatatctgcaataaaagttgaa gtttggggaaaaaaagacaatagaagagcaatgaaccaagcacaagacccttcaaagcat gtggccctgcaagaccgtccaggtcacatgcccatgaagccagcttgggtgattctgttc attggagattcaaccaacagagggatcatgtactatcttattgaaaggctgaatgaaacg ttgcaggaatggcagaaagtacatggcactaaattctatcacaacgtcaatggtgggaag actttgatcagttattcctactatccccagttctggataagcccttcattgagaccaaca tttgaaaatgcacttgaacacctcttgcaaagatcacgtcccctagagaatactggccag actgtattggttgttggtggtgttcagtggcttaattccaatcacctgcaaattattcac aaagttttgaagagccctttcacgacccttaaccagccagttaccaaatcctgtttacaa gctatatattttccacgtctttctcctacactacattccaactgccttgaccttgtttat tcttttactaaatcattcaacatttattttgtggtccagttcttgaactaa >gi568815591f:121229293_121439342|GENSCAN_predicted_peptide_3|190_aa MRKSQHKKAENSKNQNASSPPNDRNSSPAREQNWMENQFDKLTEVGFRRFQRMYGNAWMS RQKFAAGAETSWRSSARTVQKGNVGSEPPHSDPTGALLNEAVRRRALSSKPQNGRSTNSL HCVPGKATDTQRHPVKAAGREAVPCKATGELPKAGGDHLLHQHDLDVRPSVKGDHFVTLR FNDCPAGFWV >gi568815591f:121229293_121439342|GENSCAN_predicted_CDS_3|573_bp atgaggaaaagccagcacaaaaaggctgaaaattccaaaaaccagaatgcctcttctcct ccaaacgatcgcaactcctcgccagcaagggaacaaaactggatggagaatcagtttgac aaattgacagaagtaggcttcagaagatttcagaggatgtatggaaacgcctggatgtcc aggcagaagtttgctgcaggggcggaaacttcatggagaagctctgctaggacagtgcag aagggaaatgtggggtcagagcccccacacagtgaccccactggggcactgctgaatgaa gctgtgagaagaagggcactgtcctccaaaccccagaatggtagatccaccaacagcttg cactgtgtgcctggaaaagccacagacactcaacgccatcctgtaaaagcagctgggagg gaggctgtaccctgcaaagccacaggggagctgcccaaggctgggggagaccacctcttg catcagcatgacttggatgtgagacctagtgtcaaaggagatcattttgtaactttaagg tttaatgattgccccgctggattttgggtttga >gi568815591f:121229293_121439342|GENSCAN_predicted_peptide_4|171_aa MTSGPQTDQPKKHLTNFKSETKETRFICGPKTPAPVTDWEGSLLLVFNHCRDTSLIIHPR FKGVRPRRDACLSPSPLAASPAFLGKGQVPQPLLSLSLPLPCFSGDRELATSARNLSDHQ AKECLQPRIPPKPCPIFAGPHWKSDCSTHLADTPRAPGTLAQGFLTDSFSA >gi568815591f:121229293_121439342|GENSCAN_predicted_CDS_4|516_bp atgacctcaggtcctcagaccgaccagcccaagaaacatctcaccaatttcaaatccgag acaaaagagacacgttttatctgtggacccaaaactccggcaccggtcacggactgggaa ggcagccttctcttggtgtttaatcattgcagggacacctctctgattatacacccacgt ttcaagggtgtcagaccacgcagggacgcctgcctcagtccttcacccttagcggcaagt cccgcttttctggggaaggggcaagtaccccaaccccttctctccttgtctctacccctt ccctgcttttccggggacagggagcttgctacaagtgccagaaatctatctgaccaccag gccaaggaatgcctgcagcccaggattcctcctaagccgtgtcccatctttgcgggaccc cactggaaatcggactgttcaactcacctggcagacactcccagagcccctggaactctg gcccaaggctttctgaccgactccttctcggcttag >gi568815591f:121229293_121439342|GENSCAN_predicted_peptide_5|383_aa MDRAALLGLARLCALWAALLVLFPYGAQGNWMWLGIASFGVPEKLGCANLPLNSRQKELC KRKPYLLPSIREGARLGIQECGSQFRHERWNCMITAAATTAPMGASPLFGYELSSGTKET AFIYAVMAAGLVHSVTRSCSAGNMTECSCDTTLQNGGSASEGWHWGGCSDDVQYGMWFSR KFLDFPIGNTTGKENKVLLAMNLHNNEAGRQAVAKLMSVDCRCHGVSGSCAVKTCWKTMS SFEKIGHLLKDKYENSIQISDKTKRKMRRREKDQRKIPIHKDDLLYVNKSPNYCVEDKKL GIPGTQGRECNRTSEGADGCNLLCCGRGYNTHVVRHVERCECFSDPFLFGYQVKEIWADY TEHYELAGYKSIFPDQREYLLSI >gi568815591f:121229293_121439342|GENSCAN_predicted_CDS_5|1152_bp atggacagggcggcgctcctgggactggcccgcttgtgcgcgctgtgggcagccctgctc gtgctgttcccctacggagcccaaggaaactggatgtggttgggcattgcctccttcggg gttccagagaagctgggctgcgccaatttgccgctgaacagccgccagaaggagctgtgc aagaggaaaccgtacctgctgccgagcatccgagagggcgcccggctgggcattcaggag tgcgggagccagttcagacacgagagatggaactgcatgatcaccgccgccgccactacc gccccgatgggcgccagccccctctttggctacgagctgagcagcggcaccaaagagaca gcatttatttatgctgtgatggctgcaggcctggtgcattctgtgaccaggtcatgcagt gcaggcaacatgacagagtgttcctgtgacaccaccttgcagaacggcggctcagcaagt gaaggctggcactgggggggctgctccgatgatgtccagtatggcatgtggttcagcaga aagttcctagatttccccatcggaaacaccacgggcaaagaaaacaaagtactattagca atgaacctacataacaatgaagctggaaggcaggctgtcgccaagttgatgtcagtagac tgccgctgccacggagtttccggctcctgtgctgtgaaaacatgctggaaaaccatgtct tcttttgaaaagattggccatttgttgaaggataaatatgaaaacagtatccagatatca gacaaaacaaagaggaaaatgcgcaggagagaaaaagatcagaggaaaataccaatccat aaggatgatctgctctatgttaataagtctcccaactactgtgtagaagataagaaactg ggaatcccagggacacaaggcagagaatgcaaccgtacatcagagggtgcagatggctgc aacctcctctgctgtggccgaggttacaacacccatgtggtcaggcacgtggagaggtgt gagtgtttttcagacccttttctctttggctatcaagtgaaggaaatatgggctgactat acggaacattacgagcttgcaggttataagtcaattttccctgaccaaagggagtatctt ttatcaatttag >gi568815591f:121229293_121439342|GENSCAN_predicted_peptide_6|155_aa MASGAANVVGPKICLEDNVLMSGVKNNVGRGINVALANGKTGEVLDTKYFDMWGGDVAPF IEFLKAIQDGTIVLMGTYDDGATKLNDEARRLIADLGSTSITNLGFRDNWVFCGGKGIKT KSPFEQHIKNNKDTNKYEGWPEVVEMEGCIPQKQD >gi568815591f:121229293_121439342|GENSCAN_predicted_CDS_6|468_bp atggcaagtggagcagccaacgtggtgggacccaaaatctgcctggaagataatgtttta atgagtggtgttaagaataatgttggaagagggatcaatgttgccttggcaaatggaaaa acaggagaagtattagacactaaatattttgacatgtggggaggagatgtggcaccattt attgagtttctgaaggccatacaagatggaacaatagttttaatgggaacatacgatgat ggagcaaccaaactcaatgatgaggcacggcggctcattgctgatttggggagcacatct attactaatcttggttttagagacaactgggtcttctgtggtgggaagggcattaagaca aaaagcccttttgaacagcacataaagaacaataaggatacaaacaaatatgaaggatgg cctgaagttgtagaaatggaaggatgcatcccccagaagcaagactaa >gi568815591f:121229293_121439342|GENSCAN_predicted_peptide_7|155_aa MDEAGGHYTKQTNTGTENQIVHVLIHKQKLNTEKGTYKHKEGKNRLWGLLEGGGWEENED QKTTYQHKQQKRGLNMDDVEKGKKICIQKCAQCHTVEKAGKHRTGPNLHGLFGWKTGQAV GLFYTDAIKNKGITWGEDTLMEYLENPKKHISGTK >gi568815591f:121229293_121439342|GENSCAN_predicted_CDS_7|468_bp atggatgaagctggaggccattacactaagcaaactaacacaggaacagaaaaccaaata gtgcatgttctcattcataagcaaaagctaaacactgagaagggaacatataaacataaa gaagggaaaaacagactctggggcctacttgagggtggagggtgggaagagaatgaggat caaaaaactacctatcagcacaagcaacaaaagagaggattaaatatggatgatgttgag aaaggcaagaagatttgtattcagaagtgtgcccagtgccacactgtggaaaaggcaggc aagcacaggactgggcctaatctccatggtctcttcggatggaagacaggtcaagccgtt ggattattttacacagatgccattaagaacaaaggcatcacctggggagaggatacactg atggagtatttggagaatcccaagaagcacatctctggaacaaagtga >gi568815591f:121229293_121439342|GENSCAN_predicted_peptide_8|218_aa MCACWLEYPEKQRERETLVLQQDQGPILKREEEEEEEEERRKKKEQRRKKKYNNARHKGS PSPHQSQEPSWLHPVDPALGPQVELPASPAPCTHTPQPLGGRWDWAPWSRGRCLSGRLRP HRSPRKRGGSGMAGCSSQALPHGEAAKGRQEIERSAAQLRTEVLKRDTTTTLKASSKFFN GPIFDISYTKCHQAPEESTRRKSEIQANTNKEGDSAKA >gi568815591f:121229293_121439342|GENSCAN_predicted_CDS_8|657_bp atgtgtgcatgctggttggaatacccagagaaacagagagagagagagacactggtattg caacaagaccaaggcccaatcttgaaaagagaagaagaagaggaagaggaagaagaaaga agaaagaagaaagaacaaagaagaaagaagaaatacaataatgctagacataaaggttct ccaagtcctcaccagagtcaggagcccagctggcttcacccagtggatcccgcactgggg ccacaggtggagctgcctgccagtcccgcgccatgcacccacactcctcagcccttgggt ggtcgatgggactgggcaccgtggagcagggggcggtgcttgtccgggaggctcaggcca caccggagcccacggaagcggggaggctcaggcatggctggctgcagttcccaagccctg ccccatggggaggcagctaagggccggcaagaaattgagcgcagcgccgctcaacttaga acagaagtacttaaacgggacactacaactacacttaaagcttccagtaagtttttcaat ggtcccatctttgacataagctacaccaagtgtcaccaggcacctgaggagagcactaga agaaagtcagaaatccaagcaaatacaaacaaagaaggggacagtgcaaaagcatga >gi568815591f:121229293_121439342|GENSCAN_predicted_peptide_9|124_aa MAGAPPPPSLPPCSSISVCCASNERGFVGVGPMPSLTSPIQHSVGSSGQGNQAGERNKDI QLGKEEVKLSLFADDMIVYLENPIVSVQNLLKLISNLGKVSGYTINVEKSQAFLYTNNRQ RAKS >gi568815591f:121229293_121439342|GENSCAN_predicted_CDS_9|375_bp atggcaggcgcccctcccccaccctcgctgccgccttgcagttccatctcagtctgctgt gctagcaatgagcgaggcttcgtgggtgtgggacccatgccctctctgacctctcctatt caacatagtgttggaagttctggccagggcaatcaggcaggagaaagaaataaagatatt caattaggaaaagaggaagtcaaattgtccctgtttgcagatgacatgattgtatattta gaaaaccccatcgtttcagtccaaaatctccttaagctgataagcaacttaggcaaagtc tcaggatacacaatcaatgtggaaaaatcacaagcattcttatacaccaataacagacag agagccaaatcatga