GENSCAN 1.0 Date run: 7-Nov-116 Time: 19:16:22 Sequence gi568815591r:121250464_121479016 : 228553 bp : 37.83% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 192 422 231 1 0 56 82 107 0.227 3.72 1.02 Intr + 6720 6871 152 1 2 40 48 104 0.248 0.56 1.03 Intr + 15764 15984 221 1 2 82 82 97 0.464 4.68 1.04 Intr + 16244 16345 102 2 0 55 84 61 0.401 0.77 1.05 Term + 25844 26000 157 2 1 78 41 60 0.033 -3.28 1.06 PlyA + 26017 26022 6 1.05 2.03 PlyA - 28535 28530 6 1.05 2.02 Term - 34598 34172 427 1 1 12 42 208 0.115 2.39 2.01 Init - 42180 42035 146 0 2 78 98 150 0.596 14.64 2.00 Prom - 46547 46508 40 -5.15 3.00 Prom + 65217 65256 40 -4.45 3.01 Init + 65445 65502 58 2 1 42 131 23 0.970 3.52 3.02 Intr + 65633 65895 263 0 2 82 110 144 0.941 12.28 3.03 Term + 67370 67564 195 1 0 59 40 148 0.452 3.53 3.04 PlyA + 69989 69994 6 1.05 4.00 Prom + 70691 70730 40 -4.45 4.01 Init + 78830 78924 95 1 2 92 76 249 0.986 22.00 4.02 Intr + 79104 79354 251 0 2 87 101 293 0.983 26.66 4.03 Intr + 81215 81501 287 0 2 72 91 165 0.031 11.34 4.04 Intr + 88418 88808 391 1 1 59 84 223 0.301 12.37 4.05 Term + 96558 96685 128 1 2 68 38 123 0.493 2.76 4.06 PlyA + 97895 97900 6 1.05 5.07 PlyA - 98416 98411 6 1.05 5.06 Term - 100087 99998 90 1 0 47 39 74 0.123 -4.86 5.05 Intr - 100806 100680 127 2 1 80 92 37 0.189 3.06 5.04 Intr - 109664 109580 85 0 1 119 55 76 0.653 5.36 5.03 Intr - 112484 112434 51 0 0 93 99 30 0.718 2.56 5.02 Intr - 113725 113667 59 0 2 96 95 33 0.747 2.41 5.01 Init - 120892 120837 56 1 2 47 75 66 0.554 2.01 5.00 Prom - 121338 121299 40 -5.85 6.00 Prom + 121520 121559 40 -5.05 6.01 Init + 129935 130132 198 1 0 85 79 102 0.733 7.95 6.02 Term + 147964 148233 270 0 0 60 39 266 0.786 13.40 6.03 PlyA + 148892 148897 6 1.05 7.05 PlyA - 151595 151590 6 1.05 7.04 Term - 170526 170339 188 1 2 37 48 167 0.451 4.27 7.03 Intr - 171765 171460 306 1 0 43 68 187 0.099 7.70 7.02 Intr - 179739 179644 96 1 0 20 66 124 0.048 2.46 7.01 Init - 181412 181346 67 2 1 21 116 37 0.789 1.09 7.00 Prom - 183824 183785 40 -5.35 8.00 Prom + 184102 184141 40 -5.45 8.01 Sngl + 188191 188565 375 0 0 71 48 219 0.948 10.14 8.02 PlyA + 188675 188680 6 1.05 9.02 PlyA - 189888 189883 6 1.05 9.01 Sngl - 190730 190371 360 2 0 75 40 303 0.770 20.02 9.00 Prom - 214937 214898 40 -3.35 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 79668 79834 167 1 2 61 48 163 0.952 6.70 S.002 Term + 193019 193453 435 1 0 2 39 321 0.873 12.90 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:121250464_121479016|GENSCAN_predicted_peptide_1|287_aa XIQTTIREYYKHLYANKLENLEEMDKFLNTYTLPRLNQEEVESLNRPITGSEIEAIINSL PTKKSPRQMDSQQNSTQANLHTYAISAIKVEVWGKKDNRRAMNQAQDPSKHVALQDRPGH MPMKPAWVILFIGDSTNRGIMYYLIERLNETLQEWQKVHGTKFYHNVNGGKTLISYSYYP QFWISPSLRPTFENALEHLLQRSRPLENTGQTVLVVGGVQWLNSNHLQIIHKVLKSPFTT LNQPVTKSCLQAIYFPRLSPTLHSNCLDLVYSFTKSFNIYFVVQFLN >gi568815591r:121250464_121479016|GENSCAN_predicted_CDS_1|864_bp naaatacaaactaccatcagagaatactataaacacctctatgcaaataaactagaaaat ctagaagaaatggataaattcctcaacacatacaccctcccaagactaaaccaggaagaa gttgaatctctgaatagaccaataacaggatctgaaattgaggcaataattaatagctta ccaaccaaaaaaagtccaagacagatggattcacagcaaaattctacccaagcaaacctg cacacatacgccatatctgcaataaaagttgaagtttggggaaaaaaagacaatagaaga gcaatgaaccaagcacaagacccttcaaagcatgtggccctgcaagaccgtccaggtcac atgcccatgaagccagcttgggtgattctgttcattggagattcaaccaacagagggatc atgtactatcttattgaaaggctgaatgaaacgttgcaggaatggcagaaagtacatggc actaaattctatcacaacgtcaatggtgggaagactttgatcagttattcctactatccc cagttctggataagcccttcattgagaccaacatttgaaaatgcacttgaacacctcttg caaagatcacgtcccctagagaatactggccagactgtattggttgttggtggtgttcag tggcttaattccaatcacctgcaaattattcacaaagttttgaagagccctttcacgacc cttaaccagccagttaccaaatcctgtttacaagctatatattttccacgtctttctcct acactacattccaactgccttgaccttgtttattcttttactaaatcattcaacatttat tttgtggtccagttcttgaactaa >gi568815591r:121250464_121479016|GENSCAN_predicted_peptide_2|190_aa MRKSQHKKAENSKNQNASSPPNDRNSSPAREQNWMENQFDKLTEVGFRRFQRMYGNAWMS RQKFAAGAETSWRSSARTVQKGNVGSEPPHSDPTGALLNEAVRRRALSSKPQNGRSTNSL HCVPGKATDTQRHPVKAAGREAVPCKATGELPKAGGDHLLHQHDLDVRPSVKGDHFVTLR FNDCPAGFWV >gi568815591r:121250464_121479016|GENSCAN_predicted_CDS_2|573_bp atgaggaaaagccagcacaaaaaggctgaaaattccaaaaaccagaatgcctcttctcct ccaaacgatcgcaactcctcgccagcaagggaacaaaactggatggagaatcagtttgac aaattgacagaagtaggcttcagaagatttcagaggatgtatggaaacgcctggatgtcc aggcagaagtttgctgcaggggcggaaacttcatggagaagctctgctaggacagtgcag aagggaaatgtggggtcagagcccccacacagtgaccccactggggcactgctgaatgaa gctgtgagaagaagggcactgtcctccaaaccccagaatggtagatccaccaacagcttg cactgtgtgcctggaaaagccacagacactcaacgccatcctgtaaaagcagctgggagg gaggctgtaccctgcaaagccacaggggagctgcccaaggctgggggagaccacctcttg catcagcatgacttggatgtgagacctagtgtcaaaggagatcattttgtaactttaagg tttaatgattgccccgctggattttgggtttga >gi568815591r:121250464_121479016|GENSCAN_predicted_peptide_3|171_aa MTSGPQTDQPKKHLTNFKSETKETRFICGPKTPAPVTDWEGSLLLVFNHCRDTSLIIHPR FKGVRPRRDACLSPSPLAASPAFLGKGQVPQPLLSLSLPLPCFSGDRELATSARNLSDHQ AKECLQPRIPPKPCPIFAGPHWKSDCSTHLADTPRAPGTLAQGFLTDSFSA >gi568815591r:121250464_121479016|GENSCAN_predicted_CDS_3|516_bp atgacctcaggtcctcagaccgaccagcccaagaaacatctcaccaatttcaaatccgag acaaaagagacacgttttatctgtggacccaaaactccggcaccggtcacggactgggaa ggcagccttctcttggtgtttaatcattgcagggacacctctctgattatacacccacgt ttcaagggtgtcagaccacgcagggacgcctgcctcagtccttcacccttagcggcaagt cccgcttttctggggaaggggcaagtaccccaaccccttctctccttgtctctacccctt ccctgcttttccggggacagggagcttgctacaagtgccagaaatctatctgaccaccag gccaaggaatgcctgcagcccaggattcctcctaagccgtgtcccatctttgcgggaccc cactggaaatcggactgttcaactcacctggcagacactcccagagcccctggaactctg gcccaaggctttctgaccgactccttctcggcttag >gi568815591r:121250464_121479016|GENSCAN_predicted_peptide_4|383_aa MDRAALLGLARLCALWAALLVLFPYGAQGNWMWLGIASFGVPEKLGCANLPLNSRQKELC KRKPYLLPSIREGARLGIQECGSQFRHERWNCMITAAATTAPMGASPLFGYELSSGTKET AFIYAVMAAGLVHSVTRSCSAGNMTECSCDTTLQNGGSASEGWHWGGCSDDVQYGMWFSR KFLDFPIGNTTGKENKVLLAMNLHNNEAGRQAVAKLMSVDCRCHGVSGSCAVKTCWKTMS SFEKIGHLLKDKYENSIQISDKTKRKMRRREKDQRKIPIHKDDLLYVNKSPNYCVEDKKL GIPGTQGRECNRTSEGADGCNLLCCGRGYNTHVVRHVERCECFSDPFLFGYQVKEIWADY TEHYELAGYKSIFPDQREYLLSI >gi568815591r:121250464_121479016|GENSCAN_predicted_CDS_4|1152_bp atggacagggcggcgctcctgggactggcccgcttgtgcgcgctgtgggcagccctgctc gtgctgttcccctacggagcccaaggaaactggatgtggttgggcattgcctccttcggg gttccagagaagctgggctgcgccaatttgccgctgaacagccgccagaaggagctgtgc aagaggaaaccgtacctgctgccgagcatccgagagggcgcccggctgggcattcaggag tgcgggagccagttcagacacgagagatggaactgcatgatcaccgccgccgccactacc gccccgatgggcgccagccccctctttggctacgagctgagcagcggcaccaaagagaca gcatttatttatgctgtgatggctgcaggcctggtgcattctgtgaccaggtcatgcagt gcaggcaacatgacagagtgttcctgtgacaccaccttgcagaacggcggctcagcaagt gaaggctggcactgggggggctgctccgatgatgtccagtatggcatgtggttcagcaga aagttcctagatttccccatcggaaacaccacgggcaaagaaaacaaagtactattagca atgaacctacataacaatgaagctggaaggcaggctgtcgccaagttgatgtcagtagac tgccgctgccacggagtttccggctcctgtgctgtgaaaacatgctggaaaaccatgtct tcttttgaaaagattggccatttgttgaaggataaatatgaaaacagtatccagatatca gacaaaacaaagaggaaaatgcgcaggagagaaaaagatcagaggaaaataccaatccat aaggatgatctgctctatgttaataagtctcccaactactgtgtagaagataagaaactg ggaatcccagggacacaaggcagagaatgcaaccgtacatcagagggtgcagatggctgc aacctcctctgctgtggccgaggttacaacacccatgtggtcaggcacgtggagaggtgt gagtgtttttcagacccttttctctttggctatcaagtgaaggaaatatgggctgactat acggaacattacgagcttgcaggttataagtcaattttccctgaccaaagggagtatctt ttatcaatttag >gi568815591r:121250464_121479016|GENSCAN_predicted_peptide_5|155_aa MASGAANVVGPKICLEDNVLMSGVKNNVGRGINVALANGKTGEVLDTKYFDMWGGDVAPF IEFLKAIQDGTIVLMGTYDDGATKLNDEARRLIADLGSTSITNLGFRDNWVFCGGKGIKT KSPFEQHIKNNKDTNKYEGWPEVVEMEGCIPQKQD >gi568815591r:121250464_121479016|GENSCAN_predicted_CDS_5|468_bp atggcaagtggagcagccaacgtggtgggacccaaaatctgcctggaagataatgtttta atgagtggtgttaagaataatgttggaagagggatcaatgttgccttggcaaatggaaaa acaggagaagtattagacactaaatattttgacatgtggggaggagatgtggcaccattt attgagtttctgaaggccatacaagatggaacaatagttttaatgggaacatacgatgat ggagcaaccaaactcaatgatgaggcacggcggctcattgctgatttggggagcacatct attactaatcttggttttagagacaactgggtcttctgtggtgggaagggcattaagaca aaaagcccttttgaacagcacataaagaacaataaggatacaaacaaatatgaaggatgg cctgaagttgtagaaatggaaggatgcatcccccagaagcaagactaa >gi568815591r:121250464_121479016|GENSCAN_predicted_peptide_6|155_aa MDEAGGHYTKQTNTGTENQIVHVLIHKQKLNTEKGTYKHKEGKNRLWGLLEGGGWEENED QKTTYQHKQQKRGLNMDDVEKGKKICIQKCAQCHTVEKAGKHRTGPNLHGLFGWKTGQAV GLFYTDAIKNKGITWGEDTLMEYLENPKKHISGTK >gi568815591r:121250464_121479016|GENSCAN_predicted_CDS_6|468_bp atggatgaagctggaggccattacactaagcaaactaacacaggaacagaaaaccaaata gtgcatgttctcattcataagcaaaagctaaacactgagaagggaacatataaacataaa gaagggaaaaacagactctggggcctacttgagggtggagggtgggaagagaatgaggat caaaaaactacctatcagcacaagcaacaaaagagaggattaaatatggatgatgttgag aaaggcaagaagatttgtattcagaagtgtgcccagtgccacactgtggaaaaggcaggc aagcacaggactgggcctaatctccatggtctcttcggatggaagacaggtcaagccgtt ggattattttacacagatgccattaagaacaaaggcatcacctggggagaggatacactg atggagtatttggagaatcccaagaagcacatctctggaacaaagtga >gi568815591r:121250464_121479016|GENSCAN_predicted_peptide_7|218_aa MCACWLEYPEKQRERETLVLQQDQGPILKREEEEEEEEERRKKKEQRRKKKYNNARHKGS PSPHQSQEPSWLHPVDPALGPQVELPASPAPCTHTPQPLGGRWDWAPWSRGRCLSGRLRP HRSPRKRGGSGMAGCSSQALPHGEAAKGRQEIERSAAQLRTEVLKRDTTTTLKASSKFFN GPIFDISYTKCHQAPEESTRRKSEIQANTNKEGDSAKA >gi568815591r:121250464_121479016|GENSCAN_predicted_CDS_7|657_bp atgtgtgcatgctggttggaatacccagagaaacagagagagagagagacactggtattg caacaagaccaaggcccaatcttgaaaagagaagaagaagaggaagaggaagaagaaaga agaaagaagaaagaacaaagaagaaagaagaaatacaataatgctagacataaaggttct ccaagtcctcaccagagtcaggagcccagctggcttcacccagtggatcccgcactgggg ccacaggtggagctgcctgccagtcccgcgccatgcacccacactcctcagcccttgggt ggtcgatgggactgggcaccgtggagcagggggcggtgcttgtccgggaggctcaggcca caccggagcccacggaagcggggaggctcaggcatggctggctgcagttcccaagccctg ccccatggggaggcagctaagggccggcaagaaattgagcgcagcgccgctcaacttaga acagaagtacttaaacgggacactacaactacacttaaagcttccagtaagtttttcaat ggtcccatctttgacataagctacaccaagtgtcaccaggcacctgaggagagcactaga agaaagtcagaaatccaagcaaatacaaacaaagaaggggacagtgcaaaagcatga >gi568815591r:121250464_121479016|GENSCAN_predicted_peptide_8|124_aa MAGAPPPPSLPPCSSISVCCASNERGFVGVGPMPSLTSPIQHSVGSSGQGNQAGERNKDI QLGKEEVKLSLFADDMIVYLENPIVSVQNLLKLISNLGKVSGYTINVEKSQAFLYTNNRQ RAKS >gi568815591r:121250464_121479016|GENSCAN_predicted_CDS_8|375_bp atggcaggcgcccctcccccaccctcgctgccgccttgcagttccatctcagtctgctgt gctagcaatgagcgaggcttcgtgggtgtgggacccatgccctctctgacctctcctatt caacatagtgttggaagttctggccagggcaatcaggcaggagaaagaaataaagatatt caattaggaaaagaggaagtcaaattgtccctgtttgcagatgacatgattgtatattta gaaaaccccatcgtttcagtccaaaatctccttaagctgataagcaacttaggcaaagtc tcaggatacacaatcaatgtggaaaaatcacaagcattcttatacaccaataacagacag agagccaaatcatga >gi568815591r:121250464_121479016|GENSCAN_predicted_peptide_9|119_aa MKLPGQENKTAVVVGTITDDVWVQEVPKLKVCALGVTSRARSHIFRVGGKNLTFDQLALD SPKVCGTVLLSGPRKGREVYWHFDKALGTPHSHTKPYVRSKGRKFEYARGQGASQGYKN >gi568815591r:121250464_121479016|GENSCAN_predicted_CDS_9|360_bp atgaagcttcctggccaggaaaacaaaacggccgtggttgtgggaaccataacagatgac gtgtgggttcaggaggtgcccaaactgaaggtgtgtgcactgggcgtgaccagccgggcc cgcagccacatcttcagggtggggggcaagaacctcactttcgaccagctggccctggac tcccccaaggtctgcggcaccgtcctgctctcaggtcctcgcaagggtcgagaggtgtac tggcatttcgacaaggccctgggaaccccacacagccacaccaaaccctacgtccgctcc aagggccggaagttcgagtatgccagaggccaaggggccagccaaggctacaaaaactaa