GENSCAN 1.0 Date run: 7-Nov-116 Time: 02:59:11 Sequence gi568815578r:33989733_34205497 : 215765 bp : 44.44% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2904 3030 127 2 1 68 76 65 0.498 3.55 1.02 Intr + 7072 7126 55 2 1 135 103 47 0.613 8.94 1.03 Term + 10365 10470 106 0 1 52 49 68 0.161 -2.82 1.04 PlyA + 10493 10498 6 1.05 2.00 Prom + 11330 11369 40 -2.56 2.01 Init + 13785 13824 40 2 1 93 79 33 0.348 3.25 2.02 Intr + 23296 23408 113 2 2 64 102 1 0.008 -0.80 2.03 Intr + 27705 27773 69 2 0 108 79 1 0.019 0.58 2.04 Intr + 69671 69734 64 1 1 81 92 51 0.434 3.09 2.05 Term + 71491 71507 17 2 2 130 43 22 0.403 0.30 2.06 PlyA + 74265 74270 6 1.05 3.00 Prom + 76127 76166 40 -1.36 3.01 Init + 82199 82210 12 1 0 73 96 5 0.565 0.01 3.02 Intr + 82334 82598 265 1 1 105 94 360 0.632 35.39 3.03 Intr + 83831 83903 73 0 1 82 75 8 0.998 -2.64 3.04 Intr + 84087 84134 48 0 0 113 99 183 0.999 19.70 3.05 Intr + 86142 86308 167 0 2 96 65 211 0.999 19.20 3.06 Intr + 86970 87083 114 1 0 83 76 190 0.997 17.72 3.07 Intr + 87296 87513 218 0 2 82 81 274 0.999 24.32 3.08 Intr + 90178 90340 163 0 1 110 49 120 0.962 9.85 3.09 Term + 91452 91507 56 1 2 76 44 45 0.284 -3.38 3.10 PlyA + 93431 93436 6 1.05 4.10 PlyA - 93444 93439 6 1.05 4.09 Term - 100173 99998 176 1 2 87 28 139 0.945 5.72 4.08 Intr - 100870 100785 86 0 2 64 89 10 0.811 -1.84 4.07 Intr - 103999 103943 57 0 0 87 116 14 0.670 2.10 4.06 Intr - 107073 106925 149 0 2 75 93 124 0.975 10.63 4.05 Intr - 107784 107684 101 1 2 78 94 48 0.982 4.23 4.04 Intr - 108901 108766 136 1 1 63 90 164 0.975 14.24 4.03 Intr - 113833 113730 104 2 2 100 62 111 0.604 9.59 4.02 Intr - 115813 115636 178 1 1 85 100 203 0.840 20.69 4.01 Init - 117574 117530 45 1 0 81 87 -2 0.329 -0.32 4.00 Prom - 119292 119253 40 -6.76 5.00 Prom + 121631 121670 40 -8.36 5.01 Sngl + 121872 122666 795 2 0 42 48 355 0.975 22.79 5.02 PlyA + 124422 124427 6 1.05 6.00 Prom + 130407 130446 40 -1.86 6.01 Init + 132738 132792 55 2 1 81 7 113 0.869 3.95 6.02 Intr + 132897 133329 433 1 1 -5 32 375 0.743 15.30 6.03 Term + 133354 133558 205 1 1 19 46 249 0.969 10.64 6.04 PlyA + 136041 136046 6 1.05 7.00 Prom + 145170 145209 40 -2.76 7.01 Init + 170229 170320 92 2 2 95 96 59 0.833 7.36 7.02 Term + 184509 184647 139 0 1 100 44 89 0.904 3.14 7.03 PlyA + 184720 184725 6 1.05 8.04 PlyA - 185668 185663 6 1.05 8.03 Term - 185740 185684 57 1 0 104 40 36 0.008 -1.91 8.02 Intr - 201756 201684 73 2 1 62 99 70 0.653 4.91 8.01 Init - 207026 206971 56 2 2 76 80 84 0.753 7.16 8.00 Prom - 207280 207241 40 -5.66 9.00 Prom + 210389 210428 40 -3.76 9.01 Init + 215380 215392 13 0 1 94 113 2 0.313 3.27 9.02 Intr + 215606 215731 126 0 0 54 109 76 0.371 6.95 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578r:33989733_34205497|GENSCAN_predicted_peptide_1|95_aa GREVIKQQKDNYNCWYGKARTLNSNSAPFSGIPSVKQVLGKRSPENYRSTFCEFDYSRNL IWKGENDSRNVNLLVLQLPPGYCKEIVLRVQLTDP >gi568815578r:33989733_34205497|GENSCAN_predicted_CDS_1|288_bp ggcagggaggtcatcaaacaacagaaggacaattataactgctggtatggcaaagccagg actctgaattcaaactctgctccattctcaggaatcccgtctgttaagcaggttctgggg aagcgatcccctgaaaactatcgttctactttctgtgaatttgactactctaggaacctc atctggaagggagaaaatgacagccgtaatgttaacctgcttgtcctacaacttccccct gggtactgtaaggaaattgtgctcagggtgcagctcacagacccttag >gi568815578r:33989733_34205497|GENSCAN_predicted_peptide_2|100_aa MEQKTGGELFSGNIKYSINYVRYSTLYYKIGFVLVDFAHLKASVNVLRIFKTERELRLSP WSCSKTGGNSLKPVEDLQQLSILGVFLSMLTWHMSGCSMT >gi568815578r:33989733_34205497|GENSCAN_predicted_CDS_2|303_bp atggagcaaaagactggaggtgaactgttcagcgggaacataaagtattcaataaattat gtgagatattcaacactgtattataaaataggctttgtgttagttgattttgctcacctg aaggctagtgttaatgttctgcgcatattcaagactgaacgggagctcaggctgtctccc tggagctgttcaaagactggaggaaattctttaaaacctgtggaagacctgcagcagctg tccatcctgggagtattcctgtctatgcttacgtggcacatgtcaggatgtagcatgacg tag >gi568815578r:33989733_34205497|GENSCAN_predicted_peptide_3|371_aa MQEQVGTMSLKLQASNVTNKNDPKSINSRVFIGNLNTALVKKSDVETIFSKYGRVAGCSV HKGYAFVQYSNERHARAAVLGENGRVLAGQTLDINMAGEPKPDRPKGLKRAASAIYSGYI FDYDYYRDDFYDRLFDYRGRLSPVPVPRAVPVKRPRVTVPLVRRVKTNVPVKLFARSTAV TTSSAKIKLKSSELQAIKTELTQIKSNIDALLSRLEQIAAEQKANPDGKKKGDGGGAGGG GGGGGSGGGGSGGGGGGGSSRPPAPQENTTSEAGLPQGEARTRDDGDEEGLLTHSEEELP DRSNGHQQVKGIAAPGLKPGTQPWMPPPSGYQRKAGSRRLLPQRIPASAMSSAGYKYLPW SSELRDSYVAT >gi568815578r:33989733_34205497|GENSCAN_predicted_CDS_3|1116_bp atgcaggaacaggtgggcaccatgtccttgaagcttcaggcaagcaatgtaaccaacaag aatgaccccaagtccatcaactctcgagtcttcattggaaacctcaacacagctctggtg aagaaatcagatgtggagaccatcttctctaagtatggccgtgtggccggctgttctgtg cacaagggctatgcctttgttcagtactccaatgagcgccatgcccgggcagctgtgctg ggagagaatgggcgggtgctggccgggcagaccctggacatcaacatggctggagagcct aagcctgacagacccaaggggctaaagagagcagcatctgccatatacagtggctacatc tttgactatgattactaccgggacgacttctacgacaggctcttcgactaccggggccgt ctgtcgcccgtgccagtgcccagggcggtccctgtgaagcgaccccgggtcacagtccct ttggtccggcgtgtcaaaactaacgtacctgtcaagctctttgcccgctccacagctgtc accaccagctcagccaagatcaagttaaagagcagtgagctgcaggccatcaagacggag ctgacacagatcaagtccaatatcgatgccctgctgagccgcttggagcagatcgctgcg gagcaaaaggccaatccagatggcaagaagaagggtgatggaggtggcgccggcggcggc ggcggtggtggtggcagcggtggcggtggcagtggtggtggcggtggcggtggcagcagc cggccaccagccccccaagagaacacaacttctgaggcaggcctgccccagggggaagca cggacccgagacgacggcgatgaggaagggctcctgacacacagcgaggaagagctgcct gacaggagcaatggccaccagcaggtgaagggcatcgctgccccaggcctcaagccgggc acccaaccctggatgccaccccccagcgggtaccagaggaaagctggcagcaggcgcctc ctcccccaacgcatcccagccagtgccatgtcctctgcaggttacaagtacctaccctgg tccagtgaactgcgggattcatatgtggccacttag >gi568815578r:33989733_34205497|GENSCAN_predicted_peptide_4|343_aa MLNLSMVLIEYGERLMIFDPTMSKKKKKKKKPFMLDEEGDTQTEETQPSETKEVEPEPTE DKDLEADEEDTRKKDASDDLDDLNFFNQKKKKKKTKKIFDIDEAEEGVKDLKIESDVQEP TEPEDDLDIMLGNKKKKKKNVKFPDEDEILEKDEALEDEDNKKDDGISFSNQTGPAWAGS ERDYTYEELLNRVFNIMREKNPDMVAGEKRKFVMKPPQVVRVGTKKTSFVNFTDICKLLH RQPKHLLAFLLAELGTSGSIDGNNQLVIKGRFQQKQIENVLRRYIKEYVTCHTCRSPDTI LQKDTRLYFLQCETCHSRCSVASIKTGFQAVTGKRAQLRAKAN >gi568815578r:33989733_34205497|GENSCAN_predicted_CDS_4|1032_bp atgctgaatttaagcatggtattaatagaatatggagagagattgatgatttttgatcct actatgagcaagaagaaaaagaagaagaagaagccttttatgttagatgaggaaggggat acccaaacagaggaaacccagccttcagaaacaaaagaagtggagccagagccaactgag gacaaggatttggaagctgatgaagaggacactaggaaaaaagatgcttctgatgatcta gatgacttgaacttctttaatcaaaagaaaaagaagaaaaaaactaaaaagatatttgat attgatgaagctgaagaaggtgtaaaggatcttaagattgaaagtgatgttcaagaacca actgaaccagaggatgaccttgacattatgcttggcaataaaaagaagaaaaagaagaat gttaagttcccagatgaggatgaaatactagagaaagatgaagctctagaagatgaagac aacaaaaaagatgatggtatctcattcagtaatcagacaggccctgcttgggcaggctca gaaagagactacacatacgaggagctgctgaatcgagtgttcaacatcatgagggaaaag aatccagatatggttgctggggagaaaaggaaatttgtcatgaaacctccacaagtcgtc cgagtaggaaccaagaaaacttcttttgtcaactttacagatatctgtaaactattacat cgtcagcccaaacatctccttgcatttttgttggctgaattgggtacaagtggttctata gatggtaataaccaacttgtaatcaaaggaagattccaacagaaacagatagaaaatgtc ttgagaagatatatcaaggaatatgtcacttgtcacacatgccgatcaccggacacaatc ctgcagaaggacacacgactctatttcctacagtgcgaaacttgtcattctagatgttct gttgccagtatcaaaaccggcttccaggctgtcacgggcaagcgagcacagctccgtgcc aaagctaactaa >gi568815578r:33989733_34205497|GENSCAN_predicted_peptide_5|264_aa MLGRQTRAGYRPRQDSLESFNEGVIPSQERLLAAPLAPPAPSEASWGGSAQQKGMQGSPA PVSFGVINRAFEPLPGGRLNVCSPFRKRQPRQEVRLSAAALRKAGCGQACGSGVLAGSAA PGRLGAAASHGGGRLLGRRPVLPHWSGLGFSPQSLALTPGSDPHLVPRHGCGSSGLGTDG KSDGSAPGPGGSAAPADTSPTTALGSCISERKRHPALSSKRWKCACAELAKAQARRRANL ARFVWRELPGDAASPGMPGLVVHL >gi568815578r:33989733_34205497|GENSCAN_predicted_CDS_5|795_bp atgctaggaaggcaaacaagggcgggataccggccccgacaagatagcctagaaagtttt aacgagggcgttataccaagccaagagaggctcctcgcggcacccctggcacccccggca cccagtgaggcgagctggggcggcagtgcccagcagaaaggaatgcagggatcgccagcc ccagtctcgttcggggttattaaccgggcattcgagccccttccgggaggacgactcaac gtgtgctctccattccggaaacgccaaccacggcaggaggtccgcctcagcgctgccgcc ctgaggaaagcgggctgcggccaagcctgcggctccggagtcctcgccggctctgccgct ccgggccgcctaggcgcggctgcctcacatggcggcggccggctgctaggccgcaggccc gttctcccacactggagtgggttaggcttctcgcctcaatccttagcccttacgccgggc tcagatcctcacctcgtccccagacatggctgcggctcgagtgggctcggcacggacggg aagtcagacgggtcagccccaggccccggcggcagcgctgcccctgccgatacctctccc accaccgcactaggctcttgcatcagcgaaaggaaacgacaccccgccctctcctccaag cgttggaaatgcgcctgcgcagagctcgccaaggcgcaggcgcgcagaagagccaaccta gcgcgcttcgtgtggcgcgagttaccaggcgacgccgcgtcgccaggcatgccgggactt gtagtccatttataa >gi568815578r:33989733_34205497|GENSCAN_predicted_peptide_6|230_aa MGNRGGFRGGFGGGIRGQGHEDQVPGGCLFSLPIKESETIDFFLGSSLKEEGLKIMPVQK QTRAGQRTRFKAFVATGDYNGHVGLCVKCSKEVATAIRGSVILTKHSIVPVRRGYGGTRW TSPHRPLQGDRPLRLCAGAPHPCTQGHWHRLRTCAQEAPHDGCQGLHCHPGNFAKATFDT ISKTYSYLTLDLWKETIFTKSPYQEFTDHLVKTHTRVSMQRTQAPAVATA >gi568815578r:33989733_34205497|GENSCAN_predicted_CDS_6|693_bp atggggaaccgcggtggcttccgcggaggtttcggcggtggcatccggggccagggacat gaagatcaagtacctggaggatgtctcttctccctgcccatcaaggaatctgagaccatt gacttttttctggggtcctctctcaaggaagagggtttgaagattatgccggtgcagaag cagacccgcgcgggccagcgcaccaggttcaaggcgtttgttgccaccggggactacaat ggccacgtcggtctgtgcgttaagtgctccaaggaggtggctactgccatccgcgggtcc gtcatcctgaccaagcactccattgtccccgtgcgcagaggctacgggggaacaagatgg acaagcccacaccgtcccttgcaaggtgacaggccgctgcggctctgtgctggtgcacct catccctgcacccaggggcactggcatcgtctccgcacctgtgcccaagaagctcctcat gatggctgccaggggctgcactgccaccctggcaacttcgccaaggccacctttgacacc atctctaagacctacagctacctgaccctcgacctctggaaggagactatattcaccaag tctccctatcaggaattcactgaccacctcgtcaagacccacaccagagtctccatgcag aggacccaggctccagctgtggctacagcatag >gi568815578r:33989733_34205497|GENSCAN_predicted_peptide_7|76_aa MKNLSQARTIIVPHSPGNSNWSKEQAYDPESPPPEISADQNRSLKKLPEAGESSTRKEQK EQSSEYQPQQDKIHYV >gi568815578r:33989733_34205497|GENSCAN_predicted_CDS_7|231_bp atgaagaatttgtctcaggcccggacaatcatagttcctcattcccctggcaacagtaat tggtccaaggagcaggcatatgacccagaaagtccccctcctgaaattagtgctgatcag aacagaagtttgaagaaactacctgaggctggagaaagtagcaccagaaaagagcagaag gaacaatcctcagagtaccagccccaacaagataaaattcactatgtctga >gi568815578r:33989733_34205497|GENSCAN_predicted_peptide_8|61_aa MQTSKGSREYFTPQNRLRRSCSRPPTDRWDQRAFNGDDSVLREPNTAVKKRQSVVFQELR D >gi568815578r:33989733_34205497|GENSCAN_predicted_CDS_8|186_bp atgcagacctctaagggctctagggagtacttcacccctcagaaccggctacgcaggtcc tgctccaggcctcccacagaccgctgggaccagagggccttcaatggggatgacagtgtg ctcagggagcccaacactgcagtgaaaaagagacagtctgtcgtcttccaggagcttaga gattag >gi568815578r:33989733_34205497|GENSCAN_predicted_peptide_9|47_aa MDPHGVKLQTFVVSVTAHRGGASGVVHSSRWARGLAGFRSEAADLLX >gi568815578r:33989733_34205497|GENSCAN_predicted_CDS_9|141_bp atggaccctcacggagtgaagctgcagaccttcgtggtgagtgttacagctcatagaggc ggcgcatccggagttgttcattcctcccgctgggctcgtggtctcgctggcttcaggagt gaagctgcagaccttctcgnn