GENSCAN 1.0 Date run: 4-Nov-116 Time: 09:06:02 Sequence gi568815597r:39526510_39739625 : 213116 bp : 48.03% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.18 Intr - 2655 2507 149 1 2 119 114 19 0.025 7.55 1.17 Intr - 15332 15148 185 1 2 85 91 87 0.311 8.13 1.16 Intr - 19348 19224 125 1 2 70 99 59 0.306 4.58 1.15 Intr - 35278 35219 60 1 0 35 86 66 0.125 0.03 1.14 Intr - 35694 35564 131 1 2 90 105 76 0.988 9.91 1.13 Intr - 35907 35814 94 0 1 113 92 49 0.983 7.34 1.12 Intr - 37232 37105 128 0 2 95 94 80 0.974 9.70 1.11 Intr - 37413 37327 87 1 0 84 21 83 0.426 1.14 1.10 Intr - 38033 37914 120 0 0 89 97 4 0.751 1.87 1.09 Intr - 38264 38177 88 2 1 87 100 -4 0.634 0.24 1.08 Intr - 38869 38597 273 1 0 67 94 205 0.984 16.73 1.07 Intr - 41337 41242 96 0 0 56 62 94 0.859 3.81 1.06 Intr - 42430 42293 138 1 0 94 110 101 0.999 13.56 1.05 Intr - 43180 43086 95 2 2 72 113 145 0.999 15.08 1.04 Intr - 43493 43354 140 1 2 77 111 131 0.988 14.41 1.03 Intr - 44840 44725 116 2 2 104 65 271 0.999 25.65 1.02 Intr - 46077 45884 194 1 2 108 92 205 0.999 22.11 1.01 Init - 49442 49250 193 2 1 79 113 402 0.985 40.93 1.00 Prom - 50317 50278 40 -3.26 2.00 Prom + 56144 56183 40 -7.76 2.01 Init + 58240 58379 140 0 2 57 43 129 0.624 4.91 2.02 Intr + 69900 70058 159 0 0 56 78 124 0.254 7.30 2.03 Term + 83075 83156 82 2 1 75 51 86 0.007 0.77 2.04 PlyA + 88034 88039 6 1.05 3.06 PlyA - 92593 92588 6 1.05 3.05 Term - 100671 99998 674 1 2 121 49 176 0.273 10.92 3.04 Intr - 103799 103718 82 2 1 127 80 34 0.718 5.71 3.03 Intr - 105070 104933 138 1 0 49 83 102 0.216 6.46 3.02 Intr - 106206 106140 67 2 1 104 84 3 0.169 0.41 3.01 Init - 113116 113037 80 1 2 85 94 146 0.977 15.43 3.00 Prom - 129895 129856 40 -3.66 4.15 PlyA - 129913 129908 6 -0.45 4.14 Term - 132977 132612 366 2 0 82 37 264 0.682 15.30 4.13 Intr - 134754 134570 185 1 2 68 47 427 0.996 36.11 4.12 Intr - 136925 136803 123 0 0 106 110 225 0.999 27.06 4.11 Intr - 139141 139012 130 1 1 95 89 313 0.977 32.37 4.10 Intr - 139727 139560 168 2 0 83 100 330 0.987 33.84 4.09 Intr - 142701 142625 77 1 2 94 44 40 0.938 -0.57 4.08 Intr - 143369 143110 260 1 2 71 -28 185 0.846 2.31 4.07 Intr - 145596 145378 219 2 0 89 67 144 0.236 9.82 4.06 Intr - 156224 156031 194 2 2 114 -1 330 0.019 24.99 4.05 Intr - 157643 157428 216 2 0 76 64 627 0.999 57.80 4.04 Intr - 158102 157933 170 0 2 89 86 381 0.418 37.67 4.03 Intr - 164661 164556 106 0 1 87 94 -1 0.013 0.19 4.02 Intr - 168496 168439 58 0 1 98 70 99 0.422 7.99 4.01 Init - 177152 177091 62 2 2 86 64 52 0.142 3.32 4.00 Prom - 201801 201762 40 -1.66 5.00 Prom + 211336 211375 40 -3.16 5.01 Init + 212392 212422 31 0 1 95 80 61 0.375 6.16 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 89795 89650 146 2 2 67 80 121 0.884 6.94 S.002 Init + 117388 117457 70 0 1 96 58 52 0.854 4.34 S.003 Term - 156224 156027 198 2 0 114 43 346 0.979 30.10 S.004 Init + 164822 164890 69 1 0 47 52 127 0.862 4.05 S.005 Term + 165729 165911 183 2 0 127 47 98 0.939 7.14 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:39526510_39739625|GENSCAN_predicted_peptide_1|804_aa MNAAASSYPMASLYVGDLHSDVTEAMLYEKFSPAGPVLSIRVCRDMITRRSLGYAYVNFQ QPADAERALDTMNFDVIKGKPIRIMWSQRDPSLRKSGVGNVFIKNLDKSIDNKALYDTFS AFGNILSCKVVCDENGSKGYAFVHFETQEAADKAIEKMNGMLLNDRKVFVGRFKSRKERE AELGAKAKEFTNVYIKNFGEEVDDESLKELFSQFGKTLSVKVMRDPNGKSKGFGFVSYEK HEDANKAVEEMNGKEISGKIIFVGRAQKKVERQAELKRKFEQLKQERISRYQGVNLYIKN LDDTIDDEKLRKEFSPFGSITSAKVMLEDGRSKGFGFVCFSSPEEATKAVTEMNGRIVGS KPLYVALAQRKEERKAHLTNQYMQRVAGMRALPANAILNQFQPAAGGYFVPAVPQAQGRP PYYTPNQLAQMRPNPRWQQGGRPQGFQGMPSAIRQSGPRPTLRHLAPTGNAPASRGLPTT TQRVGSECPDRLAMDFGGAGAAQQGLTDSCQSGGVPTAVQNLAPRAAVAAAAPRAVAPYK YASSVRSPHPAIQPLQAPQPAVHVQGQEPLTASMLAAAPPQEQKQMLGERLFPLIQTMHS NLAGKITGMLLEIDNSELLHMLESPESLRSKVDEAVAVLQAHHAKKEAAQKGEPTAKKGL VKSSGVHGHQDWEQAGWLHPNPPAFRRRAHDHREFSLHVHPFKGLGFKGSSFHRIIPQFI CQGSDFTNHGGTRGKSIYKKKFDDENFIFKHTGPGLRRHGTKCPMAHLALMEASKGQDQH PLLRRKERAQQVHDVVKLTRGRGR >gi568815597r:39526510_39739625|GENSCAN_predicted_CDS_1|2412_bp atgaacgctgcggccagcagctaccccatggcctccctgtacgtgggcgacctgcattcg gacgtcaccgaggccatgctgtacgaaaagttcagccccgcggggcctgtgctgtccatc cgggtctgccgcgatatgatcacccgccgctccctgggctatgcctacgtcaacttccag cagccggccgacgctgagcgggctttggacaccatgaactttgatgtgattaagggaaag ccaatccgcatcatgtggtctcagagggatccctctttgagaaaatctggtgtgggaaac gtcttcatcaagaacctggacaaatctatagataacaaggcactttatgatactttttct gcttttggaaacatactgtcctgcaaggtggtgtgtgatgagaacggctctaagggttat gcctttgtccacttcgagacccaagaggctgccgacaaggccatcgagaagatgaatggc atgctcctcaatgaccgcaaagtatttgtgggcagattcaagtctcgcaaagagcgggaa gctgagcttggagccaaagccaaggaattcaccaatgtttatatcaaaaactttggggaa gaggtggatgatgagagtctgaaagagctattcagtcagtttggtaagaccctaagtgtc aaggtgatgagagatcccaatgggaaatccaaaggctttggctttgtgagttacgaaaaa cacgaggatgccaataaggctgtggaagagatgaatggaaaagaaataagtggtaaaatc atatttgtaggccgtgcacaaaagaaagtagaacggcaggcagagttaaaacggaaattt gaacagttgaaacaggagagaattagtcgatatcagggggtgaatctctacattaagaac ttggatgacactattgatgatgagaaattaaggaaagaattttctccttttggatcaatt accagtgctaaggtaatgctggaggatggaagaagcaaagggtttggcttcgtctgcttc tcatctcctgaagaagcaaccaaagcagtcactgagatgaatggacgcattgtgggctcc aagccactatatgttgccctggcccagaggaaggaagagagaaaggctcacctgaccaac cagtatatgcaacgagtggctggaatgagagcacttcctgccaatgccatcttaaatcag ttccagcctgcagcgggtggctactttgtgccagcagtcccacaggctcagggaaggcct ccatattatacacctaaccagttagcacagatgaggcctaatccacgctggcagcaaggt gggagacctcaaggcttccaaggaatgccaagtgctatacgccagtctgggcctcgtcca actcttcgccatctggctccaactggtaatgctccggcctctcgtggcctccctactacc actcagagagtcgggtctgagtgcccggaccgcttggctatggactttggtggggctggt gccgcccagcaagggctgactgacagctgccagtctggaggcgttcccacagctgtgcag aacttagcgccacgcgctgctgttgctgctgctgctccccgggctgttgccccctacaaa tacgcctccagtgtccgcagccctcatcctgccatacagcctctgcaggcaccccagcct gcggtccatgtgcaggggcaggagccactgactgcctccatgctggctgcagcacccccc caggaacagaagcagatgctgggagaacgcttgttcccactcatccaaacaatgcattca aatctggctgggaagatcacgggaatgctgctggagatagacaactctgagctgctgcac atgttagagtcccccgagtctctccgctccaaggtggatgaagctgtagcagttctacag gctcatcatgccaagaaagaagctgcccagaagggagagcccactgctaaaaaaggccta gtcaaatcctcaggtgtacatggacatcaagattgggaacaagccggctggctgcatcca aaccctcctgcgttccgacgtcgtgcccatgaccacagagaattttctctacatgtgcac ccatttaaaggactcggcttcaagggaagcagcttccaccgcattatcccccagttcatt tgccagggcagtgatttcacaaaccacggtggcaccaggggtaagtccatctacaagaag aaatttgatgatgaaaactttatcttcaaacacacaggaccaggtctgcgcaggcatggc actaagtgtcccatggcacatttggccctcatggaggcctcaaaagggcaggaccagcac cctcttctacggaggaaggaacgggcacagcaggtacatgatgtggtcaaattaacaaga ggcagaggccgg >gi568815597r:39526510_39739625|GENSCAN_predicted_peptide_2|126_aa MTKNQILKAETNVENPSSVEIRGMNEENHNCYSPVQVPRLNTGYKDDPWHVDTTQIVAAP VELGIASASAFEQGAPRCHFAPGLTNDVADPEQKIEEGGKNALILATVAEGREEWGEDDE TGPAVK >gi568815597r:39526510_39739625|GENSCAN_predicted_CDS_2|381_bp atgaccaagaaccagatattaaaagctgaaacaaatgtggaaaatccatcatctgtggag atcaggggtatgaacgaggagaatcataactgctattctcctgttcaagtccccaggctg aacactggctacaaagatgatccctggcatgttgacaccactcagattgtggctgcccct gtggagttaggaattgcctcggcatcagccttcgaacaaggggctccacgctgtcatttt gcacctggcctcacaaatgatgtagctgatcctgagcagaagattgaggaaggtggaaag aatgctctaatactggccacggtcgcagaagggagggaagaatggggcgaggacgatgaa acagggccagcggtgaagtga >gi568815597r:39526510_39739625|GENSCAN_predicted_peptide_3|346_aa MKRPKEPSGSDGESDGPIDVGQEGQLSQMARPLSTPSSSQMQARKKHRGIIEKRRRDRIN SSLSELRRLVPTAFEKQVCDIADCARGNTVWVADKGSSKLEKAEVLQMTVDHLKMLHATG GTGFFDARALAVDFRSIGFRECLTEVIRYLGVLEGPSSRADPVRIRLLSHLNSYAAEMEP SPTPTGPLAFPAWPWSFFHSCPGLPALSNQLAILGRVPSPVLPGVSSPAYPIPALRTAPL RRATGIILPARRNVLPSRGASSTRRARPLERPATPVPVAPSSRAARSSHIAPLLQSSSPT PPGPTGSAAYVAVPTPNSSSPGPAGRPAGAMLYHSWVSEITEIGAF >gi568815597r:39526510_39739625|GENSCAN_predicted_CDS_3|1041_bp atgaagcgacccaaggagccgagcggctccgacggggagtccgacggacccatcgacgtg ggccaagagggccagctgagccagatggccaggccgctgtccacccccagctcttcgcag atgcaagccaggaagaaacacagagggatcatagagaaacggcgtcgagaccgcatcaac agtagcctttctgaattgcgacgcttggtccccactgcctttgagaaacaggtatgtgac attgctgattgtgctagaggaaatactgtgtgggtggcagataagggctcttccaagctg gagaaagccgaggtcttgcagatgacggtggatcacttgaaaatgctccatgccactggt gggacaggattctttgatgcccgagccctggcagttgacttccggagcattggttttcgg gagtgcctcactgaggtcatcaggtacctgggggtccttgaagggcccagcagccgtgca gaccccgtccggattcgccttctctcccacctcaacagctacgcagccgagatggagcct tcgcccacgcccactggccctttggccttccctgcctggccctggtctttcttccatagc tgtccagggctgccagccctgagcaaccagctcgccatcctgggaagagtgcccagccct gtcctccccggtgtctcctctcctgcttaccccatcccagccctccgaaccgctcccctt cgcagagccacaggcatcatcctgccagcccggaggaatgtgctgcccagtcgaggggca tcttccacccggagggcccgccccctagagaggccagcgacccctgtgcctgtcgccccc agcagcagggctgccaggagcagccacatcgctcccctcctgcagtcttcctccccaaca ccccctggtcctacagggtcggctgcttacgtggctgttcccacccccaactcatcctcc ccagggccagctgggaggccagcgggagccatgctctaccactcctgggtctctgaaatc actgaaatcggggctttctga >gi568815597r:39526510_39739625|GENSCAN_predicted_peptide_4|777_aa MIVKRRNLYRKPFKELNQQERACSFNNVEQVFAKTPSLRTAAATEEPTPGQETPHLSPGP WVQRRRQSRLLIPSSGPAMGKTNSKLAPEVLEDLVQNTEFSEQELKQWYKGFLKDCPSGI LNLEEFQQLYIKFFPYGDASKFAQHAFRTFDKNGDGTIDFREFICALSVTSRGSFEQKLN WAFEMYDLDGDGRITRLEMLEIIEAIYKMVGTVIMMRMNQDGLTPQQRVDKIFKKMDQDK DDQITLEEFKEAAKSDPSIVLLLQCDMQNPAPATSASTPPPALAGPGQSRSMEPGQPREP QEPREPGPGAETAAAPVWEEAKIFYDNLAPKKKPKSVKAMHGKITLGWIGKGESGSKPTS EKAGASLKRKLDECQCCGLSDAEEGMETGDWGSESISAQNPFPPQAPRNRRAQAQLSVMT PGSAPDSHAGAISLTLKGRESCQHPGFLELRAPRPKPQNAVTIAVSSRALFRMDEEQQIY TEQGVEEYVRYQLEHENEPFSPGPAFPFVKALEAVNRRLRELYPDSEDVFDIVLMTNNHA QVGVRLINSINHYDLFIERFCMTGGNSPICYLKAYHTNLYLSADAEKVREAIDEGIAAAT IFSPSRDVVVSQSQLRVAFDGDAVLFSDESERIVKAHGLDRFFEHEKAHENKPLAQGPLK GFLEALGRLQKKFYSKGLRLECPIRTYLVTARSAASSGARALKTLRSWGLETDEALFLAG APKGPLLEKIRPHIFFDDQMFHVAGAQEMGTVAAHVPYGVAQTPRRTAPAKQAPSAQ >gi568815597r:39526510_39739625|GENSCAN_predicted_CDS_4|2334_bp atgattgtaaagaggagaaacctctacagaaaaccattcaaggaacttaatcagcaagaa agagcctgctccttcaacaacgtggaacaggtgtttgccaagactccaagcctgagaaca gcggcagcaactgaagagcccacccctgggcaggagaccccccacctcagtccaggccca tgggtgcagagaaggaggcagagccgcctacttatccccagctcaggccccgccatgggg aagaccaacagcaagctggcccccgaggtgctggaggaccttgttcagaacactgagttc agcgagcaggagctgaagcagtggtacaagggcttcctgaaggactgccccagcggcatc ctcaacctggaggagtttcagcagctctacatcaagttcttcccctacggcgacgcctcc aagttcgcgcagcacgctttccgcaccttcgacaagaacggcgacggcaccatcgacttc cgggagttcatctgcgccctgtcggtcacctcccgcggcagcttcgagcagaagctcaac tgggcctttgagatgtacgacctggacggcgacgggcgcatcacgcgcctggagatgctg gagatcatcgaggcaatctacaagatggtgggcaccgtgatcatgatgcgcatgaaccag gacgggctcacgccccagcagcgtgtggacaagatcttcaagaagatggaccaggataag gacgaccagattacattggaggagttcaaggaggcagccaagagtgacccatccattgtg ttgctgctgcagtgtgacatgcagaacccagccccagccacctccgcgtctacgccgccg cctgctctggccgggccgggtcagagccggagcatggaacctgggcagccccgggagccc caggagccccgcgagcccgggccaggagcggagaccgctgcggccccggtctgggaggaa gccaagattttctacgacaacctcgcgcccaagaagaaacccaaatcggtaaaagcaatg cacggaaagatcactctgggatggattggaaagggtgagtccggcagcaaacccaccagt gagaaggctggtgcctctctgaagaggaagctggatgaatgtcaatgctgtggcctgagt gatgcagaagaagggatggagactggggactggggcagtgaaagcatcagtgctcaaaat ccatttcctccccaggcaccaaggaaccggagagctcaggcccagttgtcagtcatgacc cctggatcagcccctgactctcatgccggagccatcagcctgaccctgaaaggcagagag agctgccagcaccctggtttcctggagctcagagctcccagacccaagcctcagaatgca gtcaccatcgctgtgtcctcccgagccttgtttcgcatggacgaggagcagcagatctac acggagcagggcgtggaggagtacgtgcgctaccagctggaacatgagaacgaacccttc agtcccgggccagccttcccttttgtgaaggctctggaggccgtgaacaggcggctgcgg gagctgtaccctgatagtgaggacgtcttcgacatcgtcctcatgactaacaaccatgct caagtgggtgtccgcctcatcaacagtatcaaccactatgacctgttcatcgagaggttc tgcatgacaggtgggaacagcccgatctgctacctcaaggcctatcacaccaacctctac ttgtcagccgatgcggaaaaagtgcgagaagccattgatgaggggatcgcagctgccacc atcttcagccccagcagggatgtggttgtgtcccagagtcagctgcgcgtggccttcgat ggggacgccgtgctcttctcggacgagtcggagcgcatcgtcaaggcccacgggctggac cgattcttcgagcatgagaaggcccacgagaacaaacctctggctcagggccccttaaag ggctttctggaggcactgggtaggttgcagaagaagttctactccaaaggcctgcggctg gagtgcccaattcgtacctacttggtgacagcacgcagtgcagccagttccggggcccgg gctctcaagaccctgcgcagctggggcctggagacagatgaagccttgttccttgctgga gcgcccaagggccctctccttgagaagatccgcccacacatcttctttgatgaccagatg ttccatgtggctggggctcaggagatgggcactgtggccgcccatgtgccttatggtgtg gcacagacaccccggcggactgcacctgcaaagcaggccccatctgcacagtag >gi568815597r:39526510_39739625|GENSCAN_predicted_peptide_5|11_aa MATTKRVLYVX >gi568815597r:39526510_39739625|GENSCAN_predicted_CDS_5|33_bp atggccaccaccaagcgcgtcttgtacgtggnn