GENSCAN 1.0 Date run: 5-Nov-116 Time: 08:06:35 Sequence gi568815587r:130134156_130361873 : 227718 bp : 44.37% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1408 1560 153 2 0 94 93 121 0.972 13.24 1.02 Intr + 6243 6328 86 1 2 98 89 70 0.993 7.64 1.03 Intr + 7343 7417 75 1 0 78 87 124 0.999 11.01 1.04 Intr + 7764 7919 156 2 0 79 78 326 0.987 30.91 1.05 Term + 9192 9293 102 2 0 111 43 176 0.996 13.68 1.06 PlyA + 9619 9624 6 1.05 2.00 Prom + 9900 9939 40 -4.86 2.01 Init + 16720 16762 43 0 1 59 105 53 0.607 4.68 2.02 Term + 23046 23143 98 1 2 46 47 114 0.190 1.23 2.03 PlyA + 24188 24193 6 -0.45 3.00 Prom + 24839 24878 40 -4.56 3.01 Init + 25825 25905 81 0 0 107 83 117 0.928 14.07 3.02 Term + 44986 45054 69 0 0 62 44 102 0.325 1.14 3.03 PlyA + 46636 46641 6 1.05 4.00 Prom + 47662 47701 40 -2.46 4.01 Init + 53067 53129 63 2 0 81 50 47 0.721 1.36 4.02 Intr + 53959 54118 160 0 1 91 65 258 0.999 23.36 4.03 Intr + 54375 54502 128 1 2 100 101 228 0.991 25.70 4.04 Intr + 54714 54784 71 2 2 92 94 113 0.988 10.28 4.05 Intr + 55584 55741 158 0 2 97 75 330 0.996 32.25 4.06 Intr + 55958 55993 36 0 0 98 98 19 0.808 2.13 4.07 Intr + 56299 56539 241 2 1 101 73 417 0.991 38.11 4.08 Intr + 59994 60133 140 0 2 136 64 152 0.998 17.71 4.09 Intr + 60485 60582 98 0 2 109 75 73 0.602 7.83 4.10 Intr + 62184 62293 110 2 2 84 81 263 0.982 24.28 4.11 Intr + 62415 62545 131 0 2 131 84 249 0.999 29.14 4.12 Intr + 63686 63790 105 0 0 72 73 200 0.999 17.09 4.13 Intr + 64153 64263 111 2 0 115 73 250 0.999 26.55 4.14 Intr + 64353 64466 114 1 0 66 72 101 0.883 6.72 4.15 Intr + 64792 64914 123 2 0 73 89 194 0.998 18.56 4.16 Intr + 65796 65982 187 1 1 116 94 140 0.941 16.15 4.17 Intr + 74255 74529 275 2 2 73 99 488 0.998 45.58 4.18 Intr + 75287 75450 164 0 2 118 80 263 0.889 28.19 4.19 Intr + 75507 75662 156 2 0 72 32 90 0.365 2.01 4.20 Term + 75992 76081 90 1 0 51 55 90 0.497 -0.28 4.21 PlyA + 76108 76113 6 -5.22 5.09 PlyA - 76199 76194 6 1.05 5.08 Term - 79199 79005 195 2 0 37 41 192 0.493 6.81 5.07 Intr - 79487 79303 185 0 2 47 59 37 0.165 -3.79 5.06 Intr - 80119 80046 74 0 2 85 99 36 0.317 3.45 5.05 Intr - 86167 86030 138 0 0 31 83 85 0.450 1.88 5.04 Intr - 102938 102638 301 0 1 75 94 306 0.873 25.69 5.03 Intr - 104452 104289 164 0 2 90 95 126 0.630 13.12 5.02 Intr - 105741 105657 85 1 1 87 69 108 0.996 7.68 5.01 Init - 108889 108715 175 1 1 49 51 79 0.342 -0.19 5.00 Prom - 112130 112091 40 -3.76 6.00 Prom + 114988 115027 40 -5.76 6.01 Init + 116674 116816 143 0 2 78 71 114 0.504 8.31 6.02 Term + 119274 119784 511 0 1 13 48 203 0.403 2.65 6.03 PlyA + 119828 119833 6 1.05 7.02 PlyA - 120631 120626 6 1.05 7.01 Sngl - 127718 126666 1053 2 0 61 39 475 0.508 37.06 7.00 Prom - 129238 129199 40 -2.46 8.00 Prom + 129828 129867 40 -1.06 8.01 Init + 161685 161917 233 2 2 60 51 217 0.365 12.93 8.02 Intr + 162315 162430 116 0 2 52 43 129 0.408 4.89 8.03 Term + 162738 162901 164 1 2 39 49 179 0.488 7.20 8.04 PlyA + 163697 163702 6 1.05 9.05 PlyA - 166846 166841 6 1.05 9.04 Term - 174952 174931 22 0 1 72 44 39 0.142 -4.32 9.03 Intr - 180277 180220 58 2 1 76 97 93 0.756 6.94 9.02 Intr - 181645 181565 81 2 0 47 100 64 0.880 3.11 9.01 Init - 192293 192203 91 2 1 65 86 65 0.822 4.65 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:130134156_130361873|GENSCAN_predicted_peptide_1|190_aa XELLQEQRADMDQFTASISETPVDVRVSSEESEEIPPFHPFHPFPALPENEGSGVGEQDG GLIGAEEKVINSKNKVDENMVIDETLDVKEMIFNAERVGGLEEERESVGPLREDFSLSSS ALIGLLVIAVAIATVIVISLVMLRKRQYGTISHGIVEVDPMLTPEERHLNKMQNHGYENP TYKYLEQMQI >gi568815587r:130134156_130361873|GENSCAN_predicted_CDS_1|573_bp natgagctccttcaggagcagcgtgcagatatggaccagttcactgcctcaatctcagag acccctgtggacgtccgggtgagctctgaggagagtgaggagatcccaccgttccacccc ttccaccccttcccagccctacctgagaacgaaggatctggagtgggagagcaggatggg ggactgatcggtgccgaagagaaagtgattaacagtaagaataaagtggatgaaaacatg gtcattgacgagactctggatgttaaggaaatgattttcaatgccgagagagttggaggc ctcgaggaagagcgggaatccgtgggcccactgcgggaggacttcagtctgagtagcagt gctctcattggcctgctggtcatcgcagtggccattgccacggtcatcgtcatcagcctg gtgatgctgaggaagaggcagtatggcaccatcagccacgggatcgtggaggttgatcca atgctcaccccagaagagcgtcacctgaacaagatgcagaaccatggctatgagaacccc acctacaaatacctggagcagatgcagatttag >gi568815587r:130134156_130361873|GENSCAN_predicted_peptide_2|46_aa MQVEASCSWDGSGTAVNALHSYGIQITLSAQLYEIKDEALGSGETT >gi568815587r:130134156_130361873|GENSCAN_predicted_CDS_2|141_bp atgcaggtggaagcgtcctgctcctgggacggcagcggcactgcagtcaatgcattgcat tcatacggcatccaaatcacgctgagcgcccagctctacgagataaaagatgaggcccta ggaagtggggagacaacgtga >gi568815587r:130134156_130361873|GENSCAN_predicted_peptide_3|49_aa MGSDRARKGGGGPKDFGAGLKYNSRHEVPKNDLEHLNKNFHIRGPAPDA >gi568815587r:130134156_130361873|GENSCAN_predicted_CDS_3|150_bp atggggagcgatcgggcccgcaagggcggagggggcccgaaggacttcggcgcgggactc aagtacaactcccggcacgaggttcccaagaatgacctggagcacttgaataaaaacttc cacatccgaggccctgctcctgatgcatga >gi568815587r:130134156_130361873|GENSCAN_predicted_peptide_4|886_aa MVHPQDSGRADGMCYRQKPSPKVNGLEEGVEFLPVNNVKKVEKHGPGRWVVLAAVLIGLL LVLLGIGFLVWHLQYRDVRVQKVFNGYMRITNENFVDAYENSNSTEFVSLASKVKDALKL LYSGVPFLGPYHKESAVTAFSEGSVIAYYWSEFSIPQHLVEEAERVMAEERVVMLPPRAR SLKSFVVTSVVAFPTDSKTVQRTQDNSCSFGLHARGVELMRFTTPGFPDSPYPAHARCQW ALRGDADSVLSLTFRSFDLASCDERGSDLVTVYNTLSPMEPHALVQLCGTYPPSYNLTFH SSQNVLLITLITNTERRHPGFEATFFQLPRMSSCGGRLRKAQGTFNSPYYPGHYPPNIDC TWNIEVPNNQHVKVRFKFFYLLEPGVPAGTCPKDYVEINGEKYCGERSQFVVTSNSNKIT VRFHSDQSYTDTGFLAEYLSYDSSDPCPGQFTCRTGRCIRKELRCDGWADCTDHSDELNC SCDAGHQFTCKNKFCKPLFWVCDSVNDCGDNSDEQGCSCPAQTFRCSNGKCLSKSQQCNG KDDCGDGSDEASCPKVNVVTCTKHTYRCLNGLCLSKGNPECDGKEDCSDGSDEKDCDCGL RSFTRQARVVGGTDADEGEWPWQVSLHALGQGHICGASLISPNWLVSAAHCYIDDRGFRY SDPTQWTAFLGLHDQSQRSAPGVQERRLKRIISHPFFNDFTFDYDIALLELEKPAEYSSM VRPICLPDASHVFPAGKAIWVTGWGHTQYGGTGALILQKGEIRVINQTTCENLLPQQITP RMMCVGFLSGGVDSCQVAPGAGGRQGDSGGPLSSVEADGRIFQAGVVSWGDGCAQRNKPG VYTRLPLFRDWIKENTGEQRERSFGASSVKVVGLPDLGCGALGPRS >gi568815587r:130134156_130361873|GENSCAN_predicted_CDS_4|2661_bp atggtgcatccccaggattcagggcgggcagacggcatgtgctatcggcagaagccttcg ccgaaagtgaatggcttggaggaaggcgtggagttcctgccagtcaacaacgtcaagaag gtggaaaagcatggcccggggcgctgggtggtgctggcagccgtgctgatcggcctcctc ttggtcttgctggggatcggcttcctggtgtggcatttgcagtaccgggacgtgcgtgtc cagaaggtcttcaatggctacatgaggatcacaaatgagaattttgtggatgcctacgag aactccaactccactgagtttgtaagcctggccagcaaggtgaaggacgcgctgaagctg ctgtacagcggagtcccattcctgggcccctaccacaaggagtcggctgtgacggccttc agcgagggcagcgtcatcgcctactactggtctgagttcagcatcccgcagcacctggtg gaggaggccgagcgcgtcatggccgaggagcgcgtagtcatgctgcccccgcgggcgcgc tccctgaagtcctttgtggtcacctcagtggtggctttccccacggactccaaaacagta cagaggacccaggacaacagctgcagctttggcctgcacgcccgcggtgtggagctgatg cgcttcaccacgcccggcttccctgacagcccctaccccgctcatgcccgctgccagtgg gccctgcggggggacgccgactcagtgctgagcctcaccttccgcagctttgaccttgcg tcctgcgacgagcgcggcagcgacctggtgacggtgtacaacaccctgagccccatggag ccccacgccctggtgcagttgtgtggcacctaccctccctcctacaacctgaccttccac tcctcccagaacgtcctgctcatcacactgataaccaacactgagcggcggcatcccggc tttgaggccaccttcttccagctgcctaggatgagcagctgtggaggccgcttacgtaaa gcccaggggacattcaacagcccctactacccaggccactacccacccaacattgactgc acatggaacattgaggtgcccaacaaccagcatgtgaaggtgcgcttcaaattcttctac ctgctggagcccggcgtgcctgcgggcacctgccccaaggactacgtggagatcaacggg gagaaatactgcggagagaggtcccagttcgtcgtcaccagcaacagcaacaagatcaca gttcgcttccactcagatcagtcctacaccgacaccggcttcttagctgaatacctctcc tacgactccagtgacccatgcccggggcagttcacgtgccgcacggggcggtgtatccgg aaggagctgcgctgtgatggctgggccgactgcaccgaccacagcgatgagctcaactgc agttgcgacgccggccaccagttcacgtgcaagaacaagttctgcaagcccctcttctgg gtctgcgacagtgtgaacgactgcggagacaacagcgacgagcaggggtgcagttgtccg gcccagaccttcaggtgttccaatgggaagtgcctctcgaaaagccagcagtgcaatggg aaggacgactgtggggacgggtccgacgaggcctcctgccccaaggtgaacgtcgtcact tgtaccaaacacacctaccgctgcctcaatgggctctgcttgagcaagggcaaccctgag tgtgacgggaaggaggactgtagcgacggctcagatgagaaggactgcgactgtgggctg cggtcattcacgagacaggctcgtgttgttgggggcacggatgcggatgagggcgagtgg ccctggcaggtaagcctgcatgctctgggccagggccacatctgcggtgcttccctcatc tctcccaactggctggtctctgccgcacactgctacatcgatgacagaggattcaggtac tcagaccccacgcagtggacggccttcctgggcttgcacgaccagagccagcgcagcgcc cctggggtgcaggagcgcaggctcaagcgcatcatctcccaccccttcttcaatgacttc accttcgactatgacatcgcgctgctggagctggagaaaccggcagagtacagctccatg gtgcggcccatctgcctgccggacgcctcccatgtcttccctgccggcaaggccatctgg gtcacgggctggggacacacccagtatggaggcactggcgcgctgatcctgcaaaagggt gagatccgcgtcatcaaccagaccacctgcgagaacctcctgccgcagcagatcacgccg cgcatgatgtgcgtgggcttcctcagcggcggcgtggactcctgccaggtggcccccggg gcaggagggcggcagggtgattccgggggacccctgtccagcgtggaggcggatgggcgg atcttccaggccggtgtggtgagctggggagacggctgcgctcagaggaacaagccaggc gtgtacacaaggctccctctgtttcgggactggatcaaagagaacactggggagcagcgg gaacggagcttcggggcctcctcagtgaaggtggtggggctgccggatctgggctgtggg gcccttgggccacgctcttga >gi568815587r:130134156_130361873|GENSCAN_predicted_peptide_5|438_aa MITKIKISVNILAAHCTQQKRIRKLKDKLEEITSVLHRERHSGGNCSSKEEKGAEDNKCS VDEGVSEGLPTLQSTSSTNAPPDDDDRLENVQYPYQLYIAPSTSSTERPSPNGPDRPFQC PTCGVRFTRIQNLKQHMLIHSGIKPFQCDRCGKKFTRAYSLKMHRLKHEGKRCFRCQICS ATFTSFGEYKHHMRVSRHIIRKPRIYECKTCGAMFTNSGNLIVHLRSLNHEASELANYFQ SRSCPRCRYKNSQCPKQRRLTHLKKDEKHLGGHGRFQVMWIQSDRNMCSLLQEIQVSENH WLKIQFSSPVIAVVSGAGCVSWDKVPLRDGTTLVAIHRSPLCPQSQKPSLEPALRMCPPG PNAREPAGPGYRLEVTGKQQKAAANNRRPRTPRPTRERPRPGTAQHRHRVAVDLNNSFSA PGVRSARSLETSKLGPGF >gi568815587r:130134156_130361873|GENSCAN_predicted_CDS_5|1317_bp atgataaccaaaattaaaatctccgtgaacattttggcagcacattgcactcaacagaag agaattaggaaattaaaagataaactagaagaaattacttcagtgctacacagagagaga cacagtggtggaaactgcagtagcaaagaggagaagggtgcggaagacaacaaatgctca gtagatgaaggcgtttctgagggcttgcctacacttcaaagcacgtctagcactaatgct cctccggatgatgatgatcgattggaaaatgttcagtatccctaccaactctacattgct ccttccaccagcagtacagagcgaccaagtccaaatggtcccgacagaccttttcagtgt ccaacctgcggggtgcgattcacccgtattcagaacctaaagcagcacatgctcatccac tcaggaattaaaccatttcagtgtgaccgctgtgggaaaaagttcaccagggcttactcg ctaaagatgcatcgcctaaagcatgaaggtaaacgctgtttccggtgccagatatgtagt gccactttcacttccttcggggaatataaacaccacatgagggtttcccggcacattatc cgcaagcctcggatttacgagtgcaaaacatgtggcgccatgttcaccaactctggaaat ttaatcgtgcacctgaggagtctgaaccatgaagcatcagagctagcaaactacttccag agcagaagttgtccacggtgccgttacaagaatagccagtgtccaaaacagaggaggctc acgcatcttaagaaggatgaaaagcatctgggtggccatggcaggttccaggtcatgtgg atccagagtgacaggaacatgtgctcactgctccaagaaattcaagtatctgagaatcac tggctgaagatccaattttcctcacctgtgattgcagttgtctctggggccggttgtgtc tcatgggataaagttccacttagagatggcaccaccttggtggccatccacaggtcacca ctctgtcctcaatctcaaaagccatctttagagcctgccctgagaatgtgtcctccaggc cccaatgccagagagccagcaggtcctggctacagactggaggtgactgggaagcagcag aaggcagccgccaacaacagacggccccggacaccccgccccacccgtgagcgcccacgc cctggaacggcgcaacatcgccaccgtgtggccgtcgacctgaacaacagcttctcagcc cctggggtccgtagcgctaggtctctggagacctccaaactgggcccaggattttaa >gi568815587r:130134156_130361873|GENSCAN_predicted_peptide_6|217_aa MRKNQHKNAENSKNQNASSPNDCNSSPARAQHWTENEFDKSTEVGFRRETESQIMSELLF TIASKKRKYLGIQLTRDMKELFKENYKPLLNEIKDDTNKWKNIPCSWIGRMNIVKMAILP KVIYRFNAIPIKLPMTFFTELEKTTLKFTWNQKRACIAKLILSQKNKAGGTTLPDFKLYY KATVTKTAWYWYQNRYRPMDQNRALRNNTTHLQPSDL >gi568815587r:130134156_130361873|GENSCAN_predicted_CDS_6|654_bp atgaggaaaaaccagcacaaaaatgctgaaaattccaaaaaccagaatgcctcttctcca aatgactgcaactcctctccagcgagggcacaacactggacagagaatgaatttgacaaa tcgacagaagtaggcttcagaagagaaacagagagccaaatcatgagtgaactcctgttc acaattgcttcaaagaaaagaaaatacctaggaatccaacttacaagggacatgaaggaa ctcttcaaggagaactataaaccactgctcaatgaaataaaagatgatacaaacaaatgg aagaacattccatgctcatggataggaagaatgaatatcgtgaaaatggccatactgccc aaggtaatttatagattcaatgccatccccatcaagctaccaatgactttcttcacagaa ttggaaaaaactactttaaagttcacatggaaccaaaaaagagcctgcattgccaagtta atcctaagccaaaagaataaagctggaggcaccacactacctgacttcaagctatactac aaggctacagtaaccaaaacagcatggtactggtaccaaaacagatatagaccaatggac cagaacagagccctcagaaataataccacacatctacaaccatctgatctttga >gi568815587r:130134156_130361873|GENSCAN_predicted_peptide_7|350_aa MGVKTFTHSSSSHSQEMLGKLNMLRNDGHFCDITIRVQDKIFRAHKVVLAACSDFFRTKL VGQAEDENKNVLDLHHVTVTGFIPLLEYAYTATLSINTENIIDVLAAASYMQMFSVASTC SEFMKSSILWNTPNSQPEKGLDAGQENNSNCNFTSRDGSISPVSSECSVVERTIPVCRES RRKRKSYIVMSPESPVKCGTQTSSPQVLNSSASYSENRNQPVDSSLAFPWTFPFGIDRRI QPEKVKQAENTRTLELPGPSETGRRMADYVTCESTKTTLPLGTEEDVRVKVERLSDEEVH EEVSQPVSASQSSLSDQQTVPGSEQVQEDLLISPQSSSIGIMPSVFFLVL >gi568815587r:130134156_130361873|GENSCAN_predicted_CDS_7|1053_bp atgggtgtgaaaacatttactcatagctcctcttcccacagccaggaaatgcttggaaag ctaaatatgctgcgaaatgatggacatttttgtgatatcactattcgtgtccaggacaaa atcttccgggcacataaggtggtactagcagcttgcagtgatttctttcgcaccaaactt gtaggccaagccgaggatgagaacaagaatgtgttggatctgcatcatgttacagtgact ggctttatacctcttttagaatatgcttacacagccactctatcaattaacacagaaaat attattgatgttctagcagcagccagctatatgcaaatgttcagtgttgccagcacctgc tcagagttcatgaaatcaagcattttatggaatacacccaacagccaacctgaaaagggt ctagatgctggacaagaaaataattctaactgcaattttacttctcgagatgggagcatt tctcccgtgtcctcagagtgcagtgtggtagaaagaaccattcctgtctgccgagaatcc cggagaaagcgcaaaagctacattgttatgtctcctgaaagtcctgtaaagtgtggcaca caaacaagctcaccccaggtattgaattcttcagcttcctactcagaaaatagaaaccaa ccagttgactcttccttagcttttccttggacttttccttttggaattgatcgaaggatt cagcctgagaaagttaagcaagcagaaaatacccggactttagaattacctggcccatct gagaccggtagaagaatggctgattatgtgacttgtgagagcacaaaaactaccttgcct ttaggtaccgaagaagatgtccgggtcaaagtagaaagattaagtgatgaggaggtccat gaggaagtgtcccagcctgtcagtgcatctcagagttcgctgagtgatcagcagacagtt ccaggaagtgaacaagtccaagaggaccttctgattagtccacagtcttcctctataggt ataatgccttctgtgtttttcttggtgctataa >gi568815587r:130134156_130361873|GENSCAN_predicted_peptide_8|170_aa MQIFGVLKELKTHHVHTYGLIMGGSNGSAEAQKLANGINITVATPGRLLYHMQNIPGFMY KNLQCLVIDEADRILDVGFGNTLCTDVVARGLDIPQVNWIVHYDHPDDPKEYIHRVVPAF VDLNVNGNEGKQKKRGGGGGFDYQKIKKVEKSKIFKHISKKSSDSRQLSH >gi568815587r:130134156_130361873|GENSCAN_predicted_CDS_8|513_bp atgcaaatttttggtgttcttaaggagctaaagactcaccacgtgcatacctatgggttg ataatgggtggcagtaacggatctgctgaagcacagaaacttgctaatgggatcaacatc actgtggccacaccaggccgtctgctgtatcatatgcagaatatcccagggtttatgtat aaaaacctgcagtgtctggttattgatgaagctgatcgtatcttggatgttggatttggg aacacattgtgtacggatgtggtggcaagaggactggacattcctcaagtcaactggatt gttcactatgaccatccggatgaccctaaggaatatattcatcgtgtagtgcctgccttt gttgatctgaatgtcaatggcaatgaaggcaagcagaaaaagcgaggaggtggtggtgga tttgactaccagaaaatcaagaaagttgagaagtccaaaatctttaaacacattagcaag aaatcatccgatagcaggcagctctctcattga >gi568815587r:130134156_130361873|GENSCAN_predicted_peptide_9|83_aa MAPNDVHVLNPEICDCVTLHGQRDLADVIKVDQELQNPPGGTGYFGEARTLPPGRPGTWR EKEEEQQASGPGAAARREGYVQY >gi568815587r:130134156_130361873|GENSCAN_predicted_CDS_9|252_bp atggcccccaatgatgtccatgtcctaaaccctgagatctgtgactgtgttaccttacat ggccagagggacttggcagatgtgattaaggttgaccaagaacttcaaaaccccccgggc ggtaccggttacttcggggaggcccgcacacttcctccgggaaggccaggaacctggcgg gagaaggaggaggagcagcaggcctccgggcccggcgccgccgcccgcagagagggctat gtgcagtactga