GENSCAN 1.0 Date run: 8-Nov-116 Time: 07:59:45 Sequence gi568815593r:180639126_180840058 : 200933 bp : 44.65% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 Intr - 3221 3144 78 2 0 86 94 22 0.009 2.35 1.03 Intr - 6078 5969 110 1 2 113 31 25 0.006 -0.70 1.02 Intr - 6678 6475 204 1 0 60 89 99 0.231 6.37 1.01 Init - 10420 10363 58 1 1 76 101 172 0.597 16.77 1.00 Prom - 13643 13604 40 -4.86 2.12 PlyA - 13672 13667 6 1.05 2.11 Term - 21511 21401 111 1 0 73 41 92 0.749 1.56 2.10 Intr - 25821 25768 54 0 0 96 73 22 0.248 0.68 2.09 Intr - 33349 33168 182 2 2 37 61 99 0.045 1.69 2.08 Intr - 34802 34652 151 2 1 73 7 150 0.093 5.14 2.07 Intr - 42037 41982 56 2 2 113 70 2 0.143 -0.40 2.06 Intr - 45813 45691 123 1 0 91 70 102 0.392 9.26 2.05 Intr - 46883 46823 61 2 1 -1 96 61 0.078 -3.69 2.04 Intr - 47863 47729 135 1 0 52 28 102 0.078 1.36 2.03 Intr - 50329 50196 134 2 2 91 69 57 0.215 4.46 2.02 Intr - 54178 53943 236 0 2 34 82 190 0.146 10.33 2.01 Init - 54635 54184 452 2 2 44 28 335 0.406 18.49 2.00 Prom - 61521 61482 40 -5.76 3.00 Prom + 64300 64339 40 -2.46 3.01 Init + 71463 71599 137 2 2 96 -14 100 0.036 0.21 3.02 Term + 82320 82578 259 0 1 44 48 207 0.707 7.32 3.03 PlyA + 83305 83310 6 1.05 4.06 PlyA - 84387 84382 6 1.05 4.05 Term - 86827 86771 57 1 0 86 48 80 0.204 1.49 4.04 Intr - 100947 100256 692 1 2 54 34 510 0.163 33.44 4.03 Intr - 102765 102711 55 0 1 114 53 5 0.061 -1.85 4.02 Intr - 109666 109621 46 0 1 37 106 34 0.238 -1.49 4.01 Init - 119339 118387 953 2 2 60 86 294 0.855 19.39 4.00 Prom - 119642 119603 40 -10.55 5.02 PlyA - 119743 119738 6 1.05 5.01 Sngl - 120928 120245 684 1 0 51 42 234 0.832 11.29 5.00 Prom - 121252 121213 40 -3.86 6.00 Prom + 126480 126519 40 -0.86 6.01 Init + 134973 135022 50 2 2 38 93 49 0.726 -0.75 6.02 Intr + 136626 136726 101 0 2 93 61 36 0.556 1.15 6.03 Intr + 140893 141023 131 2 2 64 83 117 0.932 9.21 6.04 Term + 149032 149166 135 0 0 78 49 67 0.089 -0.18 6.05 PlyA + 149998 150003 6 1.05 7.02 PlyA - 151442 151437 6 1.05 7.01 Sngl - 153846 152509 1338 0 0 82 42 2202 0.997 209.13 7.00 Prom - 156177 156138 40 -6.16 8.06 PlyA - 156541 156536 6 1.05 8.05 Term - 156750 156593 158 1 2 90 38 38 0.158 -2.90 8.04 Intr - 163677 163555 123 1 0 49 105 72 0.459 5.56 8.03 Intr - 164406 164298 109 0 1 107 56 -8 0.409 -2.24 8.02 Intr - 165294 165143 152 1 2 110 41 76 0.722 4.98 8.01 Init - 171139 171067 73 1 1 73 44 107 0.551 4.03 8.00 Prom - 174603 174564 40 0.74 9.03 PlyA - 175891 175886 6 1.05 9.02 Term - 179502 179342 161 1 2 102 49 84 0.243 4.00 9.01 Init - 185275 185269 7 1 1 87 94 11 0.112 2.01 9.00 Prom - 185524 185485 40 -4.76 10.02 PlyA - 186037 186032 6 1.05 10.01 Sngl - 192289 192101 189 1 0 88 54 154 0.097 5.13 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:180639126_180840058|GENSCAN_predicted_peptide_1|150_aa MQRGAALCLRLWLCLGLLDDTGFLWNLGFPVTLRRQVWEQEDAGIHSRRRSRAPALAGLD GVTAGTPPALSEARTTLAHGQAWVDPGASSRNLSPFCPKGTAVNEPSGPPPTHPQDKHQL LGLVRGPHALTTTQGHSDTAAVANQAFRRQ >gi568815593r:180639126_180840058|GENSCAN_predicted_CDS_1|450_bp atgcagcggggcgccgcgctgtgcctgcgactgtggctctgcctgggactcctggacgac actggcttcctctggaatctgggttttccggtcacactgaggagacaggtctgggagcag gaagatgcgggcattcattcccggaggaggagcagggcgccagccctggccgggctggac ggggtaacagcaggcacacccccggccctgtctgaggcccgcaccaccctagcccacggc caggcctgggtagatccaggtgcatcttccagaaatttgtctccattctgccccaaaggg actgctgtaaatgagccttcagggccccccccaactcacccacaagataagcaccagctt cttgggctcgtgcgaggcccccatgctctgacgacaacacagggtcacagtgatacagca gctgtagcaaaccaagcattcaggaggcag >gi568815593r:180639126_180840058|GENSCAN_predicted_peptide_2|564_aa MKINDSSGEDFILVGFSEYPQAEFILSLFVSGFYTMTFTGNTAIILVSLLDYRLRTPMYF FLRKLSFLDMCFTTCIVLQMLVNIWGESKKVSYVGCMVQYSVALALGSTECVLLAIMAVD RYVAVRWPLHYVTIMHQQICHFLAALSWFSGQLSLSLFTNHHFASVWPPPCGPFLCEVLL IVKLSCVDTGPTELKMLIARVIILALPVCTILTSYACIARAVLRLQSAEARCQQLSLFSY ISQITPWISRNLRTDLERICALILICPLKSSDVEHPRLPEVLQQLPRLTVTLCFLGFFYR QNWRFPSLDSWDLIQAAHSKQNSIPASVNTSSFITAVPEGPLLMAWGTLYLCLINLEGPA HQGGFQVIRRPSSSERCQLSAAGPTTSPCCAARTRAEEPPHRAHAGSAAALSDKGARPAP GLRAPEALKERLLERVLRRSPGGCHRSGGGSQRQAQDHPVCSRVQRFGVPPKGSQSQAQG PPSISKIYTWSPLKDSQSLALNPQNVSQSQQATVHVPGAAYRSMTGAREIVEETLPNTFY EASMTYTKAKDTSRKLQTNIPYKH >gi568815593r:180639126_180840058|GENSCAN_predicted_CDS_2|1695_bp atgaaaataaatgacagctcaggggaagacttcatcttagttggcttctcagaatatccc caggctgagttcatcctttctctgtttgtctccgggttctacaccatgacattcacaggg aacacagccatcatcttggtctctctgctggactaccggctccgcaccccaatgtacttc ttcctccgaaagctctcatttctggacatgtgtttcaccacctgcattgtccttcagatg ctggtgaacatctggggagagagtaagaaggtcagctatgtaggctgcatggttcagtat tctgtagccttggctcttggctccacagagtgtgtgcttcttgctatcatggctgtggac cgttatgttgccgtccgctggccccttcactatgttacaatcatgcaccaacagatctgc cactttctcgcagccttgtcctggttttctggccaactctctctttcactcttcactaac caccattttgcctctgtgtggccaccgccgtgtggaccatttctttgtgaggtcctgctc attgtcaagctgtcctgcgtggacaccggcccaactgaattgaagatgttaattgctcgt gtgatcatccttgcccttccagtgtgcaccatcctcacctcctatgcctgcattgccagg gctgtgctgaggctgcagtctgctgaagcaagatgtcaacagctgtctttattttcttac atttcccagataactccttggatcagcaggaatctacgcacagatttagagcgcatatgc gccctgatcctaatttgccccctgaagagcagcgacgtagagcacccaaggctgccagag gtccttcagcagctcccacggctgacagtgactctatgtttcctaggcttcttctaccgc cagaattggcggtttccctcgctggattcttgggacctgatccaggcagcacactcgaaa caaaattccatccccgcatcagtcaacacttcatccttcatcacagctgtacctgaaggg cccctgctcatggcctggggcacactttacctgtgtctcattaacctcgagggccctgcc catcagggtggcttccaggtcattcggcggccttcatcgagcgagcgctgccagctctca gctgctggtcccaccaccagcccctgctgtgcggccaggacaagggcagaagagccgcca catcgcgcacacgcgggctctgcggctgccctttctgataaaggagctcgtccggcgccc ggactgcgcgcaccggaggccctgaaggagcggctcctggagcgcgtcctgcgccggagt cctggaggatgccacagatccggagggggtagccagagacaagcccaggatcacccagtc tgcagcagggtgcaaaggtttggggtcccccccaaaggcagccagagccaagcccagggt cccccgagcatcagcaagatctacacctggagtcccctcaaagacagccagagtctagct ctgaatccccagaatgtcagccaaagccaacaggcaactgtgcacgttcctggagcagct tacagatcgatgacaggagccagggaaatagtagaagaaacacttcctaacacattctac gaggccagcatgacctataccaaagccaaagacacttcaagaaaactgcaaaccaacatc ccttacaaacattga >gi568815593r:180639126_180840058|GENSCAN_predicted_peptide_3|131_aa MPYNQTSERQRENLESSNLEVTPYEQESSIRFTVKFSSENVEAREQNKSHEFMATSFAPM FINSRIEINGVGKLAIHMQKKEIGLISLLMQESTKMDEKLNKSPDTLKLLEENIREKLHD TGLDHDFLDMT >gi568815593r:180639126_180840058|GENSCAN_predicted_CDS_3|396_bp atgccttataatcagacttcggaaagacaaagagagaaccttgaaagcagcaacctagaa gtgactccttatgaacaagaatcttcaataagatttacagtcaaattctcatcagaaaat gtggaggccagagagcagaacaagtcccacgagttcatggccacatcattcgcaccaatg ttcatcaacagcagaatagaaataaatggtgttgggaaactggctattcatatgcagaag aaggaaattggactcatctcactccttatgcaagaatcaactaaaatggatgaaaaactt aacaaaagtcccgacactctaaaactactagaagaaaacattagagaaaagctccatgac actggtctggaccatgatttcttggatatgacctga >gi568815593r:180639126_180840058|GENSCAN_predicted_peptide_4|600_aa MSEFPFTIASKIIKYLGIQLTRDLKDLFKENYKPLLNEIKEDTNKWKKIPCSWVGRINIM KMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRACIAKSILSQKNKAGGIMLP DFKLYYKITVTKTAWYWYQNRDIDKRNRTEPSEIMPHIYNYLIFDKPDENKQWGKDSLFN KWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGNTIQDIGMGKD FMSKTPKAMATKAKIDKWDLIKLKSFCTAKETIIRLNRQPTEWEKSFAIYSSDKRLISRI YNELQEVYKKKTNNLIKREGAYVTIASPGQEPWVSKLRLTETKFPRIDVTADRQKAMGSF NTSFEDGFILVGFSDWPQLEPILFVFIFIFYSLTLFGNTIIIALSWLDLRLHTPMYFFLS HLSLLDLCFTTSTVPQLLINLCGVDRTITRGGCVAQLFIYLALGSTECVLLVVMAFDRYA AVCRPLHYMAIMHPHLCQTLAIASWGAGFVNSLIQTGLAMAMPLCGHRLNHFFCEMPVFL KLACADTEGTEAKMFVARVIVVAVPAALILGSYVHIAHAVLRLSKIKFATSRYLWSPPLK >gi568815593r:180639126_180840058|GENSCAN_predicted_CDS_4|1803_bp atgagtgaattcccattcacaattgcttcaaagataataaaatacctaggaatccaactt acaagggacctgaaggacctcttcaaggagaactacaaaccactgctcaatgaaataaaa gaggatacaaacaaatggaagaaaattccatgctcatgggtaggaagaatcaatatcatg aaaatggccatactgcccaaggtaatttatagattcaatgccatccccatcaagctacca atgactttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaaga gcctgcattgccaagtcaatcctaagccaaaagaacaaagctggaggcatcatgctacct gacttcaaactatactacaagattacagtaaccaaaacagcatggtactggtaccaaaac agagatatagacaaacggaacagaacagagccctcagaaataatgccacatatctacaac tatctgatctttgacaaacctgatgaaaacaagcaatggggaaaggattccctatttaat aaatggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttcctt acaccttatacaaaaattaattcaagatggattaaagacttaaatgttagacctaaaacc ataaaaaccctagaagaaaacctaggcaataccattcaggacataggcatgggcaaggac ttcatgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatcta attaaactaaagagcttctgcacagcaaaagaaactatcatcagactgaacaggcaacct acagaatgggagaaaagttttgcaatctactcatctgacaaaaggctaatatccagaatc tataatgaactccaagaagtttacaagaaaaaaacaaacaaccttatcaaaagggaggga gcctatgtaaccatcgcttctccaggacaggagccctgggtaagtaaactgagactcaca gaaacaaaatttcccagaatagatgtcacagctgacaggcaaaaggccatgggaagtttc aacaccagttttgaagatggcttcattttggtgggattctcagattggccgcaactggag cccatcctgtttgtctttatttttattttctactccctaactctctttggcaacaccatc atcatcgctctctcctggctagaccttcggctgcacacacctatgtacttctttctctct catctgtccctcctggacctctgcttcaccaccagcaccgtgccccagctcctgatcaac ctttgcggggtggaccgcaccatcacccgtggagggtgtgtggctcagctcttcatctac ctagccctgggctccacagagtgtgtgctcctggtggtgatggcctttgaccgctatgct gctgtctgtcgtccactccactacatggccatcatgcacccccatctctgccagaccctg gctatcgcctcctggggtgcgggtttcgtgaactctctgatccagacaggtctcgcaatg gccatgcctctctgtggccatcgactgaatcacttcttctgtgagatgcctgtatttctg aagttggcttgtgcggacacagaaggaacagaggccaagatgtttgtggcccgagtcata gtcgtggctgttcctgcagcacttattctaggctcctatgtgcacattgctcatgcagtg ctgaggctttccaaaatcaaatttgccaccagccgatacctgtggtcacctccactgaaa taa >gi568815593r:180639126_180840058|GENSCAN_predicted_peptide_5|227_aa MITNCLSDHSAIKLELRIKKLTQNRSTTWKLNNLLLNDYWVHKEMKAEINMFFEANENKD TTYQNLWDTFKAVCRGKFIALNAHKRKQERSKIDTLTSQLKELEKQEQTHSKASRRQEIT KIRAELKEIETQKTLQKIHESRSWFFEKINKTDRPLARLIKKKKKKNQRDAIKNDKGDIT TNPTEIQTTIREYYKHLYANKLENLEEMDKFLDTYTLPRLNQEKLNL >gi568815593r:180639126_180840058|GENSCAN_predicted_CDS_5|684_bp atgataacaaactgtctctcagaccatagtgcaatcaaactagaactcaggattaagaaa ctcactcaaaaccgatcaactacatggaaactgaacaacttgctcctgaatgactactgg gtacataaagaaatgaaggcagaaataaatatgttctttgaagccaacgagaacaaagac acaacataccagaatctctgggacacattcaaagcagtgtgtagagggaaatttatagca ctaaatgcccacaagagaaagcaggaaagatctaaaattgacaccctaacatcacaatta aaagaactagagaagcaagagcaaacacattcaaaagctagcagaaggcaagaaataact aagatcagagcagaactgaaggaaatagagacacaaaaaacccttcaaaaaatccatgaa tccaggagctggttttttgaaaagatcaacaaaactgatagaccactagcaagactaata aagaagaaaaaaaagaagaatcaaagagatgcaataaaaaatgataaaggggatatcacc accaatcccacagaaatacaaactaccatcagagaatactataaacacctctatgcaaat aaactagaaaatctagaagaaatggataaattcctggacacatacaccctcccaagacta aaccaggagaagttgaatctctga >gi568815593r:180639126_180840058|GENSCAN_predicted_peptide_6|138_aa MVLQAEQAGHQHLLTFRSIQSNHPKAICFLTKVGKRGVGEAQHQWHSPYAARIDPITNVN GMVTSWLSERVKGEEHLKLFHILQHEVLEKELSVALPGPGAAATHTRMPPEQALMGEYQE TPEGAPCFRSLRTQSFVL >gi568815593r:180639126_180840058|GENSCAN_predicted_CDS_6|417_bp atggttctgcaggctgaacaggcagggcaccagcatctgctcaccttccgcagcatccaa tccaaccatccaaaggccatctgcttcttgaccaaggtggggaagagaggagttggtgag gcccagcatcaatggcattctccatatgcagcacgaattgatcctatcactaatgtcaat gggatggtcacgtcatggttatctgaacgtgtcaaaggagaagagcatctgaagctgttt cacatcctacagcatgaggttcttgagaaggagctatctgtggcgcttccgggcccgggg gcagccgccacgcacacacgcatgccacctgagcaagccttgatgggtgaataccaggag acccctgaaggtgctccttgcttcaggagcctgagaacccagtctttcgtcttgtaa >gi568815593r:180639126_180840058|GENSCAN_predicted_peptide_7|445_aa MLKKQSAGLVLWGAILFVAWNALLLLFFWTRPAPGRPPSVSALDGDPASLTREVIRLAQD AEVELERQRGLLQQIGDALSSQRGRVPTAAPPAQPRVPVTPAPAVIPILVIACDRSTVRR CLDKLLHYRPSAELFPIIVSQDCGHEETAQAIASYGSAVTHIRQPDLSSIAVPPDHRKFQ GYYKIARHYRWALGQVFRQFRFPAAVVVEDDLEVAPDFFEYFRATYPLLKADPSLWCVSA WNDNGKEQMVDASRPELLYRTDFFPGLGWLLLAELWAELEPKWPKAFWDDWMRRPEQRQG RACIRPEISRTMTFGRKGVSHGQFFDQHLKFIKLNQQFVHFTQLDLSYLQREAYDRDFLA RVYGAPQLQVEKVRTNDRKELGEVRVQYTGRDSFKAFAKALGVMDDLKSGVPRAGYRGIV TFQFRGRRVHLAPPLTWEGYDPSWN >gi568815593r:180639126_180840058|GENSCAN_predicted_CDS_7|1338_bp atgctgaagaagcagtctgcagggcttgtgctgtggggcgctatcctctttgtggcctgg aatgccctgctgctcctcttcttctggacgcgcccagcacctggcaggccaccctcagtc agcgctctcgatggcgaccccgccagcctcacccgggaagtgattcgcctggcccaagac gccgaggtggagctggagcggcagcgtgggctgctgcagcagatcggggatgccctgtcg agccagcgggggagggtgcccaccgcggcccctcccgcccagccgcgtgtgcctgtgacc cccgcgccggcggtgattcccatcctggtcatcgcctgtgaccgcagcactgttcggcgc tgcctggacaagctgctgcattatcggccctcggctgagctcttccccatcatcgttagc caggactgcgggcacgaggagacggcccaggccatcgcctcctacggcagcgcggtcacg cacatccggcagcccgacctgagcagcattgcggtgccgccggaccaccgcaagttccag ggctactacaagatcgcgcgccactaccgctgggcgctgggccaggtcttccggcagttt cgcttccccgcggccgtggtggtggaggatgacctggaggtggccccggacttcttcgag tactttcgggccacctatccgctgctgaaggccgacccctccctgtggtgcgtctcggcc tggaatgacaacggcaaggagcagatggtggacgccagcaggcctgagctgctctaccgc accgactttttccctggcctgggctggctgctgttggccgagctctgggctgagctggag cccaagtggccaaaggccttctgggacgactggatgcggcggccggagcagcggcagggg cgggcctgcatacgccctgagatctcaagaacgatgacctttggccgcaagggtgtgagc cacgggcagttctttgaccagcacctcaagtttatcaagctgaaccagcagtttgtgcac ttcacccagctggacctgtcttacctgcagcgggaggcctatgaccgagatttcctcgcc cgcgtctacggtgctccccagctgcaggtggagaaagtgaggaccaatgaccggaaggag ctgggggaggtgcgggtgcagtatacgggcagggacagcttcaaggctttcgccaaggct ctgggtgtcatggatgaccttaagtcgggggttccgagagctggctaccggggtattgtc accttccagttccggggccgccgtgtccacctggcgcccccactgacgtgggagggctat gatcctagctggaattag >gi568815593r:180639126_180840058|GENSCAN_predicted_peptide_8|204_aa MGGMWNAALPGPTSGGRAEVAAVDGLGSGHLSGGFTAAPERRQSPLKALALEDVCGGVGG RTGVPGLKPGYAPHQVRTFLPFKTPAGRRPVLGFSMVLPPKLVTESFPPRAERAGESKGE AAAAALGTRLGTGLAPCAPEQAKFGARTSGGPEPGREQVCDTRRQVTDRGIQGKQIISIC LADVPAGITLMDEALQSYGQYLHI >gi568815593r:180639126_180840058|GENSCAN_predicted_CDS_8|615_bp atgggcgggatgtggaacgctgcgttgccggggcccacttccggaggacgggcggaagtg gctgcggtggatggtctgggctctggccacttaagcggaggcttcacggcagcgcctgag aggagacagagccctctcaaagcactggcgctggaggatgtgtgcgggggtgttgggggg cgcacaggagtgcccggtctaaaacctggctatgctccccaccaggtccgcaccttcctc ccctttaaaacaccagccgggcgcagacccgttctaggcttttccatggtgcttccgcca aagcttgtgaccgagtccttcccgcctagggctgaaagggctggcgagtcgaaaggcgag gcggccgcggcagcgcttgggacgcgcctgggcaccgggctcgctccctgcgccccggag caggccaagttcggggccaggacgtcgggaggacctgagcctggcagagagcaggtgtgt gatacacgtcgtcaggtgactgaccgaggaattcagggcaaacagatcatctcaatttgc cttgctgatgtgccagctgggattacgctgatggatgaagctcttcagagctatgggcaa tatctacatatttga >gi568815593r:180639126_180840058|GENSCAN_predicted_peptide_9|55_aa MIPAAHPPKSSQDTAVGLRNRKRRKARDSCATVNSNVGQRKLILTAYCTFEERGI >gi568815593r:180639126_180840058|GENSCAN_predicted_CDS_9|168_bp atgatccctgctgcacatccacccaagtcctctcaagacactgctgtgggactgagaaat aggaaaaggaggaaagccagggacagctgtgccacagtgaattccaacgtagggcaacga aagctcatcctgacagcgtactgcactttcgaggagcgtggcatttga >gi568815593r:180639126_180840058|GENSCAN_predicted_peptide_10|62_aa MAETRKLRDGATAPLLPACVRKVSLRGAPAPASGGEGGTCRGRSSEALGPLCSPAIARAA LT >gi568815593r:180639126_180840058|GENSCAN_predicted_CDS_10|189_bp atggccgaaaccaggaagctgcgcgatggcgccacggcccctcttctcccggcctgtgtc cggaaggtttccctccgaggcgccccggctcccgcaagcggaggagagggcgggacgtgc cggggccggagctcagaggccctggggccgctctgctctcccgccatcgcaagggcggcg ctgacctga