GENSCAN 1.0 Date run: 4-Nov-116 Time: 23:31:07 Sequence gi568815578r:45153002_45354543 : 201542 bp : 43.68% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 2219 2354 136 1 1 59 57 156 0.939 9.90 1.02 Term + 10128 10225 98 1 2 81 53 79 0.502 1.83 1.03 PlyA + 11612 11617 6 1.05 2.00 Prom + 15017 15056 40 -2.46 2.01 Init + 21922 22000 79 0 1 98 105 148 0.999 18.80 2.02 Term + 22860 23134 275 1 2 87 51 166 0.651 8.43 2.03 PlyA + 23521 23526 6 1.05 3.00 Prom + 27457 27496 40 -5.66 3.01 Init + 53981 54128 148 1 1 67 113 118 0.519 12.45 3.02 Term + 54373 55685 1313 2 2 64 40 251 0.694 9.65 3.03 PlyA + 56743 56748 6 1.05 4.00 Prom + 62826 62865 40 -6.06 4.01 Init + 68389 68464 76 0 1 84 113 86 0.835 10.50 4.02 Intr + 69362 70282 921 0 0 12 81 330 0.110 15.92 4.03 Term + 93703 93845 143 2 2 74 38 135 0.075 5.19 4.04 PlyA + 94011 94016 6 1.05 5.05 PlyA - 94198 94193 6 1.05 5.04 Term - 98594 98575 20 0 2 78 55 16 0.353 -4.22 5.03 Intr - 100150 100001 150 2 0 61 121 37 0.930 4.43 5.02 Intr - 100732 100574 159 2 0 88 82 76 0.953 6.96 5.01 Init - 101542 101458 85 1 1 88 84 165 0.969 15.08 5.00 Prom - 101596 101557 40 -4.26 6.00 Prom + 102516 102555 40 -3.36 6.01 Init + 124770 124772 3 2 0 113 81 0 0.241 1.80 6.02 Term + 134327 134563 237 1 0 148 47 74 0.441 5.47 6.03 PlyA + 139355 139360 6 1.05 7.08 PlyA - 140469 140464 6 1.05 7.07 Term - 141014 140866 149 0 2 102 49 130 0.584 8.56 7.06 Intr - 145069 144917 153 2 0 60 99 306 0.999 28.94 7.05 Intr - 145582 145169 414 2 0 61 86 705 0.996 61.68 7.04 Intr - 148008 147886 123 1 0 131 84 162 0.998 20.66 7.03 Intr - 148442 148320 123 0 0 82 72 23 0.547 0.66 7.02 Intr - 151796 151227 570 0 0 92 89 968 0.995 90.03 7.01 Init - 152581 152509 73 1 1 78 66 62 0.897 2.26 7.00 Prom - 152825 152786 40 -11.14 8.00 Prom + 153156 153195 40 -12.21 8.01 Init + 153922 153943 22 0 1 119 99 19 0.988 4.53 8.02 Intr + 155142 155250 109 1 1 111 97 42 0.989 6.74 8.03 Intr + 156566 156691 126 2 0 92 78 116 0.994 10.79 8.04 Intr + 158588 158658 71 2 2 105 100 15 0.918 3.23 8.05 Intr + 158838 158953 116 1 2 41 76 145 0.965 8.77 8.06 Intr + 159220 159394 175 0 1 19 52 321 0.976 21.11 8.07 Intr + 160467 160604 138 1 0 108 30 180 0.999 14.64 8.08 Intr + 161034 161143 110 1 2 113 35 168 0.998 14.00 8.09 Intr + 161412 161564 153 2 0 117 110 96 0.996 14.97 8.10 Intr + 163186 163341 156 0 0 89 57 163 0.732 13.51 8.11 Term + 163476 163958 483 2 0 102 48 743 0.984 66.45 8.12 PlyA + 166654 166659 6 1.05 9.06 PlyA - 166767 166762 6 1.05 9.05 Term - 174414 174263 152 1 2 133 48 346 0.997 33.17 9.04 Intr - 177563 177365 199 2 1 96 111 150 0.994 17.02 9.03 Intr - 180068 180022 47 0 2 83 111 60 0.996 5.93 9.02 Intr - 182919 182772 148 0 1 101 75 217 0.828 21.51 9.01 Init - 195383 195324 60 2 0 96 72 158 0.847 14.25 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578r:45153002_45354543|GENSCAN_predicted_peptide_1|77_aa MWESLELPRDLLNGFDQNADSDMDNEVQAEVVSDGDEELVGNWKKAASTASVLRMRTTIG TKKLLAFILSVRCLTKS >gi568815578r:45153002_45354543|GENSCAN_predicted_CDS_1|234_bp atgtgggaaagtttggaacttcctagagacttgttgaatggctttgaccaaaatgctgac agtgatatggacaatgaagtccaggctgaggtggtctcagatggagatgaagaacttgtt ggaaactggaagaaagctgcctctactgccagtgtcctaaggatgaggaccaccattggg accaagaagctgctggctttcatattgtcagtaaggtgcttgacaaagagctga >gi568815578r:45153002_45354543|GENSCAN_predicted_peptide_2|117_aa MRASSFLIVVVFLIAGTLVLEAAVTGVPVKGQDTVKGRVPFNGQDPVKGQVSVKGQDKVK AQEPVKGPVSTKPGSCPIILIRCAMLNPPNRCLKDTDCPGIKKCCEGSCGMACFVPQ >gi568815578r:45153002_45354543|GENSCAN_predicted_CDS_2|354_bp atgagggccagcagcttcttgatcgtggtggtgttcctcatcgctgggacgctggttcta gaggcagctgtcacgggagttcctgttaaaggtcaagacactgtcaaaggccgtgttcca ttcaatggacaagatcccgttaaaggacaagtttcagttaaaggtcaagataaagtcaaa gcgcaagagccagtcaaaggtccagtctccactaagcctggctcctgccccattatcttg atccggtgcgccatgttgaatccccctaaccgctgcttgaaagatactgactgcccagga atcaagaagtgctgtgaaggctcttgcgggatggcctgtttcgttccccagtga >gi568815578r:45153002_45354543|GENSCAN_predicted_peptide_3|486_aa MAHSLKEDINDKVGSALRQGFPSKMKPNIIFVLSLLLILEKQAAVMGQKGGSKGRLPSEF SQFPHGQKGQHYSGQKGKQQTESKGSFSIQYTYHVDANDHDQSRKSQQYDLNALHKTTKS QRHLGGSQQLLHNKQEGRDHDKSKGHFHRVVIHHKGGKAHRGTQNPSQDQGNSPSGKGIS SQYSNTEERLWVHGLSKEQTSVSGAQKGRKQGGSQSSYVLQTEELVANKQQRETKNSHQN KGHYQNVVEVREEHSSKVQTSLCPAHQDKLQHGSKDIFSTQDELLVYNKNQHQTKNLNQD QQHGRKANKISYQSSSTEERRLHYGENGVQKDVSQSSIYSQTEEKAQGKSQKQITIPSQE QEHSQKANKISYQSSSTEERRLHYGENGVQKDVSQRSIYSQTEKLVAGKSQIQAPNPKQE PWHGENAKGESGQSTNREQDLLSHEQKGRHQHGSHGGLDIVIIEQEDDSDRHLAQHLNND RNPLFT >gi568815578r:45153002_45354543|GENSCAN_predicted_CDS_3|1461_bp atggcacactcactcaaggaagatataaatgacaaggtcggctcagctctcagacaaggt tttccaagcaagatgaagcccaacatcatctttgtactttccctgctcctcatcttggag aagcaagcagctgtgatgggacaaaaaggtggatcaaaaggccgattaccaagtgaattt tcccaatttccacacggacaaaagggccagcactattctggacaaaaaggcaagcaacaa actgaatccaaaggcagtttttctattcaatacacatatcatgtagatgccaatgatcat gaccagtcccgaaaaagtcagcaatatgatttgaatgccctacataagacgacaaaatca caacgacatctaggtggaagtcaacaactgctccataataaacaagaaggcagagaccat gataaatcaaaaggtcattttcacagggtagttatacaccataaaggaggcaaagctcat cgtgggacacaaaatccttctcaagatcaggggaatagcccatctggaaagggaatatcc agtcaatattcaaacacagaagaaaggctgtgggttcatggactaagtaaagaacaaact tccgtctctggtgcacaaaaaggtagaaaacaaggcggatcccaaagcagttatgttctc caaactgaagagctagtagctaacaaacaacaacgtgagactaaaaattctcatcaaaat aaagggcattaccaaaatgtggttgaagtgagagaggaacattcaagtaaagtacaaacc tcactctgtcctgcgcaccaagacaaactccaacatggatccaaagacattttttctacc caagatgagctcctagtatataacaagaatcaacaccagacaaaaaatctcaatcaagat caacagcatggccgaaaggcaaataaaatatcataccaatcttcaagtacagaagaaaga cgactccactatggagaaaatggtgtgcagaaagatgtatcccaaagcagtatttatagc caaactgaagagaaagcacagggcaagtctcaaaaacagataacaattcccagtcaagag caagagcatagccaaaaggcaaataaaatatcataccaatcttcaagtacggaagaaaga cgactccactatggagaaaatggtgtgcagaaagatgtatcccaacgcagtatttatagc caaactgaaaagctagtagcaggcaagtctcaaatccaggcaccaaatcctaagcaagag ccatggcatggtgaaaatgcaaaaggagagtctggccaatctacaaatagagaacaagac ctactcagtcatgaacaaaaaggcagacaccaacatggatctcatgggggattggatatt gtaattatagagcaggaagatgacagtgatcgtcatttggcacaacatcttaacaacgac cgaaacccattatttacataa >gi568815578r:45153002_45354543|GENSCAN_predicted_peptide_4|379_aa MKSIILFVLSLLLILEKQAAVMGQKDRLQHGPKDIFTTQDELLVYNKNQHQTKNLSQDQE HGRKAHKISYPSSRTEERQLHHGEKSVQKDVSKGSISIQTEEKIHGKSQNQVTIHSQDQE HGHKENKISYQSSSTEERHLNCGEKGIQKGVSKGSISIQTEEQIHGKSQNQVRIPSQAQE YGHKENKISYQSSSTEERRLNSGEKDVQKGVSKGSISIQTEEKIHGKSQNQVTIPSQDQE HGHKENKMSYQSSSTEERRLNYGGKSTQKDVSQSSISFQIEKLVEGKSQIQTPNPNQDQW SGQNAKGKSGQSADSKQDLLSHEQKGRYKQESSKTQADSDEADVRNTAEKVLIDDMVKKC DGLIEGLKQPVFIPKQEIM >gi568815578r:45153002_45354543|GENSCAN_predicted_CDS_4|1140_bp atgaagtccatcatcctctttgtcctttccctgctccttatcttggagaagcaagcagct gtgatgggacaaaaagacagactccaacatggacccaaagacatttttactacccaagat gagctcctagtatataacaagaatcaacaccagacaaaaaatctcagtcaagatcaagag catggccggaaggcacataaaatatcatacccgtcttcacgtacagaagaaagacaactt caccatggagaaaagagtgtacagaaagatgtatccaaaggcagcatttctatccaaact gaagagaaaatacatggcaagtctcaaaaccaggtaacaattcatagtcaagatcaagag catggccataaggaaaataaaatatcataccaatcttcaagtacagaagaaagacatctc aactgtggagaaaagggcatccagaaaggtgtatccaaaggcagtatttcgatccaaact gaagagcaaatacatggcaagtctcaaaaccaggtaagaattcctagtcaagctcaagag tatggccataaggaaaataaaatatcataccaatcttcgagtacagaagaaagacgtctc aacagtggagaaaaggatgtacagaaaggtgtatccaaaggcagtatttctatccaaact gaagagaaaatacatggcaagtctcaaaaccaggtaacaattcctagtcaagatcaagag catggccataaggaaaataaaatgtcataccaatcttcaagtacagaagaaagacgactc aactatggaggaaagagcacgcagaaagatgtatcccaaagcagtatttctttccaaatt gaaaagctagtagaaggcaagtctcaaatccagacaccaaatcctaatcaagatcaatgg tctggccaaaatgcaaaaggaaagtctggtcaatctgcagatagcaaacaagacctactc agtcatgaacaaaaaggcagatacaaacaggaatccagcaaaacgcaggctgacagtgat gaagctgatgttcgtaacactgcagaaaaagtgctgatagatgacatggtgaaaaagtgt gatgggcttattgaaggactgaagcagcctgtattcataccaaaacaagaaatcatgtaa >gi568815578r:45153002_45354543|GENSCAN_predicted_peptide_5|137_aa MKSSGLFPFLVLLALGTLAPWAVEGSGKSFKAGVCPPKKSAQCLRYKKPECQSDWQCPGK KRCCPDTCGIKCLDPVDTPNPTRRKPGKCPVTYGQCLMLNPPNFCEMDGQCKRDLKCCMG MCGKSCVSPVKARVIPD >gi568815578r:45153002_45354543|GENSCAN_predicted_CDS_5|414_bp atgaagtccagcggcctcttccccttcctggtgctgcttgccctgggaactctggcacct tgggctgtggaaggctctggaaagtccttcaaagctggagtctgtcctcctaagaaatct gcccagtgccttagatacaagaaacctgagtgccagagtgactggcagtgtccagggaag aagagatgttgtcctgacacttgtggcatcaaatgcctggatcctgttgacaccccaaac ccaacaaggaggaagcctgggaagtgcccagtgacttatggccaatgtttgatgcttaac ccccccaatttctgtgagatggatggccagtgcaagcgtgacttgaagtgttgcatgggc atgtgtgggaaatcctgcgtttcccctgtgaaagcaagggttattccagactga >gi568815578r:45153002_45354543|GENSCAN_predicted_peptide_6|79_aa MVSSIPVMSDAVATNHMGLFKFKLSKTELKIQFLGHISHIDCYVGQQGYRTFPPSRKSCG QRSSLPLLNFIIFPAMITS >gi568815578r:45153002_45354543|GENSCAN_predicted_CDS_6|240_bp atggtgtcctctatacctgtgatgagtgacgcagtggccacaaatcacatggggctattt aaatttaaattatctaaaacagaacttaaaattcagtttcttggtcacattagtcacatt gactgctatgttggacagcaaggatatagaacctttccaccatcacggaaatcctgtgga cagcgcagctctctacccttactcaatttcatcatcttcccagcaatgatcactagttga >gi568815578r:45153002_45354543|GENSCAN_predicted_peptide_7|534_aa MRGLLCWPVLLLLLQPWETQLQLTGPRCHTGPLDLVFVIDSSRSVRPFEFETMRQFLMGL LRGLNVGPNATRVGVIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYA MNVAFSVAEGARPPEERVPRVAVIVTDGRPQDRVAEVAAQARARGIEIYAVGVQRADVGS LRAMASPPLDEHVFLVESFDLIQEFGLQFQSRLCAIDLCAEGTHGCEHHCVNSPGSYFCH CQVGFVLQQDQRSCRVRDLCNGVDHGCEFQCVSEGLSYRCLCPEGRQLQADGKSCNRCRE GHVDLVLLVDGSKSVRPQNFELVKRFVNQIVDFLDVSPEGTRVGLVQFSSRVRTEFPLGR YGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEAQGARPRALNVPRVGLVFTDGRS QDDISVWAARAKEEGIVMYAVGVGKAVEAELREIASEPAELHVSYAPDFGTMTHLLENLR GSICPEEGISAGTELRSPCECESLVEFQGRTLGALESLTLNHILWGWWRGLEGK >gi568815578r:45153002_45354543|GENSCAN_predicted_CDS_7|1605_bp atgagaggccttctttgctggcccgtgttgctgctccttcttcagccctgggaaacccag ctccagttgacaggtcccaggtgtcacactgggcccctggatctggtgttcgtgattgac agctcccgcagcgtgcgccctttcgagttcgagaccatgcggcagttcctcatgggcctc ctccgaggcctgaacgtgggtcccaacgccacgcgcgttggcgtgatccagtattcgagt caagtgcagagcgtcttccctctccgcgcgttctctcgccgcgaggacatggagcgcgcc atccgcgacctggtgcctctggcgcaaggcaccatgacgggactggcaatccagtacgcc atgaacgtggccttcagtgtggccgagggcgcgcgaccgccagaggagcgcgtgccgcgt gtcgctgtcatcgtgacagacgggcggccccaggaccgcgtggccgaggtggcggcacag gcgcgcgcccgcggcattgaaatttacgcggtgggggtgcagcgcgcggacgtgggctcc ctgcgcgccatggcatcgcccccgctagacgagcacgtcttcctcgtagagtccttcgac ctcatccaggagttcggcctgcagttccagagccggctgtgtgccattgatctgtgtgct gaagggacccatggatgtgagcaccactgcgtcaattccccaggctcctatttctgtcac tgccaagttggctttgtactccagcaggaccagaggagctgcagggtccgggacctttgc aatggcgtggaccatggctgtgagttccagtgtgtgagcgagggcctctcctaccgctgc ctgtgccccgaggggcggcaacttcaggcagatggcaagagctgcaaccggtgccgggaa ggccacgtggaccttgttctgctggttgatggctccaagagcgtgcgtccacaaaacttc gagctagtgaagcgcttcgtgaaccagattgtggacttcctagatgtgtcccccgagggc acgcgggtggggctggtgcagttctcgagccgcgtgcgcaccgagttccctctgggtcgc tacggcaccgcagccgaggtgaagcaggcggtcctggccgtggagtacatggaacgcggc accatgacagggctggcgttgcggcacatggtggagcacagcttctccgaggcgcagggt gcacggccccgtgcccttaacgtgcctcgtgttggcctggtcttcacggatggccgctcc caggatgacatctcggtgtgggcagcgcgcgccaaggaggaaggcatcgtcatgtacgcc gtgggcgtgggcaaggcggtggaggcggagctgcgcgagatcgcctcggagccagcggaa ctgcacgtgtcctatgccccggacttcggcaccatgacgcacctgctggagaacctcaga ggcagcatctgtccagaggagggcatcagcgcagggacagagcttcggagcccatgcgaa tgcgaaagcctcgtggagttccagggccgcacgctgggggcgctcgagagcctgacgctg aaccatatcctttggggctggtggcgcgggctggagggaaagtga >gi568815578r:45153002_45354543|GENSCAN_predicted_peptide_8|552_aa MDPAGAADPSVPPNPLTHLSLQDRSEMQLQSEADRRSLPGTWTRSSPEHTTILRGGVRRC LQQQCEQTVRILHAKVAQKSYGNEKRFFCPPPCVYLSGPGWRVKPGQDQAHQAGETGPTV CGYMGLDSASGSATETQKLNFEQQPDSREFGCAKTLYISDADKRKHFRLVLRLVLRGGRE LGTFHSRLIKVISKPSQKKQSLKNTDLCISSGSKVSLFNRLRSQTVSTRYLSVEDGAFVA SARQWAAFTLHLADGHSAQGDFPPREGYVRYGSLVQLVCTVTGITLPPMIIRKVAKQCAL LDVDEPISQLHKCAFQFPGSPPGGGGTYLCLATEKVVQFQASPCPKEANRALLNDSSCWT IIGTESVEFSFSTSLACTLEPVTPVPLISTLELSGGGDVATLELHGENFHAGLKVWFGDV EAETMYRYGVVRQPLLGPGEQGKGVHASSESPQPSPWCSTPRSPRSLVCVVPDVAAFCSD WRWLRAPITIPMSLVRADGLFYPSAFSFTYTPEYSVRPGHPGVPEPATDADALLESIHQE FTRTNFHLFIQT >gi568815578r:45153002_45354543|GENSCAN_predicted_CDS_8|1659_bp atggaccccgcaggggcagcagacccctcagtgcctcccaatcctttgactcacctgagc ctgcaggacagatcagagatgcagctgcagagcgaagccgacaggcggagcctcccgggc acttggaccaggtcatccccagagcacaccaccattctgaggggaggcgtgcgcaggtgc ctgcagcaacagtgtgaacagactgtgcggatcctgcatgccaaggtggcccagaaatca tacggaaatgagaagcggttcttctgccccccgccctgtgtctacctctcggggcctggc tggagggtgaagccagggcaggatcaagctcaccaggcgggggaaacggggcccacggtc tgcggttacatgggactggacagcgcgtccggcagcgccactgagacgcagaagctgaat ttcgagcagcagccggactccagggaattcggctgcgccaagaccctgtacatctcagat gcagacaagaggaagcactttcggctggtgctgcggctggtgctgcgcgggggccgggag ctgggtaccttccacagccgccttatcaaggtcatctcgaagccctcgcagaagaagcag tcgctgaaaaacaccgatctgtgcatatcctccggctcaaaggtctccctcttcaaccgc ctgcgctctcagacggtctccacacgctacctctctgtggaggatggggcctttgtggcc agtgcacgacagtgggctgccttcacgctccacctggctgatgggcactctgcccaagga gacttcccaccgcgagagggctacgttcgctatggctccctggtgcagctcgtctgcacg gtcaccggcatcacactacctcccatgatcatccgtaaagtagcaaaacagtgtgcgctc cttgatgtggatgagcccatctcccagctgcacaagtgtgcattccagtttccaggcagt cccccaggagggggtggcacctacttatgccttgccacagagaaggtggtgcaatttcag gcctctccctgccccaaggaggcgaacagggctctgcttaacgacagctcttgctggacc atcatcggcaccgagtcggtggaattttccttcagcaccagcctggcgtgtaccctggag ccggtcactccggtgcctctcatcagcaccctagagctgagcggcgggggcgacgtggcc acgctggagctccacggagagaacttccacgcggggctcaaggtgtggtttggggacgtg gaggcagaaaccatgtacaggtacggggtggtgaggcagcctctcttgggccccggggag caggggaagggggtgcacgcgtcgtcggagtcgccgcagccctcaccctggtgctccacc cccaggagcccgcggtccctggtgtgcgtggtgccggacgtggcggccttctgcagcgac tggcgctggctgcgcgctcccatcacaatccccatgagcctggtgcgcgccgacgggctc ttctaccctagtgccttctccttcacctacaccccggaatacagcgtgcggccgggtcac cccggcgtccccgagcccgccaccgacgccgacgcgctcctggagagcatccatcaggag ttcacgcgcaccaacttccacctcttcatccagacttag >gi568815578r:45153002_45354543|GENSCAN_predicted_peptide_9|201_aa MAPARLFALLLFFVGGVAESIRETEVIDPQDLLEGRYFSGALPDDEDVVGPGQESDDFEL SGSGDLGTEDDLEDSMIGPEVVHPLVPLDNHIPERAGSGSQVPTEPKKLEENEVIPKRIS PVEESEDVSNKVSMSSTVQGSNIFERTEVLAALIVGGIVGILFAVFLILLLMYRMKKKDE GSYDLGKKPIYKKAPTNEFYA >gi568815578r:45153002_45354543|GENSCAN_predicted_CDS_9|606_bp atggcccccgcccgtctgttcgcgctgctgctgttcttcgtaggcggagtcgccgagtcg atccgagagactgaggtcatcgacccccaggacctcctagaaggccgatacttctccgga gccctaccagacgatgaggatgtagtggggcccgggcaggaatctgatgactttgagctg tctggctctggagatctgggtacggaagatgacttggaagactccatgatcggccctgaa gttgtccatcccttggtgcctctagataaccatatccctgagagggcagggtctgggagc caagtccccaccgaacccaagaaactagaggagaatgaggttatccccaagagaatctca cccgttgaagagagtgaggatgtgtccaacaaggtgtcaatgtccagcactgtgcagggc agcaacatctttgagagaacggaggtcctggcagctctgattgtgggtggcatcgtgggc atcctctttgccgtcttcctgatcctactgctcatgtaccgtatgaagaagaaggatgaa ggcagctatgacctgggcaagaaacccatctacaagaaagcccccaccaatgagttctac gcgtga