GENSCAN 1.0 Date run: 7-Nov-116 Time: 00:28:51 Sequence gi568815578r:33308510_33543620 : 235111 bp : 47.50% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 6583 6619 37 0 1 85 106 47 0.904 4.56 1.02 Intr + 8345 8466 122 0 2 106 99 43 0.517 7.41 1.03 Intr + 9505 9690 186 0 0 -62 67 205 0.406 3.29 1.04 Intr + 11413 11594 182 0 2 65 83 103 0.663 6.17 1.05 Intr + 13261 13347 87 1 0 67 36 117 0.654 3.49 1.06 Intr + 15414 15447 34 0 1 57 79 54 0.681 -0.37 1.07 Intr + 16139 16192 54 1 0 86 59 40 0.431 0.08 1.08 Intr + 16569 16592 24 2 0 109 105 -8 0.600 1.22 1.09 Intr + 17193 17345 153 2 0 75 92 140 0.991 13.37 1.10 Intr + 19909 20098 190 0 1 31 49 136 0.631 3.16 1.11 Intr + 24656 24706 51 0 0 82 90 47 0.254 3.18 1.12 Intr + 37053 37162 110 1 2 70 116 13 0.003 2.30 1.13 Intr + 39211 39466 256 0 1 58 93 73 0.008 1.82 1.14 Intr + 40819 41090 272 2 2 56 93 87 0.456 3.26 1.15 Intr + 41169 41311 143 2 2 54 98 222 0.560 18.95 1.16 Intr + 41657 41758 102 2 0 120 96 96 0.898 12.89 1.17 Intr + 41803 41861 59 1 2 55 68 18 0.274 -4.97 1.18 Intr + 44187 44275 89 1 2 77 110 95 0.527 10.29 1.19 Term + 45440 45565 126 1 0 75 48 67 0.545 -0.32 1.20 PlyA + 47011 47016 6 1.05 2.11 PlyA - 49257 49252 6 1.05 2.10 Term - 50486 50310 177 2 0 101 38 19 0.384 -4.11 2.09 Intr - 51982 51842 141 1 0 112 100 47 0.927 8.85 2.08 Intr - 58499 58350 150 2 0 93 60 143 0.984 12.36 2.07 Intr - 62120 61990 131 0 2 108 99 81 0.981 11.61 2.06 Intr - 64188 64133 56 2 2 99 88 -8 0.696 -1.08 2.05 Intr - 71182 70941 242 1 2 41 71 189 0.565 8.85 2.04 Intr - 77261 77141 121 1 1 112 72 51 0.965 6.40 2.03 Intr - 79024 78814 211 2 1 95 56 322 0.921 27.67 2.02 Intr - 83733 83633 101 2 2 94 38 72 0.693 2.55 2.01 Init - 93049 92919 131 1 2 71 83 114 0.868 9.00 2.00 Prom - 98443 98404 40 -8.46 3.10 PlyA - 99487 99482 6 1.05 3.09 Term - 100090 99998 93 1 0 88 43 94 0.863 2.73 3.08 Intr - 100379 100192 188 0 2 82 78 245 0.958 22.31 3.07 Intr - 101822 101626 197 1 2 89 69 275 0.910 24.76 3.06 Intr - 103917 103787 131 0 2 124 80 122 0.999 14.49 3.05 Intr - 104273 104066 208 1 1 106 82 252 0.999 25.48 3.04 Intr - 109322 109210 113 2 2 91 110 72 0.896 8.88 3.03 Intr - 127294 127191 104 2 2 57 77 6 0.038 -3.71 3.02 Intr - 130517 130332 186 0 0 78 98 230 0.968 22.66 3.01 Init - 135111 134802 310 0 1 106 86 666 0.968 63.68 3.00 Prom - 159157 159118 40 -1.46 4.05 PlyA - 163014 163009 6 1.05 4.04 Term - 181838 181576 263 0 2 7 47 166 0.015 0.09 4.03 Intr - 201623 201540 84 0 0 95 75 45 0.057 3.69 4.02 Intr - 213116 213033 84 0 0 84 76 31 0.154 1.29 4.01 Init - 222936 222888 49 0 1 65 65 44 0.249 0.91 4.00 Prom - 229046 229007 40 -1.66 5.02 PlyA - 229614 229609 6 1.05 5.01 Term - 230693 230593 101 0 2 115 44 51 0.382 1.69 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 39226 39466 241 0 1 85 93 134 0.981 9.66 S.002 Init - 111153 111073 81 0 0 78 98 2 0.854 1.17 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578r:33308510_33543620|GENSCAN_predicted_peptide_1|758_aa MPALPLTTCVTLGPGQENAAALDVTACWTLLPQAQGKAQHLLCLQRTRISGKQPHTEAGL ASSDSSDLFSEHYTLDCMLSLPVTGARSKGVAVLDHLPFVGKRLSKKSSGLDLSQQIRIL TPHRVTEAWPGHTNILHASHRLVIEDAKRPEITLQILSDSLLQVTLRCKLYLSLQEIPWL KVIKSIHIGVRLEQTGNTTKVAFEDKEFHVDPQDSTAADLIQLLSRIELEPRPRVSKLRL SKASSQSRGSIQLTINTPDPPTVRLDGHTATIIQPGLLVLLGLSITSSVSVSWSNVLSTA EPQIIFKRHQKAELSEISPPSLDIRNKPKLFQTPLPWFGLPQKQTLSWGSEGGFGGGQGS EQKKIPPQSQNVHDRHWAIKLGCSSVINSLAAGAEGKAASENLDPSTPGKVSATTMLTLW ALAVMLAVEEALGQLELADRPSLIPTLPVRFPPGLLPGSLSVSKVPLTGKYPARTKGGRC PPVTKYFISDSKLEDYMNATLPLQIEKILKCEKVNLAGLLGTVLSTVSDLDLLSLLDLTS PLDILGGASLSGILGEGSGGKSSKLPLLSELTGAVSGLLPQGTEGLRPLKDVTDKVQDLK ESAQGVLNSTLPSGISDALPDLLKNADLEQLLLGLQVEKVTVESMKSTTTGDGIHVQATT TAFIGGKGVVTSSEQRPGRDNREDSSMDTKNVSVSLILSYAMLKVIITHTAKQSSVQRNN LDARITKLTYSHRPDIKSKPATGLTSPRTVGALSPGKR >gi568815578r:33308510_33543620|GENSCAN_predicted_CDS_1|2277_bp atgccggctctgccactcactacctgtgtcaccttgggtcctggccaagagaatgctgct gctctggatgtcactgcttgctggacgctgctgccccaggcccaggggaaggcccagcac ctactctgccttcagaggacaagaatcagtgggaaacagccccacactgaagcggggctt gcatcctcggacagttcagacctgttctcagagcactacaccctggactgcatgctcagc ctgcctgtgacaggggccaggagcaaaggtgttgctgtcctggaccacctgccctttgta ggcaaacgcctctccaagaagagcagtgggctggacctgtcgcagcagatccggatactc acaccccacagagtcacagaggcctggccaggccacaccaacatcctccatgcctctcac aggttggtcatagaagacgccaaaagacctgagatcaccctgcaaatcctaagtgatagc ctgctgcaggtcacgttgcgctgcaaactgtacctctcactccaggagatcccgtggctc aaagtcatcaagagcattcacattggagtacggctggaacagacagggaataccaccaag gtggctttcgaggacaaggagttccacgtggacccccaggactccacagctgcagacctc attcagctattgtcacggattgagctggagcccaggcccagggtgagcaaactgaggctc agcaaggcttccagtcagtccagagggagcatacaactgaccatcaacacccctgatcct cccacggtccgccttgacggccacacggccaccatcatccagccgggcttgctagtgcta ctggggctcagcatcacctcctctgtctcagtttcctggtcgaatgttctcagcacagca gaaccgcagattatctttaagagacaccagaaagctgaactttctgagatttcaccccca tctttagatatccgcaacaaacctaaactcttccaaacaccattgccttggtttgggctc cctcagaagcagactctgagctggggaagtgaaggtggcttcgggggagggcagggcagt gaacagaaaaagatcccgccccagagtcaaaacgtgcacgacagacattgggctataaaa ttgggatgcagctctgtcatcaacagtctggcggcaggagcagaaggaaaggcagcttcc gaaaacttggacccatccaccccaggcaaggtatcagccaccacgatgctgactctctgg gccctggctgtcatgttggcggtcgaggaagcacttggccagctggagctcgcagatagg ccttccctaatacccactttgcctgtccgttttccacctggattgctccctggaagtctg tcagtctcaaaagtccctctaaccggaaaatacccagcgaggaccaaaggaggcaggtgt ccacccgtcaccaagtacttcatatctgacagcaaactcgaagactatatgaatgccacc ttgcccctgcagattgagaagattctgaagtgtgagaaggttaacttggctggtttgctt gggaccgtattatccacagtgagcgacttggacctgctgtctctattagacctcacttca ccccttgatatacttggaggtgccagcctcagtggtatcctaggtgagggaagtggcggc aagtcctcgaaacttccattgctctcagaactcactggtgctgtcagtggtctgctaccc caggggacggagggtctgcggcctctgaaggatgtgaccgacaaagtccaagacctcaaa gagtctgctcagggcgtgctgaacagcaccctgccctcaggcatcagcgatgcactccca gacctgctgaaaaatgctgacctggaacagctcttgctgggattacaggttgaaaaagta actgtggagagcatgaagtcaaccacgacaggcgatgggatccatgtccaagccacgact acggccttcataggtggaaaaggggttgtaacatccagtgagcaaagaccaggtagagac aacagagaggacagcagcatggacactaaaaatgtcagcgtttccctaattctgtcctac gcgatgctgaaggtcatcatcactcacactgccaagcagagctctgtgcagagaaataac ctggatgcaagaatcaccaaactaacctactcccaccggccagacataaaatctaagcca gctactgggttaacatcaccaaggacggtgggagctttgtcaccgggcaaacggtga >gi568815578r:33308510_33543620|GENSCAN_predicted_peptide_2|486_aa MTRPEVAAERKSLVYERSGGPVRGSDLPAELTLRDPGHCCCCGREKAEQTIWNRLHQLKA LKTRRPRSRVPLRIGILGCMAERLKEEILNREKMVDILAGPDAYRDLPRLLAVAESGQQA ANVLLSLDETYADVMPVQTSASATSAFVSIMRGCDNMCSYCIVPFTRGRERSRPIASILE EVKKLSEQGLKEVTLLGQNVNSFRDNSEVQFNSAVPTNLSRGFTTNYKTKQGGLRFAHLL DQVSRVDPEMRIRFTSPHPKDFPDEVSVRYSREAYVELVHHIRESIPGVSLSSDFIAGFC GETEEDHVQTVSLLREVQYNMGFLFAYSMRQKTRAYHRLKDDVPEEVKLRRLEELITIFR EEATKANQTSVGCTQLVLVEGLSKRSATDLCGRNDGNLKVIFPDAEMEDVNNPGLRVRAQ PGDYVLVKEGETLPATEETGHEGGDKLQGAKQLYVSGKRVSLKLLCEQLLQSLNLPKLRL TLCLSL >gi568815578r:33308510_33543620|GENSCAN_predicted_CDS_2|1461_bp atgacccggcctgaagtagcggcggaacggaagtcgcttgtgtatgaacgcagcggcgga cctgtgaggggatccgacttgccggcagaacttacgctgcgggaccccgggcactgttgc tgctgcgggagggagaaggctgagcagaccatctggaaccgtttacatcagcttaaagcc ttgaagacaaggcggccccgctcccgggttcctctgaggattggaattctaggctgcatg gctgagaggttgaaggaggagattctcaacagagagaaaatggtagatattttggctggt cctgatgcctaccgggaccttccccggctgctggctgttgctgagtcgggccagcaagct gccaacgtgctgctctctctggacgagacctatgctgatgtcatgccagtccagacaagc gccagtgccacgtctgcctttgtgtcaatcatgcgaggctgtgacaacatgtgtagctac tgcattgttcctttcacccggggcagggagaggagtcggcctattgcctccattctagag gaagtgaagaagctttctgagcaggggctgaaagaagtgacacttcttggtcagaatgtt aatagttttcgggacaattcggaggtccagttcaacagtgcagtgcctaccaatctcagt cgtggctttaccaccaactataaaaccaagcaaggaggacttcgttttgctcatcttctg gatcaggtctccagagtagatcctgaaatgaggatccgttttacctctccccaccccaag gattttcctgatgaggtgagcgtcagatattcaagagaagcttatgtggagttagttcac catattagagaatctattccaggtgtgagcctcagcagcgatttcattgctggcttttgt ggtgagacggaggaagatcacgtccagacagtctctttgctccgggaagttcagtacaac atgggcttcctctttgcctacagcatgagacagaagacacgggcatatcataggctgaag gatgatgtcccggaagaggtaaaattaaggcgtttggaggaactcatcactatcttccga gaagaagcaacaaaagccaatcagacctctgtgggctgtacccagttggtgctagtggaa gggctcagtaaacgctctgccactgacctgtgtggcaggaatgatggaaaccttaaggtg atcttccctgatgcagagatggaggatgtcaataaccctgggctcagggtcagagcccag cctggggactatgtgctggtgaaggaaggggagacattgcctgccactgaggaaacaggt catgaaggtggagataagctgcaaggggcgaagcaactttatgtcagtggaaaacgtgtc tctttaaagctgctatgtgaacagcttttacagtcattaaatttacctaaactaaggtta accctttgtctctcactttga >gi568815578r:33308510_33543620|GENSCAN_predicted_peptide_3|509_aa MASGRRAPRTGLLELRAGAGSGAGGERWQRVLLSLAEDVLTVSPADGDPGPEPGAPREQE PAQLNGAAEPGAGPPQLPEALLLQRRRVTVRKADAGGLGISIKGGRENKMPILISKIFKG LAADQTEALFVGDAILSVNGEDLSSATHDEAVQVLKKTGKEVVLEGVNTQAFPLTHVMHS SSLSQETFMDIYCVPAVYIKRQPSSPGPTPRNFSEAKHMSLKMAYVSKRCTPNDPEPRYL EICSADGQDTLFLRAKDEASARSWATAIQAQVNTLTPRVKDELQALLAATSTAGSQDIKQ IGWLTEQLPSGGTAPTLALLTEKELLLYLSLPETREALSRPARTAPLIATRLVHSGPSKG SVPYDAELSFALRTGTRHGVDTHLFSVESPQELAAWTRQLVDGCHRAAEGVQEVSTACTW NGRPCSLSVHIDKGFTLWAAEPGAARAVLLRQPFEKLQMSSDDGASLLFLDFGGAEGEIQ LDLHSCPKTIVFIIHSFLSAKVTRLGLLA >gi568815578r:33308510_33543620|GENSCAN_predicted_CDS_3|1530_bp atggcgtccggcaggcgcgccccgcgcaccgggctgctggagctgcgcgccggggcgggc tcgggggccggcggcgagcgatggcagcgggtgctgctgagtctggcggaggacgtgctg accgtgagccccgccgacggcgaccctggtcccgagcccggcgctccgcgggagcaggag cccgcgcagctcaacggcgccgcggagccgggcgccgggcccccgcagctgccagaggcg ctactgctccagcggcgccgcgtgacggtgcgcaaggccgacgccggtgggctgggcatc agcatcaaaggcggccgggagaacaagatgcctattctcatttccaagatcttcaaggga ttggcagctgaccagacagaggccctttttgtgggggatgccatcctgtctgtgaatggg gaagacttgtcctctgctacccatgatgaggcggtgcaggtcctcaagaagacaggcaag gaggtggtgctggagggagtgaatactcaagccttccctctaactcatgtcatgcattcc tcctcactaagccaagagacattcatggacatctactgtgtgcctgctgtgtatattaag cggcagccttcctcccctggccccacaccccggaacttcagcgaggccaaacacatgtcc ttgaagatggcatatgtctcgaagaggtgcacccccaatgacccggagcccaggtatctg gagatctgctcggcagatggtcaagacaccctcttcctgagggccaaggatgaggctagt gcgaggtcgtgggcgactgccatccaagcccaggtcaatactctgacgccgcgggtcaag gatgagctgcaggcactgttggcagccaccagcacagctgggagccaggacatcaagcag attggctggctaactgagcagctgcccagtgggggcacagcccccaccctggccctgcta actgaaaaggaactgctcctctacttgtctctccccgagacccgcgaggccctgagccgg ccagcccgtactgccccactcatcgccaccagactggtgcactcaggcccctccaagggc tcagtgccctacgatgcagagctctcttttgccctgcgcacgggcacgcgtcacggtgtg gacactcacctgttcagcgtggagtcaccgcaggagctggctgcctggacccgccagctt gtggatggctgtcaccgggccgccgagggtgtgcaggaggtgtctacagcctgcacgtgg aatgggcgtccctgcagcctgtctgtgcacatcgacaagggcttcacactgtgggcggct gagccaggtgcagcccgagctgtgctcctgcgacagcccttcgagaagctgcagatgtct tcagatgacggtgccagtctccttttcctggattttggaggtgctgaaggcgagatccag ctggacctgcactcgtgtcccaaaaccatagtcttcatcatccactccttcctgtcggcc aaagtcacccgcctcgggctgttggcctag >gi568815578r:33308510_33543620|GENSCAN_predicted_peptide_4|159_aa MHFKILNTDEEDIQQQYGTAAVKVKHLDSMQFIRTMLAVDQASPGTWTQLETIILSKLSQ GQKTKHRMFSLTGPHTPPSAARIHAPLTLEGGRSRDAYHRRAPPPRAARLYPRHGSLAGA ARLDPRPGSLAGVAARSAAAELTRHHHRCCRRRRRRRRR >gi568815578r:33308510_33543620|GENSCAN_predicted_CDS_4|480_bp atgcatttcaaaatcctaaacacagatgaagaggatatacagcagcagtatggtacagct gcagttaaagtaaaacacctggactccatgcagtttattagaacaatgttagctgtagat caggcaagcccagggacatggacgcagctggaaaccatcattctgagcaaactatcacaa ggacagaaaaccaaacaccgcatgttctcactcacaggccctcacacgcccccctcggcc gcccgcattcacgccccacttacgctggaaggcggccgctccagggacgcctaccatcgc cgagcgccgccgccgcgcgccgcccgcctctacccgcgacacgggtccctcgcaggcgcc gcccgcctcgacccgcggcccgggtccctcgcaggcgtcgccgcgagatctgcagccgcc gagctaaccagacaccaccaccgctgctgtcgccgtcgccgccgccgccgccgccgttag >gi568815578r:33308510_33543620|GENSCAN_predicted_peptide_5|33_aa XKQTGLTVKKLAQDYTDTKQQNGNSNQISEPQL >gi568815578r:33308510_33543620|GENSCAN_predicted_CDS_5|102_bp nataagcaaacaggcctaacagttaaaaaacttgcccaagattatacagacactaagcag caaaatgggaattcaaaccaaatttccgaacctcaactctga