GENSCAN 1.0 Date run: 4-Nov-116 Time: 11:46:49 Sequence gi568815581f:49110725_49322712 : 211988 bp : 46.77% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 21999 22082 84 2 0 32 100 162 0.432 11.82 1.02 Intr + 30548 30723 176 1 2 44 85 96 0.251 3.64 1.03 Intr + 31311 31448 138 0 0 86 80 110 0.284 9.58 1.04 Intr + 42076 42182 107 1 2 125 111 73 0.960 13.16 1.05 Intr + 45842 45879 38 0 2 110 105 41 0.577 5.98 1.06 Intr + 48313 48483 171 0 0 96 69 194 0.669 18.44 1.07 Intr + 48995 49070 76 1 1 31 85 90 0.672 2.09 1.08 Intr + 49831 49917 87 2 0 101 110 78 0.999 11.24 1.09 Intr + 53364 53551 188 1 2 80 95 244 0.999 23.71 1.10 Intr + 55390 55530 141 0 0 89 79 241 0.998 23.85 1.11 Intr + 57957 58176 220 2 1 57 75 165 0.897 9.97 1.12 Intr + 58799 58947 149 0 2 134 9 102 0.007 6.85 1.13 Intr + 61436 61576 141 1 0 97 39 59 0.005 2.45 1.14 Intr + 72219 72353 135 2 0 72 69 129 0.524 10.16 1.15 Intr + 72690 72840 151 2 1 71 66 43 0.287 0.14 1.16 Term + 81548 81825 278 0 2 -13 42 213 0.450 2.22 1.17 PlyA + 82064 82069 6 1.05 2.03 PlyA - 83684 83679 6 1.05 2.02 Term - 96158 96033 126 2 0 91 45 138 0.682 8.08 2.01 Init - 96698 96615 84 2 0 96 105 156 0.998 18.92 2.00 Prom - 97086 97047 40 -11.82 3.00 Prom + 98344 98383 40 -7.96 3.01 Init + 100001 100117 117 1 0 90 96 202 0.938 21.40 3.02 Intr + 100383 100491 109 2 1 89 68 5 0.417 -1.54 3.03 Intr + 103149 103257 109 1 1 70 95 37 0.504 1.94 3.04 Intr + 105749 105974 226 2 1 80 110 300 0.531 29.39 3.05 Intr + 107015 107191 177 1 0 74 75 282 0.965 25.62 3.06 Intr + 108816 108901 86 2 2 109 60 51 0.963 3.02 3.07 Intr + 109134 109229 96 0 0 98 80 117 0.999 11.02 3.08 Intr + 109517 109602 86 2 2 65 90 82 0.806 5.56 3.09 Intr + 111367 111501 135 2 0 72 77 90 0.960 6.84 3.10 Term + 111828 111991 164 1 2 100 48 243 0.999 19.60 3.11 PlyA + 112272 112277 6 1.05 4.02 PlyA - 112663 112658 6 -3.94 4.01 Sngl - 114271 113522 750 1 0 96 47 1468 0.431 139.68 4.00 Prom - 117355 117316 40 -3.56 5.11 PlyA - 117606 117601 6 -0.45 5.10 Term - 120194 119975 220 1 1 48 55 128 0.106 2.01 5.09 Intr - 147185 147039 147 1 0 109 95 -10 0.002 1.15 5.08 Intr - 179849 179683 167 2 2 125 66 34 0.308 3.66 5.07 Intr - 188200 187908 293 2 2 90 69 251 0.679 20.25 5.06 Intr - 191056 190923 134 0 2 72 70 -47 0.199 -7.81 5.05 Intr - 194965 194911 55 2 1 84 98 100 0.720 8.64 5.04 Intr - 200732 200588 145 2 1 46 99 202 0.991 16.96 5.03 Intr - 201318 201203 116 1 2 68 94 12 0.959 -0.03 5.02 Intr - 202121 201974 148 2 1 72 111 122 0.984 12.71 5.01 Init - 206938 206102 837 1 0 73 96 704 0.916 64.40 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 41558 41673 116 1 2 68 80 17 0.883 -3.26 S.002 Term + 58799 59004 206 0 2 134 43 150 0.989 12.63 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581f:49110725_49322712|GENSCAN_predicted_peptide_1|759_aa GGAVGGLAARLRDIRQSEGGGIRDDFGRILVIILVLGIVGFMFGSMFLQAVFSSPKPELP SPAPGVQKLKLLPEERLRNLFSYDGIWLFPKNQCKCEANKEQGGYNFQDAYGQSDLPAVK ARRQAEFEHFQRREGLPRPLPLLVQPNLPFGYPVHGVEVMPLHTVPIPGLQFEGPDAPVY EVTLTASLGTLNTLADVPDSVVQGRGQKQLIISTSDRKLLKFILQHVTYTSTGYQHQKYY PHIAIDPIEPLEERTNHAVRFAPVSLESRSSVAKFPVTIRHPVIPKLYDPGPERKLRNLV TIATKTFLRPHKLMIMLRSIREYYPDLTVIVADDSQKPLEIKDNHVEYYTMPFGKGWFAG RNLAISQVTTKYVLWVDDDFLFNEETKIEVLVDVLEKTELDVVGGSVLGNVFQFKLLLEQ SENGACLHKRMGFFQPLDGFPSCVVTSGVVNFFLAHTERLQRVGFDPRLQRVAHSEFFID GLGTLLVGSCPEVIIGHQSRSPVVDSELAALEKTYNTYRSNTLTRVGGCVRQLLLMIVGV LLSISLDMAATRGSSGSSWNLFLSIWLVARFQVHRYQQHENGLIQKVGVTGGQFFWLEIS VATVPLPEFLSCIRKNELFVLSCPVSLPSLSSGHPLPALAEPRAFMDLKEEEVHADWSMS SHGQAGRVRPHIANPASLLPFRPPSPRRSAGRHDIPGTRNAGNTKDAEALATPPVGPRSR AQSLKHGHRCRSGKAAQSPPSLRRLAAALRSQQVPWCNG >gi568815581f:49110725_49322712|GENSCAN_predicted_CDS_1|2280_bp ggcggggcagtcggcggcctggctgctaggctccgtgacatccggcagtctgagggcggc gggattcgggatgacttcgggcggatattggtcataatcctggtacttggcattgttgga tttatgttcggaagcatgttccttcaagcagtgttcagcagccccaagccagaactccca agtcctgccccgggtgtccagaagctgaagcttctgcctgaggaacgtctcaggaacctc ttttcctacgatggaatctggctgttcccgaaaaatcagtgcaaatgtgaagccaacaaa gagcagggaggttacaactttcaggatgcctatggccagagcgacctcccagcggtgaaa gcgaggagacaggctgaatttgaacactttcagaggagagaagggctgccccgcccactg cccctgctggtccagcccaacctcccctttgggtacccagtccacggagtggaggtgatg cccctgcacacggttcccatcccaggcctccagtttgaaggacccgatgcccccgtctat gaggtcaccctgacagcttctctggggacactgaacacccttgctgatgtcccagacagt gtggtgcagggcagaggccagaagcagctgatcatttctaccagtgaccggaagctgttg aagttcattcttcagcacgtgacatacaccagcacggggtaccagcaccagaagtactat ccccacattgccattgaccccatagagccactcgaggagagaacaaatcatgctgtgcgc tttgcacctgtgagtctggagtccaggtcctcagtggccaagtttccagtgaccatccgc catcctgtcatacccaagctatacgaccctggaccagagaggaagctcagaaacctggtt accattgctaccaagactttcctccgcccccacaagctcatgatcatgctccggagtatt cgagagtattacccagacttgaccgtaatagtggctgatgacagccagaagcccctggaa attaaagacaatcacgtggagtattacactatgccctttgggaagggttggtttgctggt aggaacctggccatatctcaggtcaccaccaaatacgttctctgggtggacgatgatttt ctcttcaacgaggagaccaagattgaggtgctggtggatgtcctggagaaaacagaactg gacgtggtaggcggcagtgtgctgggaaatgtgttccagtttaagttgttgctggaacag agtgagaatggggcctgccttcacaagaggatgggatttttccaacccctggatggcttc cccagctgcgtggtgaccagtggcgtggtcaacttcttcctggcccacacggagcgactc caaagagttggctttgatccccgcctgcaacgagtggctcactcagaattcttcattgat gggctagggaccctactcgtggggtcatgcccagaagtgattataggtcaccagtctcgg tctccagtggtggactcagaactggctgccctagagaagacctacaatacataccggtcc aacaccctcacccgggtaggtggctgtgttcgacagctgttgctcatgatagttggggtc ctcctcagcatcagtcttgacatggctgcaaccaggggatcctcgggatcctcctggaat ctcttcctcagcatctggctcgtggcaaggtttcaggtacatcgttatcagcagcatgaa aacggactaatacagaaagttggtgttactgggggtcaatttttctggctggaaatctct gtagccacagtgcctttgcctgagttcttgtcctgcatccggaagaatgagctgttcgtc ctgtcctgtcctgtctccctgccctctttgtcctctggccatcctctgcctgctctggct gagcccagggcttttatggacctcaaagaggaggaagtgcatgccgattggtccatgagc agccatgggcaggccggaagagtccggccccacatcgccaaccccgccagcctgctgcct ttccgacctcccagcccccggaggagcgcaggaagacatgatatccccgggaccagaaac gccgggaacacgaaggacgcagaagccctcgccacgccgcccgtggggccccggtccagg gcccagtccctcaaacacggacaccgctgccgctctggaaaggccgcccagagccctcct agcttgaggagactcgcggccgccttaagaagccagcaggtcccatggtgtaatggttag >gi568815581f:49110725_49322712|GENSCAN_predicted_peptide_2|69_aa MAQDLSEKDLLKMEVEQLKKEVKNTRIPISKAGKEIKEYVEAQAGNDPFLKGIPEDKNPF KEKGGCLIS >gi568815581f:49110725_49322712|GENSCAN_predicted_CDS_2|210_bp atggcccaggatctcagcgagaaggacctgttgaagatggaggtggagcagctgaagaaa gaagtgaaaaacacaagaattccgatttccaaagcgggaaaggaaatcaaggagtacgtg gaggcccaagcaggaaacgatccttttctcaaaggcatccctgaggacaagaatcccttc aaggagaaaggtggctgtctgataagctga >gi568815581f:49110725_49322712|GENSCAN_predicted_peptide_3|434_aa MAELQQLQEFEIPTGREALRGNHSALLRVADYCEDNYVQVSPQVHPSEFPRVNPETQSLR PHTDAGDHSISRPALVKEGPELNQNLLSCLVLISWVDFFAYSIFLVTYAEEIQPRTQWKM LALRPGLCVFQATDKRKALEETMAFTTQALASVAYQVGNLAGHTLRMLDLQGAALRQVEA RVSTLGQMVNMHMEKVARREIGTLATVQRLPPGQKVIAPENLPPLTPYCRRPLNFGCLDD IGHGIKDLSTQLSRTGTLSRKSIKAPATPASATLGRPPRIPEPVHLPVVPDGRLSAASSA FSLASAGSLDPPPPPAAVEVFQRPPTLEELSPPPPDEELPLPLDLPPPPPLDGDELGLPP PPPGFGPDEPSWVPASYLEKVVTLYPYTSQKDNELSFSEGTVICVTRRYSDGWCEGVSSE GTGFFPGNYVEPSC >gi568815581f:49110725_49322712|GENSCAN_predicted_CDS_3|1305_bp atggcggagctacagcagctgcaggagtttgagatccccactggccgggaggctctgagg ggcaaccacagtgccctgctgcgggtcgctgactactgcgaggacaactatgtgcaggtg tcacctcaggttcatccttctgagttcccacgggtcaatccagagacccaaagcctccgt cctcacacagatgctggcgatcattctatttccaggccggctctggtaaaggaaggtcct gaactaaatcaaaatctgctctcctgtctggtgttaatatcctgggttgatttctttgcc tattcaatcttcttggtgacctatgctgaggagatccagccaaggacccagtggaagatg ctggcccttaggccagggctgtgtgtatttcaggccacagacaagcggaaggcgctggag gagaccatggccttcactacccaggcactggccagcgtggcctaccaggtgggcaacctg gccgggcacactctgcgcatgttggacctgcagggggccgccctgcggcaggtggaagcc cgtgtaagcacgctgggccagatggtgaacatgcatatggagaaggtggcccgaagggag atcggcaccttagccactgtccagcggctgccccccggccagaaggtcatcgccccagag aacctaccccctctcacgccctactgcaggagacccctcaactttggctgcctggacgac attggccatgggatcaaggacctcagcacgcagctgtcaagaacaggcaccctgtctcga aagagcatcaaggcccctgccacacccgcctccgccaccttggggagaccaccccggatt cccgagccagtgcacctgccggtggtgcccgacggcagactctccgccgcctcctctgcg ttttccctggcctcggccggctccttggacccacctcctccaccagcagccgtcgaggtg ttccagcggcctcccacgctggaggagttgtccccacccccaccggacgaagagctgccc ctgccactggacctgcctcctcctccacccctggatggagatgaattggggctgcctcca cccccaccaggatttgggcctgatgagcccagctgggtgcctgcctcatacttggagaaa gtggtgacactgtacccatacaccagccagaaggacaatgagctctccttctctgagggc actgtcatctgtgtcactcgccgctactccgatggctggtgcgagggcgtcagctcagag gggactggattcttccctgggaactatgtggagcccagctgctga >gi568815581f:49110725_49322712|GENSCAN_predicted_peptide_4|249_aa MAAQGAPRFLLTFDFDETIVDENSDDSIVRAAPGQRLPESLRATYREGFYNEYMQRVFKY LGEQGVRPRDLSAIYEAIPLSPGMSDLLQFVAKQGACFEVILISDANTFGVESSLRAAGH HSLFRRILSNPSGPDARGLLALRPFHTHSCARCPANMCKHKVLSDYLRERAHDGVHFERL FYVGDGANDFCPMGLLAGGDVAFPRRGYPMHRLIQEAQKAEPSSFRASVVPWETAADVRL HLQQVLKSC >gi568815581f:49110725_49322712|GENSCAN_predicted_CDS_4|750_bp atggccgcgcagggcgcgccgcgcttcctcctgaccttcgacttcgacgagactatcgtg gacgaaaacagcgacgattcgatcgtgcgcgccgcgccgggccagcggctcccggagagc ctgcgagccacctaccgcgagggcttctacaacgagtacatgcagcgcgtcttcaagtac ctgggcgagcagggcgtgcggccgcgggacctgagcgccatctacgaagccatccctttg tcgccaggcatgagcgacctgctgcagtttgtggcaaaacagggcgcctgcttcgaggtg attctcatctccgatgccaacacctttggcgtggagagctcgctgcgcgccgccggccac cacagcctgttccgccgcatcctcagcaacccgtcggggccggatgcgcggggactgctg gctctgcggccgttccacacacacagctgcgcgcgctgccccgccaacatgtgcaagcac aaggtgctcagcgactacctgcgcgagcgggcccacgacggcgtgcacttcgagcgcctc ttctacgtgggcgacggcgccaacgacttctgccccatggggctgctggcgggcggcgac gtggccttcccgcgccgcggctaccccatgcaccgcctcattcaggaggcccagaaggcc gagcccagctcgttccgcgccagcgtggtgccctgggaaacggctgcagatgtgcgcctc cacctgcaacaggtgctgaagtcgtgctga >gi568815581f:49110725_49322712|GENSCAN_predicted_peptide_5|753_aa MAQEDSRRGQVPSSFYHGANQELDLSTKVYKRESGSPYSVLVDTKMSKPHLHETEEQPYF RETRAVSDVHAVKEDRENSDDTEEEEEEVSYKREQIIVEVNLNNQTLNVSKGEKGVSSQS KETPVLKTSSEEEEEESEEEATDDSNDYGENEKQKKKEKIVEKVSVTQRRTRRAASVAAA TTSPTPRTTRGRRKSVEPPKRKKRATKEPKAPVQKAKCEEKETLTCEKCPRVFNTRWYLE KHMNVTHRRMQICDKCGKKFVLESELSLHQQTDCEKNIQCVSCNKSFKKLWSLHEHIKIV HGYAEKKFSCEICEKKFYTMAHVRKHMVAHTKDMPFTCETCGKSFKRSMSLKVHSLQHSG EKPFRCENCDERFQYKYQLRSHMSIHIGHKQFMCQWCGKDFNMKQYFDEHMKTHTGAYSL VNGNDGVDDDDNSSAGITGLKQHTLPEVEVFKTVRGNRLKRRKTIWAQNSSRKMNMSHRE KPFICEICGKSFTSRPNMKRHRRTHTGEKPYPCDVCGQRFRFSNMLKAHKEKCFRVTSPV NVPPAVQIPLTTSPATPVPSVVNTATTPTPPINMNPATTITSIGGTRALGELWSMSKEML LTFSAPKRYLLHQGTSPLGQALPPDQCNGHSSFWPHSHFDPLILTCPLAHGYVRNNGTHF TGAKPKEKSEVANGTPQNTRCWRSRSLRGRREAWPRAARPFDVRRHRCDWTVACDVGDCG GLRCCSSRSAGRGSGSGSGSRAFKGDAAAARGG >gi568815581f:49110725_49322712|GENSCAN_predicted_CDS_5|2262_bp atggcacaagaagatagccgtcgtggtcaagtgccatcttccttttatcatggtgccaac caagaacttgacctgtccaccaaagtgtacaaaagggaatcaggaagtccttattctgtg ttagtggacaccaagatgagcaaaccgcatctccatgaaacagaagaacagccatatttc agggagacaagagcagtgtctgacgtgcatgctgttaaggaagaccgggagaattctgat gacacagaggaggaagaggaagaagtctcttacaaaagggagcagatcatagtggaggta aaccttaataatcaaacattaaatgtatctaaaggggaaaagggtgtctcttctcagtcc aaagagactcctgttcttaagacaagcagtgaggaggaagaggaagagagtgaggaagag gccacagatgacagcaatgactatggagagaatgaaaagcagaagaaaaaggagaagata gtagagaaagtcagcgttacacaaaggagaaccaggagagctgcctctgttgccgcagct accacttcccctactcccagaactacaagaggtcgtaggaagagtgtagagccacctaag cgtaagaagcgggccacaaaggagcccaaagcaccagtccagaaagctaagtgtgaagag aaagagactctgacctgtgagaagtgccccagggtatttaacactcgctggtacctggag aagcacatgaacgttactcataggcgcatgcagatttgtgataaatgtggcaagaagttt gtcctggaaagtgagctgtcccttcaccagcaaacagactgtgaaaaaaacattcagtgt gtttcctgtaacaaatcgttcaagaaactctggtcccttcatgaacatatcaagatcgtc catggatatgcagaaaagaaattttcctgtgaaatttgtgagaagaaattctataccatg gctcatgtgcggaaacacatggttgcacacacaaaagacatgccatttacatgcgaaacc tgtggaaaatcattcaaacgcagtatgtcactcaaggtgcactccttgcagcattctgga gagaagccctttagatgcgagaactgtgacgaaaggtttcagtacaagtaccagctacgc tcccacatgagcattcatattgggcacaaacagttcatgtgccagtggtgtggcaaggat ttcaacatgaagcagtacttcgacgaacacatgaaaacacacactggagcttatagtctg gtgaatggtaatgatggtgttgatgatgatgataatagcagtgctgggattacaggcttg aaacaacacaccctgccagaagttgaagtttttaagacggttagaggaaatcgattaaaa agaaggaaaacaatctgggctcagaattcatccagaaagatgaatatgagtcacagagag aaaccctttatctgtgaaatctgtggcaaaagcttcaccagccgccccaacatgaagaga caccgcagaactcacacaggcgagaagccctatccatgtgatgtgtgtggccagcggttc cgcttctcgaacatgcttaaggcccacaaggagaagtgctttcgggtgaccagccccgtg aatgtgccacctgctgtccagatcccacttacaacttccccagccaccccagttccttct gtggtgaacacagccacaaccccaacccctccaatcaatatgaatcctgccaccaccatc acatctataggaggaacaagagcactgggggaactctggagtatgagtaaggaaatgctt ctcaccttctctgctccaaagagatatctgttacatcagggaacaagtcctctaggtcag gcacttcctcctgaccagtgcaacgggcactccagcttctggcctcatagccactttgac cccttgattctgacatgtcctctggctcatgggtatgtcagaaataatggcacccatttt acaggtgcaaaaccaaaggaaaaaagtgaagtggccaatggcacaccacaaaatacaaga tgctggcgctcccggagcctccggggcaggagggaggcgtggcctcgggcggcccgcccc tttgatgtgcgccggcaccgctgcgattggacagtcgcttgtgacgttggggactgcggt gggctccgctgctgcagcagccgcagcgccggccgcggctccggctccggctccggctcc cgggcatttaaaggggacgcggcggctgcccgggggggatga