GENSCAN 1.0 Date run: 6-Nov-116 Time: 06:39:07 Sequence gi568815581r:49106760_49307422 : 200663 bp : 47.07% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 25964 26047 84 1 0 32 100 162 0.419 11.82 1.02 Intr + 34513 34688 176 0 2 44 85 96 0.244 3.64 1.03 Intr + 35276 35413 138 2 0 86 80 110 0.281 9.58 1.04 Intr + 46041 46147 107 0 2 125 111 73 0.960 13.16 1.05 Intr + 49807 49844 38 2 2 110 105 41 0.577 5.98 1.06 Intr + 52278 52448 171 2 0 96 69 194 0.669 18.44 1.07 Intr + 52960 53035 76 0 1 31 85 90 0.672 2.09 1.08 Intr + 53796 53882 87 1 0 101 110 78 0.999 11.24 1.09 Intr + 57329 57516 188 0 2 80 95 244 0.999 23.71 1.10 Intr + 59355 59495 141 2 0 89 79 241 0.998 23.85 1.11 Intr + 61922 62141 220 1 1 57 75 165 0.897 9.97 1.12 Intr + 62764 62912 149 2 2 134 9 102 0.007 6.85 1.13 Intr + 65401 65541 141 0 0 97 39 59 0.005 2.45 1.14 Intr + 76184 76318 135 1 0 72 69 129 0.524 10.16 1.15 Intr + 76655 76805 151 1 1 71 66 43 0.287 0.14 1.16 Term + 85513 85790 278 2 2 -13 42 213 0.450 2.22 1.17 PlyA + 86029 86034 6 1.05 2.03 PlyA - 87649 87644 6 1.05 2.02 Term - 100123 99998 126 1 0 91 45 138 0.682 8.08 2.01 Init - 100663 100580 84 1 0 96 105 156 0.998 18.92 2.00 Prom - 101051 101012 40 -11.82 3.00 Prom + 102309 102348 40 -7.96 3.01 Init + 103966 104082 117 0 0 90 96 202 0.938 21.40 3.02 Intr + 104348 104456 109 1 1 89 68 5 0.417 -1.54 3.03 Intr + 107114 107222 109 0 1 70 95 37 0.504 1.94 3.04 Intr + 109714 109939 226 1 1 80 110 300 0.531 29.39 3.05 Intr + 110980 111156 177 0 0 74 75 282 0.965 25.62 3.06 Intr + 112781 112866 86 1 2 109 60 51 0.963 3.02 3.07 Intr + 113099 113194 96 2 0 98 80 117 0.999 11.02 3.08 Intr + 113482 113567 86 1 2 65 90 82 0.806 5.56 3.09 Intr + 115332 115466 135 1 0 72 77 90 0.960 6.84 3.10 Term + 115793 115956 164 0 2 100 48 243 0.999 19.60 3.11 PlyA + 116237 116242 6 1.05 4.02 PlyA - 116628 116623 6 -3.94 4.01 Sngl - 118236 117487 750 0 0 96 47 1468 0.431 139.68 4.00 Prom - 121320 121281 40 -3.56 5.07 PlyA - 121571 121566 6 -0.45 5.06 Term - 124159 123940 220 0 1 48 55 128 0.106 2.01 5.05 Intr - 151150 151004 147 0 0 109 95 -10 0.002 1.15 5.04 Intr - 183814 183648 167 1 2 125 66 34 0.308 3.66 5.03 Intr - 192165 191873 293 1 2 90 69 251 0.715 20.25 5.02 Intr - 195021 194888 134 2 2 72 70 -47 0.164 -7.81 5.01 Intr - 198930 198876 55 1 1 84 98 100 0.592 8.64 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 45523 45638 116 0 2 68 80 17 0.883 -3.26 S.002 Term + 62764 62969 206 2 2 134 43 150 0.989 12.63 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:49106760_49307422|GENSCAN_predicted_peptide_1|759_aa GGAVGGLAARLRDIRQSEGGGIRDDFGRILVIILVLGIVGFMFGSMFLQAVFSSPKPELP SPAPGVQKLKLLPEERLRNLFSYDGIWLFPKNQCKCEANKEQGGYNFQDAYGQSDLPAVK ARRQAEFEHFQRREGLPRPLPLLVQPNLPFGYPVHGVEVMPLHTVPIPGLQFEGPDAPVY EVTLTASLGTLNTLADVPDSVVQGRGQKQLIISTSDRKLLKFILQHVTYTSTGYQHQKYY PHIAIDPIEPLEERTNHAVRFAPVSLESRSSVAKFPVTIRHPVIPKLYDPGPERKLRNLV TIATKTFLRPHKLMIMLRSIREYYPDLTVIVADDSQKPLEIKDNHVEYYTMPFGKGWFAG RNLAISQVTTKYVLWVDDDFLFNEETKIEVLVDVLEKTELDVVGGSVLGNVFQFKLLLEQ SENGACLHKRMGFFQPLDGFPSCVVTSGVVNFFLAHTERLQRVGFDPRLQRVAHSEFFID GLGTLLVGSCPEVIIGHQSRSPVVDSELAALEKTYNTYRSNTLTRVGGCVRQLLLMIVGV LLSISLDMAATRGSSGSSWNLFLSIWLVARFQVHRYQQHENGLIQKVGVTGGQFFWLEIS VATVPLPEFLSCIRKNELFVLSCPVSLPSLSSGHPLPALAEPRAFMDLKEEEVHADWSMS SHGQAGRVRPHIANPASLLPFRPPSPRRSAGRHDIPGTRNAGNTKDAEALATPPVGPRSR AQSLKHGHRCRSGKAAQSPPSLRRLAAALRSQQVPWCNG >gi568815581r:49106760_49307422|GENSCAN_predicted_CDS_1|2280_bp ggcggggcagtcggcggcctggctgctaggctccgtgacatccggcagtctgagggcggc gggattcgggatgacttcgggcggatattggtcataatcctggtacttggcattgttgga tttatgttcggaagcatgttccttcaagcagtgttcagcagccccaagccagaactccca agtcctgccccgggtgtccagaagctgaagcttctgcctgaggaacgtctcaggaacctc ttttcctacgatggaatctggctgttcccgaaaaatcagtgcaaatgtgaagccaacaaa gagcagggaggttacaactttcaggatgcctatggccagagcgacctcccagcggtgaaa gcgaggagacaggctgaatttgaacactttcagaggagagaagggctgccccgcccactg cccctgctggtccagcccaacctcccctttgggtacccagtccacggagtggaggtgatg cccctgcacacggttcccatcccaggcctccagtttgaaggacccgatgcccccgtctat gaggtcaccctgacagcttctctggggacactgaacacccttgctgatgtcccagacagt gtggtgcagggcagaggccagaagcagctgatcatttctaccagtgaccggaagctgttg aagttcattcttcagcacgtgacatacaccagcacggggtaccagcaccagaagtactat ccccacattgccattgaccccatagagccactcgaggagagaacaaatcatgctgtgcgc tttgcacctgtgagtctggagtccaggtcctcagtggccaagtttccagtgaccatccgc catcctgtcatacccaagctatacgaccctggaccagagaggaagctcagaaacctggtt accattgctaccaagactttcctccgcccccacaagctcatgatcatgctccggagtatt cgagagtattacccagacttgaccgtaatagtggctgatgacagccagaagcccctggaa attaaagacaatcacgtggagtattacactatgccctttgggaagggttggtttgctggt aggaacctggccatatctcaggtcaccaccaaatacgttctctgggtggacgatgatttt ctcttcaacgaggagaccaagattgaggtgctggtggatgtcctggagaaaacagaactg gacgtggtaggcggcagtgtgctgggaaatgtgttccagtttaagttgttgctggaacag agtgagaatggggcctgccttcacaagaggatgggatttttccaacccctggatggcttc cccagctgcgtggtgaccagtggcgtggtcaacttcttcctggcccacacggagcgactc caaagagttggctttgatccccgcctgcaacgagtggctcactcagaattcttcattgat gggctagggaccctactcgtggggtcatgcccagaagtgattataggtcaccagtctcgg tctccagtggtggactcagaactggctgccctagagaagacctacaatacataccggtcc aacaccctcacccgggtaggtggctgtgttcgacagctgttgctcatgatagttggggtc ctcctcagcatcagtcttgacatggctgcaaccaggggatcctcgggatcctcctggaat ctcttcctcagcatctggctcgtggcaaggtttcaggtacatcgttatcagcagcatgaa aacggactaatacagaaagttggtgttactgggggtcaatttttctggctggaaatctct gtagccacagtgcctttgcctgagttcttgtcctgcatccggaagaatgagctgttcgtc ctgtcctgtcctgtctccctgccctctttgtcctctggccatcctctgcctgctctggct gagcccagggcttttatggacctcaaagaggaggaagtgcatgccgattggtccatgagc agccatgggcaggccggaagagtccggccccacatcgccaaccccgccagcctgctgcct ttccgacctcccagcccccggaggagcgcaggaagacatgatatccccgggaccagaaac gccgggaacacgaaggacgcagaagccctcgccacgccgcccgtggggccccggtccagg gcccagtccctcaaacacggacaccgctgccgctctggaaaggccgcccagagccctcct agcttgaggagactcgcggccgccttaagaagccagcaggtcccatggtgtaatggttag >gi568815581r:49106760_49307422|GENSCAN_predicted_peptide_2|69_aa MAQDLSEKDLLKMEVEQLKKEVKNTRIPISKAGKEIKEYVEAQAGNDPFLKGIPEDKNPF KEKGGCLIS >gi568815581r:49106760_49307422|GENSCAN_predicted_CDS_2|210_bp atggcccaggatctcagcgagaaggacctgttgaagatggaggtggagcagctgaagaaa gaagtgaaaaacacaagaattccgatttccaaagcgggaaaggaaatcaaggagtacgtg gaggcccaagcaggaaacgatccttttctcaaaggcatccctgaggacaagaatcccttc aaggagaaaggtggctgtctgataagctga >gi568815581r:49106760_49307422|GENSCAN_predicted_peptide_3|434_aa MAELQQLQEFEIPTGREALRGNHSALLRVADYCEDNYVQVSPQVHPSEFPRVNPETQSLR PHTDAGDHSISRPALVKEGPELNQNLLSCLVLISWVDFFAYSIFLVTYAEEIQPRTQWKM LALRPGLCVFQATDKRKALEETMAFTTQALASVAYQVGNLAGHTLRMLDLQGAALRQVEA RVSTLGQMVNMHMEKVARREIGTLATVQRLPPGQKVIAPENLPPLTPYCRRPLNFGCLDD IGHGIKDLSTQLSRTGTLSRKSIKAPATPASATLGRPPRIPEPVHLPVVPDGRLSAASSA FSLASAGSLDPPPPPAAVEVFQRPPTLEELSPPPPDEELPLPLDLPPPPPLDGDELGLPP PPPGFGPDEPSWVPASYLEKVVTLYPYTSQKDNELSFSEGTVICVTRRYSDGWCEGVSSE GTGFFPGNYVEPSC >gi568815581r:49106760_49307422|GENSCAN_predicted_CDS_3|1305_bp atggcggagctacagcagctgcaggagtttgagatccccactggccgggaggctctgagg ggcaaccacagtgccctgctgcgggtcgctgactactgcgaggacaactatgtgcaggtg tcacctcaggttcatccttctgagttcccacgggtcaatccagagacccaaagcctccgt cctcacacagatgctggcgatcattctatttccaggccggctctggtaaaggaaggtcct gaactaaatcaaaatctgctctcctgtctggtgttaatatcctgggttgatttctttgcc tattcaatcttcttggtgacctatgctgaggagatccagccaaggacccagtggaagatg ctggcccttaggccagggctgtgtgtatttcaggccacagacaagcggaaggcgctggag gagaccatggccttcactacccaggcactggccagcgtggcctaccaggtgggcaacctg gccgggcacactctgcgcatgttggacctgcagggggccgccctgcggcaggtggaagcc cgtgtaagcacgctgggccagatggtgaacatgcatatggagaaggtggcccgaagggag atcggcaccttagccactgtccagcggctgccccccggccagaaggtcatcgccccagag aacctaccccctctcacgccctactgcaggagacccctcaactttggctgcctggacgac attggccatgggatcaaggacctcagcacgcagctgtcaagaacaggcaccctgtctcga aagagcatcaaggcccctgccacacccgcctccgccaccttggggagaccaccccggatt cccgagccagtgcacctgccggtggtgcccgacggcagactctccgccgcctcctctgcg ttttccctggcctcggccggctccttggacccacctcctccaccagcagccgtcgaggtg ttccagcggcctcccacgctggaggagttgtccccacccccaccggacgaagagctgccc ctgccactggacctgcctcctcctccacccctggatggagatgaattggggctgcctcca cccccaccaggatttgggcctgatgagcccagctgggtgcctgcctcatacttggagaaa gtggtgacactgtacccatacaccagccagaaggacaatgagctctccttctctgagggc actgtcatctgtgtcactcgccgctactccgatggctggtgcgagggcgtcagctcagag gggactggattcttccctgggaactatgtggagcccagctgctga >gi568815581r:49106760_49307422|GENSCAN_predicted_peptide_4|249_aa MAAQGAPRFLLTFDFDETIVDENSDDSIVRAAPGQRLPESLRATYREGFYNEYMQRVFKY LGEQGVRPRDLSAIYEAIPLSPGMSDLLQFVAKQGACFEVILISDANTFGVESSLRAAGH HSLFRRILSNPSGPDARGLLALRPFHTHSCARCPANMCKHKVLSDYLRERAHDGVHFERL FYVGDGANDFCPMGLLAGGDVAFPRRGYPMHRLIQEAQKAEPSSFRASVVPWETAADVRL HLQQVLKSC >gi568815581r:49106760_49307422|GENSCAN_predicted_CDS_4|750_bp atggccgcgcagggcgcgccgcgcttcctcctgaccttcgacttcgacgagactatcgtg gacgaaaacagcgacgattcgatcgtgcgcgccgcgccgggccagcggctcccggagagc ctgcgagccacctaccgcgagggcttctacaacgagtacatgcagcgcgtcttcaagtac ctgggcgagcagggcgtgcggccgcgggacctgagcgccatctacgaagccatccctttg tcgccaggcatgagcgacctgctgcagtttgtggcaaaacagggcgcctgcttcgaggtg attctcatctccgatgccaacacctttggcgtggagagctcgctgcgcgccgccggccac cacagcctgttccgccgcatcctcagcaacccgtcggggccggatgcgcggggactgctg gctctgcggccgttccacacacacagctgcgcgcgctgccccgccaacatgtgcaagcac aaggtgctcagcgactacctgcgcgagcgggcccacgacggcgtgcacttcgagcgcctc ttctacgtgggcgacggcgccaacgacttctgccccatggggctgctggcgggcggcgac gtggccttcccgcgccgcggctaccccatgcaccgcctcattcaggaggcccagaaggcc gagcccagctcgttccgcgccagcgtggtgccctgggaaacggctgcagatgtgcgcctc cacctgcaacaggtgctgaagtcgtgctga >gi568815581r:49106760_49307422|GENSCAN_predicted_peptide_5|338_aa XAYSLVNGNDGVDDDDNSSAGITGLKQHTLPEVEVFKTVRGNRLKRRKTIWAQNSSRKMN MSHREKPFICEICGKSFTSRPNMKRHRRTHTGEKPYPCDVCGQRFRFSNMLKAHKEKCFR VTSPVNVPPAVQIPLTTSPATPVPSVVNTATTPTPPINMNPATTITSIGGTRALGELWSM SKEMLLTFSAPKRYLLHQGTSPLGQALPPDQCNGHSSFWPHSHFDPLILTCPLAHGYVRN NGTHFTGAKPKEKSEVANGTPQNTRCWRSRSLRGRREAWPRAARPFDVRRHRCDWTVACD VGDCGGLRCCSSRSAGRGSGSGSGSRAFKGDAAAARGG >gi568815581r:49106760_49307422|GENSCAN_predicted_CDS_5|1017_bp ngagcttatagtctggtgaatggtaatgatggtgttgatgatgatgataatagcagtgct gggattacaggcttgaaacaacacaccctgccagaagttgaagtttttaagacggttaga ggaaatcgattaaaaagaaggaaaacaatctgggctcagaattcatccagaaagatgaat atgagtcacagagagaaaccctttatctgtgaaatctgtggcaaaagcttcaccagccgc cccaacatgaagagacaccgcagaactcacacaggcgagaagccctatccatgtgatgtg tgtggccagcggttccgcttctcgaacatgcttaaggcccacaaggagaagtgctttcgg gtgaccagccccgtgaatgtgccacctgctgtccagatcccacttacaacttccccagcc accccagttccttctgtggtgaacacagccacaaccccaacccctccaatcaatatgaat cctgccaccaccatcacatctataggaggaacaagagcactgggggaactctggagtatg agtaaggaaatgcttctcaccttctctgctccaaagagatatctgttacatcagggaaca agtcctctaggtcaggcacttcctcctgaccagtgcaacgggcactccagcttctggcct catagccactttgaccccttgattctgacatgtcctctggctcatgggtatgtcagaaat aatggcacccattttacaggtgcaaaaccaaaggaaaaaagtgaagtggccaatggcaca ccacaaaatacaagatgctggcgctcccggagcctccggggcaggagggaggcgtggcct cgggcggcccgcccctttgatgtgcgccggcaccgctgcgattggacagtcgcttgtgac gttggggactgcggtgggctccgctgctgcagcagccgcagcgccggccgcggctccggc tccggctccggctcccgggcatttaaaggggacgcggcggctgcccgggggggatga