GENSCAN 1.0 Date run: 3-Nov-116 Time: 08:13:12 Sequence gi568815590r:73452009_73788815 : 336807 bp : 38.15% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 14435 14483 49 1 1 70 82 45 0.363 3.26 1.02 Intr + 20037 20230 194 1 2 9 27 147 0.452 -1.01 1.03 Term + 21524 21685 162 1 0 51 48 160 0.621 5.25 1.04 PlyA + 21756 21761 6 1.05 2.07 PlyA - 22718 22713 6 1.05 2.06 Term - 25127 25011 117 2 0 62 49 97 0.004 0.76 2.05 Intr - 51943 51882 62 2 2 117 81 35 0.020 3.23 2.04 Intr - 56157 55936 222 1 0 18 78 111 0.089 0.28 2.03 Intr - 61686 61479 208 0 1 16 33 190 0.004 4.03 2.02 Intr - 75796 75683 114 1 0 76 75 98 0.128 7.02 2.01 Init - 100267 100004 264 1 0 59 80 160 0.340 9.36 2.00 Prom - 104908 104869 40 -6.45 3.00 Prom + 106871 106910 40 -5.85 3.01 Init + 107624 107641 18 1 0 102 77 46 0.264 3.46 3.02 Intr + 117611 117824 214 1 1 67 71 126 0.286 6.17 3.03 Term + 117909 118051 143 1 2 78 42 207 0.740 12.21 3.04 PlyA + 118088 118093 6 1.05 4.00 Prom + 119193 119232 40 -6.15 4.01 Init + 120009 120309 301 2 1 56 72 131 0.394 5.76 4.02 Term + 121007 121686 680 0 2 36 42 322 0.366 15.26 4.03 PlyA + 122896 122901 6 1.05 5.08 PlyA - 122975 122970 6 1.05 5.07 Term - 135576 135388 189 0 0 67 44 158 0.065 5.77 5.06 Intr - 143289 143158 132 0 0 84 116 116 0.990 13.92 5.05 Intr - 151855 151718 138 1 0 63 68 139 0.973 9.14 5.04 Intr - 161948 161736 213 2 0 74 115 154 0.999 14.69 5.03 Intr - 163774 163667 108 1 0 64 87 156 0.987 12.66 5.02 Intr - 165443 165284 160 1 1 119 64 95 0.950 9.27 5.01 Init - 181639 181569 71 1 2 35 77 59 0.114 0.07 5.00 Prom - 198453 198414 40 -5.05 6.00 Prom + 198583 198622 40 -6.75 6.01 Sngl + 199005 199784 780 2 0 80 55 568 0.970 48.44 6.02 PlyA + 200189 200194 6 1.05 7.00 Prom + 208650 208689 40 -4.15 7.01 Init + 221290 221332 43 0 1 59 116 44 0.547 4.93 7.02 Intr + 228404 228527 124 0 1 3 50 183 0.367 4.72 7.03 Term + 231583 231868 286 1 1 56 44 239 0.218 10.19 7.04 PlyA + 232287 232292 6 1.05 8.06 PlyA - 233369 233364 6 -0.45 8.05 Term - 234718 234660 59 2 2 57 50 61 0.695 -3.93 8.04 Intr - 236805 236646 160 0 1 54 80 192 0.486 13.64 8.03 Intr - 243775 243671 105 1 0 122 44 79 0.503 6.39 8.02 Intr - 257154 257024 131 1 2 63 116 78 0.503 7.59 8.01 Init - 276025 275926 100 1 1 59 61 88 0.346 3.67 8.00 Prom - 279557 279518 40 -4.95 9.00 Prom + 287817 287856 40 -5.25 9.01 Init + 294614 294766 153 1 0 43 30 189 0.839 6.63 9.02 Intr + 295075 295266 192 0 0 32 47 159 0.762 4.97 9.03 Term + 295388 295474 87 1 0 111 44 144 0.864 8.98 9.04 PlyA + 298019 298024 6 1.05 10.05 PlyA - 298783 298778 6 1.05 10.04 Term - 304292 304182 111 2 0 93 47 87 0.717 2.68 10.03 Intr - 304557 304473 85 2 1 86 49 25 0.028 -2.70 10.02 Intr - 326067 325905 163 1 1 121 43 37 0.016 0.61 10.01 Init - 332216 332159 58 2 1 58 103 59 0.360 5.92 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590r:73452009_73788815|GENSCAN_predicted_peptide_1|134_aa MKEIIQEGSQEEKGGGGVEVQEGNAEWIRLSSVCGVIGQGGSVLELGGEGQICQNMRRKT DVLRRDDITFKIRLHEKLYSWTSDIWICDEARGLCEDWHLLPREALSLVLSCDLNIGLEG ERNGQRLGFASDAT >gi568815590r:73452009_73788815|GENSCAN_predicted_CDS_1|405_bp atgaaagaaataatccaggaaggaagccaggaagaaaaagggggtggaggagtggaggtg caggaagggaatgcagaatggattcgcctttcatcagtgtgtggtgtgataggacagggc ggtagtgtactagaattgggtggagagggacaaatctgtcaaaacatgagaagaaaaact gatgtgctaagaagagatgacattacgtttaaaataaggcttcatgaaaagctatacagt tggacctcagatatatggatatgcgatgaagccagagggctgtgtgaggactggcacctt cttccccgggaagccctcagcttagtgttgtcttgtgatctgaacattggcctggaagga gagagaaacggccagaggcttggatttgccagtgatgccacatga >gi568815590r:73452009_73788815|GENSCAN_predicted_peptide_2|328_aa MEASRHKVISGTTLGYLSPKDMNQPSSSFFSISPTSNSSATIARELLMNGTSSTAEAIGL KGSSPTPPCSPVQPSKQLEYLARIQGFQVHYCDRQSGKECVTCLTLAPVQMTFHAIGSSI EASHDQLGPTVYQQTSQKSIREIWENGTVIGSLDKLTKTLQTQAGDIKEGSSYLYTLDDC EICTYAEKMQKGLAEISQVVGTEEKLLKEIRSVTPVNTQMIRKQNSLMADMEKVQVVWTE DQTSHSILLSQSLIQSKALTLFNSMKTEGDIFDGQQTIAFILIETESLPKCPQVALLSST PPPPTRLILPVLQCHRKCYPAVHVDYLI >gi568815590r:73452009_73788815|GENSCAN_predicted_CDS_2|987_bp atggaagccagccgccacaaagtaatctctggcactactctaggctatttgtcacccaaa gatatgaaccaaccttcaagctctttcttcagtatatctcccacatcgaatagttcagct acaattgccagggaactccttatgaatggaacatcttctacagctgaagccataggttta aaaggaagttctcctactcccccttgttctccagtacaaccttcaaaacaactggaatat ttagcaaggattcaaggctttcaggttcactactgtgatagacaaagtggcaaagagtgt gtgacctgtctgacattagcccctgtgcagatgactttccatgctattggaagctccatt gaagccagccatgatcagttggggccaacagtgtatcagcagactagccagaaatctatc agggagatttgggaaaatggcacagtgatagggagccttgataaactcaccaaaacactt caaacacaagcaggagatatcaaagagggctcaagctacctatacaccctggatgactgt gaaatctgcacatatgcagagaagatgcaaaaaggcctggcagaaattagccaagttgtg ggtacagaagaaaagctcttgaaagaaattagaagtgttactccagtgaatacacaaatg ataagaaagcaaaacagccttatggctgatatggagaaagttcaggtggtctggacagaa gatcaaaccagccacagcattctcttaagccaaagcctaatccagagcaaggccctaact ctcttcaattccatgaagactgagggagatatctttgatgggcagcagactattgctttc attctaatagaaactgagtccctccctaagtgtccccaggtggcactactctcttccact ccacccccacctacccggttaattcttccagttcttcagtgtcacaggaagtgttatcct gctgttcacgtggattatctcatttga >gi568815590r:73452009_73788815|GENSCAN_predicted_peptide_3|124_aa MVMSLKERSGSNICHSAIFAVLQPPLVIPRQTGSGVDLQQTPTDLQLRVLTVGRKTNKQK GHPHQNPICTSPSSKTKGIQLLASNGKKLDGERLLQVERRRLQMIGNNKLLQAKGGCSNP LQRS >gi568815590r:73452009_73788815|GENSCAN_predicted_CDS_3|375_bp atggtcatgtccctgaaggaacgatcaggcagcaatatttgtcattctgcaatatttgct gttctgcagcctccactggtgatacccaggcaaacaggatctggagtggacctccagcaa actccaacagacctgcagctaagggtcctgactgttggaaggaaaactaacaaacagaaa ggacatccacaccaaaaccccatctgtacgtcaccatcatcaaagaccaaaggaatacag ctcctcgccagcaacggaaaaaagctggatggagaacgacttctacaagttgagagaaga aggcttcagatgattggtaacaacaaacttctccaagctaaaggaggatgttcaaaccca ttgcaaaggagctaa >gi568815590r:73452009_73788815|GENSCAN_predicted_peptide_4|326_aa MKRQKNQIDAIKNDKGDITTHPTEIQTTIREYYKHLYANKLENLEEMDRFLDTYTLLRLN QEELESLNRPITGSEIEAIINSYQSKKFQDQMDSQLNSTRAQNPLKLISNFSNVSGYKIN VQKSQAFLYTSNRQTESQIMSELPFTIASKRIQYLGIQLIRDVKDLFKENYKPVLNNIKE DTNKWKDIPCSWIGRINIVKMGILPKVIYRFNAIPIKLPMTFFTELEKPTLKFIWNQKRA CIAKSILSQKNKAGGIMLPDFKLYYRATVTKTAAYWYQNRDIDQWNRTEPSEIIPHIYNH LIFDKPAKNRNGERIPYSTNGAGKTG >gi568815590r:73452009_73788815|GENSCAN_predicted_CDS_4|981_bp atgaaaagacagaagaatcaaatagacgcaataaaaaatgataaaggggatatcaccacc catcccacagaaatacaaactaccatcagagaatactataaacacctctacgcaaataaa ctagaaaatctagaagaaatggatagatttctggacacatacaccctcctaagactaaac caggaagaacttgaatccctgaatagaccaataacaggctctgaaattgaggcaataatt aacagctaccaatcaaaaaaattccaggaccagatggattcacagctgaattctaccaga gcccaaaatccccttaagctgataagcaacttcagcaatgtctcaggatacaaaatcaat gtgcaaaaatcacaagcattcctatacaccagtaacagacaaacagagagccaaatcatg agtgaactcccattcacaattgcttcaaagagaatacaatacctaggaatccaacttata agggatgtgaaggacctcttcaaggagaactacaaaccagtgctcaacaacataaaagag gacacaaacaaatggaaggacattccatgctcatggataggaagaatcaatatcgtgaaa atgggcatactgcccaaggtaatttatagattcaatgccattcccatcaagctaccaatg actttcttcacagaattggaaaaacctacgttaaagttcatatggaaccaaaaaagagcc tgcattgccaagtcaatcctaagccaaaagaacaaagctggaggcatcatgctacctgac ttcaaactatactaccgagctacagtaaccaaaacagcagcgtactggtaccaaaacaga gatatagaccaatggaacagaacagagccctcagaaataataccacacatttataaccat ctgatctttgacaaacctgccaaaaacagaaatggggagaggattccctattcaacaaat ggtgctgggaaaactggctag >gi568815590r:73452009_73788815|GENSCAN_predicted_peptide_5|336_aa MYFRYIGKDETTDLAHPHLVAASRYHCPVPKIFYVQLTVGNNEFFGEGKTRQAARHNAAM KALQALQNEPIPERSPQNGESGKDVDDDKDANKSEISLVFEIALKRNMPVSFEVIKESGP PHMKSFVTRVSVGEFSAEGEGNSKKLSKKRAATTVLQELKKLPPLPVVEKPKLFFKKRPK TIVKAGPEYGQGMNPISRLAQIQQAKKEKEPDYVLLSERGMPRRREFVMQVKVGNEVATG TGPNKKIAKKNAAEAMLLQLGYKASTNLQDQLEKSLPVKHIMQFFSFLSQRTPYPTLRAL YPPLVLQIDRYACCRAAVKVLLFTIILKISLASVLC >gi568815590r:73452009_73788815|GENSCAN_predicted_CDS_5|1011_bp atgtacttccggtatataggcaaagatgaaaccactgatctagcccatcctcatcttgta gccgcctcaaggtatcattgcccagtgcctaagatcttttatgttcagctcactgtagga aataatgaattttttggggaaggaaagactcgacaagctgctagacacaatgctgcaatg aaagccctccaagcactgcagaatgaacctattccagaaagatctcctcagaatggtgaa tcaggaaaggatgtggatgatgacaaagatgcaaataagtctgagatcagcttagtgttt gaaattgctctgaagcgaaatatgcctgtcagttttgaggttattaaagaaagtggacca ccacatatgaaaagctttgttactcgagtgtcagtaggagagttctctgcagaaggagaa ggaaatagcaaaaaactctccaagaagcgcgctgcgaccaccgtcttacaggagcttaaa aaacttccacctcttcctgtggtggaaaagccaaaactattttttaaaaaacgccctaaa acaatagtaaaggccggaccagaatatggccaagggatgaaccctattagccgcctggcg caaattcaacaggccaaaaaggaaaaggagccggattatgttttgctttcagaaagagga atgcctcgacgtcgagaatttgtgatgcaggtgaaggtaggcaatgaagttgctacagga acaggacctaataaaaagatagccaaaaaaaatgctgcagaagcaatgctgttacaactt ggttataaagcatccactaatcttcaggatcaacttgagaagtctttaccagtcaagcac atcatgcagtttttttcgttcctttcccaacgtacaccctacccaacccttagagctctt tatcctcctcttgtattgcagattgatcgatatgcctgctgcagagctgctgttaaggtt cttctcttcaccattatcctgaaaatttctctggcctctgtcctttgttga >gi568815590r:73452009_73788815|GENSCAN_predicted_peptide_6|259_aa MRPSSSPPCGGQQPSSFGSVDWLSQSSCSGPTHTPRPAEVSLGSLPGPDQASGPEEPPQA VSIKEARSSNLPPPERAMAGLRKGPNAWRAPLCPHSLHHGAGPWLGGHLPAPPVPGPSGV EEAGQGNAAVGGPDKNLVSKSPHETRTANAGLPAEQPLLGVTARTPGFPLTVFWPCQWPA PAVPLGTPAWAPDSDAVPWLLLGSLQVEQEALASEWACCCGQPLAYHPPKPRKWHAYAGT SPVHGALGPGVLCRRRDAF >gi568815590r:73452009_73788815|GENSCAN_predicted_CDS_6|780_bp atgcgtccctcctcctccccaccttgtggcgggcagcagccctctagctttggctccgtg gactggctctcccaaagcagctgctcagggccgacccacacccccaggcctgccgaggtc tccctggggagcctccctggcccggaccaggcatcgggccccgaggagccccctcaggcc gtcagcatcaaggaggctaggtcctcaaatctgcctccgccagagagggctatggctggg ttgaggaaagggccaaacgcctggcgggccccactgtgtccacacagccttcaccacgga gcaggtccgtggcttggagggcatcttccagcaccaccagtacctgggccctctggagtg gaagaggctggccagggaaatgcagctgtcggaggtccagataaaaacctggtttcaaaa tcgccgcatgaaacacgaacggcaaatgcaggactcccagctgaacagcctcttctcgga gtcactgcacgcacccctggctttccactcaccgtcttctggccttgccaatggcctgca cctgctgtgcccttgggcacccctgcctgggccccagactctgatgctgtcccctggctc cttctggggtctctgcaagtggaacaagaggccctggcctctgagtgggcctgctgctgt gggcagcctctggcgtaccaccccccaaagcccaggaagtggcatgcatacgctgggacc agccctgtccacggggccctggggcctggtgtgctctgccggagacgggatgcattttga >gi568815590r:73452009_73788815|GENSCAN_predicted_peptide_7|150_aa MQAIFIQRLNTMEEGKRGENHIKGSSRGIKESEQQIQDLASDTVYPNEKKPEKQFCTGSP SRSNRQEKEIKGIQIDKEEVKLLLFADDMIVYLENPKDSTKKLLDLINEFSKVSRYKINV HKSVALLYTNSDQAENQIKNSNPFTIAAKK >gi568815590r:73452009_73788815|GENSCAN_predicted_CDS_7|453_bp atgcaagctatatttatacaacgcttaaataccatggaagaagggaaacggggagagaac cacatcaagggatcatcccgtgggataaaagaatctgaacagcagattcaagatcttgcc tctgacacagtatatccaaatgagaagaaaccagaaaaacaattctgtactggaagtcct agccggagcaatcgacaagagaaagaaataaagggcatccaaatcgataaagaggaagtc aaactgttgctgtttgctgatgatatgatcgtatacctagaaaaccctaaagactcaacc aaaaagctcctagatctgataaatgaattcagtaaagtttcaagatacaaaataaacgta cacaagtcagtagctctgctatacaccaacagcgaccaagctgagaatcaaatcaagaac tcaaaccctttcacaatagctgcaaaaaaatga >gi568815590r:73452009_73788815|GENSCAN_predicted_peptide_8|184_aa MGKGLGIDSYSEKINTSLILNIISDQGDANENHTSLQDKMANPKEKTAMCLVNELARFNR VQPQYKLLNERGPAHSKAEEPLLVWPPTPPLVHGLILQGSGLPEVDPEMLSSMFSVQLSL GEQTWESEGSSIKKAQQAVANKALTESTLPKPVQKPPKSNVNNNPVFGEQVVFGYIDKFF RDDF >gi568815590r:73452009_73788815|GENSCAN_predicted_CDS_8|555_bp atgggcaaaggacttggaatagacagttactctgagaagataaacacatcattaatactc aacatcattagtgatcagggagatgcaaatgaaaaccacacttctctccaagataaaatg gcaaacccaaaagagaaaactgcaatgtgtctggtaaatgagttagcccgtttcaataga gtccaaccccagtataaacttctgaatgaaagagggcctgctcattcaaaggcagaggaa cctctccttgtgtggccaccaacaccaccactggtccatgggctcattcttcagggcagt gggctccctgaagtagatccggaaatgctgtccagcatgttctcagtgcagctgagtctt ggtgagcagacatgggaatccgaaggcagcagtataaagaaggctcagcaggctgttgcc aataaagctttgactgaatctacgcttcccaaaccagttcagaagccacccaaaagtaat gttaacaataacccagtttttggggaacaggtggtattcggttacattgataaattcttc cgtgatgatttctga >gi568815590r:73452009_73788815|GENSCAN_predicted_peptide_9|143_aa MWEGRRGGGEAAKGAAFLRVPRAAELQGSPAPSAFFAGGAEVGVSRTPPMRLPPPGGCRA PAAAGSAGLAAASWAPGRGPFPGGLGTRASDCARSAPGPQSPRPRRDSPPPRLPRALRLP LCAVQGTRDHPASAPAAAVQRTV >gi568815590r:73452009_73788815|GENSCAN_predicted_CDS_9|432_bp atgtgggagggaaggcgcgggggaggggaggccgcgaagggggctgcttttctccgggtt ccccgtgctgcggaactgcagggctccccagcgccctctgccttcttcgccggcggcgcg gaagtcggggtctcccggacgcccccgatgaggctcccgccccctggcggctgccgggcc ccagcagctgcaggctctgcggggctagcggcggcgagctgggcccctgggcgagggcca ttcccggggggcttgggcacgcgggcgagcgactgcgcaaggagcgcgcccggtccgcag tctcctcgtccccggcgcgactccccgccccctcgtctgccaagggctctccggctgccc ctctgtgctgtgcaggggacgcgcgaccatcccgcgtctgctccggccgccgctgtgcaa cgcacggtttag >gi568815590r:73452009_73788815|GENSCAN_predicted_peptide_10|138_aa MESKKEFREHDKLQDEYLPGFRSASLFGFLSPILFLCLSQTYSPNKPLALLTPVSASWRT QINTVGTGCGARGRCWDYRKMCGVLNVILRSSDVPSTERKDQPYPSNICRYIEKPHVGVL VDSPSCGPSGKPSSTIDE >gi568815590r:73452009_73788815|GENSCAN_predicted_CDS_10|417_bp atggagtccaagaaagaattcagggaacatgacaaactgcaggatgagtatttgccaggt ttcagatctgcatcattgtttggttttcttagcccaatcctcttcctctgtctttcacaa acttactcccctaacaaacctcttgccctcctaactccagtgtctgcttcctggaggacc caaatcaacacagttggcactgggtgtggtgcacgaggaaggtgctgggattacaggaaa atgtgtggagttttaaatgtgattctaagatcctctgacgtgccttccacggaaagaaag gatcagccctaccccagcaacatctgcaggtacatagaaaagccacatgtgggtgttctg gttgacagccccagctgcggtcccagtggaaaaccatcatcaactatagatgagtga