GENSCAN 1.0 Date run: 24-Oct-119 Time: 21:52:49 Sequence gi568815593f:149497770_149732188 : 234419 bp : 43.78% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.08 Intr - 7826 7678 149 1 2 70 96 65 0.829 5.38 1.07 Intr - 9364 9258 107 1 2 57 82 60 0.892 1.31 1.06 Intr - 12184 12110 75 1 0 95 115 34 0.898 6.51 1.05 Intr - 14103 14025 79 2 1 90 92 -44 0.784 -4.25 1.04 Intr - 15440 15301 140 2 2 57 107 103 0.949 8.36 1.03 Intr - 22619 22521 99 2 0 133 96 -20 0.912 3.61 1.02 Intr - 27402 27276 127 2 1 106 91 27 0.895 5.48 1.01 Init - 30814 30786 29 1 2 96 60 23 0.597 -0.63 1.00 Prom - 42463 42424 40 -3.86 2.07 PlyA - 45800 45795 6 1.05 2.06 Term - 48062 47793 270 2 0 39 38 217 0.709 7.28 2.05 Intr - 52052 51945 108 2 0 77 50 97 0.324 5.28 2.04 Intr - 52181 52169 13 1 1 47 95 7 0.498 -7.92 2.03 Intr - 52412 52306 107 2 2 137 107 98 0.730 15.61 2.02 Intr - 53196 53073 124 2 1 100 74 200 0.847 20.39 2.01 Init - 65292 65138 155 0 2 90 75 36 0.461 1.96 2.00 Prom - 68833 68794 40 -3.16 3.00 Prom + 69100 69139 40 -5.76 3.01 Init + 76819 77005 187 0 1 44 66 107 0.705 3.32 3.02 Intr + 83688 83855 168 1 0 35 100 72 0.340 3.02 3.03 Term + 84611 84678 68 0 2 39 29 138 0.408 1.10 3.04 PlyA + 84802 84807 6 1.05 4.03 PlyA - 85877 85872 6 1.05 4.02 Term - 100760 100485 276 2 0 -52 47 519 0.548 29.26 4.01 Init - 101518 101483 36 1 0 51 35 62 0.535 -4.06 4.00 Prom - 102621 102582 40 -3.56 5.00 Prom + 102631 102670 40 -10.74 5.01 Init + 102690 102695 6 2 0 79 66 4 0.547 -1.96 5.02 Intr + 103339 103462 124 0 1 110 99 111 0.905 14.56 5.03 Intr + 111779 111926 148 0 1 129 59 270 0.743 27.49 5.04 Intr + 118798 118997 200 1 2 76 82 267 0.587 23.89 5.05 Intr + 120407 120537 131 0 2 108 84 204 0.996 22.41 5.06 Intr + 121169 121273 105 1 0 48 94 157 0.991 12.71 5.07 Intr + 122585 122695 111 1 0 43 59 82 0.721 1.38 5.08 Intr + 123964 124293 330 0 0 108 110 353 0.981 35.33 5.09 Intr + 126243 126371 129 2 0 70 82 64 0.848 4.89 5.10 Intr + 129307 129502 196 0 1 80 71 256 0.965 22.09 5.11 Intr + 131040 131197 158 1 2 87 109 60 0.996 7.73 5.12 Intr + 134213 134351 139 1 1 108 -14 190 0.468 10.84 5.13 Intr + 137340 137453 114 1 0 65 86 44 0.268 2.32 5.14 Term + 137772 137806 35 1 2 110 37 22 0.360 -2.95 5.15 PlyA + 138429 138434 6 1.05 6.07 PlyA - 138691 138686 6 1.05 6.06 Term - 140726 140391 336 2 0 61 49 134 0.665 1.37 6.05 Intr - 143379 143230 150 0 0 53 119 66 0.882 6.56 6.04 Intr - 146913 146820 94 2 1 42 65 100 0.562 3.07 6.03 Intr - 158945 158913 33 1 0 98 97 18 0.008 1.04 6.02 Intr - 203155 203019 137 1 2 109 29 74 0.044 2.97 6.01 Init - 203303 203241 63 2 0 97 84 5 0.805 2.15 6.00 Prom - 224153 224114 40 -3.46 7.00 Prom + 226275 226314 40 -3.66 7.01 Init + 232574 232651 78 1 0 85 96 150 0.456 16.56 7.02 Intr + 232770 232932 163 2 1 6 38 151 0.238 1.45 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593f:149497770_149732188|GENSCAN_predicted_peptide_1|269_aa MEVKVETKNKWYGQEKDYNVLVMDLLGPSLEDLFNFCSRRFTMKTVLMLADQMISRIEYV HTKNFIHRDIKPDNFLMGIGRHCNKLFLIDFGLAKKYRDNRTRQHIPYREDKNLTGTARY ASINAHLGIEQSRRDDMESLGYVLMYFNRTSLPWQGLKAATKKQKYEKISEKKMSTPVEV LCKGFPAEFAMYLNYCRGLRFEEAPDYMYLRQLFRILFRTLNHQYDYTFDWTMLKQKAAQ QAASSSGQGQQAQTPTGKQTDKTKSNMKX >gi568815593f:149497770_149732188|GENSCAN_predicted_CDS_1|807_bp atggaggtgaaggtagaaactaagaataagtggtatggtcaggaaaaagactacaatgta ctagtcatggatcttctgggacctagcctcgaagacctcttcaatttctgttcaagaagg ttcacaatgaaaactgtacttatgttagctgaccagatgatcagtagaattgaatatgtg catacaaagaattttatacacagagacattaaaccagataacttcctaatgggtattggg cgtcactgtaataagttattccttattgattttggtttggccaaaaagtacagagacaac aggacaaggcaacacataccatacagagaagataaaaacctcactggcactgcccgatat gctagcatcaatgcacatcttggtattgagcagagtcgccgagatgacatggaatcatta ggatatgttttgatgtattttaatagaaccagcctgccatggcaagggctaaaggctgca acaaagaaacaaaaatatgaaaagattagtgaaaagaagatgtccacgcctgttgaagtt ttatgtaaggggtttcctgcagaatttgcgatgtacttaaactattgtcgtgggctacgc tttgaggaagccccagattacatgtatctgaggcagctattccgcattcttttcaggacc ctgaaccatcaatatgactacacatttgattggacaatgttaaagcagaaagcagcacag caggcagcctcttccagtgggcagggtcagcaggcccaaacccccacaggcaagcaaact gacaaaaccaagagtaacatgaaagnn >gi568815593f:149497770_149732188|GENSCAN_predicted_peptide_2|258_aa MKMGADSQNCGNSGTTGSQTLMEHQAAELRFIELQMNYVNEPRGEGDDLSQMMASSSGSK AEFIVGGKYKLVRKIGSGSFGDIYLAINITNGEEVAVKLESQKARHPQLLYESKLYKILQ GGVGIPHIRWWFPLCVQVIFDPYAEGLAAAFFTAALLMYLGLSSEVDLKALGEPRLKVQT WPSPRTTPHTTSPENGTEMVGSRFLRNMHFAKKHKKGLKKMQANNAKAMSALAEASRALV KPKEVKPKIPKGVSCELD >gi568815593f:149497770_149732188|GENSCAN_predicted_CDS_2|777_bp atgaagatgggtgctgactctcaaaactgtgggaattcaggcacaactggaagtcagacc ctgatggaacaccaagcggcagaattgagatttattgagctgcagatgaactatgtaaat gagccaaggggggagggggatgatttaagtcaaatgatggcgagtagcagcggctccaag gctgaattcattgtcggagggaaatataaactggtacggaagatcgggtctggctccttc ggggacatctatttggcgatcaacatcaccaacggcgaggaagtggcagtgaagctagaa tctcagaaggccaggcatccccagttgctgtacgagagcaagctctataagattcttcaa ggtggggttggcatcccccacatacgctggtggtttcctttatgcgtgcaggtcatcttt gatccctacgcggaagggttggctgctgccttttttacggcggcacttctgatgtactta ggtttgtccagtgaagtggatttgaaggcacttggagagccgcggcttaaggtacagaca tggccaagtccacgaaccacaccacacacaaccagtcctgaaaatggcacagaaatggtg ggctccaggttcctgaggaacatgcactttgccaagaagcacaagaagggcctaaagaag atgcaggccaacaatgccaaggcaatgagtgcacttgctgaggctagcagggccctcgta aagcccaaggaggttaagcccaagatcccaaagggtgtcagctgcgaactcgattga >gi568815593f:149497770_149732188|GENSCAN_predicted_peptide_3|140_aa MYNGYASDLLVYIASPNFPLSEPYIHLPVDTFTCIAQVSASPKFVSSIVHLTIQDRNPRV SPGSCVGIAAPASALGGHSHPERRGCRELGDAPLLPRLRCCPGGRPPCVSAGRHLARAGR QLVEEYLLCVQTASCVIQEA >gi568815593f:149497770_149732188|GENSCAN_predicted_CDS_3|423_bp atgtacaacggatatgccagtgacctcctagtctatattgccagtccaaacttccctctc tcagaaccttacatccatcttcctgttgacaccttcacttgcatagcacaggtatcagct agcccaaaatttgtctcatccatcgtccacctgactatccaagacagaaatcccagagtc agcccaggcagctgcgtgggcatagccgcgcccgccagcgccctcggtgggcacagccac ccggagcggcgcgggtgccgggagctgggcgacgcgcccctcctgccccggctgcgctgt tgcccgggcggccgacctccgtgcgtgagcgccggcaggcaccttgcgcgcgcggggagg cagctggtggaggagtacctgctgtgcgtccagacagccagctgtgtgatccaggaagca tag >gi568815593f:149497770_149732188|GENSCAN_predicted_peptide_4|103_aa MTPVFGGLGAVGTLEGEAGKKEEEEEEEEEEKERRGRRKKRKEKKEKEKKEKRKRKERKR KKRKRKKRKRKKKRKRKRKKEEEEEEEEEEEEEEEEEEEEESV >gi568815593f:149497770_149732188|GENSCAN_predicted_CDS_4|312_bp atgacacctgtcttcgggggcctgggggctgtgggcacccttgaaggggaagcaggaaaa aaagaagaagaggaggaagaagaagaagaggaaaaagaaagaagaggaagaagaaagaag aggaaggagaagaaggagaaggagaagaaggagaagaggaagaggaaggagaggaagagg aagaagaggaagaggaagaagaggaagaggaagaagaagaggaagaggaagaggaagaaa gaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaa gagtcagtttaa >gi568815593f:149497770_149732188|GENSCAN_predicted_peptide_5|641_aa MVLPQGDLDVLFSNIDDIIKVNSRFLHDLQETASKEEEQVQLVGNIFLEFQEELEQVYKV YCASYDQALLLVDTYRKEPELQRHIQGIVEAVVPQAGSSGLSFLLVIPLQRITRYPLLLQ KILENTVPDASAYPVLQRAVSALQDVNTNINEYKMRKEVASKYTKVEQLTLRERLARINT HTLSKKTTRLSQLLKQEAGLIPRTEDKEFDDLEERFQWVSLCVTELKNNVAAYLDNLQAF LYFRPHEYNLDIPEGPAVQYCNLARDLHLEAFLKFKQRLEGLVWQPLCSLAKALLGPQNL IKKRLDKLLDFERVEEKLLEVGSVTYQEEAARHTYQALNSLLVAELPQFNQLVMQWLGQI MCTFVTLQRDLAKQVLQRAEGSMAQLPHHHVPEPAFRKLVEDALGRTSNQLRSFQETFEK VQPPPTTQPLLPGSERQVQALLSRYGPGKLYQVTSNISGTGTLDLTLPRGQIVAILQNKD TKGNSGRWLVDTGGHRGYVPAGKLQLYHVVPSAEELRRQAGLNKDPRCLTPEPSPALVPS IPTMNQVIAAYPFVARSSHEVSLQAGQPVTILEAQDKKGNPEWSLVEVNGQRGSWPELLG DTERNQTDEARSSRSQSLRQADEAPINYVPCNCMLLEAENF >gi568815593f:149497770_149732188|GENSCAN_predicted_CDS_5|1926_bp atggtgttgccgcagggagatctggatgtcctgttctcaaacattgatgatatcatcaaa gtgaacagcagattcctccatgatctgcaggagacagcctccaaggaagaggaacaagtg cagctagttggtaacatatttctggaattccaagaggagttggagcaagtctataaggtc tactgtgccagctacgaccaggccttgctactggtggacacgtaccggaaggagccggag ctgcagcggcacatccagggcatcgttgaggcggtggtgccgcaagctggatcttcaggc ctcagtttcttgctggtaattcctctgcagaggatcaccaggtacccactgctgctgcag aaaatcctggagaacacagtccctgatgccagtgcctatcctgtccttcagagggctgtc tctgccctccaggacgtgaacaccaatatcaatgagtacaagatgcgcaaggaagtggcc tccaagtacaccaaggtagagcagctgaccctccgggagcggctggcccgcatcaacaca cacaccctctccaagaagaccacccggctgagccagctgctgaagcaggaggcggggctg atccccaggacagaagacaaggaatttgatgatttagaagagaggttccagtgggtgtct ctgtgtgtgactgagctgaagaacaacgtggctgcttacctggacaatctgcaggctttc ctctacttcaggccgcacgaatacaatctggacatccccgaggggcctgcagtgcagtat tgcaatttggcaagagaccttcaccttgaggccttcctgaaatttaagcaacggctagaa ggcctggtgtggcagccactgtgcagcctggccaaagccctgcttggccctcagaacctg atcaagaagcgtctggacaagctactggactttgagcgggtggaagagaagctgctggag gtgggcagtgtgacctaccaggaggaggccgcccggcacacataccaggcactcaactcg ctgctagtggctgagctcccacagtttaaccagctggtcatgcagtggctgggccagatc atgtgcacattcgtgaccctccagagggaccttgcaaagcaagtgctgcagagggcagag ggaagcatggcccagctgccccaccaccacgtcccagagcctgccttcaggaagctggtg gaggacgcactgggccggacgagtaaccagcttcgctcctttcaagagacctttgagaaa gtgcagccacctcccaccacacaaccgctccttccagggtctgaacgccaggtgcaggct ctcctgagcaggtatggccctgggaagctgtaccaggtgacaagcaacatcagtgggact gggactctggacctgactctgcctcggggccaaatcgtggccatccttcaaaacaaggac accaaaggcaacagcggccgctggctggtggacaccgggggacatcgtgggtatgtgccg gctgggaaactacagctgtaccatgtggtccccagtgcagaggagctcagaaggcaggcg gggctgaacaaagacccccgatgtctaacaccggagcccagcccagctctagtgccctct attcccaccatgaaccaggtcatagccgcgtacccttttgtggccagaagcagccatgaa gtgagcctgcaggcaggccagcctgtgaccatcctggaggcccaggacaagaaggggaac cctgagtggagcctggtggaagtgaatggacagagggggagctggcctgagctgctgggg gatacagaaagaaatcagacagatgaggcccggtcttccagatcccagtctctaaggcag gcagatgaggcacccatcaattacgtgccgtgcaactgcatgctccttgaggctgagaac ttctga >gi568815593f:149497770_149732188|GENSCAN_predicted_peptide_6|270_aa MGKGSENESQEKSSKLLVPHHSGDGQMNINVPPVTATPVGTGRVVQGLAHFSISIRFPQH LLPPSSKIWELAVPLWTNLIINHQGSLSTGFMPDPVLGVRDEQKIMKMSSAKVSRDVSCV HIPTTPSAGLFPGLPTRRKVRKVRRQRDEATCLKAATQLLLLKPPADSWAPKLLRPLETP KRMLLAAKRPEPITTGTAPSVSSVPAYCLVTCPACPCADAVDARTPYSWREAGKPSGACA YEHCPSLAAPGMGEEGKPQECLWGVTAAPE >gi568815593f:149497770_149732188|GENSCAN_predicted_CDS_6|813_bp atggggaaaggctcagaaaacgagtcccaggagaaaagctccaaactcttagtcccacat catagcggagatgggcagatgaacatcaatgtccctccagtgacggccacaccagtgggg acagggagggtagtccaggggctggcccacttctcaatctccatcaggttcccccagcac ctgctgcccccttcttcaaagatatgggaactagcagtgcccctttggactaatctcatc atcaaccatcaaggtagcctaagcactggctttatgccagatcctgtgctgggtgtcaga gatgaacagaagatcatgaaaatgtctagtgccaaagtctcccgtgatgtatcctgtgtc cacatcccaacaaccccatcggcaggactctttcctggtttgcccacaaggaggaaagtg aggaaagtgaggcgtcagagagatgaagcaacttgtttgaaggcggccacacagctgctt ctactgaagcctccagcagactcttgggccccaaagctgctgcgtcccctagaaacacct aaaaggatgctcctggctgcaaagcgccctgagcctataaccacaggaactgctccatca gtatccagtgtgcctgcttattgcctggttacctgcccggcttgtccttgtgcagatgca gtggatgccagaacaccctacagctggagagaggcggggaaaccctcaggtgcttgtgcc tacgagcactgccccagcctggcagcaccagggatgggagaggaaggaaagccccaggaa tgcctgtggggtgtgacagcagcccctgagtga >gi568815593f:149497770_149732188|GENSCAN_predicted_peptide_7|81_aa MAGNDCGALLDEELSSFFLNYLADTQAAEPPFQAPCDALRYRGREPEVSRRVPERWGRYG RWGGSSLGRLESATPRAKLGX >gi568815593f:149497770_149732188|GENSCAN_predicted_CDS_7|243_bp atggcggggaacgactgcggcgcgctgctggacgaagagctctcctccttcttcctcaac tatctcgctgacacgcaggctgcagagccccccttccaggcgccctgcgatgcgctccgt taccggggcagggagccggaggtctcccggcgcgtgccggagcgctgggggcgctacggc cgctggggagggtctagccttggccgcttggagtctgccaccccgcgggcaaaactgggg gnn