GENSCAN 1.0 Date run: 7-Nov-116 Time: 00:31:02 Sequence gi568815586r:121140800_121374442 : 233643 bp : 47.21% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 8347 8402 56 0 2 51 68 94 0.772 4.46 1.02 Intr + 13986 14154 169 0 1 70 99 242 0.892 23.45 1.03 Intr + 15280 15348 69 0 0 101 94 -10 0.453 0.28 1.04 Intr + 20103 20175 73 2 1 89 77 20 0.751 -0.02 1.05 Intr + 21583 21721 139 2 1 91 117 95 0.791 12.22 1.06 Intr + 24558 24638 81 0 0 111 113 142 0.987 17.75 1.07 Intr + 25259 25388 130 2 1 41 77 118 0.999 6.70 1.08 Intr + 26689 26825 137 0 2 68 93 148 0.999 12.67 1.09 Intr + 34589 34679 91 2 1 69 103 178 0.991 17.40 1.10 Intr + 36348 36413 66 2 0 107 99 94 0.999 11.50 1.11 Intr + 36498 36647 150 2 0 81 93 263 0.999 26.46 1.12 Intr + 39555 39656 102 2 0 58 100 49 0.877 3.47 1.13 Term + 43506 44003 498 2 0 58 48 567 0.934 44.22 1.14 PlyA + 45092 45097 6 1.05 2.00 Prom + 57361 57400 40 -1.86 2.01 Init + 69366 69499 134 2 2 117 88 359 0.997 38.41 2.02 Intr + 76335 76482 148 0 1 89 89 219 0.999 22.34 2.03 Intr + 81114 81185 72 2 0 109 75 109 0.996 11.30 2.04 Intr + 81295 81367 73 0 1 117 80 95 0.992 10.58 2.05 Intr + 82148 82244 97 0 1 91 79 59 0.964 4.37 2.06 Intr + 87734 87814 81 2 0 78 108 4 0.511 0.15 2.07 Intr + 87926 88067 142 2 1 80 90 123 0.991 12.06 2.08 Intr + 88164 88300 137 2 2 83 80 170 0.983 15.07 2.09 Intr + 91615 91708 94 1 1 138 65 315 0.999 34.17 2.10 Intr + 91812 91877 66 2 0 86 101 94 0.685 9.60 2.11 Intr + 92198 92293 96 1 0 51 96 117 0.997 9.01 2.12 Term + 92724 92750 27 2 0 125 54 22 0.740 0.57 2.13 PlyA + 94995 95000 6 1.05 3.10 PlyA - 96354 96349 6 1.05 3.09 Term - 100624 100604 21 1 0 96 54 31 0.095 -1.19 3.08 Intr - 103816 103774 43 0 1 80 92 19 0.599 -0.26 3.07 Intr - 104441 104341 101 2 2 138 94 70 0.883 11.51 3.06 Intr - 107935 107807 129 1 0 69 79 102 0.987 8.29 3.05 Intr - 109075 108988 88 0 1 115 75 162 0.999 17.57 3.04 Intr - 109235 109162 74 2 2 118 81 72 0.999 7.70 3.03 Intr - 111915 111862 54 0 0 122 110 41 0.997 8.88 3.02 Intr - 112673 112474 200 0 2 100 121 306 0.993 34.17 3.01 Init - 114832 114751 82 1 1 73 48 117 0.590 7.33 3.00 Prom - 115638 115599 40 -6.86 4.26 PlyA - 117551 117546 6 1.05 4.25 Term - 121278 120982 297 0 0 40 54 167 0.859 3.77 4.24 Intr - 123140 123007 134 0 2 137 83 188 0.996 23.56 4.23 Intr - 127890 127839 52 0 1 98 94 88 0.661 8.88 4.22 Intr - 128782 128729 54 1 0 59 73 104 0.971 5.18 4.21 Intr - 129782 129729 54 2 0 81 108 35 0.939 3.98 4.20 Intr - 130146 130099 48 0 0 110 94 75 0.996 9.18 4.19 Intr - 133786 133257 530 2 2 121 96 498 0.690 46.56 4.18 Intr - 156876 156740 137 2 2 93 106 7 0.107 3.21 4.17 Intr - 161441 161293 149 2 2 66 76 62 0.153 1.83 4.16 Intr - 163734 163687 48 0 0 93 73 35 0.453 1.38 4.15 Intr - 167892 167705 188 1 2 75 32 246 0.793 17.11 4.14 Intr - 169064 168902 163 2 1 70 97 194 0.680 18.05 4.13 Intr - 177625 177478 148 0 1 46 115 214 0.998 20.14 4.12 Intr - 177809 177702 108 1 0 82 110 33 0.944 4.30 4.11 Intr - 179019 178898 122 0 2 102 82 4 0.959 0.49 4.10 Intr - 179660 179586 75 2 0 92 89 19 0.782 2.11 4.09 Intr - 186432 186297 136 2 1 54 96 115 0.990 9.47 4.08 Intr - 187698 187517 182 0 2 57 96 227 0.597 19.07 4.07 Intr - 189873 189784 90 0 0 64 107 31 0.110 2.79 4.06 Intr - 196593 196492 102 0 0 76 115 38 0.949 5.67 4.05 Intr - 203875 203833 43 0 1 113 59 15 0.687 -0.66 4.04 Intr - 205232 205040 193 0 1 112 86 184 0.812 19.15 4.03 Intr - 206206 206097 110 0 2 73 52 66 0.972 1.43 4.02 Intr - 207082 207003 80 1 2 99 97 58 0.998 6.15 4.01 Init - 211541 211335 207 2 0 122 94 462 0.998 49.02 4.00 Prom - 213903 213864 40 -6.86 5.00 Prom + 217263 217302 40 -3.46 5.01 Init + 223965 224071 107 2 2 80 77 137 0.910 11.49 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 58533 58349 185 1 2 92 47 96 0.905 3.61 S.002 Intr - 189925 189784 142 0 1 45 107 33 0.847 1.26 S.003 Intr - 194924 194734 191 2 2 82 94 73 0.880 5.68 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:121140800_121374442|GENSCAN_predicted_peptide_1|586_aa MWIETLKNDFMVGTFISICFALVSDKLYQRKEPVISSVHTKVKGIAEVKEEIVENGVKKL VHSVFDTADYTFPLQGNSFFVMTNFLKTEGQEQRLCPEYPTRRTLCSSDRGCKKGWMDPQ SKVLSHLWFYDALTPIGIQTGRCVVYEGNQKTCEVSAWCPIEAVEEAPRPALLNSAENFT VLIKNNIDFPGHNYTTRNILPGLNITCTFHKTQNPQCPIFRLGDIFRETGDNFSDVAIQG GIMGIEIYWDCNLDRWFHHCRPKYSFRRLDDKTTNVSLYPGYNFRYAKYYKENNVEKRTL IKVFGIRFDILVFGTGGKFDIIQLVVYIGSTLSYFGLAAVFIDFLIDTYSSNCCRSHIYP WCKCCQPCVVNEYYYRKKCESIVEPKPTLKYVSFVDESHIRMVNQQLLGRSLQDVKGQEV PRPAMDFTDLSRLPLALHDTPPIPGQPEEIQLLRKEATPRSRDSPVWCQCGSCLPSQLPE SHRCLEELCCRKKPGACITTSELFRKLVLSRHVLQFLLLYQEPLLALDVDSTNSRLRHCA YRCYATWRFGSQDMADFAILPSCCRWRIRKEFPKSEGQYSGFKSPY >gi568815586r:121140800_121374442|GENSCAN_predicted_CDS_1|1761_bp atgtggattgagactctgaagaacgactttatggtgggcaccttcatctccatctgcttt gctctggtgagtgacaagctgtaccagcggaaagagcctgtcatcagttctgtgcacacc aaggtgaaggggatagcagaggtgaaagaggagatcgtggagaatggagtgaagaagttg gtgcacagtgtctttgacaccgcagactacaccttccctttgcaggggaactctttcttc gtgatgacaaactttctcaaaacagaaggccaagagcagcggttgtgtcccgagtatccc acccgcaggacgctctgttcctctgaccgaggttgtaaaaagggatggatggacccgcag agcaaagttctttcacatctgtggttctacgatgctttgacccctataggaattcagacc ggaaggtgtgtagtgtatgaagggaaccagaagacctgtgaagtctctgcctggtgcccc atcgaggcagtggaagaggccccccggcctgctctcttgaacagtgccgaaaacttcact gtgctcatcaagaacaatatcgacttccccggccacaactacaccacgagaaacatcctg ccaggtttaaacatcacttgtaccttccacaagactcagaatccacagtgtcccattttc cgactaggagacatcttccgagaaacaggcgataatttttcagatgtggcaattcagggc ggaataatgggcattgagatctactgggactgcaacctagaccgttggttccatcactgc cgtcccaaatacagtttccgtcgccttgacgacaagaccaccaacgtgtccttgtaccct ggctacaacttcagatacgccaagtactacaaggaaaacaatgttgagaaacggactctg ataaaagtcttcgggatccgttttgacatcctggtttttggcaccggaggaaaatttgac attatccagctggttgtgtacatcggctcaaccctctcctacttcggtctggccgctgtg ttcatcgacttcctcatcgacacttactccagtaactgctgtcgctcccatatttatccc tggtgcaagtgctgtcagccctgtgtggtcaacgaatactactacaggaagaagtgcgag tccattgtggagccaaagccgacattaaagtatgtgtcctttgtggatgaatcccacatt aggatggtgaaccagcagctactagggagaagtctgcaagatgtcaagggccaagaagtc ccaagacctgcgatggacttcacagatttgtccaggctgcccctggccctccatgacaca cccccgattcctggacaaccagaggagatacagctgcttagaaaggaggcgactcctaga tccagggatagccccgtctggtgccagtgtggaagctgcctcccatctcaactccctgag agccacaggtgcctggaggagctgtgctgccggaaaaagccgggggcctgcatcaccacc tcagagctgttcaggaagctggtcctgtccagacacgtcctgcagttcctcctgctctac caggagcccttgctggcgctggatgtggattccaccaacagccggctgcggcactgtgcc tacaggtgctacgccacctggcgcttcggctcccaggacatggctgactttgccatcctg cccagctgctgccgctggaggatccggaaagagtttccgaagagtgaagggcagtacagt ggcttcaagagtccttactga >gi568815586r:121140800_121374442|GENSCAN_predicted_peptide_2|388_aa MAGCCAALAAFLFEYDTPRIVLIRSRKVGLMNRAVQLLILAYVIGWVFVWEKGYQETDSV VSSVTTKVKGVAVTNTSKLGFRIWDVADYVIPAQEENSLFVMTNVILTMNQTQGLCPEIP DATTVCKSDASCTAGSAGTHSNGVSTGRCVAFNGSVKTCEVAAWCPVEDDTHVPQPAFLK AAENFTLLVKNNIWYPKFNFSKRNILPNITTTYLKSCIYDAKTDPFCPIFRLGKIVENAG HSFQDMAVEGGIMGIQVNWDCNLDRAASLCLPRYSFRRLDTRDVEHNVSPGYNFRFAKYY RDLAGNEQRTLIKAYGIRFDIIVFGKAGKFDIIPTMINIGSGLALLGMATVLCDIIVLYC MKKRLYYREKKYKYVEDYEQGLASELDQ >gi568815586r:121140800_121374442|GENSCAN_predicted_CDS_2|1167_bp atggcgggctgctgcgccgcgctggcggccttcctgttcgagtacgacacgccgcgcatc gtgctcatccgcagccgcaaagtggggctcatgaaccgcgccgtgcaactgctcatcctg gcctacgtcatcgggtgggtgtttgtgtgggaaaagggctaccaggaaactgactccgtg gtcagctccgttacgaccaaggtcaagggcgtggctgtgaccaacacttctaaacttgga ttccggatctgggatgtggcggattatgtgataccagctcaggaggaaaactccctcttc gtcatgaccaacgtgatcctcaccatgaaccagacacagggcctgtgccccgagattcca gatgcgaccactgtgtgtaaatcagatgccagctgtactgccggctctgccggcacccac agcaacggagtctcaacaggcaggtgcgtagctttcaacgggtctgtcaagacgtgtgag gtggcggcctggtgcccggtggaggatgacacacacgtgccacaacctgcttttttaaag gctgcagaaaacttcactcttttggttaagaacaacatctggtatcccaaatttaatttc agcaagaggaatatccttcccaacatcaccactacttacctcaagtcgtgcatttatgat gctaaaacagatcccttctgccccatattccgtcttggcaaaatagtggagaacgcagga cacagtttccaggacatggccgtggagggaggcatcatgggcatccaggtcaactgggac tgcaacctggacagagccgcctccctctgcttgcccaggtactccttccgccgcctcgat acacgggacgttgagcacaacgtatctcctggctacaatttcaggtttgccaagtactac agagacctggctggcaacgagcagcgcacgctcatcaaggcctatggcatccgcttcgac atcattgtgtttgggaaggcagggaaatttgacatcatccccactatgatcaacatcggc tctggcctggcactgctaggcatggcgaccgtgctgtgtgacatcatagtcctctactgc atgaagaaaagactctactatcgggagaagaaatataaatatgtggaagattacgagcag ggtcttgctagtgagctggaccagtga >gi568815586r:121140800_121374442|GENSCAN_predicted_peptide_3|263_aa MEVPTLKPLSEDQARFYFQDLIKGIEYLHYQKIIHRDIKPSNLLVGEDGHIKIADFGVSN EFKGSDALLSNTVGTPAFMAPESLSETRKIFSGKALDVWAMGVTLYCFVFGQCPFMDERI MCLHSKIKSQALEFPDQPDIAEDLKDLITRMLDKNPESRIVVPEIKLHPWVTRHGAEPLP SEDENCTLVEVTEEEVENSVKHIPSLATVILVKTMIRKRSFGNPFEGSRREERSLSAPGN LLTKKPTRECESLSELKGWLQEG >gi568815586r:121140800_121374442|GENSCAN_predicted_CDS_3|792_bp atggaagtgcccaccctcaaaccactctctgaagaccaggcccgtttctacttccaggat ctgatcaaaggcatcgagtacttacactaccagaagatcatccaccgtgacatcaaacct tccaacctcctggtcggagaagatgggcacatcaagatcgctgactttggtgtgagcaat gaattcaagggcagtgacgcgctcctctccaacaccgtgggcacgcccgccttcatggca cccgagtcgctctctgagacccgcaagatcttctctgggaaggccttggatgtttgggcc atgggtgtgacactatactgctttgtctttggccagtgcccattcatggacgagcggatc atgtgtttacacagtaagatcaagagtcaggccctggaatttccagaccagcccgacata gctgaggacttgaaggacctgatcacccgtatgctggacaagaaccccgagtcgaggatc gtggtgccggaaatcaagctgcacccctgggtcacgaggcatggggcggagccgttgccg tcggaggatgagaactgcacgctggtcgaagtgactgaagaggaggtcgagaactcagtc aaacacattcccagcttggcaaccgtgatcctggtgaagaccatgatacgtaaacgctcc tttgggaacccattcgagggcagccggcgggaggaacgctcactgtcagcgcctggaaac ttgctcaccaaaaaaccaaccagggaatgtgagtccctgtctgagctcaagggctggctg caggagggctga >gi568815586r:121140800_121374442|GENSCAN_predicted_peptide_4|1149_aa MASVHESLYFNPMMTNGVVHANVFGIKDWVTPYKIAVLVLLNEMSRTGEGAVSLMERRRL NQLLLPLLQGPDITLSKLYKLIEESCPQLANSVQIRIKLMAEGELKDMEQFFDDLSDSFS GTEPEVHKTSVVGLFLRHMILAYSKLSFSQVFKLYTALQQYFQNGEKKTVEDADMELTSR DEGERKMEKEELDVSVRSQLNFPQTGLPDHEASLLKNDETKALTPASLQKELNNLLKFNP DFAEASWLYVLGQKRSDSYVLLEHSVKKAVHFGLPYLASLGIQSLVQQRAFAGKTANKLM DALKDSDLLHWKHSLSELIDISIAQKTAIWRLYGRSTMALQQAQMLLSMNSLEAVNAGVQ QNNTESFAVALCHLAELHAEQGCFAAASEVLKHLKERFPPNSQHAQLWMLCDQKIQFDRA MNDGKYHLADSLVTGITALNSIEGVYRKAVVLQAQNQMSEAHKLLQKLLVHCQKLKNTEM VISVLLSVAELYWRSSSPTIALPMLLQALALSKEYRLQYLASETVLNLAFAQLILGIPEQ ALSLLHMAIEPILADGAILDKGRAMFLVAKCQVASAASYDQPKKAEALEAAIENLNEAKN YFAKVDCKERIRDVVYFQARLYHTLGKTQERNRCAMLFRQLHQELPSHGAGKRALEVGPD PAPTLYIGVMNPAVCIWMFKVWEVEHSSEQHERHTVQKGRMKPPQEITVLSLGVRVSETS PGSLDPVGRFREPSRSSFLLFPAQCCYPAGGSNAHLRLQQSGSAAGWECPSVLDEAGACT MSSCVSSQPSSNRAAPQDELGGRGSSSSESQKPCEALRGLSSLSIHLGMESFIVVTECEP GCAVDLGLARDRPLEADGQEVPLDTSGSQARPHLSGRKLSLQERSQGGLAAGGSLDMNGR CICPSLPYSPVSSPQSSPRLPRRPTVESHHVSITGMQDCVQLNQYTLKDEIGKLMGGIIT MVKTPNASRKQGSYGVVKLAYNENDNTYYAMKVLSKKKLIRQAGFPRRPPPRGTRPAPGG CIQPRGPIEQVYQEIAILKKLDHPNVVKLVEVSGKPCRLVENDSQKGKYCLRFKVKIVNL VNSMKHLGDLQTSPDCVWSTSGLDDKDLSPCWMFRVPWEYKAGAASSLGEAGRGLLEEVM FEMAVVVPG >gi568815586r:121140800_121374442|GENSCAN_predicted_CDS_4|3450_bp atggccagcgtccacgagagcctctacttcaatcccatgatgaccaatggggttgtgcac gccaatgtgttcggcatcaaggactgggtgacgccgtacaagatcgcggtgctggtgctg ctgaacgagatgagccgcacaggcgagggcgccgtcagcctcatggagcggcggaggctc aaccagctgctcctgcccctgctgcagggcccagatattacactgtcaaaactttacaag ttaattgaagagtcttgtccacagctggcaaattcagtgcagatcagaatcaaactgatg gctgaaggcgagttgaaggatatggaacagttttttgatgacctttcagattctttctct ggaactgaaccagaggttcacaaaacaagtgtagtaggtttgtttctgcgtcacatgatc ttggcctacagtaagctttctttcagccaagtgtttaaactgtacactgcccttcagcag tacttccagaatggtgagaaaaagacagtggaggatgctgatatggaactgaccagtaga gatgagggtgaaagaaaaatggaaaaagaagaacttgatgtatctgtaagatcacagctc aactttcctcagacgggcctccctgaccacgaggcttctttgctaaagaatgatgagact aaggccctcactccagcttccttgcagaaggaattaaacaatttgttgaaatttaatcct gattttgctgaagcgagctggctttatgtgctggggcagaagagatccgatagctatgtt ctgctggagcattctgtgaagaaggcagtacattttgggttaccgtacctcgcctccctg ggaatacagtcccttgttcaacagagagcttttgctgggaagacggcaaacaagctgatg gatgccctaaaggactccgacctcctgcactggaaacacagcctgtcagagctcatcgat atcagcatcgcacagaaaacggccatctggaggctgtatggccgcagcaccatggcactg caacaggcccagatgttgctgagcatgaacagcctggaggcggtgaatgcgggcgtgcag cagaacaacacagagtcctttgctgtcgcactctgccacctcgcagagctacacgcggag cagggctgttttgctgcagcttctgaagtgttaaagcacttgaaggaacgatttccgcct aatagtcagcacgcccagttatggatgctatgtgatcaaaaaatacagtttgacagagca atgaatgatggcaaatatcatttggctgattcacttgttacaggaatcacagctctcaat agcatagagggtgtttataggaaagcggttgtattacaagctcagaaccaaatgtcagag gcacataagcttttacaaaaattgttggttcattgtcagaaactgaagaacacagaaatg gtgatcagtgtcctactgtccgtggcagagctgtactggcgatcttcctcccctaccatc gcgctgcccatgctcctgcaggctctggccctctccaaggagtaccggttacagtacttg gcctctgaaacagtgctgaacttggcttttgcgcagctcattcttggaatcccagaacag gccttaagtcttctccacatggccatcgagcccatcttggctgacggggctatcctggac aaaggtcgtgccatgttcttagtggccaagtgccaggtggcttcagcagcttcctacgat cagccgaagaaagcagaagctctggaggctgccatcgagaacctcaatgaagccaagaac tattttgcaaaggttgactgcaaagagcgcatcagggacgtcgtttacttccaggccaga ctctaccataccctggggaagacccaggagaggaaccggtgtgcgatgctcttccggcag ctgcatcaggagctgccctctcatggggcaggaaagcgggccctggaggttgggcctgac ccagccccgaccttgtacattggtgtgatgaatcctgctgtctgcatctggatgttcaag gtttgggaggtggaacattccagtgagcaacatgagcggcacacggttcagaaaggaagg atgaaacctccgcaggaaattacagtgctgtccttgggggtgagggtcagtgagacatcc cctgggtcgctcgaccccgtaggacggttcagggagccctccaggtcttcgtttctcctc ttccccgcacagtgctgttatccagctgggggatccaacgcacacttaaggctccagcaa agtggctccgctgccggatgggagtgccccagtgtgctggatgaagctggcgcatgcacc atgtcatcatgtgtctctagccagcccagcagcaaccgggccgccccccaggatgagctg gggggcaggggcagcagcagcagcgaaagccagaagccctgtgaggccctgcggggcctc tcatccttgagcatccacctgggcatggagtccttcattgtggtcaccgagtgtgagccg ggctgtgctgtggacctcggcttggcgcgggaccggcccctggaggccgatggccaagag gtcccccttgacacctccgggtcccaggcccggccccacctctccggtcgcaagctgtct ctgcaagagcggtcccagggtgggctggcagccggtggcagcctggacatgaacggacgc tgcatctgcccgtccctgccctactcacccgtcagctccccgcagtcctcgcctcggctg ccccggcggccgacagtggagtctcaccacgtctccatcacgggtatgcaggactgtgtg cagctgaatcagtataccctgaaggatgaaattggaaagctaatggggggtatcatcacc atggtcaaaactccaaatgccagtagaaaacaaggctcctatggtgtcgtcaagttggcc tacaatgaaaatgacaatacctactatgcaatgaaggtgctgtccaaaaagaagctgatc cggcaggccggctttccacgtcgccctccaccccgaggcacccggccagctcctggaggc tgcatccagcccaggggccccattgagcaggtgtaccaggaaattgccatcctcaagaag ctggaccaccccaatgtggtgaagctggtggaggtttctggaaaaccctgtagactagtg gagaatgatagtcaaaaaggcaaatactgtcttcgttttaaagtgaaaatagtgaacctt gtaaactccatgaagcatcttggggatctgcagacatctccagattgtgtttggagcacc tctggtctagacgacaaggacttgtcaccgtgctggatgttcagagtgccatgggagtac aaagcgggggcagccagctcactgggagaagctgggagaggcctcttagaggaagtgatg tttgagatggccgtggtggttcctggatga >gi568815586r:121140800_121374442|GENSCAN_predicted_peptide_5|36_aa MKKKKKKKMKTKKKKERERMSNKIVPLPSRNGQRKX >gi568815586r:121140800_121374442|GENSCAN_predicted_CDS_5|108_bp atgaagaaaaagaagaagaagaagatgaagacgaagaagaagaaagagagagagagaatg agtaacaagatcgtgccactcccaagtcgaaatggacaaaggaagtgn