GENSCAN 1.0 Date run: 4-Nov-116 Time: 23:15:43 Sequence gi568815586f:121110165_121333093 : 222929 bp : 47.55% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 9248 9284 37 1 1 83 80 16 0.022 0.65 1.02 Term + 21147 21262 116 1 2 108 49 89 0.138 5.73 1.03 PlyA + 21968 21973 6 1.05 2.00 Prom + 22253 22292 40 -7.36 2.01 Init + 22807 22931 125 0 2 108 111 148 0.993 18.74 2.02 Intr + 44621 44789 169 2 1 70 99 242 0.879 23.45 2.03 Intr + 45915 45983 69 2 0 101 94 -10 0.453 0.28 2.04 Intr + 50738 50810 73 1 1 89 77 20 0.751 -0.02 2.05 Intr + 52218 52356 139 1 1 91 117 95 0.791 12.22 2.06 Intr + 55193 55273 81 2 0 111 113 142 0.987 17.75 2.07 Intr + 55894 56023 130 1 1 41 77 118 0.999 6.70 2.08 Intr + 57324 57460 137 2 2 68 93 148 0.999 12.67 2.09 Intr + 65224 65314 91 1 1 69 103 178 0.991 17.40 2.10 Intr + 66983 67048 66 1 0 107 99 94 0.999 11.50 2.11 Intr + 67133 67282 150 1 0 81 93 263 0.999 26.46 2.12 Intr + 70190 70291 102 1 0 58 100 49 0.877 3.47 2.13 Term + 74141 74638 498 1 0 58 48 567 0.934 44.22 2.14 PlyA + 75727 75732 6 1.05 3.00 Prom + 87996 88035 40 -1.86 3.01 Init + 100001 100134 134 1 2 117 88 359 0.997 38.41 3.02 Intr + 106970 107117 148 2 1 89 89 219 0.999 22.34 3.03 Intr + 111749 111820 72 1 0 109 75 109 0.996 11.30 3.04 Intr + 111930 112002 73 2 1 117 80 95 0.992 10.58 3.05 Intr + 112783 112879 97 2 1 91 79 59 0.964 4.37 3.06 Intr + 118369 118449 81 1 0 78 108 4 0.511 0.15 3.07 Intr + 118561 118702 142 1 1 80 90 123 0.991 12.06 3.08 Intr + 118799 118935 137 1 2 83 80 170 0.983 15.07 3.09 Intr + 122250 122343 94 0 1 138 65 315 0.999 34.17 3.10 Intr + 122447 122512 66 1 0 86 101 94 0.685 9.60 3.11 Intr + 122833 122928 96 0 0 51 96 117 0.997 9.01 3.12 Term + 123359 123385 27 1 0 125 54 22 0.740 0.57 3.13 PlyA + 125630 125635 6 1.05 4.10 PlyA - 126989 126984 6 1.05 4.09 Term - 131259 131239 21 0 0 96 54 31 0.095 -1.19 4.08 Intr - 134451 134409 43 2 1 80 92 19 0.599 -0.26 4.07 Intr - 135076 134976 101 1 2 138 94 70 0.883 11.51 4.06 Intr - 138570 138442 129 0 0 69 79 102 0.987 8.29 4.05 Intr - 139710 139623 88 2 1 115 75 162 0.999 17.57 4.04 Intr - 139870 139797 74 1 2 118 81 72 0.999 7.70 4.03 Intr - 142550 142497 54 2 0 122 110 41 0.997 8.88 4.02 Intr - 143308 143109 200 2 2 100 121 306 0.993 34.17 4.01 Init - 145467 145386 82 0 1 73 48 117 0.590 7.33 4.00 Prom - 146273 146234 40 -6.86 5.21 PlyA - 148186 148181 6 1.05 5.20 Term - 151913 151617 297 2 0 40 54 167 0.859 3.77 5.19 Intr - 153775 153642 134 2 2 137 83 188 0.996 23.56 5.18 Intr - 158525 158474 52 2 1 98 94 88 0.661 8.88 5.17 Intr - 159417 159364 54 0 0 59 73 104 0.971 5.18 5.16 Intr - 160417 160364 54 1 0 81 108 35 0.939 3.98 5.15 Intr - 160781 160734 48 2 0 110 94 75 0.996 9.18 5.14 Intr - 164421 163892 530 1 2 121 96 498 0.690 46.56 5.13 Intr - 187511 187375 137 1 2 93 106 7 0.107 3.21 5.12 Intr - 192076 191928 149 1 2 66 76 62 0.153 1.83 5.11 Intr - 194369 194322 48 2 0 93 73 35 0.453 1.38 5.10 Intr - 198527 198340 188 0 2 75 32 246 0.793 17.11 5.09 Intr - 199699 199537 163 1 1 70 97 194 0.680 18.05 5.08 Intr - 208260 208113 148 2 1 46 115 214 0.998 20.14 5.07 Intr - 208444 208337 108 0 0 82 110 33 0.944 4.30 5.06 Intr - 209654 209533 122 2 2 102 82 4 0.959 0.49 5.05 Intr - 210295 210221 75 1 0 92 89 19 0.782 2.11 5.04 Intr - 217067 216932 136 1 1 54 96 115 0.990 9.47 5.03 Intr - 218333 218152 182 2 2 57 96 227 0.598 19.07 5.02 Intr - 220508 220419 90 2 0 64 107 31 0.779 2.79 5.01 Init - 221829 221824 6 0 0 85 89 4 0.592 0.94 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 89168 88984 185 0 2 92 47 96 0.905 3.61 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:121110165_121333093|GENSCAN_predicted_peptide_1|50_aa MDSIAIPSSLPADEDDRITQIPGRKAMAYSSGVTNDGTIYKGVDRVKKKQ >gi568815586f:121110165_121333093|GENSCAN_predicted_CDS_1|153_bp atggactctatcgccatcccctcttctcttcctgcagatgaagatgaccgtatcactcag attcccggcaggaaagcaatggcatactcaagtggggtaactaatgatggaaccatttac aaaggtgtggacagagttaagaaaaagcaatag >gi568815586f:121110165_121333093|GENSCAN_predicted_peptide_2|609_aa MPACCSCSDVFQYETNKVTRIQSMNYGTIKWFFHVIIFSYVCFALVSDKLYQRKEPVISS VHTKVKGIAEVKEEIVENGVKKLVHSVFDTADYTFPLQGNSFFVMTNFLKTEGQEQRLCP EYPTRRTLCSSDRGCKKGWMDPQSKVLSHLWFYDALTPIGIQTGRCVVYEGNQKTCEVSA WCPIEAVEEAPRPALLNSAENFTVLIKNNIDFPGHNYTTRNILPGLNITCTFHKTQNPQC PIFRLGDIFRETGDNFSDVAIQGGIMGIEIYWDCNLDRWFHHCRPKYSFRRLDDKTTNVS LYPGYNFRYAKYYKENNVEKRTLIKVFGIRFDILVFGTGGKFDIIQLVVYIGSTLSYFGL AAVFIDFLIDTYSSNCCRSHIYPWCKCCQPCVVNEYYYRKKCESIVEPKPTLKYVSFVDE SHIRMVNQQLLGRSLQDVKGQEVPRPAMDFTDLSRLPLALHDTPPIPGQPEEIQLLRKEA TPRSRDSPVWCQCGSCLPSQLPESHRCLEELCCRKKPGACITTSELFRKLVLSRHVLQFL LLYQEPLLALDVDSTNSRLRHCAYRCYATWRFGSQDMADFAILPSCCRWRIRKEFPKSEG QYSGFKSPY >gi568815586f:121110165_121333093|GENSCAN_predicted_CDS_2|1830_bp atgccggcctgctgcagctgcagtgatgttttccagtatgagacgaacaaagtcactcgg atccagagcatgaattatggcaccattaagtggttcttccacgtgatcatcttttcctac gtttgctttgctctggtgagtgacaagctgtaccagcggaaagagcctgtcatcagttct gtgcacaccaaggtgaaggggatagcagaggtgaaagaggagatcgtggagaatggagtg aagaagttggtgcacagtgtctttgacaccgcagactacaccttccctttgcaggggaac tctttcttcgtgatgacaaactttctcaaaacagaaggccaagagcagcggttgtgtccc gagtatcccacccgcaggacgctctgttcctctgaccgaggttgtaaaaagggatggatg gacccgcagagcaaagttctttcacatctgtggttctacgatgctttgacccctatagga attcagaccggaaggtgtgtagtgtatgaagggaaccagaagacctgtgaagtctctgcc tggtgccccatcgaggcagtggaagaggccccccggcctgctctcttgaacagtgccgaa aacttcactgtgctcatcaagaacaatatcgacttccccggccacaactacaccacgaga aacatcctgccaggtttaaacatcacttgtaccttccacaagactcagaatccacagtgt cccattttccgactaggagacatcttccgagaaacaggcgataatttttcagatgtggca attcagggcggaataatgggcattgagatctactgggactgcaacctagaccgttggttc catcactgccgtcccaaatacagtttccgtcgccttgacgacaagaccaccaacgtgtcc ttgtaccctggctacaacttcagatacgccaagtactacaaggaaaacaatgttgagaaa cggactctgataaaagtcttcgggatccgttttgacatcctggtttttggcaccggagga aaatttgacattatccagctggttgtgtacatcggctcaaccctctcctacttcggtctg gccgctgtgttcatcgacttcctcatcgacacttactccagtaactgctgtcgctcccat atttatccctggtgcaagtgctgtcagccctgtgtggtcaacgaatactactacaggaag aagtgcgagtccattgtggagccaaagccgacattaaagtatgtgtcctttgtggatgaa tcccacattaggatggtgaaccagcagctactagggagaagtctgcaagatgtcaagggc caagaagtcccaagacctgcgatggacttcacagatttgtccaggctgcccctggccctc catgacacacccccgattcctggacaaccagaggagatacagctgcttagaaaggaggcg actcctagatccagggatagccccgtctggtgccagtgtggaagctgcctcccatctcaa ctccctgagagccacaggtgcctggaggagctgtgctgccggaaaaagccgggggcctgc atcaccacctcagagctgttcaggaagctggtcctgtccagacacgtcctgcagttcctc ctgctctaccaggagcccttgctggcgctggatgtggattccaccaacagccggctgcgg cactgtgcctacaggtgctacgccacctggcgcttcggctcccaggacatggctgacttt gccatcctgcccagctgctgccgctggaggatccggaaagagtttccgaagagtgaaggg cagtacagtggcttcaagagtccttactga >gi568815586f:121110165_121333093|GENSCAN_predicted_peptide_3|388_aa MAGCCAALAAFLFEYDTPRIVLIRSRKVGLMNRAVQLLILAYVIGWVFVWEKGYQETDSV VSSVTTKVKGVAVTNTSKLGFRIWDVADYVIPAQEENSLFVMTNVILTMNQTQGLCPEIP DATTVCKSDASCTAGSAGTHSNGVSTGRCVAFNGSVKTCEVAAWCPVEDDTHVPQPAFLK AAENFTLLVKNNIWYPKFNFSKRNILPNITTTYLKSCIYDAKTDPFCPIFRLGKIVENAG HSFQDMAVEGGIMGIQVNWDCNLDRAASLCLPRYSFRRLDTRDVEHNVSPGYNFRFAKYY RDLAGNEQRTLIKAYGIRFDIIVFGKAGKFDIIPTMINIGSGLALLGMATVLCDIIVLYC MKKRLYYREKKYKYVEDYEQGLASELDQ >gi568815586f:121110165_121333093|GENSCAN_predicted_CDS_3|1167_bp atggcgggctgctgcgccgcgctggcggccttcctgttcgagtacgacacgccgcgcatc gtgctcatccgcagccgcaaagtggggctcatgaaccgcgccgtgcaactgctcatcctg gcctacgtcatcgggtgggtgtttgtgtgggaaaagggctaccaggaaactgactccgtg gtcagctccgttacgaccaaggtcaagggcgtggctgtgaccaacacttctaaacttgga ttccggatctgggatgtggcggattatgtgataccagctcaggaggaaaactccctcttc gtcatgaccaacgtgatcctcaccatgaaccagacacagggcctgtgccccgagattcca gatgcgaccactgtgtgtaaatcagatgccagctgtactgccggctctgccggcacccac agcaacggagtctcaacaggcaggtgcgtagctttcaacgggtctgtcaagacgtgtgag gtggcggcctggtgcccggtggaggatgacacacacgtgccacaacctgcttttttaaag gctgcagaaaacttcactcttttggttaagaacaacatctggtatcccaaatttaatttc agcaagaggaatatccttcccaacatcaccactacttacctcaagtcgtgcatttatgat gctaaaacagatcccttctgccccatattccgtcttggcaaaatagtggagaacgcagga cacagtttccaggacatggccgtggagggaggcatcatgggcatccaggtcaactgggac tgcaacctggacagagccgcctccctctgcttgcccaggtactccttccgccgcctcgat acacgggacgttgagcacaacgtatctcctggctacaatttcaggtttgccaagtactac agagacctggctggcaacgagcagcgcacgctcatcaaggcctatggcatccgcttcgac atcattgtgtttgggaaggcagggaaatttgacatcatccccactatgatcaacatcggc tctggcctggcactgctaggcatggcgaccgtgctgtgtgacatcatagtcctctactgc atgaagaaaagactctactatcgggagaagaaatataaatatgtggaagattacgagcag ggtcttgctagtgagctggaccagtga >gi568815586f:121110165_121333093|GENSCAN_predicted_peptide_4|263_aa MEVPTLKPLSEDQARFYFQDLIKGIEYLHYQKIIHRDIKPSNLLVGEDGHIKIADFGVSN EFKGSDALLSNTVGTPAFMAPESLSETRKIFSGKALDVWAMGVTLYCFVFGQCPFMDERI MCLHSKIKSQALEFPDQPDIAEDLKDLITRMLDKNPESRIVVPEIKLHPWVTRHGAEPLP SEDENCTLVEVTEEEVENSVKHIPSLATVILVKTMIRKRSFGNPFEGSRREERSLSAPGN LLTKKPTRECESLSELKGWLQEG >gi568815586f:121110165_121333093|GENSCAN_predicted_CDS_4|792_bp atggaagtgcccaccctcaaaccactctctgaagaccaggcccgtttctacttccaggat ctgatcaaaggcatcgagtacttacactaccagaagatcatccaccgtgacatcaaacct tccaacctcctggtcggagaagatgggcacatcaagatcgctgactttggtgtgagcaat gaattcaagggcagtgacgcgctcctctccaacaccgtgggcacgcccgccttcatggca cccgagtcgctctctgagacccgcaagatcttctctgggaaggccttggatgtttgggcc atgggtgtgacactatactgctttgtctttggccagtgcccattcatggacgagcggatc atgtgtttacacagtaagatcaagagtcaggccctggaatttccagaccagcccgacata gctgaggacttgaaggacctgatcacccgtatgctggacaagaaccccgagtcgaggatc gtggtgccggaaatcaagctgcacccctgggtcacgaggcatggggcggagccgttgccg tcggaggatgagaactgcacgctggtcgaagtgactgaagaggaggtcgagaactcagtc aaacacattcccagcttggcaaccgtgatcctggtgaagaccatgatacgtaaacgctcc tttgggaacccattcgagggcagccggcgggaggaacgctcactgtcagcgcctggaaac ttgctcaccaaaaaaccaaccagggaatgtgagtccctgtctgagctcaagggctggctg caggagggctga >gi568815586f:121110165_121333093|GENSCAN_predicted_peptide_5|906_aa MVSWLYVLGQKRSDSYVLLEHSVKKAVHFGLPYLASLGIQSLVQQRAFAGKTANKLMDAL KDSDLLHWKHSLSELIDISIAQKTAIWRLYGRSTMALQQAQMLLSMNSLEAVNAGVQQNN TESFAVALCHLAELHAEQGCFAAASEVLKHLKERFPPNSQHAQLWMLCDQKIQFDRAMND GKYHLADSLVTGITALNSIEGVYRKAVVLQAQNQMSEAHKLLQKLLVHCQKLKNTEMVIS VLLSVAELYWRSSSPTIALPMLLQALALSKEYRLQYLASETVLNLAFAQLILGIPEQALS LLHMAIEPILADGAILDKGRAMFLVAKCQVASAASYDQPKKAEALEAAIENLNEAKNYFA KVDCKERIRDVVYFQARLYHTLGKTQERNRCAMLFRQLHQELPSHGAGKRALEVGPDPAP TLYIGVMNPAVCIWMFKVWEVEHSSEQHERHTVQKGRMKPPQEITVLSLGVRVSETSPGS LDPVGRFREPSRSSFLLFPAQCCYPAGGSNAHLRLQQSGSAAGWECPSVLDEAGACTMSS CVSSQPSSNRAAPQDELGGRGSSSSESQKPCEALRGLSSLSIHLGMESFIVVTECEPGCA VDLGLARDRPLEADGQEVPLDTSGSQARPHLSGRKLSLQERSQGGLAAGGSLDMNGRCIC PSLPYSPVSSPQSSPRLPRRPTVESHHVSITGMQDCVQLNQYTLKDEIGKLMGGIITMVK TPNASRKQGSYGVVKLAYNENDNTYYAMKVLSKKKLIRQAGFPRRPPPRGTRPAPGGCIQ PRGPIEQVYQEIAILKKLDHPNVVKLVEVSGKPCRLVENDSQKGKYCLRFKVKIVNLVNS MKHLGDLQTSPDCVWSTSGLDDKDLSPCWMFRVPWEYKAGAASSLGEAGRGLLEEVMFEM AVVVPG >gi568815586f:121110165_121333093|GENSCAN_predicted_CDS_5|2721_bp atggtgagctggctttatgtgctggggcagaagagatccgatagctatgttctgctggag cattctgtgaagaaggcagtacattttgggttaccgtacctcgcctccctgggaatacag tcccttgttcaacagagagcttttgctgggaagacggcaaacaagctgatggatgcccta aaggactccgacctcctgcactggaaacacagcctgtcagagctcatcgatatcagcatc gcacagaaaacggccatctggaggctgtatggccgcagcaccatggcactgcaacaggcc cagatgttgctgagcatgaacagcctggaggcggtgaatgcgggcgtgcagcagaacaac acagagtcctttgctgtcgcactctgccacctcgcagagctacacgcggagcagggctgt tttgctgcagcttctgaagtgttaaagcacttgaaggaacgatttccgcctaatagtcag cacgcccagttatggatgctatgtgatcaaaaaatacagtttgacagagcaatgaatgat ggcaaatatcatttggctgattcacttgttacaggaatcacagctctcaatagcatagag ggtgtttataggaaagcggttgtattacaagctcagaaccaaatgtcagaggcacataag cttttacaaaaattgttggttcattgtcagaaactgaagaacacagaaatggtgatcagt gtcctactgtccgtggcagagctgtactggcgatcttcctcccctaccatcgcgctgccc atgctcctgcaggctctggccctctccaaggagtaccggttacagtacttggcctctgaa acagtgctgaacttggcttttgcgcagctcattcttggaatcccagaacaggccttaagt cttctccacatggccatcgagcccatcttggctgacggggctatcctggacaaaggtcgt gccatgttcttagtggccaagtgccaggtggcttcagcagcttcctacgatcagccgaag aaagcagaagctctggaggctgccatcgagaacctcaatgaagccaagaactattttgca aaggttgactgcaaagagcgcatcagggacgtcgtttacttccaggccagactctaccat accctggggaagacccaggagaggaaccggtgtgcgatgctcttccggcagctgcatcag gagctgccctctcatggggcaggaaagcgggccctggaggttgggcctgacccagccccg accttgtacattggtgtgatgaatcctgctgtctgcatctggatgttcaaggtttgggag gtggaacattccagtgagcaacatgagcggcacacggttcagaaaggaaggatgaaacct ccgcaggaaattacagtgctgtccttgggggtgagggtcagtgagacatcccctgggtcg ctcgaccccgtaggacggttcagggagccctccaggtcttcgtttctcctcttccccgca cagtgctgttatccagctgggggatccaacgcacacttaaggctccagcaaagtggctcc gctgccggatgggagtgccccagtgtgctggatgaagctggcgcatgcaccatgtcatca tgtgtctctagccagcccagcagcaaccgggccgccccccaggatgagctggggggcagg ggcagcagcagcagcgaaagccagaagccctgtgaggccctgcggggcctctcatccttg agcatccacctgggcatggagtccttcattgtggtcaccgagtgtgagccgggctgtgct gtggacctcggcttggcgcgggaccggcccctggaggccgatggccaagaggtccccctt gacacctccgggtcccaggcccggccccacctctccggtcgcaagctgtctctgcaagag cggtcccagggtgggctggcagccggtggcagcctggacatgaacggacgctgcatctgc ccgtccctgccctactcacccgtcagctccccgcagtcctcgcctcggctgccccggcgg ccgacagtggagtctcaccacgtctccatcacgggtatgcaggactgtgtgcagctgaat cagtataccctgaaggatgaaattggaaagctaatggggggtatcatcaccatggtcaaa actccaaatgccagtagaaaacaaggctcctatggtgtcgtcaagttggcctacaatgaa aatgacaatacctactatgcaatgaaggtgctgtccaaaaagaagctgatccggcaggcc ggctttccacgtcgccctccaccccgaggcacccggccagctcctggaggctgcatccag cccaggggccccattgagcaggtgtaccaggaaattgccatcctcaagaagctggaccac cccaatgtggtgaagctggtggaggtttctggaaaaccctgtagactagtggagaatgat agtcaaaaaggcaaatactgtcttcgttttaaagtgaaaatagtgaaccttgtaaactcc atgaagcatcttggggatctgcagacatctccagattgtgtttggagcacctctggtcta gacgacaaggacttgtcaccgtgctggatgttcagagtgccatgggagtacaaagcgggg gcagccagctcactgggagaagctgggagaggcctcttagaggaagtgatgtttgagatg gccgtggtggttcctggatga