GENSCAN 1.0 Date run: 8-Nov-116 Time: 11:48:02 Sequence gi568815597f:231429252_231664902 : 235651 bp : 40.23% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.07 PlyA - 32 27 6 -0.45 1.06 Term - 983 622 362 0 2 -33 42 313 0.634 8.31 1.05 Intr - 2026 1845 182 0 2 89 63 100 0.004 6.19 1.04 Intr - 12953 12916 38 2 2 68 101 65 0.004 1.94 1.03 Intr - 17375 17206 170 0 2 124 23 88 0.012 4.64 1.02 Intr - 43385 43269 117 0 0 65 54 88 0.009 2.62 1.01 Init - 63912 63798 115 0 1 57 77 124 0.317 8.62 1.00 Prom - 64009 63970 40 -6.95 2.04 PlyA - 64022 64017 6 1.05 2.03 Term - 70009 69796 214 0 1 10 37 174 0.540 0.22 2.02 Intr - 70713 70050 664 1 1 -28 79 287 0.274 6.09 2.01 Init - 70947 70740 208 0 1 60 57 113 0.364 4.43 2.00 Prom - 73194 73155 40 -3.15 3.00 Prom + 81856 81895 40 -1.85 3.01 Init + 83018 83077 60 1 0 52 95 12 0.542 -0.40 3.02 Intr + 83198 83282 85 1 1 98 58 60 0.084 2.47 3.03 Intr + 87850 87967 118 2 1 43 106 40 0.017 -0.10 3.04 Intr + 99007 99127 121 1 1 2 86 122 0.113 2.98 3.05 Intr + 99154 99346 193 0 1 6 52 148 0.069 1.14 3.06 Intr + 107962 108076 115 2 1 39 113 69 0.402 3.09 3.07 Intr + 113230 113360 131 1 2 28 111 108 0.336 6.52 3.08 Intr + 131877 132004 128 1 2 83 37 70 0.410 0.88 3.09 Term + 135277 135654 378 0 0 75 39 271 0.959 14.70 3.10 PlyA + 136951 136956 6 1.05 4.03 PlyA - 137174 137169 6 1.05 4.02 Term - 162549 162298 252 0 0 97 42 146 0.305 5.55 4.01 Init - 167225 167091 135 2 0 54 77 97 0.831 5.48 4.00 Prom - 168616 168577 40 -7.15 5.00 Prom + 169100 169139 40 -5.95 5.01 Init + 169603 169712 110 0 2 87 37 56 0.285 0.14 5.02 Intr + 174577 174652 76 1 1 92 70 116 0.236 8.80 5.03 Intr + 188728 188808 81 0 0 37 113 85 0.308 4.82 5.04 Intr + 197530 197683 154 0 1 101 91 31 0.170 3.42 5.05 Intr + 201786 201921 136 1 1 59 110 3 0.024 -1.69 5.06 Intr + 202169 202265 97 2 1 9 70 141 0.092 3.49 5.07 Intr + 211903 211987 85 0 1 36 101 101 0.851 4.67 5.08 Intr + 212072 212119 48 0 0 66 101 53 0.693 2.13 5.09 Intr + 212199 212441 243 1 0 36 84 243 0.647 15.25 5.10 Intr + 212709 212897 189 1 0 77 48 75 0.353 1.04 5.11 Intr + 215187 215318 132 1 0 64 91 69 0.441 4.50 5.12 Term + 216390 216499 110 1 2 46 54 81 0.147 -1.81 5.13 PlyA + 216742 216747 6 -1.75 6.02 PlyA - 216844 216839 6 1.05 6.01 Sngl - 217889 217311 579 2 0 71 32 202 0.590 8.82 6.00 Prom - 217943 217904 40 -9.85 7.08 PlyA - 218060 218055 6 1.05 7.07 Term - 218840 218104 737 0 2 54 48 327 0.325 17.94 7.06 Intr - 220989 220877 113 2 2 1 98 74 0.547 -1.20 7.05 Intr - 221965 221795 171 0 0 23 86 113 0.294 2.74 7.04 Intr - 224169 224054 116 0 2 79 86 80 0.385 5.23 7.03 Intr - 227698 227533 166 0 1 58 35 123 0.297 3.04 7.02 Intr - 232943 232812 132 1 0 27 92 76 0.013 0.74 7.01 Intr - 233146 232961 186 0 0 54 72 109 0.299 3.78 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 46864 46697 168 1 0 92 48 160 0.802 6.91 S.002 Term + 93948 94052 105 2 0 73 38 121 0.877 3.03 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:231429252_231664902|GENSCAN_predicted_peptide_1|327_aa MVEKDTTYMYGGGSEQAESSSMASCKITERVKTSGKSEDRKMCGWLWPCGHKGKLPQKLQ CPCTLSSDFHNNDHILQGMGLPESQTAVIFINVLLGLATQCGYSGLVLGMSAESCDVIRL QVSQPWILAPAPVECILNRATSVPVKILGTHSSHASGALKPAAALWEPLSGLAKAGAGSL SLRGGVEGEAQVGTRAARGACGPARVPDVTLTAKICSFTPEASETTNPPGGTNNSRHAAL KAVTLTAKVCSFTPEPARPRTHQKEETPNTSEYQKEQTPDMPPLRTVTLTARVCSFILEI SETKNPPIPDTSGHGALHPKCSSSSCG >gi568815597f:231429252_231664902|GENSCAN_predicted_CDS_1|984_bp atggtagagaaagacactacttatatgtacggaggaggtagtgaacaagctgagagttcg tcgatggcatcctgcaaaatcacagaaagggttaagacttcaggcaaaagtgaagacagg aaaatgtgtggctggctgtggccttgtggccacaaaggaaaacttcctcagaaactacag tgtccttgtactctttcctctgacttccacaataacgaccacatcctccaggggatgggg cttcctgagagccagactgcagtgatttttatcaatgtccttctgggtctagccacccaa tgcggctacagtgggctggtgctggggatgtctgcagagtcctgtgatgtgatccgtctt caggtctcccagccatggatactagcacctgctccagtggagtgtattctcaaccgagca accagtgttcctgttaaaatccttggcacccactctagccacgcttcaggagcccttaag cccgctgctgcactgtgggagcccctttctgggctggccaaggccggagccggctccctc agcttgcggggaggtgtggagggagaggcgcaggtgggaaccagggctgcgcgtggtgct tgcgggccagcacgcgttccggatgtaacactcactgcgaagatctgcagcttcactcct gaagccagcgagaccacgaacccaccaggaggaacaaacaactccagacatgccgcctta aaagctgtaacactcactgcgaaggtctgcagcttcactcctgaaccagcgagaccacga acccaccagaaggaagaaactccgaacacatccgaatatcagaaggaacaaactccggac atgccgcctttaagaactgtaacactcactgcgagggtctgcagcttcattcttgaaatc agtgagaccaagaacccaccaattccggacacatcaggacatggtgccctgcatcccaag tgctccagctccagctgtggctaa >gi568815597f:231429252_231664902|GENSCAN_predicted_peptide_2|361_aa MWKQLWNWLTGRGWNSLEDSEDRKIWDSLELPEDLLNGFDQSADSDMENKVQAEVVSDGH GELVGNWGKEIGSIRPCSRDLWNSEFERDDLGYLAEEISKWQSVLNKAEHKQIRENCFDK AKHKSLENLQPDNEIEKKTPFSGEKFKLAAEICISHEELNVHHLDNGEIVSKACQRPLGQ PLPSQAWRPRREKWFPGLNPVPLSYVQPQDLVPCIPAAPAITNVQLRPLFQRVQAPSLGS FHVVLGLRVHRSQELRFGNLHLDFRGCIETLGSPSRSSLQGWSPHGEPLLRAPTGALPSG SVRRGPPDSRMIDPLTACTVCLEKPQTLNASHESSLEEGFTLQSHRSRTAQGHGSPPFAS A >gi568815597f:231429252_231664902|GENSCAN_predicted_CDS_2|1086_bp atgtggaagcaactttggaattggctgacaggcagaggttggaacagtttggaggactca gaagacaggaaaatctgggacagtttggaacttcctgaagacttgttgaatggctttgac caaagtgctgatagtgatatggaaaacaaagtgcaggctgaggtggtctcagatggacac ggggaacttgttgggaactggggcaaagagattggcagcattcgcccctgctctagagat ctgtggaactctgaatttgagagagatgatttagggtatctggcagaagaaatttctaag tggcaaagtgttttgaacaaagcagagcataaacaaatcagagaaaactgctttgacaaa gcaaagcataaaagtttggaaaatctgcagcctgacaatgagatagaaaagaaaacccca ttttctggggagaaattcaagctggctgcagaaatttgcataagtcatgaggaattgaat gttcaccacctagacaatggggaaattgtctccaaggcatgtcagagacctttagggcag cctctgccatcacaggcctggaggcctaggagggaaaaatggtttcctgggctgaatcca gtgcccctcagctatgtgcagcctcaggacttggtgccctgcatcccagctgctccagcc attaccaatgtacagctcagaccattgtttcagagggtacaagccccaagccttggcagc ttccacgtggtgttgggcctccgtgtgcacagaagtcaagaattgaggtttgggaacctc catctagatttcagaggatgtatagaaacacttggaagtccaagcagaagttcacttcag ggatggagccctcatggagaacctctgctaagagcccccactggggcactacctagtgga tctgtgagaagagggccaccagactccagaatgatagatccactgacagcttgcactgtg tgcctagaaaagccacagacactcaatgctagccatgaaagcagcttggaggagggcttt accctgcaaagccacaggagcagaactgcccaaggccatgggagcccaccttttgcatca gcataa >gi568815597f:231429252_231664902|GENSCAN_predicted_peptide_3|442_aa MFYKVIELHLQRSHGPRWSQFPVIHFVVTISESVKVLVQQAIEMDSNLEFDFIIGLRVPD AIGPNCRYKGKLVAIEMGHETIWPPPTKKLEGSLLSHIPGPGHRRSRSGGGTTTPTAPRA KLGPPEGRKKITYVFRRFPTITQISGPGGRIVFLVTVHAVRLVESPLSSGAAEGGRNSTL ARRWRVVVLGPRAFQQELDARHDKYERLVKLSRDITVESKRTIFLLHRITSAPDMEDILT ESEIKLDGVRQKIFQVAQELSGEDMHQFHRAITTGLQEYVEAVSFQHFIKTRSLISMDEI NKQLIFTTEDNGKENKTPSSDAQDKQFGTWRLRVTPVDYLLGVADLTGELMRMCINSVGN GDIDTPFEVSQFLRQVYDGFSFIGNTGPYEVSKKLYTLKQSLAKVENACYALKVRGSEIP KHMLADVFSVKTEMIDQEEGIS >gi568815597f:231429252_231664902|GENSCAN_predicted_CDS_3|1329_bp atgttttacaaagtgattgaacttcatttgcagaggtcacatgggcctagatggtcacag ttccctgtgattcactttgtggttacaatttctgaatcagttaaagttcttgtgcagcaa gcaatagaaatggactctaacttagagtttgactttataattggccttcgtgtgcctgat gcaattggccctaattgtagatacaaaggcaaattagtagcgatagaaatggggcatgag acaatttggccacctcctacaaagaaactcgagggctcgcttctttctcacatccctggc ccggggcaccggaggtcccgctccggcggaggaactacaacccccacagcaccgcgcgcg aagctgggcccccctgagggaaggaaaaaaattacgtacgtttttcgtcgctttcccacg atcacacagatctcaggtccaggaggacgcatcgtctttttagtaacagtgcatgctgtg aggcttgtagagtcgcctctctcttcaggcgcagcagagggcggccgcaatagtacgctc gcgcggcggtggcgggtggtggtccttggaccacgcgcatttcagcaggaacttgatgca aggcatgacaaatatgagagacttgtgaaacttagtcgggatataactgttgaaagtaaa aggacaatttttctcctccataggattacaagtgctcctgatatggaagatatattgact gaatcagaaattaaattggatggtgtcagacaaaagatattccaggtagcccaagagcta tcaggggaagatatgcatcagttccatcgagccattactacaggactacaggaatatgtg gaagctgtctcttttcaacacttcatcaaaacacgatcattaattagtatggatgaaatt aataaacaattgatatttacgactgaagacaatgggaaagaaaataaaactccctcctct gatgcacaggataagcagtttggtacttggagactgagagtcacacctgtcgattacctt ctgggagtggctgacttaactggagaattgatgcggatgtgtattaacagtgtggggaat ggggacattgataccccctttgaagtgagccagtttttacgtcaggtttatgatgggttt tcattcattggcaacactggaccttacgaggtttctaagaagctgtataccttgaaacaa agtttggccaaagtggagaatgcttgttatgccttgaaagtcagagggtcagaaattcca aaacatatgttggcagatgtgttttcagttaaaacagaaatgatagatcaagaagagggc atttcttag >gi568815597f:231429252_231664902|GENSCAN_predicted_peptide_4|128_aa MLLQTPPFPVAPQAALTNCSRQLSRGGAYSGGQEILRDLGLDIPEVILFCLTSSYRDADE VKEDTRCSQSAGEARTSEWAQHSCSPGQGLAMAAAPAPAFSPTHQVTPVQSVVLRSPSNH RSEAERTT >gi568815597f:231429252_231664902|GENSCAN_predicted_CDS_4|387_bp atgctgctgcagactccacccttcccagtggctccccaggcagcactcacaaactgctct cgccagctctcccgcggaggggcctactctgggggacaggaaatattgagagaccttggc cttgatattccagaggttattcttttctgcttgacctcgagttacagagatgcagatgaa gtcaaagaggataccagatgcagccagtctgctggggaggcacgtacgtctgagtgggcc cagcacagctgttctccagggcaaggcttggccatggcagctgccccagcgcccgcattc agcccaacccaccaggtcaccccagtccagtccgtggtgctcagatccccaagcaatcac aggtcagaggcggaaagaactacgtga >gi568815597f:231429252_231664902|GENSCAN_predicted_peptide_5|486_aa MAPAPAWRLGSPQEAFNHGRRRRGAGTSHGESRSKLGWVCITAALEEEQPSLPAVTIALV GLHLQEKIKAHDYFTHCVTHEEHGRDGKKRAAPFPPPLASGKEQEAAQAEREELAAGRMP GGGPQGAPAAAGGGGVSHRAEMDARHYGAPSSHSGHSSQTYKQNHRTYCQMKLQNTSWLN SLCSGRQVLQEVSEEGIVVIGDDGSMRVFALKAFQRDKDAILGSECVRNWWVLGLTDFKN EAADPRVGSWSRWLKSEAADLRGVKLQTFVVTVTAHKGSVDPKSEQQQGLLQTAKEQSFH NVQRDRRGLPLLAPAACFYSLIWPHPHPADWYSPVVCFDRALIGLGWSMGLGAVEQGAAL VGEARAAQEPTKGGEAQAWRAAGPQPCPAGRQPRPCEKLNTAAAGPGGTGGRPVRRREKL PDSGFYCEAIASSCCSDSKEGVSWNISNPAGISPNAIPPRTPRQAPVCDVPCPVSKCSHC SIPTYE >gi568815597f:231429252_231664902|GENSCAN_predicted_CDS_5|1461_bp atggcaccagcacctgcttggcgtctagggagtcctcaggaagctttcaatcatggcaga aggcgaaggggagcaggtacatcacacggtgaaagcaggagcaagttggggtgggtctgc atcacagctgctctggaggaagagcagccatctcttcctgctgtcaccattgccttggta ggactgcatctgcaagagaagattaaagcacacgattactttactcactgtgtaacacat gaggaacatggacgagatgggaaaaagcgcgctgctcccttccccccgcctctggcctcg gggaaggagcaggaggcagcccaggcggagcgggaggagctggcagcggggcgcatgcca ggcgggggtcctcagggcgccccagccgccgccggcggcggcggcgtgagccaccgcgca gaaatggatgctagacattatggggctccctccagtcactcgggtcactcgtcacagact tataaacagaatcacagaacttattgtcagatgaagcttcaaaacaccagttggctaaac tccttgtgttctggcaggcaggtccttcaggaggtatcagaagaaggcattgttgtcata ggagatgatggctccatgcgtgtttttgccttgaaggccttccaacgggacaaggatgct atcttgggctctgagtgtgtccggaattggtgggttcttggtctcactgacttcaaaaat gaagccgcggaccctcgcgtgggttcgtggtctcgctggctcaagagtgaagctgcggac cttcgcggagtgaagctgcagaccttcgtggtgactgttacagctcataaaggcagtgtg gacccaaagagtgagcagcagcaaggtttgttgcaaacagcgaaagaacaaagcttccac aatgtgcaaagggaccggagagggttgccgctgctggcgcctgcagcctgcttttattct cttatctggccccacccacatcctgctgattggtacagcccagtcgtctgttttgacagg gcgctgattggccttgggtggtcgatgggactgggcgctgtggagcagggggcggcgctt gtcggggaggctcgggctgcacaggagcccacgaagggcggggaggctcaggcatggcgg gctgcaggtccccagccctgccccgcgggaaggcagccaaggccttgcgagaaattgaac acagcagctgctggcccaggaggcactggtgggagaccagtaagacggagagaaaaactt cctgattctgggttttactgtgaggccattgcaagcagttgctgtagtgatagtaaggaa ggggtgagttggaacatcagcaaccctgcaggtatttctcctaatgctatccctccccgc accccacgacaggccccagtgtgcgatgttccctgtcctgtgtccaagtgttctcattgc tcaattcccacctatgagtga >gi568815597f:231429252_231664902|GENSCAN_predicted_peptide_6|192_aa MGKDLITETPKAMATQAKIDKWDLIKLKSFCTATETTIRVNRQPIEWEKIFAIYPSDKGL ISRIYKELKQMYKKKQINKPIKKWAKDMNRNFSKEDIYAANRHMKKCSSSLVIREMQIKT TMRYHITSVRMVIIKKSGNNRCWRGYGETGMLLHCWWECKLVQPLWKTVRRFIKDLELEI PFGPVIPFDPAI >gi568815597f:231429252_231664902|GENSCAN_predicted_CDS_6|579_bp atgggcaaggacttaataactgaaacaccaaaagcaatggcaacacaagccaaaatagac aaatgggatctaattaaactaaagagcttctgcacagcaacagaaactaccatcagagtg aacaggcaacctatagaatgggagaaaatttttgcaatctacccatctgacaaagggcta atatccagaatctacaaagaacttaaacaaatgtacaagaaaaaacaaataaacaagccc atcaaaaagtgggcaaaggatatgaacagaaacttctcaaaagaagacatttatgcagcc aacagacacatgaaaaaatgctcatcatcactggtcatcagagaaatgcaaatcaaaacc acaatgagataccatatcacatcagttagaatggtgatcattaaaaagtcaggaaacaac agatgctggagaggatatggagaaacaggaatgcttttacactgttggtgggagtgtaaa ctagttcaaccattgtggaagacagtgcggcgattcatcaaggatctagaactagaaata ccatttggcccagtgatcccatttgacccagcgatctaa >gi568815597f:231429252_231664902|GENSCAN_predicted_peptide_7|540_aa XSQGERWLPSLWFSGLNHSSLLALENANSPDEEGSATMQHTSSDKKQQDSSLNGSLIPFH LTGGLQPPLTDMFWPATGQYPPGTELPQEGAGSHLCCFAAFTGDISRDMDGVGVGSHYPP ETNTGTENQRMHALIYKWELNDENTWTHRGEQHTLGTISGQWPAKQQGPGQPCKPHAGEG LSPCMAMQGTAAPPRKPGTSRPKVDKTTKMGRNQSRKAENPKNQSTSSPPKDHSSSPATE QSWTENDFDELTQVDFRRAPEGSTKHGKEQRVPATAKTCQTVKTINARKKLHQLTVLEVL ARAIRQEKEIKSIPLGKEEVKLSLLADDMIVYLENPIISAQNLLKQISNFSNVSGYKINV QKSQAFLYTNNRQTETQIMSKLPFTIATKRIKHRGIQLTRDVKNLFKENYKLLLNEIKED TNKWKNIPCSWIGRINIVKMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRAC IAKTILSKKNKAGGNTLLQTILQGYSNQNSMVLVPKQRDRPMEQNRALRNNTTHLQPSDL >gi568815597f:231429252_231664902|GENSCAN_predicted_CDS_7|1623_bp nnctcccagggggagaggtggctgccatctctgtggttcagtggactcaaccattccagc ctgctggctttggagaatgcaaacagtccagatgaggaagggtctgccacaatgcagcac acctcctcagacaaaaagcagcaagactcatctttaaatgggtccctgataccattccac ctgactgggggtctccagccacctcttacagatatgttctggccagcaacaggtcagtat ccccctggcacagagcttccacaggaaggagctggcagccatctttgctgttttgcagcc ttcactggtgatatctccagggatatggatggagtcggagttggaagccattatcctcca gaaactaacacaggaacagaaaaccaaagaatgcatgctctcatctataagtgggaacta aatgatgaaaatacatggacacatagaggggaacagcacacactgggaactatcagtggg cagtggccagctaagcagcaagggccagggcaaccttgcaaaccacacgctggagagggc ctgagtccctgcatggccatgcaaggcacagcagcccctccacgcaagcctggtacctca agaccaaaggtagataaaaccacaaagatggggagaaaccagagcagaaaagctgaaaat cctaaaaaccagagcacctcttctcctccaaaggatcacagctcctcgccagcaacagaa caaagctggacagagaatgactttgacgagctgacacaagtagacttcagaagagctcct gaaggaagcactaaacatggaaaggaacaacgggtaccagccactgcaaaaacatgccaa actgtaaagaccatcaatgctaggaagaaactgcatcaactaacagtgttggaagttctg gccagggccatcagacaagagaaagaaataaagagtattccattaggaaaagaggaagtt aaattgtccctgttggcagatgatatgattgtatatttagaaaaccccatcatctcagcc caaaatctccttaagcagataagcaacttcagtaatgtctcaggatacaaaatcaatgtg caaaaatcacaggcattcctatacaccaataacagacaaacagagacccaaatcatgagc aaactcccattcacaattgctacaaagagaataaaacaccgaggaatccaacttacaagg gatgtaaagaacctcttcaaggagaactacaaactactgctcaacgaaataaaagaggac acaaacaaatggaagaacattccatgctcatggataggaagaatcaatatcgtgaaaatg gccatactgcccaaggtaatttatagattcaatgccatccccatcaagctaccaatgact ttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcctgc attgccaaaacaatcctaagcaagaagaacaaagctggaggcaacacactacttcaaact atactacaaggctacagtaaccaaaacagcatggtgctggtaccaaaacagagagataga ccaatggaacagaacagagccctcagaaataacaccacacatctacaaccatctgatctt tga