GENSCAN 1.0 Date run: 7-Nov-116 Time: 00:33:25 Sequence gi568815595f:108730667_108953874 : 223208 bp : 38.52% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 PlyA - 2403 2398 6 1.05 1.04 Term - 6493 6299 195 1 0 83 49 174 0.653 9.43 1.03 Intr - 11582 11312 271 1 1 10 84 141 0.254 2.52 1.02 Intr - 12825 12738 88 1 1 68 109 71 0.329 5.31 1.01 Init - 15202 15136 67 1 1 60 87 46 0.144 2.99 1.00 Prom - 15709 15670 40 -8.45 2.00 Prom + 17159 17198 40 -9.95 2.01 Sngl + 19634 20650 1017 1 0 88 43 766 0.998 68.87 2.02 PlyA + 20668 20673 6 -4.04 3.00 Prom + 20750 20789 40 -5.25 3.01 Sngl + 21008 23290 2283 1 0 44 35 868 0.654 69.98 3.02 PlyA + 23510 23515 6 1.05 4.08 PlyA - 24307 24302 6 1.05 4.07 Term - 25239 25112 128 1 2 78 48 143 0.629 6.76 4.06 Intr - 25922 25842 81 0 0 86 105 6 0.380 0.79 4.05 Intr - 26471 26393 79 2 1 109 61 96 0.013 7.21 4.04 Intr - 43708 43675 34 0 1 115 84 13 0.245 0.91 4.03 Intr - 47861 47691 171 1 0 79 66 131 0.443 8.14 4.02 Intr - 49760 49664 97 0 1 92 56 41 0.574 -0.55 4.01 Init - 53858 53825 34 2 1 66 110 40 0.632 4.08 4.00 Prom - 81283 81244 40 -3.05 5.00 Prom + 85369 85408 40 -5.05 5.01 Init + 88367 88415 49 1 1 63 116 13 0.156 2.76 5.02 Intr + 100004 100114 111 0 0 90 116 56 0.807 8.03 5.03 Intr + 112048 112138 91 2 1 98 56 35 0.360 -0.57 5.04 Intr + 116402 116463 62 2 2 77 111 46 0.619 3.26 5.05 Intr + 118500 118588 89 1 2 41 109 109 0.670 7.07 5.06 Term + 122954 123211 258 1 0 76 37 221 0.999 10.37 5.07 PlyA + 123601 123606 6 1.05 6.02 PlyA - 123934 123929 6 1.05 6.01 Sngl - 125411 125241 171 2 0 83 48 164 0.986 6.48 6.00 Prom - 125845 125806 40 -4.15 7.00 Prom + 127650 127689 40 -1.45 7.01 Init + 134980 135012 33 0 0 81 101 59 0.411 6.42 7.02 Term + 138675 138749 75 2 0 121 39 13 0.269 -3.44 7.03 PlyA + 139034 139039 6 1.05 8.11 PlyA - 139211 139206 6 1.05 8.10 Term - 145667 145584 84 2 0 62 39 125 0.129 1.67 8.09 Intr - 161065 160906 160 0 1 125 86 38 0.167 6.37 8.08 Intr - 175560 175418 143 0 2 104 -3 123 0.365 3.03 8.07 Intr - 177543 177371 173 1 2 94 62 136 0.909 10.34 8.06 Intr - 185548 185461 88 1 1 110 50 106 0.464 7.62 8.05 Intr - 189919 189770 150 1 0 96 115 110 0.991 13.94 8.04 Intr - 192301 192166 136 0 1 79 53 70 0.104 2.25 8.03 Intr - 199090 199035 56 1 2 57 65 103 0.017 1.76 8.02 Intr - 212012 211914 99 2 0 51 110 43 0.067 2.19 8.01 Init - 223096 222893 204 1 0 97 116 173 0.998 19.90 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 26519 26393 127 2 1 67 61 176 0.985 11.17 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:108730667_108953874|GENSCAN_predicted_peptide_1|206_aa MHPQLAALSMPFGTLRTNGNGIESGTGSSNDVIRPELYSPRNHVVKYSRKHSCLHSQNKS RQKNVEKLDWLSLLASIFLPCRTLPALEHRTSSSSALGLLDLQPQTEGCTDGFPTFQVLG LGLASLLLSLQMVCCGTLPCNHPLSRLPALAAATCTLTAALPGLQGGDSPLQSQQLNPFS LWEQAVCTASTRQLYLLQTIVTQKQV >gi568815595f:108730667_108953874|GENSCAN_predicted_CDS_1|621_bp atgcatcctcaacttgctgccctctctatgccttttgggactcttagaaccaatggcaat ggcattgaatctggaaccggatcctctaatgatgtcataagacctgagctgtattcacct aggaatcatgttgtaaaatactcaagaaagcacagctgcctgcacagccagaataaaagc aggcagaagaacgtggaaaaactagactggctgagtcttctggcctccatctttctccca tgccggacacttcctgccctcgaacatcggacctcaagttcttcagctttgggactcttg gaccttcaaccacagactgaaggctgcactgacggcttccctacttttcaggttttggga ctcggactagcttccttgctcctcagcttgcagatggtctgctgtgggactttgccttgt aatcatcctttgtctcggctgcctgctctggctgctgccacctgcacgcttacagctgca ctccctggcctgcaaggaggtgactctcccttacagagtcagcagcttaaccctttctct ctgtgggagcaagctgtatgtacagcatcaacaaggcagttatacctgttacagacaata gtgactcagaagcaagtatga >gi568815595f:108730667_108953874|GENSCAN_predicted_peptide_2|338_aa MGKKQNRKTGNSKTQSASPPPKERSSSPATEQSWMENDSDELREEGFRRSNYSELREDIQ TKGKEVENFEKNLEECITRITNTEKCLKELMELKTKARELREECRSLRSRCDQLEERVSA IEDEMNEMKREGKFREKRIKRNEQSLQEIWDYVKRPNLRLIGVPESDVENGTKLENTLQD IIQENFPNLARQANVQIQEIQRTPQRYSSRRATPRHIIVRFTKVEMKEKMLRAAREEGWV TLKGKPIRLTADLSAETLQARREWGPIFNILKEKNFQPRISYPAKLSFISEGEIKYFIDK QMLRDFVTTRPALKELLKEALNMERNNRYQPLQNHAKM >gi568815595f:108730667_108953874|GENSCAN_predicted_CDS_2|1017_bp atggggaaaaaacagaacagaaaaactggaaactctaaaacgcagagcgcctctcctcct ccaaaggaacgcagttcctcaccagcaacagaacaaagctggatggagaatgattctgac gagctgagagaagaaggcttcagacgatcaaattactctgagctacgggaggacattcaa accaaaggcaaagaagttgaaaactttgaaaaaaatttagaagaatgtataactagaata accaatacagagaagtgcttaaaggagctgatggagctgaaaaccaaggctcgagaacta cgtgaagaatgcagaagcctcaggagccgatgcgatcaactggaagaaagggtatcagca atagaagatgaaatgaatgaaatgaagcgagaagggaagtttagagaaaaaagaataaaa agaaatgagcaaagcctccaagaaatatgggactatgtgaaaagaccaaatctacgtctg attggtgtacctgaaagtgatgtggagaatggaaccaagttggaaaacactctgcaggat attatccaggagaacttccccaatctagcaaggcaggccaacgttcagattcaggaaata cagagaacgccacaaagatactcctcgagaagagcaactccaagacacataattgtcaga ttcaccaaagttgaaatgaaggaaaaaatgttaagggcagccagagaggaaggttgggtt accctcaaaggaaagcccatcagactaacagcggatctctcagcagaaaccctacaagcc agaagagagtgggggccaatattcaacattcttaaagaaaagaattttcaacccagaatt tcatatccagccaaactaagcttcataagtgaaggagaaataaaatactttatagacaag caaatgctgagagattttgtcaccaccaggcctgccctaaaagagctcctgaaggaagcg ctaaacatggaaaggaacaaccggtaccagccgctgcaaaatcatgccaaaatgtaa >gi568815595f:108730667_108953874|GENSCAN_predicted_peptide_3|760_aa MVKGSIQQEELTILNIYAPNTGAPRFIKQVLSDLQRDLDSHTLIMGDFNTPLSTLDRSTR QKVNKDTQELNSALHQADLIDIYRTLHPKSTEYTFFSAPHHTYSKIDHIVGSKALLSKCK RTDIITNYLSDHSAIKLELRIKNLTQSRSTTWKLNNLLLNDYWVHNEMKAEIKMFFETNE NKDTTYQNLWDAFKAVCRGKFIALNAYKRKQERSKIDTLTSQLKELEKQEQTHSKASRRQ EITKIRAELKEIETQKTLQKINESRSWFFERINKIDRPLASLIKKKREKNQIDTIKNDKG DITTDPTEIQTTIREYYKHLYANKLENLEEMDTFLDTYTLPRLNQEEVESLNRPITGSEI VAIINSLPTKKSPGPDGFTAEFYQRYKEELVPFLLKLFQSIEKEGILPNSFYEASIILIP KPGRDTTKKENFRPISLMNIDAKILNKILASRIQQHIKKLIHHDQVGFIPGMQGWFNIRK SINVIQHINRAKDKNHMIISIDAEKAFDKIQQPFMLKTLNKLGIDGTYFKIIRAIYDKPT ANIILNGQKLEAFPLKTGTRQGCPLSPLLFNIVLEVLARAIRQEKEIKGIQLGKEEVKLS LFADDMIVYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELP FTIASKRIKYLGIQLTRDVKDLFKENYKPLLKEIKEDTNKWKNIPCSWVGRINIVKMAIL PKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRARIAK >gi568815595f:108730667_108953874|GENSCAN_predicted_CDS_3|2283_bp atggtaaagggatcaattcaacaagaggagctaactatcctaaatatttatgcacccaat acaggagcacccagattcataaagcaagtcctgagtgacctacaaagagacttagactcc cacacattaataatgggagactttaacaccccactgtcaacattagacagatcaacgaga cagaaagtcaacaaggatacccaggaattgaactcagctctgcaccaagcagacctaata gacatctacagaactctccaccccaaatcaacagaatatacatttttttcagcaccacac cacacctattccaaaattgaccacatagttggaagtaaagctctcctcagcaaatgtaaa agaacagacattataacaaactatctctcagaccacagtgcaatcaaactagaactcagg attaagaatctcactcaaagccgctcaactacatggaaactgaacaacctgctcctgaat gactactgggtacataacgaaatgaaggcagaaataaagatgttctttgaaaccaacgag aacaaagacaccacataccagaatctctgggacgcattcaaagcagtgtgtagagggaaa tttatagcactaaatgcctacaagagaaagcaggaaagatccaaaattgacaccctaaca tcacaattaaaagaactagaaaagcaagagcaaacacattcaaaagctagcagaaggcaa gaaataactaaaatcagagcagaactgaaggaaatagagacacaaaaaacccttcaaaaa atcaatgaatccaggagctggttttttgaaaggatcaacaaaattgatagaccgctagca agcctaataaagaaaaaaagagagaagaatcaaatagacacaataaaaaatgataaaggg gatatcaccaccgatcccacagaaatacaaactaccatcagagaatactacaaacacctc tacgcaaataaactagaaaatctagaagaaatggatacattcctcgacacatacactctc ccaagactaaaccaggaagaagttgaatctctgaatagaccaataacaggctctgaaatt gtggcaataatcaatagtttaccaaccaaaaagagtccaggaccagatggattcacagcc gaattctaccagaggtacaaggaggaactggtaccattccttctgaaactattccaatca atagaaaaagagggaatcctccctaactcattttatgaggccagcatcattctgatacca aagccgggcagagacacaaccaaaaaagagaattttagaccaatatccttgatgaacatt gatgcaaaaatcctcaataaaatactggcaagccgaatccagcagcacatcaaaaagctt atccaccatgatcaagtgggcttcatccctgggatgcaaggctggttcaatatacgcaaa tcaataaatgtaatccagcatataaacagagccaaagacaaaaaccacatgattatctca atagatgcagaaaaagcctttgacaaaattcaacaacccttcatgctaaaaactctcaat aaattaggtattgatgggacgtatttcaaaataataagagctatctatgacaaacccaca gccaatatcatactgaatgggcaaaaactggaagcattccctttgaaaactggcacaaga cagggatgccctctctcaccactcctattcaacatagtgttggaagttctggccagggca attaggcaggagaaggaaataaagggtattcaattaggaaaagaggaagtcaaattgtcc ctgtttgcagacgacatgattgtgtatctagaaaaccccatcgtctcagcccaaaatctc cttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaatgtacaaaaatca caagcattcttatacaccaacaacagacaaacagagagccaaatcatgagtgaactccca ttcacaattgcttcaaagagaataaaatacctaggaatccaacttacaagggatgtgaag gacctcttcaaggagaactacaaaccactgctcaaggaaataaaagaggacacaaacaaa tggaagaacattccatgctcatgggtaggaagaatcaatatcgtgaaaatggccatactg cccaaggtaatttacagattcaatgccatccccatcaagctaccaatgactttcttcaca gaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcccgcatcgccaag taa >gi568815595f:108730667_108953874|GENSCAN_predicted_peptide_4|207_aa MWFIVEYVPRPGAAVRLSSAGSQLPQHSSWEVGVGGHTSNHIGIEVGGTTTAEEQIKDRP QGVVLEEPRPDQRILNQLGLDLMTRSGFGADKTLEVIVFCICSLFLCDNEKSLINPGSTQ CSLDSVMDKKIKDVLNSLEYSPSPISKKLSCASVKSQGRPSSCPAGMAVTGCACGYGCGS WDVQLETTCHCQCSVVDWTTARCCHLT >gi568815595f:108730667_108953874|GENSCAN_predicted_CDS_4|624_bp atgtggttcattgtggaatatgttccacgtccaggtgcagcagtgcggctgagctcagct ggttctcagttgccacaacacagctcctgggaagtgggtgtgggtgggcacacttcgaac cacattggcatagaggttggtgggacaaccacagcagaggagcagataaaggaccgtccc caaggagttgtgctcgaagaaccacgcccagaccaaagaatccttaaccaattgggactt gatctcatgacaagatctggatttggagctgataagactttggaagtaattgtattttgc atatgttcactgtttctttgtgacaatgagaaatctctgatcaacccggggagtactcag tgttccttagactccgttatggataagaagatcaaggatgttctcaacagtctagagtac agtccctctcctataagcaagaagctctcgtgtgctagtgtcaaaagccaaggcagaccg tcctcctgccctgctgggatggctgtcactggctgtgcttgtggctatggctgtggttcg tgggatgttcagctggaaaccacctgccactgccagtgcagtgtggtggactggaccact gcccgctgctgccacctgacctga >gi568815595f:108730667_108953874|GENSCAN_predicted_peptide_5|219_aa MVSSSKPVPKCQRRPEGISGCPFFLWGLLALLGLALVISLIFNISHYVEKQRQEIVMLAV LSSILSAKLWRKTLLFKCSKALAMVDEYYIEDTPIYGNLDDMISEPMDENCYEQMKARPE KSVNKMQEATPSAQATNETQMCYASLDHSVKGKRRKPRKQNTHFSDKDGDEQLHAIDASV SKTTLVDSFSPESQAVEENIHDDPIRLFGLIRAKREPIN >gi568815595f:108730667_108953874|GENSCAN_predicted_CDS_5|660_bp atggtgtcctcaagtaagccagtccctaaatgtcagaggaggccagaaggaatctctggg tgcccctttttcctctggggacttctagcattgttgggcttggctttggttatatcactg atcttcaatatttcccactatgtggaaaagcaacgacaagaaattgtcatgctggcagtt ctttccagtatcctttccgccaaactttggcgaaagacccttcttttcaagtgctctaaa gcactggcaatggttgatgagtattatattgaagacacaccaatttatggtaacttagat gatatgatttcagaaccaatggatgaaaattgctatgaacaaatgaaagcccgaccagag aaatctgtaaataagatgcaggaagccaccccatctgcacaggcaaccaatgaaacacag atgtgctacgcctcacttgatcacagcgttaaggggaagcgtagaaagcccaggaaacag aatactcatttctcagacaaggatggagatgagcaactacatgcaatagatgccagcgtt tctaagaccaccttagtagacagtttctccccagaaagccaggcagtagaggaaaacatt catgatgatcccatcagactgtttggattgatccgtgctaagagagaacctataaactag >gi568815595f:108730667_108953874|GENSCAN_predicted_peptide_6|56_aa MELQVKRSQKGRILERRELRVEKTLEICEESPSSIQLSTGQCTQVTKLPELGEKPT >gi568815595f:108730667_108953874|GENSCAN_predicted_CDS_6|171_bp atggagctccaggtcaagagaagtcaaaaaggcagaatactggagagaagagagctgcgt gtagagaaaactctggagatatgtgaagagtcgccgtcaagtattcagttaagtactggt cagtgcacgcaagtgacaaaactacccgaacttggggaaaaacccacctga >gi568815595f:108730667_108953874|GENSCAN_predicted_peptide_7|35_aa MIVRLPQTRGNEIFGNVWRHFVYTGTTCGGDASGI >gi568815595f:108730667_108953874|GENSCAN_predicted_CDS_7|108_bp atgattgtgaggcttccccagacgcgtggaaatgagatatttggtaacgtctggagacat tttgtgtatacaggcacaacctgtggtggggatgctagtggcatctag >gi568815595f:108730667_108953874|GENSCAN_predicted_peptide_8|430_aa MGNGKSIAGDQKAVPTQETHVWYRTFMMEYPSGLQTLHEFKTLLGLQGLNQKANKHIDQV YNTFDTNKVPNGPMVLRAAFKPGNMVKGPLGEFFNPIPAGQKRTKSEEIQKKKGIEEVQW DMDEAGNHHSQQTNTGTENQTPHVLTHKWELNNENTWTQGGACRRDGFVDFLEFIAAVNL IMQEKMEQKLKWYFKLYDADGNGSIDKNELLDMFMAVQALNGQQTLSPEEFINLVFHKID INNDGELTLEEFINGMAKDQDLLEIVYKSFDFSNVLRVICNGKQPDMETDSSKSPDKAGL GKDSTTSAHPTQRKEKEMMGECKEEVGTLAGTADSKLEYKRRAWESGKPGASSGHLLPVE TSFVKIKKKEQLFFINHFFLKPGENISLTTASSALMASPDSLKGYKFLKNKTPTRLAASI REQCSSANHC >gi568815595f:108730667_108953874|GENSCAN_predicted_CDS_8|1293_bp atggggaatggcaaatctatagctggtgatcagaaagcagttcctacacaagagacccat gtgtggtacagaacatttatgatggaatatccatccggcctgcaaacactacatgaattt aagacacttttgggtctgcaaggtctgaatcagaaggccaataaacatattgatcaagtt tataatacctttgacacgaacaaggttccaaatggacccatggtgttaagggctgctttt aaacctggtaatatggtgaagggtcccctaggggaattcttcaatcccatcccagcaggg cagaagaggacaaaatctgaagaaattcagaagaagaaaggcattgaagaagttcagtgg gacatggatgaagctggaaaccatcattctcagcaaactaacacaggaacagaaaaccaa acacctcatgttctcactcataagtgggagttgaacaatgagaacacatggacacaggga ggggcctgtcggcgggatggatttgttgactttttggagtttattgctgctgtaaatcta atcatgcaagaaaaaatggagcaaaaattaaaatggtattttaagctgtatgatgctgat ggaaatggttctattgacaaaaatgaactactggacatgttcatggcggtacaagccctc aatggccagcaaactctgagtcctgaagaattcatcaacttggtgttccataagatcgat ataaacaatgatggggaattgactttagaagaatttatcaatggcatggcaaaagatcag gatctcctggagattgtttacaagagcttcgacttctccaatgtgctgagagtaatctgt aatgggaagcagccagacatggagacagactcctccaaatctcctgacaaggctggtcta gggaaggactcaaccacctccgcacaccccactcagaggaaagagaaggagatgatggga gagtgcaaggaggaagttgggacactggcaggaacagcagacagcaagttggaatacaag agaagggcttgggagtcaggcaagcctggggcatcctcagggcatttgcttccagtagag accagcttcgtcaaaattaaaaagaaggagcagctttttttcatcaatcactttttttta aaaccaggggaaaatatttccctcacaactgcttcgtctgcactcatggcttcaccagat agcttaaagggctataaatttctgaaaaataaaacaccaactcgcctagcagccagcatc cgagaacaatgtagctctgccaaccactgctaa