GENSCAN 1.0 Date run: 6-Nov-116 Time: 17:20:16 Sequence gi568815594f:106216548_106447689 : 231142 bp : 35.59% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 75 773 699 2 0 68 43 271 0.988 16.55 1.02 PlyA + 842 847 6 1.05 2.08 PlyA - 2384 2379 6 1.05 2.07 Term - 3860 3478 383 0 2 -47 28 219 0.007 -3.58 2.06 Intr - 13899 13816 84 1 0 88 121 -3 0.539 1.87 2.05 Intr - 16517 16391 127 2 1 82 88 111 0.984 9.83 2.04 Intr - 17103 17041 63 0 0 119 18 72 0.705 1.30 2.03 Intr - 19972 19843 130 0 1 100 86 75 0.979 8.28 2.02 Intr - 28217 28079 139 0 1 65 80 136 0.984 9.10 2.01 Init - 35968 35956 13 1 1 82 49 9 0.172 -3.10 2.00 Prom - 38088 38049 40 -5.25 3.02 PlyA - 38189 38184 6 1.05 3.01 Sngl - 39011 38691 321 2 0 71 48 186 0.502 8.64 3.00 Prom - 39549 39510 40 -4.25 4.00 Prom + 54540 54579 40 -5.45 4.01 Init + 57063 57262 200 2 2 9 98 192 0.604 10.72 4.02 Intr + 75116 75242 127 2 1 3 64 192 0.374 8.06 4.03 Intr + 87586 87723 138 0 0 37 81 62 0.011 0.14 4.04 Intr + 99338 99508 171 1 0 113 47 161 0.012 13.72 4.05 Intr + 99702 99797 96 2 0 116 64 23 0.304 1.99 4.06 Intr + 104630 104748 119 1 2 77 53 102 0.379 3.94 4.07 Intr + 108438 108571 134 0 2 104 71 87 0.764 7.97 4.08 Intr + 110904 111017 114 1 0 47 76 198 0.999 14.00 4.09 Intr + 111529 111696 168 2 0 87 63 153 0.998 11.70 4.10 Intr + 115125 115336 212 1 2 47 89 270 0.993 20.71 4.11 Intr + 120322 120490 169 0 1 52 96 139 0.954 9.70 4.12 Term + 130979 131145 167 0 2 100 38 182 0.847 11.50 4.13 PlyA + 131189 131194 6 1.05 5.07 PlyA - 131384 131379 6 1.05 5.06 Term - 141896 141636 261 2 0 90 38 112 0.367 0.94 5.05 Intr - 145791 145627 165 0 0 82 86 20 0.215 0.44 5.04 Intr - 150890 150496 395 0 2 92 90 268 0.447 20.85 5.03 Intr - 152244 152163 82 0 1 -22 121 68 0.012 -2.61 5.02 Intr - 162029 161920 110 0 2 50 78 99 0.176 4.18 5.01 Init - 162728 162650 79 2 1 64 107 24 0.623 3.17 5.00 Prom - 162882 162843 40 -3.45 6.02 PlyA - 164185 164180 6 1.05 6.01 Sngl - 168801 168538 264 0 0 74 42 152 0.716 4.15 6.00 Prom - 169273 169234 40 -4.95 7.04 PlyA - 170923 170918 6 1.05 7.03 Term - 173012 172959 54 2 0 117 54 71 0.962 3.18 7.02 Intr - 173916 173851 66 0 0 47 106 64 0.680 2.28 7.01 Init - 175451 175410 42 2 0 59 100 34 0.796 2.17 7.00 Prom - 179649 179610 40 -3.15 8.03 PlyA - 179997 179992 6 1.05 8.02 Term - 187936 187314 623 2 2 47 48 259 0.122 11.39 8.01 Init - 189518 188852 667 2 1 49 -33 304 0.120 9.82 8.00 Prom - 189611 189572 40 -6.15 9.05 PlyA - 190146 190141 6 1.05 9.04 Term - 190913 190690 224 0 2 -78 34 267 0.352 0.80 9.03 Intr - 191200 191038 163 1 1 64 71 75 0.141 2.03 9.02 Intr - 214186 214098 89 2 2 77 47 89 0.362 2.47 9.01 Init - 215342 215186 157 2 1 68 32 181 0.405 10.62 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 3762 3478 285 0 0 75 28 177 0.832 5.69 S.002 Init - 92413 92221 193 1 1 48 115 158 0.915 12.41 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594f:106216548_106447689|GENSCAN_predicted_peptide_1|232_aa MGKFLDTYTLPRLNQEEVQSLNRPITGSEIVAIINSLPTKKSPGPDGFTAEFYQRYKEEL VPFLLKLFQSIEKEGILPNSFYEASIILIPKPGRDTTKKENFRPISLMNIDAKILNKILP NQIQQHIKMLIHHDQVGFIPGMQGWFNIHKSINVIQHINRAKDKNHMIISIDAEKAFDKI QQPFMLKTLNKLGIDGTYFKIIRAIYEKPTANIILNGQKLEAFPLKTGTRQG >gi568815594f:106216548_106447689|GENSCAN_predicted_CDS_1|699_bp atgggtaaattccttgacacatacactctcccaagactaaaccaggaagaagttcaatct ctgaatagaccaataacaggatctgagattgtggcaataatcaatagcttaccaaccaaa aagagtccaggaccagatggattcacagccgaattctaccagaggtacaaggaggaactg gtaccattccttctgaaactattccaatcaatagaaaaagagggaatcctccctaactca ttttatgaggccagcatcattctgataccaaagccaggcagagacacaaccaaaaaagag aattttagaccaatatccttgatgaacattgatgcaaaaatcctcaataaaatactgcca aaccaaatccagcagcacatcaaaatgcttatccaccatgatcaagtgggcttcatccct gggatgcaaggctggttcaatatacacaaatcaataaatgtaatccagcatataaacaga gccaaagacaaaaaccacatgattatctcaatagatgcagaaaaggcctttgacaaaatt caacaacccttcatgctaaaaactctcaataaattaggtattgatgggacatacttcaaa ataataagagctatctatgagaaacccacagccaatatcatactgaatgggcaaaaactg gaagcattccctttgaaaactggcacaagacagggatga >gi568815594f:106216548_106447689|GENSCAN_predicted_peptide_2|312_aa MWGRDINNDYLAERSIEEVYYLWCLAGGDLEKELVNKEIIRSKPPICTLPNQSNLPHSNS NNELSAAATLPLIIREKDTEYQLNRIILFDRLLKGAIHAKYDAIDKDTPIPTDRQIEVDI PRCHQYDELLSSPEGHAKFRRVLKAWVVSHPDLVYWQALAYACMSAFIPKYLYNFFLKDN SHVIQERFNWTYISTWLGKPQNDGRRQKALLTWQRQEKMRKKQKQKPLKNPSDLVRLIHY YENSTGKSSSHDSITTPRSLPQHVGILEDTIQVEIWVGTQPNHVIYKIHVIIGSKGLFEK SKGRLLVLMDCQ >gi568815594f:106216548_106447689|GENSCAN_predicted_CDS_2|939_bp atgtggggcagagatataaataatgattacctggcagaaagatctattgaagaagtgtat tacctttggtgtttggctggaggtgacttggagaaagagcttgtcaacaaggaaatcatt cgatccaaaccacctatctgcacactccccaaccagtctaatttacctcattcaaacagc aataatgagttgtctgcagctgccacgctccctttaatcatcagagagaaggatacagag taccaactaaatagaattattctcttcgacaggctgctaaagggagctattcatgccaag tacgatgcaattgataaagacactccaattcctacagatagacaaattgaagtggatatt cctcgctgtcatcagtacgatgaactgttatcatcaccagaaggtcatgcaaaatttagg cgtgtattaaaagcctgggtagtgtctcatcctgatcttgtgtattggcaagccttggct tatgcatgtatgtctgcttttattcccaaatacctgtataacttcttcttaaaagacaac tcacatgtaatacaagagaggtttaattggacttacatttccacatggctggggaagccc cagaatgatggcaggaggcaaaaggcacttcttacatggcagcggcaggagaagatgagg aagaagcaaaagcagaaacccctgaagaacccgtcagatctcgtgagacttattcactat tatgagaatagcacgggcaagagcagctcccatgattcaattaccacaccccggtccctc ccacaacatgtgggaattctggaagatacaattcaagttgagatttgggtggggacacag ccaaaccatgttatctataaaattcatgtgatcattggaagcaaaggattatttgagaaa agtaagggaagactacttgtactgatggactgtcaataa >gi568815594f:106216548_106447689|GENSCAN_predicted_peptide_3|106_aa MFPGASHGSCLQRTWSNHSLAGSQCLCQHLDLLAMPQPVCLAVHSGQTPHSLIHPLPRHA WLSLGRVASRPVARAEHSLPVQVDGTGPVGPSKTQAKAPLATEVST >gi568815594f:106216548_106447689|GENSCAN_predicted_CDS_3|321_bp atgttccctggtgccagccatggaagctgcttgcaacgcacctggtccaaccacagcctt gcaggaagccagtgcctatgccagcacctggacctgcttgccatgccacagccagtgtgc ctggctgtgcacagtggccagaccccacactcactcatacaccccttgccacgccatgcc tggctctcccttggccgcgtggcatccaggccagtagcacgagctgagcacagcctgcca gtccaagtggatggaacgggcccagtgggtcccagcaaaactcaggcaaaggcaccactg gccacagaggtttccacctga >gi568815594f:106216548_106447689|GENSCAN_predicted_peptide_4|604_aa MSSVLIKGRHIDEKTHKEGDVKTGAKIEVMWPQAKEAKECQQAPETGKGTEGFLLKASKG GVTLPTQASFLATNYIAKAKKPFTIGEDLILPAAKDICHEILGEDADQKLLTKGYASQPV TVVTLQAITLMINKALLSKFMNLFILQCTSLTQKKTPPTTTRKTGLPKYLPLSLGTQAST KTRGMELLHRAPALSLWLLLPLRLSAPDAAFQPPRPNAEERPRLVVSFPLPKLATKGHIA REKEEVRSVFAWPPIVGDVRSPSARPPSLGSEEHLFPAVILSRKIFCRLLAKMANNDAVL KRLEQKGAEADQIIEYLKQQVSLLKEKAILQATLREEKKLRVENAKLKKEIEELKQELIQ AEIQNGVKQIPFPSGTPLHANSMVSENVIQSTAVTTVSSGTKEQIKGGTGDEKKAKEKIE KKGEKKEKKQQSIAGSADSKPIDVSRLDLRIGCIITARKHPDADSLYVEEVDVGEIAPRT VVSGLVNHVPLEQMQNRMVILLCNLKPAKMRGVLSQAMVMCASSPEKIEILAPPNGSVPG DRITFDAFPGEPDKELNPKKKIWEQIQPDLHTNDECVATYKGVPFEVKGKGVCRAQTMSN SGIK >gi568815594f:106216548_106447689|GENSCAN_predicted_CDS_4|1815_bp atgtcgagcgttcttataaaagggagacacatagacgagaagacacacaaagaaggtgat gtgaagacaggggcaaagattgaagtgatgtggccacaagccaaagaagccaaggaatgc caacaggcaccagaaactggaaaaggaacagaaggatttttattgaaagcctccaaaggg ggtgtgaccctgccaacacaagcatcattcttagcgactaactacatagctaaagctaag aagccctttactatcggtgaagatttgatcctgcctgctgctaaagacatctgtcatgaa attttaggagaggatgccgatcaaaagcttctcacaaaaggctatgcttcccagcctgtc acagtggtcaccctgcaggctataacccttatgataaataaagctctcctttctaagttt atgaacctcttcattcttcagtgtacctccctcacacaaaaaaagaccccacccactacg acacggaaaactggattacccaaatacctgcctctgagtctggggacgcaggcaagcaca aagacaagggggatggagcttctacacagggccccagcgctgtcgctgtggctgctgctg ccgctacggcttagtgcaccagacgctgcatttcagccaccgcgacccaacgctgaggaa agaccccgacttgtggtgtccttccccttgcccaaactggccacgaaaggacatatagcg agagagaaagaggaagtgaggagcgtctttgcctggccgcccatcgtcggggatgtgagg agcccctctgcccggccgcccagtctgggaagtgaggagcacctcttcccggccgtcatc ctgtctaggaagattttctgccgtctcttggcaaaaatggcaaataatgatgctgttctg aagagactggagcagaagggtgcagaggcagatcaaatcattgaatatcttaagcagcaa gtttctctacttaaggagaaagcaattttgcaggcaactttgagggaagagaagaaactt cgagttgaaaatgctaaactgaagaaagaaattgaagaactgaaacaagagctaattcag gcagaaattcaaaatggagtgaagcaaataccatttccatctggtactccactgcacgct aattctatggtttctgaaaatgtgatacagtctacagcagtaacaaccgtatcttctggt accaaagaacagataaaaggaggaacaggagacgaaaagaaagcgaaagagaaaattgaa aagaaaggagagaagaaggagaaaaaacagcaatcaatagctggaagtgccgactctaag ccaatagatgtttcccgtctggatcttcgaattggttgcatcataactgctagaaaacac cctgatgcagattctttgtatgtggaagaagtagatgtcggagaaatagccccaaggaca gttgtcagtggcctggtgaatcatgttcctcttgaacagatgcaaaatcggatggtgatt ttactttgtaacctgaaacctgcaaagatgaggggagtattatctcaagcaatggtcatg tgtgctagttcaccagagaaaattgaaatcttggctcctccaaatgggtctgttcctgga gacagaattacttttgatgctttcccaggagagcctgacaaggagctgaatcctaagaag aagatttgggagcagatccagcctgatcttcacactaatgatgagtgtgtggctacatac aaaggagttccctttgaggtgaaagggaagggagtatgtagggctcaaaccatgagcaac agtggaatcaaataa >gi568815594f:106216548_106447689|GENSCAN_predicted_peptide_5|363_aa MESAGQWKFVSWFPMLRAKSILNPRGGLSSSSVKNELEDHKEGRKAISEIKSSCSSKQSE EHKKVKVHMKEQLCQGERRLCGESIFKTGKAMTDPNKMIINLALFGMTQSGKSSAGNILL GSTDFHSSFAPCSVTTCCSLGRSCHLHSFMRRGGLEVALQVQVLDTPGYPHSRLSKKYVK QEVKEALAHHFGQGGLHLALLVQRADVPFCGQEVTDPVQMIQRTCLATNVMTIQQATLLS NIQLFSGQNRLKDFKLNQPLSSVFGSNSKVQFLSQFKELLGHAWMNYTAILFTHAEKIEE AGLTEDKYLHEASDTLKTLLNSIQHKYVFQYKKGKSLNEQRMKILERIMEFIKENCYQVL TFK >gi568815594f:106216548_106447689|GENSCAN_predicted_CDS_5|1092_bp atggaatctgcaggccagtggaagtttgttagctggtttcccatgctgcgggcaaagagc atactaaatccccgtggaggtctctctagcagcagtgtgaagaatgaattggaggaccat aaggaaggcagaaaggccatttcagaaattaagtcctcatgttcaagcaaacaatcagaa gaacataaaaaggtgaaagttcatatgaaagaacaactgtgccagggcgagagacgtctt tgtggagaaagcattttcaagactggaaaagccatgacagaccccaacaagatgatcatc aacttggccctctttggcatgactcagagtggaaaaagttctgctggaaacattctgctg ggaagcacagactttcacagcagctttgctccctgttctgtgaccacatgttgtagcctg ggccgcagttgtcacctccacagcttcatgcgtcgaggtgggctagaggtagccctgcag gtccaggtgttggacactccaggttatccacacagcaggctgagcaagaagtatgtgaaa caggaagtcaaagaggctctggcacatcacttcgggcaagggggtctccaccttgcactc ctggttcagagagcagatgtgcctttctgtgggcaggaagtaactgacccagtccagatg atccagagaacatgtttggccactaacgtgatgactattcagcaagcaacactactttca aatattcaactattttctggccaaaatcgacttaaagattttaagctaaatcaaccactc tcttctgtttttggcagtaattcaaaagtgcaatttctctcacagtttaaggaacttctt ggacatgcttggatgaattacacagccattctttttacccatgcagaaaaaatagaagag gctgggcttactgaagataaatatttacatgaggcctctgataccctgaaaacgctgcta aattctattcagcacaaatacgttttccagtacaaaaaaggaaaatcactcaatgaacaa agaatgaaaatcttagaaagaatcatggaatttataaaagagaactgttaccaagttctt acatttaaataa >gi568815594f:106216548_106447689|GENSCAN_predicted_peptide_6|87_aa MLWVKRWRKELQPFGDPRPGSSPSQGCDALFGTLQFLASQSFQSPLHFPVPAGEAACSAS APAAALQRAGTHAGIWSCPPCGIRQCV >gi568815594f:106216548_106447689|GENSCAN_predicted_CDS_6|264_bp atgttgtgggtgaagagatggagaaaagagctgcagccctttggggaccccaggcctggg agctccccaagccagggctgtgatgccctctttgggaccctgcagttcctggcatctcaa agcttccagtcaccactgcatttcccagtgccagctggggaagctgcttgcagtgcatct gctccagctgcagccttgcagagagctggcacccatgccggcatctggagctgcccaccc tgtggcatcagacagtgtgtctga >gi568815594f:106216548_106447689|GENSCAN_predicted_peptide_7|53_aa MFAKSRWKTEIQNKSVSQLLLFENGNFQGTSQNFRKPCEDVLASASTAAMIVS >gi568815594f:106216548_106447689|GENSCAN_predicted_CDS_7|162_bp atgtttgccaaaagcagatggaaaacagaaattcaaaacaagtccgtatctcagcttctc ttgtttgaaaatggaaacttccaaggcacctctcagaatttccggaaaccatgtgaagat gtgcttgcttctgcttcaactgctgctatgattgtttcctga >gi568815594f:106216548_106447689|GENSCAN_predicted_peptide_8|429_aa MGDFNTALSTLDRSTRQKVNKDIQELNSALHQADLIDIYRTLHPKSTEYAFFSAPHHTYS KTDHIVGSKALLSKCKRTEIITNCLSDHSAIKLELRIKKLTQNCSTTWKLNNLLLNDYWV HNEMKAEIKMFFETDKNKDTTYQNLWDTFKAVYRGKFIALNAHKRKQERSKIDTLTSQLK ELEKQEQTHSKASRRQEVTKIRAELKEMEMQNPFKKLLNPGAVLEVLARAIRQKKEINGI QIGKEEVKLFLFADDMIVYLENPTVSAQNLLKLIGNFSKVSGYKINVQKSQAFLYTNNRQ TKRQIMSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLNEVKEDINKWKNVPFPCVG RTNKMAILPKVIYRFSAIPIKLPMTFFTELEKTTFNFIWNQERAGIAKSILSQKNKAGGI TLPDFKTVL >gi568815594f:106216548_106447689|GENSCAN_predicted_CDS_8|1290_bp atgggagactttaacacggcactgtcaacattagacagatcaacaagacagaaagttaac aaggatatccaggaattgaactcagctcttcaccaagcagacctaatagacatctacaga actctccaccccaaatcaacagaatacgcattcttctcagcaccacatcacacttattcc aaaactgaccacatagttggaagtaaagcactcctcagcaaatgtaaaagaacagaaatt ataacaaactgtctctcagaccacagtgcaatcaaactagaactcaggattaagaaactc actcaaaactgctcaactacatggaaactgaacaacctgctcctgaatgactactgggta cataacgaaatgaaggcagaaataaagatgttctttgaaaccgacaagaacaaagacaca acataccagaatctctgggacacattcaaagcagtgtatagagggaaatttatagcacta aatgcccacaagagaaagcaggaaagatctaaaattgacacactaacatcacaattaaaa gaactagagaagcaagagcaaacacattcaaaagctagcagaaggcaagaagtaactaag atcagagcagaactgaaagaaatggagatgcaaaatcccttcaaaaaattactgaatcca ggagctgtgttggaagttctggccagggcaatcaggcagaagaaggaaataaatggtatt caaataggaaaagaggaggtcaaattgttcctgtttgcagatgacatgattgtatatcta gaaaaccccactgtctcagcccaaaatctccttaagctgataggcaacttcagcaaagtc tcaggatacaaaatcaacgtgcaaaaatcacaagcattcttatacaccaataacagacaa acaaagagacaaatcatgagtgaactcccattcacaattgcttcaaagagaataaaatac ctaggaatccaacttacaagggatgtgaaggacctcttcaaggagaactacaaaccactg ctcaatgaagtaaaagaggacataaacaaatggaagaacgttccattcccatgtgtagga agaaccaataaaatggccatactgcccaaggtaatttatagattcagtgccatccccatc aagctaccaatgactttcttcacagaattggaaaaaactactttcaatttcatatggaac caagaaagagccggcattgccaagtcaatactaagccaaaagaacaaagctggaggcatc acactacctgacttcaaaactgtactataa >gi568815594f:106216548_106447689|GENSCAN_predicted_peptide_9|210_aa MVMAIVGDCGNIGGSGGHGGYYGGSYESGSVTVVVIVKAVMAFVVAFCGGGTEPNSSLMK VEAVTLENNTCKDTERNDLVQMPPLLIPRQTGSGVDLQQTPTDLQLRVLTVRRKTNKQKG HPHQNPVRTSPSSKTKENDFDELGEEGFRRSNYSELKEEVQTHDKEVKNLEKKLDKWLTR ITNAEKSLKDLMELKTKARELRDECTSLSS >gi568815594f:106216548_106447689|GENSCAN_predicted_CDS_9|633_bp atggtaatggcgatagttggtgattgtggtaacattggtggtagtggtggtcatggtggt tattatggtggcagttatgagagtggcagtgtgacagtagtggtaatagtaaaggcagtt atggcatttgtggttgctttttgtggtggcggcactgaacccaactcctctctaatgaaa gtagaagcagtcaccttggaaaataacacctgcaaagatacagaaagaaacgatttggtc cagatgcctccgctgctgatacccaggcaaacagggtctggagtggacctccagcaaact ccaacagacttgcagctgagggtcctgactgttagaaggaaaactaacaaacagaaagga catccacaccaaaaccctgttcgtacgtcaccatcatcaaagaccaaagagaacgacttt gacgaattgggagaagaaggcttcagacgatcaaactactccgagctaaaggaggaagtt caaacccatgacaaagaagttaaaaaccttgaaaaaaaattagacaaatggctaactaga ataaccaatgcagagaagtccttaaaggacctgatggagctgaaaaccaaggcacgagaa ctacgtgacgaatgcacaagcctcagtagctga