GENSCAN 1.0 Date run: 7-Nov-116 Time: 00:13:57 Sequence gi568815593f:161750811_161997419 : 246609 bp : 34.80% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 799 1001 203 1 2 7 84 197 0.750 9.21 1.02 Term + 17418 18229 812 1 2 67 38 228 0.119 8.05 1.03 PlyA + 19677 19682 6 1.05 2.00 Prom + 22033 22072 40 -5.65 2.01 Init + 29622 29887 266 2 2 86 64 209 0.065 14.83 2.02 Intr + 35381 35481 101 2 2 -27 116 70 0.003 -2.87 2.03 Intr + 65752 65903 152 2 2 82 72 64 0.001 3.16 2.04 Intr + 74998 75130 133 0 1 113 92 -11 0.299 1.10 2.05 Intr + 75286 75360 75 2 0 121 93 46 0.427 6.97 2.06 Intr + 85187 85393 207 0 0 77 95 507 0.964 48.43 2.07 Intr + 97381 97612 232 2 1 27 97 149 0.740 5.61 2.08 Intr + 103348 103460 113 1 2 90 92 154 0.999 15.10 2.09 Intr + 114911 114978 68 0 2 96 115 111 0.998 12.31 2.10 Intr + 122307 122527 221 2 2 47 94 153 0.356 7.98 2.11 Intr + 124750 124832 83 1 2 57 62 50 0.182 -2.34 2.12 Intr + 131058 131213 156 1 0 19 13 161 0.571 0.76 2.13 Intr + 131748 131891 144 1 0 89 111 211 0.933 22.83 2.14 Intr + 134385 134588 204 1 0 50 81 107 0.838 4.45 2.15 Intr + 140088 140240 153 1 0 91 111 29 0.948 4.62 2.16 Intr + 144856 145058 203 2 2 113 94 83 0.984 9.48 2.17 Term + 146301 146612 312 2 0 90 36 206 0.997 9.62 2.18 PlyA + 148132 148137 6 1.05 3.00 Prom + 154774 154813 40 -3.65 3.01 Init + 168402 168461 60 2 0 85 119 37 0.775 7.80 3.02 Intr + 175029 175216 188 2 2 52 69 203 0.663 12.37 3.03 Term + 182246 182327 82 2 1 88 43 73 0.496 -1.01 3.04 PlyA + 182993 182998 6 1.05 4.00 Prom + 183378 183417 40 -2.65 4.01 Sngl + 227063 227464 402 1 0 52 50 138 0.631 2.42 4.02 PlyA + 228170 228175 6 1.05 5.02 PlyA - 228793 228788 6 1.05 5.01 Sngl - 238834 238523 312 1 0 47 41 202 0.841 7.12 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 29622 29942 321 2 0 86 39 220 0.929 12.64 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593f:161750811_161997419|GENSCAN_predicted_peptide_1|338_aa XVVGGCDIDKFQKPLFGGQLTEDLLGPPAVWTGGLDEHNQLPRLDFAVHELLSHADGLRS PREESSVAVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADEMIVYLENPIISAQNLLKLI SNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIRYLGIQLTRDVKDLFK ENYKPLLNEIKEDTNKWKNSPCSCVERINIMKMAILPKVIYRFNAIPIKLPMTFFTELEK TTLKLIWNQKRAHTAKSILSQKNKAGGIKLPDFKLYYKATVTKTAWYLYQNRNIDQWNRT EPSEIMPHIYNHLIFDKPEKNKQWGKDSLLINGAGKTG >gi568815593f:161750811_161997419|GENSCAN_predicted_CDS_1|1017_bp nntgtggttggaggctgtgatatcgacaaattccagaagcctttgtttggtggacaattg actgaggatctcttgggccctcctgcagtatggacaggtgggcttgatgaacacaaccag cttcccaggctggattttgcagttcacgaactcttgagccatgcagatgggctgcgatct ccccgggaagaatcctcagttgcagtgttggaagttctggccagggcaattaggcaggag aaggaaataaagggtattcaattaggaaaagaggaagtcaaattgtccctgtttgcagat gaaatgattgtatatctagaaaaccccatcatctcagcccaaaatctccttaagctgata agcaacttcagcaaagtctcaggatacaaaatcaatgtacaaaaatcacaggcattctta tacaccaataacaggcaaacagagagccaaattatgagtgaactcccattcacaattgct tcaaagagaataagatacctaggaatccaacttacaagggatgtaaaggacctcttcaag gagaactacaaaccactgctcaatgaaataaaggaggatacaaacaaatggaagaacagt ccatgctcatgtgtagaaagaatcaatatcatgaaaatggccatactgcccaaggtaatt tatagattcaatgccatccccatcaagctaccaatgactttcttcacagaattggaaaaa actactttaaagctcatatggaaccaaaaaagagcccacactgccaagtcaatcctaagc caaaagaacaaagctggaggcatcaaactacctgacttcaaactatactacaaggctaca gtaaccaaaacagcatggtacttgtaccaaaacagaaatatagaccaatggaacagaaca gagccctcagaaataatgccacatatctacaaccatctgatctttgacaaacctgagaaa aacaagcaatggggaaaggattccctattaataaatggtgctgggaaaactggctag >gi568815593f:161750811_161997419|GENSCAN_predicted_peptide_2|940_aa MVEGKEEQVTAYVDGSSQKQRAYAEKFCLMKPSNLVKRIHYHENSTGKTCPQDSITSHWV PPTTCGIQDEICLGLRRTISGLYVSFKVSICLNQGEEDFGMTVFNWLKTIPLPHTSSIRQ GKGIKCDIRSHRITNYVNNGFSFEIITDSIQENFHSYSQQYTAGNAADKGSNQVNIIFTT LKTVSKAAVGSRMRHYLYLSNSFAIQTSLKKMIRIKSRYWQLLLTFARCVHQFLGTILAY AHWYVETQNDFSLKQQQQQEQKQQKMKTKTEMKTKKKEEEEEEEEEEEEEEEEEEEEEEE GEEEEGGGGGRSADWILGSKFGCEIFSKGARRVHDGSDQVSERQSEDAPLLWRARTRTRR LALAPVSPRFSLPDFSPVLRDPVSRGGLSYGQPSLQDELKDNTTVFTRILDRLLDGYDNR LRPGLGERVTEVKTDIFVTSFGPVSDHDMEYTIDVFFRQSWKDERLKFKGPMTVLRLNNL MASKIWTPDTFFHNGKKSVAHNMTMPNKLLRITEDGTLLYTMRLTVRAECPMHLEDFPMD AHACPLKFGSSWVTRVKLHQKKKKKKEEEEVEGEGEGEGGAGGGGGGGGGEGEGEGEEEE DTYAYTRAEVVYEWTREPARSVVVAEDGSRLNQYDLLGQTVDSGIVQSSTVTLNYTSSGT TLAHQESTLIDFSRKKQRPLCKVCNAVFYISLWSPENIKINDLEYHYLPVSVRRLENDGE YVVMTTHFHLKRKIGYFVIQTYLPCIMTVILSQVSFWLNRESVPARTVFGVTTVLTMTTL SISARNSLPKVAYATAMDWFIAVCYAFVFSALIEFATVNYFTKRGYAWDGKSVVPEKPKK VKDPLIKKNNTYAPTATSYTPNLARGDPGLATIAKSATIEPKEVKPETKPPEPKKTFNSV SKIDRLSRIAFPLLFGIFNLVYWATYLNREPQLKAPTPHQ >gi568815593f:161750811_161997419|GENSCAN_predicted_CDS_2|2823_bp atggtggaaggcaaggaggagcaagtcacagcttacgtggatggcagcagtcaaaaacag agagcttatgcagaaaaattctgccttatgaagccatcaaatcttgtgaaacgtattcac tatcatgagaacagcacgggaaagacttgcccccaggattcaattacctcccattgggtc cctcccacgacatgtggaattcaggatgagatttgtctgggactcagacgaaccatatca ggactctatgtgtcatttaaagtcagtatttgcctgaaccaaggtgaagaggactttgga atgactgtctttaattggttaaaaactatacctcttcctcatacgagctctataaggcaa ggtaaaggaatcaagtgtgacatacggagccatagaattacaaattatgtgaataatggc ttttcttttgagattatcactgactccatacaagaaaattttcattcctactctcagcaa tacacagcaggtaatgctgcagataaagggagcaaccaggtaaacattatctttaccaca ctgaaaacagtaagcaaagccgccgttggttccagaatgagacactatctgtatctttca aactcttttgctatacaaacatccttgaaaaaaatgatcagaatcaaaagcagatattgg cagctgttacttacgtttgcaagatgtgttcatcagtttcttggtaccatattggcatat gcccactggtatgtagaaacacaaaacgatttctccctgaaacaacaacaacaacaggag cagaagcagcagaagatgaagacaaagacagagatgaagacaaagaagaaggaagaagaa gaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaa ggggaggaggaggaaggaggaggaggaggaagatctgcagattggatattgggaagcaaa tttgggtgtgaaatcttcagcaaaggagcacgcagagtccatgatggctcagaccaagtg agtgagaggcagagcgaggacgcccctctgctctggcgcgcccggactcggactcgcaga ctcgcgctggctccagtctctccacgattctctctcccagacttttccccggtcttaaga gatcctgtgtccagagggggccttagctatggacagccgtcattacaagatgaacttaaa gacaataccactgtcttcaccaggattttggacagactcctagatggttatgacaatcgc ctgagaccaggattgggagagcgtgtaaccgaagtgaagactgatatcttcgtcaccagt ttcggacccgtttcagaccatgatatggaatatacaatagatgtatttttccgtcaaagc tggaaggatgaaaggttaaaatttaaaggacctatgacagtcctccggttaaataaccta atggcaagtaaaatctggactccggacacatttttccacaatggaaagaagtcagtggcc cacaacatgaccatgcccaacaaactcctgcggatcacagaggatggcaccttgctgtac accatgaggctgacagtgagagctgaatgtccgatgcatttggaggacttccctatggat gcccatgcttgcccactaaaatttggaagttcctgggtgacaagagtgaaactgcatcaa aaaaagaaaaaaaaaaaagaagaagaagaagtagaaggagaaggagaaggagaaggaggg gcagggggagggggaggaggaggaggaggagaaggagaaggagaaggagaagaagaagaa gacacttatgcttatacaagagcagaagttgtttatgaatggaccagagagccagcacgc tcagtggttgtagcagaagatggatcacgtctaaaccagtatgaccttcttggacaaaca gtagactctggaattgtccagtcaagtacagtaacgctgaattatacttcctctgggacc actttagctcaccaagagtccaccttgatagatttcagtagaaagaaacagagacctttg tgtaaggtctgcaatgctgtattttacatctctttgtggagccctgaaaacatcaaaata aatgaccttgagtaccattaccttcctgtttctgtcagaagactagagaatgatggagaa tatgttgttatgaccactcatttccacttgaagagaaagattggctactttgttattcaa acatacctgccatgcataatgacagtgattctctcacaagtctccttctggctcaacaga gagtctgtaccagcaagaactgtctttggagtaacaactgtgctcaccatgacaacattg agcatcagtgccagaaactccctccctaaggtggcttatgcaacagctatggattggttt attgccgtgtgctatgcctttgtgttctcagctctgattgagtttgccacagtaaactat ttcactaagagaggttatgcatgggatggcaaaagtgtggttccagaaaagccaaagaaa gtaaaggatcctcttattaagaaaaacaacacttacgctccaacagcaaccagctacacc cctaatttggccaggggcgacccgggcttagccaccattgctaaaagtgcaaccatagaa cctaaagaggtcaagcccgaaacaaaaccaccagaacccaagaaaacctttaacagtgtc agcaaaattgaccgactgtcaagaatagccttcccgctgctatttggaatctttaactta gtctactgggctacgtatttaaacagagagcctcagctaaaagcccccacaccacatcaa tag >gi568815593f:161750811_161997419|GENSCAN_predicted_peptide_3|109_aa MLIITTAAITYSPNALPDTLSSDSVLCITAALAVAKRGQGTAQAMAPEGASPNPWQLLHG VEPAGAQRSRIEVWEPPPGFQRRYCVLYKLEVCDNFTSSKAFGAIYPTT >gi568815593f:161750811_161997419|GENSCAN_predicted_CDS_3|330_bp atgttgataataactacggcagctattacttattcacctaatgcactgccagacactctg tctagtgactcggtgctctgcatcactgctgctctagcagtggctaaaaggggccaaggt acagctcaggccatggctccagagggtgcaagccccaatccttggcagcttctacatggt gttgagcctgcaggtgcacagaggtcaagaattgaagtttgggaacctccacctggattt cagaggagatactgtgttttatacaaattggaggtttgtgacaactttacatcaagcaag gctttcggggccatttatccaacaacatga >gi568815593f:161750811_161997419|GENSCAN_predicted_peptide_4|133_aa MHTFLEKGNLPNLEEKNSFSSTKAMATKAKIDKWDLIKLKSFCTAKETTIRVKRQPTEWE NIFTTYSSDKELISRIYNELKKIYKRKTNNPIKKWAKDMNRHFSKEDIYAAKKHRKKNAH HYWPSEKCKSKPQ >gi568815593f:161750811_161997419|GENSCAN_predicted_CDS_4|402_bp atgcatacctttcttgaaaaaggaaatttaccaaacttggaagaaaaaaattccttcagt tccacaaaagcaatggcaacaaaagccaaaattgacaaatgggatctaattaaactaaag agcttctgcacagcaaaagaaactaccatcagagtgaaaaggcaacctacagaatgggag aacattttcacaacctactcatctgacaaagagctaatatccagaatctacaatgaactc aaaaaaatttacaagagaaaaacaaacaaccccatcaaaaagtgggcaaaggatatgaac agacacttctcaaaagaagacatttatgcagccaaaaaacacaggaaaaaaaatgctcat cattactggccatcagagaaatgcaaatcaaaaccacaatga >gi568815593f:161750811_161997419|GENSCAN_predicted_peptide_5|103_aa MPSQKFAAGAGPLWRTSVRTVWKGKVGWEPQHRVPTGALPSTAVRRGPQSSRHQNGRSTD SLHSAPGKDTDLNASYESCQEGGYTLQSHRELPTTMGPPVASL >gi568815593f:161750811_161997419|GENSCAN_predicted_CDS_5|312_bp atgcccagtcagaagtttgctgccggggcagggcccttgtggagaacctctgtgaggaca gtatggaagggaaaggtcgggtgggagccccaacacagagtccctactggggcactgcct agtacagctgtgagaagagggccacagtcctccagacaccagaatggtagatccactgac agcttgcactctgctcctggaaaagacacagacctcaatgccagctatgaaagctgccag gaggggggctataccctgcaaagccacagggagctgcccacgaccatgggcccacctgtt gcatcactgtga