GENSCAN 1.0 Date run: 3-Nov-116 Time: 17:13:29 Sequence gi568815579f:29842610_30115066 : 272457 bp : 44.42% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 2894 3034 141 1 0 84 49 99 0.648 3.53 1.02 PlyA + 4451 4456 6 1.05 2.09 PlyA - 9222 9217 6 1.05 2.08 Term - 15843 15833 11 1 2 142 44 -8 0.248 -1.44 2.07 Intr - 19321 19205 117 2 0 54 65 114 0.367 6.14 2.06 Intr - 31089 31020 70 0 1 80 3 43 0.083 -6.25 2.05 Intr - 31497 31399 99 0 0 112 74 30 0.517 4.31 2.04 Intr - 32091 31908 184 2 1 58 82 81 0.331 4.29 2.03 Intr - 33982 33831 152 1 2 96 49 75 0.166 3.36 2.02 Intr - 40548 40421 128 1 2 78 60 56 0.038 2.20 2.01 Init - 59260 59017 244 1 1 72 53 158 0.043 8.70 2.00 Prom - 61177 61138 40 -5.16 3.00 Prom + 78084 78123 40 -3.76 3.01 Sngl + 78573 79043 471 2 0 59 48 547 0.961 43.92 3.02 PlyA + 79546 79551 6 1.05 4.00 Prom + 85915 85954 40 -5.86 4.01 Init + 99939 100055 117 2 0 98 88 161 0.841 16.82 4.02 Term + 100214 100237 24 1 0 105 55 39 0.938 0.52 4.03 PlyA + 100680 100685 6 1.05 5.03 PlyA - 100782 100777 6 1.05 5.02 Term - 114007 113761 247 0 1 27 41 192 0.565 3.66 5.01 Init - 114379 114102 278 1 2 65 58 233 0.563 14.46 5.00 Prom - 123849 123810 40 -4.66 6.00 Prom + 132330 132369 40 -4.66 6.01 Init + 133875 133933 59 2 2 70 81 24 0.138 0.68 6.02 Intr + 142614 142692 79 0 1 82 80 32 0.182 1.35 6.03 Intr + 143673 143808 136 2 1 68 61 94 0.215 4.84 6.04 Intr + 162752 162843 92 0 2 76 76 32 0.108 0.51 6.05 Intr + 163042 163099 58 0 1 58 121 83 0.999 7.06 6.06 Intr + 164861 165029 169 0 1 85 48 164 0.996 11.10 6.07 Intr + 166396 166687 292 1 1 57 -25 464 0.615 29.14 6.08 Intr + 168485 168627 143 1 2 58 75 94 0.995 4.25 6.09 Intr + 169676 169922 247 2 1 26 85 134 0.961 4.36 6.10 Term + 172278 172460 183 2 0 88 47 110 0.996 4.44 6.11 PlyA + 173285 173290 6 1.05 7.00 Prom + 184837 184876 40 -5.76 7.01 Init + 194829 195148 320 2 2 102 29 188 0.623 11.12 7.02 Term + 195365 195458 94 2 1 92 36 87 0.726 1.20 7.03 PlyA + 197439 197444 6 1.05 8.00 Prom + 198646 198685 40 -4.46 8.01 Init + 199063 199217 155 0 2 53 47 136 0.278 5.46 8.02 Intr + 208431 208489 59 0 2 107 81 47 0.037 4.43 8.03 Intr + 222522 222680 159 1 0 71 72 44 0.038 1.06 8.04 Intr + 223382 223447 66 0 0 71 110 44 0.085 3.78 8.05 Intr + 228726 228874 149 1 2 74 94 76 0.402 6.75 8.06 Term + 231869 231931 63 1 0 122 50 18 0.260 -0.71 8.07 PlyA + 234060 234065 6 1.05 9.05 PlyA - 236347 236342 6 1.05 9.04 Term - 236833 236705 129 1 0 54 46 67 0.060 -2.72 9.03 Intr - 241953 241770 184 2 1 51 61 173 0.672 10.69 9.02 Intr - 246276 246143 134 0 2 68 94 16 0.017 -0.36 9.01 Init - 259789 259595 195 1 0 75 50 96 0.034 3.43 9.00 Prom - 260333 260294 40 -4.46 10.00 Prom + 261114 261153 40 -5.36 10.01 Init + 262400 262841 442 1 1 71 66 258 0.603 18.15 10.02 Term + 263095 263168 74 2 2 120 44 14 0.237 -1.73 10.03 PlyA + 266236 266241 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 209026 208853 174 1 0 18 49 206 0.825 7.46 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815579f:29842610_30115066|GENSCAN_predicted_peptide_1|46_aa VLTPTCMRMILIVLTVAKLFSTCFLNIDAVWTPDSHIFTPPRSPLL >gi568815579f:29842610_30115066|GENSCAN_predicted_CDS_1|141_bp gttttaacgcctacctgcatgcggatgatcctgatagttctaacagtagctaaactgttt tccacgtgctttctgaacattgatgctgtttggacccctgacagccacatttttacgccc ccgcgcagcccgctgctctga >gi568815579f:29842610_30115066|GENSCAN_predicted_peptide_2|334_aa MKRSRCRDRPQPPPPDRREDAVQRAAELSQSLPPRRRAPPGRQRLEERTGPAGPEGKEQP PALASQSAEIAASARLPPRLGTPGPPGSWEPQRDGSRGQPLWMDPLLGILNSKHQPHICK QSLQQHNNGVECVGAPLPLDHLGEVNGTHLASSAAEKWGGYIKRKTERRVNERRRKQALP IRQEDMGGDLKVWGQLWESLHTGRELTNLLELRGKTPGNNDVGGTVSRVAEPSVHQDPQP ASSEVNSVTVCKYGCRQIAASCPQPRASGMPGRRHQMAADALRSSGREGAGPGISRKDSV SLELLGLQLEEHLQDNEANIMKSKAQEEVGCGLT >gi568815579f:29842610_30115066|GENSCAN_predicted_CDS_2|1005_bp atgaagcggagccgctgccgcgaccgaccgcagccgccgccgcccgaccgccgggaggat gcagttcagcgggcagcggagctgtctcagtctttgccgccgcgccggcgagcgccgccc gggaggcagcggctggaggagcggacgggccccgcggggcccgagggcaaggagcagccg cctgccttggcctcccaaagtgccgagattgcagcctctgcccggctgccaccccgtctg ggaactccaggccctcctggctcctgggagccacagagagatggctcccgtgggcagccg ctgtggatggatcctttattgggcatcttgaattccaagcaccagccccacatttgcaag cagagcctccagcaacacaacaacggtgtagagtgcgtgggggccccactgccactggac cacttgggagaggtgaatggtactcacctcgcctcttctgcagcagagaagtggggagga tatatcaagaggaaaacagagaggagagtcaatgagagacggaggaaacaggccttgcct ataaggcaggaggatatggggggagacctcaaggtttgggggcagctgtgggagtctctg cacactggcagagaattaactaacctcttggagcttagagggaaaaccccggggaacaat gatgtcgggggcacggtcagcagagtggccgagccatctgtacaccaggacccgcagcca gccagctctgaggttaattctgtgacagtgtgcaaatacggctgccgccaaatcgcagcc tcgtgcccgcagcctcgcgcctcagggatgcccgggcgccgccaccagatggcagcagac gctttgcgatcgagcggccgggagggcgcggggccggggatttctcgcaaggacagtgta agcctggagctgctggggctacaactggaagaacacttgcaggataacgaagccaatata atgaaaagcaaagcccaggaagaggtggggtgtggtctcacttaa >gi568815579f:29842610_30115066|GENSCAN_predicted_peptide_3|156_aa MVNPTKFFNEPWGRISIQLFADKFPKTAENVCALSIGEKGFGYKGSCFHRIIPGFMCHGG DFTHHNGSGGKYIYGEKFDDENFILKQTGSGILSKENAGPNTNGSQFFICSAKSEWLDGE HVFFGKVKEGMNIVEAMEGFGSRNGKTSKKITIADC >gi568815579f:29842610_30115066|GENSCAN_predicted_CDS_3|471_bp atggtcaaccccaccaagttcttcaatgagccctggggccgcatctccatccagctgttt gcagacaagtttccaaagacagcagaaaatgtttgtgctctgagcattggagagaaagga tttggttataagggttcctgctttcacagaattattccggggtttatgtgtcacggtggt gacttcacacaccataatggcagtggtggcaagtacatctatggggagaaatttgatgat gagaacttcatcctgaagcagacaggttctggcatcttgtccaaggaaaatgctggaccc aacacaaacggttcccagtttttcatctgcagtgccaagagtgagtggttggatggtgag catgtgttctttggcaaggtgaaagaaggcatgaatattgtggaggccatggagggtttt gggtccaggaatggcaagaccagcaagaagatcaccattgctgactgttga >gi568815579f:29842610_30115066|GENSCAN_predicted_peptide_4|46_aa MEAPTVETPPDPSPPSAPAPALVPLRAPDVARLREEQEKPFDFAAE >gi568815579f:29842610_30115066|GENSCAN_predicted_CDS_4|141_bp atggaggcgcccaccgtggagacgccccccgacccctcgcccccttcggccccggcccct gccctggttccgttgcgcgccccggatgtggcgcggctgcgcgaggagcaggaaaagccc tttgacttcgcagcggagtga >gi568815579f:29842610_30115066|GENSCAN_predicted_peptide_5|174_aa MKTILSNQTVDIPENVNVTLKGRTVIIKGPRGTLRKDFNHINVELRLLGKKKKRLRVDKW SGNRKELATVRTICSHVQNMIKGVTLGFHYKMRVQMRPGVACSVPQAQKDELILEGNDTE LVSNSAALIQQPTTVKNKDIRKFLDGIYVSEKGTVQQVDESDLRVIQLQKEDAR >gi568815579f:29842610_30115066|GENSCAN_predicted_CDS_5|525_bp atgaagaccattctcagcaatcagactgttgacattccagaaaatgtcaacgttactctg aagggacgcacagttatcataaagggccccagaggaaccctgcggaaggacttcaatcac atcaacgtagaactcagactccttggaaagaaaaaaaagaggctccgggttgacaaatgg tcgggtaacagaaaggaactggctaccgttcggactatttgtagtcatgtacagaacatg atcaagggtgttacactgggcttccattacaagatgagggttcagatgagaccaggtgtt gcctgttcagtacctcaagcccagaaagatgaattaatccttgaaggaaatgacactgag cttgtttcaaattcagcagctttgattcagcaacccacaacagttaaaaacaaggatatc aggaaatttttggatggtatctatgtctctgaaaaaggaactgttcagcaggttgatgaa tcagatctaagagttatccagctacagaaagaagatgccagatga >gi568815579f:29842610_30115066|GENSCAN_predicted_peptide_6|485_aa MEAARPRYVGQRKGQRRFTRKKVDNDYNALRERLSTLPDKLSYNIMVPFGPFAFMPGKLV HTNEVTVLLGDNWFAKCSAKQAVGLVEHRKEHVRKTIDDLKKVMKNFESRVEFTEDLQKM SDAAGDIVDIREEIKCDFEFKAKHRIAHKPHSKPKTSDIFEADIANDVKSKDLLADKELW ARLEELERQEELLGELDSKPDTVIANGEDTTSSEEEKEDRNTNVNAMHQVTDSHTPCHKD VASSEPFSGQVNSQLNCSVNGSSSYHSDDDDDDDDDDDDDNIDDDDGDNDHEALGVRINT GKNTTLKFSEKKEEAKRKRKNSTGSGHSAQELPTIRTPADIYRAFVDVVNGEYVPRKSIL KSRSRENSVCSDTSESSAAEFDDRRGVLRSISCEEATCSDTSESILEEEPQENQKKLLPL SVTPEAFSGTVIEKEFVSPSLTPPPAIAHPALPTIPERKEVLLEASEETGKRVSKFKAAR LQQKD >gi568815579f:29842610_30115066|GENSCAN_predicted_CDS_6|1458_bp atggaagctgcaaggccacgttatgttggtcaaaggaagggccagcgcagattcacaagg aagaaggtagataatgactataatgcccttcgagaaagactcagcaccttgcctgataaa ttgtcttataatataatggtaccatttggcccttttgccttcatgccaggaaaacttgtc catactaatgaagtcactgttttactgggggacaactggtttgcaaagtgctcagcaaag caggctgtaggtttagttgagcaccggaaagaacatgtaagaaaaacaatagatgactta aaaaaagtgatgaaaaattttgaatccagagttgaattcacagaagatttgcagaaaatg agcgatgctgcaggtgatattgttgacatacgagaagaaattaaatgtgacttcgaattt aaagcaaaacaccgaattgctcataaaccgcattccaaaccaaaaacttcagatattttt gaagcagatattgcaaatgatgtgaaatccaaggatttgctagctgataaagaactgtgg gctcgacttgaagaactagagagacaggaagaattgctgggtgaacttgatagtaagcct gatactgtgattgcaaatggagaagatacgacatcttctgaagaggaaaaggaagatcgt aacacaaatgtgaatgcgatgcatcaagtaacagactctcatactccttgtcataaggat gttgcaagttcagaaccattcagtggtcaagtgaatagtcagttgaactgttcagtgaat ggttccagttcttaccacagtgatgatgatgatgatgatgatgatgacgacgacgacgac aacattgacgacgatgatggtgataacgaccatgaggctttaggggtccgaataaatact ggaaagaataccactttaaaattcagtgaaaagaaagaagaagccaaacgtaaacgaaag aacagcactggcagtggccactctgcccaggagctgccgaccatcaggacgcctgcagac atttacagagcctttgttgatgttgtgaatggagaatatgtccctcgcaaatccatcctg aagtctcgaagtagagagaatagtgtgtgtagcgacactagtgaaagcagtgctgctgaa tttgatgataggcggggagttttgaggagtatcagctgcgaagaagccacttgcagtgac accagtgagagcattttggaagaggaaccacaagaaaatcaaaagaaacttttgccctta tcagtaacacctgaggctttttctggaactgttatagaaaaagaatttgtatcaccttcc ttaacaccacccccagccattgctcatcccgcactacccactattccagaacgaaaggaa gttctgttggaagcatctgaagaaactggaaagagggtttcaaagtttaaagctgccaga ttgcaacagaaagactag >gi568815579f:29842610_30115066|GENSCAN_predicted_peptide_7|137_aa MASPKSMPKDAQMMAQILKDLGITEYEPRLINQMLEFAFRYVTTILDDAKIYSSHAKKAT VDADDVQLAIQFHADQSFTSLPPRDFFIRYRKAKKSNPFSINQAIFRAKVYSTDAYFTVS SCKIFNSCNISSSECSD >gi568815579f:29842610_30115066|GENSCAN_predicted_CDS_7|414_bp atggcttctcccaagagcatgccgaaagatgcacagatgatggcacaaatcctgaaggat ctgggaattacagaatatgagccaagacttataaatcagatgttagagtttgccttccgt tatgtgaccacaattctagatgatgcaaaaatttactccagccatgctaagaaagctacc gttgatgcagatgatgtgcagttggcaatccagttccacgctgaccagtcttttacctct cttcccccaagagatttttttattagatatcgcaaggcaaagaaatcaaaccccttttcc attaatcaagccatattcagggcaaaggtttacagtacagatgcctacttcacagtctcc agctgtaaaatcttcaattcctgcaacatcagcagttcagaatgttctgattaa >gi568815579f:29842610_30115066|GENSCAN_predicted_peptide_8|216_aa MWEGLELPRELLNGFDKNTDKGVDNEIQADVVSDGDEELGNWSKGDSCYVLVNQHRTTQK AVWLRCNAAFGKAPAKPIRSCDIPQACMEPFCVPVLCVPRGTKAWFLDSGASRLGREVDV EQEMATTSSSSYHLSNPRTIAPIKVTGQCPAARGEGGDLVPPIWHRGGQGIWTAAIGEVP TPGPGFGPSMSLGRRWAVPCLSSAQWEQGMLTLDEL >gi568815579f:29842610_30115066|GENSCAN_predicted_CDS_8|651_bp atgtgggaaggtttggaacttcctagagaattgttgaatggctttgacaaaaatactgat aaaggtgtggacaatgaaatccaggctgacgtggtctcagatggagatgaagaacttggg aactggagcaaaggcgactcttgttatgttttagtaaatcagcacagaaccacgcagaag gcagtgtggctgcggtgcaatgctgcatttggaaaagctcctgcaaaaccaatcaggagc tgtgacatccctcaagcttgcatggaacccttctgtgtgccagtgctgtgtgtgccccgg ggcaccaaggcctggtttttggactctggggcctcacgtctaggaagggaggtggatgtg gaacaggaaatggcaactaccagcagcagctcctatcatcttagcaacccaaggacaatt gctcctataaaagtcacaggtcagtgcccggctgctagaggggagggaggggatctcgtg cccccgatctggcaccggggtgggcagggcatatggacggcagccattggcgaggtgccc acaccaggccctggcttcgggcccagcatgagcctgggccggcggtgggcagttccctgc ctctcctcagcacagtgggaacaagggatgctgactctagatgagctctag >gi568815579f:29842610_30115066|GENSCAN_predicted_peptide_9|213_aa MSPGHTCLYKNDCRRIDHLGSGGTAGPDIAQSVAPQILLSELRASKFSPVSLNSEPELGD RFVAIVIDWTQGLLQHLWPNGNNSGHHSQGASSVLGTGMHYVIQTLTPARGSPGNVESLT TGRQTLTGSFANATQITANANLAMVQTPRSNFLSVPAHSPPKVTDKIEFDTPLEDVHHQN KGVNQERARGSLWNRDGKGDLQAETEGRHRESS >gi568815579f:29842610_30115066|GENSCAN_predicted_CDS_9|642_bp atgtccccaggccacacctgcctctacaagaatgactgcagaagaattgatcaccttggt agcggtggaactgcagggccagacattgcccaaagtgtggccccacagatcctactttct gagcttagagcctccaagttcagtcctgtctctctgaactcagagcctgagctaggagac agattcgtggccatcgtaattgattggactcagggacttctccaacatctctggcccaat ggtaacaacagtggccaccattcacagggtgcttcctctgtcctgggcactggtatgcat tacgtcatccaaaccctcacaccagccagagggagcccagggaatgtggaaagtttaacg acaggcaggcagaccctgactggcagcttcgccaatgcaacccagataacggcaaacgca aacctcgccatggtccagacacctcgctccaacttcctctccgtgccagcacatagcccg ccaaaggtcacggacaagatagaatttgacacaccactggaggatgtgcaccaccaaaac aaaggtgtaaaccaagaaagagcaagaggtagcttatggaacagagatggcaaaggcgat ttgcaggctgagactgaaggaagacaccgggagagcagctga >gi568815579f:29842610_30115066|GENSCAN_predicted_peptide_10|171_aa MGPADTPVSMELCVVGTGPGDTPVSMELCVVGTGPGDTPVSMELCVLGTGPSDTPVSMEL CVVGTGPGDTAVFIELCMVGTGPGDTPVFMELCMVGMGPRDTPASMELCVVAMAMGSGHS VVLGDSAATVPFGLWISSPVGQQAQCPGPPRNSDSGVRFGGQSSVASSGPL >gi568815579f:29842610_30115066|GENSCAN_predicted_CDS_10|516_bp atgggccctgcggacacccctgtatccatggagctctgtgtggtggggacgggtcctggt gacactcctgtgtccatggagctctgtgtggtggggacgggtcctggtgacactcctgtg tccatggagctctgtgtgttggggacgggacctagtgacactcctgtgtccatggaactc tgtgtggtggggacaggtcctggtgacaccgctgtgttcattgagctctgtatggtgggg acgggtcctggtgacacccctgtgttcatggagctctgtatggtggggatgggtcctcgt gacactcctgcgtccatggagctctgtgttgtggccatggccatgggttctggtcactct gtggtactgggagactcagcagccactgtgccttttggactctggatctccagtcctgtg ggacagcaagcccaatgcccaggtcctcctcgaaactctgactctggagtaaggtttggt ggacagagctctgttgcctcctcaggcccattgtga