GENSCAN 1.0 Date run: 4-Nov-116 Time: 23:53:18 Sequence gi568815594f:73736751_73937286 : 200536 bp : 36.82% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 3782 3821 40 -3.75 1.01 Init + 3909 3972 64 2 1 90 121 120 0.999 15.08 1.02 Intr + 4792 4927 136 2 1 116 92 73 0.999 9.21 1.03 Intr + 5199 5282 84 0 0 93 107 74 0.974 7.82 1.04 Term + 8014 8092 79 1 1 106 54 45 0.858 -0.84 1.05 PlyA + 8306 8311 6 1.05 2.00 Prom + 12628 12667 40 -2.95 2.01 Init + 25313 25376 64 1 1 78 55 68 0.169 3.86 2.02 Term + 27204 27343 140 1 2 96 54 51 0.201 -0.36 2.03 PlyA + 27613 27618 6 1.05 3.03 PlyA - 27966 27961 6 1.05 3.02 Term - 29224 28776 449 2 2 58 48 246 0.122 11.99 3.01 Init - 34513 34276 238 1 1 62 99 103 0.212 7.02 3.00 Prom - 42284 42245 40 -5.65 4.00 Prom + 44873 44912 40 -5.45 4.01 Init + 44929 45012 84 0 0 82 116 -16 0.440 1.47 4.02 Term + 51461 51892 432 1 0 -328 54 962 0.573 45.51 4.03 PlyA + 52612 52617 6 1.05 5.00 Prom + 59572 59611 40 -2.35 5.01 Init + 63195 63385 191 2 2 64 88 60 0.288 2.06 5.02 Term + 66102 66219 118 0 1 92 48 119 0.379 5.33 5.03 PlyA + 66770 66775 6 1.05 6.00 Prom + 67494 67533 40 -2.55 6.01 Init + 71682 71744 63 2 0 62 116 56 0.859 7.00 6.02 Term + 88890 88952 63 2 0 87 36 69 0.021 -1.49 6.03 PlyA + 89754 89759 6 1.05 7.02 PlyA - 89888 89883 6 1.05 7.01 Sngl - 90313 89927 387 1 0 37 45 228 0.556 9.36 7.00 Prom - 97924 97885 40 -3.65 8.00 Prom + 98223 98262 40 -7.05 8.01 Init + 100001 100109 109 1 1 70 81 160 0.988 11.93 8.02 Intr + 100214 100346 133 0 1 94 78 134 0.841 11.78 8.03 Intr + 100453 100536 84 1 0 97 68 58 0.651 2.72 8.04 Intr + 111154 111263 110 1 2 81 75 37 0.095 0.71 8.05 Term + 111964 112088 125 2 2 106 33 89 0.822 2.77 8.06 PlyA + 112511 112516 6 1.05 9.00 Prom + 115876 115915 40 -6.95 9.01 Init + 116613 116712 100 2 1 83 78 159 0.952 12.88 9.02 Intr + 117033 117159 127 1 1 140 84 38 0.467 7.42 9.03 Term + 117285 117372 88 0 1 86 37 79 0.454 -1.25 9.04 PlyA + 117460 117465 6 1.05 10.03 PlyA - 117748 117743 6 -0.45 10.02 Term - 118298 118051 248 0 2 66 55 169 0.569 6.37 10.01 Init - 126962 126890 73 2 1 66 116 71 0.916 8.98 10.00 Prom - 130327 130288 40 -2.65 11.00 Prom + 132606 132645 40 -5.75 11.01 Init + 132721 132820 100 0 1 80 94 167 0.540 14.97 11.02 Intr + 132919 133042 124 2 1 114 103 164 0.999 19.22 11.03 Intr + 133156 133239 84 1 0 137 109 105 0.974 15.52 11.04 Intr + 137081 137188 108 2 0 81 111 12 0.318 1.28 11.05 Intr + 157362 157426 65 0 2 107 56 7 0.004 -3.16 11.06 Term + 162493 162623 131 2 2 72 39 162 0.939 7.06 11.07 PlyA + 163314 163319 6 1.05 12.04 PlyA - 164001 163996 6 1.05 12.03 Term - 170026 168819 1208 2 2 30 39 417 0.001 21.83 12.02 Intr - 181219 181097 123 2 0 52 77 102 0.004 5.14 12.01 Init - 196711 196669 43 1 1 76 80 50 0.129 3.63 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 37505 37295 211 1 1 27 36 193 0.869 3.78 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594f:73736751_73937286|GENSCAN_predicted_peptide_1|120_aa MTSKLAVALLAAFLISAALCEGAVLPRSAKELRCQCIKTYSKPFHPKFIKELRVIESGPH CANTEIIVKLSDGRELCLDPKENWVQRVVEKFLKSPQQSPECDVPLPVSMCSPCSIPTYE >gi568815594f:73736751_73937286|GENSCAN_predicted_CDS_1|363_bp atgacttccaagctggccgtggctctcttggcagccttcctgatttctgcagctctgtgt gaaggtgcagttttgccaaggagtgctaaagaacttagatgtcagtgcataaagacatac tccaaacctttccaccccaaatttatcaaagaactgagagtgattgagagtggaccacac tgcgccaacacagaaattattgtaaagctttctgatggaagagagctctgtctggacccc aaggaaaactgggtgcagagggttgtggagaagtttttgaagagcccacaacagtcccca gagtgtgatgttccccttcctgtgtccatgtgttctccttgttcaattcccacctatgag tga >gi568815594f:73736751_73937286|GENSCAN_predicted_peptide_2|67_aa MVTLETQDVIRKEHGVGFKGTVSLSKWSSLMPALKWTAVSSTPAIQFSPHSLTHTLLQGV KSCWLSK >gi568815594f:73736751_73937286|GENSCAN_predicted_CDS_2|204_bp atggtaaccctagagacacaggatgtgattagaaaggaacatggagtaggatttaagggt actgtaagtctgagtaagtggagttcactcatgccagcactgaagtggacagctgtttcc agcactcctgcaatccagttctcgcctcattcactcacgcacacccttctgcaaggagtt aagagctgctggctgagtaaatga >gi568815594f:73736751_73937286|GENSCAN_predicted_peptide_3|228_aa MSFGSEEEYFPDPFTGLTTEVPHLLSQQLSTLVGGSTLSNEAGTGLHEPWNQLAALVPAG ANSTHLHLLCSTPFRRECTEVLVLEGGMLPPGDTTMISLNWKLRLSPGHLAFLLPLSQQA KKGVTLLAGVTDSDHQDEISLLLHNGGKEECTWNTGDPLGCLLVLPCPMIKVNGKLQRPN PDRTTNGPEPSGIKVSVTLPGKKPQPAEVLAEGKGNTEWVVEVVINTS >gi568815594f:73736751_73937286|GENSCAN_predicted_CDS_3|687_bp atgtcttttggtagtgaagaagaatatttccctgaccctttcacaggactcacaacagag gtgcctcatttactcagtcagcagctttcaactcttgtgggagggagcacgctatcaaat gaggcaggaactggattgcatgagccctggaaccagctggctgctttggtgccagcagga gcaaactccactcacttgcacctgctgtgttccacccctttcaggagggagtgcacagag gtcttagttctagagggaggaatgctgccaccaggagacacaactatgatttcattaaac tggaagttaagattgtcacctggccaccttgcgttcctcctacctctaagtcaacaggct aagaaaggagttacattgttggctggggtgactgactcagaccatcaagatgaaatcagc ctactactccacaatggaggtaaggaagagtgcacatggaatacaggagatcccttaggg tgtctcttagtattaccatgccctatgataaaggtcaatgggaaactacaacggcccaat ccagacaggactacaaatggcccagaaccttcaggaataaaggtttcggtcactctacca ggtaaaaaaccacaacctgctgaagtgcttgctgaaggcaaagggaatacagaatgggta gtagaagtagtcatcaataccagctaa >gi568815594f:73736751_73937286|GENSCAN_predicted_peptide_4|171_aa MGETAPMIQLPPPGPALDMWRLWGLQFKEKEEEEEKEEEEDEEKEEDEKEEDEEDEKKED EKEEEEKEDEEKEEEEKEEDEKEEDEEEEEKEKEEKKEEEKEEEEKEKEEEEEKEEEEEK EEEEKKKRQQSQSPFHTRISLFHSGSFTSNITKDFIEAKRKVELVTRVCML >gi568815594f:73736751_73937286|GENSCAN_predicted_CDS_4|516_bp atgggggaaacagctcccatgattcaattacccccacctggtcctgcacttgacatgtgg agattatggggattacaattcaaggagaaggaggaggaggaggaaaaggaggaggaggag gatgaggagaaggaggaggatgagaaagaggaggatgaggaggatgagaagaaggaggat gagaaggaggaggaagaaaaggaggatgaggagaaggaggaggaggagaaggaggaggat gagaaggaggaggatgaggaggaggaggagaaggagaaggaggagaaaaaggaggaggag aaggaggaggaggagaaggagaaggaggaggaagaggagaaggaggaggaagaggagaag gaggaggaggagaagaagaagagacaacagagtcagagtccctttcacactaggatttct ctctttcattctggaagttttacttctaatattacaaaggacttcattgaagccaagcgg aaagttgaacttgtgacaagagtgtgtatgctgtga >gi568815594f:73736751_73937286|GENSCAN_predicted_peptide_5|102_aa MPACELHQLAYLEYLVGVIGEELFLLEPMYKGWKLYLLLQMCNHKHKPIMITDNQGNMTP IGMSSFMSVEVELAEATDNPFSTAATAVPTLAATTLGKKQRV >gi568815594f:73736751_73937286|GENSCAN_predicted_CDS_5|309_bp atgcctgcctgtgaactccaccaactggcctatctagaatatctggtgggagtgattggt gaagagctgtttcttttggagccaatgtataaaggctggaagctgtatctacttcttcaa atgtgcaaccacaaacacaagcccataatgatcacagataatcagggaaacatgacacca ataggaatgagtagctttatgtctgttgaggtggagcttgcagaagcaactgacaacccc ttttccactgctgctacagcagtacccacccttgctgccacaacactggggaaaaaacaa agagtctga >gi568815594f:73736751_73937286|GENSCAN_predicted_peptide_6|41_aa MDTKKGAIDIRAFFKVEGGKKSLAKDSWRNRELAVLLTVTS >gi568815594f:73736751_73937286|GENSCAN_predicted_CDS_6|126_bp atggacacaaagaagggagcaatagacatcagagccttctttaaggtggagggtgggaag aagagtctagctaaagacagttggaggaacagggagttggcagtcctactcacggtaact tcatag >gi568815594f:73736751_73937286|GENSCAN_predicted_peptide_7|128_aa MKNYFLWMNKEWFLEMESTPGENAVMIVEMITKDTEYYIINFVDKATSGIERIDSNFGRG SAVDKMLPNNIMCYREIFCERKSQSMPQISLLTSFKKLPQPPQPSAANFLISEQPSTLRQ DSPPGKVL >gi568815594f:73736751_73937286|GENSCAN_predicted_CDS_7|387_bp atgaagaattacttcttatggatgaacaaagagtggtttcttgaaatggaatctactcct ggtgaaaatgctgtgatgattgttgaaatgataacaaaggatacggagtattacatcata aattttgttgataaagcaacatcaggaattgaaaggattgactccaattttggaagaggt tctgctgtggataaaatgctgccaaacaacataatgtgctatagagaaatcttttgtgaa aggaagagtcaatcaatgccacaaatctcattgctgacttcttttaagaaattgccacag ccacctcaaccatcagcagccaacttcctgatcagtgagcagccatcaacattgaggcaa gactctccaccaggaaaagtattatga >gi568815594f:73736751_73937286|GENSCAN_predicted_peptide_8|186_aa MSLPSSRAARVPGPSGSLCALLALLLLLTPPGPLASAGPVSAVLTELRCTCLRVTLRVNP KTIGKLQVFPAGPQCSKVEVVASLKNGKQVCLDPEAPFLKKVIQKILDSARPFHALQVLL LLSLLLTALPSCTNGQSKRNLGKSKDNSVVSYRAMLKDGRKMYLDPEAPRIKKIVQKMLE GDGSGA >gi568815594f:73736751_73937286|GENSCAN_predicted_CDS_8|561_bp atgagcctcccgtccagccgcgcggcccgtgtcccgggtccttcgggctccttgtgcgcg ctgctcgcgctgctgctcctgctgacgccgccggggcccctcgccagcgctggtcctgtc tctgctgtgctgacagagctgcgttgcacttgtttacgcgttacgctgagagtaaacccc aaaacgattggtaaactgcaggtgttccccgcaggcccgcagtgctccaaggtggaagtg gtagcctccctgaagaacgggaagcaagtttgtctggacccggaagccccttttctaaag aaagtcatccagaaaattttggacagtgccagaccatttcatgccctgcaggtgttgctg cttctgtcactgctgctgaccgctctgccttcctgcaccaatgggcaaagcaagagaaac ttagggaaaagcaaagataattctgttgtctcttacagagccatgctgaaagatgggagg aaaatgtacctggacccagaagctcccagaatcaagaagatagtccagaaaatgttggaa ggtgatgggtcaggtgcttaa >gi568815594f:73736751_73937286|GENSCAN_predicted_peptide_9|104_aa MSSAARSRLTRATRQEMLFLALLLLPVVVAFARAEAEEDGDLQCLCVKTTSQVRPRHITS LEVIKAGPHCPTAQLIATLKNGRKICLDLQALLYKKIIKEHLES >gi568815594f:73736751_73937286|GENSCAN_predicted_CDS_9|315_bp atgagctccgcagccaggtcccgcctcacccgcgccacccgccaggagatgctgttcttg gcgttgctgctcctgccagttgtggtcgccttcgccagagctgaagctgaagaagatggg gacctgcagtgcctgtgtgtgaagaccacctcccaggtccgtcccaggcacatcaccagc ctggaggtgatcaaggccggaccccactgccccactgcccaactcatagccacgctgaag aatgggaggaaaatttgcttggatctgcaagccctgctgtacaagaaaatcattaaggaa catttggagagttag >gi568815594f:73736751_73937286|GENSCAN_predicted_peptide_10|106_aa MRAWNFVMLGENLPESEFGPEKSKDVLENLSWKAFLNYPSEPNVFTRLLMRGKPDIKGRR VSRDRGEVEVGVMQVEDWRKGYEARNAGEKGEEVDSLLEPSEGIQL >gi568815594f:73736751_73937286|GENSCAN_predicted_CDS_10|321_bp atgagggcctggaattttgtaatgcttggagagaacctacctgagagcgaatttggccca gagaaaagcaaagatgtattagagaacttgagttggaaagcgtttctgaattatccaagt gagcccaatgtattcacacgcctccttatgagaggtaagccggatataaaaggtagaaga gtcagcagagacagaggagaagtagaggttggtgtgatgcaggttgaagactggagaaag ggctatgaggcaaggaatgctggtgaaaaaggagaggaagtagattctcttctagagcct tcagaaggaatacagctctga >gi568815594f:73736751_73937286|GENSCAN_predicted_peptide_11|203_aa MARAALSAAPSNPRLLRVALLLLLLVAAGRRAAGASVATELRCQCLQTLQGIHPKNIQSV NVKSPGPHCAQTEVIATLKNGRKACLNPASPIVKKIIEKMLNSMYNHFHLTAGNFESYVA ECQDLFSFFRVLGSILTSSLPHILSVLLLLSNSCSKQTALSMFIEYQQPALNFRHFEDVH ETQPLISRDSESIVDRRGANITV >gi568815594f:73736751_73937286|GENSCAN_predicted_CDS_11|612_bp atggcccgcgctgctctctccgccgcccccagcaatccccggctcctgcgagtggcactg ctgctcctgctcctggtagccgctggccggcgcgcagcaggagcgtccgtggccactgaa ctgcgctgccagtgcttgcagaccctgcagggaattcaccccaagaacatccaaagtgtg aacgtgaagtcccccggaccccactgcgcccaaaccgaagtcatagccacactcaagaat gggcggaaagcttgcctcaatcctgcatcccccatagttaagaaaatcatcgaaaagatg ctgaacagcatgtataatcattttcatttgactgcaggaaattttgaatcttatgttgct gagtgtcaagatttgttctctttctttagagtcttgggctccattttgacaagcagcttg cctcacatcctctcagtcctcctgctactcagcaattcctgcagcaaacaaactgctctg tcaatgttcatcgaatatcaacaacctgcactaaatttcaggcactttgaggatgttcat gaaacacagcccttgatctcaagagattcagagtctattgtggatcggaggggagcaaac atcacagtttag >gi568815594f:73736751_73937286|GENSCAN_predicted_peptide_12|457_aa MSALNERLKNPENSVNTFFQTRQNITTNRSTLNQRSSTFFLAPGTCFVEDNFSKDAQNLL KLISNFSKVSRYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDVKD LKENYKPLLSEIKEDTNKWKNIPCSWVGRINIMKMAILLKVIYRFNAIPTKLQMTFFTEL EKATLKFIWNQKRACITKSILSQKNKAGGITLPDFKLYYKAMVTKTAWYWYQNRDIDQWN RTEPSEIMPHIYNYLIFDKPEKNKQWGKDSLFNKWCWENWLAICRKLKLDPFLTPYTKIN SRWIKDLNIRPKTIKTLEENLGSTIQDIGMGKDFMSETPKAMATKARIDQWDLIKLKSFC TAKETAIRVNRQPTKWEKIFATYSSDKGLISRIYNEVKQIYQKKTNNPIKKWMKDMNRHF SKEDIYAAKKTHEKMLTITGHQRKANQNHNEIPSHTS >gi568815594f:73736751_73937286|GENSCAN_predicted_CDS_12|1374_bp atgtctgcacttaatgagcgtctaaagaatcctgaaaattcagttaacacatttttccag acacgacaaaatataacaactaatagaagtactttaaatcagcggtcctcaacctttttt ttggcaccagggacctgttttgtggaagacaatttttccaaggatgcccaaaatctcctt aagctgataagcaacttcagcaaagtctcaagatacaaaatcaatgtacaaaaatcacaa gcattcttatacaccaataacagacaaacagagagccaaatcatgagtgaactcccattc acaattgcttcaaagagaataaaatacctaggaatccaacttacaagggacgtgaaggac ctcaaggagaactacaaaccactgctcagtgaaataaaagaggatacaaacaaatggaag aatattccatgctcatgggtaggaagaatcaatatcatgaaaatggccatactgctcaag gtaatttatagattcaatgccatccccaccaagctacaaatgactttcttcacagaattg gaaaaagctactttaaagttcatatggaaccaaaaaagagcctgcatcaccaagtcaatc ctaagccaaaagaacaaagctggaggcatcacactacctgacttcaaactatactacaag gctatggtaaccaaaacagcatggtactggtaccaaaacagagatatagatcaatggaac agaacagagccctcagaaataatgccgcatatctacaactatctgatctttgacaaacct gagaaaaacaagcaatggggaaaggattccctatttaataaatggtgctgggaaaactgg ctagccatatgtagaaagctgaaactggatcctttccttacaccttatacaaaaattaat tcaagatggattaaagacttaaacattagacctaaaaccataaaaaccctagaagaaaac ctaggcagtaccattcaggacataggcatgggcaaggacttcatgtctgaaacaccaaaa gcaatggcaacaaaagccagaattgaccaatgggatctaattaaactaaagagcttctgc acagcaaaagaaactgccatcagagtgaacaggcaacctacaaaatgggagaaaattttc gcaacctactcatctgacaaagggctaatatccaggatctacaatgaagtcaaacaaatt taccagaaaaaaacaaacaaccccatcaaaaagtggatgaaggacatgaacagacacttc tcaaaagaagacatttatgcagccaaaaaaacacatgaaaaaatgctcaccatcactggc catcagagaaaagcaaatcaaaaccacaatgagataccatctcacaccagttag