GENSCAN 1.0 Date run: 5-Nov-116 Time: 07:35:54 Sequence gi568815596f:113027687_113232868 : 205182 bp : 46.42% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 3016 3087 72 0 0 93 77 76 0.474 8.07 1.02 Intr + 9376 9472 97 0 1 85 67 65 0.210 3.68 1.03 Intr + 23540 23709 170 0 2 49 37 105 0.039 1.17 1.04 Intr + 31726 31781 56 0 2 82 117 55 0.552 5.58 1.05 Intr + 33166 33251 86 1 2 89 81 14 0.876 0.26 1.06 Intr + 34438 34565 128 2 2 112 78 55 0.587 7.30 1.07 Term + 34767 34991 225 2 0 126 49 335 0.998 30.18 1.08 PlyA + 36582 36587 6 1.05 2.00 Prom + 38064 38103 40 -4.76 2.01 Init + 42711 42928 218 2 2 63 13 108 0.202 -0.99 2.02 Intr + 46643 46728 86 2 2 38 99 109 0.879 6.46 2.03 Intr + 47037 47164 128 1 2 143 78 30 0.951 7.90 2.04 Term + 47466 47678 213 2 0 111 50 175 0.648 13.23 2.05 PlyA + 48136 48141 6 1.05 3.03 PlyA - 49002 48997 6 1.05 3.02 Term - 49903 49696 208 0 1 35 38 371 0.155 23.61 3.01 Init - 84971 84955 17 2 2 84 93 25 0.369 2.27 3.00 Prom - 88428 88389 40 -4.26 4.04 PlyA - 89815 89810 6 1.05 4.03 Term - 93471 93259 213 0 0 -2 43 335 0.663 17.23 4.02 Intr - 97271 97146 126 2 0 45 72 62 0.393 1.18 4.01 Init - 97413 97375 39 0 0 62 97 24 0.555 0.88 4.00 Prom - 97628 97589 40 -7.76 5.00 Prom + 99873 99912 40 -4.86 5.01 Init + 99939 100054 116 2 2 54 110 88 0.835 5.35 5.02 Intr + 101890 101978 89 1 2 96 113 70 0.943 9.91 5.03 Intr + 103359 103471 113 1 2 150 83 32 0.976 9.00 5.04 Term + 104970 105185 216 2 0 107 38 382 0.989 32.24 5.05 PlyA + 106063 106068 6 1.05 6.00 Prom + 110186 110225 40 -4.96 6.01 Init + 129846 129978 133 2 1 66 20 214 0.888 10.70 6.02 Intr + 130569 130667 99 1 0 64 82 40 0.584 1.08 6.03 Intr + 130838 130992 155 0 2 84 82 55 0.597 4.29 6.04 Intr + 141578 141618 41 1 2 87 96 10 0.115 -1.28 6.05 Intr + 146943 146969 27 0 0 100 94 73 0.663 6.33 6.06 Intr + 147772 147893 122 1 2 64 41 56 0.510 -1.36 6.07 Intr + 148880 148926 47 0 2 101 77 24 0.668 0.73 6.08 Intr + 154660 155826 1167 0 0 100 99 335 0.678 24.68 6.09 Intr + 158340 158569 230 2 2 56 98 93 0.542 3.77 6.10 Intr + 164694 164903 210 0 0 26 72 143 0.097 4.53 6.11 Intr + 165362 165442 81 2 0 112 52 98 0.992 7.35 6.12 Intr + 165572 165684 113 2 2 113 63 110 0.552 11.02 6.13 Intr + 165906 165964 59 1 2 44 110 25 0.561 -1.10 6.14 Intr + 166173 166262 90 2 0 73 95 88 0.945 8.19 6.15 Intr + 168041 168084 44 1 2 115 67 89 0.996 6.54 6.16 Intr + 168461 168621 161 2 2 54 70 233 0.543 17.73 6.17 Intr + 169878 169948 71 1 2 62 97 44 0.988 1.60 6.18 Intr + 170061 170227 167 2 2 44 103 232 0.736 19.06 6.19 Intr + 171054 171198 145 0 1 72 68 168 0.976 13.48 6.20 Intr + 171397 171540 144 0 0 73 83 351 0.982 33.58 6.21 Term + 173472 173729 258 2 0 55 48 463 0.999 34.55 6.22 PlyA + 175209 175214 6 1.05 7.08 PlyA - 175451 175446 6 1.05 7.07 Term - 179552 179356 197 0 2 95 55 78 0.462 2.77 7.06 Intr - 187672 187597 76 1 1 118 82 27 0.803 4.19 7.05 Intr - 190176 190061 116 1 2 67 78 104 0.934 7.47 7.04 Intr - 192492 192406 87 1 0 95 80 126 0.644 12.44 7.03 Intr - 199570 199469 102 2 0 121 110 104 0.870 16.05 7.02 Intr - 201941 201908 34 2 1 109 97 -9 0.498 -0.10 7.01 Init - 204932 204855 78 2 0 88 66 18 0.286 0.66 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 49926 49696 231 0 0 68 38 365 0.811 24.61 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:113027687_113232868|GENSCAN_predicted_peptide_1|277_aa MTEFESEDTGSNGWNLKERNSKEKVRSEVLKNQVAAPSVPSHDEREHLCGSATTVGVSCW RAMVKLHSPAEASPQALEASEAQTRKAPECGSGVQKLPVALALFMGLSQQKLWGSLHPVE LKMVLSGALCFRMKDSALKVLYLHNNQLLAGGLHAGKVIKGEEISVVPNRWLDASLSPVI LGVQGGSQCLSCGVGQEPTLTLEPVNIMELYLGAKESKSFTFYRRDMGLTSSFESAAYPG WFLCTVPEADQPVRLTQLPENGGWNAPITDFYFQQCD >gi568815596f:113027687_113232868|GENSCAN_predicted_CDS_1|834_bp atgacggagtttgaatctgaggacacaggtagcaatggttggaacctgaaggagagaaac tccaaggagaaggtgcgctcagaagtgcttaagaaccaggtggccgctccctctgtacct tcccatgatgagcgtgagcatctgtgtggatcagcgaccacagttggagtttcctgctgg agagcaatggtcaaactacacagtccggcggaggcgagtccgcaggctctggaagcctct gaggcacagacgaggaaagcgcctgaatgtggatctggggtacagaagttgccggtggct ctggccctgttcatgggactcagtcagcagaaactgtgggggagtctacaccctgtggag ctcaagatggtcctgagtggggcgctgtgcttccgaatgaaggactcggcattgaaggtg ctttatctgcataataaccagcttctagctggagggctgcatgcagggaaggtcattaaa ggtgaagagatcagcgtggtccccaatcggtggctggatgccagcctgtcccccgtcatc ctgggtgtccagggtggaagccagtgcctgtcatgtggggtggggcaggagccgactcta acactagagccagtgaacatcatggagctctatcttggtgccaaggaatccaagagcttc accttctaccggcgggacatggggctcacctccagcttcgagtcggctgcctacccgggc tggttcctgtgcacggtgcctgaagccgatcagcctgtcagactcacccagcttcccgag aatggtggctggaatgcccccatcacagacttctacttccagcagtgtgactag >gi568815596f:113027687_113232868|GENSCAN_predicted_peptide_2|214_aa MGSSWTGPALNFFQLFELTLTQSGSHQLAIHLHPGPSTLLPHLDLGTATTLPTVLLLQPK KPQWFGQGEPIIKIKYADQKALYTRDGQLLVGDPVADNCCAEKICILPNRGLARTKVPIF LGIQGGSRCLACVETEEGPSLQLEDVNIEELYKGGEEATRFTFFQSSSGSAFRLEAAAWP GWFLCGPAEPQQPVQLTKESEPSARTKFYFEQSW >gi568815596f:113027687_113232868|GENSCAN_predicted_CDS_2|645_bp atgggatccagctggactggaccagcattgaatttcttccagctctttgagctgacactg acccagagtgggagtcatcagcttgctatccaccttcacccagggccctccactttgttg ccccacctagatctgggcacagctaccacactgcccactgtcctgctgctacaaccaaag aagccccagtggtttggccaaggggagcccatcatcaaaattaaatatgcagaccagaag gctctatacacaagagatggccagctgctggtgggagatcctgttgcagacaactgctgt gcagagaagatctgcatacttcctaacagaggcttggcccgcaccaaggtccccattttc ctggggatccagggagggagccgctgcctggcatgtgtggagacagaagaggggccttcc ctacagctggaggatgtgaacattgaggaactgtacaaaggtggtgaagaggccacacgc ttcaccttcttccagagcagctcaggctccgccttcaggcttgaggctgctgcctggcct ggctggttcctgtgtggcccggcagagccccagcagccagtacagctcaccaaggagagt gagccctcagcccgtaccaagttttactttgaacagagctggtag >gi568815596f:113027687_113232868|GENSCAN_predicted_peptide_3|74_aa MEEEERLGDRARPCPEEEEEEEEKEEEEEKEKKEEEEEEEEEEEEEERRKKRKKKKRKKK KKKGRRRRRNRNQV >gi568815596f:113027687_113232868|GENSCAN_predicted_CDS_3|225_bp atggaggaagaggaacgcctgggtgatagagcaagaccctgtcctgaggaggaggaggag gaggaggagaaggaggaggaggaggagaaggagaagaaggaggaggaggaggaagaagaa gaggaggaagaggaggaggagaggaggaagaagaggaagaagaagaagaggaagaagaag aagaagaaggggaggaggaggaggaggaataggaatcaagtgtga >gi568815596f:113027687_113232868|GENSCAN_predicted_peptide_4|125_aa MSLQIAEEKWLAKSEFREYSPGEKTLGAPGTSLLCVISPLCPEKWPSLSFVDMHRKTKTK TMKKNKRKRKKRKRKRKKKKKKRRRRRRKKKKRKKKKKKKRRRRRKKKRRRGRKKKKKKK KLLKA >gi568815596f:113027687_113232868|GENSCAN_predicted_CDS_4|378_bp atgtccttacagattgcagaagagaagtggcttgccaagtctgaattcagagagtattct cctggagaaaagacgctcggggctcctggaacctccctgctctgtgtcatttccccgctg tgtcctgagaagtggcccagtctgtcctttgtcgacatgcacaggaagacgaagacgaag acgatgaagaagaataagaggaagaggaagaagaggaagaggaagaggaagaagaagaag aagaagaggaggaggaggaggaggaagaagaagaagaggaagaagaagaagaagaagaag aggagaaggaggaggaagaagaagaggagaagggggaggaagaagaagaagaagaagaag aaactactcaaggcttag >gi568815596f:113027687_113232868|GENSCAN_predicted_peptide_5|177_aa MEICRGLRSHLITLLLFLFHSETICRPSGRKSSKMQAFRIWDVNQKTFYLRNNQLVAGYL QGPNVNLEEKIDVVPIEPHALFLGIHGGKMCLSCVKSGDETRLQLEAVNITDLSENRKQD KRFAFIRSDSGPTTSFESAACPGWFLCTAMEADQPVSLTNMPDEGVMVTKFYFQEDE >gi568815596f:113027687_113232868|GENSCAN_predicted_CDS_5|534_bp atggaaatctgcagaggcctccgcagtcacctaatcactctcctcctcttcctgttccat tcagagacgatctgccgaccctctgggagaaaatccagcaagatgcaagccttcagaatc tgggatgttaaccagaagaccttctatctgaggaacaaccaactagttgctggatacttg caaggaccaaatgtcaatttagaagaaaagatagatgtggtacccattgagcctcatgct ctgttcttgggaatccatggagggaagatgtgcctgtcctgtgtcaagtctggtgatgag accagactccagctggaggcagttaacatcactgacctgagcgagaacagaaagcaggac aagcgcttcgccttcatccgctcagacagtggccccaccaccagttttgagtctgccgcc tgccccggttggttcctctgcacagcgatggaagctgaccagcccgtcagcctcaccaat atgcctgacgaaggcgtcatggtcaccaaattctacttccaggaggacgagtag >gi568815596f:113027687_113232868|GENSCAN_predicted_peptide_6|1187_aa MGKGPRGGLGRAGAPPAVAALDGDRVPCSALATPASTCGAAVAEGWTCCSGQCTRQTSVS MVRMSRSGDQYQSEDWANVIFPVCEAYKEESFIHSSSRYLLSVNYESGMQALRILLLLFA YGLERENKQVGTTFSLQAQMGSRTILEKLEERPQVQTEASVLQLQAREFHSWFLPAATSV IRQLLGLQRDRTEPPQSSEKLKGHFNVEGRYGFPVSPAASAPPSPHSETVKADAPGHSWA AFAQWMMGDYRLPDHPQPMEILNLYLGDSLEPHPGECPRETCSHEDPPEPFEEQTWATDP PEPTRQNVPPWGSGVELTHLGSWVHQDGLEPCQEQTRATDPPESTRQDAPPWGSGVELTH LGSPSAQREHRQNTASPGSPVNSHLPGSPKQNRSTSTQVVFWAGILQAQMCVLDLEEELE KTEGLKAGLKCCLPTPPVDLPGDTGLHSSPPENEDSGEDSSEPEGEGQAWLREGTPDSSP QWGAEEESMFFSNPLFLASPCSENSASGECFSWGASDSHAGVRTGPESPATLEPPLPEDT VLWELESEPDLGDGAAISGHCTPPFPVPIYKPHSICWASVAAAEGAPAAPPGHGESEEGS PQLQHHSSGILPKWTLDASQSSLLETDGEQPSSLKKKEAGEAPKPGEEVKSEGTARPAET GDVQPDIHLTSAEHENLRTPMNSSWLPGSPMPQAQSPEEGQRPPAGDKLANGVRNNKVAW NLASRLYRLEGFRKSEVAAYLQKNNDFSRAVAEEYLSFFQFGGQSLDRALRSFLQALVLS GETQERERILYQFSRRFHHCNPGIFPSVDSVHTLTCAIMLLNTDLHGQNIGKSMSCQEFI TNLNGLRDGGNFPKELLKALYWSIRSEKLEWAVDEEDTARPEKAQPSLPAGKMSKPFLQL AQDPTVPTYKQGILARKMHQDADGKKTPWGKRGWKMFHTLLRGMVLYFLKQGEDHCLEGE SLVGQMVDEPVGVHHSLATPATHYTKKPHVFQLRTADWRLYLFQAPTAKEMSSWIARINL AAATHSAPPFPAAVGSQRRFVRPILPVGPAQSSLEEQHRSHENCLDAAADDLLDLQRNLP ERRGRGRELEEHRLRKEYLEYEKTRYETYVQLLVARLHCPSDALDLWEEQLGREAGGTRE PKLSLKKSHSSPSLHQDEAPTTAKVKRNISERRTYRKIIPKRNRNQL >gi568815596f:113027687_113232868|GENSCAN_predicted_CDS_6|3564_bp atgggtaagggaccacggggaggcctgggccgggcgggggcacccccggccgtcgccgcc ctcgacggtgaccgtgtgccgtgcagcgctctcgccacgccggcgagcacctgcggagcg gctgtcgctgagggatggacttgctgctctggtcaatgcactcgccagacctctgtgtcc atggtccgcatgtctaggtcaggggaccagtatcaaagtgaagactgggcaaacgtgata tttcctgtatgtgaagcatataaagaagagtcattcattcactcatctagcagatattta ctgagtgtcaactacgaatcaggcatgcaggcactgcgaatactactgctcttatttgca tatgggttggaaagagaaaataagcaagtaggcaccaccttctctctccaagctcagatg ggctccaggaccatcctggagaagctcgaggagagaccccaggttcagacagaggcctct gtcctgcagctccaggccagggagttccactcatggttcctgcctgcagcaacatctgta attcggcagcttctcgggctgcagagggacaggacagagccacctcagagctcagaaaaa ctgaagggccactttaatgtagagggtagatatggattcccagtttctccagcggccagt gctccccctagtccacacagtgagaccgtgaaagcagatgctccggggcactcctgggca gcttttgctcagtggatgatgggtgactacagactccctgaccacccccagcccatggaa attctcaacctgtacttgggagacagcctggagccccacccaggagagtgcccaagggaa acgtgcagccatgaggatccaccggagcctttcgaggagcaaacctgggccactgaccct cctgaacctaccagacaaaatgttcctccctggggctccggtgtggagctcacacacctg gggagctgggtccatcaggacgggctggagccttgccaggagcaaacccgggccactgac cctcctgaatctaccagacaagatgctcctccctggggctccggtgtggagctcacacac ctggggagcccctctgcccagagggagcacaggcagaacacagcatcaccagggtcacca gtgaacagccatctaccggggagcccaaagcagaaccggagcacgtccacacaggtagtg ttctgggcaggcatcctgcaggcccagatgtgtgtcctagacctggaggaggagctggag aagacggaagggctcaaggctgggctgaaatgctgtctccccacgccccctgtggacctc cccggggacacgggcctgcactccagcccacctgagaatgaagactcaggggaagacagc agtgagcctgagggagagggccaggcatggctgagagagggaaccccagactcttcccca cagtggggagctgaggaggagagcatgttcttcagcaaccccctcttcctggcgagtcct tgctcagagaacagtgcttctggagagtgcttttcctggggggcttcagactcccatgca ggtgtgaggactggacctgagagcccagcgactctggagcctcccctcccagaagacaca gtgctgtgggagctggaaagtgagccagatttgggggacggcgctgctatcagtgggcat tgtacccctccattccctgtgcccatctataaaccacactccatctgctgggcctcagtg gctgccgctgagggggctcctgcagcacctcctggtcacggggagagtgaggagggcagc ccgcagcttcaacaccacagctcaggcattttgcccaagtggacactagatgcttcacag tcttcactcttggagacggatggggaacagccaagttccttgaagaaaaaggaggcaggg gaggccccaaaaccaggcgaggaagtaaagagtgaaggaacagccaggcctgcagagact ggagacgtccagcctgacattcacctgacttctgcagaacatgagaatctgaggacaccg atgaactcttcttggcttcctgggagccctatgccccaagcacagtccccagaggaaggc cagagaccaccagctggagacaagctagctaatggcgtcaggaacaacaaggtagcctgg aacttggcctcacgcctctatcgcctggagggcttccggaagtctgaagtggctgcctac ctgcagaagaacaatgactttagcagggctgtggctgaggagtacctgtccttcttccag tttggaggccagagtctggaccgagccctccggagcttcctccaggccttggtgctcagt ggggagactcaggaacgggagcgaatcctctaccagttctccagacgcttccaccattgc aatccggggatcttcccctcagtagattctgtacacaccttgacatgtgcaatcatgctg cttaacacggacctgcatggacagaacattgggaagagcatgagctgccaggaattcata accaacctgaatgggctgagggatggcgggaacttccccaaggagctgctgaaggccctc tactggtctatccgcagcgagaagctcgagtgggccgtggatgaagaagacacagccaga cctgagaaggcccagccgtccctgccagctggcaagatgagcaagcccttccttcagctg gctcaggatcccacagtgcccacctacaagcagggcatcctggctcggaaaatgcatcaa gatgcagacggcaagaagacgccatggggcaagcgtggctggaagatgttccacacctta ctgcgagggatggttctctacttcctgaagcagggagaagaccactgtctggagggggag agcttggtggggcagatggtggatgagcccgtgggggtgcaccactcgctggccaccccc gccacgcattacaccaagaagccgcacgtcttccagctgcgcacggctgactggcgcctc tacctcttccaggcacccactgccaaggagatgagctcctggatcgcgcgcatcaacttg gctgcggccacgcactccgcgccgcccttccccgccgctgtgggctcccagcgcagattc gtgcggcccatcctgcccgtgggccccgcccagagctccctggaggagcagcatcgatcc cacgagaactgcctggacgctgccgcggacgacctgctggatctacagaggaacctgccg gagcggcggggccgtggccgcgagctggaggagcaccgcctgcggaaggagtacctggag tacgagaaaacccgctacgagacctacgtgcagctgctggtggcccgcctgcactgcccc tctgatgctctggacctgtgggaggagcagctggggagggaagctggaggcactcgggag cccaagctcagcctgaagaagtcccactcgagcccgtccctgcaccaggatgaggctccc accacggccaaggtgaagcgcaacatctcagagcgcagaacctaccggaagatcatccct aagcggaaccgcaatcagctgtga >gi568815596f:113027687_113232868|GENSCAN_predicted_peptide_7|229_aa MPTLQANCRTERSPNPWDKTAGSRHQPFICSLDDLVSGREMVGPTLPGYPPHIPTSGQGS YASSAIAGMVAGSEYSGNAYGHTPYSSYSEAWRFPNSSLLSIAPPQVLLQCGIPLAAAVS QASPRELKTDIPQGLGPWEVLAQARPAQGARSLQEIHVDQPTLAEVDSDLLKPLALVTGS DMGTRPKAGCYRRLFLDLSLKLPGSRESLSAIFAPRKWILRVPGAILEA >gi568815596f:113027687_113232868|GENSCAN_predicted_CDS_7|690_bp atgcccactctccaggccaactgcaggactgagaggtcacccaacccatgggacaaaaca gctggcagcagacaccagcccttcatttgttcccttgatgaccttgtgtcagggcgagag atggtggggcccacgctgcccggatacccaccccacatccccaccagcggacagggcagc tatgcctcctctgccatcgcaggcatggtggcaggaagtgaatactctggcaatgcctat ggccacaccccctactcctcctacagcgaggcctggcgcttccccaactccagcttgctg agtatagcaccaccccaggtcctcctgcagtgcggcatccccttggcagctgccgtcagc caggccagccccagggagcttaaaacagacattccacagggcctgggcccctgggaggtt ctagcacaggccaggccagcccagggtgctaggagcttgcaggaaattcatgtggaccaa ccaactctggcagaagtggactctgatctccttaaacccctggctctagtgactggctca gatatgggcacacggcccaaggcaggctgctacaggagactattcctggatttgagtttg aaacttccaggaagcagagagtcactttctgctatatttgcaccaagaaaatggatcttg agagttcctggggccatcttggaagcatga