GENSCAN 1.0 Date run: 4-Nov-116 Time: 11:37:33 Sequence gi568815587f:8811450_9019477 : 208028 bp : 43.53% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 16830 17009 180 2 0 101 20 109 0.484 5.46 1.02 Intr + 17286 17449 164 2 2 45 69 130 0.472 5.57 1.03 Intr + 26283 26423 141 0 0 40 72 112 0.102 4.27 1.04 Intr + 36210 36279 70 0 1 56 83 51 0.007 0.58 1.05 Intr + 56688 56753 66 2 0 54 98 33 0.000 0.00 1.06 Term + 81248 82180 933 1 0 -8 48 280 0.002 6.53 1.07 PlyA + 82212 82217 6 1.05 2.00 Prom + 83478 83517 40 -2.46 2.01 Init + 100001 100222 222 1 0 103 78 136 0.630 12.67 2.02 Intr + 103377 103481 105 2 0 53 111 41 0.890 3.31 2.03 Intr + 107888 108025 138 1 0 128 -14 153 0.015 9.76 2.04 Intr + 114196 114330 135 0 0 94 33 75 0.002 3.36 2.05 Intr + 120597 120713 117 2 0 105 105 33 0.440 7.36 2.06 Term + 121329 121403 75 2 0 60 55 71 0.254 -1.16 2.07 PlyA + 123163 123168 6 1.05 3.03 PlyA - 125166 125161 6 -0.45 3.02 Term - 126724 126167 558 1 0 73 44 464 0.339 34.85 3.01 Init - 133940 133710 231 2 0 63 47 86 0.170 0.26 3.00 Prom - 134430 134391 40 -5.56 4.06 PlyA - 134589 134584 6 1.05 4.05 Term - 137026 136871 156 1 0 58 32 194 0.992 8.73 4.04 Intr - 141888 141754 135 0 0 54 98 19 0.525 0.26 4.03 Intr - 144849 144741 109 2 1 79 93 93 0.884 9.19 4.02 Intr - 150734 150643 92 2 2 70 111 4 0.831 -0.31 4.01 Init - 152864 152760 105 2 0 98 113 274 0.999 29.12 4.00 Prom - 159279 159240 40 -7.36 5.09 PlyA - 159323 159318 6 1.05 5.08 Term - 161137 161074 64 0 1 102 42 72 0.220 1.36 5.07 Intr - 172520 172426 95 2 2 43 89 103 0.570 4.66 5.06 Intr - 172675 172623 53 2 2 88 116 26 0.996 4.03 5.05 Intr - 174401 174262 140 1 2 45 99 101 0.997 7.01 5.04 Intr - 176181 176099 83 0 2 88 98 18 0.975 1.24 5.03 Intr - 176812 176669 144 1 0 55 95 101 0.447 7.98 5.02 Intr - 192473 192313 161 0 2 34 89 366 0.361 31.01 5.01 Intr - 193122 192900 223 0 1 68 99 149 0.110 11.70 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 59616 59505 112 0 1 48 99 145 0.843 9.97 S.002 Term + 107888 108031 144 1 0 128 41 150 0.983 12.21 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587f:8811450_9019477|GENSCAN_predicted_peptide_1|517_aa KCLDTEQCTFNSTTRPLRLHQNSFRSPALGKAKPPAPQDAALLWFVFRGSNAAVSAGKLG GTLTKQKIPRCFAAVMSEFTQEGKLLQQRKLIRPDKSGEQRWKTGMRQETVEDKKVQEFE HSHEQNQQSINVELGHVSKHRRSLEATTSKDQSPHDGEIKARMADIDSQGNETDYASAGC KRNRNSSPVVTPTSSLPSEPSYSLQGTEEVESLNRPITGSEIEAIINSLPTKKSPGPNGF TVEFYQRYKEELVPFLLKLFQSIEKEGILPNPFYEASIILIPKPGRDTTKKENFRPISLV NINAKILSKILANRIQQHIEKLIHHDQVGFIPGMQGWFNIRKSINIIQHINRTKDKNHMT ISIDAEKAFDKIQQPFMLKTLNKLGIDGTYLKIIRATYDKPTANIILNGQKLEAFPLKTG TRQGCPLSPLLFNIVLKVLARAIRQEKEINGIQLGKEEVKLSLFADDMIVCLENPIVSAQ NLLKLISNFSKVSGYKINVQKSQHSYTPITDKQRAKS >gi568815587f:8811450_9019477|GENSCAN_predicted_CDS_1|1554_bp aagtgcctggacacagagcagtgcaccttcaacagcacaactcggccgttaaggctgcac caaaattcattcaggagcccagccctgggcaaagcaaagcctcctgccccccaggatgca gcacttctctggtttgtcttcaggggctccaatgcagctgtatctgcggggaagctgggg ggaacactgaccaaacagaagattccacgctgctttgctgcggtcatgtctgagtttacc caagaggggaagctgctccaacagaggaagttaattagaccagacaagagcggggagcag aggtggaagacagggatgagacaagaaactgtggaggacaaaaaggtacaagaatttgaa catagccatgaacagaaccaacaatcaataaatgtagagcttggacatgttagcaaacat cgaaggtctctggaagctacaacttccaaggaccaatctccgcatgatggagaaatcaaa gccagaatggctgatattgattctcaaggcaatgaaacagactacgcaagtgctggctgc aaaagaaacagaaacagcagccccgtggtcactccaactagctctttacccagtgaacca agctactcgctgcagggtacagaagaagttgaatccctgaatagaccaataacaggctct gaaattgaggcaataattaatagcttaccaaccaaaaaaagtccaggaccaaatggattc acagtcgaattctaccagaggtacaaggaggagctggtaccattccttctgaaactattc caatcaatagaaaaagagggaatcctccctaacccattttatgaggccagcatcattctg ataccaaagcctggcagagacacaacaaaaaaagagaattttagaccaatatccctagtg aacatcaatgcaaaaatcctcagtaaaatactggcaaaccgaatccagcagcacatcgaa aagcttatccaccatgatcaagtgggcttcatccctgggatgcaaggctggttcaacata cgcaaatcaataaacataatccagcatataaacagaaccaaagacaaaaaccacatgact atctcaatagatgcagaaaaggcctttgacaaaattcaacagcctttcatgctaaaaact ctcaataaattaggtattgatgggacgtatctcaaaataataagagctacttatgacaaa cccacagccaatatcatactgaatgggcaaaaactggaagcattccctttgaaaaccggc acaagacagggatgccctctctcaccactcctattcaacatagtgttgaaagttctggcc agggcaatcaggcaggagaaagaaataaatggtattcaattaggaaaagaagaagtcaaa ttgtccctgtttgcagatgacatgattgtatgtttagaaaaccccatcgtctcagcccaa aatctccttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaatgtgcaa aaatcacagcattcttatacaccaataacagacaaacagagagccaaatcatga >gi568815587f:8811450_9019477|GENSCAN_predicted_peptide_2|263_aa MDNCLAAAALNGVDRRSLQRSARLALEVLERAKRRAVDWHALERPKGCMGVLAREAPHLE KQPAAGPQRVLPGEKYYSSVPEEGGATHVYRYHRGESKLHMCLDIGNGQAENISKDLYIE VYPGTYSVTVGSNDLTKKTHVVAVDSGQSVDLVFPEEDEEEETARGACIATFSSLGPSRG KSCAVGSNSGSSLAMTSEVLEASLSTKVYHQQPRTKEKEKGIHFPQRGRDRYLARGLCPP CSEETAATRAQDQAALLQPTRAP >gi568815587f:8811450_9019477|GENSCAN_predicted_CDS_2|792_bp atggacaactgtttggcggccgcagcgctgaatggggtggaccgacgttccctgcagcgt tcagcaaggctggctctagaagtgctggagagggccaagaggagggcggtggactggcat gccctggagcgtcccaaaggctgcatgggggtccttgcccgggaggcgccccacctagag aaacagccggcagccggcccgcagcgcgttctcccgggagagaaatattattcatctgtg ccagaggaaggaggggcaacccatgtctatcgttatcacagaggcgagtcgaagctgcac atgtgcttggacatagggaatggtcaggctgagaacatctctaaggacctctacatagaa gtatatccagggacctattctgtcactgtgggctcaaatgacttaaccaagaagactcat gtggtagcagttgattctggacaaagcgtggacctggtcttccctgaggaggatgaggag gaagaaacagccaggggagcgtgcattgctactttctcctctttaggaccttccaggggc aaaagctgtgctgtgggctccaactctggaagttctctggccatgacttctgaggtcctg gaagcaagtctgtccaccaaggtctaccaccaacagccaagaacaaaggaaaaagaaaag ggcatccacttcccccagaggggcagagacaggtaccttgcaaggggcttgtgcccgccc tgttcagaggagacagctgccaccagggcccaagatcaagccgcactgctgcagcccacc agagccccgtga >gi568815587f:8811450_9019477|GENSCAN_predicted_peptide_3|262_aa MGKSPGHLTKEEIQMANKHMKRCSTSYVIREMQTETTMGYQYTPIRMTKIYDTDNTKCSQ GCGATGTFFYCWWEYKMVKEEMMDNRGNSSLPDKLPIFPDSARLPLTRSFYLEPMVTFHV HPEAPVSSPYSEELPRLPFPSDSLILGNYSEPCPFSFPMPYPNYRGCEYSYGPAFTRKRN ERERQRVKCVNEGYAQLRHHLPEEYLEKRLSKVETLRAAIKYINYLQSLLYPDKAETKNN PGKVSSMIATTSHHADPMFRIV >gi568815587f:8811450_9019477|GENSCAN_predicted_CDS_3|789_bp atgggcaaaagtcctggacacctcaccaaagaggagatacagatggcaaataagcatatg aaaagatgttccacatcatatgtcatcagggaaatgcaaactgaaacaacaatgggatac caatacacacctattagaatgaccaaaatctacgacactgataacaccaaatgctctcaa ggatgtggagcaacaggaactttcttttattgctggtgggaatacaaaatggttaaagag gaaatgatggacaacagaggcaactctagtctacctgacaaacttcctatcttccctgat tctgcccgcttgccactgaccaggtccttctatctggagcccatggtcactttccacgtg cacccagaggccccggtgtcatccccttactctgaggagctgccacggctgccttttccc agcgactctcttatcctgggaaattacagtgaaccctgccccttctctttcccgatgcct tatccaaattacagagggtgcgagtactcctacgggccagccttcacccggaaaaggaat gagcgggaaaggcagcgggtgaaatgtgtcaatgaaggctacgcccagctccgccatcat ctgccagaggagtatttggagaagcgactcagcaaagtggaaaccctcagagctgcgatc aagtacattaactacctgcagtctcttctgtaccctgataaagctgagaccaagaataac cctggaaaagtttcctccatgatagcaaccaccagccaccatgctgaccctatgttcaga attgtttga >gi568815587f:8811450_9019477|GENSCAN_predicted_peptide_4|198_aa MATLWGGLLRLGSLLSLSCLALSVLLLAQLSDAAKNFEDVRCKCICPPYKENSGHIYNKN ISQKDCDCLHVVEPMPVRGPDVEAYCLRCECKYEERSSVTIKVTIIIYLSILGLLLLYMV YLTLVEPILKRRLFGHAQLIQSDDDIGDHQPFANAHDVLARSRSRANVLNKVEYAQQRWK LQVQEQRKSVFDRHVVLS >gi568815587f:8811450_9019477|GENSCAN_predicted_CDS_4|597_bp atggcgaccctgtggggaggccttcttcggcttggctccttgctcagcctgtcgtgcctg gcgctttccgtgctgctgctggcgcagctgtcagacgccgccaagaatttcgaggatgtc agatgtaaatgtatctgccctccctataaagaaaattctgggcatatttataataagaac atatctcagaaagattgtgattgccttcatgttgtggagcccatgcctgtgcgggggcct gatgtagaagcatactgtctacgctgtgaatgcaaatatgaagaaagaagctctgtcaca atcaaggttaccattataatttatctctccattttgggccttctacttctgtacatggta tatcttactctggttgagcccatactgaagaggcgcctctttggacatgcacagttgata cagagtgatgatgatattggggatcaccagccttttgcaaatgcacacgatgtgctagcc cgctcccgcagtcgagccaacgtgctgaacaaggtagaatatgcacagcagcgctggaag cttcaagtccaagagcagcgaaagtctgtctttgaccggcatgttgtcctcagctaa >gi568815587f:8811450_9019477|GENSCAN_predicted_peptide_5|320_aa SRGPVGGDRPEVGEIRSAPNLGGSRQSSGPGRWTLEPRLAAWRCVSEKPSSGAGGGTRGM ARLSVIPGSATAWTGLLTEGGRKETDMREAASLRQQRRMKQAVQFIHKDSADLLPLDGLK KLGSSKDMRRLMETNLSKLRSGPRVPWASKTNKLNQAKSEGLKKSEEDDMILVSCQCAGK DVKALVDTGCLYNLISLACVDRLGLKEHVKSHKHEGEKLSLPRHLKVVGQIEHLVITLGS LRLDCPAAVVDDNEKNLSLGLQTLRSLKCIINLDKHRLIMGKTDKEEIPFVETVSLNEDN PPITGPGDGVDDVLAPHYYF >gi568815587f:8811450_9019477|GENSCAN_predicted_CDS_5|963_bp tcacgtggcccggtgggtggggaccgacctgaagttggagaaatccggagcgctcccaac ctcggagggagtcgccagtcctccgggcccgggcggtggaccctggagccccggctggcg gcgtggaggtgcgtttctgagaagccgagcagcggcgcgggcggcgggactcgaggcatg gcccggctgtcggtgatccccgggtcggccacggcgtggacagggctcctcactgagggc ggccgcaaggagaccgacatgcgggaggcggcgtcactgcgacagcagcgccggatgaag caggcggtgcagttcatccacaaggactccgccgacctgctgcccctggacggcctcaag aagctgggctcgtccaaggacatgaggcgcctcatggaaaccaacctgtctaagctccga agcggtccccgtgtcccttgggcctctaagacgaacaaactcaatcaggctaagtctgag gggctaaagaagtctgaggaggatgacatgattttggtttcttgccagtgtgctggaaag gatgtgaaagccttggttgacacaggctgcctatataatctcatctctttggcctgtgtg gacagattgggactcaaggagcatgtcaaatcccacaagcatgaaggagaaaagctttct ctaccccggcatctcaaagtagtgggccagattgagcacctagtgatcacactgggctcc ctccgcctggactgcccagcagctgtggttgatgacaatgagaaaaacttgtcccttggt ctacagactctccgatctctgaagtgcatcataaacttggataagcaccggctgatcatg gggaagacagacaaggaagaaatcccttttgtggagacagtctctttgaatgaagacaat cctcctattactggtcctggagatggggtagatgatgttcttgctcctcattactacttc tag