GENSCAN 1.0 Date run: 8-Nov-116 Time: 05:14:15 Sequence gi568815582r:54183689_54386050 : 202362 bp : 46.63% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 16 11 6 1.05 1.03 Term - 10229 10075 155 0 2 115 49 124 0.698 9.28 1.02 Intr - 20663 20556 108 0 0 72 100 46 0.519 4.46 1.01 Init - 23441 23333 109 2 1 49 78 109 0.601 6.38 1.00 Prom - 27582 27543 40 -3.66 2.05 PlyA - 30279 30274 6 1.05 2.04 Term - 30525 30460 66 0 0 91 45 36 0.164 -2.46 2.03 Intr - 33535 33428 108 1 0 75 81 47 0.183 3.18 2.02 Intr - 37932 37830 103 2 1 107 40 79 0.407 5.18 2.01 Init - 41566 41508 59 1 2 90 76 20 0.615 1.78 2.00 Prom - 43740 43701 40 -6.06 3.03 PlyA - 44440 44435 6 1.05 3.02 Term - 44635 44469 167 2 2 101 44 123 0.930 7.28 3.01 Init - 48339 48291 49 0 1 48 106 13 0.803 0.21 3.00 Prom - 50067 50028 40 -4.66 4.00 Prom + 50877 50916 40 -3.56 4.01 Init + 56196 56314 119 2 2 79 74 77 0.363 5.07 4.02 Intr + 67198 67261 64 1 1 82 89 18 0.130 0.02 4.03 Intr + 73453 73539 87 0 0 105 29 79 0.173 3.87 4.04 Intr + 74348 74495 148 1 1 62 47 88 0.151 1.91 4.05 Intr + 81340 81529 190 2 1 126 78 47 0.123 6.14 4.06 Intr + 86400 86649 250 0 1 64 30 117 0.117 0.94 4.07 Term + 91692 91874 183 2 0 39 48 143 0.506 2.94 4.08 PlyA + 91878 91883 6 1.05 5.08 PlyA - 93316 93311 6 1.05 5.07 Term - 100052 99998 55 1 1 118 32 48 0.430 -0.77 5.06 Intr - 100624 100558 67 2 1 96 95 64 0.910 5.86 5.05 Intr - 101925 100809 1117 0 1 67 62 1591 0.923 143.66 5.04 Intr - 102347 102096 252 2 0 76 99 477 0.384 45.23 5.03 Intr - 103797 103592 206 1 2 80 22 54 0.217 -3.08 5.02 Intr - 104173 104065 109 1 1 76 86 74 0.344 5.86 5.01 Init - 117345 117271 75 0 0 81 58 55 0.012 2.89 5.00 Prom - 118468 118429 40 -4.96 6.00 Prom + 122178 122217 40 0.04 6.01 Init + 123540 123551 12 2 0 107 89 18 0.400 3.60 6.02 Intr + 142964 143146 183 1 0 98 60 53 0.413 3.48 6.03 Term + 146771 146836 66 1 0 127 42 47 0.610 1.94 6.04 PlyA + 147780 147785 6 1.05 7.00 Prom + 149895 149934 40 -4.76 7.01 Init + 153155 153280 126 1 0 66 34 252 0.655 17.76 7.02 Intr + 153296 153580 285 1 0 -222 75 835 0.715 48.74 7.03 Intr + 159653 159731 79 1 1 76 52 110 0.422 5.32 7.04 Intr + 172332 172375 44 1 2 41 97 66 0.007 0.76 7.05 Intr + 186641 186809 169 1 1 71 72 114 0.085 7.62 7.06 Term + 187133 187260 128 0 2 64 49 48 0.559 -3.06 7.07 PlyA + 187723 187728 6 1.05 8.03 PlyA - 189240 189235 6 1.05 8.02 Term - 191657 191516 142 1 1 73 52 101 0.648 2.40 8.01 Init - 200297 200281 17 2 2 83 98 32 0.681 3.39 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 78465 78133 333 0 0 86 55 152 0.823 7.73 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582r:54183689_54386050|GENSCAN_predicted_peptide_1|123_aa MSSGAHQPKDYPLKSRHAQVPKSSSQLNADPLQAPGDQRHFPFSAKRPEHSLTNGHPPTP GMRETVIIGCLQGAMGARSTQVFRGRFSRAQAPSRKKCALVTPSGRSGALQRERPEAEDQ EEG >gi568815582r:54183689_54386050|GENSCAN_predicted_CDS_1|372_bp atgtcttctggggcccatcagcccaaggactatcccctgaaatccaggcatgcccaggtc cctaaatcttcctcgcagctcaatgcagaccctctgcaggccccaggagatcaaaggcat tttccattttctgccaagcgccccgagcactctttgaccaatggccaccccccgacccca ggcatgagagaaactgtcattattggctgcttgcaaggtgccatgggcgcccggtccacg caggtcttccgtggccgcttctctagggcccaggccccttccaggaagaagtgcgctctg gtgacaccatcaggccgctcgggggcactgcagagggaaaggccggaagcagaggaccaa gaggagggctga >gi568815582r:54183689_54386050|GENSCAN_predicted_peptide_2|111_aa MDELKKKEKIKINLDFTPQRVMDNGLVFALSDFSLKYSQVPPKDYRANFTRRFLHSSHLV GKTFGRAWGDSCPPPMQRGTESQSGFPGGQKPYLATQWKVGLIIPFDSTRL >gi568815582r:54183689_54386050|GENSCAN_predicted_CDS_2|336_bp atggacgagctgaaaaaaaaagagaaaattaaaatcaaccttgatttcaccccacagagg gtcatggacaatggtcttgtctttgctttgtctgatttctctctgaagtacagccaggta cccccaaaagactacagagctaactttacaagacgcttcctgcattcctcacacttggtg gggaagacatttggccgggcctggggagacagctgccctccaccaatgcaaagaggtact gagtcacagtccggattccctggtggccagaaaccttatcttgccacccaatggaaagtg ggcctcatcattccctttgacagcacccgattgtga >gi568815582r:54183689_54386050|GENSCAN_predicted_peptide_3|71_aa MQKHLGEDLIEPSISQMFLVLKPSDSWTGINIIGSPGSQAFELDISFPRSPTCTSRLWDF SASMIAEANIL >gi568815582r:54183689_54386050|GENSCAN_predicted_CDS_3|216_bp atgcagaaacaccttggagaagatctcatagagcccagcatttcccaaatgttcctggtt ctcaagccttcagactcctggactggcatcaacatcattggttcccctggttctcaggcc tttgaactagacatcagctttcctaggtctcccacttgtactagcagattatgggacttt tcagcttccatgattgcagaagccaatatcttatga >gi568815582r:54183689_54386050|GENSCAN_predicted_peptide_4|346_aa MGNPAARRGAYPSVDSTSVQVSNSRLMEGVMEDVRPLQLGCLMIKVYASQNKVIIGLGGS SLHARKKTVDEKAEAQEEQSDFMVVTHKTGRGPSECILLDISRGAGGHHSNWSPSGEGPI LVQNYENAPQIIPKYSGERDFSSWLSVDLPGNFRLNLQWANVFSKPVLINLWVGSLNGVQ GEGKEKEAAPFTGGPFGCCWNMSSGSSVNEGQGWRVNEEKGECQAGSGQSMNPPGVSHIN AILLPFATLHAIGVWICQPYVPKLLESGYGELYGTQQHPSSYQGHRGSHIEERRSRKKAS LFPANVDGLSQQNFDVQLPPKPPSTRSEAEKESKEWTEIFVLSYTY >gi568815582r:54183689_54386050|GENSCAN_predicted_CDS_4|1041_bp atgggcaatccagctgcaagaagaggtgcatacccctctgtagacagcacatctgtgcag gtgtctaacagccgcctcatggaaggagtcatggaagacgttaggcccctccagttggga tgtcttatgataaaggtgtatgcatctcagaacaaagtgataataggacttggtggttcc tcactccatgctagaaagaagactgtggatgagaaagctgaggcccaagaggaacaaagt gactttatggttgtcacacataaaactgggcgtggccctagtgaatgcattctcctggac atttcccgtggagccggcggccatcactccaactggtccccaagtggggagggccctata ctggttcagaactatgagaatgcacctcaaattatcccaaagtactccggggagagagac ttctcctcctggctcagtgtggatctaccaggaaacttcaggttaaatctacagtgggca aatgtgttttccaaacctgttcttattaatctatgggtaggatccctgaatggagtgcag ggagaaggaaaagagaaggaggcggcgcccttcacaggaggaccttttggttgctgctgg aacatgagctccggttcaagtgtgaatgaaggacagggatggagggtcaatgaagagaag ggggaatgtcaagctggctctgggcaaagtatgaatcccccaggagtgtcccatatcaat gccatccttctgccctttgccacactccatgccattggggtctggatttgccaaccatat gtgcccaagcttcttgaatcagggtacggagagctctatggtacccagcagcatcccagc agctaccaaggccacaggggatcacacatcgaggagcgaagatctagaaagaaggcaagc ctttttcctgctaatgtggatggcctctcacagcaaaactttgatgtgcaattaccccca aagcctccttccaccaggagtgaagctgagaaggaatcaaaggagtggacagagatcttc gtcctgtcctatacctactga >gi568815582r:54183689_54386050|GENSCAN_predicted_peptide_5|626_aa MGGPISSPGQTCKAPDDQKFAMSTSAWPACGSPPRPKALSPQPRQCWEFQCRYRLHALIR EAAWGAGLLHRFVAAPYPSSAPGLPPFSPRSQPAIFSKGFAHGPGGGVKWLISGFGPAGL GAASPPQGSELGYQYIRPLYPSERPGAAGGSGGSAGARGGLGAGASELNASGSLSNVLSS VYGAPYAAAAAAAAAQGYGAFLPYAAELPIFPQLGAQYELKDSPGVQHPAAAAAFPHPHP AFYPYGQYQFGDPSRPKNATRESTSTLKAWLNEHRKNPYPTKGEKIMLAIITKMTLTQVS TWFANARRRLKKENKMTWAPRSRTDEEGNAYGSEREEEDEEEDEEDGKRELELEEEELGG EEEDTGGEGLADDDEDEEIDLENLDGAATEPELSLAGAARRDGDLGLGPISDSKNSDSED SSEGLEDRPLPVLSLAPAPPPVAVASPSLPSPPVSLDPCAPAPAPASALQKPKIWSLAET ATSPDNPRRSPPGAGGSPPGAAVAPSALQLSPAAAAAAAHRLVSAPLGKFPAWTNRPFPG PPPGPRLHPLSLLGSAPPHLLGLPGAAGHPAAAAAFARPAEPEGGTDRCSALEVEKKLLK TAFQPVPRRPQNHLDAALVLSALSSS >gi568815582r:54183689_54386050|GENSCAN_predicted_CDS_5|1881_bp atgggcgggcccattagctccccggggcagacttgcaaagctcctgatgatcagaagttt gccatgtccacctctgcctggccggcttgtgggtcgccgccgcggcctaaagctctgagc ccccagccgcggcaatgctgggaatttcaatgccgctatcgactgcacgccctaatccgt gaagccgcctggggcgcgggcctgcttcaccgatttgtggcggctccctacccttccagt gcccccgggctgccacccttcagcccccggagccagcccgccatttttagtaagggattc gcacacggacccgggggcggcgttaagtggctaatctctgggtttggtcctgctgggttg ggggctgcctcgcccccgcaaggcagcgagctgggataccaatacatccgcccgctttac ccgtccgagcgcccgggggccgctggcggcagcggcggcagcgcgggggcccggggcggc ctgggtgccggagcctcggagctgaacgcctcggggtccctgtccaacgtgctctcgtcc gtgtacggggcgccctacgccgcggccgctgcggccgccgccgcccaaggctacggcgcc ttcctgccctacgccgcggagctgcccatcttcccgcagctgggcgcgcagtatgagctg aaggacagccccggggtgcagcatccggccgcggctgccgcgtttccgcacccgcacccc gccttctacccgtatggccagtaccagttcggggacccgtcccgtcccaagaacgccacc agggagagcaccagcacgctgaaggcctggctcaacgagcaccgcaagaacccctacccc accaagggcgagaagatcatgctggccatcatcaccaagatgaccctcacccaggtgtcc acctggttcgccaacgcgcgccggcgcctcaagaaggagaataagatgacttgggcgcct cgcagccgcactgacgaggagggaaacgcttatgggagcgagcgcgaggaggaagacgaa gaggaggacgaggaggacggcaaacgcgagctagagctggaggaggaggagctcgggggg gaggaggaggacacggggggcgagggcctggctgacgacgacgaggacgaggagatcgat ttggagaacttagacggcgcggccaccgagcctgagctgtccctggctggggcggcgcgc agggatggcgacctaggcctgggacccatttcggactccaaaaatagcgactcggaagat agctctgagggcttagaggaccggccactaccggtcctgagtctggctccagcgccacca ccagtggccgtggcctcgccgtctctgccgtcgccccccgtgagcctggacccctgcgct cccgcaccagcccccgcctccgccctgcagaagcccaagatctggtccctcgcggagact gccacaagcccggacaacccgcgccgctcgcctcccggcgcgggggggtctccaccgggg gcagcggtcgcgccttccgccctgcagctctctccggccgccgccgccgccgccgctcac agactggtctcagcgccgctgggcaagttcccggcttggaccaaccggccgtttccaggc ccaccgcccggcccccgcctgcacccgctctccctgctgggctctgcccctccgcacctg ctgggacttcccggagccgcgggccacccggctgccgccgccgccttcgctcggccagcg gagcccgaaggcggaacagatcgctgtagtgccttggaagtggagaaaaagttactcaag acagctttccagcccgtgcccaggcggccccagaaccatctggacgccgccctggtctta tcggctctctcctcatcctag >gi568815582r:54183689_54386050|GENSCAN_predicted_peptide_6|86_aa MAAVVLGLRKPPPSHATGVSTQSFQILGREGWRVKPAHTLLAMTSQRNKELRPSTCLEVE EKWLQRWAVGLPVKDSGVSQAKIWGK >gi568815582r:54183689_54386050|GENSCAN_predicted_CDS_6|261_bp atggctgctgtggtcctggggctgaggaagcctcctccatctcatgccactggcgtctcc acacaaagcttccagatcctgggaagagaaggctggagagtcaagccggctcatacctta ctggccatgactagtcagagaaacaaggaattacgaccctccacgtgtctggaagtagaa gagaagtggcttcagaggtgggctgtgggcttaccagtgaaggactctggagtcagccag gccaagatttggggcaagtga >gi568815582r:54183689_54386050|GENSCAN_predicted_peptide_7|276_aa MKLFPSSSCHSSKGFGDMEEEEMEEEEEEEKAEEEEKEEEEEVEEEEEEEEEKEVEEEEK VEVEEEEEVEQEEEKEVEEEEKVEVEEEEVEEEEEEVEEEEKEEEKEEDEEEEEKEEEKE EDEEEEEKEGKKMEEEKHVAMSLVNAEPPSGAFTTVVYHIDLPGRFSPNIYSAIMLDQEH PAGPMNGPWAKMMFKKGTDHLDDKWASGRCQNFQAAGTINGHMEAAASVMAERVAACSEV HSFPSERRRAAWEEVLIVVSLHFTPQITNESHSGSV >gi568815582r:54183689_54386050|GENSCAN_predicted_CDS_7|831_bp atgaagcttttccccagcagcagctgccacagcagtaagggctttggggacatggaggag gaggagatggaggaggaggaggaggaagagaaggcagaggaggaggagaaggaggaggag gaagaggtagaggaggaggaggaagaggaagaggagaaggaagtagaggaggaagagaag gtggaggtggaggaggaggaagaggtggagcaggaggaggagaaggaggtagaggaggag gagaaggtggaagtggaggaggaggaggtggaggaagaggaggaggaggtggaggaggag gagaaggaggaggagaaggaggaggacgaggaggaggaggagaaggaggaggagaaggag gaggacgaggaggaggaggagaaggagggaaagaagatggaggaggagaagcatgtagct atgtcccttgtaaacgcagagcctccttctggggccttcaccactgtcgtctaccacatc gaccttccaggaaggttttccccgaacatctattctgctatcatgctggaccaggaacac ccggcaggccccatgaatggaccttgggccaaaatgatgtttaagaaggggacggaccat ttagatgacaaatgggcttcagggcgctgtcagaacttccaggcggcggggactattaac ggtcacatggaggccgccgcatctgtgatggcagagcgggtcgccgcctgctcggaagta cacagcttcccgtcagagcgccgccgagctgcctgggaggaggtattaattgttgtcagc ctccatttcacaccccagataacaaacgagagccactctggcagtgtttaa >gi568815582r:54183689_54386050|GENSCAN_predicted_peptide_8|52_aa MAEDEGALAVLWPLMTPYCTLIANQGTRNDQAASLSPYSDDNGQSGQLAVLT >gi568815582r:54183689_54386050|GENSCAN_predicted_CDS_8|159_bp atggcagaagatgaaggagctttggcagttttatggcctttaatgaccccctattgtacc ctgattgccaatcagggcacacgtaatgaccaagcagcatcacttagcccttattccgat gacaatggccagagcggccagctcgctgttttgacatga