GENSCAN 1.0 Date run: 3-Nov-116 Time: 05:56:24 Sequence gi568815597r:31808855_32019065 : 210211 bp : 45.17% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 PlyA - 4928 4923 6 1.05 1.04 Term - 6493 5088 1406 2 2 54 47 417 0.205 25.47 1.03 Intr - 7788 7690 99 1 0 58 81 47 0.003 1.08 1.02 Intr - 26339 26276 64 2 1 109 85 52 0.676 5.29 1.01 Init - 27822 27766 57 0 0 59 105 60 0.621 6.11 1.00 Prom - 43821 43782 40 -2.46 2.04 PlyA - 47112 47107 6 1.05 2.03 Term - 69130 69008 123 1 0 73 54 83 0.428 1.78 2.02 Intr - 107133 107041 93 0 0 56 75 92 0.634 4.86 2.01 Init - 110211 110116 96 0 0 82 94 108 0.988 11.11 2.00 Prom - 114852 114813 40 -7.36 3.00 Prom + 121964 122003 40 -3.56 3.01 Init + 123839 123894 56 1 2 89 33 21 0.512 -2.45 3.02 Intr + 129202 129839 638 1 2 23 48 455 0.042 26.74 3.03 Intr + 165199 165303 105 2 0 54 92 53 0.017 2.49 3.04 Term + 169103 169161 59 0 2 115 41 56 0.647 1.45 3.05 PlyA + 169939 169944 6 1.05 4.03 PlyA - 170541 170536 6 1.05 4.02 Term - 173895 173833 63 0 0 113 47 71 0.940 3.39 4.01 Init - 180975 180853 123 0 0 36 100 116 0.875 7.77 4.00 Prom - 183807 183768 40 -2.76 5.00 Prom + 188326 188365 40 -7.26 5.01 Init + 189743 189753 11 1 2 88 63 4 0.233 -2.03 5.02 Intr + 192120 192249 130 0 1 79 86 41 0.205 3.70 5.03 Intr + 198475 198553 79 0 1 85 87 33 0.447 2.02 5.04 Intr + 204753 205523 771 1 0 52 110 605 0.659 50.55 5.05 Intr + 205881 205980 100 1 1 64 78 66 0.301 2.37 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 40137 40277 141 2 0 97 55 69 0.824 2.43 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:31808855_32019065|GENSCAN_predicted_peptide_1|541_aa MPTEVIGMEVYIKETLQEEPTQHEDDKVVKTFMMIHDLFPRTWMELEAIILSKLMQERKN QILHVLTYKWELNGRWTDMSQAGDVEGPSTGDPVLSPQHNCELLQNMEGASSMPGLSPDG PGASSGPGVRAGSRRKIPRKEALRGGSSRAAGAAEVRPGVLELLAVVQSRGSMLAPGLHM QLPSVPTQGRALTSKRLQVSLCDILDDSCPRKLCSRSAGLPERALACRERLAGVEEVSCL RPREARDGGMSSPGCDRRSPTLSKEEPPGRPLTSSPDPVPVRVRKKWRRQGAHSECEEGA GDFLWLDQSPRGDNLLSVGDPPQVADLESLGGPCRPPSPKDTGSGPGEPGGSGAGCASGT EKFGYLPATGDGPQPGSPCGPVGFPVPSGGESLSSAAQAPPQSAALCLGASAQASAEQQE AVCVVRTGSDEGQAPAQDQEELEAKAQPASRGRLEQGLAAPADTCASSREPLGGLSSSLD TEASRACSGPFMEQRRSKGTKNLKKGPVPCAQDRGTDRSSDNSHQDRPEEPSPGGCPRLV S >gi568815597r:31808855_32019065|GENSCAN_predicted_CDS_1|1626_bp atgccaacagaggtgataggcatggaggtgtacatcaaggaaaccctccaggaggagcct actcagcatgaagacgacaaggttgtgaagacctttatgatgatccatgatctgtttcca cgaacgtggatggagctggaggccattatccttagcaaactaatgcaggaacggaaaaac caaatattgcatgttctcacttataagtgggagctaaatggtaggtggacagacatgtcc caggcgggggacgtagaaggccccagcacaggagaccctgtgctcagtccccaacacaac tgtgagcttttacagaacatggaaggagccagctccatgccaggcctgtcaccagatggg ccgggagcaagctctgggcccggagtcagggctggcagcagaaggaagatccccaggaag gaggcccttcgaggtggcagctcccgggctgcaggtgctgctgaggtccggccaggggtc ttggagctgctagctgtggtacagagccggggctcgatgctggctcctgggctccacatg cagctgccctcggtgcctactcaggggagagctctgacctccaagaggctccaggtttct ctgtgtgacatcttagatgacagttgccccaggaaactttgtagcaggtctgctggcctc ccagagagagctctggcctgcagggagaggcttgcaggagtggaggaggtgagctgcctc aggcccagggaggccagagacggtggaatgagttctccagggtgtgacagaagaagcccc acactcagcaaagaggagccccctggaaggcccctgacatcctcaccagacccagtccct gtgagggtaagaaagaaatggaggaggcaaggggctcattcagagtgtgaggaaggggct ggtgacttcctgtggcttgatcagagccctcgtggggacaacctcctgtctgtgggagac cctccccaagttgctgacctggagtccttgggaggcccttgcagacctccctctccaaaa gacactgggtctgggcctggagagccaggtggaagtggggcaggatgtgcctcagggact gagaaatttggatatttgcccgctacaggggatgggccccagccaggcagcccctgtggc cctgtcgggttcccagtgcccagtggaggggagtccctcagttcagctgcacaggctcct ccacagagcgcagcactgtgcctgggggcgtcagcacaggcctctgcagagcagcaagaa gctgtgtgtgtcgtgcggactggcagcgatgaaggccaggctccagcacaggaccaggag gagctggaggccaaggctcagccagcttccaggggaaggctggagcaaggactcgctgcc cccgctgacacctgtgccagctcccgggagcccttgggcggcctcagctcctccctggat actgaagccagcagggcctgctcaggcccattcatggagcagagaagatccaagggcact aagaacctgaagaaaggtccagtgccctgtgcccaagaccggggcacagacagaagctca gacaactcccaccaggacaggccagaggaacccagcccaggaggctgccccagactggtg agctga >gi568815597r:31808855_32019065|GENSCAN_predicted_peptide_2|103_aa MNRPAPVEISYENMRFLITHNPTNATLNKFTEELKKYGVTTLVRVCDATYDKAPVEKEGI HVLVLGSGTGDMSTKVCFPQKQTLRHGLQCTVIQEVQETPVED >gi568815597r:31808855_32019065|GENSCAN_predicted_CDS_2|312_bp atgaaccgtccagcccctgtggagatctcctatgagaacatgcgttttctgataactcac aaccctaccaatgctactctcaacaagttcacagaggaacttaagaagtatggagtgacg actttggttcgagtttgtgatgctacatatgataaagctccagttgaaaaagaaggaatc cacgttctagtcctgggaagtggcacaggagacatgtcaacaaaagtctgcttcccccag aagcagaccctgagacatggacttcagtgcacagtaattcaggaagtacaggaaacacca gtagaggactga >gi568815597r:31808855_32019065|GENSCAN_predicted_peptide_3|285_aa MGNCGKQTASIAVQLLSWSHLGGGGGGGSGGGGGDDSPPASARDTRPGPAAAAAARLHES VLLLLLRPPLRLLLPPLPPLPPLPCGVTSRRARPPLPLQPLPDRRRLQPGRSRRHHHRCA RSRNQLPPPPLRGRCGARRAGPPARPPRSLTGWLAGSRAVGPAHLPRGALTHCPGRLPRS SSSWFVPLRTPGYGRRVHVIGEGRQNVAWGVARPRCAPPIRTWAGSRACSREQNMRLVTG NPKMKDSVPTLTPNAENLVKAFNKLSDPVSPVAEPKKEPEGKAAC >gi568815597r:31808855_32019065|GENSCAN_predicted_CDS_3|858_bp atggggaactgtggaaagcagacagctagtattgctgtgcagcttctttcatggagtcac ctcggcggcggcggcggcggcggtagcggtggcggcggcggcgacgactcccccccagcc tcggcgcgcgacacccggcccggcccggcggcggcggcggcggcgcgtctccacgagtcc gtcttgctgctgctgctgcggccgccgctgcgtctcctgctgccgccgctgcctccgctg ccgccgctgccctgtggcgtcacctcgcgccgtgcccggccgccgctgccgctccagccg ctcccggacagacgacggttacagcccggccgatcgcgccgccaccaccaccgctgcgcg cgcagccggaaccagctcccgccgccgcccctcagggggcgctgcggggcgcggcgcgca gggccaccggcccgccctccgcgttcgctgactggctggctggccggctcgcgggcagtt gggcccgcccacctgccccgcggggcgctgacccactgccctgggcgcctcccccgctcc tcctcctcgtggttcgttccgctccgcactcccggctacggaaggcgagtccatgtgatc ggggagggaaggcagaacgtggcctggggagtcgctcgcccacgctgcgccccgcccatc cgcacctgggctgggtcgagggcctgcagccgagaacaaaacatgcgtttagttactgga aacccgaaaatgaaagattctgtacctaccctcactcccaatgcagaaaacttagtcaaa gctttcaacaaactctcagacccagtgtccccagtggctgagcccaagaaggagccagaa ggcaaagcagcctgctga >gi568815597r:31808855_32019065|GENSCAN_predicted_peptide_4|61_aa MKKVKDAGKRMMIRYPQQLIFPEHLLHAWALVITGLYPVDKATPGHPDWPIIFILTDSVN V >gi568815597r:31808855_32019065|GENSCAN_predicted_CDS_4|186_bp atgaaaaaggtgaaggatgccggaaagaggatgatgattcggtacccacagcagctaata tttcctgagcatcttctgcatgcttgggccctggtaataacaggactttatcctgttgac aaggccactccagggcacccagactggcccatcatcttcatcctcacggactctgtgaat gtttag >gi568815597r:31808855_32019065|GENSCAN_predicted_peptide_5|364_aa MLIRTYFYPVYLVFICLPYVESSTKAGTLSILFTAYPTSSTMAGIYSVPTVGVPDDPVGR TQTIMRVAKGKPQKRGGDTCSASAESRGKRNERSPLPPAHAHQALAWRERRESQSREATS RTPRRCAGAVAGLGTLFSAEKPKGGAGGWLRTRAASFPVLSLAGSLGSASVATAPALPPP PTAARASVAAASLSRSLDRTSSQMQRRDDPAARMSRSSGRSGSMDPSGAHPSVRQTPSRQ PPLPHRSRGGGGGSRGGARASPATQPPPLLPPSATGPDATVGGPAPTPLLPPSATASVKM EPENKYLPELMAEKDSLDPSFTHAMQLLTAGKPFLIAGGDERLDRGVGGEDSQSGEGPWE EGPS >gi568815597r:31808855_32019065|GENSCAN_predicted_CDS_5|1092_bp atgctgataagaacttatttctaccctgtttacttggtgtttatctgtctcccctatgta gaaagcagcacaaaagcaggaaccctgtctatcctgtttaccgcttaccccacatccagt acaatggctggcatctattcggtgcctactgtgggagtgcctgatgaccctgtgggcagg actcagactattatgagggtggcaaagggcaagcctcaaaaaaggggaggagacacgtgt tccgctagcgccgagtcacgtggaaaacgaaacgaacggagcccactgcctcccgcgcat gcgcatcaagctctggcctggcgggagcggagggaaagccagagtcgggaggcgacttct cggacgccgcggcggtgcgcaggcgccgtggccggactggggactttgttctccgcggag aaacccaagggcggtgccggtggctggctgcgcacgcgcgccgcctcatttccggtgctc tctctcgctgggtcgctcgggtcggcttcggtcgctaccgctcccgctctgccacccccg ccaaccgccgctcgggcctccgtcgctgccgcgtcgctttctcgctccttggatcgcaca tcctcccagatgcagcgccgggacgaccccgccgcgcgcatgagccggtcttcgggccgt agcggctccatggacccctccggtgcccacccctcggtgcgtcagacgccgtctcggcag ccgccgctgcctcaccggtcccggggaggcggagggggatcccgcgggggcgcccgggcc tcgcccgccacgcagccgccaccgctgctgccgccctcggccacgggtcccgacgcgaca gtgggcgggccagcgccgaccccgctgctgcccccctcggccacagcctcggtcaagatg gagccagagaacaagtacctgcccgaactcatggccgagaaggactcgctcgacccgtcc ttcactcacgccatgcagctgctgacggcagggaaaccctttttgatcgcgggtggcgat gagcgcctcgaccgaggtgtaggaggtgaagacagccagagcggggagggaccgtgggag gaagggccgagn