GENSCAN 1.0 Date run: 8-Nov-116 Time: 14:32:39 Sequence gi568815579r:29602846_29808413 : 205568 bp : 50.33% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3199 3480 282 2 0 91 63 149 0.612 10.09 1.02 Intr + 5812 5864 53 2 2 107 82 44 0.996 4.33 1.03 Intr + 7564 7787 224 0 2 76 76 436 0.998 38.13 1.04 Intr + 9017 9094 78 2 0 83 113 11 0.761 1.77 1.05 Intr + 9272 9333 62 2 2 91 91 24 0.769 1.38 1.06 Intr + 11026 11127 102 2 0 107 95 46 0.925 7.35 1.07 Intr + 12399 12485 87 1 0 107 5 82 0.251 1.74 1.08 Intr + 13879 14022 144 2 0 53 30 106 0.008 1.55 1.09 Intr + 14559 14604 46 1 1 111 80 20 0.011 0.97 1.10 Intr + 18995 19100 106 2 1 39 61 67 0.274 -0.68 1.11 Intr + 21402 21553 152 2 2 84 75 182 0.801 15.46 1.12 Intr + 38838 39036 199 0 1 13 -2 241 0.096 6.95 1.13 Intr + 54986 55088 103 1 1 73 64 42 0.016 0.05 1.14 Intr + 68766 68814 49 1 1 76 74 45 0.059 -0.36 1.15 Intr + 70979 71807 829 2 1 112 -11 1733 0.151 157.21 1.16 Intr + 71831 72022 192 1 0 71 95 41 0.311 2.79 1.17 Intr + 74355 74539 185 2 2 136 40 43 0.163 2.89 1.18 Term + 79365 79590 226 0 1 56 55 133 0.295 3.05 1.19 PlyA + 82293 82298 6 1.05 2.00 Prom + 84787 84826 40 -7.46 2.01 Init + 87066 87152 87 2 0 70 100 108 0.616 10.84 2.02 Term + 94214 94897 684 1 0 -15 47 300 0.509 9.24 2.03 PlyA + 95689 95694 6 1.05 3.14 PlyA - 96762 96757 6 1.05 3.13 Term - 100132 99867 266 2 2 102 47 505 0.996 43.47 3.12 Intr - 105578 105409 170 1 2 58 111 205 0.210 19.39 3.11 Intr - 112454 112280 175 0 1 3 103 108 0.194 2.80 3.10 Intr - 120459 120366 94 0 1 101 98 67 0.593 8.54 3.09 Intr - 126004 125888 117 1 0 49 47 88 0.399 1.46 3.08 Intr - 131648 131565 84 2 0 49 29 119 0.007 2.12 3.07 Intr - 139037 138980 58 1 1 44 98 39 0.005 -0.61 3.06 Intr - 142032 141917 116 0 2 89 69 32 0.001 0.65 3.05 Intr - 158536 158419 118 0 1 69 52 102 0.246 5.17 3.04 Intr - 174933 174804 130 1 1 123 30 69 0.004 4.35 3.03 Intr - 183567 183447 121 0 1 93 82 -20 0.001 -2.03 3.02 Intr - 189455 189351 105 2 0 48 80 96 0.089 5.21 3.01 Init - 204582 204367 216 0 0 91 81 89 0.650 7.29 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 28118 27996 123 2 0 48 47 136 0.862 3.88 S.002 Intr + 138222 138292 71 0 2 49 77 51 0.832 -1.07 S.003 Term + 138478 138593 116 2 2 95 53 76 0.905 3.53 S.004 Term + 179524 179615 92 2 2 97 45 70 0.914 1.48 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815579r:29602846_29808413|GENSCAN_predicted_peptide_1|1039_aa XLSKHSTCLYPNPTACPPKDAGGKDPRKRAHSSPLLSNTAYDVLSSRIPFQAHAPPRWAS AGDVTHSAISELRESATAAASASSESAGSGPRMKSVIYHALSQKEANDSDVQPSGAQRAE AFVRAFLKRSTPRMSPQAREDQLQRKAVVLEYFTRHKRKEKKKKAKGLSARQRRELRLFD IKPEQQRYSLFLPLHELWKQYIRDLCSGLKPDTQPQMIQAKLLKADLHGAIISVTKSKCP SYVGITGILLQETKHIFKIITKEDRLKVIPKLNCVFTVETDGFISYIYGSKFQLRSRCSE AAQVLSPKQHAVTAASDEHTHGLGTWSPSTLPPRTRAPAQRLPCGYTVALMAHPGVISPS PQCNHICLFHIIFFHWNVSCSDAGTLPMVFAAITIPSALKGGSPTRRVEMAPAVQAAEQG AVDKASVAKEAGAAAEVFSGEKGAGRARRCQKRKKEKQKRRRRRRKEEGGEGGGKKKKRK KRKRRKQKKWKKKMEEEKEEEKEKEGKERRVLQGVGRCWLEASVSPMGFSGGLHGPLHDS AASFPWSLQYTAKISKLKIAVLTRQLETMVDHLANTEINSQRIAAVESCFGASGQPLALP GRVLLGEGVLTKECRKKAKPRIFFLFNDILVYGSIVLNKRKYRSQHIIPLEEVTLELLPE TLQAKNRWMIKTAKKSFVVSAASATERQEWISHIEECVRRQLRATGRPPSTEHAAPWIPD KATDICMRCTQTRFSALTRRHHCRKCGFVVCAECSRQRFLLPRLSPKPVRVCSLCYRELA AQQRQEEAEEQGAGSPGQPAHLARPICGASSGDDDDSDEDKEGSRDGDWPSSVEFYASGL TPGLQNICPQASSTAQAPKRAAPEAAQGSGTPSHGGRCSGGEWLFLDSQCLFAGHCVLMA SLQGWNRTAAAPSPKVLGPVRAHGLGWPRTFAEPSHLRSCSRTSWAMPAAGDTAGSETNK ARTACCPRQHLGYGDLGSGILEQHTGTLRTVCKLLIPLKYQNAFGFQQPPDLLGAPTMKQ RPLNLACPSHNHLLAQPNP >gi568815579r:29602846_29808413|GENSCAN_predicted_CDS_1|3120_bp ngccttagcaagcattccacatgcctttaccccaaccctactgcctgtcccccaaaagat gcaggcggaaaagaccccaggaagcgagcccactcctccccactgctcagcaacaccgcc tacgacgttctcagtagccgaatccctttccaggcgcatgcgcccccgaggtgggcgagc gccggtgatgtcacgcatagcgccatctccgagctccgagagtctgcgacagcagctgcc agtgcgtcatcagagagcgccggaagcggtccgagaatgaagagtgtgatctaccatgca ttgtctcagaaagaggcgaatgactccgatgtccagccttcaggagcacagcgggccgag gccttcgtgagggccttcctgaagcgcagcacgccccgcatgagcccgcaggcccgcgag gaccagctgcagcgcaaggcggtggtcctggagtacttcacccgccacaagcgcaaggag aagaagaagaaagccaaaggcctctctgccaggcaaaggagggagctgcggctctttgac attaaaccagagcagcagagatacagccttttcctccctctccatgaactctggaaacag tacatcagggacctgtgcagtgggctcaagccagacacgcagccacagatgattcaggcc aagctcttaaaggcagatcttcacggggctattatttcagtgacaaaatccaaatgcccc tcttatgtgggtattacaggaatccttctacaggaaacaaagcacattttcaaaattatc accaaagaagaccgcctgaaagttatccccaagctaaactgcgtgttcactgtggaaacc gatggctttatttcctacatttacgggagcaaattccagcttcggtcaaggtgttctgag gcagctcaagtgctctcccccaagcagcacgcagtcaccgctgccagtgatgaacacacg cacgggctaggcacatggagcccaagcacactgccacctcggacacgggccccggcccag cgactgccttgtggatacactgttgccctcatggcccacccaggggtcatatctcccagt ccccagtgcaaccatatctgcctgtttcatatcattttcttccactggaatgtgagctgc tcggacgctgggacactgcccatggtgttcgccgccataaccattcccagtgccctgaag ggtggcagccccactaggcgagtggagatggctccagccgtccaagctgctgaacaaggg gctgtggacaaggccagcgtggccaaggaggcaggggctgcagctgaggtcttctcaggg gaaaagggagcaggcagagcaagacgctgtcagaaaaggaagaaagagaagcagaagagg aggaggaggaggagaaaagaagaaggaggtgaaggaggaggaaagaagaaaaagaggaag aagaggaagaggaggaagcagaagaagtggaagaagaagatggaggaggagaaagaggag gagaaggagaaggaggggaaagaaaggagagttttgcaaggggttggtcgctgttggctg gaggcctcagtgtctcccatgggcttctccggagggctgcatggacctcttcatgactca gcagccagcttcccctggagtttgcagtatacagccaagatcagcaagctaaaaatagca gttcttacccgccagctggagacgatggtggaccacttggccaacacggagatcaacagc cagcgcatcgcggcagtggagagctgcttcggggcctcggggcagccgctggcgctgcca ggccgagtgctgctgggcgagggcgtgctgaccaaagagtgccgcaagaaggccaagccg cgcatcttcttcctctttaacgacatcctggtgtatggcagcatcgtgctcaacaagcgc aagtaccgcagccagcacatcatccccctggaggaggtcacactggagctgttgccggag acgctgcaggccaagaaccgctggatgatcaagacggccaagaagtcctttgtggtgtcg gccgcctccgctacggagcgccaggaatggattagccacatcgaggagtgcgtgcggcgg caactgagggccacgggccgcccgcccagcacggagcacgcggcaccctggatccccgac aaggccacggacatctgcatgcgctgcacgcagacgcgcttctctgccctcacgaggcgc caccactgccgcaagtgcggcttcgtggtctgcgctgagtgctcgcgccagcgcttcctg ctcccgcgcctgtcccccaagcccgtgcgcgtctgcagcctctgctaccgcgaactggcc gcccagcagcggcaggaggaggcggaggagcagggcgcggggtccccagggcagccagcc cacctggcccggcccatctgcggagcgtccagtggagatgacgatgactccgacgaggac aaggagggcagcagggacggcgactggcccagcagcgtggagttctacgcctcggggctg acccccggcctgcagaacatctgtccccaagccagctccactgcccaggcccccaagagg gcagctccagaagctgcccagggctccgggaccccatcccatggtggcaggtgcagcggt ggggagtggctctttctggactcccagtgcctttttgctggacactgtgtccttatggct tcactgcagggctggaacagaactgctgctgccccaagtcccaaggtgttagggcctgta agggcccacggcttggggtggcccaggaccttcgcagagccttcacacctgcggtcttgc tcccgcaccagctgggccatgccagctgctggggacaccgctgggagtgagacaaacaag gcccgcacagcatgctgcccacggcagcacctgggctatggggacctgggctccgggatt ctggagcagcacactgggacacttcgtacagtttgcaaattactaatacccttaaaatac caaaatgcctttggatttcagcagccaccagacctcctgggggcccccaccatgaagcag aggcctttgaacctcgcctgccccagccataaccacttgctggcacaacccaatccctga >gi568815579r:29602846_29808413|GENSCAN_predicted_peptide_2|256_aa MLLAIIPYFPGIIASNMAAPSELSDEELKAPDTRHPDTWTPNTQTPRHSDTQTPRNLTPR HPDTQTPRNLTPRHPDTQTPDTQTPRNLTARHPDTQTPDTQTPGHSDTQTLRHPDIQTPD TQKPDTPPPGHPDILPPEHPATQSLGHLDTQPPHHPDTLPLQYPDTPTPRHPDTPLPGHL DNPPPGHLDTQTPRPSATQTPCHSDTQTPHHLDTWTPGHPDTRTLRHLTTQTPRNLTPRN LDTRTPRHLTPRHTDT >gi568815579r:29602846_29808413|GENSCAN_predicted_CDS_2|771_bp atgttgctggcaattatcccgtattttcctgggataattgccagcaacatggcagccccc tctgagcttagtgatgaagaattgaaggcacctgacaccagacacccagacacctggaca cccaacactcagacacccagacactcagatacccagacacccagaaacctgacacccaga cacccggatacccagacacccagaaacctgacacccagacacccagatacccagacaccc gacacccagacacccagaaacctgacagccagacacccggatacccagacacccgacacc cagacacccggacactcagacacccagacactcagacacccagacatccagacacctgat acccagaaacctgacaccccaccacccggacacccagacatcctgccacctgaacaccct gccactcagtcacttggacacctggatactcagccaccccaccatccagataccctgcca ctccaatacccagacaccccaacccccagacacccagacactccactacctggacacctg gacaacccaccacctggacacctggacacccaaacacccagaccctctgccacccagaca ccctgccactcagacacccagacaccccaccacctggacacctggacacccggacaccca gacacccggacactcagacacctcaccacccagacaccgagaaacctgacacccagaaac ctggacacccggacacccagacacctgacaccaagacacacagacacttga >gi568815579r:29602846_29808413|GENSCAN_predicted_peptide_3|589_aa MNRTRLSTTECLRPSAREREHGKALEAWLLSSLGGTICLHRPPSQPGKGPTHAPQGLVPA QNQRTLLCKATKRGATLQDVTVARQLSAVEAGPEGLQLEAMAVGFLQEGLQHFPKRPRAP SAGCAQLFTTLAGLTLGSLLPGPVCPAGGSGWSRALGHMDIRDALAVGARGAESRAFQCQ LQAMALPHASVLSPRVMTVSRCRQPCQAPPQTALRDSPERQSQCRILQEEDMSLSLSDAL DLVGCLPQAEHGLPERRETILCGDHSEQCSEYIFVWVEAHLHTLELWRQRSYGSKVPLEA AGLQEHQDDLCQGYARAPGEPRRYIEPSLATVDPTVKVCGTNTLCYRDFGEVPEKGPEPA PPSAVPDSEPGSVHYQEHGDTVATREGPAQPEGGTEGGASAGAATKACATLRGAGELGGE PGACQARAAAASDRRARRVDLRSPRPATMTIMVEDIMKLLCSLSGERKMKAAVKHSGKGA LVTGAMAFVGGLVGGPPGLAVGGAVGGLLGAWMTSGQFKPVPQILMELPPAEQQRLFNEA AAIIRHLEWTDAVQLTALVMGSEALQQQLLAMLVNYVTKELRAEIQYDD >gi568815579r:29602846_29808413|GENSCAN_predicted_CDS_3|1770_bp atgaacaggaccaggctgtcaaccacagagtgtctgcggcccagcgccagggaacgtgag catgggaaagccctggaggcttggctcctgtcctccctgggcgggacaatctgcctgcac aggccccccagccagccaggcaagggcccaacccatgctccccaaggcttggttcctgct cagaaccagcgcacactgctgtgtaaagccaccaagcgtggggccaccttgcaggatgtg actgtggcgaggcagctgtctgcagtggaggcaggcccagaagggctacagctggaggct atggcagtgggcttcctgcaggaaggcctccagcattttcccaagcggccaagagcaccc tctgctggatgtgcacagctgttcaccaccttagcagggctgactctggggagcctactc cctggccctgtttgccctgcaggagggagtggctggagccgtgctctgggtcacatggac attcgtgatgccctcgctgtaggtgcccgaggtgctgagtccagggccttccagtgccag cttcaagcgatggccctgccacatgcctctgtactaagtccccgtgtgatgacggtgagc aggtgccggcagccatgccaagcacctccccagactgctctgcgggactctcctgagagg cagtcccagtgccgcatcttacaggaggaggacatgtccctgtcactcagcgacgcttta gatcttgttggctgccttccccaagcagaacatgggctccctgagaggagggagaccatt ctctgtggtgaccattcagagcagtgtagtgagtacatctttgtctgggtagaagcacat cttcacactctggagctatggaggcaaagaagctatggctccaaggttcctctggaagca gcaggcctacaagaacatcaggatgacctgtgccaaggctatgccagggcacctggtgag ccacgccgctacattgaaccctcactggccactgtagacccaacagtcaaagtctgtggc accaacaccttgtgttatagagactttggggaagtccccgagaagggcccagaacccgcc cctccatcagcagttccagactctgagcctgggagcgtgcattaccaggaacatggagac acggtggctacacgtgaaggcccggcgcagccggaaggtgggacggagggcggggccagc gccggggccgccaccaaggcctgcgcgaccctccgcggggctggggagctgggcggggag cccggggcctgccaggcccgggctgcagccgcgtctgatcgccgagcgcgccgcgtagac ctccgctcccccaggcccgccacgatgactatcatggtggaggacatcatgaagctgctg tgctccctttctggggagaggaagatgaaggcggctgtcaagcactctgggaagggtgcc ctggtcacaggggccatggccttcgtcgggggtttggtgggcggcccaccgggactcgcc gttgggggggctgtcggggggctgttaggtgcctggatgacaagtggacagtttaagccg gttcctcagatcctaatggagctgccccctgccgagcaacagaggctctttaacgaagcc gcagccatcatcaggcacctggagtggacggacgccgtgcagctgaccgcgctggtcatg ggcagcgaggccctgcagcagcagctgctggccatgctggtgaactacgtcaccaaggag ctgcgggccgagatccagtatgatgactag