GENSCAN 1.0 Date run: 4-Nov-116 Time: 04:52:53 Sequence gi568815579f:29573840_29774676 : 200837 bp : 49.64% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 7788 7914 127 2 1 40 76 104 0.061 4.65 1.02 Intr + 32205 32486 282 1 0 91 63 149 0.590 10.09 1.03 Intr + 34818 34870 53 1 2 107 82 44 0.995 4.33 1.04 Intr + 36570 36793 224 2 2 76 76 436 0.998 38.13 1.05 Intr + 38023 38100 78 1 0 83 113 11 0.761 1.77 1.06 Intr + 38278 38339 62 1 2 91 91 24 0.769 1.38 1.07 Intr + 40032 40133 102 1 0 107 95 46 0.925 7.35 1.08 Intr + 41405 41491 87 0 0 107 5 82 0.251 1.74 1.09 Intr + 42885 43028 144 1 0 53 30 106 0.008 1.55 1.10 Intr + 43565 43610 46 0 1 111 80 20 0.011 0.97 1.11 Intr + 48001 48106 106 1 1 39 61 67 0.274 -0.68 1.12 Intr + 50408 50559 152 1 2 84 75 182 0.801 15.46 1.13 Intr + 67844 68042 199 2 1 13 -2 241 0.096 6.95 1.14 Intr + 83992 84094 103 0 1 73 64 42 0.016 0.05 1.15 Intr + 97772 97820 49 0 1 76 74 45 0.059 -0.36 1.16 Intr + 99985 100813 829 1 1 112 -11 1733 0.151 157.21 1.17 Intr + 100837 101028 192 0 0 71 95 41 0.311 2.79 1.18 Intr + 103361 103545 185 1 2 136 40 43 0.163 2.89 1.19 Term + 108371 108596 226 2 1 56 55 133 0.295 3.05 1.20 PlyA + 111299 111304 6 1.05 2.00 Prom + 113793 113832 40 -7.46 2.01 Init + 116072 116158 87 1 0 70 100 108 0.616 10.84 2.02 Term + 123220 123903 684 0 0 -15 47 300 0.509 9.24 2.03 PlyA + 124695 124700 6 1.05 3.11 PlyA - 125768 125763 6 1.05 3.10 Term - 129138 128873 266 1 2 102 47 505 0.996 43.47 3.09 Intr - 134584 134415 170 0 2 58 111 205 0.210 19.39 3.08 Intr - 141460 141286 175 2 1 3 103 108 0.194 2.80 3.07 Intr - 149465 149372 94 2 1 101 98 67 0.593 8.54 3.06 Intr - 155010 154894 117 0 0 49 47 88 0.399 1.46 3.05 Intr - 160654 160571 84 1 0 49 29 119 0.007 2.12 3.04 Intr - 168043 167986 58 0 1 44 98 39 0.005 -0.61 3.03 Intr - 171038 170923 116 2 2 89 69 32 0.001 0.65 3.02 Intr - 187542 187425 118 2 1 69 52 102 0.251 5.17 3.01 Intr - 200566 200533 34 2 1 108 100 20 0.082 2.48 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 31719 31794 76 2 1 86 58 11 0.838 -2.84 S.002 Term - 57124 57002 123 1 0 48 47 136 0.862 3.88 S.003 Intr + 167228 167298 71 2 2 49 77 51 0.832 -1.07 S.004 Term + 167484 167599 116 1 2 95 53 76 0.905 3.53 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815579f:29573840_29774676|GENSCAN_predicted_peptide_1|1081_aa LFLGLLTSWHFWLAAPIDEGAPGKEMQVLAIGSGSDMHKSGKGLSKHSTCLYPNPTACPP KDAGGKDPRKRAHSSPLLSNTAYDVLSSRIPFQAHAPPRWASAGDVTHSAISELRESATA AASASSESAGSGPRMKSVIYHALSQKEANDSDVQPSGAQRAEAFVRAFLKRSTPRMSPQA REDQLQRKAVVLEYFTRHKRKEKKKKAKGLSARQRRELRLFDIKPEQQRYSLFLPLHELW KQYIRDLCSGLKPDTQPQMIQAKLLKADLHGAIISVTKSKCPSYVGITGILLQETKHIFK IITKEDRLKVIPKLNCVFTVETDGFISYIYGSKFQLRSRCSEAAQVLSPKQHAVTAASDE HTHGLGTWSPSTLPPRTRAPAQRLPCGYTVALMAHPGVISPSPQCNHICLFHIIFFHWNV SCSDAGTLPMVFAAITIPSALKGGSPTRRVEMAPAVQAAEQGAVDKASVAKEAGAAAEVF SGEKGAGRARRCQKRKKEKQKRRRRRRKEEGGEGGGKKKKRKKRKRRKQKKWKKKMEEEK EEEKEKEGKERRVLQGVGRCWLEASVSPMGFSGGLHGPLHDSAASFPWSLQYTAKISKLK IAVLTRQLETMVDHLANTEINSQRIAAVESCFGASGQPLALPGRVLLGEGVLTKECRKKA KPRIFFLFNDILVYGSIVLNKRKYRSQHIIPLEEVTLELLPETLQAKNRWMIKTAKKSFV VSAASATERQEWISHIEECVRRQLRATGRPPSTEHAAPWIPDKATDICMRCTQTRFSALT RRHHCRKCGFVVCAECSRQRFLLPRLSPKPVRVCSLCYRELAAQQRQEEAEEQGAGSPGQ PAHLARPICGASSGDDDDSDEDKEGSRDGDWPSSVEFYASGLTPGLQNICPQASSTAQAP KRAAPEAAQGSGTPSHGGRCSGGEWLFLDSQCLFAGHCVLMASLQGWNRTAAAPSPKVLG PVRAHGLGWPRTFAEPSHLRSCSRTSWAMPAAGDTAGSETNKARTACCPRQHLGYGDLGS GILEQHTGTLRTVCKLLIPLKYQNAFGFQQPPDLLGAPTMKQRPLNLACPSHNHLLAQPN P >gi568815579f:29573840_29774676|GENSCAN_predicted_CDS_1|3246_bp ctgttcttggggctgctgacttcctggcacttctggcttgctgctcccattgacgaagga gccccaggcaaagagatgcaggttctggccattggaagtgggagtgacatgcacaaaagt gggaagggccttagcaagcattccacatgcctttaccccaaccctactgcctgtccccca aaagatgcaggcggaaaagaccccaggaagcgagcccactcctccccactgctcagcaac accgcctacgacgttctcagtagccgaatccctttccaggcgcatgcgcccccgaggtgg gcgagcgccggtgatgtcacgcatagcgccatctccgagctccgagagtctgcgacagca gctgccagtgcgtcatcagagagcgccggaagcggtccgagaatgaagagtgtgatctac catgcattgtctcagaaagaggcgaatgactccgatgtccagccttcaggagcacagcgg gccgaggccttcgtgagggccttcctgaagcgcagcacgccccgcatgagcccgcaggcc cgcgaggaccagctgcagcgcaaggcggtggtcctggagtacttcacccgccacaagcgc aaggagaagaagaagaaagccaaaggcctctctgccaggcaaaggagggagctgcggctc tttgacattaaaccagagcagcagagatacagccttttcctccctctccatgaactctgg aaacagtacatcagggacctgtgcagtgggctcaagccagacacgcagccacagatgatt caggccaagctcttaaaggcagatcttcacggggctattatttcagtgacaaaatccaaa tgcccctcttatgtgggtattacaggaatccttctacaggaaacaaagcacattttcaaa attatcaccaaagaagaccgcctgaaagttatccccaagctaaactgcgtgttcactgtg gaaaccgatggctttatttcctacatttacgggagcaaattccagcttcggtcaaggtgt tctgaggcagctcaagtgctctcccccaagcagcacgcagtcaccgctgccagtgatgaa cacacgcacgggctaggcacatggagcccaagcacactgccacctcggacacgggccccg gcccagcgactgccttgtggatacactgttgccctcatggcccacccaggggtcatatct cccagtccccagtgcaaccatatctgcctgtttcatatcattttcttccactggaatgtg agctgctcggacgctgggacactgcccatggtgttcgccgccataaccattcccagtgcc ctgaagggtggcagccccactaggcgagtggagatggctccagccgtccaagctgctgaa caaggggctgtggacaaggccagcgtggccaaggaggcaggggctgcagctgaggtcttc tcaggggaaaagggagcaggcagagcaagacgctgtcagaaaaggaagaaagagaagcag aagaggaggaggaggaggagaaaagaagaaggaggtgaaggaggaggaaagaagaaaaag aggaagaagaggaagaggaggaagcagaagaagtggaagaagaagatggaggaggagaaa gaggaggagaaggagaaggaggggaaagaaaggagagttttgcaaggggttggtcgctgt tggctggaggcctcagtgtctcccatgggcttctccggagggctgcatggacctcttcat gactcagcagccagcttcccctggagtttgcagtatacagccaagatcagcaagctaaaa atagcagttcttacccgccagctggagacgatggtggaccacttggccaacacggagatc aacagccagcgcatcgcggcagtggagagctgcttcggggcctcggggcagccgctggcg ctgccaggccgagtgctgctgggcgagggcgtgctgaccaaagagtgccgcaagaaggcc aagccgcgcatcttcttcctctttaacgacatcctggtgtatggcagcatcgtgctcaac aagcgcaagtaccgcagccagcacatcatccccctggaggaggtcacactggagctgttg ccggagacgctgcaggccaagaaccgctggatgatcaagacggccaagaagtcctttgtg gtgtcggccgcctccgctacggagcgccaggaatggattagccacatcgaggagtgcgtg cggcggcaactgagggccacgggccgcccgcccagcacggagcacgcggcaccctggatc cccgacaaggccacggacatctgcatgcgctgcacgcagacgcgcttctctgccctcacg aggcgccaccactgccgcaagtgcggcttcgtggtctgcgctgagtgctcgcgccagcgc ttcctgctcccgcgcctgtcccccaagcccgtgcgcgtctgcagcctctgctaccgcgaa ctggccgcccagcagcggcaggaggaggcggaggagcagggcgcggggtccccagggcag ccagcccacctggcccggcccatctgcggagcgtccagtggagatgacgatgactccgac gaggacaaggagggcagcagggacggcgactggcccagcagcgtggagttctacgcctcg gggctgacccccggcctgcagaacatctgtccccaagccagctccactgcccaggccccc aagagggcagctccagaagctgcccagggctccgggaccccatcccatggtggcaggtgc agcggtggggagtggctctttctggactcccagtgcctttttgctggacactgtgtcctt atggcttcactgcagggctggaacagaactgctgctgccccaagtcccaaggtgttaggg cctgtaagggcccacggcttggggtggcccaggaccttcgcagagccttcacacctgcgg tcttgctcccgcaccagctgggccatgccagctgctggggacaccgctgggagtgagaca aacaaggcccgcacagcatgctgcccacggcagcacctgggctatggggacctgggctcc gggattctggagcagcacactgggacacttcgtacagtttgcaaattactaataccctta aaataccaaaatgcctttggatttcagcagccaccagacctcctgggggcccccaccatg aagcagaggcctttgaacctcgcctgccccagccataaccacttgctggcacaacccaat ccctga >gi568815579f:29573840_29774676|GENSCAN_predicted_peptide_2|256_aa MLLAIIPYFPGIIASNMAAPSELSDEELKAPDTRHPDTWTPNTQTPRHSDTQTPRNLTPR HPDTQTPRNLTPRHPDTQTPDTQTPRNLTARHPDTQTPDTQTPGHSDTQTLRHPDIQTPD TQKPDTPPPGHPDILPPEHPATQSLGHLDTQPPHHPDTLPLQYPDTPTPRHPDTPLPGHL DNPPPGHLDTQTPRPSATQTPCHSDTQTPHHLDTWTPGHPDTRTLRHLTTQTPRNLTPRN LDTRTPRHLTPRHTDT >gi568815579f:29573840_29774676|GENSCAN_predicted_CDS_2|771_bp atgttgctggcaattatcccgtattttcctgggataattgccagcaacatggcagccccc tctgagcttagtgatgaagaattgaaggcacctgacaccagacacccagacacctggaca cccaacactcagacacccagacactcagatacccagacacccagaaacctgacacccaga cacccggatacccagacacccagaaacctgacacccagacacccagatacccagacaccc gacacccagacacccagaaacctgacagccagacacccggatacccagacacccgacacc cagacacccggacactcagacacccagacactcagacacccagacatccagacacctgat acccagaaacctgacaccccaccacccggacacccagacatcctgccacctgaacaccct gccactcagtcacttggacacctggatactcagccaccccaccatccagataccctgcca ctccaatacccagacaccccaacccccagacacccagacactccactacctggacacctg gacaacccaccacctggacacctggacacccaaacacccagaccctctgccacccagaca ccctgccactcagacacccagacaccccaccacctggacacctggacacccggacaccca gacacccggacactcagacacctcaccacccagacaccgagaaacctgacacccagaaac ctggacacccggacacccagacacctgacaccaagacacacagacacttga >gi568815579f:29573840_29774676|GENSCAN_predicted_peptide_3|410_aa XGQPWTIEDLQRLSPRVMTVSRCRQPCQAPPQTALRDSPERQSQCRILQEEDMSLSLSDA LDLVGCLPQAEHGLPERRETILCGDHSEQCSEYIFVWVEAHLHTLELWRQRSYGSKVPLE AAGLQEHQDDLCQGYARAPGEPRRYIEPSLATVDPTVKVCGTNTLCYRDFGEVPEKGPEP APPSAVPDSEPGSVHYQEHGDTVATREGPAQPEGGTEGGASAGAATKACATLRGAGELGG EPGACQARAAAASDRRARRVDLRSPRPATMTIMVEDIMKLLCSLSGERKMKAAVKHSGKG ALVTGAMAFVGGLVGGPPGLAVGGAVGGLLGAWMTSGQFKPVPQILMELPPAEQQRLFNE AAAIIRHLEWTDAVQLTALVMGSEALQQQLLAMLVNYVTKELRAEIQYDD >gi568815579f:29573840_29774676|GENSCAN_predicted_CDS_3|1233_bp ngaggccagccttggacaattgaggacttgcaaagactaagtccccgtgtgatgacggtg agcaggtgccggcagccatgccaagcacctccccagactgctctgcgggactctcctgag aggcagtcccagtgccgcatcttacaggaggaggacatgtccctgtcactcagcgacgct ttagatcttgttggctgccttccccaagcagaacatgggctccctgagaggagggagacc attctctgtggtgaccattcagagcagtgtagtgagtacatctttgtctgggtagaagca catcttcacactctggagctatggaggcaaagaagctatggctccaaggttcctctggaa gcagcaggcctacaagaacatcaggatgacctgtgccaaggctatgccagggcacctggt gagccacgccgctacattgaaccctcactggccactgtagacccaacagtcaaagtctgt ggcaccaacaccttgtgttatagagactttggggaagtccccgagaagggcccagaaccc gcccctccatcagcagttccagactctgagcctgggagcgtgcattaccaggaacatgga gacacggtggctacacgtgaaggcccggcgcagccggaaggtgggacggagggcggggcc agcgccggggccgccaccaaggcctgcgcgaccctccgcggggctggggagctgggcggg gagcccggggcctgccaggcccgggctgcagccgcgtctgatcgccgagcgcgccgcgta gacctccgctcccccaggcccgccacgatgactatcatggtggaggacatcatgaagctg ctgtgctccctttctggggagaggaagatgaaggcggctgtcaagcactctgggaagggt gccctggtcacaggggccatggccttcgtcgggggtttggtgggcggcccaccgggactc gccgttgggggggctgtcggggggctgttaggtgcctggatgacaagtggacagtttaag ccggttcctcagatcctaatggagctgccccctgccgagcaacagaggctctttaacgaa gccgcagccatcatcaggcacctggagtggacggacgccgtgcagctgaccgcgctggtc atgggcagcgaggccctgcagcagcagctgctggccatgctggtgaactacgtcaccaag gagctgcgggccgagatccagtatgatgactag