GENSCAN 1.0 Date run: 4-Nov-116 Time: 00:14:42 Sequence gi568815575f:8365451_8566380 : 200930 bp : 38.68% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 Intr - 9245 9164 82 1 1 37 105 88 0.054 4.12 1.01 Init - 19938 19910 29 0 2 79 80 62 0.058 2.43 1.00 Prom - 28680 28641 40 -1.85 2.00 Prom + 35900 35939 40 -3.65 2.01 Sngl + 48740 48922 183 1 0 99 42 132 0.708 4.49 2.02 PlyA + 49994 49999 6 1.05 3.03 PlyA - 50516 50511 6 1.05 3.02 Term - 51581 51427 155 0 2 31 46 167 0.323 3.90 3.01 Init - 61130 61082 49 2 1 97 87 20 0.425 3.96 3.00 Prom - 67223 67184 40 -4.55 4.00 Prom + 85371 85410 40 -5.85 4.01 Init + 86290 86374 85 0 1 100 101 117 0.789 15.23 4.02 Intr + 86955 87207 253 1 1 63 89 127 0.135 5.87 4.03 Term + 90871 90883 13 1 1 130 40 4 0.041 -3.50 4.04 PlyA + 91262 91267 6 1.05 5.02 PlyA - 91343 91338 6 1.05 5.01 Sngl - 93432 93022 411 0 0 50 33 214 0.743 8.14 5.00 Prom - 99149 99110 40 -5.65 6.00 Prom + 99190 99229 40 -10.15 6.01 Sngl + 100001 100933 933 1 0 66 42 882 0.924 77.60 6.02 PlyA + 101035 101040 6 1.05 7.06 PlyA - 102322 102317 6 1.05 7.05 Term - 120279 120092 188 1 2 35 42 147 0.118 1.47 7.04 Intr - 128280 128175 106 0 1 53 91 72 0.131 2.87 7.03 Intr - 132902 132755 148 1 1 61 60 77 0.084 1.52 7.02 Intr - 135638 135482 157 0 1 66 85 106 0.313 6.25 7.01 Init - 138931 138832 100 1 1 52 121 26 0.203 2.78 7.00 Prom - 143538 143499 40 -3.25 8.11 PlyA - 144188 144183 6 1.05 8.10 Term - 154762 154548 215 2 2 119 35 181 0.925 12.31 8.09 Intr - 157234 157158 77 0 2 68 64 27 0.193 -3.46 8.08 Intr - 161596 161496 101 1 2 53 98 37 0.138 -0.81 8.07 Intr - 167512 167375 138 1 0 17 52 155 0.235 4.54 8.06 Intr - 167807 167668 140 0 2 58 86 50 0.586 1.06 8.05 Intr - 169010 168869 142 2 1 88 79 58 0.965 3.91 8.04 Intr - 170361 170141 221 1 2 85 44 145 0.522 6.90 8.03 Intr - 171492 171321 172 0 1 91 98 41 0.906 3.99 8.02 Intr - 174308 174214 95 0 2 54 85 90 0.578 4.06 8.01 Init - 181931 181865 67 2 1 82 77 49 0.781 4.49 8.00 Prom - 187953 187914 40 -5.35 9.00 Prom + 189234 189273 40 -5.95 9.01 Sngl + 189558 190622 1065 2 0 45 48 408 0.689 29.38 9.02 PlyA + 190666 190671 6 1.05 10.02 PlyA - 192435 192430 6 1.05 10.01 Term - 200762 200562 201 2 0 43 55 253 0.699 13.91 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575f:8365451_8566380|GENSCAN_predicted_peptide_1|37_aa MPVQKAALLSPRNAMHALLMISAQFSVKTTDEHIDTK >gi568815575f:8365451_8566380|GENSCAN_predicted_CDS_1|111_bp atgcccgtgcagaaggcagctctgctcagtcctagaaatgctatgcatgctttgctgatg atctcagcacagttctcagtgaagacaacagacgaacacattgacaccaag >gi568815575f:8365451_8566380|GENSCAN_predicted_peptide_2|60_aa MECPVRALVLVQKKDSMSYMLECPGPQRRIGRISSLCHEYCQGSTQTCMEQERGDIDQHY >gi568815575f:8365451_8566380|GENSCAN_predicted_CDS_2|183_bp atggagtgcccggtaagagccttggtcttggtgcagaagaaagacagcatgtcatacatg cttgagtgtccaggtccccagaggaggattgggagaataagctccctgtgccatgaatat tgccaggggtccacacagacctgcatggagcaagaacgaggagacattgaccaacactat tga >gi568815575f:8365451_8566380|GENSCAN_predicted_peptide_3|67_aa MTLEFRKAAPGNQHQKKGEETPALSLYHVRMWLPEASKRAFTRISTLLDLDPKLPILQYC EKRNSVA >gi568815575f:8365451_8566380|GENSCAN_predicted_CDS_3|204_bp atgaccttggagttcagaaaggctgctccagggaatcaacaccagaaaaaaggggaagag acaccagcactctctctctaccatgttagaatgtggctgcctgaagccagcaagagagcc tttaccagaatctccacgttgctagaccttgatcctaaacttcccatcctccagtactgt gagaaacggaattctgttgcttaa >gi568815575f:8365451_8566380|GENSCAN_predicted_peptide_4|116_aa MANMEKPCVRNWRVLGLTDFKNEATDPGDSGAQLASPSGSRTGAAGGAACQSRAVRSHSS ALGWSMGLGAVEQGVVLVGEARAAQEPMEWVGGSGKSGCRSRALPRRKAAKGRCNY >gi568815575f:8365451_8566380|GENSCAN_predicted_CDS_4|351_bp atggccaatatggagaaaccctgtgtccggaattggcgggttctgggtctcactgacttc aagaatgaagccacggaccctggcgactcaggagcccagctagcttcacccagtggatcc cgcaccggggctgcaggtggagctgcctgccagtcccgcgccgtgcgttcccattcctca gcgcttgggtggtcgatgggactgggcgccgtggagcagggggtggtgctcgttggggag gctcgggccgcacaggagcccatggagtgggtgggaggctcaggcaagtcgggctgcagg tcccgagccctgccccgcaggaaggcagctaagggtcggtgtaactactaa >gi568815575f:8365451_8566380|GENSCAN_predicted_peptide_5|136_aa MWGQSPTGVLPGRAVTRGPQTFRHQNGRSTDSLHCVPGKAADTQCQPMKAARREAVPCNA TGAELPKTMGTHLLHQHDLDVRPGVKEDHFGALKFDGTAAFWTCMGPATPLFWPISPIWN SCIYPILVPSLHLGSY >gi568815575f:8365451_8566380|GENSCAN_predicted_CDS_5|411_bp atgtggggtcagagccctactggggtactgccaggtagagctgtgacaagagggccacag accttcagacaccagaatggtagatcaactgacagcttgcactgtgtacctggaaaagct gcagacactcaatgccagcccatgaaagcagctaggagggaggctgtaccctgcaatgcc acaggggcagagctgcctaagaccatgggaacccacctcttgcatcagcatgacctggat gtgagacctggagtcaaagaagatcattttggagctttaaaatttgatggcactgctgca ttttggacttgcatgggccctgcaacccctttgttttggccaatttctcccatttggaac agctgtatttacccaatacttgtaccctcattgcatctaggaagttactag >gi568815575f:8365451_8566380|GENSCAN_predicted_peptide_6|310_aa MSPKPRASGPPAKAKEAGKRKSSSQPSPSDPKKKVSDPPKLLLVFPSPPSSQEASPVVTW HNPPTRPPPLLRTRPCSQPPPSSSLNRSPSVISLLSFQTTKVAKKGKAVRRGRRGKKGAA TKMAAVTAPEAESGPAAPGPSDQPSQELPQHELPPEEPVSEGTQHDPLSQESELEEPLSQ ESEVEEPLSQESQVEEPLSQESEVEEPLSQESQVEEPLSQESEVEEPLSQESEVEEPLSQ ESQVEEPLSQESEVEEPLSQESQVEEPLSQESEMEEPLSQESQVEEPLSQESEMEEPLSQ ESEMEELPSV >gi568815575f:8365451_8566380|GENSCAN_predicted_CDS_6|933_bp atgagtccaaagccgagagcctcgggacctccggccaaggccaaggaggcaggaaagagg aagtcctcctctcagccgagccccagtgacccgaagaagaaggtgagtgaccctcccaag ctcctcctcgtcttcccctcgcctccttcctcacaagaagcctctcctgtcgtcacttgg cacaaccccccaacccggcccccaccgcttctgaggacacgtccctgttcccagcctcct ccatcctcgtccctaaaccggagcccttctgtgatctccctgttgtccttccagactacc aaggtggccaagaagggaaaagcagttcgtagagggagacgcgggaagaaaggggctgcg acaaagatggcggccgtgacggcacctgaggcggagagcgggccagcggcacccggcccc agcgaccagcccagccaggagctccctcagcacgagctgccgccggaggagccagtgagc gaggggacccagcacgaccccctgagtcaggagagcgagctggaggaaccactgagtcag gagagcgaggtggaagaaccactgagtcaggagagccaggtggaggaaccactgagtcag gagagcgaggtggaagaaccactgagtcaggagagccaggtggaggaaccactgagtcag gagagcgaggtggaggaaccactgagtcaggagagcgaggtggaagaaccactgagtcag gagagccaggtggaggaaccactgagtcaggagagcgaggtggaagaaccactgagtcag gagagccaggtggaggaaccactgagtcaggagagcgagatggaagaaccactgagtcag gagagccaggtggaggaaccactgagtcaggagagcgagatggaagaaccactgagtcag gagagcgagatggaagaactaccgagtgtgtag >gi568815575f:8365451_8566380|GENSCAN_predicted_peptide_7|232_aa MHPLASLSWPHGRLGVGTREIRKGYILLTEHQQGPFLATRFSSGKAQSIQDMPGDNASYI GLQQWLKGVGDQQLQLPLVLGMRMLSLPAPRMMHPCRIVALQCCPLQQGAAFASCHGGTG LGERIWIGQWELSDGVTETADRETVDKATAVMHSLYFISECEQLQKTTWPENELSTVCGP HRDDTNGFFNGKLIAIIPCSIHYLLLTAAHVSILCQVQWKPASTQIRFSCDT >gi568815575f:8365451_8566380|GENSCAN_predicted_CDS_7|699_bp atgcaccctcttgcttctctgtcgtggccccatggtcgtctaggtgtgggaacgagggaa ataagaaagggttacattttgcttactgaacaccagcaagggcctttcctggctaccagg ttcagctccggcaaagcacaaagcatacaggacatgccaggagacaatgcctcctacatt ggcctccagcaatggctgaagggtgttggtgaccagcagctccagctccctcttgttttg ggaatgagaatgctgagccttcctgctccaagaatgatgcacccatgcaggatagtggct ctgcaatgctgtccactgcagcagggagccgcctttgcttcctgccatggtggtactggg cttggtgagaggatttggattggccagtgggaattgagtgatggcgtaactgaaactgca gacagagaaactgtggataaggcgactgctgtaatgcattcactatacttcatctctgaa tgtgagcaactccagaaaactacttggccagaaaatgaactcagtacagtgtgtggccct cacagagatgacacaaatggcttcttcaatgggaagctcatcgccatcatcccctgctcc atccattatttgttactgacagctgcccatgtgtcaatcttatgtcaagtacagtggaag ccagcttcaacacaaatccgcttttcatgtgacacctga >gi568815575f:8365451_8566380|GENSCAN_predicted_peptide_8|455_aa MNTSGPGHEEVQLGERSGMSSIDPTVNRYHVRWFPEACAHNRTTGSEASSGMTHENYIIL QDLSFSCKYKVTVQPIRPKSHSKAEAVFFTTPPCSALKGKSHKPVGCLGEAGHVLSKVLA KPENLSASFIVQDVNITGHFSWKMAKANLYQPMTGFQVTWAEVTTESRQNSLPNSIISQS QILPSDHYVLTVPNLRPSTLYRLEVQVLTPGGEGPATIKTFRTPELPPSSAHIWEDIKEE EKNRKRRERVWLYIQIQNRKDMVIGRNICQGLVLFLQIKMCKLVELRPRDMHTSRGSGNC SEARTLLCDFSAGRTSVNHGRIWRQIREAGATDPEIGLEGCNKNPTSFTFFYFLSYTTGS QSYAMGFIMNNRKHLTFLRRVVHLGAVLSFGQMSAFSSKAQSQNISFIQLLTTTAPMPSE EDTEESHREWILSPAKVLGISQRPKATPLCNMRTA >gi568815575f:8365451_8566380|GENSCAN_predicted_CDS_8|1368_bp atgaacacttctggcccaggacatgaagaggtgcagctgggagaaagatcgggaatgtca agtattgatcccactgtcaaccgatatcatgtgcggtggtttcctgaagcgtgtgcccac aacagaacaaccggatcagaggcatcatctggcatgacccacgaaaattacataattctt caagatctgtcattttcctgcaagtataaggtgactgtccaaccaatacggccaaaaagt cactccaaggcagaagctgttttcttcactactccaccatgctctgctcttaaggggaag agccacaagcctgttggctgcctgggcgaagcaggtcatgttctttctaaggtgctagct aagcctgagaacctttctgcttcattcatcgtccaggatgtgaacatcaccggtcacttt tcttggaagatggccaaggccaatctctatcagcccatgactgggtttcaagtgacttgg gctgaggtcactacggaaagcagacagaacagcctacccaacagcattatttcacagtcc cagatcctgccttccgatcattatgtcctaacagtgcccaatctgagaccatctactctt taccgactggaagtgcaagtgctgaccccaggaggggaggggccggccaccatcaagacg ttccggacgccggagctcccaccctcttcagcacacatttgggaagacatcaaagaggaa gagaaaaacaggaaaaggagagaaagagtgtggttgtacattcagattcagaataggaaa gacatggtcataggaaggaacatttgccagggtctcgtgctttttctacaaataaagatg tgtaagcttgttgaacttcggccacgagacatgcacacttccagaggcagtgggaactgc tcagaggcccggactctcctatgtgactttagtgcaggaagaacttctgtcaatcatgga cgcatctggagacaaatcagagaggctggggctactgacccagagatcgggcttgaaggc tgcaataagaatccaacttccttcacattcttttacttcttgagctacaccacaggcagc caatcatatgcaatgggcttcattatgaataataggaagcatcttacattccttcgtagg gtggtccacctcggtgctgtattgtcatttggacagatgtcagctttctctagcaaggcc caaagccaaaacatcagcttcattcagttgctgacaaccacagcacctatgccaagtgaa gaagacaccgaggagtcgcatagggagtggattctgagtcctgccaaggtgcttggcatc tcacagaggccaaaagcaaccccactgtgtaacatgagaacagcttaa >gi568815575f:8365451_8566380|GENSCAN_predicted_peptide_9|354_aa MEIITNSLSDHSAIKLELRIKKATQNHTTTWKLNNMLLNDYWVNNEIQAEINKFFETNEN KDTMYQNHWDTAKAAFRGKFIARNAHRRKRGRSKIDTLTSQLKELEKQEQTNSKASRRQE ITKIRGELKEIETQKTLPKINESRSWFFENINKTDRLLARLIKKKREKNQIDTILNDKGD ITNDPTEVQTTIREYYKHLYANKLENLEEMDKLLDTYTLPRLNQEEVESLNNSLPTKKSP GPDRFTAELYQRYKQELVSFLLKLFQTIEKEGLLPNSFYEVSIMLIPKPGRDTTKKENFR PISLMNIDVKILNKILANRIQQHIRKFIHHDQVGFIPGMQGWFNIHNPSHKQNQ >gi568815575f:8365451_8566380|GENSCAN_predicted_CDS_9|1065_bp atggaaatcataacaaacagtctctcagaccacagtgcaatcaaattagaactcaggatt aagaaagccactcaaaaccatacaactacatggaaactgaacaatatgctcctgaatgac tactgggtaaataatgaaattcaggcagaaataaataagttttttgaaaccaatgagaac aaagacacaatgtaccagaatcactgggacacagctaaagcagcgtttagagggaaattt atagcacgaaatgcccacaggagaaagcggggaagatctaaaatcgacaccctaacatca caattaaaagaactggagaagcaagagcaaacaaattcaaaagccagcagaagacaagaa ataactaagatcagaggagaactgaaggagatagagacacaaaaaacccttccaaaaatc aatgaatccagaagctggttttttgaaaacattaacaaaacagatagactactagccaga ctaataaagaagaaaagagagaagaatcaaatagacacaatattaaatgataaaggggat atcactaatgatcccacagaagtacaaactaccatcagagaatactacaaacacctctat gcaaataaactagaaaatctagaagaaatggataaactcctggacacatacaccctccca agactaaaccaggaagaagttgaatctctgaataatagcctaccaaccaaaaaaagccca ggaccagacagattcacagctgaattgtaccagagatacaaacaggagctggtatcattc cttctaaaactattccaaacaatagaaaaagagggactcctccctaactcattttatgag gtcagcatcatgctgataccaaaacctggcagagatacaacaaaaaaagaaaatttcagg ccgatatccctgatgaacattgatgtgaaaatcctcaataaaatactggcaaaccgaatc cagcagcacatcagaaagtttatccatcacgatcaagttggcttcatccctgggatgcaa ggctggttcaacatacacaatccatcacataaacagaaccaatga >gi568815575f:8365451_8566380|GENSCAN_predicted_peptide_10|66_aa TELQETVEAEMTNMELGNESHIIPHEVLKSERSAGPQGMDIIPHKRSSVDECRELEKRIA WAENSA >gi568815575f:8365451_8566380|GENSCAN_predicted_CDS_10|201_bp acagaactacaggaaacagtggaagcagaaatgaccaacatggagttggggaatgagagt cacatcatccctcatgaagtcctgaagagtgagcgatctgcaggcccacaaggaatggac atcattccacacaaacgcagcagcgtggatgagtgtagagagttagagaagcgcattgca tgggcagagaactcagcgtga