GENSCAN 1.0 Date run: 3-Nov-116 Time: 22:39:48 Sequence gi568815588f:70304402_70520100 : 215699 bp : 49.08% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.07 PlyA - 1153 1148 6 1.05 1.06 Term - 7663 7572 92 2 2 108 41 37 0.136 -1.12 1.05 Intr - 19629 19462 168 1 0 75 95 301 0.714 29.42 1.04 Intr - 36301 36152 150 2 0 93 80 231 0.931 22.93 1.03 Intr - 56069 55946 124 2 1 15 64 157 0.193 6.16 1.02 Intr - 67115 66976 140 0 2 67 78 32 0.246 0.28 1.01 Init - 72132 72051 82 0 1 47 92 111 0.479 8.53 1.00 Prom - 73544 73505 40 -6.86 2.00 Prom + 84002 84041 40 -3.26 2.01 Sngl + 85025 86047 1023 1 0 52 48 361 0.505 25.57 2.02 PlyA + 86209 86214 6 1.05 3.00 Prom + 98711 98750 40 -6.96 3.01 Init + 100001 100145 145 1 1 80 83 167 0.176 15.80 3.02 Intr + 115513 115698 186 2 0 101 91 72 0.759 8.46 3.03 Term + 118621 118694 74 2 2 127 54 6 0.627 -0.83 3.04 PlyA + 121578 121583 6 1.05 4.04 PlyA - 122044 122039 6 1.05 4.03 Term - 128687 128535 153 2 0 115 45 122 0.558 8.52 4.02 Intr - 131582 130885 698 0 2 92 75 559 0.992 46.21 4.01 Init - 137266 137074 193 1 1 98 102 269 0.993 26.43 4.00 Prom - 142299 142260 40 -6.76 5.00 Prom + 150611 150650 40 -5.96 5.01 Init + 153995 154106 112 1 1 79 100 127 0.609 11.41 5.02 Intr + 155469 155557 89 1 2 82 -23 118 0.386 -0.21 5.03 Term + 155939 156037 99 1 0 85 43 82 0.787 1.53 5.04 PlyA + 156662 156667 6 1.05 6.00 Prom + 160893 160932 40 -6.46 6.01 Init + 174514 174658 145 0 1 64 86 195 0.955 15.18 6.02 Intr + 176213 176327 115 0 1 104 37 43 0.539 0.31 6.03 Intr + 177539 177732 194 2 2 84 68 59 0.376 2.64 6.04 Intr + 181518 181697 180 1 0 84 66 37 0.028 0.94 6.05 Term + 190718 190812 95 0 2 122 42 51 0.607 1.89 6.06 PlyA + 191304 191309 6 1.05 7.06 PlyA - 191868 191863 6 1.05 7.05 Term - 192441 192354 88 2 1 85 44 52 0.386 -2.37 7.04 Intr - 193122 192902 221 0 2 38 94 106 0.100 3.30 7.03 Intr - 199551 199458 94 2 1 41 101 88 0.198 5.37 7.02 Intr - 199662 199617 46 1 1 135 57 22 0.271 1.27 7.01 Init - 209929 209773 157 1 1 70 86 97 0.087 5.77 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 173688 173481 208 0 1 88 97 111 0.899 9.80 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588f:70304402_70520100|GENSCAN_predicted_peptide_1|251_aa MLKKMGEAVARVARKVNETVESGSDTLGSWLCSVHGDWLLKNNRDTEEQDLKRPEARDST THIPGSPQAPSTQQSETLSQKKRNEEQEKEEEEEEGEEEEGDGEEEEEEESNEEPDLAEC KLVSFPIGIYKVLRNVSGQIHLITLANNELKSLTSKFMTTFSQLRELHLEGNFLHRLPSE VSALQHLKAIDLSRNQFQDFPEQLTALPALETINLEENEIVGSHCGHHTGPGKHSGAIVC NHPASQPVPLM >gi568815588f:70304402_70520100|GENSCAN_predicted_CDS_1|756_bp atgctgaagaagatgggtgaggccgtggccagagtagcaaggaaggtcaacgagacggtg gagagcggctctgacactctgggctcctggctctgcagtgtgcatggggactggctttta aagaataatcgtgacacagaggagcaggatttgaaaaggcctgaggcccgagactctact acacacatccctgggagcccccaggcacctagtacacagcagagtgagaccctgtctcaa aagaaaagaaatgaagagcaagagaaggaggaagaagaggaggagggggaggaggaggag ggagatggggaggaggaggaggaggaggaaagcaatgaggaaccagacctggccgagtgc aagctggtctcctttcccattggcatctacaaggtcctgcggaatgtctctggccagatc cacctcatcaccctggctaacaacgagcttaagtccctcaccagcaagttcatgaccaca ttcagtcagctccgagagctccacctggaggggaacttcctacaccgcctccccagcgag gtcagtgccctgcagcacctcaaggccattgacctgtcccggaaccagttccaggacttc cctgagcagcttaccgccctgccggcgctggagaccatcaacctggaggagaacgagatc gtagggtcccactgtgggcaccacactggaccaggaaagcattctggagccatcgtttgt aatcatcctgcttctcagccggttccactgatgtaa >gi568815588f:70304402_70520100|GENSCAN_predicted_peptide_2|340_aa MDSELMHSIVGSYHKPPERVFVPSFTQNEPSQNCHPANLEVTSPKILHSPNSQALILALK TLQEKIHCLELERTQAEDDLNILSREAAQYKKALENETNERNLAHQELIKQKKDVSIQLS SAQSRCTLLEKQLEYTKRMVLNVEREKNMILEQQAQLQREKEQDQMKLYAKLDKLDVLEK ECFRLTTTQKTAEDKIKHLEEKLKEEEHQRKLFQDKASELQTGLEISKIMSSVSNLKHSK EKKKSSKKTKCIKRGPPWQICSKFGALPFVAEKMRQHRDPHILQKPFNVTETRCLPKPSR TTAWCKAITPDSEKSISICDNLSELLMAMQDELDQMSMEH >gi568815588f:70304402_70520100|GENSCAN_predicted_CDS_2|1023_bp atggattctgaattaatgcatagtatagtaggaagctatcataaacctccagaaagagta tttgttccctcattcacccagaatgaaccatctcagaattgccatcctgcgaacttagaa gttacctctcctaagatacttcatagcccaaatagccaagctcttattttagccttaaaa actcttcaggaaaaaattcattgtttagagctggagagaacacaagctgaagatgacctg aacattctttccagagaagcagcacagtataagaaggccttagagaatgaaacaaatgag agaaatctagcacatcaggagctgataaagcagaaaaaagatgtaagtatacagttaagc tcagcccagtctcgttgcactcttctagagaagcaactagaatatacaaagagaatggtt ctcaatgtagagcgagaaaagaacatgatcctagaacaacaggcccagcttcagagggaa aaagaacaagatcagatgaagctgtatgcaaaacttgacaagcttgatgtcttagaaaaa gagtgttttagacttacaacaactcagaaaactgctgaggacaagattaaacatttagaa gaaaaacttaaggaagaagaacatcagcgtaagctatttcaagacaaagcttctgagctt caaactggacttgaaatcagtaaaattatgtcttcagtttcaaatttaaagcactccaag gaaaagaagaaatcttcaaagaaaactaaatgtataaagagaggaccaccttggcaaatt tgttcaaagtttggagcactgccttttgtggctgaaaagatgaggcaacatcgtgaccca catatccttcagaaaccttttaacgtgactgagactagatgtctccccaagccttctaga acaactgcctggtgtaaagctattactcctgactcagaaaagtccatttccatttgtgac aatttatctgaacttttgatggcaatgcaagatgagctggaccaaatgagcatggagcac taa >gi568815588f:70304402_70520100|GENSCAN_predicted_peptide_3|134_aa MSSSAGSGHQPSQSRAIPTRTVAISDAAQLPHDYCTTPGGTLFSTTPGGTRIIYDRKFLL DRRNSPMAQTPPCHLPNIPGVTSPGTLIEDSKVEVNNLNNLNNHDRKHAVGHFSNLHQRY LKCWYLRISVTPLI >gi568815588f:70304402_70520100|GENSCAN_predicted_CDS_3|405_bp atgtcctcgtcagccggcagcggccaccagcccagccagagccgcgccatccccacccgc accgtggccatcagcgacgccgcgcagctacctcatgactattgcaccacgcccgggggg acgctcttctccaccacaccgggaggaactcgaatcatttatgacagaaagtttctgttg gatcgtcgcaattctcccatggctcagaccccaccctgccacctgcccaatatcccagga gtcactagccctggcaccttaattgaagactccaaagtagaagtaaacaatttgaacaac ttgaacaatcacgacaggaaacatgcagttggccatttttcaaatttacatcaaagatac ctgaagtgttggtatctgagaatatctgtcactcctcttatctga >gi568815588f:70304402_70520100|GENSCAN_predicted_peptide_4|347_aa MHAHCLPFLLHAWWALLQAGAATVATALLRTRGQPSSPSPLAYMLSLYRDPLPRADIIRS LQAEDVAVDGQNWTFAFDFSFLSQQEDLAWAELRLQLSSPVDLPTEGSLAIEIFHQPKPD TEQASDSCLERFQMDLFTVTLSQVTFSLGSMVLEVTRPLSKWLKHPGALEKQMSRVAGEC WPRPPTPPATNVLLMLYSNLSQEQRQLGGSTLLWEAESSWRAQEGQLSWEWGKRHRRHHL PDRSQLCRKVKFQVDFNLIGWGSWIIYPKQYNAYRCEGECPNPVGEEFHPTNHAYIQSLL KRYQPHRVPSTCCAPVKTKPLSMLYVDNGRVLLDHHKDMIVEECGCL >gi568815588f:70304402_70520100|GENSCAN_predicted_CDS_4|1044_bp atgcacgcccactgcctgcccttccttctgcacgcctggtgggccctactccaggcgggt gctgcgacggtggccactgcgctcctgcgtacgcgggggcagccctcgtcgccatcccct ctggcgtacatgctgagcctctaccgcgacccgctgccgagggcagacatcatccgcagc ctacaggcagaagatgtggcagtggatgggcagaactggacgtttgcttttgacttctcc ttcctgagccaacaagaggatctggcatgggctgagctccggctgcagctgtccagccct gtggacctccccactgagggctcacttgccattgagattttccaccagccaaagcccgac acagagcaggcttcagacagctgcttagagcggtttcagatggacctattcactgtcact ttgtcccaggtcaccttttccttgggcagcatggttttggaggtgaccaggcctctctcc aagtggctgaagcaccctggggccctggagaagcagatgtccagggtagctggagagtgc tggccgcggccccccacaccgcctgccaccaatgtgctccttatgctctactccaacctc tcgcaggagcagaggcagctgggtgggtccaccttgctgtgggaagccgagagctcctgg cgggcccaggagggacagctgtcctgggagtggggcaagaggcaccgtcgacatcacttg ccagacagaagtcaactgtgtcggaaggtcaagttccaggtggacttcaacctgatcgga tggggctcctggatcatctaccccaagcagtacaacgcctatcgctgtgagggcgagtgt cctaatcctgttggggaggagtttcatccgaccaaccatgcatacatccagagtctgctg aaacgttaccagccccaccgagtcccttccacttgttgtgccccagtgaagaccaagccg ctgagcatgctgtatgtggataatggcagagtgctcctagatcaccataaagacatgatc gtggaagaatgtgggtgcctctga >gi568815588f:70304402_70520100|GENSCAN_predicted_peptide_5|99_aa MAGPTEPGEVSAPGRPAGLEAEAQAPALVGKRAEGDPASPLPSSCRTFSTAVVRGAYTPR FTVTLHPTSPQNSGPLHPPAYRASPVGSLTVISGSIHQD >gi568815588f:70304402_70520100|GENSCAN_predicted_CDS_5|300_bp atggcaggacctacggagcccggtgaggtgagcgcgccaggccggccggctgggctggag gcagaggcccaggcgcccgccctcgtgggaaagcgcgcggagggcgacccggccagccct ctcccctccagctgccgcaccttctccacagcagttgtccgtggtgcctacactccccgc ttcacggtcacccttcacccgaccagtcctcagaactctggacccctgcacccaccggcc taccgtgcgtctcccgtgggctccctcacagtcatctccggctccattcatcaagactga >gi568815588f:70304402_70520100|GENSCAN_predicted_peptide_6|242_aa MGWRSWEERAGARARAGRAPSLPDSAPAVRAQHAGRRSTDAPRSRGLAGFCFSGALRGGG SVLMLVALGAVLGVGVAELTRSPWLALVGSVTSTSLWVQWEKISAKHVAQCMAWSECSVD VAVGAFDYEPSPVARTLQGFLSSQVHGPGPLDEDSGTVAVQMFSHGRVIQEHLRAPFMRQ AVVLGASSYLIFLGFIPASYADCLGFERLLPGHLHGPFPLLPGMCLHTAGAFLSSYVGIG LP >gi568815588f:70304402_70520100|GENSCAN_predicted_CDS_6|729_bp atgggctggcgaagttgggaggagcgagctggagccagagcgcgcgccgggcgcgccccg tcgctgcctgactcggcgcccgcagttcgggcgcagcacgccggccgcaggagcacggat gccccccggagccgcgggctggcaggtttctgtttttctggagcacttcgtggaggtggc agtgtgctcatgctcgtggccttgggtgcagttctgggagtgggagttgctgagctcacc cgaagcccttggctggcactagtggggtcagtaacctctacctccttgtgggttcagtgg gagaaaatcagtgcaaagcatgtggcccagtgcatggcatggagtgagtgctcagttgat gtggcggtgggggcctttgattatgagccatcccctgtggccaggactctgcagggcttc ctcagcagccaggtgcatggccctgggcctctggacgaggactctgggactgtggctgtg cagatgttttctcatggcagagttatccaagaacatcttcgtgctccctttatgaggcaa gctgttgttcttggggcatcatcttatttaatttttcttggcttcattccagcaagttat gccgattgtttaggatttgagaggttactgccagggcatctgcacgggccgttccctctg ctgccgggcatgtgtcttcacacggccggggccttcctgtcttcatacgtcggcataggc cttccgtga >gi568815588f:70304402_70520100|GENSCAN_predicted_peptide_7|201_aa MQSFALQAGLPLRGAALTSSLQPAIPGEPLKQAPSPGRKEASWLPHTEMTSQGADTEAQM VRVNLPQGCPSSYITLPKEVRKDKRRRDTKEATDMKEENSSGWRDGADMERAKGTTTQRV GTVRRAVRVALRNDPNATSGQGVGGLVARHAPAPDRLGLSEAAVHTCTRQEHTPSGTSSV SLLVKGGFYPSTQQFTPGSTS >gi568815588f:70304402_70520100|GENSCAN_predicted_CDS_7|606_bp atgcagtcctttgctctgcaggcggggctgccactgcggggggcagccctgacttccagc cttcagcctgcaatccctggggagcctttgaagcaagcgccaagtcctggcaggaaggag gcctcatggctgccacacactgagatgacaagtcaaggtgcagacactgaggcccagatg gtccgggttaacttgccccaaggctgcccctcatcttacatcacgctcccaaaggaagtg cggaaggataaaagaagacgtgatacaaaggaagccacagacatgaaagaagagaatagc tcaggatggagagatggagcagatatggaaagagctaaaggcaccaccacccagagagtt ggcacagtcaggagagctgtgcgagtggcgctgaggaatgacccaaacgctacctcgggt cagggagttggcggccttgtggccaggcatgcgccggccccagacaggctgggtctgagc gaggcagctgtgcacacctgcacccgccaggagcacacaccctctggaacctcctcggtc tccctcttggtgaaaggaggattctaccccagcacccagcagttcacacctggctctacg tcctag