GENSCAN 1.0 Date run: 7-Nov-116 Time: 20:18:08 Sequence gi568815589f:114487839_114697740 : 209902 bp : 44.72% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 2252 2322 71 1 2 81 61 49 0.466 0.31 1.02 Term + 3742 3904 163 1 1 12 33 259 0.879 10.21 1.03 PlyA + 4558 4563 6 1.05 2.07 PlyA - 5570 5565 6 1.05 2.06 Term - 11022 10861 162 0 0 103 43 95 0.954 4.44 2.05 Intr - 13244 13184 61 1 1 92 76 12 0.299 -0.86 2.04 Intr - 15324 15269 56 0 2 31 109 67 0.649 0.88 2.03 Intr - 17079 16346 734 1 2 44 116 1203 0.073 109.86 2.02 Intr - 17301 17178 124 0 1 107 57 61 0.069 5.06 2.01 Init - 38001 37951 51 0 0 77 116 6 0.354 3.46 2.00 Prom - 49412 49373 40 -4.96 3.00 Prom + 53967 54006 40 -1.66 3.01 Init + 72474 72530 57 2 0 104 110 22 0.805 7.31 3.02 Term + 88334 88438 105 1 0 55 45 74 0.088 -1.79 3.03 PlyA + 90009 90014 6 1.05 4.00 Prom + 92854 92893 40 -6.36 4.01 Init + 100001 100082 82 1 1 118 87 156 0.938 19.63 4.02 Intr + 104714 104814 101 0 2 68 40 172 0.988 10.23 4.03 Term + 109732 109905 174 0 0 88 43 259 0.999 19.16 4.04 PlyA + 113308 113313 6 1.05 5.00 Prom + 135926 135965 40 -2.66 5.01 Init + 136469 136621 153 1 0 34 109 149 0.735 11.58 5.02 Intr + 139061 139168 108 1 0 76 79 114 0.927 9.78 5.03 Intr + 140263 140412 150 0 0 107 100 89 0.917 12.36 5.04 Intr + 145930 146040 111 0 0 79 53 132 0.566 9.38 5.05 Intr + 149152 149232 81 0 0 102 94 72 0.972 9.03 5.06 Intr + 150706 150888 183 0 0 110 77 20 0.859 3.08 5.07 Intr + 155296 155471 176 0 2 118 45 142 0.449 11.64 5.08 Intr + 170978 171112 135 2 0 45 81 112 0.005 5.88 5.09 Term + 172303 172414 112 1 1 58 46 90 0.609 -0.17 5.10 PlyA + 173718 173723 6 1.05 6.05 PlyA - 174045 174040 6 1.05 6.04 Term - 178908 178805 104 1 2 122 53 -10 0.514 -2.66 6.03 Intr - 180575 180368 208 2 1 93 96 79 0.169 7.85 6.02 Intr - 194385 194197 189 0 0 85 109 43 0.212 5.88 6.01 Intr - 208393 208049 345 1 0 -18 65 222 0.102 4.89 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 16963 16346 618 1 0 74 116 1214 0.860 116.15 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589f:114487839_114697740|GENSCAN_predicted_peptide_1|77_aa MPRHGPSAPALCGHLASLRCCIPGKSSPPEEYEKAERASVKGKHQDDRLFAATRKQRDLE IMQQKEKKANEKQQEPK >gi568815589f:114487839_114697740|GENSCAN_predicted_CDS_1|234_bp atgccacgacacggtccatctgctccggctctgtgcggccacctcgcttccttgcgctgc tgcatcccaggcaagagctccccaccagaagagtatgaaaaagcagagcgcgcctcggtt aagggaaagcaccaagatgacaggctttttgccgccacccgcaagcagagggacttggag atcatgcagcagaaggagaaaaaggcaaacgagaagcagcaggaacccaagtag >gi568815589f:114487839_114697740|GENSCAN_predicted_peptide_2|395_aa MTGVSTISLLVAGKPGKNVAADPAPPGLQQQSPPPPGSSSQLLCLSGTRRRLRVPRIPGS SRRLRAPGDPAVAPVPARTAAPPTPRALRARPGLGVEMNAPLDGLSVSSSSTGSLGSAAG AGGGGGAGLRLLSANVRQLHQALTALLSEAEREQFTHCLNAYHARRNVFDLVRTLRVLLD SPVKRRLLPMLRLVIPRSDQLLFDQYTAEGLYLPATTPYRQPAWGGPDSAGPGEVRLVSL RRAKAHEGLGFSIRGGSEHGVGIYVSLVEPGSLAEKEGLRVGDQILRVNDKSLARVTHAE AVKAAEAHDYFLLSAVQWPVSRVDGPGAVQDPFQLGQSVICLGRQAGEEGHAQGDGTGEL KVPIGIRNQVASLLGPQPKDFVVSLLEDNASQSAY >gi568815589f:114487839_114697740|GENSCAN_predicted_CDS_2|1188_bp atgaccggtgttagcaccatatctctcttggtagcaggaaagccaggcaagaatgtagct gctgaccccgcgccccccggactccagcagcagtctccgccacctccgggctccagcagc caactcttgtgtctctcaggaacccgccgacgtctccgcgtcccccggattccaggctcc agccgtcgtctccgcgctcccggggatccagctgtcgcgccagtacccgcgaggacagcg gcaccgcccacaccccgcgcgctcagagcccggccgggcctcggcgtggagatgaacgcg ccgctggacggcctgtcggtgagctcgtcctccaccggctcgctgggctcggcggccggg gcgggcggcggcgggggcgcggggctgcggttactgtctgccaacgtgcgccagctgcac caagcgctgaccgcgctgctgagcgaggcggagcgggagcagttcacccactgcctgaac gcttaccacgcgcgccgcaacgtcttcgacctggtgcgcaccctgcgcgtgctgctggac agtccggtcaagcggcgcctgctgcccatgcttcgtctggtcatcccgcgctccgaccag ctgctcttcgaccaatacacggccgagggcctctacctgcccgccaccaccccctacagg cagcccgcctggggcggccccgacagcgcggggccaggggaggtgcgcctggtgagtttg cggcgtgccaaggcccacgagggcttgggcttcagcatccgtgggggctcggagcacggc gtgggcatctacgtgtctctggtggaaccaggctctctagctgagaaggaaggactgcgg gtcggggaccagattctgcgcgtcaacgacaaatccctggcccgggtgacccacgcggag gccgtcaaggcagcagaggcccacgactacttcctcctgagtgccgttcagtggcctgtg tccagggtggatgggccaggagctgtccaagatcccttccagcttggacagtctgtgatt tgcttggggaggcaggcaggcgaggagggtcatgctcaaggagatggaactggggagctt aaggtccccattgggattaggaaccaagtggcctccttgcttggaccccagcccaaggat tttgtagtcagcctcctggaagacaatgccagccagtccgcttactag >gi568815589f:114487839_114697740|GENSCAN_predicted_peptide_3|53_aa MATKRQELFVDASICRRAQLSNPPNLKPYPNVDTSSLVSYLFLILQSILSPEE >gi568815589f:114487839_114697740|GENSCAN_predicted_CDS_3|162_bp atggcaaccaaaagacaggaactatttgtagatgccagtatctgcagaagagctcagctc tccaaccctcctaacctcaaaccctaccccaatgtggacacctcaagcttggttagttac ctcttcctcattcttcagtctatattgtcacctgaagagtga >gi568815589f:114487839_114697740|GENSCAN_predicted_peptide_4|118_aa MASQSQGIQQLLQAEKRAAEKVSEARKRKNRRLKQAKEEAQAEIEQYRLQREKEFKAKEA AALGSRGSCSTEVEKETQEKMTILQTYFRQNRDEVLDNLLAFVCDIRPEIHENYRING >gi568815589f:114487839_114697740|GENSCAN_predicted_CDS_4|357_bp atggctagtcagtctcaggggattcagcagctgctgcaggccgagaagcgggcagccgag aaggtgtccgaggcccgcaaaagaaagaaccggaggctgaagcaggccaaagaagaagct caggctgaaattgaacagtaccgcctgcagagggagaaagaattcaaggccaaggaagct gcggcattgggatcccgtggcagttgcagcactgaagtggagaaggagacccaggagaag atgaccatcctccagacatacttccggcagaacagggatgaagtcttggacaacctcttg gcttttgtctgtgacattcggccagaaatccatgaaaactaccgcataaatggatag >gi568815589f:114487839_114697740|GENSCAN_predicted_peptide_5|402_aa MVITQPDEASGLLPELHNGQVLTVLRIDNTCAPISFDLGAAEEQLQTWGIQVPADQYRSL AESALLEPQVRRYIIYNSRPMRLAFAVVFYVVVWANIYSTSQMFALGNHWAGMLLVTLAA VSLTLTLVLVFERHQKKANTNTDLRLAAANGALLRHRVLLGVTDTVEGCQSVIQLWFVYF DLENCVQFLSDHVQEMKTSQESLLRSRLSQLCVVMETGVSPATAEGPENLEDAPLLPGNS CPNERPLMQTELHQLVPEAEPEEMARQLLAVFGGYYIRLLVTSQLPQAMGTRHTNSPRIP CPCQLIEAYILGTGCCPFLASPGTATNLLCGFGLAHTEAPRICNRKSVGCVIYLSKDSPS SDDRRLVQAGAVTGEQQGMSQQVRHHLPHPPAKLMDGFMDDL >gi568815589f:114487839_114697740|GENSCAN_predicted_CDS_5|1209_bp atggttatcactcagccagatgaagcttctggtctgcttccagagctccacaatggccag gtcctcactgttctccggattgacaatacctgtgcacccatctccttcgacctgggagcc gcagaagagcaactgcaaacttggggcatccaggtcccggctgaccagtacaggagcttg gctgagagtgccctcttggagccccaagtgagaagatatatcatctacaactcgaggcct atgcggctggcctttgctgtggttttctatgtggtggtgtgggccaatatctactctacc agtcagatgtttgccttggggaaccactgggctggcatgctgctcgtgaccctggccgcg gtgagcctgaccttgactcttgtgctggtctttgaaagacaccagaagaaggccaacacc aacacggacctgaggctggcagctgccaatggagccctcctgagacaccgggtgctgctg ggggtgacagacacagtggaaggatgccagagtgtgattcagctttggtttgtctacttc gacctggagaactgtgtgcagtttttgtctgatcatgttcaagaaatgaagactagccaa gagtccttgctgagaagcagattgagccagttgtgtgttgtcatggagactggggtgagc cctgcaacagcggaggggcctgagaacttggaggatgctcctctcctgcccggcaattct tgtcctaacgagaggccactcatgcagactgagcttcatcagcttgttcctgaggctgag ccggaggaaatggcccgccagctgctggcagtgtttggcggctactacatccggcttcta gtgacctcccagctccctcaggcaatggggacacgacacacgaactctccgagaattcca tgcccctgccagctcatagaagcctacatcctaggcacagggtgctgcccgttcctggcg agtcctggcactgccactaacttgctgtgtggctttgggctagcccatactgaggccccc cgcatctgcaacaggaagtcagtgggctgcgtcatctatctttccaaggattctcccagc tctgatgaccgaaggttggtccaagcaggtgctgtcaccggggagcagcaaggaatgtca caacaagtaagacaccacctgcctcatccacctgccaagctgatggatggctttatggat gacttatag >gi568815589f:114487839_114697740|GENSCAN_predicted_peptide_6|281_aa PNLKSVNELICKCGYGKINKKRIALTGNTLIAQSLGKYGIICTEDLIHEIYTVGKCFKQA SNFLWPSKPSSPEGGMKKKTTHFVEGGNADNRGDQINSLLRRINEDVYHDYFSNLVLPEA PGLVPQYTVWWPAWHRAQQNFSLLLAVVVQSKHPSPNIPWFPSSVTSQTAFCLLCQLQSL QNIFTSVISPKPPPRDPVEFSLPDLLLQKDELDRQNPKRINAVSHLPSRTPLIQTKKSTS SSSSEFEDLNAYASQRNFYKRNLNRYCQEHWPFQPCLTGRP >gi568815589f:114487839_114697740|GENSCAN_predicted_CDS_6|846_bp ccaaatctgaagtcagtaaatgaactaatctgtaagtgtggttatggcaaaatcaataag aagcgaattgctttgacaggtaacactttgattgctcaatctcttggtaaatatggtatc atctgcacggaagatctgattcatgagatctatactgttggaaaatgcttcaaacaagca agtaacttcctgtggccctccaaaccatcctctccagaaggtggaatgaagaaaaagacc acccattttgtagaaggtggaaatgctgacaacaggggggaccagatcaacagccttctt agaagaataaatgaagatgtctaccatgattatttttctaatctggtgcttcctgaggcc ccaggacttgtgccccagtatacagtctggtggccagcatggcatcgtgctcagcagaac ttctccctcctcctcgctgttgtggtccagtccaaacatcccagccccaacattccctgg ttccccagctctgtgacatcacagacagccttctgtctactctgccagctgcagagcttg caaaacatcttcacttccgtgatctcacctaagcctcccccacgtgaccctgtggaattc tcattgccagatttgctgcttcagaaggatgagcttgacagacaaaatcccaagcgcatt aacgcagtctcccatttgccttcgagaacacccctgatccagacaaaaaagagcacttcc tccagcagcagtgagtttgaggatctgaatgcatatgcttcccaaagaaatttttacaag agaaacttaaaccgctactgccaggagcactggccattccagccatgcctcactgggagg ccctga