GENSCAN 1.0 Date run: 8-Nov-116 Time: 14:19:00 Sequence gi568815577f:42793334_43009018 : 215685 bp : 48.42% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 25410 25663 254 2 2 76 90 88 0.128 4.64 1.02 Intr + 42205 42286 82 1 1 33 105 51 0.050 0.94 1.03 Intr + 45842 45905 64 1 1 96 86 33 0.053 2.19 1.04 Intr + 49082 49291 210 0 0 76 82 65 0.325 3.58 1.05 Term + 49768 49922 155 2 2 52 42 153 0.683 5.18 1.06 PlyA + 52796 52801 6 1.05 2.15 PlyA - 54349 54344 6 1.05 2.14 Term - 56909 56716 194 0 2 104 43 231 0.987 17.78 2.13 Intr - 58991 58922 70 2 1 76 100 90 0.959 7.75 2.12 Intr - 60475 60236 240 1 0 47 94 296 0.804 23.85 2.11 Intr - 61223 61058 166 1 1 -39 72 185 0.653 4.16 2.10 Intr - 61293 61229 65 0 2 100 102 77 0.794 7.82 2.09 Intr - 62447 62349 99 2 0 83 70 84 0.031 6.41 2.08 Intr - 65532 65391 142 2 1 58 32 62 0.061 -2.04 2.07 Intr - 66154 66027 128 1 2 78 75 84 0.401 5.58 2.06 Intr - 66389 66329 61 1 1 136 98 31 0.958 7.64 2.05 Intr - 69151 68949 203 1 2 35 70 216 0.583 12.58 2.04 Intr - 70263 70107 157 2 1 81 66 140 0.830 11.11 2.03 Intr - 80358 80218 141 2 0 129 80 28 0.422 5.57 2.02 Intr - 83434 83369 66 0 0 92 76 66 0.697 3.82 2.01 Init - 86162 86074 89 2 2 76 94 130 0.508 10.70 2.00 Prom - 95734 95695 40 -4.26 3.00 Prom + 97033 97072 40 -9.95 3.01 Init + 100001 100048 48 1 0 107 98 86 0.838 10.61 3.02 Intr + 100217 100369 153 1 0 19 64 95 0.131 0.47 3.03 Intr + 103594 103714 121 0 1 49 69 78 0.229 2.07 3.04 Intr + 109849 110943 1095 2 0 98 36 546 0.592 40.35 3.05 Intr + 115124 115150 27 0 0 86 87 34 0.644 1.19 3.06 Term + 115531 115688 158 2 2 94 55 110 0.977 6.40 3.07 PlyA + 115746 115751 6 1.05 4.08 PlyA - 116362 116357 6 1.05 4.07 Term - 125958 125884 75 0 0 84 43 66 0.188 -0.46 4.06 Intr - 126227 126026 202 1 1 34 73 58 0.138 -1.81 4.05 Intr - 126925 126742 184 2 1 111 54 146 0.334 12.35 4.04 Intr - 130031 129827 205 2 1 47 115 73 0.306 4.67 4.03 Intr - 130470 130268 203 1 2 76 39 44 0.137 -2.70 4.02 Intr - 132281 132013 269 1 2 86 119 36 0.471 3.68 4.01 Init - 138480 138395 86 0 2 89 28 106 0.560 5.13 4.00 Prom - 143782 143743 40 -4.16 5.06 PlyA - 145045 145040 6 1.05 5.05 Term - 160389 160228 162 0 0 76 48 93 0.355 2.04 5.04 Intr - 161295 161195 101 1 2 110 28 33 0.513 -0.67 5.03 Intr - 162238 162108 131 0 2 24 81 152 0.415 8.44 5.02 Intr - 162462 162388 75 2 0 57 74 75 0.307 1.63 5.01 Init - 171873 171770 104 0 2 65 94 111 0.567 9.11 5.00 Prom - 178017 177978 40 -4.66 6.00 Prom + 180242 180281 40 -6.06 6.01 Init + 181899 181964 66 2 0 101 52 180 0.870 14.77 6.02 Intr + 199078 199241 164 0 2 58 18 86 0.000 -2.63 6.03 Intr + 209868 209985 118 0 1 77 53 84 0.040 4.27 6.04 Term + 214158 214289 132 2 0 102 39 101 0.730 4.69 6.05 PlyA + 215056 215061 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 62555 62349 207 2 0 78 70 202 0.885 16.22 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815577f:42793334_43009018|GENSCAN_predicted_peptide_1|254_aa MKVAVHKPGKEPLLGTRPGQHLDLRLLASRSPNAMRRYPRDLAPELSWCRVPDPKRDEES LLMGTAVAHRVPGPEKSGQDIHVRSPGTNQFSKDCQFHTLENSARKRRSECKEFALQGFS FRSLAEGVPVAFQGGIDVTCLVHSEPEATRPPPGDSLGKRPPDSQVPMKFCTGAAAVQEM GGPCLHPVWKGSGASGPLGLPGIVKNLLHRLSSGKTYPISRGQITDAPRLTMELHPNKLI VGRQIARKWSTDRT >gi568815577f:42793334_43009018|GENSCAN_predicted_CDS_1|765_bp atgaaggtagctgtccacaagccaggaaaggagcccttactgggaacccgaccaggccag caccttgatctcagacttcttgcctccagaagccccaatgcaatgaggaggtatccacgg gacctggccccagaattgtcctggtgcagagttccagatcccaaaagggatgaggagagt ttacttatggggacagcagtggcccacagagtgccaggacccgagaagagtggtcaggac atccatgttaggagtcctggaacaaaccagttctccaaggattgccagtttcatacgctg gagaatagtgctagaaaacgaagatctgagtgcaaggaatttgctctccagggtttttca ttccgctccttagcagaaggtgttcctgttgctttccaaggtggcattgacgtgacctgc ctagtgcacagtgagcctgaggcaacacggccaccaccaggagactccctggggaagagg cccccagacagccaggtgcccatgaagttctgcacaggggccgccgccgtccaggagatg ggagggccctgtctgcaccctgtctggaaggggagtggggcgagcgggccacttggcctc cctggtatagtcaaaaatttactacacagactcagtagcggaaaaacatatcccataagc agaggacaaatcacagatgctcctcgacttacgatggagctacatccgaacaagctcatc gtcggccgacagatcgcacgcaagtggtccacggatcgcacataa >gi568815577f:42793334_43009018|GENSCAN_predicted_peptide_2|606_aa MAGSVGLALCGQTLVVRGGSRFLATSIASSDDDSLFIYDCSAAEKKSQENKGEDAPLDQG SGAILASTFSKSGSYFALTDDSKRLILFRTKPWQCLSVRTVARRCTALTFIASEEKVLVA DKSGDVYSFSVLEPHGCGRLELGHLSMLLDVAAASLPSSMRDLIREDSGERLFLFWPLVM QAVSPDDRFILTADRDEKIRVSWAAAPHSIESFCLGHTEFVSRISVVPTQPGLLLSSSGD CVWGVRRCVCVRLWDELARVCGRLPSEGGCCIYDGIGDQDLRMADGPSGRFYVSSLIGLH SQPPVPALQINTQPRDLREEEKCGHNLQRDGTLRLWEYRSGRQLHCCHLASLQELVDPQA PQKFAASRIAFWCQENCVALLCDGCGGTSMASGSADLQGEELAGWPLAHVLNEEVLNGGG IGLAPPSENVTFDDCCCMRSGARGPAVLVIMLSFFVASTPVVYIFQLDARRQQLVYRQQL AFQHQVWDVAFEETQGLWVLQDCQEAPLVLYRPVGDQWQSVPESTVLKKVSGVLRGNWAM LEGSAGADASFSSLYKATFDNVTSYLKKKEERLQQQLEKKQRRRSPPPGPDGHAKKMRPG EATLSC >gi568815577f:42793334_43009018|GENSCAN_predicted_CDS_2|1821_bp atggcgggctctgtgggactggcgttgtgcgggcagacgttggtggtgcggggcggcagc cgattcctggccacctccatagcaagcagtgatgatgacagcctcttcatctatgactgc agtgctgcagaaaagaagtcacaagaaaataaaggggaggacgcgcccttggaccagggg agcggtgcgattctggcgtccaccttctccaagtctggcagctattttgctttaaccgat gacagtaagcgtctgattcttttccgtacaaaaccatggcaatgtctgagtgtcaggacc gtggcaaggaggtgtacagccctgactttcatagcctcggaggagaaggtcttggtggcc gacaagtctggagacgtctactccttttcggtgctggagccacacgggtgtggccgtcta gagctggggcacctgtctatgctgttagatgtggctgcagcttccctgccttcttccatg agggacttgattcgggaagactcaggtgagcggctttttcttttctggcccctggtgatg caggctgtgagtcctgatgaccgcttcatcctcactgccgaccgggacgagaagatccga gtcagctgggccgcggcgccccatagcatcgagtccttctgcttggggcacacagagttt gtgagccgtatctccgtggtgccaactcagcccgggctgcttctgtcctcctctggggac tgcgtgtggggtgtgcgtaggtgcgtgtgtgttcgcctgtgggatgaactagcacgggtg tgtggaaggctcccttctgagggcggctgctgcatctacgatggtatcggggatcaggat ctcaggatggctgatggaccttctggccgcttctatgtttcttctctgattgggcttcat tcacagcctcctgttcctgctcttcaaattaacacacagccacgcgacctgagagaagaa gaaaagtgtggccataatctccagagggacggcaccctgaggctctgggagtacaggagc ggccgccagctgcactgctgtcacctggccagtctgcaggagctggtggacccccaggcc ccccagaagtttgccgcgtccaggattgcattctggtgccaggagaactgcgtggcgctc ctgtgcgacggctgcggcggcacctctatggcctccggttctgcagacctgcagggggaa gagttagcgggttggccactggcccacgtcctcaatgaagaggtcctgaacggtggtggt attggtctagcacctccctctgagaatgtcacttttgatgactgctgctgcatgaggagc ggagcccgtgggcctgcagtcctagtaatcatgctctctttcttcgtggcaagcactcct gtggtctacatcttccagctggacgcccgcagacagcagttggtgtacaggcagcagctg gcgttccagcaccaagtgtgggacgtggctttcgaggagacccaggggctgtgggtgctc caggactgccaggaagcccccctggtgctctacaggcctgtgggcgaccagtggcagtct gttcctgagagcaccgtgttaaagaaagtctctggtgttcttcgtgggaactgggccatg ctggaaggctctgccggcgcagacgccagcttcagcagtctctacaaggccacgttcgac aacgtgacctcctacctgaagaagaaagaggagagactgcagcagcagctagagaagaag cagcggcgccggagtcccccgcctgggcccgacgggcatgccaagaagatgagaccgggg gaggcgacgctaagttgctga >gi568815577f:42793334_43009018|GENSCAN_predicted_peptide_3|533_aa MAAPCLLRQGRAGALKARMWPASVPRFPRAGPPPPLPEAQAEAGVPRADGRSRGLHAWAA DSGGRSQTMLQEAQVFRGLASTVSLSAESGKSEKGQPQNSKKQSPPKNVVEPKERGKLLA TQTAAELSKNLSSPSSYPPAVNKGRKVASPSPSGSVLFTDEGVPKFLSRKTLVEFPQKVL SPFRKQGSDSEARQVGRKVTSPSSSSSSSSSDSESDDEADVSEVTPRVVSKGRGGLRKPE ASHSFENRAPRVTVSAKEKTLLQKPHVDITDPEKPHQPKKKGSPAKPSEGRENARPKTTM PRSQVDEEFLKQSLKEKQLQKTFRLNEIDKESQKPFEVKGPLPVHTKSGLSAPPKGSPAP AVLAEEARAEGQLQASPPGAAEGHLEKPVPEPQRKAAPPLPRKETSGTQGIEGHLKGGQA IVEDQIPPSNLETVPVENNHGFHEKTAALKLEAEGEAMEDAAAPGDDRGGTQDRVDEERL VEPAPVPAEPFDNTTYKNLQHHDYSTYTFLDLNLELSKFRMPQPSSGRESPRH >gi568815577f:42793334_43009018|GENSCAN_predicted_CDS_3|1602_bp atggctgccccgtgtttgctgcggcaaggacgagccggggcgctgaaggcccggatgtgg ccagcgagcgtcccgcggttcccccgagccggtccgccgccgccgctgcccgaggcccag gccgaggctggcgtgccccgtgccgacgggcgctcccgagggctgcatgcctgggcggca gactcgggtggacgttcccagactatgctccaggaagcccaggtgtttcgaggacttgct tctacggtttctttgtctgcggaatcagggaagagtgaaaagggtcagccacagaattcc aagaagcaaagtccaccaaaaaatgtagtggaaccaaaggagaggggcaagctcctagcc acccagacagcagctgaattgtctaaaaacttatcttcacccagttcttacccgccagct gtgaataagggcaggaaggtagctagtcccagtcccagtggcagcgtgctattcacagat gaaggggttccgaaatttttgtcaagaaagactttggtagagtttccacagaaagttctg tctccattcagaaaacagggctctgattcagaagctcgtcaggtgggtcggaaagtgacg tcgccttcgtcttcatcctcttccagctcctctgattctgaatctgatgatgaggctgac gtttcagaggtcactcctcgagtggtgagcaaaggcagaggggggcttcgaaaaccagag gcctctcattcctttgaaaacagagccccccgagttacagtatcagcaaaagagaaaacc ttgctgcagaagccgcatgtggacattactgatccagagaagccccaccagccaaagaag aaagggtcccctgctaagccatcagaaggcagggaaaatgcgagaccaaaaaccacaatg cccagatctcaagtagatgaagagtttttgaagcaaagtttaaaggaaaaacaattgcag aaaacatttagattaaatgaaatagataaagaaagccaaaagccatttgaagttaaagga cccttacctgtccacacaaaatcagggttgtctgcgccaccgaagggcagcccagcgcct gctgtgttggcagaagaggccagagcagaggggcagctgcaagccagtcctcctggggcg gcagaggggcatctggaaaaacccgtgccagagccccagcgcaaggcggcccctcccctg cccagaaaggaaacctcagggacgcagggaatagaaggccacctgaagggtggacaggca atcgtggaagatcagataccaccaagcaatttggagacagttcctgttgagaataaccac ggtttccatgaaaagacagcagcgctgaagcttgaggccgagggcgaggccatggaagat gcagccgcgccaggggacgaccgaggcggcacacaggatagggttgatgaagaacgcttg gtagagccagccccagtgcctgctgagccgtttgacaacactacctacaagaacctgcag catcatgactacagcacgtacaccttcttagacctcaacctcgaactctcaaaattcagg atgcctcagccctcctcaggccgggagtcacctcgacactga >gi568815577f:42793334_43009018|GENSCAN_predicted_peptide_4|407_aa MGTDDPVVQEAALVALGSLQKDPAMRICGPQSVCIQVLIKTVCCSTPPRVVCWRALRVPT NARALGLALKPARGFRPRPQWTYHPSIWESRPQQPDKGSSSASSRLSVHAHRSLISPTED WDRIWTLAQAHADTIHHQAAAQPAGTEAVPSQDPHWDYQDGAFGCCHRDHMIVCLLAGLK KCAHKAPSKPLIIVMKKVKGKNRQSFECLPPPSGGPAGPRGRSSTREPPSKPPAPGVCFK CGNEGHRSRQCPNPEEKEDCQAQNLQKQGPRYVKEVRFAVLTLKQSLSSKASTTLSMSGT NLSCNVSALSSFVLVGNLPVVYEKSPFRPHETLATPKAAFFYSPFEIMYGRTFVLEPPPL TDSEPLRNYLPSLIQTRSFTREAAQTCLSAKLTLTKSYHRSGQAPTL >gi568815577f:42793334_43009018|GENSCAN_predicted_CDS_4|1224_bp atgggaacagacgacccagtggtgcaggaagctgctcttgtggccctggggagcctccag aaggatcctgccatgcggatctgtgggcctcagtctgtctgcatccaggtgcttattaaa acagtgtgttgctccacaccgcctcgtgttgtctgttggcgcgctctccgggttccaacc aatgcaagagccttggggctggccctgaaacctgcgaggggcttccgtccacgtccccag tggacctaccacccctccatctgggaaagcaggccacagcagccggacaaaggaagctcc tcagcctctagtcgcctctctgtgcatgcacatcggtcactgatctcgcctactgaagac tgggaccgtatctggaccctagctcaggcacatgctgatacaattcatcaccaagctgcc gcccagcctgctggcacagaggcagtccccagccaggacccccactgggattatcaagat ggggcctttggatgctgccatcgagaccacatgattgtgtgtctccttgcaggactcaaa aagtgtgcccataaagcgccctcaaagcctttaataattgtgatgaagaaagtgaaaggc aaaaacaggcagagtttcgaatgcttgcctccgccatcgggaggccctgcaggcccacgg ggccgcagctccacacgggagcctcctagcaagccacctgcacctggcgtctgtttcaag tgtggcaatgaaggccaccggtccaggcaatgcccaaacccagaagaaaaggaggactgc caagcccaaaaccttcaaaaacaaggaccacggtatgtcaaggaagtgcgcttcgctgtc ctcactctcaaacaatccctgtcctccaaagcctccacaactctttccatgtcgggtaca aacctgtcctgcaacgtctctgccctatcctcatttgtcctcgtcggaaatctgccagtc gtgtacgagaagtctccctttcgccctcatgagactctggcaacaccaaaggcagccttt ttttatagtccctttgagatcatgtatggccgaacttttgtcttagagcctccaccctta acagactctgagccactcaggaattacctcccctccttaatccagacacggtctttcact cgtgaagcagcacagacgtgtttatcggccaaactgaccctcacaaaaagctaccaccga agtggacaggcccctacactgtga >gi568815577f:42793334_43009018|GENSCAN_predicted_peptide_5|190_aa MEETKALGSLDGDSKRRLRVPDGQRLHLSQVCSDSVMRGKCQPFRPAYKLMEQPPLKISR LGLPLRISSDNGPAFVADSTEDGKGIGDHMETACRLLVSEFRKDIRKNVTGDVNTPAILG EVSSSPPLDIRNNITGEAIRNNITEGVYTPCDIGGNIILCPPEYYQQYQTEVVYTSCDIG SNIILSTPGY >gi568815577f:42793334_43009018|GENSCAN_predicted_CDS_5|573_bp atggaagagaccaaagccttggggagcctggatggagacagcaagagaaggctccgggtc cctgatggacaacggctgcacctgagccaggtgtgctcagacagcgtaatgcgaggcaag tgccagccgttccgcccggcatacaagcttatggagcagccccctttgaagatctccaga ttgggactgcccttacggatcagctcagataacgggcctgcgtttgtggctgacagtaca gaagatggcaaaggtatcggggatcacatggaaaccgcatgccgcctcctggtctcagag ttccggaaagatattaggaaaaatgtaactggggatgtgaacacccctgcgatattggga gaagtatcatcctctcccccgttggatattaggaacaatatcacaggggaggccattagg aacaatatcacagagggtgtttacacaccctgcgatattggaggtaatatcatcctctgc cccccggaatattaccaacaatatcaaacagaggttgtgtacacttcctgtgatattggg agtaatatcattctctccacccccggatattaa >gi568815577f:42793334_43009018|GENSCAN_predicted_peptide_6|159_aa MAAATPAPASAVASGWVGVLGASSLVTASVNPPVTLLVAKNHFIMWTNSVDQEYGEEVAV SPPGCLSPQLGRLSGWGAAQTLNVCDLHQQQQGHAGRGPARLLSPTLQCFSTPMLQMQVV TELKTEQDPNCSEPDAEGVSPPPVESQTPMDVDKQAIYR >gi568815577f:42793334_43009018|GENSCAN_predicted_CDS_6|480_bp atggcggccgcgaccccggccccggcctcggccgtggcgagcgggtgggtcggggtcctg ggggcgagtagtctggtcacagctagtgtgaaccccccggtcacactgttagtggcaaaa aaccattttattatgtggacaaattctgtggaccaggaatatggagaggaagtggccgtc tctcctccaggatgtctgagtcctcagctgggacgactcagcggctggggggcagcccag actctaaatgtctgtgacctgcaccagcagcagcaaggccatgccggccgtggtccagcc cggctgctctcccccactctgcagtgtttttccacacccatgctgcagatgcaagtagta acagagttaaagacagaacaagatccaaactgctctgaacccgatgcagaaggagtgagc cctccccctgtggagtctcagaccccgatggatgtggacaagcaggccatttataggtag