GENSCAN 1.0 Date run: 8-Nov-116 Time: 15:09:26 Sequence gi568815581f:6345111_6550419 : 205309 bp : 49.46% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 7198 7265 68 0 2 92 99 -4 0.128 1.86 1.02 Intr + 10821 10937 117 0 0 40 113 80 0.138 5.28 1.03 Term + 26964 27168 205 0 1 -18 48 237 0.010 5.94 1.04 PlyA + 28418 28423 6 1.05 2.05 PlyA - 29044 29039 6 1.05 2.04 Term - 38199 38048 152 1 2 72 49 140 0.921 6.57 2.03 Intr - 38390 38277 114 0 0 -90 36 280 0.536 5.42 2.02 Intr - 48164 48032 133 2 1 31 79 111 0.272 4.72 2.01 Init - 52842 52837 6 0 0 96 93 0 0.471 2.15 2.00 Prom - 53366 53327 40 -1.66 3.00 Prom + 54562 54601 40 -2.16 3.01 Init + 60539 60597 59 1 2 94 85 11 0.419 2.04 3.02 Intr + 63866 63973 108 2 0 44 94 93 0.370 4.90 3.03 Intr + 67354 67490 137 1 2 97 32 55 0.218 1.01 3.04 Intr + 68857 68910 54 2 0 93 85 18 0.138 0.95 3.05 Term + 70813 70922 110 2 2 95 49 73 0.099 2.77 3.06 PlyA + 73145 73150 6 1.05 4.00 Prom + 73915 73954 40 -2.46 4.01 Init + 76997 77210 214 1 1 62 63 134 0.748 7.21 4.02 Intr + 77487 77654 168 1 0 115 32 35 0.212 0.52 4.03 Term + 78542 78855 314 0 2 49 42 139 0.144 0.56 4.04 PlyA + 79282 79287 6 -1.75 5.07 PlyA - 80169 80164 6 1.05 5.06 Term - 80720 80350 371 0 2 85 52 399 0.981 30.81 5.05 Intr - 81646 81505 142 1 1 90 85 266 0.997 26.43 5.04 Intr - 81959 81771 189 2 0 70 77 326 0.917 29.48 5.03 Intr - 83396 83208 189 2 0 88 75 349 0.966 33.38 5.02 Intr - 88988 88809 180 2 0 8 97 297 0.845 22.66 5.01 Init - 89994 89899 96 0 0 104 72 129 0.809 13.22 5.00 Prom - 95203 95164 40 -5.26 6.00 Prom + 96826 96865 40 -7.86 6.01 Init + 99163 99277 115 0 1 80 40 42 0.275 -0.79 6.02 Intr + 99966 100294 329 1 2 130 90 130 0.554 12.82 6.03 Term + 102353 102661 309 1 0 98 39 130 0.756 4.16 6.04 PlyA + 105934 105939 6 1.05 7.19 PlyA - 106175 106170 6 1.05 7.18 Term - 110533 110228 306 1 0 130 55 619 0.988 57.92 7.17 Intr - 112612 112484 129 1 0 110 94 209 0.999 24.59 7.16 Intr - 116446 116263 184 0 1 52 105 318 0.990 29.69 7.15 Intr - 118786 118622 165 0 0 69 117 109 0.686 10.98 7.14 Intr - 119208 119060 149 0 2 101 110 154 0.999 17.93 7.13 Intr - 119661 119545 117 0 0 98 77 188 0.994 19.36 7.12 Intr - 123231 123115 117 0 0 80 69 163 0.926 14.26 7.11 Intr - 125298 125150 149 1 2 71 84 169 0.994 14.75 7.10 Intr - 126245 126051 195 0 0 101 73 123 0.993 11.49 7.09 Intr - 127717 127547 171 2 0 82 86 147 0.956 13.81 7.08 Intr - 129494 129322 173 1 2 140 66 205 0.879 23.09 7.07 Intr - 132103 131919 185 1 2 77 63 221 0.999 17.09 7.06 Intr - 132987 132865 123 0 0 83 92 193 0.965 19.98 7.05 Intr - 133765 133437 329 2 2 95 91 327 0.843 29.12 7.04 Intr - 135512 135401 112 2 1 130 89 -9 0.381 3.35 7.03 Intr - 138642 138505 138 0 0 80 37 194 0.311 14.16 7.02 Intr - 139182 139106 77 1 2 121 80 50 0.991 6.73 7.01 Init - 144201 144132 70 0 1 81 73 51 0.641 4.11 7.00 Prom - 150102 150063 40 -3.06 8.00 Prom + 151591 151630 40 -6.06 8.01 Init + 162907 163072 166 0 1 56 86 204 0.570 14.79 8.02 Term + 173805 173887 83 1 2 58 42 131 0.935 3.36 8.03 PlyA + 174904 174909 6 1.05 9.05 PlyA - 175516 175511 6 1.05 9.04 Term - 180113 179917 197 0 2 70 42 56 0.307 -3.23 9.03 Intr - 180353 180246 108 0 0 103 100 178 0.906 20.76 9.02 Intr - 192972 192877 96 1 0 93 105 128 0.820 14.98 9.01 Init - 204129 203997 133 0 1 81 93 42 0.338 2.31 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581f:6345111_6550419|GENSCAN_predicted_peptide_1|129_aa MTKFCCLQIPHLPQFMHLEITPWEKFSKHKAMGKIVKKKTYRSDENIFKLLYLKRIISKT ESKTLSQKERKKKKEEEEEEEEKEEEEEEEEEAENGEEEEEKKQQEGGGRMEEGEGRRKK EEEGRRILG >gi568815581f:6345111_6550419|GENSCAN_predicted_CDS_1|390_bp atgaccaagttttgctgtctccagattcctcacctgccccagttcatgcatcttgaaatc actccctgggaaaaattttccaagcacaaagcaatgggtaaaatagtaaagaagaagacc tacagatctgatgagaacattttcaaacttctgtacctcaagagaatcataagcaaaact gaaagcaagaccctatctcaaaaagaaaggaaaaagaagaaagaagaagaagaagaagaa gaagagaaagaggaagaggaagaagaagaggaggaggcggagaatggggaggaggaggag gagaagaagcagcaggagggaggcgggaggatggaagagggagaagggagaaggaagaag gaagaagaaggaagaagaatactaggttga >gi568815581f:6345111_6550419|GENSCAN_predicted_peptide_2|134_aa MTVALQRTYSKHYLLGQILGELEVQLSALVGNWLPSFPPCVTAEAQDDDNDVDDDDNDVD DDDNDVDDDDNDDGYYLHGAYGVPEGYNGLISVFQCLRQYSNLGLSNFILDAVNPMLLPE NAFMQTNDVVVFAP >gi568815581f:6345111_6550419|GENSCAN_predicted_CDS_2|405_bp atgactgtggctctgcagaggacgtactcgaagcactacctgctggggcagattctgggt gagctagaggtgcagctctctgcgctcgtggggaactggctacccagcttccctccctgc gtgacggcagaggcccaagatgatgataatgatgttgatgatgatgataatgatgttgat gatgatgataatgatgttgatgatgatgataatgatgatggctattacttacatggtgcc tacggtgtgccagagggatacaatggcttgatcagcgtctttcagtgcctcaggcaatac tcaaacctgggtctttctaacttcatattagatgctgtcaaccccatgctgctacctgag aatgcctttatgcagacaaatgacgtggttgtatttgcaccttag >gi568815581f:6345111_6550419|GENSCAN_predicted_peptide_3|155_aa MDFPFRGGASTSRFSAPLHWSCHGYKKVVKIHSHLQPGSGDSASVQRAPEHGALRRGFQW EEPTGSQREPSAYHPQDHLPEQGAGRRVESGLKRPTGDNQHNSHRWKKEFPNSKKGQSPG DSQGKQCQLLTRQCLHPLPVTQNQFSIPFFAWQPP >gi568815581f:6345111_6550419|GENSCAN_predicted_CDS_3|468_bp atggacttccccttccgtggtggagcatccacgagccggttctctgctccccttcattgg agctgccacggctacaagaaggttgtgaaaattcactctcatctacagccaggaagtggg gattctgccagtgtacagagggcacctgaacacggggctctcagaaggggcttccaatgg gaagaacccactggaagccagagggagccatcggcgtatcacccacaggatcatcttcca gagcagggtgcaggaaggagggtggagagtggattgaagagaccaacgggagacaaccag cacaactcccacaggtggaagaaggagtttcccaatagtaagaagggccagtccccaggg gattcacagggaaaacagtgccagctcctcacccggcagtgcctccaccccttgccagtg acacagaatcagttctccatccccttcttcgcctggcagcccccctaa >gi568815581f:6345111_6550419|GENSCAN_predicted_peptide_4|231_aa MVRADGEPPHPEPKAAISYPGDPSAWSLLPPAGNLKTKPSGWISGKEALGREALQTIATA RPDLTSHALPLDIQRHQKGSCPHGMSRTGPSPSIGCIRASVQPGSPQATFSPQMRGAPSL LVVPKRKGVTKLEPQRKQCSGSIALGPVGKHTYTATIMQMLFIDFQIVESTYGQSTEDLV LLTEQKVDVNNKEDGKVKYIRTLGIQLALGEPLLGGDKLWGRREAQEALGS >gi568815581f:6345111_6550419|GENSCAN_predicted_CDS_4|696_bp atggtcagggctgatggagaaccgccacatccggagcccaaagcggccatctcctaccct ggggacccgtctgcttggtctctgctgccacctgctggcaacttgaaaacaaagccttcc ggctggatctccgggaaagaagcccttggaagagaggccctgcaaactattgcaactgcc cgcccagacctgaccagccacgcccttcctttagacatccagaggcatcagaaggggagc tgtccacacgggatgagccggacaggaccaagcccgagcataggctgcattcgggcctct gtccagccagggagcccacaggccacgttctcaccacaaatgaggggggcaccctcgctg ctcgttgtccccaaaagaaagggggtcacgaaattggaaccacagagaaagcaatgcagt ggaagcattgcgctgggccctgtggggaaacacacctacacagccacaatcatgcaaatg ctgtttattgattttcagattgtagaatcaacctatggacaaagcacagaagatttagtt ttgcttacagagcagaaggtagatgttaacaacaaggaagatggaaaagtaaagtacatc agaactttagggatacagttagcgctgggggaacccctgctgggtggagacaagctctgg ggcagaagagaagcccaggaagcccttggcagttga >gi568815581f:6345111_6550419|GENSCAN_predicted_peptide_5|388_aa MDAALLLNVEGVKKTILHGGTGELPNFITGSRVIFHFRTMKCDEERTVIDDSRQVGQPMH IIIGNMFKLEVWEILLTSMRVHEVAEFWCDTIHTGVYPILSRSLRQMAQGKDPTEWHVHT CGLANMFAYHTLGYEDLDELQKEPQPLVFVIELLQLWGQVDAPSDYQRETWNLSNHEKMK AVPVLHGEGNRLFKLGRYEEASSKYQEAIICLRNLQTKEKPWEVQWLKLEKMINTLILNY CQCLLKKEEYYEVLEHTSDILRHHPGIVKAYYVRARAHAEVWNEAEAKADLQKVLELEPS MQKAVRRELRLLENRMAEKQEEERLRCRNMLSQGATQPPAEPPTEPPAQSSTEPPAEPPT APSAELSAGPPAEPATEPPPSPGHSLQH >gi568815581f:6345111_6550419|GENSCAN_predicted_CDS_5|1167_bp atggatgccgctctgctcctgaacgtggaaggggtcaagaaaaccattctgcacgggggc acgggcgagctcccaaacttcatcaccggatcccgagtgatctttcatttccgcaccatg aaatgtgatgaggagcggacagtcattgacgacagtcggcaggtgggccagcccatgcac atcatcatcggaaacatgttcaagctcgaggtctgggagatcctgcttacctccatgcgg gtgcacgaggtggccgagttctggtgcgacaccatccacacgggggtctaccccatccta tcccggagcctgaggcagatggcccagggcaaggaccccacagagtggcacgtgcacacg tgcgggctggccaacatgttcgcctaccacacgctgggctacgaggacctggacgagctg cagaaggagcctcagcctctggtctttgtgatcgagctgctgcagctctggggccaggtt gatgccccgagtgattaccagagggagacctggaacctgagcaatcatgagaagatgaag gcggtgcccgtcctccacggagagggaaatcggctcttcaagctgggccgctacgaggag gcctcttccaagtaccaggaggccatcatctgcctaaggaacctgcagaccaaggagaag ccatgggaggtgcagtggctgaagctggagaagatgatcaatactctgatcctcaactac tgccagtgcctgctgaagaaggaggagtactatgaggtgctggagcacaccagtgatatt ctccggcaccacccaggcatcgtgaaggcctactacgtgcgtgcccgggctcacgcagag gtgtggaatgaggccgaggccaaggcggacctccagaaagtgctggagctggagccgtcc atgcagaaggcggtgcgcagggagctgaggctgctggagaaccgcatggcggagaagcag gaggaggagcggctgcgctgccggaacatgctgagccagggtgccacgcagcctcccgca gagccacccacagagccacccgcacagtcatccacagagccacctgcagagccacccaca gcaccatctgcagagctgtccgcagggccccctgcagagccagccacagagccacccccg tccccagggcactcgctgcagcactga >gi568815581f:6345111_6550419|GENSCAN_predicted_peptide_6|250_aa MNGHGRANPSAAFAAAREGANRIPGAVAVAGAGTEPISWSSGQRRLLARQMASRWQNMGT SVRRRSLQHQEQLEDSKELQPVVSHQETSVGALGSLCRQFQRRLPLRAVNLNLRAGPSWK RLETPEPGQQGLQAAARSAKSALGAVSQRIQESCQSGTKWLVETQVKARRRKRGAQKGSG SPTHSLSQKSTRLSGAAPAHSAADPWEKEHHRLSVRMGSHAHPLRRSRREAAFRSPYSST EPLCSPRQVG >gi568815581f:6345111_6550419|GENSCAN_predicted_CDS_6|753_bp atgaatgggcacggccgggccaatccaagcgcagcatttgcagccgcgagagagggagcc aatcggatccccggggcggtggcggtggcgggggcggggacggaaccaatcagctggtct agtggacagagaagactcttggccaggcagatggcttctcggtggcagaacatggggacc tccgtgcgccggagatctctccagcaccaggagcagctggaggacagcaaggagctgcag cctgtggtcagccatcaggagacctctgtaggggccctggggtccctgtgcagacagttc caaaggaggctgcccctgagagccgtcaacctcaacctccgcgcagggccctcctggaaa cgcctggaaaccccagagccaggtcagcagggcctccaggctgcagctcgctcagctaag agtgctttgggtgccgtgtcccagagaatccaggagtcctgccaaagtggcaccaagtgg ctggtggagacccaggtgaaggccaggaggcggaagagaggagcacagaagggcagtgga tccccaactcacagcctgagccagaagagcacccggctgtctggagccgcccctgcccac tcagccgcagacccctgggagaaggagcatcaccgcctctctgtccggatgggctcacat gcccacccattacggcgatcaaggcgggaggctgccttccggagcccctactcctcaaca gagcccctctgctctcccaggcaagtgggatag >gi568815581f:6345111_6550419|GENSCAN_predicted_peptide_7|962_aa MEEETECLALASGPSASTPVDPRGELYRVSLRRQRFPAQGSIEIHEDSEEGCPQRSCKTH VLLLVLHGGNILDTGAGDPSCKAADIHTFSSVLEKDLARDLGLLQSPALRQSPHTPGLRS WEGWEEVPSRVPGSCLGLPEGSLEPNSWPVCALFCLILSPKPANLPDPGWAHLGPPVCSL NPYSHDEGCLSSSQDHVPLAALPLLAISSPQYQDAVATVIERANQVYREFLKSSDGIGFS GQVCLIGDCVGGLLAFDAICYSAGPSGDSPASSSRKGSISSTQDTPVAVEEDCSLASSKR LSKSNIDISSGLEDEEPKRPLPRKQSDSSTYDCEAITQHHAFLSSIHSSVLKDESETPAA GGPQLPEVSLGRFDFDVSDFFLFGSPLGLVLAMRRTVLPGLDGFQVRPACSQVYSFFHCA DPSASRLEPLLEPKFHLVPPVSVPRYQRFPLGDGQSLLLADALHTHSPLFLEGSSRDSPP LLDAPASPPQASRFQRPGRRMSEGSSHSESSESSDSMAPVGASRITAKWWGSKRIDYALY CPDVLTAFPTVALPHLFHASYWESTDVVAFILRQVMRYESVNIKESARLDPAALSPANPR EKWLRKRTQVKLRNVTANHRANDVIAAEDGPQVLVGRFMYGPLDMVALTGEKVDILVMAE PSSGRWVHLDTEITNSSGRITYNVPRPRRLGVGVYPVKMVVRQVSFRGDQTCAMSYLTVL PRGMECVVFSIDGSFAASVSIMGSDPKVRPGAVDVVRHWQDLGYMILYITGRPDMQKQRV VSWLSQHNFPQGMIFFSDGLVHDPLRQKAIFLRNLMQECFIKISAAYGSTKDISVYSVLG LPASQIFIVGRPTKKYQTQCQFLSEGYAAHLAALEASHRSRPKKNNSRMILRKGSFGLHA QPEFLRKRNHLRRTMSVQQPDPPAANPKPERAQSQPESDKDHERPLPALSWARGPPKFES VP >gi568815581f:6345111_6550419|GENSCAN_predicted_CDS_7|2889_bp atggaggaggagactgagtgtttggctttagccagtgggccctcagcatccacccctgtg gaccccagaggagaactgtaccgggtttccttgagaagacagaggttcccagcccaggga agcatcgagatccacgaagacagcgaggaaggctgcccgcagcgctcctgcaagacacat gtcctcctgctggtcctgcatgggggaaacatcctggacacgggtgccggggacccgtcc tgcaaggcagccgacatccacaccttcagctccgtgctggagaaggacctcgccagagac ttggggctgctgcagagcccagccctcaggcagagccctcataccccggggctccggagc tgggaagggtgggaagaggtccccagcagggtgccaggctcctgcctgggtcttcctgaa gggtccctggagcccaattcctggccagtatgtgctctcttctgcctgatcctcagcccc aaacctgccaacctgcccgatcctggctgggctcaccttgggccccctgtctgcagcctg aacccctacagccacgatgagggctgcctcagcagcagccaggaccacgtccctctggcc gcccttcccctgttggccatctcctccccgcagtaccaggatgctgtcgccaccgtcatc gagcgagccaaccaggtctacagagagttcctgaagtcctctgatgggattggcttcagt gggcaggtgtgtctcatcggggactgtgtggggggcctcctggccttcgatgccatctgc tacagtgcggggccctcaggggacagccctgccagcagcagccggaaggggagcatcagc agcacccaggacaccccagtcgcggtggaggaagattgcagcctggccagcagcaagcgt ctcagcaaaagcaacattgacatctccagtgggttggaggatgaggagcccaagaggccg ttgccgcggaaacagagcgactcctccacctatgactgcgaggccatcacccagcaccat gccttcctctcaagcatccactccagcgtgctaaaggatgagtctgagaccccggcggct ggggggccgcagctccctgaggtcagcctgggccgctttgacttcgatgtgtccgacttc ttcctcttcggctcgccactgggcctggtcctggccatgcggaggacggtgctgcctggg ctggacggcttccaggtgcgtcctgcctgcagccaggtctacagcttcttccattgcgca gacccctctgcctcacggctcgagccactgctggagcccaagttccacctggtgccgcct gtcagcgtgcctcgctaccagaggttcccactgggcgatgggcagtccctcctcctcgct gatgccctacacacccacagccccctcttcctggagggcagctcccgggacagcccgcca cttctggatgcccctgcctcgccccctcaggcctcgaggttccagcgcccaggacggagg atgagcgaggggagctcccacagcgagagctcggagtcctcggacagcatggcacccgtg ggtgcctcccgcatcacagccaagtggtggggaagcaagaggatcgactatgccctgtac tgccctgatgtcctcacggccttccccaccgtggccctgccccacctcttccacgccagt tactgggagtccacagacgtggtggccttcatcctgagacaggtaatgcgctatgagagc gtgaacatcaaggaaagcgcccgcctggaccctgcagcactgagtcctgccaacccccgg gagaagtggcttcgtaagcggactcaggtcaagctgaggaatgtcacggctaatcaccgg gccaatgatgtgattgctgctgaagatggcccccaggtcctggtggggcggttcatgtac gggcccctcgacatggtggctctgactggagagaaggtggacatcctagtaatggcagag ccatcctcaggccgctgggtacacctggacacagagatcaccaacagcagtggtcgcatc acatacaatgtgccgcggccccggcgcctgggggttggtgtctatcctgtgaagatggtc gtcaggcaggtttctttcaggggcgaccagacctgtgccatgagctacctcacggtgttg cccaggggcatggagtgtgtagtgttcagcattgatgggtccttcgcggccagcgtgtct atcatgggaagcgaccccaaggtccggccgggtgcagtggatgttgtccggcactggcag gacttgggctacatgatcctttacatcacgggacggccggacatgcagaagcagcgggtg gtgtcgtggctgtcccagcacaacttcccacagggcatgatcttcttctccgatgggctg gtgcatgacccgctgcggcagaaggccatcttcctgcgcaacctcatgcaggagtgcttc atcaaaatcagtgcggcctatggctccacgaaggacatctctgtctacagcgtgctgggc ctgcctgcctcccagatcttcattgtgggccggcccaccaagaagtaccaaacccagtgc cagttcctgagcgagggctacgccgcacacctggccgcgctggaggccagccaccgctca cgcccaaagaagaacaactcgcgcatgatcctgcgcaagggcagcttcgggctgcacgcg cagccagagttcctgcggaagcgcaaccacctgcgcagaaccatgtcagtgcagcagccc gacccgcccgccgccaaccccaagcccgagcgggcccagagccagcccgagtcggacaaa gaccacgagcggccgctgccggcgctcagctgggcgcgtgggccccccaagttcgagtcg gtgccctga >gi568815581f:6345111_6550419|GENSCAN_predicted_peptide_8|82_aa MQLGLLGMLGLLGLLGMQLGLLGMQSSEAGRSSGTQVPRGQEVSVLETPVRKLHVALQSG SNQKARYRLSLCIDLEIPCNAV >gi568815581f:6345111_6550419|GENSCAN_predicted_CDS_8|249_bp atgcagctggggttgctgggaatgctggggttgctggggttgctggggatgcagctgggg ttgctgggaatgcagagttcagaggccggaagaagctcggggacccaggtgccaagaggc caggaggtgagtgtcctggaaacgccagtgcgcaagcttcacgtggcactacagtcagga tccaaccagaaagcccgctaccgcttgtcactctgcattgacctggagatcccatgcaat gcagtttaa >gi568815581f:6345111_6550419|GENSCAN_predicted_peptide_9|177_aa MGCSPCGQGWQGAAGGCICLWKAGVSGFVKLLSLGVLEKLFSRAGGPPPGGGAPWHLRNV LSDSVESSDDEFFDAREEMAEGKNAILIGMSQWNSNDLVEQIETMGKLDEHQASLNFATY RNSHGDFCGVFCGIKAPAYFNRASFVHRMTTPVARVVNGRCLGPSSLDCQVFGIGIC >gi568815581f:6345111_6550419|GENSCAN_predicted_CDS_9|534_bp atggggtgtagtccgtgtggccagggctggcaaggggctgcggggggttgcatctgcctc tggaaagcaggagtctcaggttttgtgaagctgctctctctaggtgtcctggagaagctc ttctccagggctggtggtcctcccccgggcggcggtgccccctggcaccttcgaaatgtc ctcagtgactctgtggagagctcagatgatgaattctttgatgccagagaggagatggct gaagggaagaatgccatcctcattgggatgagccagtggaactccaatgacctcgtggag cagatcgagaccatggggaaactggacgagcatcaagcatccctgaattttgccacttac aggaactctcatggggacttctgtggggttttctgtggcataaaagctccagcctacttc aacagagcttcctttgtccacagaatgactaccccagtggcacgggtggttaatggcagg tgcctgggtccaagctccttggactgccaggtctttggtattggaatctgctga