GENSCAN 1.0 Date run: 6-Nov-116 Time: 14:57:49 Sequence gi568815586f:9952688_10165629 : 212942 bp : 38.40% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 1845 1840 6 1.05 1.03 Term - 7371 7258 114 0 0 58 34 95 0.146 -1.31 1.02 Intr - 11067 10995 73 2 1 86 88 25 0.372 0.79 1.01 Init - 13777 13569 209 1 2 61 98 140 0.786 8.70 1.00 Prom - 17499 17460 40 -4.55 2.00 Prom + 22426 22465 40 -4.85 2.01 Sngl + 23400 23861 462 2 0 96 32 333 0.605 24.41 2.02 PlyA + 24757 24762 6 1.05 3.00 Prom + 36204 36243 40 -3.65 3.01 Init + 36859 36910 52 0 1 51 73 72 0.399 3.37 3.02 Term + 37584 37930 347 1 2 44 47 224 0.461 7.57 3.03 PlyA + 37947 37952 6 1.05 4.06 PlyA - 38405 38400 6 1.05 4.05 Term - 40904 40737 168 2 0 38 54 73 0.258 -4.20 4.04 Intr - 44313 44159 155 1 2 67 83 115 0.964 7.77 4.03 Intr - 44592 44473 120 1 0 42 89 98 0.924 4.85 4.02 Intr - 45693 45595 99 1 0 83 78 49 0.786 2.56 4.01 Init - 46413 46350 64 0 1 87 111 71 0.930 10.66 4.00 Prom - 46851 46812 40 -5.75 5.00 Prom + 55928 55967 40 -4.35 5.01 Init + 58073 58163 91 1 1 45 75 105 0.936 5.60 5.02 Intr + 60098 60196 99 0 0 106 75 57 0.900 5.36 5.03 Intr + 61836 62054 219 1 0 -8 89 206 0.671 8.45 5.04 Intr + 70097 70201 105 0 0 77 81 74 0.427 4.87 5.05 Term + 84740 85071 332 0 2 48 36 175 0.318 2.13 5.06 PlyA + 85261 85266 6 -0.45 6.00 Prom + 85941 85980 40 -4.05 6.01 Init + 86466 86532 67 2 1 54 78 133 0.955 8.19 6.02 Intr + 88779 88933 155 1 2 76 99 135 0.999 12.27 6.03 Intr + 108729 108821 93 2 0 42 85 61 0.006 0.44 6.04 Intr + 112045 112166 122 0 2 87 95 35 0.038 2.47 6.05 Term + 115422 115455 34 0 1 115 48 26 0.010 -2.62 6.06 PlyA + 115848 115853 6 1.05 7.07 PlyA - 116847 116842 6 1.05 7.06 Term - 118826 118646 181 1 1 85 37 140 0.342 4.70 7.05 Intr - 120672 120606 67 1 1 86 67 53 0.717 0.04 7.04 Intr - 128726 128550 177 0 0 2 107 136 0.193 5.87 7.03 Intr - 136535 136437 99 0 0 117 75 72 0.466 7.96 7.02 Intr - 146301 146121 181 0 1 41 80 213 0.509 14.32 7.01 Init - 146654 146586 69 2 0 44 87 26 0.424 -0.80 7.00 Prom - 147249 147210 40 -9.75 8.00 Prom + 152626 152665 40 -6.15 8.01 Sngl + 154077 154487 411 2 0 32 42 226 0.161 8.44 8.02 PlyA + 154588 154593 6 1.05 9.00 Prom + 155834 155873 40 -3.65 9.01 Init + 156011 156090 80 1 2 83 47 53 0.790 1.28 9.02 Term + 159004 159619 616 1 1 70 41 822 0.758 68.65 9.03 PlyA + 160292 160297 6 1.05 10.07 PlyA - 163027 163022 6 1.05 10.06 Term - 163342 163193 150 1 0 72 48 81 0.055 -0.57 10.05 Intr - 172761 172610 152 1 2 83 111 5 0.479 1.26 10.04 Intr - 175158 175060 99 1 0 93 92 67 0.538 6.76 10.03 Intr - 175812 175636 177 1 0 8 28 157 0.469 0.67 10.02 Intr - 177464 177293 172 2 1 125 71 122 0.998 12.79 10.01 Init - 186292 186215 78 1 0 24 91 75 0.351 2.51 10.00 Prom - 188992 188953 40 -6.45 11.00 Prom + 194686 194725 40 -2.55 11.01 Sngl + 200715 201221 507 2 0 53 48 265 0.883 14.79 11.02 PlyA + 201886 201891 6 1.05 12.03 PlyA - 203568 203563 6 1.05 12.02 Term - 207334 207193 142 0 1 84 38 117 0.935 2.72 12.01 Init - 207814 207660 155 1 2 87 89 62 0.849 5.70 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:9952688_10165629|GENSCAN_predicted_peptide_1|131_aa MSHLCKTPLEIRLSISPAATPIAPGTLPQGSLTDSFPDLLGLTTEDCHCPITSEAYRTIT DTLGNSHSGGFASHKVSSSSVDPLPPCVYLLIGQLVFSLMGFPGLQYDSIPWFQTGKRQV HTEMKSVGETS >gi568815586f:9952688_10165629|GENSCAN_predicted_CDS_1|396_bp atgtcccatctgtgcaagaccccactggaaatcagactgtccatctcaccggcagccact cccatagcccctggaactctgccccaaggctctctgactgactccttcccagatcttctc ggcttaacaactgaagactgtcactgcccaatcacctcagaagcctacaggaccatcaca gacactctaggtaactctcacagtggaggttttgcctcgcacaaggtctcttcttcctct gtggatcctctacctccatgtgtctacctgctaattggacagctagtcttcagcctgatg ggatttccaggtcttcagtatgattcgattccctggtttcaaacagggaagaggcaagtc catacagaaatgaagtctgttggagagactagttaa >gi568815586f:9952688_10165629|GENSCAN_predicted_peptide_2|153_aa MGCSWITSARTVQKGNVRSEAPHRVPTGVLSSGAVRSGPPSSSSQNGRSTDSMHCPLGKA ADTQYQPMKAAMRETVPCKATGAELPKDMGAYLLHQHDLDVRYRAKGDDFGALRFGALWT CMGPVAPLFWPISPIWNDRIYPMPVLSLYLGSN >gi568815586f:9952688_10165629|GENSCAN_predicted_CDS_2|462_bp atggggtgctcatggataacctctgctaggacagtgcagaagggaaatgtgaggtcagaa gctccacacagagtccccactggggtactgtccagtggagcagtgagaagtggaccaccg tcctccagctctcagaatggtagatccactgacagcatgcactgtccacttggaaaagct gcagacactcaatatcagcccatgaaagcagccatgagggagactgtaccttgcaaagcc acaggagcagagctgcccaaagacatgggagcctacctcttgcatcagcatgacctggat gtgagatatagagccaaaggagatgattttggagctttaagatttggtgccctttggact tgcatggggcctgtagcccctttgttttggccaatttctcccatttggaatgaccgtatt tatccaatgcctgtactctcattgtatctaggaagtaactaa >gi568815586f:9952688_10165629|GENSCAN_predicted_peptide_3|132_aa MGERGEEATEEMLEASREQSKLKTTWRQLTILDAIKKICDSWKEDKVSTLTDTWKKLNQT LMDNLQAFKTSVVEVTADVVEIARELELEVEPEDVAELMQFYQETLTNEELLFMDEQRKW FLRQILFLVNML >gi568815586f:9952688_10165629|GENSCAN_predicted_CDS_3|399_bp atgggtgagagaggtgaagaagctacagaagaaatgttggaagctagcagagagcaaagt aaattgaaaaccacctggagacaacttaccattctagatgccattaaaaagatttgtgat tcatggaaggaagacaaagtgtcaacattaacagacacttggaagaagttgaatcaaacc ctcatggataacttgcaggcgttcaagacttcagtggtggaagtaactgcagatgtggtg gaaatagcaagagaactagaactggaagtggagcctgaagatgtggctgaattgatgcaa ttttatcaggaaactttaacaaatgaggagttgctttttatggatgagcaaagaaaatgg ttcttgagacaaatattattcctggtgaacatgctgtga >gi568815586f:9952688_10165629|GENSCAN_predicted_peptide_4|201_aa MQDEDGYITLNIKTRKPALISVGSASSSWWRVMALILLILCVGMVVGLVALGIWSVMQRN YLQGENENRTGTLQQLAKRFCQYVVKQSELKGTFKGHKCSPCDTNWRYYGDSCYGFFRHN LTWEESKQYCTDMNATLLKIDNRNIVIMRFPIILEPSPSLLQGSYCTYCSLAPNFMLNEI DIGLFKYAYLVLPSRVELSQQ >gi568815586f:9952688_10165629|GENSCAN_predicted_CDS_4|606_bp atgcaggatgaagatggatacatcaccttaaatattaaaactcggaaaccagctctcatc tccgttggctctgcatcctcctcctggtggcgtgtgatggctttgattctgctgatcctg tgcgtggggatggttgtcgggctggtggctctggggatttggtctgtcatgcagcgcaat tacctacaaggtgagaatgaaaatcgcacaggaactctgcaacaattagcaaagcgcttc tgtcaatatgtggtaaaacaatcagaactaaagggcactttcaaaggtcataaatgcagc ccctgtgacacaaactggagatattatggagatagctgctatgggttcttcaggcacaac ttaacatgggaagagagtaagcagtactgcactgacatgaatgctactctcctgaagatt gacaaccggaacattgtgataatgcgttttcccattattttggagccatctccatcactt cttcagggctcctattgtacctactgttcccttgctcctaattttatgctaaatgaaatt gatattgggctttttaaatatgcatatcttgtactgccaagcagggtggaactttctcaa caatga >gi568815586f:9952688_10165629|GENSCAN_predicted_peptide_5|281_aa MSEEVTYATLTFQDSAGARNNRDGNNLRKRGHPAPSPIWRHAALGLVTLCLMLLIGLVTL GMMFLQISNDINSDSEKLSQLQKTIQQQQDNLSQQLGNSNNLSMEEEFLKSQISSVLKRQ EQMAIKLCQELIIHTSVLEVGKCKLKAPAGLVLDEAILFFKKATVTLGTEEGGQRPFRQL RAKEGTDYEEATDYEEATIPSQGTLTRTRTHSHWDHVDTPIHLTFTFWDMGGNWRTQEKP MQTRGDRANSTQWPQLGSNFFSQHRYNKTILHKRMLLKNLL >gi568815586f:9952688_10165629|GENSCAN_predicted_CDS_5|846_bp atgtctgaagaagtgacctacgcgacactcacatttcaggattctgctggagcaaggaat aaccgagatggaaataacctaagaaaaagagggcatccagctccatctcccatttggcgt catgctgctctgggtctggtaactctttgcctgatgttgctgattgggctggtgacattg gggatgatgtttttgcagatatctaatgacattaactcagattcagagaaattgagtcaa cttcagaaaaccatccaacagcagcaggataacttatcccagcaactgggcaactccaac aacttgtccatggaggaggaatttctcaagtcacagatctccagtgtactgaagaggcag gaacaaatggccatcaaactgtgccaagagctaatcattcatacttcagttctagaagtt gggaagtgtaagttaaaggcaccagcaggtttggtgcttgatgaggccattctcttcttt aagaaggcaactgtaaccttgggaacggaggaaggtggccagagaccattcaggcagctc agggccaaggagggcactgactatgaagaggccaccgactatgaagaggccaccatccca tcgcagggcacactcacacgcacccgcactcactcacactgggaccatgtagacacgcca attcacctaacgttcacattttgggacatgggaggaaactggaggacccaggaaaaaccc atgcaaacacggggagaccgtgcaaactccacacagtggccccagctgggaagcaatttt ttttctcaacatcgttataacaaaacaatcttacataaaaggatgttactcaagaacctg ctttaa >gi568815586f:9952688_10165629|GENSCAN_predicted_peptide_6|156_aa MALILLILCMGMVVGLVALGIWCDYKRSPCDANWTYYGDSCYGFFKHNLTWKESEQYCTD MKATLLKTDNQNILFNLIYNIQYYVETLGIQKNATKGNVQSNAKEDFITGSLRKIKGSYD YWVGLSQDGHSGRWLWQDGSSPSPGLAHDLFPVDKS >gi568815586f:9952688_10165629|GENSCAN_predicted_CDS_6|471_bp atggctttgattctgctgatcctgtgcatggggatggttgtcgggctggtggctctgggg atttggtgtgactataaacgcagcccctgtgatgccaactggacatattatggagatagc tgctatgggttcttcaaacacaacttgacatggaaagagagtgagcagtactgcactgac atgaaggctactctcctgaagactgacaaccagaacattctgttcaatcttatttataac atccaatactacgtggaaacactagggatccaaaagaatgcaactaaaggaaatgtacag tcaaatgccaaagaggattttatcactggcagcttgaggaagattaaaggaagctatgat tactgggtggggttgtctcaggatggacacagcggacgctggctttggcaagatggctcc tctccttctcctggcctggctcatgacctgtttcctgttgataaatcctga >gi568815586f:9952688_10165629|GENSCAN_predicted_peptide_7|257_aa MCVLPSYYYSADSEGTTVLHRAILTVARRPRAIRPHFTLTAVGIQMQAKYSSTRDMLDDD GDTTMSLHSQGSATTRHPEPRRTEHRAPSSTWRPVALTLLTLCLVLLIGLAALGLLFFQY YQLSNTGQDTISQMEERLGNTSQELQSLQVQNIKLAGSLQHVAEKLCRELYNKAGGLLRP DSGKAWLWMDGTPFTSELFHIIIDVTSPRSRDCVAILNGMIFSKDCKELKRCVCERRAGM VKPESLHVPPETLGEGD >gi568815586f:9952688_10165629|GENSCAN_predicted_CDS_7|774_bp atgtgtgtccttccaagttattattattctgcagactctgaaggaaccacagtgctacac cgtgctattctcacagtagcccggcggcccagggcaatccgaccacatttcactctcacc gctgtaggaatccagatgcaggccaagtacagcagcacgagggacatgctggatgatgat ggggacaccaccatgagcctgcattctcaaggctctgccacaactcggcatccagagccc cggcgcacagagcacagggctccctcttcaacgtggcgaccagtggccctgaccctgctg actttgtgcttggtgctgctgatagggctggcagccctggggcttttgttttttcagtac taccagctctccaatactggtcaagacaccatttctcaaatggaagaaagattaggaaat acgtcccaagagttgcaatctcttcaagtccagaatataaagcttgcaggaagtctgcag catgtggctgaaaaactctgtcgtgagctgtataacaaagctggagggcttttgcgccct gacagtggcaaggcctggctgtggatggatggaacccctttcacttctgaactgttccat attataatagatgtcaccagcccaagaagcagagactgtgtggccatccttaatgggatg atcttctcaaaggactgcaaagaattgaagcgttgtgtctgtgagagaagggcaggaatg gtgaagccagagagcctccatgtcccccctgaaacattaggcgaaggtgactga >gi568815586f:9952688_10165629|GENSCAN_predicted_peptide_8|136_aa MMISIDAEKAFNKIQQPFMLKTLNKLDIDGTYLKIIRAFYDKPTANIILNGQQQEAFPLK TGTRQGCLLSPLLFNIVLEVLARTIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVS AENLLKLISNFRKVSG >gi568815586f:9952688_10165629|GENSCAN_predicted_CDS_8|411_bp atgatgatctcaatagatgcagaaaaggccttcaacaaaattcaacagcccttcatgcta aaaactctcaataaactagatattgatgggacgtatctcaaaataataagagctttttat gacaaacccacagccaacatcatactgaatgggcaacaacaggaagcattccctttgaaa actggcacaagacaaggatgccttctctcaccactcctattcaacatagtgttggaagtt ctggccaggacaatcaggcaggagaaagaaataaagggtattcaattaggaaaagaggaa gtcaaattgtccctctttgcagatgacatgattgtatatttagaaaaccccatcgtctca gccgaaaatctcctcaagctgataagcaacttcagaaaagtctcaggataa >gi568815586f:9952688_10165629|GENSCAN_predicted_peptide_9|231_aa MDEAGNHHSEQTITRTENQTPHVLTRRRLAPTRTQAAPRASRRSDRSALERESKSKIAED EQINASKNEEDAAKMLVGGLSWDTSKKDLKDYFPKFGEVIDCTIKMDPNIGRARGFGFIL FKDATNVEKVLDQKEYRLDVRVIDPKKAMAMKKDPVKKIFAEGLNPEATEEKIREYFGEF GEIEAIEIPVDPKLNKRQGFVFITFKKEPVKKVLEKKFHTISGSKREITVA >gi568815586f:9952688_10165629|GENSCAN_predicted_CDS_9|696_bp atggatgaagctggaaaccatcattctgagcaaactatcacaaggacagaaaaccaaaca ccacatgttctcactcgtagacgcctggcgccaacgagaacccaggccgccccgagggcg agtcgtcggagcgaccgcagcgccctcgagcgggaatcaaaatcaaaaatcgccgaggac gaacagatcaacgccagcaagaacgaggaggacgcagcaaaaatgctcgttggtggcctg agctgggataccagcaaaaaagatttaaaagactatttccctaaatttggagaggtcatt gactgtacaataaaaatggatcccaacattggacgggcaagagggtttgggtttatcctg ttcaaagatgcaaccaatgtggagaaggtcctagaccagaaggagtacaggctggatgtc cgtgtcattgaccctaaaaaggccatggctatgaagaaggacccggtgaagaaaatcttc gccgagggtctgaatcctgaagccactgaggaaaagatcagggagtactttggcgagttt ggggagatcgaggccattgaaattccagtggatccaaagttgaacaaaagacaaggtttt gtgtttatcacctttaaaaaagaacctgtgaagaaagttctggagaaaaagttccatact atcagtggaagtaagcgtgagatcacggtggcctag >gi568815586f:9952688_10165629|GENSCAN_predicted_peptide_10|275_aa MSNKVTKVNAMEHRAAMGKIKRDDPQTVISGAERKELPNAISIQGLSRTMEYHPDLENLD EDGYTQLHFDSQSNTRIAVVSEKAISSKNQWDFRDPTNVKTCRCQCGTEIRSIQLSSQCA EITIPAIEEEGILTGGEAQQILGSCAASPPWRLIAVILGILCLVILVIAVVLGTMGVLSS PCPPNWIIYEKSCYLFSMSLNSWDGSKRQCWQLGSNLLKIDSSNELLIWKYTLINRKHPP FSKRQKTSTLFQETENIHTFPRDRKGEIAASRTSV >gi568815586f:9952688_10165629|GENSCAN_predicted_CDS_10|828_bp atgtccaataaagtaacaaaagtgaatgcaatggaacacagagccgctatgggtaaaata aagagggatgacccacagacagtcatctcaggagcagaaagaaaagagctcccaaatgct atatctattcaggggctctcaagaacaatggaatatcatcctgatttagaaaatttggat gaagatggatatactcaattacacttcgactctcaaagcaataccaggatagctgttgtt tcagagaaagctatttcctctaagaatcagtgggatttcagggatcctactaatgtcaag acctgtagatgtcaatgtggaactgaaattaggagcatacagttatcaagccaatgtgcc gaaattacaattccagcaattgaagaagaaggtatccttactggtggagaagcacagcaa atcctaggatcgtgtgctgcatctcctccttggcgcctcattgctgtaattttgggaatc ctatgcttggtaatactggtgatagctgtggtcctgggtaccatgggggttctttccagc ccttgtcctcctaattggattatatatgagaagagctgttatctattcagcatgtcacta aattcctgggatggaagtaaaagacaatgctggcaactgggctctaatctcctaaagata gacagctcaaatgaattgcttatatggaaatatacgttaataaacagaaaacatccaccc ttttccaagagacagaaaacatccacacttttccaagagacagaaaacatccacactttt ccaagagacagaaaaggtgaaatagctgctagccgaacctcagtctga >gi568815586f:9952688_10165629|GENSCAN_predicted_peptide_11|168_aa MAHKSNGQRCGFLRFSELATTSCGVAQRGELTPPAPLSSLRRVLSSHYRHSRISFARLSG LGSSLRRTPMAGRPSAPAPNLGISLPRFSETRDSLPAGAQAMNLGPALRRCTPQPRALAP ASTSAILHPRFRHGLRRGLGTAPKRLGKNSVRHVGLQSTQDGQWSLHR >gi568815586f:9952688_10165629|GENSCAN_predicted_CDS_11|507_bp atggcgcataagagtaacggccagcgctgcgggtttttacgtttcagcgagctcgcaaca acgtcctgtggtgtggcccagcgcggggagctgaccccaccagctccgctctcaagcctt cggagagtcctctccagtcactatcgtcattcccgcatttcctttgctaggctgtccggg ctgggcagctccctcaggcggacacccatggctggcagaccttccgcgcctgcccctaat ctgggcatatcactccctcggttttctgagactcgggactccttgcccgctggagcgcag gccatgaatctgggtcccgcactgcggcgctgcacgccgcagcccagggctttggcccca gccagcacatccgccatcctgcaccccaggttccggcacgggctgcgacggggcctcgga actgctcccaagcggctgggaaagaactcagtcaggcatgtcggcctgcaaagcacccag gatgggcagtggagtctgcaccgttga >gi568815586f:9952688_10165629|GENSCAN_predicted_peptide_12|98_aa MWIPQTQKHYSPQDFIQQAISYSSFPFWMGLSRRNPSYPWLWEDGSPLMPHLFRVRGAVS QTYPSGTCAYIQRGAVYAENCILAAFSICQKKANLRAQ >gi568815586f:9952688_10165629|GENSCAN_predicted_CDS_12|297_bp atgtggattccacaaactcagaaacattactccccacaggacttcatccagcaagcaatt tcctattccagttttccattctggatggggctgtctcggaggaaccccagctacccatgg ctctgggaggacggttctcctttgatgccccacttatttagagtccgaggcgctgtctcc cagacatacccttcaggtacctgtgcatatatacaacgaggagctgtttatgcggaaaac tgcattttagctgccttcagtatatgtcagaagaaggcaaacctaagagcacagtga