GENSCAN 1.0 Date run: 4-Nov-116 Time: 11:47:51 Sequence gi568815586r:9893146_10099100 : 205955 bp : 38.16% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 284 396 113 1 2 93 68 37 0.402 0.46 1.02 Intr + 2544 2680 137 0 2 106 54 131 0.318 10.79 1.03 Term + 6473 6531 59 0 2 40 49 75 0.056 -4.33 1.04 PlyA + 7598 7603 6 1.05 2.04 PlyA - 7641 7636 6 1.05 2.03 Term - 11954 11695 260 0 2 115 33 166 0.123 8.53 2.02 Intr - 13613 13242 372 0 0 65 -5 186 0.070 1.11 2.01 Init - 17637 17568 70 0 1 96 93 -14 0.391 1.34 2.00 Prom - 25084 25045 40 -1.85 3.04 PlyA - 25405 25400 6 1.05 3.03 Term - 28717 28658 60 1 0 102 39 99 0.992 3.23 3.02 Intr - 29087 28921 167 0 2 111 95 115 0.998 13.26 3.01 Init - 39184 39130 55 1 1 56 97 85 0.898 7.70 3.00 Prom - 45726 45687 40 -5.35 4.03 PlyA - 47258 47253 6 1.05 4.02 Term - 47679 47576 104 1 2 109 49 44 0.758 0.06 4.01 Init - 50051 49985 67 2 1 71 94 63 0.462 6.49 4.00 Prom - 64523 64484 40 -3.35 5.04 PlyA - 64742 64737 6 1.05 5.03 Term - 66913 66800 114 1 0 58 34 95 0.150 -1.31 5.02 Intr - 70609 70537 73 0 1 86 88 25 0.386 0.79 5.01 Init - 73319 73111 209 2 2 61 98 140 0.795 8.70 5.00 Prom - 77041 77002 40 -4.55 6.00 Prom + 81968 82007 40 -4.85 6.01 Sngl + 82942 83403 462 0 0 96 32 333 0.605 24.41 6.02 PlyA + 84299 84304 6 1.05 7.00 Prom + 95746 95785 40 -3.65 7.01 Init + 96401 96452 52 1 1 51 73 72 0.399 3.37 7.02 Term + 97126 97472 347 2 2 44 47 224 0.461 7.57 7.03 PlyA + 97489 97494 6 1.05 8.06 PlyA - 97947 97942 6 1.05 8.05 Term - 100446 100279 168 0 0 38 54 73 0.258 -4.20 8.04 Intr - 103855 103701 155 2 2 67 83 115 0.964 7.77 8.03 Intr - 104134 104015 120 2 0 42 89 98 0.924 4.85 8.02 Intr - 105235 105137 99 2 0 83 78 49 0.786 2.56 8.01 Init - 105955 105892 64 1 1 87 111 71 0.930 10.66 8.00 Prom - 106393 106354 40 -5.75 9.00 Prom + 115470 115509 40 -4.35 9.01 Init + 117615 117705 91 2 1 45 75 105 0.936 5.60 9.02 Intr + 119640 119738 99 1 0 106 75 57 0.900 5.36 9.03 Intr + 121378 121596 219 2 0 -8 89 206 0.671 8.45 9.04 Intr + 129639 129743 105 1 0 77 81 74 0.427 4.87 9.05 Term + 144282 144613 332 1 2 48 36 175 0.318 2.13 9.06 PlyA + 144803 144808 6 -0.45 10.00 Prom + 145483 145522 40 -4.05 10.01 Init + 146008 146074 67 0 1 54 78 133 0.955 8.19 10.02 Intr + 148321 148475 155 2 2 76 99 135 0.999 12.27 10.03 Intr + 168271 168363 93 0 0 42 85 61 0.006 0.44 10.04 Intr + 171587 171708 122 1 2 87 95 35 0.038 2.47 10.05 Term + 174964 174997 34 1 1 115 48 26 0.010 -2.62 10.06 PlyA + 175390 175395 6 1.05 11.06 PlyA - 176389 176384 6 1.05 11.05 Term - 178368 178188 181 2 1 85 37 140 0.343 4.70 11.04 Intr - 180214 180148 67 2 1 86 67 53 0.719 0.04 11.03 Intr - 188268 188092 177 1 0 2 107 136 0.203 5.87 11.02 Intr - 196077 195979 99 1 0 117 75 72 0.493 7.96 11.01 Intr - 205843 205663 181 1 1 41 80 213 0.580 14.32 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:9893146_10099100|GENSCAN_predicted_peptide_1|102_aa EFIQNSLKPGHFGWIGLYVTFQGNLWMWIDEHFLVPELFSVIGPTDDRSCAVITGNWVYS EDCSSTFKGICQRDAILTHNGTSGPAKSAKAAFTPEAGEARE >gi568815586r:9893146_10099100|GENSCAN_predicted_CDS_1|309_bp gagttcatacagaacagtttaaaacctggacattttggttggattggactatatgttaca ttccaagggaacctatggatgtggatagatgaacactttttagttccagaattgttttca gtgattggaccaactgatgacaggagctgtgccgttatcacaggaaactgggtgtattct gaagactgtagctccacatttaagggcatttgccagagagatgcgatcttgacgcacaat ggaaccagtggtcctgccaagagtgccaaagcggctttcacccctgaagcaggggaggct agagaatag >gi568815586r:9893146_10099100|GENSCAN_predicted_peptide_2|233_aa MTNKPTLSATWVERILTYLPRQPDGSSQVIEGKRHNGYSIIDGEALVEIGSGRLANNWSA QTCELLALSQALKCLQNQEGTIYTDSKYSFEVAHTFGKIWTERGLINRKGQDLIHKELIT QVLNTLQLPKEIAIVHVPGQQKGFSFETESHLVINVTQTNHPLTLEFDACSVISCGDEQA QRQLSNVDKYLCPYRIESTKYKYGALKSPCGDWTDVWQTTQHGRWTARPPFSK >gi568815586r:9893146_10099100|GENSCAN_predicted_CDS_2|702_bp atgacgaataaacccacactctcagcaacgtgggtggaaaggattcttacctacctacca cggcaaccagatggctcctcccaggtaattgagggaaaaagacataatgggtattcaata atagatggagaagctttagtagaaatcgggtcaggaagattggctaataattggtctgcc caaacttgtgaactgcttgcattaagccaggctttaaaatgcttgcaaaaccaggaaggg accatttatactgattctaagtactcctttgaggtagctcataccttcgggaagatttgg actgaacgaggtcttattaaccgcaaaggccaagatctcatccacaaagagttaatcacc caagtactaaatactctccagttacccaaagagatagctattgtccatgttccaggacag caaaaaggcttttcctttgaaacagaaagtcacctcgtcatcaatgtgactcaaactaat catcccttaaccctcgagtttgatgcttgttcagttatctcatgtggagatgaacaagct caaaggcagctatcaaatgtagataagtatttatgtccataccgtattgagtcaaccaag tataagtatggagccttaaaaagtccctgtggtgactggacagatgtttggcagaccacc caacatggacggtggacagccaggccccctttttcaaagtag >gi568815586r:9893146_10099100|GENSCAN_predicted_peptide_3|93_aa MINPELRDGRADGFIHRIATWSKHAKPVACSGDWLGVRDKCFYFSDDTRNWTASKIFCSL QKAELAQIDTQEDMPTQYENNEDEDFYDVFTST >gi568815586r:9893146_10099100|GENSCAN_predicted_CDS_3|282_bp atgattaatccagagctgcgggatggcagagctgatggcttcatacatcggatagccaca tggtccaagcatgctaaacctgtggcatgttcaggggactggcttggagtgagagataag tgtttctatttttctgatgataccagaaattggacagccagtaaaatattttgtagtttg cagaaagcagaacttgctcagattgatacacaagaagacatgcctactcaatatgaaaac aacgaggatgaagacttttacgatgtcttcacttccacttaa >gi568815586r:9893146_10099100|GENSCAN_predicted_peptide_4|56_aa MEGEGMESKDGTEVVEQQTNTPGLNYIEPYCQLILPIEDFPTHRFEDEPVIYFSCG >gi568815586r:9893146_10099100|GENSCAN_predicted_CDS_4|171_bp atggagggagaagggatggagagtaaggatgggacggaggttgtggagcagcagaccaac acgccaggtctcaactatatagaaccatattgccagttgatccttcccattgaggatttt cccactcacagatttgaagatgaaccagtcatttattttagttgtggatag >gi568815586r:9893146_10099100|GENSCAN_predicted_peptide_5|131_aa MSHLCKTPLEIRLSISPAATPIAPGTLPQGSLTDSFPDLLGLTTEDCHCPITSEAYRTIT DTLGNSHSGGFASHKVSSSSVDPLPPCVYLLIGQLVFSLMGFPGLQYDSIPWFQTGKRQV HTEMKSVGETS >gi568815586r:9893146_10099100|GENSCAN_predicted_CDS_5|396_bp atgtcccatctgtgcaagaccccactggaaatcagactgtccatctcaccggcagccact cccatagcccctggaactctgccccaaggctctctgactgactccttcccagatcttctc ggcttaacaactgaagactgtcactgcccaatcacctcagaagcctacaggaccatcaca gacactctaggtaactctcacagtggaggttttgcctcgcacaaggtctcttcttcctct gtggatcctctacctccatgtgtctacctgctaattggacagctagtcttcagcctgatg ggatttccaggtcttcagtatgattcgattccctggtttcaaacagggaagaggcaagtc catacagaaatgaagtctgttggagagactagttaa >gi568815586r:9893146_10099100|GENSCAN_predicted_peptide_6|153_aa MGCSWITSARTVQKGNVRSEAPHRVPTGVLSSGAVRSGPPSSSSQNGRSTDSMHCPLGKA ADTQYQPMKAAMRETVPCKATGAELPKDMGAYLLHQHDLDVRYRAKGDDFGALRFGALWT CMGPVAPLFWPISPIWNDRIYPMPVLSLYLGSN >gi568815586r:9893146_10099100|GENSCAN_predicted_CDS_6|462_bp atggggtgctcatggataacctctgctaggacagtgcagaagggaaatgtgaggtcagaa gctccacacagagtccccactggggtactgtccagtggagcagtgagaagtggaccaccg tcctccagctctcagaatggtagatccactgacagcatgcactgtccacttggaaaagct gcagacactcaatatcagcccatgaaagcagccatgagggagactgtaccttgcaaagcc acaggagcagagctgcccaaagacatgggagcctacctcttgcatcagcatgacctggat gtgagatatagagccaaaggagatgattttggagctttaagatttggtgccctttggact tgcatggggcctgtagcccctttgttttggccaatttctcccatttggaatgaccgtatt tatccaatgcctgtactctcattgtatctaggaagtaactaa >gi568815586r:9893146_10099100|GENSCAN_predicted_peptide_7|132_aa MGERGEEATEEMLEASREQSKLKTTWRQLTILDAIKKICDSWKEDKVSTLTDTWKKLNQT LMDNLQAFKTSVVEVTADVVEIARELELEVEPEDVAELMQFYQETLTNEELLFMDEQRKW FLRQILFLVNML >gi568815586r:9893146_10099100|GENSCAN_predicted_CDS_7|399_bp atgggtgagagaggtgaagaagctacagaagaaatgttggaagctagcagagagcaaagt aaattgaaaaccacctggagacaacttaccattctagatgccattaaaaagatttgtgat tcatggaaggaagacaaagtgtcaacattaacagacacttggaagaagttgaatcaaacc ctcatggataacttgcaggcgttcaagacttcagtggtggaagtaactgcagatgtggtg gaaatagcaagagaactagaactggaagtggagcctgaagatgtggctgaattgatgcaa ttttatcaggaaactttaacaaatgaggagttgctttttatggatgagcaaagaaaatgg ttcttgagacaaatattattcctggtgaacatgctgtga >gi568815586r:9893146_10099100|GENSCAN_predicted_peptide_8|201_aa MQDEDGYITLNIKTRKPALISVGSASSSWWRVMALILLILCVGMVVGLVALGIWSVMQRN YLQGENENRTGTLQQLAKRFCQYVVKQSELKGTFKGHKCSPCDTNWRYYGDSCYGFFRHN LTWEESKQYCTDMNATLLKIDNRNIVIMRFPIILEPSPSLLQGSYCTYCSLAPNFMLNEI DIGLFKYAYLVLPSRVELSQQ >gi568815586r:9893146_10099100|GENSCAN_predicted_CDS_8|606_bp atgcaggatgaagatggatacatcaccttaaatattaaaactcggaaaccagctctcatc tccgttggctctgcatcctcctcctggtggcgtgtgatggctttgattctgctgatcctg tgcgtggggatggttgtcgggctggtggctctggggatttggtctgtcatgcagcgcaat tacctacaaggtgagaatgaaaatcgcacaggaactctgcaacaattagcaaagcgcttc tgtcaatatgtggtaaaacaatcagaactaaagggcactttcaaaggtcataaatgcagc ccctgtgacacaaactggagatattatggagatagctgctatgggttcttcaggcacaac ttaacatgggaagagagtaagcagtactgcactgacatgaatgctactctcctgaagatt gacaaccggaacattgtgataatgcgttttcccattattttggagccatctccatcactt cttcagggctcctattgtacctactgttcccttgctcctaattttatgctaaatgaaatt gatattgggctttttaaatatgcatatcttgtactgccaagcagggtggaactttctcaa caatga >gi568815586r:9893146_10099100|GENSCAN_predicted_peptide_9|281_aa MSEEVTYATLTFQDSAGARNNRDGNNLRKRGHPAPSPIWRHAALGLVTLCLMLLIGLVTL GMMFLQISNDINSDSEKLSQLQKTIQQQQDNLSQQLGNSNNLSMEEEFLKSQISSVLKRQ EQMAIKLCQELIIHTSVLEVGKCKLKAPAGLVLDEAILFFKKATVTLGTEEGGQRPFRQL RAKEGTDYEEATDYEEATIPSQGTLTRTRTHSHWDHVDTPIHLTFTFWDMGGNWRTQEKP MQTRGDRANSTQWPQLGSNFFSQHRYNKTILHKRMLLKNLL >gi568815586r:9893146_10099100|GENSCAN_predicted_CDS_9|846_bp atgtctgaagaagtgacctacgcgacactcacatttcaggattctgctggagcaaggaat aaccgagatggaaataacctaagaaaaagagggcatccagctccatctcccatttggcgt catgctgctctgggtctggtaactctttgcctgatgttgctgattgggctggtgacattg gggatgatgtttttgcagatatctaatgacattaactcagattcagagaaattgagtcaa cttcagaaaaccatccaacagcagcaggataacttatcccagcaactgggcaactccaac aacttgtccatggaggaggaatttctcaagtcacagatctccagtgtactgaagaggcag gaacaaatggccatcaaactgtgccaagagctaatcattcatacttcagttctagaagtt gggaagtgtaagttaaaggcaccagcaggtttggtgcttgatgaggccattctcttcttt aagaaggcaactgtaaccttgggaacggaggaaggtggccagagaccattcaggcagctc agggccaaggagggcactgactatgaagaggccaccgactatgaagaggccaccatccca tcgcagggcacactcacacgcacccgcactcactcacactgggaccatgtagacacgcca attcacctaacgttcacattttgggacatgggaggaaactggaggacccaggaaaaaccc atgcaaacacggggagaccgtgcaaactccacacagtggccccagctgggaagcaatttt ttttctcaacatcgttataacaaaacaatcttacataaaaggatgttactcaagaacctg ctttaa >gi568815586r:9893146_10099100|GENSCAN_predicted_peptide_10|156_aa MALILLILCMGMVVGLVALGIWCDYKRSPCDANWTYYGDSCYGFFKHNLTWKESEQYCTD MKATLLKTDNQNILFNLIYNIQYYVETLGIQKNATKGNVQSNAKEDFITGSLRKIKGSYD YWVGLSQDGHSGRWLWQDGSSPSPGLAHDLFPVDKS >gi568815586r:9893146_10099100|GENSCAN_predicted_CDS_10|471_bp atggctttgattctgctgatcctgtgcatggggatggttgtcgggctggtggctctgggg atttggtgtgactataaacgcagcccctgtgatgccaactggacatattatggagatagc tgctatgggttcttcaaacacaacttgacatggaaagagagtgagcagtactgcactgac atgaaggctactctcctgaagactgacaaccagaacattctgttcaatcttatttataac atccaatactacgtggaaacactagggatccaaaagaatgcaactaaaggaaatgtacag tcaaatgccaaagaggattttatcactggcagcttgaggaagattaaaggaagctatgat tactgggtggggttgtctcaggatggacacagcggacgctggctttggcaagatggctcc tctccttctcctggcctggctcatgacctgtttcctgttgataaatcctga >gi568815586r:9893146_10099100|GENSCAN_predicted_peptide_11|234_aa LTVARRPRAIRPHFTLTAVGIQMQAKYSSTRDMLDDDGDTTMSLHSQGSATTRHPEPRRT EHRAPSSTWRPVALTLLTLCLVLLIGLAALGLLFFQYYQLSNTGQDTISQMEERLGNTSQ ELQSLQVQNIKLAGSLQHVAEKLCRELYNKAGGLLRPDSGKAWLWMDGTPFTSELFHIII DVTSPRSRDCVAILNGMIFSKDCKELKRCVCERRAGMVKPESLHVPPETLGEGD >gi568815586r:9893146_10099100|GENSCAN_predicted_CDS_11|705_bp ctcacagtagcccggcggcccagggcaatccgaccacatttcactctcaccgctgtagga atccagatgcaggccaagtacagcagcacgagggacatgctggatgatgatggggacacc accatgagcctgcattctcaaggctctgccacaactcggcatccagagccccggcgcaca gagcacagggctccctcttcaacgtggcgaccagtggccctgaccctgctgactttgtgc ttggtgctgctgatagggctggcagccctggggcttttgttttttcagtactaccagctc tccaatactggtcaagacaccatttctcaaatggaagaaagattaggaaatacgtcccaa gagttgcaatctcttcaagtccagaatataaagcttgcaggaagtctgcagcatgtggct gaaaaactctgtcgtgagctgtataacaaagctggagggcttttgcgccctgacagtggc aaggcctggctgtggatggatggaacccctttcacttctgaactgttccatattataata gatgtcaccagcccaagaagcagagactgtgtggccatccttaatgggatgatcttctca aaggactgcaaagaattgaagcgttgtgtctgtgagagaagggcaggaatggtgaagcca gagagcctccatgtcccccctgaaacattaggcgaaggtgactga