GENSCAN 1.0 Date run: 8-Nov-116 Time: 13:00:51 Sequence gi568815584f:73850333_74061904 : 211572 bp : 43.33% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 1144 1183 40 -2.76 1.01 Init + 20035 20083 49 0 1 86 58 45 0.848 0.41 1.02 Term + 20779 20903 125 2 2 86 47 142 0.913 8.45 1.03 PlyA + 21121 21126 6 1.05 2.00 Prom + 21960 21999 40 -6.26 2.01 Init + 26789 26836 48 1 0 59 97 30 0.675 1.16 2.02 Intr + 28203 28358 156 2 0 68 38 75 0.535 0.71 2.03 Intr + 29723 29844 122 1 2 85 93 105 0.979 9.99 2.04 Intr + 30873 30960 88 0 1 122 70 9 0.872 2.47 2.05 Intr + 43465 43600 136 0 1 111 98 1 0.961 3.54 2.06 Intr + 47739 47930 192 1 0 124 77 92 0.973 11.16 2.07 Intr + 53628 53778 151 1 1 53 97 64 0.954 3.02 2.08 Intr + 54570 54751 182 0 2 83 89 203 0.951 19.41 2.09 Intr + 59009 59098 90 0 0 102 69 55 0.866 4.97 2.10 Intr + 59693 59722 30 0 0 73 92 30 0.338 0.00 2.11 Term + 70150 70295 146 2 2 72 44 119 0.436 3.97 2.12 PlyA + 70990 70995 6 1.05 3.10 PlyA - 71374 71369 6 1.05 3.09 Term - 84062 83924 139 1 1 95 37 42 0.767 -2.76 3.08 Intr - 85756 85617 140 1 2 66 91 112 0.965 8.56 3.07 Intr - 87369 87270 100 2 1 35 93 92 0.977 4.51 3.06 Intr - 87780 87616 165 2 0 85 77 108 0.732 8.48 3.05 Intr - 90721 90594 128 1 2 111 109 51 0.996 8.98 3.04 Intr - 92383 92037 347 2 2 106 94 329 0.999 30.41 3.03 Intr - 94553 94003 551 1 2 107 65 446 0.905 36.71 3.02 Intr - 96273 95954 320 0 2 88 94 254 0.991 20.76 3.01 Init - 97745 97743 3 2 0 66 66 0 0.517 -4.40 3.00 Prom - 98259 98220 40 -5.36 4.00 Prom + 99459 99498 40 -3.66 4.01 Init + 100001 100163 163 1 1 108 94 219 0.999 22.39 4.02 Intr + 103103 103237 135 0 0 69 67 83 0.587 4.84 4.03 Intr + 105119 105177 59 0 2 101 49 33 0.609 -0.70 4.04 Intr + 105473 105596 124 1 1 110 81 163 0.865 17.96 4.05 Intr + 107815 107945 131 2 2 105 -4 67 0.792 -0.39 4.06 Intr + 108639 108746 108 2 0 36 94 88 0.773 4.68 4.07 Intr + 108830 108892 63 1 0 84 111 -8 0.459 0.01 4.08 Intr + 109083 109190 108 2 0 39 59 90 0.676 1.68 4.09 Intr + 110841 111043 203 2 2 18 86 214 0.919 12.28 4.10 Intr + 111123 111238 116 0 2 72 82 58 0.913 3.69 4.11 Intr + 111405 111571 167 1 2 96 69 65 0.846 5.08 4.12 Term + 112638 112667 30 2 0 102 48 22 0.614 -2.55 4.13 PlyA + 112758 112763 6 1.05 5.12 PlyA - 112921 112916 6 1.05 5.11 Term - 113154 113125 30 0 0 99 42 15 0.021 -4.15 5.10 Intr - 119793 119678 116 1 2 98 110 98 0.996 13.17 5.09 Intr - 121576 121520 57 2 0 74 115 15 0.757 1.66 5.08 Intr - 122692 122552 141 2 0 99 97 140 0.868 16.32 5.07 Intr - 126727 126692 36 2 0 125 90 -12 0.617 0.93 5.06 Intr - 127042 126967 76 1 1 121 81 72 0.955 8.89 5.05 Intr - 132829 132686 144 1 0 25 107 82 0.483 4.28 5.04 Intr - 136561 136482 80 2 2 23 109 16 0.435 -3.53 5.03 Intr - 137699 137554 146 1 2 85 111 162 0.374 18.13 5.02 Intr - 152963 152879 85 0 1 75 63 9 0.208 -4.02 5.01 Init - 153333 153309 25 0 1 102 98 24 0.684 4.49 5.00 Prom - 160755 160716 40 -4.56 6.00 Prom + 161058 161097 40 -5.06 6.01 Init + 169147 169202 56 0 2 104 72 53 0.271 6.06 6.02 Intr + 172584 172816 233 0 2 88 82 127 0.691 9.52 6.03 Intr + 173766 173794 29 1 2 78 93 30 0.507 0.33 6.04 Intr + 178852 178917 66 0 0 77 97 69 0.709 5.80 6.05 Intr + 183696 183839 144 2 0 84 95 93 0.992 10.08 6.06 Intr + 199370 199863 494 1 2 96 34 257 0.335 12.90 6.07 Intr + 205252 205353 102 1 0 68 61 64 0.441 0.99 6.08 Intr + 206574 206648 75 0 0 90 86 45 0.797 3.13 6.09 Intr + 206812 206926 115 1 1 88 70 83 0.887 6.95 6.10 Term + 207255 207299 45 2 0 99 49 30 0.827 -2.59 6.11 PlyA + 208354 208359 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584f:73850333_74061904|GENSCAN_predicted_peptide_1|57_aa MGFRHVGQAGLKLLTSGRGQQVKVEFITNGQWFNQSCLYNDAFIKTQKDLIEEFPNS >gi568815584f:73850333_74061904|GENSCAN_predicted_CDS_1|174_bp atggggtttcgccatgttggccaggctggtctcaaactcctgacctcaggtaggggacag caggtgaaggttgagtttatcaccaatggccagtggtttaatcaatcatgtctatataat gacgccttcatcaaaacccaaaaggacctgattgaggagtttccaaatagctga >gi568815584f:73850333_74061904|GENSCAN_predicted_peptide_2|446_aa MVVSGAAGACGSVAGQASDISMTLSRDAGQRQQAVAPGQPPYVAEQLRESCPAGVDVYFD NDHKGKQLMNENSHIILCGQISQYNKDVPYPPPLSPAIEAIQKERNITRERFLVLNYKDK FEPGILQLSQWFKEGKLKLLVQFVQNTSIPLGQGLVESEAKDITCLSLLPVTEASECSRL MLPGLGSSAEHLVFVQDEAEDSGNDFLSSESTDSSIPWFLRVQELAHDSLIAATRAQLAK NAKTSSNGENVHLGSGDGQSKDSGPLPQVEKKLKCTVEGCDRTFVWPAHFKYHLKTHRND RSFICPAEGCGKSFYVLQRLKVHMRTHNGEKPFMCHESGCGKQFTTAGNLKNHRRIHTGE KPFLCEAQGCGRSFAEYSSLRKHLVVHSDISHPLNQKSGRMETESLGLEDIEDNLDYYVV RATSVNVDDPKEGGREYDKGILGTQD >gi568815584f:73850333_74061904|GENSCAN_predicted_CDS_2|1341_bp atggttgtcagtggggccgcaggtgcctgtggatctgtggctgggcaggctagcgatatc agtatgacactctctcgagatgctgggcagcggcaacaagctgtagctcctggtcagcca ccgtatgtggctgaacaactccgtgaatcatgcccagctggagtggatgtttattttgac aatgaccataagggtaaacaactgatgaatgagaacagccacatcatcctgtgtggtcaa atttctcagtacaacaaagatgtgccttatcctcccccgctatcccctgctatagaggca atccagaaagaaagaaacatcacaagggaaagatttctggtattaaattataaagacaaa tttgagcctggcattctacagctgagtcagtggtttaaagaaggaaagctaaagctcctg gtacagtttgttcagaatacgtccatcccattgggacaggggcttgtagaatcagaagct aaagatattacttgcttgtccctccttcccgtgactgaagcctcagaatgcagtcggcta atgttaccaggtctgggctcttcagctgagcacttagtgtttgtacaggatgaggcagaa gattcagggaatgatttcctctccagtgagagcacagacagtagcattccatggttcctc cgggttcaggagttggcccatgacagtttgattgctgctactcgtgcacaactggcaaag aatgcaaaaaccagcagcaatggagaaaatgtccaccttggttctggtgatgggcagtca aaagattctgggccccttcctcaagtggaaaagaagctcaagtgtacagttgaaggttgt gaccggacatttgtatggccagctcactttaaataccacctcaagactcatcgaaatgac cgctccttcatctgtcctgcagaaggttgtgggaaaagcttctatgtgctgcagaggctg aaggtgcacatgaggacccacaatggagagaagccctttatgtgccatgagtctggctgt ggtaagcagtttactacagctggaaacctgaagaaccaccggcgcatccacacaggagag aaacctttcctttgtgaagcccaaggatgtggccgttcctttgctgagtattctagcctc cgaaaacatctggtggttcactcagatatatctcatcctctcaaccagaagtcaggtaga atggagacggagagtttgggacttgaagacatagaggacaatttagattattatgttgta agagccaccagtgtgaatgttgatgaccctaaagagggtggcagggaatatgacaagggg atattaggaacacaggattga >gi568815584f:73850333_74061904|GENSCAN_predicted_peptide_3|630_aa MIFPPESFADTEAGEELSGDGLVLPRASKLDEFLSPEEEIDSTSDSTGSIYQNLQELKQK GRWCLLESLFQSDPESDENLSEDEEDLESFFQDKDRGMVQVQCPQALRCGSTRRCSSLNN LPSNIPRPQTQPPSGSRPPSQHRSVSSWASSITVPRPFRMTLREARKKAEWLGSPASFEQ ERQRAQRQGEEEAECHRQFRAQPVPAHVYLPLYQEIMERSEARRQAGIQKRKELLLSSLK PFSFLEKEEQLKEAARQRDLAATAEAKISKQKATRRIPKSILEPALGDKLQEAELFRKIR IQMRALDMLQMASSPIASSSNRANPQPRTATRTQQEKLGFLHTNFRFQPRVNPVVPDYEG LYKAFQRRAAKRRETQEATRNKPFLLRTANLRHPQRPCDAATTGRRQDSPQPPATPLPRS RSLSGLASLSANTLPVHITDATRKRESAVRSALEKKNKADESIQWLEIHKKKSQAMSKSV TLRAKAMDPHKSLEEVFKAKLKENRNNDRKRAKEYKKELEEMKQRIQTRPYLFEQVAKDL AKKEAEQWYLDTLKQAGLEEDFVRNKGQGTRAVQEKETKIKDFPRFQETTKLSIRDPEQG LEGSLEQPASPRKVLEELSHQSPENLVSLA >gi568815584f:73850333_74061904|GENSCAN_predicted_CDS_3|1893_bp atgatatttccccccgagtccttcgcagacacagaggcaggagaggagctgtccggggat gggctggttttgcccagggccagcaaacttgacgagttcctcagcccagaggaggagata gattctacttctgactcaactgggagcatttaccagaacttacaggaactgaagcagaaa gggagatggtgtctgttggagtctctctttcagtctgacccagagagtgatgaaaacctc tctgaagatgaggaggacctggagagtttcttccaagacaaggacagggggatggtgcag gtccagtgcccgcaggctctgaggtgtggctccacaaggcgctgcagctccctgaacaac cttccctccaacattcccaggcctcagacccagccaccctcaggctcccggcctccctcc cagcacagaagcgtcagctcctgggcatcatccattactgtccctcggccattccgcatg acgctgcgcgaggcccggaagaaggccgagtggctgggctcacctgcctcctttgagcag gagaggcagcgggcccagaggcagggtgaggaagaggccgagtgccacaggcagttccgg gcacagcctgtgcctgcacatgtctacctgcccctctaccaagagatcatggagcgcagc gaggcccgaaggcaggcagggatccagaagaggaaggaactgctcctctcttctttgaag cccttcagcttcctggagaaggaggagcagctaaaggaagctgctcgacagagagacttg gcagccacagctgaagccaagatctccaagcagaaggccaccagaaggattcccaagtcc attctggagccagcccttggggataaactccaggaagctgagctcttcaggaaaattcgc atccaaatgagagccctggacatgctccagatggcctcttcccctatcgcctcctctagt aaccgggctaacccacagccccgcacagccacccgaacccagcaggaaaagcttgggttt ctgcacactaacttcagattccagcctcgggtgaatcctgtggtccctgactatgagggc ctttacaaggccttccagagaagagcagccaaaagaagagaaacccaagaggccactcgc aacaagcccttcttgctgaggaccgccaacctgcgccaccctcagcggccctgtgatgct gccaccaccggaaggaggcaggattccccacagccaccagctacacccctgccaaggagt cgttctctgagcggccttgcttccctctctgccaacactctccctgtgcacatcacagat gccaccaggaagagggaatctgcagtcagaagtgcacttgaaaaaaagaacaaagcagat gagagtattcagtggctggagatacacaaaaagaagtctcaagcaatgtccaaatctgtg accttgcgtgcaaaagccatggatccccataaaagcctggaggaagtgttcaaagcaaag ctgaaagagaaccggaacaatgaccgtaaaagagcgaaagaatataagaaagaactggag gaaatgaagcagcgaatacaaacaaggccctatctctttgaacaagttgccaaggatcta gccaagaaagaagcagaacagtggtatctagacaccctgaagcaggctgggctggaggaa gactttgtgagaaacaagggtcaaggcacccgggctgttcaagagaaagagaccaaaatc aaggattttcccaggttccaagaaactacaaaactcagcatcagagatccagagcagggt ttagaaggatctctagaacagcctgcaagccccaggaaagtactggaggagctgtctcat cagtcaccagaaaatctcgtatcacttgcttaa >gi568815584f:73850333_74061904|GENSCAN_predicted_peptide_4|468_aa MAARLVSRCGAVRAAPHSGPLVSWRRWSGASTDTVYDVVVSGGGLVGAAMACALGYDIHF HDKKILLLEAGPKKVLEKLSETYSNRVSSISPGSATLLSSFGAWDHICNMRYRAFRRMQV WDACSEALIMFDKDNLDDMGYIVENDVIMHALTKQLEAVSDRVTVLYRSKAIRYTWPCPF PMADSSPWVHITLGDGSTFQTKLLIGADGHNSGVRQAVGIQNVSWNYDQSAVVATLHLSE ATENNVAWQRFLPSGPIALLPLSDTLSSLVWSTSHEHAAELVSMDEEKFVDAVNSAFWSD ADHTDFIDTAGAMLQYAVSLLKPTKVSARQLPPSVARVDAKSRVLFPLGLGHAAEYVRPR VALIGDAAHRVHPLAGQGVNMGFGDISSLAHHLSTAAFNGKDLGSVSHLTGYETERQRHN TALLAATDLLKRLYSTSASPLVLLRTWGLQATNAVSPLKEQIMAFASK >gi568815584f:73850333_74061904|GENSCAN_predicted_CDS_4|1407_bp atggcggcccggcttgtcagccgatgcggggctgtgcgtgcagctccccacagcggcccg ctggtgtcctggcgcaggtggtccggcgcctcaacagacaccgtgtatgacgtggtggtg tcgggtggaggcctggtgggcgctgccatggcctgtgccttgggatatgatattcacttt catgacaagaaaatcctgttgctcgaagcaggtccaaagaaagtactggagaaattgtca gaaacttacagcaacagggtcagctccatttcccctggctctgcaacgcttctcagtagt tttggtgcctgggaccatatctgcaacatgagatacagagcctttcggcgaatgcaggtg tgggacgcctgctcagaggccctgataatgtttgataaggataatttagatgacatgggc tatatcgtggagaatgatgtcatcatgcatgctctcactaagcagttggaggctgtgtct gaccgagtgacggttctctacaggagcaaagccattcgctatacctggccttgtccattt cctatggccgactccagcccttgggttcatattaccctaggtgatggcagcaccttccag accaaattgttgataggtgcagatggtcacaactccggagtacggcaggctgttggaatc cagaatgtgagctggaactatgaccagtctgctgttgtggctactctgcatttatcagag gccacagaaaacaacgtagcctggcagagatttcttccctctgggcctattgctctgctc ccgctctcagacaccttgagttccttggtttggtccacgtcccatgaacatgcagcagag ctagttagcatggatgaggaaaaatttgtggatgccgttaactctgccttttggagtgat gctgaccacacggacttcatcgacacagctggtgccatgctgcagtatgctgtcagcctt ctgaagcccactaaggtctcggctcgccagctgcccccaagcgtagccagggtggatgcc aaaagccgagttctgtttcctcttgggttgggacatgctgctgagtacgtcaggcctcgg gtggcgctcattggggatgcagcccacagagtccatccgcttgcaggacagggtgtcaac atgggctttggggatatctccagcttggcccatcacctcagtacggcagccttcaatggg aaggacttaggttccgtgagccacctcacaggttatgaaacagaaagacagcgtcacaac actgctcttctggctgctacagacttactaaaaaggctctattctaccagtgcctccccg cttgtgttgctcaggacgtggggcttgcaggccacaaatgcagtgtctccactcaaagaa cagattatggcctttgcaagcaaatga >gi568815584f:73850333_74061904|GENSCAN_predicted_peptide_5|311_aa MGSFGYEQGSWVTPVNKKISALEELIFYQRNTDNKYQNQQTWFEGIFLSSMCPINVSAST LYGIMFDAGSTGTRIHVYTFVQKMPGQLPILEGEVFDSVKPGLSAFVDQPKQGAETVQGL LEVAKDSIPRSHWKKTPVVLKATAGLRLLPEHKAKALLFEVKEIFRKSPFLVPKGSVSIM DGSDEGILAWVTVNFLTGEVGFEPCYAEVLRVVRGKLHQPEEVQRGSFYAFSYYYDRAVD TDMIDYEKGGILKVEDFERKAREVCDNLENFTSGSPFLCMDLSYITALLKDGFGFADSTV LQKVQYTCNKG >gi568815584f:73850333_74061904|GENSCAN_predicted_CDS_5|936_bp atgggcagttttggttatgagcaaggttcttgggttacaccagtaaacaaaaagatttct gctcttgaggagcttatattctatcagaggaacacagacaataaatatcagaaccagcag acttggtttgagggtatcttcctgtcttccatgtgccccatcaatgtcagcgccagcacc ttgtatggaattatgtttgatgcagggagcactggaactcgaattcatgtttacaccttt gtgcagaaaatgccaggacagcttccaattctagaaggggaagtttttgattctgtgaag ccaggactttctgcttttgtagatcaacctaagcagggtgctgagaccgttcaagggctc ttagaggtggccaaagactcaatcccccgaagtcactggaaaaagaccccagtggtccta aaggcaacagcaggactacgcttactgccagaacacaaagccaaggctctgctctttgag gtaaaggagatcttcaggaagtcacctttcctggtaccaaagggcagtgttagcatcatg gatggatccgacgaaggcatattagcttgggttactgtgaattttctgacaggggaggtg ggctttgagccctgctatgccgaagtgctgagggtggtacgaggaaaacttcaccagcca gaggaggtccagagaggttccttctatgctttctcttactattatgaccgagctgttgac acagacatgattgattatgaaaaggggggtattttaaaagttgaagattttgaaagaaaa gccagggaagtgtgtgataacttggaaaacttcacctcaggcagtcctttcctgtgcatg gatctcagctacatcacagccctgttaaaggatggctttggctttgcagacagcacagtc ttacagaaagtccagtatacctgtaacaaaggttaa >gi568815584f:73850333_74061904|GENSCAN_predicted_peptide_6|452_aa MPSKGKDKKKGKSKGKDTKKLIKTDESVVDRAKANASLWEARLEVTELSRIKYRDTSRIL AKSNEDLKKKQCKMEKDIMSVLSYLKKQDQEKDNMVDEEIDFQKGQIEKLKQQLNETKEK AQEEKDKLEQKYTRQINELEGQFHQKAKEIGMIHTELKAVRQFQKRKIQVERELDDEIND LLVKEKIMQLVQQRSQIQTLQKKVVNLETALSYMTKEFESEVLKLQQHAMIENQAGQVEI DKLQHLLQMKDREMNRVKKLAKNILDERTEVERFFLDALHQVKQQILISRKHYKQIAQAA FNLKMRAACTGRTEYPKIRTFDGREHSTNSVNQDLLEAEKWTHIEGNVDIGDLTWEQKEK VLRLLFAKMNGCPSRKYNQSSRPPVPDYVVSDSGETKEFGDESKLQDKIFITQQIAISDS SGEVVLPTIPKEPQESDTVGSQSHYNLEDKGL >gi568815584f:73850333_74061904|GENSCAN_predicted_CDS_6|1359_bp atgccgtcgaagggaaaggacaaaaagaaaggcaagagcaaaggcaaagacacgaagaag ttaataaaaacagatgaatctgtggtggacagagccaaggccaatgcctccctttgggag gccaggttggaagtcacagaactctctaggattaagtatcgtgatacttcacggatactg gcaaaaagtaatgaggacttaaagaaaaagcaatgtaaaatggagaaagacataatgtca gtattaagttacctgaagaagcaggatcaggagaaagataatatggtagatgaagaaatt gactttcagaaaggacagattgaaaaactgaaacagcaattaaatgaaacaaaggaaaaa gcccaagaggagaaggataaattggaacaaaagtataccaggcaaattaatgaactagag ggacagttccatcaaaaagccaaagaaattggcatgattcacacagagctgaaagcagta agacaattccagaagagaaaaatccaagtggagagagagttagatgatgagatcaatgat ctgttggttaaggaaaagattatgcaacttgtccagcagagatcacaaatccaaaccctt cagaagaaggtagtaaacttggagactgctctgagttacatgaccaaagagtttgagagt gaagttttaaaactgcagcaacacgcaatgatagagaaccaagcaggtcaggtagaaatt gacaagctgcagcaccttcttcagatgaaggacagggaaatgaatcgtgtgaagaagctg gccaagaacatactggatgagagaacagaagtggaaagattctttttagatgctctgcac caagtgaagcaacagatcctaattagcaggaagcattataagcagatagcacaagctgct ttcaatttaaaaatgagagcagcatgtacaggaagaacagaatatcccaaaatcagaaca tttgatggcagagagcacagcaccaatagtgtgaatcaggatcttctggaggccgaaaaa tggacacatattgaaggaaatgtggatattggagatttgacctgggagcagaaggaaaaa gtattgcgattgctctttgcaaaaatgaatggctgtccttctaggaaatacaaccagagt tctaggcctccagttccagactatgttgtttctgacagtggggaaacaaaggaatttggg gatgaaagtaagcttcaagataaaatcttcatcacccagcaaattgcaatatcagactct tctggtgaagtggtgctacccactattccaaaagaacctcaggagtctgacacagtggga agtcagagtcattacaacctagaggacaaaggcttataa