GENSCAN 1.0 Date run: 6-Nov-116 Time: 14:36:39 Sequence gi568815585r:46734840_46995906 : 261067 bp : 38.56% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.09 PlyA - 304 299 6 1.05 1.08 Term - 6039 5898 142 2 1 53 47 109 0.794 -0.28 1.07 Intr - 7138 7073 66 0 0 27 97 83 0.443 0.20 1.06 Intr - 7328 7215 114 1 0 72 89 80 0.805 5.14 1.05 Intr - 7605 7489 117 2 0 92 25 82 0.336 0.96 1.04 Intr - 12250 12089 162 0 0 111 38 104 0.258 5.87 1.03 Intr - 17606 17426 181 0 1 83 52 91 0.251 3.00 1.02 Intr - 27335 27190 146 1 2 64 73 92 0.583 4.31 1.01 Init - 27596 27439 158 2 2 65 56 131 0.572 7.03 1.00 Prom - 31197 31158 40 -6.15 2.06 PlyA - 31565 31560 6 1.05 2.05 Term - 31982 31759 224 0 2 -34 39 211 0.004 0.10 2.04 Intr - 42784 42613 172 1 1 118 6 182 0.092 11.59 2.03 Intr - 47952 47828 125 1 2 36 76 168 0.916 9.78 2.02 Intr - 49511 49413 99 0 0 83 99 34 0.873 3.16 2.01 Init - 52251 52182 70 0 1 58 111 30 0.860 3.56 2.00 Prom - 55162 55123 40 -7.15 3.00 Prom + 59143 59182 40 -5.35 3.01 Init + 61679 61774 96 1 0 65 109 91 0.362 9.26 3.02 Term + 62033 62305 273 1 0 31 42 161 0.599 0.29 3.03 PlyA + 62640 62645 6 1.05 4.11 PlyA - 63368 63363 6 1.05 4.10 Term - 69294 69247 48 0 0 105 40 46 0.367 -2.17 4.09 Intr - 83508 83395 114 0 0 108 47 44 0.350 2.02 4.08 Intr - 83789 83643 147 2 0 80 66 44 0.076 0.91 4.07 Intr - 100800 100013 788 1 2 108 105 498 0.185 43.76 4.06 Intr - 108971 108863 109 2 1 69 106 17 0.060 0.54 4.05 Intr - 123665 123577 89 0 2 72 103 10 0.035 -0.33 4.04 Intr - 132605 132477 129 0 0 6 101 79 0.003 0.75 4.03 Intr - 142842 142750 93 1 0 90 86 41 0.062 3.12 4.02 Intr - 157751 157551 201 0 0 115 97 127 0.775 14.64 4.01 Init - 161067 160656 412 0 1 74 101 216 0.929 18.22 4.00 Prom - 190441 190402 40 -3.35 5.03 PlyA - 191104 191099 6 1.05 5.02 Term - 201610 201099 512 2 2 14 36 357 0.102 16.65 5.01 Init - 227639 227555 85 2 1 99 31 101 0.483 6.53 5.00 Prom - 228379 228340 40 -6.65 6.03 PlyA - 228979 228974 6 1.05 6.02 Term - 229291 229121 171 1 0 59 36 155 0.522 4.24 6.01 Intr - 256754 256589 166 1 1 60 71 135 0.394 8.04 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 42784 42596 189 1 0 118 33 170 0.907 10.97 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585r:46734840_46995906|GENSCAN_predicted_peptide_1|361_aa MEAEEFHDMPSESWRTRGAGSVAQSKSEGLRSREASGVTLSLTPETGNLGDCSSSRGLEN LQFRRTGEASAPASGETENSPFLCLSVLSEPLADWVVPALIETSPHLPKSGGQPLPWLML VCADCTLNPPPGGVFSFASPSTFVLQTVAAAALKTSNGFSRRPPEEAQRCPQITLLRLEG CKKTHTGKQKPYTSMAILVILSQDWNEDSGGNYTLRKFGRNPIEWPSRQCEYVYTSNLVS SGIIKVLYRSSWAKMAAAHREGEEELRGTAAGTMWMRKHAVCIHFAVVVVVFHFKTREIT EDMTGMEVCKAKAQSHHVRKQNTYFSANCKLRGSVFCVDFQQPIAVRKQTIVSHAEFLNR K >gi568815585r:46734840_46995906|GENSCAN_predicted_CDS_1|1086_bp atggaggctgaggagttccacgatatgccatctgagagctggagaaccagaggagctggt agtgtggctcaatccaagtctgaaggcctgagaagcagggaagccagtggtgtcactctc agtctgacgccggagactgggaacctgggggactgcagttctagcagagggctggagaac ctgcagttcaggagaacaggagaagccagtgccccagcttcaggagaaacagagaattca cctttcctctgcctttctgttctatctgagcccttggccgattgggtggtgcccgccctc attgaaacctccccccatcttcccaagagtggagggcagccattgccttggttgatgttg gtctgtgctgactgcacactgaatcctcctcctggtggtgtcttttcttttgcgtccccc tctacatttgtcttacaaactgtggctgctgctgctctaaaaacttccaatggcttttcg aggagaccccctgaggaagcccagaggtgccctcagattactttgctgaggttggaggga tgtaagaagactcacacaggaaaacagaaaccttatacaagcatggccattctggttatt ctatcacaggattggaatgaagactctggtgggaattacaccttgaggaaatttgggagg aatccaatcgaatggccatcaaggcagtgtgagtacgtgtacacgtccaacttggtcagt agtggtattattaaagttttgtatagatcaagttgggccaaaatggcagcagctcaccgg gagggagaggaggaactgcgtggcactgctgctggcactatgtggatgaggaagcatgcg gtgtgcatacactttgcagtcgtcgtcgttgtatttcacttcaaaactcgagagataact gaagacatgacagggatggaagtttgcaaggcaaaggctcaaagtcaccatgtaaggaag cagaacacctatttctcagctaactgcaaactaagaggctcagttttctgcgttgacttt cagcagcctattgctgtcagaaagcaaacaattgtctcacatgctgaattcctgaaccgg aaatga >gi568815585r:46734840_46995906|GENSCAN_predicted_peptide_2|229_aa MKFAVYLPPKAETGKCPALYWLSGLTCTEQNFISKSGYHQSASEHGLVVIAPDTSPRGCN IKGEDESWDFGTGAGFYVDATEDPWKTNYRMYSYVTEEAYDATHLVKSYPGSQLDILIDQ GKDDQFLLDGQLLPDNFIAACTEKKIPVVFRLQEAKLGFRMKKGNVTVGVPVMGMATAVA KKTLAEVAGGVVGGSKDAAMATAAQGTMTKEASTTVEMEDKVLEPVRVS >gi568815585r:46734840_46995906|GENSCAN_predicted_CDS_2|690_bp atgaaatttgctgtctacttaccaccaaaggcagaaacaggaaagtgccctgcactgtat tggctctcaggtttaacttgcacagagcaaaattttatatcaaaatctggttatcatcag tctgcttcagaacatggtcttgttgtcattgctccagataccagccctcgtggctgcaat attaaaggtgaagatgagagctgggactttggcactggtgctggattttatgttgatgcc actgaagatccttggaaaaccaactacagaatgtactcttatgtcacagaggaggcttat gatgctacccaccttgtgaaatcctatccaggatctcagctggacatactaattgatcaa gggaaagatgaccagtttcttttagatggacagttactccctgataacttcatagctgcc tgtacagaaaagaaaatccccgttgtttttcgattgcaagaggcaaaattggggtttcgg atgaagaaaggaaatgtcactgttggtgttccagtgatggggatggcaacagcagtagct aagaagacattggcagaggttgccgggggggtggtggggggaagtaaagatgcagcgatg gccacagcagcacaggggaccatgacaaaagaggcatccactacagtcgaaatggaggac aaagtgttagaaccagtaagagtttcgtaa >gi568815585r:46734840_46995906|GENSCAN_predicted_peptide_3|122_aa MTSSPPQRSESILRPEQILSNSKDTCSPRHHQVGGSEKEKGLEARLRVTGKGPWPPVMDG GAQGPGSPSLRSCLRIPAPQDSASAPITPSRAAAAGRLTTSRPLPEPSLTYGGGVTRRSG PK >gi568815585r:46734840_46995906|GENSCAN_predicted_CDS_3|369_bp atgacttccagtcccccacagcgctctgagagcattttacgacccgaacagattttgtca aactccaaagacacgtgctcccctcggcaccatcaggttggaggctcagagaaggaaaag ggattagaggcgaggctccgggtcacgggaaaaggtccttggcctcctgtcatggatggt ggcgcccaaggcccagggtcgccgtctctgcgctcgtgcctgcggatacctgcaccgcag gactcggcgtccgccccgatcacgcctagccgtgccgcagccgcaggaaggctgacaacg tccaggcctctacccgaacccagtcttacctacggtggcggagtgaccagaagaagcggg ccgaagtaa >gi568815585r:46734840_46995906|GENSCAN_predicted_peptide_4|709_aa MDILCEENTSLSSTTNSLMQLNDDTRLYSNDFNSGEANTSDAFNWTVDSENRTNLSCEGC LSPSCLSLLHLQEKNWSALLTAVVIILTIAGNILVIMAVSLEKKLQNATNYFLMSLAIAD MLLGFLVMPVSMLTILYGYRWPLPSKLCAVWIYLDVLFSTASIMHLCAISLDRYVAIQNP IHHSRFNSRTKAFLKIIAVWTISVGHKFSECKEWKAKQKMYFILSTNYVIENAYPVNSGT RQIQIENDPFPEMLLTCQVIWITFIFHDEGNQSASAFKCIKHDRQNPSRIIPCLPSTALC TTCNSSWHPPLLSPCYIEILEPLLIGVAFQGKESHQNWSRAKMKSISMPIPVFGLQDDSK VFKEGSCLLADDNFVLIGSFVSFFIPLTIMVITYFLTIKSLQKEATLCVSDLGTRAKLAS FSFLPQSSLSSEKLFQRSIHREPGSYTGRRTMQSISNEQKACKVLGIVFFLFVVMWCPFF ITNIMAVICKESCNEDVIGALLNVFVWIGYLSSAVNPLVYTLFNKTYRSAFSRYIQCQYK ENKKPLQLILVNTIPALAYKSSQLQMGQKKNSKQDAKTTDNDCSMVALGKQHSEEASKDN SDGVNEKLSVERILLPAAAGHLPHLSILCSLSYLPILCPALAGPRAFMDLTGSPEEPFMA PGSGPNPALRSEQAGVKRGQAVGADTPEPARTEGHIDVIASFRWTLGWP >gi568815585r:46734840_46995906|GENSCAN_predicted_CDS_4|2130_bp atggatattctttgtgaagaaaatacttctttgagctcaactacgaactccctaatgcaa ttaaatgatgacaccaggctctacagtaatgactttaactccggagaagctaacacttct gatgcatttaactggacagtcgactctgaaaatcgaaccaacctttcctgtgaagggtgc ctctcaccgtcgtgtctctccttacttcatctccaggaaaaaaactggtctgctttactg acagccgtagtgattattctaactattgctggaaacatactcgtcatcatggcagtgtcc ctagagaaaaagctgcagaatgccaccaactatttcctgatgtcacttgccatagctgat atgctgctgggtttccttgtcatgcccgtgtccatgttaaccatcctgtatgggtaccgg tggcctctgccgagcaagctttgtgcagtctggatttacctggacgtgctcttctccacg gcctccatcatgcacctctgcgccatctcgctggaccgctacgtcgccatccagaatccc atccaccacagccgcttcaactccagaactaaggcatttctgaaaatcattgctgtttgg accatatcagtaggtcacaaattttcagagtgcaaggaatggaaagcaaagcagaaaatg tacttcatcttgagtacaaactatgtaatagaaaatgcctatccagtaaacagtggaaca agacagattcaaattgaaaatgatccctttcctgagatgcttttaacttgtcaagttatt tggataaccttcattttccatgatgaaggaaatcagtcagcgtcagcctttaaatgcata aaacatgacaggcaaaacccatcaagaattattccctgccttccatccacagccctctgt actacctgcaattcctcatggcatccgccactcctaagcccctgctacatagaaatcctg gagcccctactcattggtgtagcatttcagggaaaggaaagtcatcaaaattggtccagg gcaaaaatgaaaagtatatccatgccaataccagtctttgggctacaggacgattcgaag gtctttaaggaggggagttgcttactcgccgatgataactttgtcctgatcggctctttt gtgtcatttttcattcccttaaccatcatggtgatcacctactttctaactatcaagtca ctccagaaagaagctactttgtgtgtaagtgatcttggcacacgggccaaattagcttct ttcagcttcctccctcagagttctttgtcttcagaaaagctcttccagcggtcgatccat agggagccagggtcctacacaggcaggaggactatgcagtccatcagcaatgagcaaaag gcatgcaaggtgctgggcatcgtcttcttcctgtttgtggtgatgtggtgccctttcttc atcacaaacatcatggccgtcatctgcaaagagtcctgcaatgaggatgtcattggggcc ctgctcaatgtgtttgtttggatcggttatctctcttcagcagtcaacccactagtctac acactgttcaacaagacctataggtcagccttttcacggtatattcagtgtcagtacaag gaaaacaaaaaaccattgcagttaattttagtgaacacaataccggctttggcctacaag tctagccaacttcaaatgggacaaaaaaagaattcaaagcaagatgccaagacaacagat aatgactgctcaatggttgctctaggaaagcagcattctgaagaggcttctaaagacaat agcgacggagtgaatgaaaagctctcagtggagaggatactcctccctgcagctgctggt catcttcctcatctctccatcctctgctctcttagttatctgcccatcctctgccctgct ctggctgggcccagggcttttatggacctcacaggcagcccagaagagccattcatggcc ccggggtctggccccaaccctgctctgagatcagagcaggcaggagtgaagagaggccag gcagtgggagcagacacccccgagcctgcaaggactgaggggcacatagatgtcattgct tccttccgatggactctgggctggccttag >gi568815585r:46734840_46995906|GENSCAN_predicted_peptide_5|198_aa MDQLLHESEIMHIVIIRDHTFQPSRPSGGRCGGRGASGNRGCVRHLRASWSSGWAWAWRA PTRSSQPALLAPGNEGLSTRASGCGGCTGSPSSASPPALHSISHRALAAFPRGRARDLQP AMPEPPTHSMGSCAAQASPTSTTPCSMAPSPIDHPRAEECERTAWDWQAAPPAALVWDPL GEASWAPESGGDMESLYI >gi568815585r:46734840_46995906|GENSCAN_predicted_CDS_5|597_bp atggatcagcttctgcatgagtcagagatcatgcacattgtcatcatccgggatcacact tttcaaccatcaagaccaagtggagggaggtgtggaggtagaggcgccagcgggaaccgg ggctgtgtgcggcacttgcgggccagctggagttctgggtgggcatgggcttggcgggcc cccactcggagcagccagccagccctgctggccccgggcaatgagggacttagcacccgg gccagtggctgcggagggtgcactgggtcccccagcagtgccagcccaccggcgctgcac tcgatttctcaccgagccttagctgccttcccgcggggcagggctcgggacctgcagccc gccatgcctgagcctcccacccactccatgggctcctgtgcggcccaagcctccccaacg agcaccactccctgctccatggcgccaagtcccatcgaccacccaagggctgaggaatgc gagcgcacggcgtgggactggcaggcagctccacctgcagccctggtgtgggatccacta ggtgaagccagctgggctcctgagtctggtggggacatggagagtctttatatctag >gi568815585r:46734840_46995906|GENSCAN_predicted_peptide_6|112_aa XSTEAEAVNRNTSDEGSVNLAPNCNSFSGATLSQWLYIFAFKACMASRVDDPLQDEWEKD LARFLLAIKGAESLALCSSSATPTAAHWAHCVACWIWGQGNRYCPVDALAVL >gi568815585r:46734840_46995906|GENSCAN_predicted_CDS_6|339_bp nngagcactgaggctgaagctgttaatagaaacacctccgatgagggctctgtaaatcta gctcccaactgcaactctttcagtggtgcaacactgagccagtggctatacatctttgcc ttcaaggcctgcatggcttccagggtagatgatcccctacaagacgagtgggagaaggat ctggctcgcttcctgcttgctataaaaggtgctgagtccttggccctgtgttcctcttct gcaacaccaaccgctgcccactgggctcattgcgtggcctgttggatttggggccaagga aaccggtactgccctgttgatgctcttgctgtgctgtaa