GENSCAN 1.0 Date run: 6-Nov-116 Time: 17:45:38 Sequence gi568815595f:71654050_71855174 : 201125 bp : 41.49% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 4264 4353 90 1 0 77 74 78 0.354 3.49 1.02 Term + 9492 9597 106 0 1 60 48 94 0.155 -0.50 1.03 PlyA + 11177 11182 6 1.05 2.03 PlyA - 11382 11377 6 1.05 2.02 Term - 17100 16899 202 2 1 56 54 115 0.888 0.78 2.01 Init - 17413 17253 161 1 2 71 70 203 0.975 14.15 2.00 Prom - 24870 24831 40 -5.75 3.11 PlyA - 25077 25072 6 1.05 3.10 Term - 30679 30633 47 2 2 108 32 80 0.657 0.79 3.09 Intr - 36116 35961 156 0 0 117 111 183 0.999 22.56 3.08 Intr - 36725 36555 171 0 0 4 -2 200 0.731 1.59 3.07 Intr - 39892 39826 67 1 1 58 103 84 0.931 4.46 3.06 Intr - 42471 42411 61 2 1 60 90 57 0.708 0.92 3.05 Intr - 49458 49317 142 1 1 119 67 110 0.763 10.49 3.04 Intr - 56031 55911 121 0 1 50 44 59 0.107 -3.15 3.03 Intr - 56435 56363 73 1 1 73 110 54 0.196 4.59 3.02 Intr - 65480 65299 182 2 2 61 71 73 0.082 0.64 3.01 Init - 67660 67607 54 1 0 71 89 98 0.147 9.53 3.00 Prom - 68485 68446 40 -10.75 4.00 Prom + 69564 69603 40 -6.45 4.01 Init + 70461 70554 94 2 1 78 89 47 0.106 4.39 4.02 Intr + 70796 71001 206 0 2 28 45 221 0.056 9.80 4.03 Intr + 71688 72023 336 2 0 13 80 254 0.108 11.89 4.04 Intr + 77365 77472 108 0 0 80 11 92 0.384 0.26 4.05 Intr + 83151 83219 69 2 0 69 105 65 0.876 4.76 4.06 Term + 84094 84231 138 0 0 82 42 75 0.896 -0.72 4.07 PlyA + 84475 84480 6 1.05 5.00 Prom + 94238 94277 40 -5.15 5.01 Sngl + 100001 101128 1128 1 0 116 55 1029 0.725 96.76 5.02 PlyA + 102036 102041 6 1.05 6.07 PlyA - 102090 102085 6 1.05 6.06 Term - 105984 105851 134 1 2 101 47 63 0.727 0.77 6.05 Intr - 107825 107620 206 1 2 70 80 53 0.776 0.62 6.04 Intr - 110513 110393 121 0 1 85 85 110 0.239 9.03 6.03 Intr - 120458 120396 63 0 0 98 94 56 0.053 4.97 6.02 Intr - 127543 127423 121 1 1 81 15 39 0.042 -4.95 6.01 Init - 131003 130908 96 2 0 104 86 182 0.947 17.96 6.00 Prom - 136643 136604 40 -3.55 7.04 PlyA - 137807 137802 6 1.05 7.03 Term - 145866 145678 189 0 0 50 49 95 0.028 -1.73 7.02 Intr - 148758 148587 172 2 1 75 45 149 0.040 8.32 7.01 Init - 180758 180721 38 2 2 90 111 25 0.017 2.61 7.00 Prom - 196662 196623 40 -1.95 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:71654050_71855174|GENSCAN_predicted_peptide_1|65_aa XGCKHPALEYFTNLQLENYCVQPGNWENWKLGEDGAQHALPRHQQDALQGSGLDIGVRTP ICRTE >gi568815595f:71654050_71855174|GENSCAN_predicted_CDS_1|198_bp nngggatgtaaacatcctgccctggaatacttcaccaacctgcagctagaaaactactgt gtacaaccgggcaactgggagaattggaaactgggtgaagatggtgcccagcatgctctc cccaggcatcagcaggatgctttacaaggcagtggcctagatattggggtgaggaccccc atctgcaggacagagtga >gi568815595f:71654050_71855174|GENSCAN_predicted_peptide_2|120_aa MSTIPRELALPLRAAPDAAMLSLPLFSDTGAPSRGQPSPWSLCSPLLGPLLLTRVYGHCK CGTRLRVPRPSPTFTSHTALHTEVQLLDELKEQHSHQLIRDPILGNPFSALGGLLFWFAK >gi568815595f:71654050_71855174|GENSCAN_predicted_CDS_2|363_bp atgagcacaataccccgagagttagccctccccctgcgagcggctccggatgctgccatg ctgtctctcccactcttctccgacacgggtgctccaagccgcggccagccaagcccctgg tctctctgcagccctctgctgggccccctcctgctcacccgtgtgtatggccattgtaag tgtggcactagattaagggttcccaggccttctcccacctttacatcacacacagctctg cacacagaggttcagctgttagatgagttgaaggagcaacattcccaccagctcatccga gatcccatcttaggaaacccattctccgcgcttgggggattgctgttttggtttgcgaag tga >gi568815595f:71654050_71855174|GENSCAN_predicted_peptide_3|357_aa MVEAKGEAGTFFTGQQYQKGEPSYTAVTNPTQSWFSSTKPVFLKVYSLIRPSESPGCACY NTGSPPCPRPTHSDLCGERSLPGATAAECASNLKKIYTVQTVQLCQCLSVPLFPISERQW RLPCRVTKGLEVTWGEQDSVEREGLASAFTLHFIITFSSEIAVSLYRGGNVGFTGYMTCE GYAENLGLKSREEESNAKGGVWKMKVPKDSTSTVWKELLLATIGEQFTDCAAAEKGFLEG GIRKQEEFIGKEQTSKPQKADEQSRMHNLAPVRVGADSADHSKVPESLAGNDEVIGVSVS VRDREDVVQVWNVNASLVGEATVLEKIYELLPHITFKAVFYKPHEEHHAFEGGRGKH >gi568815595f:71654050_71855174|GENSCAN_predicted_CDS_3|1074_bp atggtagaagccaaaggagaagcaggcaccttcttcacagggcagcagtaccaaaaaggt gagccatcatatacagctgtcaccaaccctacccaatcatggtttagctccaccaagcct gtgtttctcaaagtgtattccctcatccgcccatcagaatcacctggctgtgcttgttac aatacaggctccccaccctgccccagacctactcattcagacctctgtggtgaaagatcc ctccctggtgccacagcagctgagtgcgcatcaaatctgaagaaaatctacacagtacag acagtacagctctgccagtgcctctctgtgcctctgttccccatcagtgagaggcagtgg cgccttccctgcagggttactaaaggtttagaggtcacatggggagagcaggatagtgtg gagcgtgaaggcctagcaagtgctttcactttgcatttcatcattacattttccagtgaa atagctgtatccttatacagaggaggcaacgtgggtttcacgggctacatgacctgtgaa ggttatgctgagaacctaggtcttaaatccagggaagaggagagtaatgcaaagggtggc gtatggaagatgaaagtccccaaggacagcacgtccacagtttggaaagagttgctgtta gcaaccatcggggaacagttcacagactgtgccgcagcagaaaagggatttctagaggga ggcatcaggaagcaggaagaattcataggaaaggagcagacaagtaaaccgcagaaagca gatgagcagtcgagaatgcacaatctggctccagttcgtgttggggcagatagcgctgac cactccaaggtccctgaaagcttggcaggaaatgatgaagtaataggagttagtgtcagt gttcgggaccgagaagacgtcgtccaagtctggaatgtaaatgcctctttagtgggtgaa gcgactgttttagaaaagatctatgaacttctgccccacataacttttaaagcagtattt tataaaccccatgaagagcatcatgcttttgaaggtggacgtggaaaacactaa >gi568815595f:71654050_71855174|GENSCAN_predicted_peptide_4|316_aa MAPGLSLEKAIIHPLPDKVSCCSLLNLQRIAAGTRPPPARLPSPTALPARRVPIGREQGV ESVAETFRAALSSPGPEARTPNSRPARPEAGFRPGGASPGPCKRAAGWSGRLGTDGGMDG TGDGAPDPRGAREGGMDRHTSSQKDGVKAGRWLEERRRDGGQMEAQLGVTDKWRGEPMAK WIKRLQCPRVQNKGRERRTNKRTTDWGMRLEQASTLKQVFSPSSLFFSVHVSAAHINCRS SETEYPVQLQLQETLGKVSLLRSHLYDLENETQRWRSWQPENANKRRQEESSFSSQRSSL ARQKTNYASQMQQKKL >gi568815595f:71654050_71855174|GENSCAN_predicted_CDS_4|951_bp atggcacctgggttgtcccttgagaaagccataattcacccccttcctgacaaggtcagc tgctgctcattattgaatctccagcgcatagcagcagggacgcggccacccccagctcgc ctaccctctcccacggccctgccagcacgccgcgttccgatcggaagggaacaaggggta gaaagtgttgccgaaacttttcgggctgcgctgtcctcccccggcccggaggcgaggacg ccgaacagccgcccggcgcggcccgaagctggcttccgacccggcggggcgagtcccggg ccatgcaaaagggcagcgggatggagcggcaggctagggacagatggagggatggatggg acaggggatggagccccagacccacgtggtgcgagggaaggagggatggacagacatacc agcagtcagaaggatggagtgaaggctggcagatggctggaggagagaaggagggatgga gggcagatggaagcacagttgggtgtgacagataaatggagaggggaaccgatggccaaa tggataaagagactccagtgcccgcgagtacagaacaaagggcgagaaagaagaacgaac aagagaaccaccgactggggcatgaggttggagcaggcctccactctaaagcaggtcttc tccccttcctcattatttttttcagtgcacgtgtcagctgcacacatcaactgccgatct tccgaaacagaatatccagtccaattgcaacttcaggaaactttaggaaaagtttcactt ctccgaagtcatctctatgatctggaaaatgagactcagagatggagaagctggcaacct gaaaatgctaacaagcgcagacaagaagagtctagtttctctagccaaaggagcagcctt gcaagacagaaaactaactatgccagccaaatgcaacagaaaaaactgtga >gi568815595f:71654050_71855174|GENSCAN_predicted_peptide_5|375_aa MANASEPGGSGGGEAAALGLKLATLSLLLCVSLAGNVLFALLIVRERSLHRAPYYLLLDL CLADGLRALACLPAVMLAARRAAAAAGAPPGALGCKLLAFLAALFCFHAAFLLLGVGVTR YLAIAHHRFYAERLAGWPCAAMLVCAAWALALAAAFPPVLDGGGDDEDAPCALEQRPDGA PGALGFLLLLAVVVGATHLVYLRLLFFIHDRRKMRPARLVPAVSHDWTFHGPGATGQAAA NWTAGFGRGPTPPALVGIRPAGPGRGARRLLVLEEFKTEKRLCKMFYAVTLLFLLLWGPY VVASYLRVLVRPGAVPQAYLTASVWLTFAQAGINPVVCFLFNRELRDCFRAQFPCCQSPR TTQATHPCDLKGIGL >gi568815595f:71654050_71855174|GENSCAN_predicted_CDS_5|1128_bp atggcgaacgcgagcgagccgggtggcagcggcggcggcgaggcggccgccctgggcctc aagctggccacgctcagcctgctgctgtgcgtgagcctagcgggcaacgtgctgttcgcg ctgctgatcgtgcgggagcgcagcctgcaccgcgccccgtactacctgctgctcgacctg tgcctggccgacgggctgcgcgcgctcgcctgcctcccggccgtcatgctggcggcgcgg cgtgcggcggccgcggcgggggcgccgccgggcgcgctgggctgcaagctgctcgccttc ctggccgcgctcttctgcttccacgccgccttcctgctgctgggcgtgggcgtcacccgc tacctggccatcgcgcaccaccgcttctatgcagagcgcctggccggctggccgtgcgcc gccatgctggtgtgcgccgcctgggcgctggcgctggccgcggccttcccgccagtgctg gacggcggtggcgacgacgaggacgcgccgtgcgccctggagcagcggcccgacggcgcc cccggcgcgctgggcttcctgctgctgctggccgtggtggtgggcgccacgcacctcgtc tacctccgcctgctcttcttcatccacgaccgccgcaagatgcggcccgcgcgcctggtg cccgccgtcagccacgactggaccttccacggcccgggcgccaccggccaggcggccgcc aactggacggcgggcttcggccgcgggcccacgccgcccgcgcttgtgggcatccggccc gcagggccgggccgcggcgcgcgccgcctcctcgtgctggaagaattcaagacggagaag aggctgtgcaagatgttctacgccgtcacgctgctcttcctgctcctctgggggccctac gtcgtggccagctacctgcgggtcctggtgcggcccggcgccgtcccccaggcctacctg acggcctccgtgtggctgaccttcgcgcaggccggcatcaaccccgtcgtgtgcttcctc ttcaacagggagctgagggactgcttcagggcccagttcccctgctgccagagcccccgg accacccaggcgacccatccctgcgacctgaaaggcattggtttatga >gi568815595f:71654050_71855174|GENSCAN_predicted_peptide_6|246_aa MRSLCCAPLLLLLLLPPLLLTPRAGDAAVITGACDKDSQCGGGMCCAVSIWVKSIRICTP MGKLGDSCHPLTQQFWKWKAGKKKEEEKQKEKGVVAMQRVGSVGRHTHPSKTGLDVGTDG GCDNTYYSHNGMFCAGITGMSCRNCYLFSKSAAPSLSLCPSPLPNTETGFLLAFNIYWLD GFIQVVFQIPVIRFMISKKFGPGREGPEVQWSWQPTQGAHLKNSILGQLHFITPATSVHT ENLKYQ >gi568815595f:71654050_71855174|GENSCAN_predicted_CDS_6|741_bp atgaggagcctgtgctgcgccccactcctgctcctcttgctgctgccgccgctgctgctc acgccccgcgctggggacgccgccgtgatcaccggggcttgtgacaaggactcccaatgt ggtggaggcatgtgctgtgctgtcagtatctgggtcaagagcataaggatttgcacacct atgggcaaactgggagacagctgccatccactgactcaacaattttggaaatggaaggca ggaaagaagaaagaggaagagaagcaaaaggaaaaaggagtggttgccatgcagagagtg ggatcagtcggcaggcacactcacccatccaaaaccggtttggatgttgggactgatgga gggtgtgacaacacttattattcacataatggaatgttctgtgctggaattacaggcatg agctgccgcaactgctatctctttagcaagagtgctgccccttccttatccctgtgtcct tctccactgcccaacacagaaactggcttcttattagctttcaatatctattggttagat ggatttattcaggtggttttccagattcctgtaattagatttatgatatcaaaaaagttt ggtccagggagagagggtcctgaagttcagtggtcttggcagcccacacagggtgcacat ctgaagaacagtatcttgggtcagctgcattttataaccccagccacctcggtgcacaca gaaaacctaaaataccagtaa >gi568815595f:71654050_71855174|GENSCAN_predicted_peptide_7|132_aa MLLMRVLFCSFFRHRPRAGDGAAMVPPPGAMCEDQHSKNGWVGRWKKPLPLMASWSMGTS PGLLSPDCFYRLRSPSYGQLCCFFSPGEIQRARRSLMGVYRVTLLCSEVGYTALAFAAAA SIPSSFVNLYFP >gi568815595f:71654050_71855174|GENSCAN_predicted_CDS_7|399_bp atgctgctcatgagagtgttgttctgttcattcttcagacataggccccgtgctggtgat ggagcagccatggtgcccccaccaggagctatgtgtgaggaccaacattccaagaatggc tgggtgggaagatggaagaagcccctgcctctgatggcatcgtggagcatgggcaccagc cctggactgctgtctccagattgcttttataggttgagaagtccctcttacgggcagtta tgttgcttcttcagtcctggagagatacaaagggcacgtcggtcactgatgggtgtgtac agagtaaccctgctttgctcagaagtgggatacacggctcttgcctttgccgctgcagca tccatcccctcttcctttgttaacctttattttccttag