GENSCAN 1.0 Date run: 4-Nov-116 Time: 15:15:09 Sequence gi568815587f:27481698_27682051 : 200354 bp : 39.55% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.08 PlyA - 649 644 6 1.05 1.07 Term - 17107 16952 156 1 0 76 37 201 0.998 10.75 1.06 Intr - 17871 17662 210 0 0 74 79 162 0.999 12.09 1.05 Intr - 19869 19798 72 0 0 66 94 77 0.957 4.78 1.04 Intr - 20223 20105 119 1 2 57 105 90 0.845 6.86 1.03 Intr - 24758 24531 228 0 0 68 20 121 0.271 0.22 1.02 Intr - 25068 25019 50 2 2 78 105 106 0.332 8.71 1.01 Init - 40679 40588 92 2 2 66 13 148 0.234 5.11 1.00 Prom - 42148 42109 40 -4.65 2.04 PlyA - 45348 45343 6 1.05 2.03 Term - 64497 64390 108 0 0 57 44 106 0.388 0.63 2.02 Intr - 67199 67039 161 0 2 56 78 71 0.161 1.69 2.01 Init - 71061 70842 220 0 1 71 103 126 0.243 11.24 2.00 Prom - 85909 85870 40 -5.45 3.00 Prom + 97491 97530 40 -5.75 3.01 Sngl + 99983 100357 375 1 0 61 42 510 0.702 39.59 3.02 PlyA + 100387 100392 6 1.05 4.00 Prom + 118215 118254 40 -5.35 4.01 Init + 120226 120328 103 0 1 105 54 92 0.621 7.95 4.02 Term + 123579 123658 80 1 2 105 54 20 0.285 -2.75 4.03 PlyA + 123940 123945 6 1.05 5.00 Prom + 140147 140186 40 -5.95 5.01 Sngl + 142487 142798 312 1 0 56 48 286 0.807 17.08 5.02 PlyA + 142915 142920 6 -0.45 6.00 Prom + 142927 142966 40 -6.75 6.01 Init + 149731 149770 40 0 1 68 59 55 0.514 1.01 6.02 Intr + 150554 150735 182 0 2 -40 109 234 0.560 11.37 6.03 Term + 155830 156093 264 0 0 47 47 174 0.529 3.72 6.04 PlyA + 157550 157555 6 1.05 7.00 Prom + 160894 160933 40 0.35 7.01 Init + 164146 164214 69 0 0 72 105 56 0.884 6.80 7.02 Term + 164327 164443 117 1 0 111 40 92 0.888 4.26 7.03 PlyA + 168116 168121 6 1.05 8.04 PlyA - 170695 170690 6 1.05 8.03 Term - 176888 176124 765 2 0 90 40 610 0.359 48.82 8.02 Intr - 184513 184316 198 1 0 61 36 153 0.000 6.03 8.01 Intr - 192527 192363 165 2 0 21 113 117 0.038 6.74 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587f:27481698_27682051|GENSCAN_predicted_peptide_1|308_aa MDSSSEDRMDEGTQAKMQAGLEEVMGTTCGRLREKMAALGEPVRLERDTRRTRAPLVSTA EAAFGDPYTRKANVTHPVRTDYTHCLYSTRTDTRYNSTEPPFPAVCTQTPSLLSPQPRLY SGWDICRAIELLEKLQRSGEVPPQKLQALQRVLQSEFCNAVREVYEHVYETVDISSSPEV RANATAKATVAAFAASEGHSHPRVVELPKTEEGLGFNIMGGKEQNSPIYISRIIPGGIAD RHGGLKRGDQLLSVNGVSVEGEHHEKAVELLKAAQGKVKLVVRYTPKVLEEMESRFEKMR SAKRRQQT >gi568815587f:27481698_27682051|GENSCAN_predicted_CDS_1|927_bp atggacagttcctcagaggacagaatggatgaggggacacaggcgaagatgcaagcaggt ttggaagaagtgatgggaaccacatgtggaaggttaagggagaagatggcggcgctaggg gaacccgtgcggctggagagagacacacgccgcacacgcgctcccttggtgagcacagca gaagcagcattcggagacccgtacacccgaaaagcaaacgtcacacacccggttcgcaca gactacactcattgcctgtacagcacgagaacggacacacggtacaattcaacagaacct cctttccccgctgtttgcacacaaactccttcattactttctccacaacctcgcttgtat tcgggttgggatatttgtagagcaattgaattattggaaaaactacaaaggagtggagaa gtaccaccacagaaacttcaggctttgcaaagagtccttcaaagtgaattctgcaatgct gtgagagaggtatatgaacatgtctatgagactgtggacatcagtagcagtcctgaagtg agagcgaacgctactgcaaaggctactgttgctgcatttgctgccagtgaaggacattct catcctcgagttgttgagctaccaaaaacagaagagggccttggattcaatattatggga ggcaaagaacaaaactctccaatctatatatcccgaataattccaggtggaattgctgat agacatgggggcctcaaacgtggagatcaactcctctctgttaatggagtgagtgttgaa ggagaacatcatgaaaaagctgtagaactgctgaaagccgcacaaggaaaggttaaatta gtggtacgatacacacccaaagtcttagaagaaatggagtcgcgctttgaaaaaatgaga tcagcaaaacgcaggcaacagacctaa >gi568815587f:27481698_27682051|GENSCAN_predicted_peptide_2|162_aa MKRAKEKQFLELLPPQKIVNKKQYCIPGGTAEISATVKDLKDAGVAIPITFPFNTPIWSM QKTDGAWRITVDYVCKETIMDLLGLQLVLVLWRILTNTITKAHLQPSIFPFPKFNQSPLL EMTVMWEKPDQQFAHAALCFQVPFATATAISSPHTDTVSCVH >gi568815587f:27481698_27682051|GENSCAN_predicted_CDS_2|489_bp atgaagagggccaaggagaagcagttcctagaattgcttccacctcaaaaaatagtaaat aaaaagcaatactgcattcctggagggactgcagagattagtgctaccgttaaggacttg aaagatgcaggggtggcgattcccatcacattcccattcaacactcctatttggtctatg cagaagacagatggagcttggagaataacagtggattatgtttgcaaagaaacaatcatg gatcttctgggcctccaattagttctggttctctggagaatcctgactaatacaatcacc aaggctcatcttcagccatcgatttttccctttccgaagttcaaccagtctccactatta gagatgacagtgatgtgggagaaacctgatcagcagtttgctcatgctgctctctgtttc caggtcccttttgcaacagcaactgccatctcctctcctcacacagacacagtcagctgt gtacattga >gi568815587f:27481698_27682051|GENSCAN_predicted_peptide_3|124_aa MPPKDDKKKDAGKSAKKDKDPVNKSGGKAKKKKWSKGKVRDKLNNLVLFDKATYDKLCKE VPNYKLITPAVVSERLKIRGSLARAALQELLSKGLIKLVSKHRAKVIYTRNTKGGDAAAA GEDA >gi568815587f:27481698_27682051|GENSCAN_predicted_CDS_3|375_bp atgccacccaaggatgacaagaagaaggatgctggaaagtcggccaagaaagacaaagac ccagttaacaaatccgggggcaaggccaaaaagaagaagtggtccaaaggcaaagttcgg gacaagctcaataacttagtcttgtttgacaaagctacctatgacaaactctgtaaggaa gttcccaattataaacttataaccccagctgtggtctctgagagactgaagattcgaggc tccctggccagggcagcccttcaggagctccttagtaaaggacttatcaaactggtttca aagcacagagctaaagtaatttacaccagaaataccaagggcggagatgctgcagctgct ggtgaagatgcatga >gi568815587f:27481698_27682051|GENSCAN_predicted_peptide_4|60_aa MERSMPLEQDLLLTFPKRRGYTTPRRATEKAPGFGFHSESSGEECLREVVGFQDALHLPS >gi568815587f:27481698_27682051|GENSCAN_predicted_CDS_4|183_bp atggagaggtcaatgccattggaacaagacttattacttacatttcccaagagaagggga tatacgacaccacgcagggccacagagaaagcaccaggttttggattccattcagaatcc tctggagaagagtgtcttagggaagtagtaggatttcaggatgccctgcatctgccttcc tga >gi568815587f:27481698_27682051|GENSCAN_predicted_peptide_5|103_aa MLEGQDRDAKSDVRMEEPTKWAIGDTVEEWSDSSKRVRSRNSAEMHTQPRSYPALNDSEL AWNGHPKTKIHGRGDTGSTGSGPGHQVGETVGDGGEEGIDDEP >gi568815587f:27481698_27682051|GENSCAN_predicted_CDS_5|312_bp atgttggaaggtcaagacagagatgccaaatcagatgtcaggatggaggagccaacaaaa tgggcaataggggacacagttgaggaatggagtgacagtagcaagagagtgagaagtagg aactcagcagagatgcacacccagccaagaagttaccctgcattaaatgacagtgagctg gcctggaatggacaccccaaaaccaagatccatggcagaggggacactggcagtacaggg tctggccctggacatcaggtgggtgaaactgttggggatggaggagaagaaggcattgat gatgaaccatga >gi568815587f:27481698_27682051|GENSCAN_predicted_peptide_6|161_aa MGAVCEALQQYSLEGDRLAFHCESYPKLCVCDGLGGFRGHRAVSVFSCEAEKIWEGVSQR ALGQSSRASGSGCQWKQNTHNATDIGHLVQHPTSTWQDSNFFPLVPVVFDLHKVGNGDLH LGIQWVISGQDEEQSQTRSRAVKLLSGLTECDQIDKESSLS >gi568815587f:27481698_27682051|GENSCAN_predicted_CDS_6|486_bp atgggggctgtctgtgaagctttgcagcagtacagcctagaaggggacaggcttgccttc cactgtgagagttacccgaagctctgcgtctgtgatggtctagggggcttccgaggccat cgggcagtgtcagtcttcagctgcgaagccgagaagatctgggaaggagtcagtcagaga gccttgggccagagttccagggcctctgggagtggctgccagtggaagcaaaatacacat aacgctacagacattggtcaccttgttcagcacccaacatcgacctggcaagactcgaac tttttcccgttggtccctgttgtctttgatctacacaaagtggggaatggtgacctccat ctgggaattcaatgggtgatctctgggcaagatgaagagcagtcacaaacccgatccagg gctgttaaacttctttcaggactcactgaatgtgaccagatagataaggagagttctctg agttag >gi568815587f:27481698_27682051|GENSCAN_predicted_peptide_7|61_aa MGWKAAVIAQARCNEDLNQSSGKGHVLDKGETGCSLAVSDVTTPNFPLDKRRPKIASGPI G >gi568815587f:27481698_27682051|GENSCAN_predicted_CDS_7|186_bp atgggttggaaggctgctgtgatagcccaggcgagatgcaatgaagacctgaaccagagt agtggaaagggacatgtcctggataagggtgaaactggatgctccttggcagtttcagat gtcaccacccctaactttccattggataaaagaaggcccaaaatagcctctggtcccata ggctag >gi568815587f:27481698_27682051|GENSCAN_predicted_peptide_8|375_aa NAEFLQKGLQVHTCFGVYPHASVWHDCASQKKGCAVYLHVSVEFNKLIPENGFIKSGPSA AGLLEFAGGPLQTLFAWVSPAEAAEQQILQNGKCCSLIVPLEASSQRGTWPYEVSVGPYW EFHQVRRVMTILFLTMVISYFGCMKAAPMKEANIRGQGGLAYPGVRTHGTLESVNGPKAG SRGLTSLADTFEHVIEELLDEDQKVRPNEENNKDADLYTSRVMLSSQVPLEPPLLFLLEE YKNYLDAANMSMRVRRHSDPARRGELSVCDSISEWVTAADKKTAVDMSGGTVTVLEKVPV SKGQLKQYFYETKCNPMGYTKEGCRGIDKRHWNSQCRTTQSYVRALTMDSKKRIGWRFIR IDTSCVCTLTIKRGR >gi568815587f:27481698_27682051|GENSCAN_predicted_CDS_8|1128_bp aatgctgagtttctacagaaagggttgcaggtccacacatgttttggcgtctacccacac gcttctgtatggcatgactgtgcatcccagaagaagggctgtgctgtgtacctccacgtt tcagtggaatttaacaaactgatccctgaaaatggtttcataaagtcaggaccctcagct gcaggtctgttggagtttgctggaggtccactccagaccctgtttgcttgggtatcacca gcagaggctgcagaacagcaaatattgcagaacggcaaatgttgctccctgattgttcct ctggaagcttcgtctcagaggggcacctggccgtatgaggtgtcagtcggcccctactgg gagttccaccaggtgagaagagtgatgaccatccttttccttactatggttatttcatac tttggttgcatgaaggctgcccccatgaaagaagcaaacatccgaggacaaggtggcttg gcctacccaggtgtgcggacccatgggactctggagagcgtgaatgggcccaaggcaggt tcaagaggcttgacatcattggctgacactttcgaacacgtgatagaagagctgttggat gaggaccagaaagttcggcccaatgaagaaaacaataaggacgcagacttgtacacgtcc agggtgatgctcagtagtcaagtgcctttggagcctcctcttctctttctgctggaggaa tacaaaaattacctagatgctgcaaacatgtccatgagggtccggcgccactctgaccct gcccgccgaggggagctgagcgtgtgtgacagtattagtgagtgggtaacggcggcagac aaaaagactgcagtggacatgtcgggcgggacggtcacagtccttgaaaaggtccctgta tcaaaaggccaactgaagcaatacttctacgagaccaagtgcaatcccatgggttacaca aaagaaggctgcaggggcatagacaaaaggcattggaactcccagtgccgaactacccag tcgtacgtgcgggcccttaccatggatagcaaaaagagaattggctggcgattcataagg atagacacttcttgtgtatgtacattgaccattaaaaggggaagatag