GENSCAN 1.0 Date run: 5-Nov-116 Time: 11:34:09 Sequence gi568815588f:73811556_74016562 : 205007 bp : 47.94% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.22 PlyA - 836 831 6 1.05 1.21 Term - 3692 3460 233 0 2 87 49 304 0.986 23.04 1.20 Intr - 5562 5468 95 2 2 112 78 21 0.999 3.11 1.19 Intr - 5999 5924 76 0 1 117 113 113 0.999 15.27 1.18 Intr - 8090 7977 114 0 0 50 100 42 0.781 2.02 1.17 Intr - 10175 10127 49 2 1 54 100 68 0.991 2.85 1.16 Intr - 12529 12485 45 1 0 70 113 43 0.877 3.61 1.15 Intr - 12737 12600 138 2 0 84 75 29 0.692 1.86 1.14 Intr - 25956 25913 44 1 2 102 115 47 0.966 6.76 1.13 Intr - 26561 26461 101 1 2 108 45 84 0.949 5.85 1.12 Intr - 30656 30577 80 2 2 100 105 0 0.434 1.25 1.11 Intr - 30986 30903 84 2 0 78 113 93 0.990 10.82 1.10 Intr - 35792 35670 123 2 0 77 75 169 0.984 15.28 1.09 Intr - 36527 36433 95 0 2 124 67 122 0.986 13.38 1.08 Intr - 37054 36971 84 2 0 109 103 67 0.978 10.09 1.07 Intr - 37560 37458 103 0 1 84 87 167 0.850 15.95 1.06 Intr - 37778 37706 73 1 1 99 74 77 0.880 6.81 1.05 Intr - 40764 40699 66 2 0 120 91 122 0.999 13.72 1.04 Intr - 41691 41637 55 1 1 126 95 41 0.998 6.64 1.03 Intr - 49334 49275 60 0 0 107 109 73 0.964 10.01 1.02 Intr - 61528 61434 95 0 2 119 84 111 0.546 13.41 1.01 Init - 62906 62842 65 2 2 111 42 144 0.994 12.72 1.00 Prom - 65928 65889 40 -3.96 2.00 Prom + 78672 78711 40 -4.46 2.01 Init + 100001 100057 57 1 0 97 87 104 0.855 11.72 2.02 Intr + 100486 100513 28 0 1 90 84 25 0.920 -0.01 2.03 Intr + 100660 100767 108 2 0 109 52 84 0.964 7.16 2.04 Intr + 101369 101543 175 0 1 121 94 107 0.997 13.60 2.05 Intr + 101735 101826 92 2 2 80 89 60 0.999 4.94 2.06 Intr + 101984 102203 220 0 1 120 37 153 0.944 10.76 2.07 Intr + 102425 102663 239 2 2 88 56 305 0.514 24.56 2.08 Intr + 103221 103361 141 1 0 101 94 117 0.997 13.92 2.09 Intr + 103696 103844 149 2 2 88 109 192 0.999 21.25 2.10 Term + 104834 105010 177 1 0 82 55 91 0.727 2.89 2.11 PlyA + 105906 105911 6 1.05 3.00 Prom + 105976 106015 40 -4.86 3.01 Init + 114330 114471 142 2 1 52 75 74 0.566 2.80 3.02 Term + 119524 119588 65 2 2 64 48 115 0.903 3.15 3.03 PlyA + 121959 121964 6 1.05 4.03 PlyA - 123065 123060 6 1.05 4.02 Term - 125826 125664 163 2 1 103 42 85 0.523 2.81 4.01 Init - 133438 133329 110 1 2 99 74 95 0.484 7.09 4.00 Prom - 142285 142246 40 -2.66 5.00 Prom + 151116 151155 40 -4.06 5.01 Init + 159613 159642 30 0 0 75 105 49 0.704 3.36 5.02 Intr + 163409 163508 100 1 1 58 66 93 0.757 3.78 5.03 Term + 171339 171367 29 1 2 134 38 6 0.225 -1.56 5.04 PlyA + 173692 173697 6 1.05 6.00 Prom + 176470 176509 40 -2.26 6.01 Sngl + 186653 186997 345 1 0 90 52 495 0.674 42.14 6.02 PlyA + 187137 187142 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 1945 2103 159 0 0 110 43 80 0.801 3.64 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588f:73811556_74016562|GENSCAN_predicted_peptide_1|625_aa MATTATCTRFTDDYQLFEELGKGAFSVVRRCVKKTSTQEYAAKIINTKKLSARDHQKLER EARICRLLKHPNIVRLHDSISEEGFHYLVFDLVTGGELFEDIVAREYYSEADASHCIHQI LESVNHIHQHDIVHRDLKPENLLLASKCKGAAVKLADFGLAIEVQGEQQAWFGFAGTPGY LSPEVLRKDPYGKPVDIWACGVILYILLVGYPPFWDEDQHKLYQQIKAGAYDFPSPEWDT VTPEAKNLINQMLTINPAKRITADQALKHPWVCQRSTVASMMHRQETVECLRKFNARRKL KGAILTTMLVSRNFSGMFSQLCTLIMPRSREWTLGKGWQLGPSEPGEDLLSEGFNPLAKP SSAKSLLNKKSDGGVKPSAAAKIQGYPSCPGPQLSRDILSSGDGWWAGLSGHHRVCGGPG VQEPQTTVVHNATDGIKGSTESCNTTTEDEDLKAAPLRTGNGSSVPEGRSSRDRTAPSAG MQPQPSLCSSAMRKQEIIKITEQLIEAINNGDFEAYTKICDPGLTSFEPEALGNLVEGMD FHKFYFENLLSKNSKPIHTTILNPHVHVIGEDAACIAYIRLTQYIDGQGRPRTSQSEETR VWHRRDGKWLNVHYHCSGAPAAPLQ >gi568815588f:73811556_74016562|GENSCAN_predicted_CDS_1|1878_bp atggccaccaccgccacctgcacccgtttcaccgacgactaccagctcttcgaggagctt ggcaagggtgctttctctgtggtccgcaggtgtgtgaagaaaacctccacgcaggagtac gcagcaaaaatcatcaataccaagaagttgtctgcccgggatcaccagaaactagaacgt gaggctcggatatgtcgacttctgaaacatccaaacatcgtgcgcctccatgacagtatt tctgaagaagggtttcactacctcgtgtttgaccttgttaccggcggggagctgtttgaa gacattgtggccagagagtactacagtgaagcagatgccagccactgtatacatcagatt ctggagagtgttaaccacatccaccagcatgacatcgtccacagggacctgaagcctgag aacctgctgctggcgagtaaatgcaagggtgccgccgtcaagctggctgattttggccta gccatcgaagtacagggagagcagcaggcttggtttggttttgctggcaccccaggttac ttgtcccctgaggtcttgaggaaagatccctatggaaaacctgtggatatctgggcctgc ggggtcatcctgtatatcctcctggtgggctatcctcccttctgggatgaggatcagcac aagctgtatcagcagatcaaggctggagcctatgatttcccatcaccagaatgggacacg gtaactcctgaagccaagaacttgatcaaccagatgctgaccataaacccagcaaagcgc atcacggctgaccaggctctcaagcacccgtgggtctgtcaacgatccacggtggcatcc atgatgcatcgtcaggagactgtggagtgtttgcgcaagttcaatgcccggagaaaactg aagggtgccatcctcacgaccatgcttgtctccaggaacttctcaggtatgttttcccag ctgtgtactttgattatgccgaggtccagagagtggacactggggaaagggtggcagcta ggacccagtgaacctggtgaggacctgctcagtgaaggcttcaaccccctggcaaaaccc tcctctgccaaaagcctattgaacaagaagtcggatggcggtgtcaagcccagtgctgct gccaaaatccagggctatcccagctgcccgggaccccagttgagccgggatattttgtct tctggagatggctggtgggcaggcctcagtggtcatcatagggtctgcgggggtcctggg gtgcaggagccacaaaccactgtggtacacaacgctacagatgggatcaagggctccaca gagagctgcaacaccaccacagaagatgaggacctcaaagctgccccgctccgcactggg aatggcagctcggtgcctgaaggacggagctcccgggacagaacagccccctctgcaggc atgcagccccagccttctctctgctcctcagccatgcgaaaacaggagatcattaagatt acagaacagctgattgaagccatcaacaatggggactttgaggcctacacgaagatttgt gatccaggcctcacttcctttgagcctgaggcccttggtaacctcgtggaggggatggat ttccataagttttactttgagaatctcctgtccaagaacagcaagcctatccataccacc atcctaaacccacacgtccacgtgattggggaggacgcagcgtgcatcgcctacatccgc ctcacccagtacatcgacgggcagggtcggcctcgcaccagccagtcagaagagacccgg gtctggcaccgtcgggatggcaagtggctcaatgtccactatcactgctcaggggcccct gccgcaccgctgcagtga >gi568815588f:73811556_74016562|GENSCAN_predicted_peptide_2|461_aa MRALLARLLLCVLVVSDSKGSNELHQVPSNCDCLNGGTCVSNKYFSNIHWCNCPKKFGGQ HCEIDKSKTCYEGNGHFYRGKASTDTMGRPCLPWNSATVLQQTYHAHRSDALQLGLGKHN YCRNPDNRRRPWCYVQVGLKLLVQECMVHDCADGKKPSSPPEELKFQCGQKTLRPRFKII GGEFTTIENQPWFAAIYRRHRGGSVTYVCGGSLISPCWVISATHCFIDYPKKEDYIVYLG RSRLNSNTQGEMKFEVENLILHKDYSADTLAHHNDIGEGEPRDYCGHNGLGRVGPRERLE LRLKLPALLKIRSKEGRCAQPSRTIQTICLPSMYNDPQFGTSCEITGFGKENSTDYLYPE QLKMTVVKLISHRECQQPHYYGSEVTTKMLCAADPQWKTDSCQGDSGGPLVCSLQGRMTL TGIVSWGRGCALKDKPGVYTRVSHFLPWIRSHTKEENGLAL >gi568815588f:73811556_74016562|GENSCAN_predicted_CDS_2|1386_bp atgagagccctgctggcgcgcctgcttctctgcgtcctggtcgtgagcgactccaaaggc agcaatgaacttcatcaagttccatcgaactgtgactgtctaaatggaggaacatgtgtg tccaacaagtacttctccaacattcactggtgcaactgcccaaagaaattcggagggcag cactgtgaaatagataagtcaaaaacctgctatgaggggaatggtcacttttaccgagga aaggccagcactgacaccatgggccggccctgcctgccctggaactctgccactgtcctt cagcaaacgtaccatgcccacagatctgatgctcttcagctgggcctggggaaacataat tactgcaggaacccagacaaccggaggcgaccctggtgctatgtgcaggtgggcctaaag ctgcttgtccaagagtgcatggtgcatgactgcgcagatggaaaaaagccctcctctcct ccagaagaattaaaatttcagtgtggccaaaagactctgaggccccgctttaagattatt gggggagaattcaccaccatcgagaaccagccctggtttgcggccatctacaggaggcac cgggggggctctgtcacctacgtgtgtggaggcagcctcatcagcccttgctgggtgatc agcgccacacactgcttcattgattacccaaagaaggaggactacatcgtctacctgggt cgctcaaggcttaactccaacacgcaaggggagatgaagtttgaggtggaaaacctcatc ctacacaaggactacagcgctgacacgcttgctcaccacaacgacattggtgagggggaa ccccgcgactactgtggccataatggcttggggagagtgggacccagggagagactggag ctgaggttgaagctgcccgccttgctgaagatccgttccaaggagggcaggtgtgcgcag ccatcccggactatacagaccatctgcctgccctcgatgtataacgatccccagtttggc acaagctgtgagatcactggctttggaaaagagaattctaccgactatctctatccggag cagctgaaaatgactgttgtgaagctgatttcccaccgggagtgtcagcagccccactac tacggctctgaagtcaccaccaaaatgctgtgtgctgctgacccacagtggaaaacagat tcctgccagggagactcagggggacccctcgtctgttccctccaaggccgcatgactttg actggaattgtgagctggggccgtggatgtgccctgaaggacaagccaggcgtctacacg agagtctcacacttcttaccctggatccgcagtcacaccaaggaagagaatggcctggcc ctctga >gi568815588f:73811556_74016562|GENSCAN_predicted_peptide_3|68_aa MKDFELSCDVELLKILEQCEKLIKGRFLGLGETTSRENQFGDYCNQRGTSKRNQEYGRSM KVDKECDH >gi568815588f:73811556_74016562|GENSCAN_predicted_CDS_3|207_bp atgaaggactttgaactttcttgtgatgtggaattgttaaagattttggagcagtgtgaa aaattgatcaaaggaagattcttgggcctgggtgagactaccagcagggaaaaccagttt ggagactattgcaatcagcgaggaacttcaaagaggaaccaggagtacgggaggagcatg aaagtggacaaggagtgtgaccattga >gi568815588f:73811556_74016562|GENSCAN_predicted_peptide_4|90_aa MGLYPSALGRSMAPGAAEQGAVLIREAWAAQEPTAAGLTAEGILGVHEIPVLLDAPGRGS SEIIYVGNTIYTAPSCIHQVEVQKVLKTKF >gi568815588f:73811556_74016562|GENSCAN_predicted_CDS_4|273_bp atgggcctgtacccctcagcccttgggcggtctatggcaccaggtgccgcagagcagggg gcagtgctcatcagggaggcttgggctgcccaggagcccacggccgcgggactaacagct gagggtatccttggagtccatgaaatacctgtcctgctagatgctcctggaagagggagt tctgagatcatctatgttgggaacaccatctacacggccccatcctgtatccatcaggtt gaagtccaaaaagtgctaaaaactaagttttaa >gi568815588f:73811556_74016562|GENSCAN_predicted_peptide_5|52_aa MALVPRLELLNTLKVTQYQFWIQALRLPDNFGFCTFENSPELPCVDTFIFER >gi568815588f:73811556_74016562|GENSCAN_predicted_CDS_5|159_bp atggccctggtaccccgtttggagctcctgaatacactgaaagtgacacagtaccagttc tggatccaagctttaagactacctgacaattttggcttttgtacttttgaaaacagtcct gaactgccatgtgttgataccttcatttttgaaaggtaa >gi568815588f:73811556_74016562|GENSCAN_predicted_peptide_6|114_aa MPVFHTRTIESILEPVAQQISHLVIMHEEGEVDGKAIPDLTAPVAAVQAAVSNLVRVSAQ GLARERAREVSPGPRPASRLPVAGSLAVAFRVGSGARRRGFEWPLLRPVTLSES >gi568815588f:73811556_74016562|GENSCAN_predicted_CDS_6|345_bp atgccagtgtttcatacgcgcacgatcgagagcatcctggagccggtggcacagcagatc tcccacctggtgataatgcacgaggagggcgaggtggacggcaaagccattcctgacctc accgcgcccgtggccgccgtgcaggcggccgtcagcaacctcgtccgggtgagcgcgcag ggcctggcgcgggagcgggcgcgggaggtatccccggggccccggcccgcgtcgcggctg cctgtggccggctcgctggccgtggctttccgcgtgggttccggagccaggcgccgaggg ttcgaatggcctcttctccgccctgtgaccttgagcgagtcctga