GENSCAN 1.0 Date run: 6-Nov-116 Time: 00:42:03 Sequence gi568815596r:10684261_10902641 : 218381 bp : 47.64% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 5332 5424 93 2 0 70 12 99 0.262 0.44 1.02 Term + 5646 5866 221 1 2 62 42 146 0.450 4.60 1.03 PlyA + 7577 7582 6 1.05 2.00 Prom + 15198 15237 40 -3.76 2.01 Init + 23786 23806 21 1 0 87 96 24 0.702 3.10 2.02 Intr + 25065 25197 133 2 1 108 94 75 0.881 10.32 2.03 Intr + 36700 36737 38 2 2 78 113 64 0.506 5.88 2.04 Intr + 36774 36872 99 2 0 102 96 -13 0.316 1.21 2.05 Intr + 42242 42309 68 1 2 82 93 79 0.742 5.50 2.06 Intr + 69721 69806 86 1 2 98 92 58 0.033 6.66 2.07 Intr + 75006 75266 261 1 0 84 72 119 0.035 7.36 2.08 Intr + 79933 80165 233 2 2 69 100 152 0.015 11.99 2.09 Intr + 84459 84550 92 2 2 43 31 180 0.232 6.59 2.10 Intr + 87579 87677 99 0 0 96 89 151 0.999 15.23 2.11 Intr + 88282 88350 69 1 0 101 108 31 0.952 4.70 2.12 Intr + 90528 90620 93 0 0 54 80 114 0.513 6.28 2.13 Intr + 90718 90811 94 1 1 68 73 85 0.916 4.97 2.14 Intr + 93325 93462 138 0 0 62 99 54 0.787 4.56 2.15 Intr + 94312 94409 98 0 2 127 96 194 0.859 22.91 2.16 Intr + 97983 98146 164 0 2 55 72 140 0.499 8.72 2.17 Term + 98914 98948 35 2 2 56 41 12 0.122 -8.95 2.18 PlyA + 99164 99169 6 1.05 3.21 PlyA - 99623 99618 6 1.05 3.20 Term - 100066 99998 69 1 0 84 54 129 0.701 7.04 3.19 Intr - 100770 100674 97 2 1 61 105 -10 0.667 -1.99 3.18 Intr - 103179 103021 159 2 0 62 91 184 0.754 15.20 3.17 Intr - 104509 104437 73 2 1 39 98 3 0.426 -5.14 3.16 Intr - 104721 104637 85 0 1 101 99 148 0.981 16.59 3.15 Intr - 105629 105489 141 2 0 64 82 129 0.997 10.45 3.14 Intr - 106573 106459 115 0 1 5 84 139 0.985 5.65 3.13 Intr - 107665 107535 131 1 2 44 106 140 0.981 10.89 3.12 Intr - 108942 108836 107 1 2 101 71 97 0.901 9.23 3.11 Intr - 112947 112821 127 0 1 78 116 64 0.994 8.45 3.10 Intr - 113497 113440 58 0 1 83 106 15 0.983 1.69 3.09 Intr - 118380 118239 142 1 1 111 115 96 0.614 14.01 3.08 Intr - 135094 135014 81 2 0 129 131 -32 0.633 4.81 3.07 Intr - 136643 136511 133 2 1 46 84 127 0.121 8.32 3.06 Intr - 137044 136970 75 1 0 76 84 25 0.106 0.61 3.05 Intr - 146187 146051 137 1 2 68 21 81 0.063 -0.31 3.04 Intr - 151201 151064 138 2 0 70 32 124 0.829 5.44 3.03 Intr - 153402 153264 139 0 1 28 103 81 0.639 3.64 3.02 Intr - 157207 157031 177 1 0 71 64 125 0.706 8.52 3.01 Init - 157366 157328 39 1 0 92 58 66 0.611 2.30 3.00 Prom - 168829 168790 40 -4.96 4.04 PlyA - 170242 170237 6 1.05 4.03 Term - 170745 170596 150 0 0 65 54 140 0.961 6.21 4.02 Intr - 182568 182430 139 2 1 93 -6 99 0.101 1.47 4.01 Init - 190221 189977 245 0 2 84 38 134 0.125 5.21 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 50567 50509 59 2 2 66 65 72 0.802 3.48 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:10684261_10902641|GENSCAN_predicted_peptide_1|104_aa XSGAVTAVSLSPNADTSPGNCQPPLGLPESGACSSTVIPGPPSPCSAACGRWRRLMTHFL EPEHAGKFPEEGRWLNPGRAAAGRKTSEALTTRRGCRRRCYGQD >gi568815596r:10684261_10902641|GENSCAN_predicted_CDS_1|315_bp ntttctggcgcggtgacggcagtgtccttgtctcccaatgccgacacctcccccggaaat tgtcagccgcctctgggcctccccgagtcgggggcgtgctcgagcaccgtaatcccggga cctccgagcccctgctccgcggcgtgcggccgctggcgccgactgatgacgcacttcctg gagccggaacacgcgggcaagttcccggaagagggaaggtggctgaatccgggaagagcg gcggccgggagaaaaactagtgaggctctgacaacccggcgtggctgtcggcgacgctgc tacgggcaagactag >gi568815596r:10684261_10902641|GENSCAN_predicted_peptide_2|606_aa MSLMHVQEGQRVFSRMAGALGSRGFYTSVEVSGSGSSSLEGGDLQLPTASSDSTDHELIS DRRTGNSEGGEGEVGVSWATGRTPAPTRTGKASLSTQVGTLDSLVGLSDELGKLDTFAES LIRRMAQSVVEVMEDSKGKVQEHLLANGAELGAGRTQTALIAATAGKHTLPGLGPGMPTE DRNPRGLSGLLGDGPQSGWSQSLMLLPAGQASFLFSDKTIIAQELPACAQHILMREPCAC PPALPMMAAFVSTLMLGLGFRSLNHGCRAQAHVLKCVCILPVDLTSFVTHFEWDMAKYPV KQPLVSVVDTIAKQLAQIEMDLKSRTAAYNTLKTNLENLEKKSMGNLFTRTLSDIVSKED FVLDSEYLVTLLVIVPKPNYSQWQKTYESLSDMVVPRSTKLITEDKEGGLFTVTLFRKVI EDFKTKAKENKFTVREFYYDEKEIEREREEMARLLSDKKQQYQTSCVALKKGSSTFPDHK VKVTPLGNPDRPAAGQTDRERESEGEGEGPLLRWLKVNFSEAFIAWIHIKALRVFVESVL RYGLPVNFQAVLLQPHKKSSTKRLREVLNSVFRHLDEVAATSILDVGIQKQQYFYSICGD PGTATQ >gi568815596r:10684261_10902641|GENSCAN_predicted_CDS_2|1821_bp atgagcctgatgcacgtgcaggaggggcagagggtgttcagcaggatggcaggagccctg gggagcaggggcttctacacaagtgtggaggtatcggggtctggcagcagcagcctggag ggcggagacctgcagttgcccacagcatcctcagacagcaccgatcatgaactcatctct gatcgtcgtacgggcaattcggaaggtggagagggtgaggtgggagtttcctgggcaacc gggcgcacacccgcgcccacacggacaggaaaagcttcgctgtctacgcaggtggggacc ttggattccctggttggcctctctgatgagttggggaaactcgacacctttgctgaaagc ctcataaggagaatggctcagagcgtggtggaagtcatggaggactcaaaggggaaggtc caggagcacctcctggcaaacggagcagagctgggagcaggaaggacacagacagctttg attgcagccacagctggcaaacacaccttgcccggcctgggccctggaatgcccacagaa gacagaaaccctcgaggcttgtctggcttgctgggcgatgggccgcagtctggttggtca cagtccctcatgctgctgcctgccgggcaggcctcctttttgttctcggacaagaccatc attgctcaagagttgccagcctgtgcacagcatattttgatgagggagccctgtgcctgc ccgccggcgttgcccatgatggctgccttcgtgtccacgctgatgcttggcttgggattc cgaagtctcaatcatggatgcagagcgcaggctcatgtgttgaaatgtgtttgtattctt ccagttgacttaacatcctttgtgacccactttgaatgggacatggccaaatatcctgtc aagcagccgctcgtgagtgtggtggacacaatagccaagcaactggcgcagatcgagatg gacctgaagtcccgaacggccgcctacaacactctgaagacaaacctggagaacctggaa aagaaatccatggggaacctcttcacccggacactgagtgatattgtgagcaaagaggac ttcgtgctggattctgaatatctcgtcacacttctggtcatcgtccccaaaccaaactac tcacaatggcaaaaaacctacgaatctctctcagacatggtggtccctcgatcaaccaaa ctcattactgaggacaaggaagggggccttttcactgtgactctgtttcgaaaagtgatt gaagatttcaaaaccaaggccaaagaaaacaagttcactgttcgtgaattttactatgat gagaaggaaattgaaagggaaagggaggagatggccagattgctgtctgataagaagcaa cagtatcaaacttcctgtgttgctcttaaaaagggatcatccaccttcccggaccacaag gttaaggtaaccccgctaggtaaccctgataggcctgctgcggggcagaccgacagagag agagagagtgagggcgagggtgagggccccctgctgcgctggctcaaggtgaacttcagt gaagccttcattgcctggatccacatcaaggccctgagagtgtttgtggagtccgtgctc aggtatggactaccagtgaacttccaggcagtgctcctgcagccgcataagaagtcatcc accaagcgtttaagagaggttctaaactctgtcttccgacatctggatgaagtagccgct acaagtatactggatgtaggtatccagaaacagcagtatttctacagcatctgtggagat cccgggactgcaactcaataa >gi568815596r:10684261_10902641|GENSCAN_predicted_peptide_3|740_aa MAFPPVAQLPVGLAAVWAGPLDFSGSHAGKSPHQQQFLVFQQQLHPLLKVWGSTRGGGSL SSFIVDSLSALKKLGAPEQAAILCLGFSDCKMRIITAPASKVSRGSNELMILARRSDRGW GLGFPGGRTWKEYAPDKARDGNGTRLAHCQRPGQEGGLRSPTAGAVPAGKGARENLDPLA RTRQACKLTEFHQFKHKHSAKTNGELIGTEVKLINRILPSTGLLCNGRDWKQQTRSTASR GEKIPPASMRRDLREKLVWVCRPLAPVEVPANISSDFQPCSPTSPAHSLSRKSPIMYPST TMANAPGLVSCTFFLAVNGLYSSSDDVIELTPSNFNREVIQSDSLWLVEFYAPWCGHCQR LTPEWKKAATALKDVVKVGAVDADKHHSLGGQYGVQGFPTIKIFGSNKNRPEDYQGGRTG EAIVDAALSALRQLVKDRLGGRSGGYSSGKQGRSDSSSKKDVIELTDDSFDKNVLDSEDV WMVEFYAPWCGHCKNLEPEWAAAASEVKEQTKGKVKLAAVDATVNQVLASRYGIRGFPTI KIFQKGESPVDYDGGRTRSDIVSRALDLFSDNAPPPELLEIINEDIAKRTCEEHQLCVVA VLPHILDTGAAGRNSYLEVLLKLADKYKKKMWGWLWTEAGAQSELETALGIGGFGYPAMA AINARKMKFALLKGSFSEQGINEFLRELSFGRGSTAPVGGGAFPTIVEREPWDGRDGELP VEDDIDLSDVELDDLGKDEL >gi568815596r:10684261_10902641|GENSCAN_predicted_CDS_3|2223_bp atggcgtttcctcctgtggcacagctgccagtggggctggccgccgtctgggctggtcct ctagacttctctggctcccacgctggcaaatctccccaccagcagcagtttttggtgttc cagcagcagctgcatcccctcctgaaggtctggggctccactcgagggggcggaagtctt agttccttcattgttgactccctctcagccctaaagaagcttggagcccctgaacaagct gcaattctgtgcctgggcttttctgactgcaaaatgaggataatcacagcccctgcctcc aaagtgtcccgaggatcaaatgagttgatgatacttgcacggcgctcggatcgtggctgg ggactgggcttccccggaggccgcacctggaaggaatacgccccggataaggcccgcgat ggaaacggaacccggctggctcactgccagcggcctggtcaagaaggcgggctccggagc ccgactgccggggctgtgcccgcaggaaaaggtgcccgagagaaccttgaccctctggcc agaaccaggcaggcttgcaaactcactgagttccaccagttcaaacacaagcattcagcc aaaacaaatggagaactgataggcacggaggtaaaattgattaaccgaatcttgccctcc actggcttactgtgcaacggtagagactggaagcaacagacgaggagcacagcttccaga ggagagaagatacccccagcatccatgagaagagacctccgggagaagcttgtgtgggtg tgccgaccactggccccagttgaggtcccagccaacatcagctccgacttccagccatgc tcaccaacttctcctgcccactccctatcaaggaaaagtcccataatgtatccatcaacc accatggctaatgcacccggtctggtgagctgtaccttctttctggcagtgaatggtctg tattcctctagtgatgatgtgatcgaattaactccatcgaatttcaaccgagaagttatt cagagtgatagtttgtggcttgtagaattctatgctccatggtgtggtcactgtcaaaga ttaacaccagaatggaagaaagcagcaactgcattaaaagatgttgtcaaagttggtgca gttgatgcagataagcatcattccctaggaggtcagtatggtgttcagggatttcctacc attaagatttttggatccaacaaaaacagaccagaagattaccaaggtggcagaactggt gaagccattgtagatgctgcgctgagtgctctgcgccagctcgtgaaggatcgcctcggg ggacggagcggaggatacagttctggaaaacaaggcagaagtgatagttcaagtaagaag gatgtgattgagctgacagacgacagctttgataagaatgttctggacagtgaagatgtt tggatggttgagttctatgctccttggtgtggacactgcaaaaacctagagccagagtgg gctgccgcagcttcagaagtaaaagagcagacgaaaggaaaagtgaaactggcagctgtg gatgctacagtcaatcaggttctggcctcccgatacgggattagaggatttcctacaatc aagatatttcagaaaggcgagtctcctgtggattatgacggtgggcggacaagatccgac atcgtgtcccgggcccttgatttgttttctgataacgccccacctcctgagctgcttgag attatcaacgaggacattgccaagaggacgtgtgaggagcaccagctctgtgttgtggct gtgctgccccatatccttgatactggagctgcaggcagaaattcttatctggaagttctt ctgaagttggcagacaaatacaaaaagaaaatgtgggggtggctgtggacagaagctgga gcccagtctgaacttgagaccgcgttggggattggagggtttgggtaccccgccatggcc gccatcaatgcacgcaagatgaaatttgctctgctaaaaggctccttcagtgagcaaggc atcaacgagtttctcagggagctctcttttgggcgtggctccacggcacctgtaggaggc ggggctttccctaccatcgttgagagagagccttgggacggcagggatggcgagcttccc gtggaggatgacattgacctcagtgatgtggagcttgatgacttagggaaagatgagttg tga >gi568815596r:10684261_10902641|GENSCAN_predicted_peptide_4|177_aa MKHNRWKPRGIDNRVCRRCKGQILMPNIGYGSNKNTKHLGPSGLQKFLVYNVREMEVLLK CNRVKDSRGHTTLKSWHCLSLEEKVALISPKDIIAFAENWELLLIDLTSSARIDDPMSAP QELPESLKNPPGEITEQEKQTRGTNDKNNEIFEIWTADSYQKWCLVSTAATSITSTC >gi568815596r:10684261_10902641|GENSCAN_predicted_CDS_4|534_bp atgaagcataacaggtggaaacccagaggtattgacaacagggtttgtagaaggtgcaag ggccagatcttaatgcccaacattggttatgggagcaacaaaaacacaaagcacctgggg cctagtggcttacagaagttcctggtctacaatgtcagggagatggaagtgctgctgaag tgcaacagagttaaggacagcagaggccacaccacgctcaagagctggcactgcctatcc ttggaagaaaaagtagctttaatatcccccaaggacattatagcatttgcagagaactgg gaactactgctgattgatttaacttcctcagcaagaattgatgaccccatgagtgctccc caggagcttccagaatccctgaaaaatccacctggagaaattacagagcaagagaaacag accagaggaacaaatgacaagaacaatgagatttttgaaatctggacagccgacagctat cagaagtggtgcctggtgtcaacagctgccaccagcatcaccagcacctgttga