GENSCAN 1.0 Date run: 18-Nov-116 Time: 17:35:20 Sequence gi568815590r:85948465_86169622 : 221158 bp : 38.66% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 8 3 6 -3.84 1.02 Term - 400 217 184 0 1 74 43 141 0.289 4.23 1.01 Init - 642 584 59 0 2 93 61 114 0.285 10.03 1.00 Prom - 5733 5694 40 -4.45 2.06 PlyA - 5965 5960 6 1.05 2.05 Term - 13389 13160 230 1 2 74 54 106 0.579 1.61 2.04 Intr - 15977 15898 80 1 2 95 85 52 0.210 3.88 2.03 Intr - 25817 25793 25 0 1 90 61 27 0.007 -3.63 2.02 Intr - 39967 39779 189 2 0 108 43 143 0.559 10.44 2.01 Init - 42057 42039 19 0 1 65 75 21 0.538 -1.25 2.00 Prom - 42775 42736 40 -5.55 3.04 PlyA - 43261 43256 6 1.05 3.03 Term - 54220 54134 87 1 0 82 50 123 0.886 4.58 3.02 Intr - 67692 67492 201 0 0 -4 82 267 0.437 15.46 3.01 Init - 72230 72183 48 2 0 85 119 -2 0.780 3.61 3.00 Prom - 73541 73502 40 -5.35 4.00 Prom + 77555 77594 40 -4.15 4.01 Init + 87883 88047 165 0 0 65 42 98 0.604 2.58 4.02 Intr + 92315 92556 242 1 2 42 76 188 0.843 8.43 4.03 Term + 92642 92879 238 2 1 57 42 146 0.853 1.56 4.04 PlyA + 93200 93205 6 1.05 5.00 Prom + 93683 93722 40 -7.35 5.01 Sngl + 94817 95089 273 1 0 46 42 229 0.723 9.18 5.02 PlyA + 95100 95105 6 1.05 6.07 PlyA - 95698 95693 6 1.05 6.06 Term - 100303 99998 306 1 0 64 47 119 0.143 -0.47 6.05 Intr - 116167 115501 667 0 1 95 94 559 0.990 48.01 6.04 Intr - 120489 120383 107 0 2 28 59 92 0.509 -1.61 6.03 Intr - 121354 120974 381 1 0 24 103 231 0.064 12.38 6.02 Intr - 126749 126604 146 0 2 70 9 179 0.052 7.28 6.01 Init - 127718 127523 196 2 1 81 32 85 0.719 1.34 6.00 Prom - 127895 127856 40 -3.65 7.00 Prom + 128187 128226 40 -7.65 7.01 Init + 129254 129433 180 1 0 70 41 120 0.019 4.81 7.02 Intr + 150440 150644 205 1 1 92 115 244 0.855 25.35 7.03 Intr + 162381 162430 50 1 2 46 99 45 0.382 -1.12 7.04 Intr + 173090 173227 138 1 0 6 63 274 0.801 16.44 7.05 Intr + 185202 185278 77 2 2 15 87 98 0.172 -0.21 7.06 Intr + 190993 191171 179 1 2 95 63 158 0.798 12.64 7.07 Intr + 192986 193065 80 0 2 69 90 53 0.644 1.95 7.08 Intr + 201648 201824 177 2 0 21 30 224 0.019 9.09 7.09 Intr + 203002 203076 75 0 0 84 64 88 0.903 4.79 7.10 Intr + 204352 204394 43 0 1 136 78 4 0.834 1.09 7.11 Intr + 208982 209123 142 0 1 72 64 99 0.051 4.39 7.12 Intr + 215409 215531 123 0 0 74 97 40 0.035 2.28 7.13 Term + 217152 217290 139 0 1 37 39 97 0.027 -3.85 7.14 PlyA + 217317 217322 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 126749 126559 191 0 2 70 45 185 0.838 9.03 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590r:85948465_86169622|GENSCAN_predicted_peptide_1|80_aa MEESPVEGEVEVRESLQAATYWQKQPGERVTSVRMLQWILKMRPLEVSANYTLEGAQLLL RGTASWLLSGKATQAREGNR >gi568815590r:85948465_86169622|GENSCAN_predicted_CDS_1|243_bp atggaggaaagtcccgtggaaggtgaagtagaagtcagggaaagccttcaagcagcaacg tactggcagaagcaacctggggagagagtcacctcagtgcgaatgctccagtggatccta aaaatgcggccgctggaagtgtcagccaactacacacttgaaggggcacagcttcttctg aggggcacagcttcatggctgctcagtgggaaagccacacaggctagagaggggaacaga tga >gi568815590r:85948465_86169622|GENSCAN_predicted_peptide_2|180_aa MGQRTGGNRVLCCDKGHKMVSEEPGQNLMLLSPLPSALRTQTPQEELSHLNYVRCLGAGS PSRVLELAIEKVYRTPEGNKGTGQHNPQEMFHDMKEYHESCYTAVLKSESQVCPNSGWKG ELQGLGGAFGIGDHVVAVLGKYSLLQPTNRRLQEKVYARRLPNFSGNENVISSILVRNWK >gi568815590r:85948465_86169622|GENSCAN_predicted_CDS_2|543_bp atggggcagaggacaggaggaaacagagttctttgctgtgataaaggccacaagatggtc agtgaagagccaggacagaacttgatgctgcttagccccttgcccagtgctctccgcaca caaactccacaagaagagctttcccatttgaactatgttagatgtctgggagcaggaagc ccctcacgggtgctggagcttgctatagaaaaggtttacagaacgccagagggaaacaaa gggacaggacaacataacccacaggaaatgttccatgacatgaaagaatatcatgaaagc tgctatacagctgtgcttaagagtgagtcacaagtttgcccaaattcaggatggaagggg gagctgcaaggtcttggaggagcatttgggataggggatcatgttgtggctgtccttgga aaatacagtctgctgcagcctactaatcgcaggttgcaggagaaggtctacgccagaaga ctgccaaatttctcaggaaatgaaaatgtgatttctagcattttagtgagaaactggaaa tga >gi568815590r:85948465_86169622|GENSCAN_predicted_peptide_3|111_aa MTGERSGILLFRCSDLDCPPKLTSWGTKGWKSRQGIGTSTDDSDGGHDDDDGDGDSYGKD KATSIYLALTVDQVLSLGGGKRKAFAGVNSHYEARTLMATQQEKSPIGEYN >gi568815590r:85948465_86169622|GENSCAN_predicted_CDS_3|336_bp atgacaggtgagagatctggcatcttactgttcagatgcagtgatctggattgcccaccg aagctaaccagctggggaacaaagggctggaaatccagacaaggaattggcacttctaca gatgacagtgatggtggtcatgatgatgatgatggcgatggtgacagttatggtaaggac aaagcaactagtatttatttagcactaactgtggaccaggttctcagtctaggcggtggg aaacgaaaggcatttgcaggtgttaattcccattatgaggcacgcacactgatggctaca cagcaggaaaaatctcctattggagaatacaattga >gi568815590r:85948465_86169622|GENSCAN_predicted_peptide_4|214_aa MQLAQTLDVFIIIVYMNGKSPTDPLTSDNVPPPLIAYYSYKEFLRTLEDMGSSRQQAPIS EENPGGIQGYALIPPPCGGEAASVQRVCGTGCQPSPLAASSPTWLCPELALQAATLQVMS RVGPGITAVQCKPKSGNPDKHGAPSLDHVLLSSLDLFSSNYPLADIEGHYEQEPVETAHC VYYEDVADCIALVNYIVHPPWEIPQEILFSLGQE >gi568815590r:85948465_86169622|GENSCAN_predicted_CDS_4|645_bp atgcagctagcacaaaccttggatgtttttatcatcatagtttatatgaacggaaaaagt cctacagaccctttaacaagtgacaatgtgccacctccactcatagcttattatagctac aaggagttccttagaaccctagaggacatgggttcttcccgacagcaggctccaatttct gaagaaaatcctggtggcatccaaggatatgccctcatcccacctccatgtggaggggag gcagcttctgtccagcgtgtgtgtggcacaggttgtcagcccagcccactggctgccagc tcacccacatggctgtgccctgagctggcactgcaggcagccaccttgcaggtaatgtca cgtgtaggacctggaatcactgctgtgcagtgcaaaccaaaatcaggtaatccagacaaa catggagctccatccctggaccatgttcttttgagctccctggacttgttttcatcaaat tacccactggccgacattgagggacattatgaacaagaacctgtggaaactgctcactgt gtttactacgaggatgttgctgattgcattgcacttgttaattacattgtacatcctccc tgggagatcccacaagaaattcttttcagtctgggtcaggaatga >gi568815590r:85948465_86169622|GENSCAN_predicted_peptide_5|90_aa MGATKKHKAHAADGFRKFLVHNAKELEVLLMCNKSYFAEITHNVSFQNRKATWKGPPSWP SESPTTMPGCAAKKMKRQLMCMFCVSIKPP >gi568815590r:85948465_86169622|GENSCAN_predicted_CDS_5|273_bp atgggcgcaacaaaaaaacacaaagcacatgctgctgatggcttccggaagttcctggtc cacaacgccaaggagctggaagtgctgctgatgtgcaacaaatcttactttgctgagatc actcacaatgtttccttccagaaccgcaaagccacatggaaagggccacccagctggcca tcagagtcaccaaccacaatgccaggttgtgcagcaaagaaaatgaagagacagctcatg tgcatgttttgtgtttcaataaaaccaccataa >gi568815590r:85948465_86169622|GENSCAN_predicted_peptide_6|600_aa MGESGNHHFQQTNTGTENQTAHVLTHKWELDNENIWAQGGEHHKLGPVMGWKARSGKTLG EIPNVGTLTLLTGYGGCQLPCCKDTQAAYGETHVVRSGGLLPTASWELRPADSHTVTSDD PGVSVVSGYPGGCLPDHDPPVGFLSEGPAPRSCSLIKGGGTGLAASRVPRSRERRACCGY GVRRQQEGGPGATSAGLGQARRSKPSRRRRRGAWARGGGPGGAEDTGGSLPSQVRPPGPC QCPVQFLFDISEQGVQRMGKKRAGAAANKGRNSYLRRYDIKALIGTGSFSRVVRVEQKTT KKPFAIKVMETREREGREACVSELSVLRRVSHRYIVQLMEIFETEDQVYMVMELATGGEL FDRLIAQGSFTERDAVRILQMVADGIRYLHALQITHRNLKPENLLYYHPGEESKILITDF GLAYSGKKSGDWTMKTLCGTPEYIAPEVLLRKPYTSAVDMWALGVITYALLSGFLPFDDE SQTRLYRKILKGKYNYTGEPWPSISHLAKDFIDKLLILEAGHRMSAGQALDHPWVITMAA GSSMKNLQRAISRNLMQRASPHSQSPGSAQSSKSHYSHKSRHMWSKRNLRIVESPLSALL >gi568815590r:85948465_86169622|GENSCAN_predicted_CDS_6|1803_bp atgggtgaaagtggaaaccatcattttcagcaaactaacacaggaacagaaaaccaaaca gcacatgttctcactcataagtgggagttggacaatgaaaacatatgggcacagggaggg gaacatcacaaactgggacctgtcatgggttggaaggctaggagtgggaaaacattagga gaaatacctaacgtaggcacactcacactcctcactggctatgggggatgccagctgcca tgctgcaaggacactcaggcagcctatggagaaacccacgtggtgcggagtggaggcctt ctgccaacagccagctgggaactgaggcctgctgacagtcacacggtgaccagcgatgat ccaggcgtctcggtcgttagcgggtatcctgggggctgtctccctgaccacgacccccca gtggggtttctttccgagggtcccgcccctcgcagctgctctttgataaagggcggagga acggggctggctgcttcccgagtccccaggtcccgcgagcggcgggcgtgttgcgggtat ggggtgcggcgccagcaggaaggtggtcccggggccaccagcgctggcttgggccaagca cgaaggtcaaaaccaagccggcgtcggaggcgcggggcctgggcccgaggcggcggccca ggcggcgcagaggatacaggtggctcgcttccgagccaagttcgacccccgggtccttgc cagtgcccagtacaatttctctttgacatctctgaacagggagttcagaggatgggaaaa aagagagcaggagcagcagcaaacaagggaaggaattcctatcttcggagatatgacatc aaagctcttattgggacaggcagtttcagcagggttgtcagggtagagcagaagaccacc aagaaaccttttgcaataaaagtgatggaaaccagagagagggaaggtagagaagcgtgc gtgtctgagctgagcgtcctgcggcgggttagccatcgttacattgtccagctcatggag atctttgagactgaggatcaagtttacatggtaatggagctggctaccggaggggagctc tttgatcgactcattgctcagggatcctttacagagcgggatgccgtcaggatcctccag atggttgctgatgggattaggtatttgcatgcgctgcagataactcataggaatctaaag cctgaaaacctcttatactatcatccaggtgaagagtcgaaaattttaattacagatttt ggtttggcatactccgggaaaaaaagtggtgactggacaatgaagacactctgtgggacc ccagagtacatagctcctgaggttttgctaaggaagccttataccagtgcagtggacatg tgggctcttggtgtgatcacatatgctttacttagcggattcctgccttttgatgatgaa agccagacaaggctttacaggaagattctgaaaggcaaatataattatacaggagagcct tggccaagcatttcccacttggcgaaggactttatagacaaactactgattttggaggct ggtcatcgcatgtcagctggccaggccctggaccatccctgggtgatcaccatggctgca gggtcttccatgaagaatctccagagggccatatcccgaaacctcatgcagagggcctct ccccactctcagagtcctggatctgcacagtcttctaagtcacattattctcacaaatcc aggcatatgtggagcaagagaaacttaaggatagtagaatcgccactgtctgcgcttttg taa >gi568815590r:85948465_86169622|GENSCAN_predicted_peptide_7|535_aa MDKFLDTYTLPSLNQEEVESLNKPITSSEIEAVINSLPTKESPGPDGLTAEFYQRYKEEL ASANLQGPSRTTELFHPTLASISSPMLEGAELYFNVDHGYLEGLVRGCKASLLTQQDYIN LVQCETLEGQDRTGQDWHHKTSARQSKTPPFQKKEEEEEEEEEEEEEKEKEGKGEGEGEG GKKKEYTNGGKCHGKNCNPTPVSKCSMGTQMSTLTRLCSYMIDNVILLMNGALQKKSVKE ILGKCHPLGRFTEMEAVNIAETPSDLFNAILIETPLAPFFQDCMSENALDELNIELLRNK LYKFEADRRAFIITLNSFGTELSKEDRETLYPTFGKLYPEGLRLLAQAEDFDQMKNVADH YGVYKPLFEAVGGSGGKTLEDVFYEREVQMNVLAFNRQFHYVAFPLTPPFSSVIRAAVFS FHHLLNVWSVIEIQLITALLNINDIWLLIVTRSSGSHLTNYPVQQKESLLSNSNKNVLGL PVTGLPRHFRNGKGQDSEFEKVSLPHSESIEQHFRKDPCCPFTDESLEHTAMFST >gi568815590r:85948465_86169622|GENSCAN_predicted_CDS_7|1608_bp atggataaattcctggacacatacaccctcccaagtctaaaccaggaagaagtcgaatca ctgaataaaccaataacaagttctgaaattgaggcagtaattaatagcctaccaaccaaa gaaagtccaggaccagatggactcacagctgaattctaccagaggtacaaagaggagctg gctagtgcaaatcttcaggggccgtccaggactacagagctgtttcaccctaccttggct tcaatctcttcccccatgctcgaaggtgcggagctgtacttcaacgtggaccatggctac ctggagggcctggttcgaggatgcaaggccagcctcctgacccagcaagactatatcaac ctggtccagtgtgagaccctagaaggacaggacaggacaggacaggactggcatcataag acctccgccaggcagagcaagactccaccatttcaaaagaaggaggaggaggaggaggag gaggaggaggaggaggaggagaaggagaaagaaggaaaaggagaaggagaaggagaagga ggaaagaaaaaagaatatacaaatggtggaaagtgccatgggaaaaactgcaaccccacc cctgtcagcaaatgctcgatggggacccagatgtctacccttacccggctgtgcagttat atgatagacaatgtgattctgctgatgaatggtgcattgcagaaaaaatctgtgaaagaa attctggggaagtgccaccccttgggccgtttcacagaaatggaagctgtcaacattgca gagacaccttcagatctctttaatgccattctgatcgaaacgccattagctccattcttc caagactgcatgtctgaaaatgctctagatgaactgaatattgaattgctacgcaataaa ctatacaagtttgaggccgacagacgtgcttttatcatcactcttaactcctttggcact gaattgagcaaagaagaccgagagaccctctatccaaccttcggcaaactctatcctgag gggttgcggctgttggctcaagcagaagactttgaccagatgaagaacgtagcggatcat tacggagtatacaaacctttatttgaagctgtaggtggcagtgggggaaagacattggag gacgtgttttacgagcgtgaggtacaaatgaatgtgctggcattcaacagacagttccac tacgtggcttttccactgactccccctttctcctcagtcattcgtgctgcagttttcagc ttccaccacttgctgaacgtttggtcagtaattgaaatccaactcatcacagccctactc aacatcaatgacatctggctccttattgtcacaaggagctcaggctcccatctcaccaac tacccagtccagcagaaagagtccctcttgtccaatagcaacaaaaatgttctgggtttg ccagtgactggtctgccaaggcacttcagaaatggaaaaggccaagattctgaatttgaa aaggtgtcactgcctcactcagaaagcatagagcaacactttagaaaggacccctgctgt cccttcactgatgaatcactggagcatactgcaatgttcagcacatag